Skip to content
Category

Speech recognition

page 1
natural language processing
field of computer science and linguistics
speech recognition
automatic conversion of spoken language into text
n-gram
An '''n-gram' is a sequence of n adjacent symbols in a particular order. The symbols may be n'' adjacent letters (including punctuation marks and blanks), syllables, or rarely whole words found in a language dataset; or adjacent phonemes extracted from a speech-recording dataset, or adjacent base pairs extracted from a genome. They are collected from a text corpus or speech corpus.
voice user interface
makes spoken human interaction with computers possible, using speech recognition to understand spoken commands and answer questions
artificial intelligence content detection
algorithms to detect AI-generated content
trigram
Trigrams are a special case of the n-gram, where n is 3. They are often used in natural language processing for performing statistical analysis of texts and in cryptography for control and use of ciphers and codes. See results of analysis of "Letter Frequencies in the English Language".
speech corpus
speech audio files and text transcriptions
Lexical Markup Framework
ISO standard for Natural Language Processing (NLP) lexicons and Machine Readable Dictionaries (MRD)
voice activity detection
technique used in speech processing in which the presence or absence of human speech is detected
voice search
allows the user to use a voice command to search
Motor theory of speech perception
Hypothesis of spoken word identification
Google Read Along
Android language-learning app
Linguatec
The Linguatec Sprachtechnologien GmbH is a language technology provider, specialized in the field of machine translation, speech synthesis and speech recognition. Linguatec was founded in Munich in 1996 and its headquarters are in Pasing.
text simplification
automated process
Subvocal recognition
the art of taking subvocalization and converting the detected results to a digital text-based output
word error rate
computer language processing metric
Stenomask
thumb|right|Court reporter tests his stenomask. A stenomask is a hand-held microphone built into a padded, soundproof enclosure that fits over the speaker's mouth or nose and mouth. Some lightweight versions may be fitted with an elastic neck strap to hold them in place while freeing the user's hands for other tasks. The purpose of a stenomask is to allow a person to speak without being heard by other people, and to keep background noise away from the microphone.