Speech Recognition Xiaofeng Lai
What is speech recognition? Speech recognition : This is the ability of a machine or program to identify words and phrases in spoken language and convert them to a machine-readable format.
Outline Brief history of speech recognition. Introduction to how it works Applications Dragon Dictation
Brief History 1950’s, AT&T Bell Laboratories designed “Audrey”. 1960’s, IBM demonstrated “Shoebox”. 1970’s, with the help of DoD’s DARPA. 1980’s, The Hidden Markov Model helped. 1990’s, the software for speech recognition came to people, for example, Dragon. 2000’s, computer speech recognition sort of stalls. Like, google voice research, Siri.
How it works Input Speech Statistical modeling systems Output ADC
How it works Input speech Discrete Continuous Analog-to-digital converter (ADC) The speech recognition technology converts these created vibrations to digital format. Extract phonemes Organize grammar
How it works Statistical modeling systems The Hidden Markov model (HMM) Most common used in everything from data compression to sound recognition. Artificial neural networks (ANN) Were originally developed to model of human brain function. Biology The difference HMM is a special case of the ANN ANNs are capable of modeling extremely complex biological functions
The Hidden Markov Model It is a directed graph augmented with probability scores. N1 N2 N3 = 0.4 X 0.8 X 0.5 = 0.16 N1 N2 N2 N2 N3 N3 N3 N3 N3 = 0.4 x 0.2 x 0.2 x 0.8 x 0.5 x 0.5 x 0.5 x 0.5 = N1 N1 N2 N2 N3 = 0.6 x 0.4 x 0.2 x 0.8 x 0.5 = 0.192
Example t ow m aa t ow - British English t ah m ey t ow - American English t ah mey t a - Possibly pronunciation when speaking quickly
Applications Healthcare Military Telephone Business People with disabilities Google’s Voice Search, however, has been available on Android and iPhones.
Applications Dragon Dictation Powered by Nuance’s world-renowned Dragon NaturallySpeaking software 2.0, you can send text or to your friends, send notes and reminders to yourself … all using your voice.
Applications
Thank you Questions?
References /howsrworks.asp /howsrworks.asp ets/high-tech-gadgets/speech- recognition1.htm ets/high-tech-gadgets/speech- recognition1.htm nition nition