Speech Recognition Xiaofeng Lai. What is speech recognition?  Speech recognition :  This is the ability of a machine or program to identify words and.

Slides:



Advertisements
Similar presentations
Artificial Intelligence By: David Hunt Lee Evans Jonathan Moreton Rachel Moss.
Advertisements

Character Recognition using Hidden Markov Models Anthony DiPirro Ji Mei Sponsor:Prof. William Sverdlik.
Speech Recognition Part 3 Back end processing. Speech recognition simplified block diagram Speech Capture Speech Capture Feature Extraction Feature Extraction.
Artificial Intelligence
Speech in Multimedia Hao Jiang Computer Science Department Boston College Oct. 9, 2007.
SPEECH RECOGNITION Kunal Shalia and Dima Smirnov.
The Potential for Use of Voice Recognition Software in Appraisal of Oral History Tapes Sonia Yaco University of Wisconsin-Madison School of Library and.
Voice Recognition Technology Kathleen Kennedy COMP 1631 Winter 2010.
4/25/2001ECE566 Philip Felber1 Speech Recognition A report of an Isolated Word experiment. By Philip Felber Illinois Institute of Technology April 25,
Spik v1.0 Voice Commands Execution in a Windows Environment Dekel Abelson Eliran Dahan Instructor: Ari Todtfeld.
Automatic Speech Recognition
SPEECH RECOGNITION FOR MOBILE SYSTEMS BY: PRATIBHA CHANNAMSETTY SHRUTHI SAMBASIVAN.
Natural Language Processing and Speech Enabled Applications by Pavlovic Nenad.
Assistive Technology By: Roxanne Majeski, Oscar Guerin, Tasha Reaves, Elias Luna.
By Chance Berman and Clark Baumgartner. 1. Introduction 2. History 3. Modern Applications 4. Case Study 5. Ethical Analysis.
Automatic Transcript Generation Helmer Strik A 2 RT Dept. of Language & Speech University of Nijmegen.
A VERY USEFUL E-LEARNING TOOL FOR TEACHERS, RESEARCHERS, AND STUDENTS.
Practical AT session 3 WP4-D4.2. Prepared by: Shams Eldin Mohamed Ahmed Hassan Speech, Text and Braille AT.
1 Dragon ® NaturallySpeaking ® Professional Dragon NaturallySpeaking for Education.
 The most intelligent device - “Human Brain”.  The machine that revolutionized the whole world – “computer”.  Inefficiencies of the computer has lead.
A Multimedia English Learning System Using HMMs to Improve Phonemic Awareness for English Learning Yen-Shou Lai, Hung-Hsu Tsai and Pao-Ta Yu Chun-Yu Chen.
Outline Grammar-based speech recognition Statistical language model-based recognition Speech Synthesis Dialog Management Natural Language Processing ©
+ New Media Production CA ~ Siri By Eva Lucey. + Introduction to Siri Apple’s latest iPhone feature – New Application First seen in October 2011 – iPhone.
Lesson 3 Hearing Things Hà Nội Eclipse + ADT + Android SDK Robot SDK Reference:
COLLABORATOR – WD 08 Literature Review IT Layansan R IT Sarmilan S IT Banujan C IT Aravinth S.
Speech Recognition Speech Recognition lets you speak into a microphone to control your computer. You can give commands that the computer will carry out.
Speech Recognition with CMU Sphinx Srikar Nadipally Hareesh Lingareddy.
Higher Vision, language and movement. Strong AI Is the belief that AI will eventually lead to the development of an autonomous intelligent machine. Some.
© 2013 by Larson Technical Services
ARTIFICIALINTELLIGENCE ARTIFICIAL INTELLIGENCE EXPERT SYSTEMS.
ARTIFICIAL INTELLIGENCE FOR SPEECH RECOGNITION. Introduction What is Speech Recognition?  also known as automatic speech recognition or computer speech.
BY KALP SHAH Sentence Recognizer. Sphinx4 Sphinx4 is the best and versatile recognition system. Sphinx4 is a speech recognition system which is written.
VocaLinks Speech Recognition Seminar Assistive Technology, Consulting, Training, Sales and Support.
Reducing uncertainty in speech recognition Controlling mobile devices through voice activated commands Neil Gow, GWXNEI001 Stephen Breyer-Menke, BRYSTE003.
AT and Handheld Devices in ELA Lauchie MacKinnon.
Statistical techniques for video analysis and searching chapter Anton Korotygin.
Introduction. Natural Languages are languages of the Humans These Languages tend to be ambiguous A sentence in natural may not have just one meaning I.
Pattern Recognition NTUEE 高奕豪 2005/4/14. Outline Introduction Definition, Examples, Related Fields, System, and Design Approaches Bayesian, Hidden Markov.
Speech Recognition Created By : Kanjariya Hardik G.
Simple Project on Digit Recognition By: Class: Faculty: Manish Ravlani Speech Recognition Dr. Kepuska.
PREPARED BY MANOJ TALUKDAR MSC 4 TH SEM ROLL-NO 05 GUKC-2012 IN THE GUIDENCE OF DR. SANJIB KR KALITA.
Christoph Prinz / Automatic Speech Recognition Research Progress Hits the Road.
Speech Recognition through Neural Networks By Mohammad Usman Afzal Mohammad Waseem.
1 Speech Recognition. 2 Introduction What is Speech Recognition? - Voice Recognition? Where can it be used? - Dictation - System control/navigation -
How can speech technology be used to help people with disabilities?
Speech Recognition
Classification of models
Natural Language Processing and Speech Enabled Applications
Speech Processing AEGIS RET All-Hands Meeting
Artificial Intelligence for Speech Recognition
A presentation on Basics of Speech Recognition Systems
Speech Recognition Amit Sharma CSE 8th.
Hidden Markov Models (HMM)
Google translate app demo
Derek Hunt Education Commons
7 INPUT AND OUTPUT CHAPTER
Implementing AI solutions using the cognitive services in Azure
(No need of Desktop computer)
Creating Transcripts of Your Narrated PowerPoints Richard Oliver Department of Information Systems 2018 Quality in Online Education Conference.
Kocaeli University Introduction to Engineering Applications
شبکه عصبی تنظیم: بهروز نصرالهی-فریده امدادی استاد محترم: سرکار خانم کریمی دانشگاه آزاد اسلامی واحد شهرری.
EEG Recognition Using The Kaldi Speech Recognition Toolkit
Natural Language Processing
Adrian Alexander Costarella
Digital Speech Processing
Prepared by: Mahmoud Rafeek Al-Farra
Phoneme Recognition Using Neural Networks by Albert VanderMeulen
Artificial Intelligence
Artificial Intelligence 2004 Speech & Natural Language Processing
The Application of Hidden Markov Models in Speech Recognition
Presentation transcript:

Speech Recognition Xiaofeng Lai

What is speech recognition?  Speech recognition :  This is the ability of a machine or program to identify words and phrases in spoken language and convert them to a machine-readable format.

Outline  Brief history of speech recognition.  Introduction to how it works  Applications  Dragon Dictation

Brief History  1950’s, AT&T Bell Laboratories designed “Audrey”.  1960’s, IBM demonstrated “Shoebox”.  1970’s, with the help of DoD’s DARPA.  1980’s, The Hidden Markov Model helped.  1990’s, the software for speech recognition came to people, for example, Dragon.  2000’s, computer speech recognition sort of stalls. Like, google voice research, Siri.

How it works Input Speech Statistical modeling systems Output ADC

How it works  Input speech  Discrete  Continuous  Analog-to-digital converter (ADC)  The speech recognition technology converts these created vibrations to digital format.  Extract phonemes  Organize grammar

How it works  Statistical modeling systems  The Hidden Markov model (HMM)  Most common used in everything from data compression to sound recognition.  Artificial neural networks (ANN)  Were originally developed to model of human brain function.  Biology  The difference  HMM is a special case of the ANN  ANNs are capable of modeling extremely complex biological functions

The Hidden Markov Model  It is a directed graph augmented with probability scores.  N1 N2 N3 = 0.4 X 0.8 X 0.5 = 0.16  N1 N2 N2 N2 N3 N3 N3 N3 N3 = 0.4 x 0.2 x 0.2 x 0.8 x 0.5 x 0.5 x 0.5 x 0.5 =  N1 N1 N2 N2 N3 = 0.6 x 0.4 x 0.2 x 0.8 x 0.5 = 0.192

 Example t ow m aa t ow - British English t ah m ey t ow - American English t ah mey t a - Possibly pronunciation when speaking quickly

Applications  Healthcare  Military  Telephone  Business  People with disabilities  Google’s Voice Search, however, has been available on Android and iPhones.

Applications  Dragon Dictation  Powered by Nuance’s world-renowned Dragon NaturallySpeaking software  2.0, you can send text or to your friends, send notes and reminders to yourself … all using your voice.

Applications

Thank you  Questions?

References  /howsrworks.asp /howsrworks.asp  ets/high-tech-gadgets/speech- recognition1.htm ets/high-tech-gadgets/speech- recognition1.htm  nition nition