Speech Processing August 4, 2005 12/2/2018.

Slides:



Advertisements
Similar presentations
Natural Language Systems
Advertisements

II. PHONOLOGY             .
SPandH Overview January 2010 Phil Green Speech and Hearing Research Group Dept of Computer Science University of Sheffield
Voice Biometric Overview for SfTelephony Meetup March 10, 2011 Dan Miller Opus Research.
Centro per la Ricerca Scientifica e Tecnologica Spoken language technologies: recent advances and future challenges Gianni Lazzari VIENNA July 26.
Spoken Language Technologies: A review of application areas and research issues Analysis and synthesis of F0 contours Agnieszka Wagner Department of Phonetics,
Binary Decision Diagrams1 BINARY DECISION DIAGRAMS.
Designing a Multi-Lingual Corpus Collection System Jonathan Law Naresh Trilok Pace University 04/19/2002 Advisors: Dr. Charles Tappert (Pace University)
Advanced Technology Center Stuttgart EMOTIONAL SPACE IMPROVES EMOTION RECOGNITION Raquel Tato, Rocio Santos, Ralf Kompe Man Machine Interface Lab Advance.
Tanja Schultz, Alan Black, Bob Frederking Carnegie Mellon University West Palm Beach, March 28, 2003 Towards Dolphin Recognition.
Introduction to Speech Production Lecture 1. Phonetics and Phonology Phonetics: The physical manifestation of language in sound waves. –How sounds are.
ASR Evaluation Julia Hirschberg CS Outline Intrinsic Methods –Transcription Accuracy Word Error Rate Automatic methods, toolkits Limitations –Concept.
Auditory User Interfaces
Bootstrapping pronunciation models: a South African case study Presented at the CSIR Research and Innovation Conference Marelie Davel & Etienne Barnard.
Speech Recognition Calculator ECE L02 - Group 8 Alfredo Herrera John Holmes Josh Liang Alex Kee.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
ISSUES IN SPEECH RECOGNITION Shraddha Sharma
An Evaluation Framework for Natural Language Understanding in Spoken Dialogue Systems Joshua B. Gordon and Rebecca J. Passonneau Columbia University.
BravoBrava Mississippi State University Can Advances in Speech Recognition make Spoken Language as Convenient and as Accessible as Online Text? Joseph.
Multimedia Specification Design and Production 2013 / Semester 2 / week 3 Lecturer: Dr. Nikos Gazepidis
Speech and Language Processing
» Jun 9, 2003 Speaker Verification Secure AND Efficient, Deployments in Finance and Banking Jonathan Moav Director of Marketing
Outline Grammar-based speech recognition Statistical language model-based recognition Speech Synthesis Dialog Management Natural Language Processing ©
Algoritmi e Programmazione Avanzata
Introduction to Linguistics n Phonetics and Phonetic Transcription.
Creating User Interfaces Directed Speech. XML. VoiceXML Classwork/Homework: Sign up to be Voxeo developer. Do tutorials.
12/5/20151 Spoken Language Processing Julia Hirschberg CS 4706.
S PEECH T ECHNOLOGY Answers to some Questions. S PEECH T ECHNOLOGY WHAT IS SPEECH TECHNOLOGY ABOUT ?? SPEECH TECHNOLOGY IS ABOUT PROCESSING HUMAN SPEECH.
Using Google's Web Speech API with Moodle for language learning tasks
Unit 5 Phonetics and Phonology. Phonetics Sounds produced by the human speech organs are called the “phonic/auditory medium” Phonetics is the study of.
ARTIFICIAL INTELLIGENCE FOR SPEECH RECOGNITION. Introduction What is Speech Recognition?  also known as automatic speech recognition or computer speech.
Experimentation Duration is the most significant feature with around 40% correlation. Experimentation Duration is the most significant feature with around.
金聲玉振 Taiwan Univ. & Academia Sinica 1 Spoken Dialogue in Information Retrieval Jia-lin Shen Oct. 22, 1998.
Lecture 1 Phonetics – the study of speech sounds
Chapter 7 Speech Recognition Framework  7.1 The main form and application of speech recognition  7.2 The main factors of speech recognition  7.3 The.
Language Technologies Capability Demonstration Alon Lavie, Lori Levin, Alex Waibel Language Technologies Institute Carnegie Mellon University CATANAL Planning.
Presented By: O. Govinda Rao 3 rd MCA AITAM CH. Hari Prasad 3 rd MCA AITAM.
PREPARED BY MANOJ TALUKDAR MSC 4 TH SEM ROLL-NO 05 GUKC-2012 IN THE GUIDENCE OF DR. SANJIB KR KALITA.
Recapitulation. 2 Phonetics and Phonology Main differences between phonetics and phonology Airstream mechanism Speech Organs ConsonantsVowels Major features.
International Telecommunication Union The Fully Networked Car Geneva, 3-4 March 2010 Human Machine Interface (HMI) and signal processing for Intelligent.
How can speech technology be used to help people with disabilities?
SPEECH TECHNOLOGY An Overview Gopala Krishna. A
CS 224S / LINGUIST 285 Spoken Language Processing
Speech Recognition Calculator
Speech recognition in mobile environment Robust ASR with dual Mic
Artificial Intelligence for Speech Recognition
Speech Technology Center Solutions
3.0 Map of Subject Areas.
Biometrics Reg: AMP/HNDIT/F/F/E/2013/067.
Why Study Spoken Language?
Spoken Language Processing
Issues in Spoken Dialogue Systems
Dr. Debaleena Chattopadhyay Department of Computer Science
Speech Processing Speech Recognition
Why Study Spoken Language?
Advanced NLP: Speech Research and Technologies
Sfax University, Tunisia
Advanced NLP: Speech Research and Technologies
Automatic Speech Recognition: Conditional Random Fields for ASR
University of West Bohemia – Department of Cybernetics
Ala’a Spaih Abeer Abu-Hantash Directed by Dr.Allam Mousa
TECHNOLOGICAL PROGRESS
A HCL Proprietary Utility for
A maximum likelihood estimation and training on the fly approach
Indian Institute of Technology Bombay
Speaker Identification:
Spoken Language Processing
Emre Yılmaz, Henk van den Heuvel and David A. van Leeuwen
1-P-30 Speech-to-Speech Translation using Dual Learning and Prosody Conversion Zhaojie Luo, Yoichi Takashima, Tetsuya Takiguchi, and Yasuo Ariki (Kobe.
Presentation transcript:

Speech Processing August 4, 2005 12/2/2018

CS 224S / LINGUIST 236 Speech Recognition and Synthesis Dan Jurafsky Lecture 1: Overview and Articulatory Phonetics 12/2/2018

Applications of Speech Recognition/Understanding (ASR/ASU) Dictation Telephone-based Information (directions, air travel, banking, etc) Hands-free (in car) Second language ('L2') (accent reduction) Audio archive searching 12/2/2018

Applications of Speech Synthesis/Text-to-Speech (TTS) Games Telephone-based Information (directions, air travel, banking, etc) Eyes-free (in car) Reading/speaking for disabled Education (Reading tutors, L2) 12/2/2018

Applications of Speaker/ Language Recognition Language recognition for call routing Speaker Recognition: Speaker verification (binary decision) Voice password, telephone assistant Speaker identification (one of N) Criminal investigation 12/2/2018

State of the Art ASR speaker-independent, continuous, no noise, world’s best research systems: Human-human speech: ~13-20% Word Error Rate (WER) Human-machine speech: ~3-5% WER 12/2/2018