Spoken Dialogue Systems and the GALAXY Architecture 29 October 2000 Advanced Technology Laboratories 1 Federal Street A&E Building 2W Camden, New Jersey.

Slides:



Advertisements
Similar presentations
Spoken Dialogue Systems SIG-AI Fall 2003 By: Sachin Kamboj.
Advertisements

Non-Native Users in the Let s Go!! Spoken Dialogue System: Dealing with Linguistic Mismatch Antoine Raux & Maxine Eskenazi Language Technologies Institute.
1990s DARPA Programmes WSJ and BN Dapo Durosinmi-Etti Bo Xu Xiaoxiao Zheng.
TECHNOLOGY FOR MOBILE ADVERTISING SEARCH & COMMERCE © 2007 Apptera Inc. Optimizing Software Architecture for Voice Search SpeechTek 2007.
Dialogue – Driven Intranet Search Suma Adindla School of Computer Science & Electronic Engineering 8th LANGUAGE & COMPUTATION DAY 2009.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
U1, Speech in the interface:2. Dialogue Management1 Module u1: Speech in the Interface 2: Dialogue Management Jacques Terken HG room 2:40 tel. (247) 5254.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
SPOKEN LANGUAGE SYSTEMS MIT Computer Science and Artificial Intelligence Laboratory Mitchell Peabody, Chao Wang, and Stephanie Seneff June 19, 2004 Lexical.
ITCS 6010 Spoken Language Systems: Architecture. Elements of a Spoken Language System Endpointing Feature extraction Recognition Natural language understanding.
Speech recognition, understanding and conversational interfaces Alexander Rudnicky School of Computer Science
Jumping Off Points Ideas of possible tasks Examples of possible tasks Categories of possible tasks.
Conversational Agent 1.Two layers: Dialogue manager and Conversational agent. 2.Rule-Based Translator (ELIZA and PARRY) 3. Layer one: Dialogue Manager.
Equal-party Conversation System for Language Learning Chih-yu Chao (advisor: Stephanie Seneff) April 14 th, 2006 Dialogs on Dialogs Reading Group.
Spoken Dialog Management for an Astronaut’s Procedure Assistant Presented by: Dan Bohus Collaborators: Gregory Aist, RIALIST Group.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
High-quality Speech Translation for Language Learning Chao Wang and Stephanie Seneff June 24, 2004 Spoken Language Systems Group MIT Computer Science and.
Natural Language Understanding
Track: Speech Technology Kishore Prahallad Assistant Professor, IIIT-Hyderabad 1Winter School, 2010, IIIT-H.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
MMP - M204 Information Design/Cross Media Publishing - Spoken Language Interfaces - Dr. Ingrid Kirschning (UDLA)1 4. Speech Synthesis –Introduction to.
1M4 speech recognition University of Sheffield M4 speech recognition Martin Karafiát*, Steve Renals, Vincent Wan.
1 7-Speech Recognition (Cont’d) HMM Calculating Approaches Neural Components Three Basic HMM Problems Viterbi Algorithm State Duration Modeling Training.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Role-plays for CALL: System Architecture and Resources Sabrina Wilske & Magdalena Wolska Saarland University ICL, Villach, September.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
Aardvark Anatomy of a Large-Scale Social Search Engine.
Research Challenges for Spoken Language Dialog Systems Julie Baca, Ph.D. Assistant Research Professor Center for Advanced Vehicular Systems Mississippi.
7-Speech Recognition Speech Recognition Concepts
Experiments on Building Language Resources for Multi-Modal Dialogue Systems Goals identification of a methodology for adapting linguistic resources for.
1 Computational Linguistics Ling 200 Spring 2006.
Controlling Agents with Natural Language Jared Allen 2005 University of Arkansas EiA Agent team.
CMU Shpinx Speech Recognition Engine Reporter : Chun-Feng Liao NCCU Dept. of Computer Sceince Intelligent Media Lab.
May 2006CLINT-CS Verbmobil1 CLINT-CS Dialogue II Verbmobil.
DS-UCAT: A New Multimodal Dialogue System For An Academic Application Ramón López-Cózar 1, Zoraida Callejas 1, Germán Montoro 2 1 Dept. of Languages and.
Natural Language Processing Rogelio Dávila Pérez Profesor – Investigador
Lessons Learned Mokusei: Multilingual Conversational Interfaces Future Plans Explore language-independent approaches to speech understanding and generation.
Topics in Artificial Intelligence: Discourse and Dialogue CS 359 Gina-Anne Levow September 25, 2001.
DIALOG SYSTEMS FOR AUTOMOTIVE ENVIRONMENTS Presenter: Joseph Picone Inst. for Signal and Info. Processing Dept. Electrical and Computer Eng. Mississippi.
Weak AI: Can Machines Act Intelligently? Some things they can do: –Computer vision: face recognition from a large set –Robotics: autonomous (mostly) car.
ENTERFACE 08 Project 2 “multimodal high-level data integration” Mid-term presentation August 19th, 2008.
1 Boostrapping language models for dialogue systems Karl Weilhammer, Matthew N Stuttle, Steve Young Presenter: Hsuan-Sheng Chiu.
16.0 Spoken Dialogues References: , Chapter 17 of Huang 2. “Conversational Interfaces: Advances and Challenges”, Proceedings of the IEEE,
ENTERFACE 08 Project 1 “MultiParty Communication with a Tour Guide ECA” Mid-term presentation August 19th, 2008.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Introduction to Dialogue Systems. User Input System Output ?
L C SL C S SpeechBuilder: Facilitating Spoken Dialogue System Creation Eugene Weinstein Project Oxygen Core Team MIT Laboratory for Computer Science
DIALOG SYSTEMS FOR AUTOMOTIVE ENVIRONMENTS Presenter: Joseph Picone Inst. for Signal and Info. Processing Dept. Electrical and Computer Eng. Mississippi.
Theban Stanley, Julie Baca, Matt Elliott and Joseph Picone Human and Systems Engineering Center for Advanced Vehicular Systems Mississippi State University.
Explainable Adaptive Assistants Deborah L. McGuinness, Tetherless World Constellation, RPI Alyssa Glass, Stanford University Michael Wolverton, SRI International.
M. Liu, T. Stanley, J. Baca and J. Picone Intelligent Electronic Systems Center for Advanced Vehicular Systems Mississippi State University URL:
1 Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents S. Kawamoto, et al. October 27, 2004.
金聲玉振 Taiwan Univ. & Academia Sinica 1 Spoken Dialogue in Information Retrieval Jia-lin Shen Oct. 22, 1998.
Presentation Title 1 1/27/2016 Lucent Technologies - Proprietary Voice Interface On Wireless Applications Protocol A PDA Implementation Sherif Abdou Qiru.
Language in Cognitive Science. Research Areas for Language Computational models of speech production and perception Signal processing for speech analysis,
L C S Spoken Language Systems Group Stephanie Seneff Spoken Language Systems Group MIT Laboratory for Computer Science January 13, 2000 Multilingual Conversational.
Language Technologies Capability Demonstration Alon Lavie, Lori Levin, Alex Waibel Language Technologies Institute Carnegie Mellon University CATANAL Planning.
Preparing for the 2008 Beijing Olympics : The LingTour and KNOWLISTICS projects. MAO Yuhang, DING Xiao-Qing, NI Yang, LIN Shiuan-Sung, Laurence LIKFORMAN,
MERL 1 COLLAGEN: Applying Collaborative Discourse Theory to Human-Computer Interaction Charles Rich Candace L. Sidner Neal Lesh Mitsubishi Electric Research.
Message Source Linguistic Channel Articulatory Channel Acoustic Channel Observable: MessageWordsSounds Features Bayesian formulation for speech recognition:
1 7-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches Recognition Theories Bayse Rule Simple Language Model P(A|W) Network Types.
Automated Assistant for Crisis Management (Reflective Agent with Distributed Adaptive Reasoning) RADAR.
AQUAINT Mid-Year PI Meeting – June 2002 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
Assessing Grammar Module 5 Activity 5.
Assessing Grammar Module 5 Activity 5.
CSCE 590 Web Scraping – Information Retrieval
Spoken Language Processing
Issues in Spoken Dialogue Systems
Advanced NLP: Speech Research and Technologies
Dialogue State Tracking & Dialogue Corpus Survey
Presentation transcript:

Spoken Dialogue Systems and the GALAXY Architecture 29 October 2000 Advanced Technology Laboratories 1 Federal Street A&E Building 2W Camden, New Jersey Jerry Franke Senior Member, Engineering Staff

September 99 Talk Outline Spoken Language DevelopmentSpoken Language Development GALAXY II SystemGALAXY II System SUMMIT (speech recognition)SUMMIT (speech recognition) TINA (natural language parsing)TINA (natural language parsing) GENESIS (natural language generation)GENESIS (natural language generation) Turn ManagementTurn Management GALAXY II DomainsGALAXY II Domains Overview/DemonstrationOverview/Demonstration

September 99 Spoken Language Development UniversitiesUniversities –MIT, CMU, Colorado Basic research labsBasic research labs –ATT, SRI Software developersSoftware developers –Nuance, SpeechWorks Domain developersDomain developers –Lockheed Martin - ATL

September 99 GALAXY II System Developed by MIT Spoken Language Systems groupDeveloped by MIT Spoken Language Systems group Multiple servers performing parts of the dialogue processMultiple servers performing parts of the dialogue process Speech Recognition Language Understanding Turn Management Language Generation Speech Synthesis SoundSound SoundSound SUMMITTINAGENESIS

September 99 SUMMIT (speech recognition) Three elements: vocabulary, language models, acoustic modelsThree elements: vocabulary, language models, acoustic models Pause words stripped outPause words stripped out AM: segment-based models and boundary-based diphone modelsAM: segment-based models and boundary-based diphone models LM: Forward Viterbi search with a class bigram model, followed by a backward A* search with a class trigram modelLM: Forward Viterbi search with a class bigram model, followed by a backward A* search with a class trigram model Produces N-best list or word graph of possible utterancesProduces N-best list or word graph of possible utterances Models trained on domain corpusModels trained on domain corpus Models achieve speaker-independenceModels achieve speaker-independence

September 99 TINA (natural language parsing) Selects from N-best list depending on grammatical parseSelects from N-best list depending on grammatical parse Grammars reflect both syntactic and semantic structureGrammars reflect both syntactic and semantic structure Result is a semantic frameResult is a semantic frame Example:Example: “Where is the library in Swain Hall?” Clause: LOCATE Topic: PUBLIC-BUILDING Quantifier: DEF Name: library Predicate: IN Topic: HALL Name: Swain

September 99 GENESIS (natural language generation) Processes semantic framesProcesses semantic frames Embeds semantic frame components into context-dependent message templatesEmbeds semantic frame components into context-dependent message templates Two types of output:Two types of output: –natural language messages –messages are sent to some speech synthesis module –possibility of output in multiple languages –keyword-value pairs –useful structure for the turn management backend Can be used to map between (translate) languagesCan be used to map between (translate) languages

September 99 Turn Management Manages the system’s part of the dialogueManages the system’s part of the dialogue Fuses current utterance with dialogue history for full contextFuses current utterance with dialogue history for full context Five main tasks:Five main tasks: –Answer user’s requests (information retrieval) –Initiate sub-dialogues to clarify the user’s request –Track progress through the dialogue –Control response to the user –Provide assistance in using the system when needed

September 99 GALAXY II Domains From MIT: –Jupiter - weather forecasts –Pegasus - airline scheduling –Voyager - Cambridge, Massachusetts city guide –Dinex - Boston restaurant guide –Wheels - automobile classified ads –Mercury - airline flight booking From Lockheed Martin - ATL: –DARPA Communicator - airline flight, hotel, car rental booking –DARPA LCS-Marine/Marine Small Unit Logistics - supply request –DARPA LCS-Army - data collection during equipment tests

September 99 Overview/Demonstration Booking airline flights (round trip and one way)Booking airline flights (round trip and one way) Hotel, car rentalHotel, car rental Uses user profile to fill in some information about reservationUses user profile to fill in some information about reservation Uses some real-world knowledgeUses some real-world knowledge Information retrieval via mobile agentsInformation retrieval via mobile agents

September 99 Try It Yourself CMU CMU MIT MIT Colorado