Seminar on Spoken Dialogue Systems

Slides:



Advertisements
Similar presentations
Hidden Information State System A Statistical Spoken Dialogue System M. Gašić, F. Jurčíček, S. Keizer, F. Mairesse, B. Thomson, K. Yu and S. Young Cambridge.
Advertisements

Dialogue Policy Optimisation
Probabilistic Adaptive Real-Time Learning And Natural Conversational Engine Seventh Framework Programme FP7-ICT
Spoken Language Understanding in Dialogue Systems Svetlana Stoyanchev 02/02/2015.
Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
OpenDial Framework Svetlana Stoyanchev SDS seminar 3/23.
1 Team Talk – A Report. 2 Introduction Project done as part of , Spring ’03. Design and implementation of a spoken dialog system. Thrust of project:
Heterogeneous Multi-Robot Dialogues for Search Tasks Thomas K Harris, Satanjeev (Bano) Banerjee Alexander Rudnicky AAAI Spring Symposium 2005: Dialogical.
U1, Speech in the interface:2. Dialogue Management1 Module u1: Speech in the Interface 2: Dialogue Management Jacques Terken HG room 2:40 tel. (247) 5254.
Application architectures
Programming Languages Structure
WP1 UGOT demos 2nd year review Saarbrucken Mar 2006.
Lecture Nine Database Planning, Design, and Administration
DFKI Approach to Dialogue Management Norbert Reithinger, Elsa Pecourt, Markus Löckelt
MDE In A Sales Management System: A Case Study © 2008 INRIA, TUD & SAP MDE In A Sales Management System: A Case Study Mathias Fritzsche, Hugo Bruneliere,
Course Instructor: Aisha Azeem
Find The Better Way Expand Your Voice with VXML May 10 th, 2005.
What is it? A mobile robotics system controls a manned or partially manned vehicle-car, submarine, space vehicle | Website for Students.
SDS Architectures Julia Hirschberg COMS 4706 (Thanks to Josh Gordon for slides.) 1.
Mobile Multimodal Applications. Dr. Roman Englert, Gregor Glass March 23 rd, 2006.
DQA meeting: : Learning more effective dialogue strategies using limited dialogue move features Matthew Frampton & Oliver Lemon, Coling/ACL-2006.
The Old World Meets the New: Utilizing Java Technology to Revitalize and Enhance NASA Scientific Legacy Code Michael D. Elder Furman University Hayden.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
Spoken Dialogue Systems Introduction Svetlana Stoyanchev Columbia University 01/26/2014.
Develop a fast semantic decoder for dialogue systems Capability to parse 10 – 100 ASR hypotheses in real time Robust to speech recognition noise Semantic.
ITCS 6010 SALT. Speech Application Language Tags (SALT) Speech interface markup language Extension of HTML and other markup languages Adds speech and.
Dialogue Modeling and Dialogue Management Frameworks Svetlana Stoyanchev Seminar on SDS, Columbia 2/16/2015.
Evaluation of SDS Svetlana Stoyanchev 3/2/2015. Goal of dialogue evaluation Assess system performance Challenges of evaluation of SDS systems – SDS developer.
UML based expert system generation Using Enterprise Architect to model and generate a web-based expert system.
Chapter 3 DECISION SUPPORT SYSTEMS CONCEPTS, METHODOLOGIES, AND TECHNOLOGIES: AN OVERVIEW Study sub-sections: , 3.12(p )
Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.
Reinforcement Learning for Spoken Dialogue Systems: Comparing Strengths & Weaknesses for Practical Deployment Tim Paek Microsoft Research Dialogue on Dialogues.
Learning Automata based Approach to Model Dialogue Strategy in Spoken Dialogue System: A Performance Evaluation G.Kumaravelan Pondicherry University, Karaikal.
SIMO SIMulation and Optimization ”New generation forest planning system” Antti Mäkinen Dept. of Forest Resource Management / University of Helsinki.
DIALOG SYSTEMS FOR AUTOMOTIVE ENVIRONMENTS Presenter: Joseph Picone Inst. for Signal and Info. Processing Dept. Electrical and Computer Eng. Mississippi.
4/5/2007Data handling and transfer in the LHCb experiment1 Data handling and transfer in the LHCb experiment RT NPSS Real Time 2007 FNAL - 4 th May 2007.
1 Boostrapping language models for dialogue systems Karl Weilhammer, Matthew N Stuttle, Steve Young Presenter: Hsuan-Sheng Chiu.
16.0 Spoken Dialogues References: , Chapter 17 of Huang 2. “Conversational Interfaces: Advances and Challenges”, Proceedings of the IEEE,
Systems Analysis and Design in a Changing World, Fourth Edition
ENTERFACE 08 Project 1 “MultiParty Communication with a Tour Guide ECA” Mid-term presentation August 19th, 2008.
A Common Ground for Virtual Humans: Using an Ontology in a Natural Language Oriented Virtual Human Architecture Arno Hartholt (ICT), Thomas Russ (ISI),
Framework for MDO Studies Amitay Isaacs Center for Aerospace System Design and Engineering IIT Bombay.
Architecture of Decision Support System
SIMO SIMulation and Optimization ”New generation forest planning system” Antti Mäkinen & Jussi Rasinmäki Dept. of Forest Resource Management.
Intelligent Robot Architecture (1-3)  Background of research  Research objectives  By recognizing and analyzing user’s utterances and actions, an intelligent.
DIALOG SYSTEMS FOR AUTOMOTIVE ENVIRONMENTS Presenter: Joseph Picone Inst. for Signal and Info. Processing Dept. Electrical and Computer Eng. Mississippi.
Information state and dialogue management in the TRINDI Dialogue Move Engine Toolkit, Larsson and Traum 2000 D&QA Reading Group, Feb 20 th 2007 Genevieve.
1 Context-Aware Internet Sharma Chakravarthy UT Arlington December 19, 2008.
Usable Security – CS 6204 – Fall, 2009 – Dennis Kafura – Virginia Tech Policy Authoring Matthew Dunlop Usable Security – CS 6204 – Fall, 2009 – Dennis.
SQL Based Knowledge Representation And Knowledge Editor UMAIR ABDULLAH AFTAB AHMED MOHAMMAD JAMIL SAWAR (Presented by Lei Jiang)
ARCHITECTURES AND STANDARDS FOR IVAS AT THE SOCIAL COGNITIVE SYSTEMS GROUP H. van Welbergen, K. Bergmann, H. Buschmeier, S. Kahl, I. de Kok, A. Sadeghipour,
Listener-Control Navigation of VoiceXML. Nuance Speech Analysis 92% of customer service is through phone. 84% of industrialists believe speech better.
Conceptual Design Dr. Dania Bilal IS588 Spring 2008.
Develop a fast semantic decoder for dialogue systems Capability to parse 10 – 100 ASR hypothesis in real time Robust to speech recognition noise Trainable.
Seminar on SDS Final class 5/4/2015. Topics discussed ASR, NLU in dialogue Dialogue management NLG, information presentation Evaluation Error recovery.
W3C Multimodal Interaction Activities Deborah A. Dahl August 9, 2006.
VoiceXML. Nuance Speech Analysis 92% of customer service is through phone. 84% of industrialists believe speech better than web.
Ganga/Dirac Data Management meeting October 2003 Gennady Kuznetsov Production Manager Tools and Ganga (New Architecture)
Learning Procedural Knowledge through Observation -Michael van Lent, John E. Laird – 인터넷 기술 전공 022ITI02 성유진.
B. Franek, poster presented at Computing in High Energy and Nuclear Physics, Praha – Czech Republic, 21 – 27 March 2009 This framework provides: -A method.
Inquiry learning and SimQuest
Learning to Program D is for Digital.
Self Healing and Dynamic Construction Framework:
Chapter 6. Data Collection in a Wizard-of-Oz Experiment in Reinforcement Learning for Adaptive Dialogue Systems by: Rieser & Lemon. Course: Autonomous.
Issues in Spoken Dialogue Systems
Towards Understanding End-of-trip Instructions in a Taxi Ride Scenario
Lecture 1: Multi-tier Architecture Overview
Voice Activation for Wealth Management
Chapter 5 Architectural Design.
Spoken Dialogue Systems
Presentation transcript:

Seminar on Spoken Dialogue Systems Columbia University Svetlana Stoyanchev 2/23/15 2/23/15

Today’s agenda Discussion of the POMDP paper (Edward Li) Project presentations by the teams Information Presentation in SDS, paper presentations: Taghi Paksima, Kallirroi Georgila, and Johanna D. Moore. Evaluating the Effectiveness of Information Presentation in a Full End-to-End Dialogue System. In Proceedings of the 10th Annual SIGdial Meeting on Discourse and Dialogue, pp. 1-10, London, UK, 2009. [Presenter: Samara Trilling ] Margaret Mitchell, Dan Bohus, Ece Kamar Crowdsourcing Language Generation Templates for Dialogue Systems. INLG, 2014 [presenter:Sarah Ita Levitan] Discussion of frameworks 2/23/15

Frameworks Wit.AI Vs. OpenDial 2/23/15

Wit.AI NLU/ASR interface Provides interface to create and annotate utterance examples for your domain Interface with different platforms: iOS, Android, Ruby, Python, Web (javascript), etc. NO Dialogue Management framework You would need to implement DM yourself. 2/23/15

ExampleSystem Architecture using Wit.AI Wit AI interface Open-source/freeware TTS (Festival, Nuance, AT&T) json text Intent + concepts Python DM While (get input): perform action given intent&concepts: Ask for cuisine Ask for price range Ask for neighborhood Look up restaurant Present result State/Frame, e.g Restaurant domain Cuisine : ? Price range : ? Neighborhood: ? 2/23/15

OpenDial DM Framework implemented by Pier Lison for his PhD thesis, 2014 Blackboard architecture Multiple modules “cooperate” to Interpret system input Maintain SDS state Decide on the action to perform DM components can run either synchronously or asynchronously Seminar on SDS; Intro

OpenDial ASR/TTS: OpenDial comes with support for commercial off-the shelve ASR (Nuance & AT&T Watson) Components: Rule-structured (Rules are defined in XML): collection of probabilistic rules with trigger variables NLU, DM, NLG External components: operate by monitoring dialogue state ASR, TTS, domain specific (visualization, back-end, robot input sensors) 2/23/15

OpenDial Aims to combine flexibility of Information State architecture (Traum & Larsson) with robustness and flexibility of statistically optimized SDS Use of N-best list of hypotheses allows to take advantage of the uncertainty of ASR Declarative specification of the dialogue in terms of rule-structured models facilitated SDS development 2/23/15

Open Dial Examples See opendial->domains->examples-> Pizzas Simulator-step-by-step.xml (not probabilistic) Instructions to a robot to move around a grid Example with probabilistic DM (util values are set in the rules) Simulator (probabilistic) Flight booking Flight-booking-nlu.xml Flight-booking-nlg.xml Flignt-booking-dm.xml 2/23/15

Open Dial Defines dialogue in terms of utterances and dialogue acts U_u - Utterance user U_m – Utterance machine A_u – dialogue Act user A_m – dialogue Act machine Define dialogue rules using ‘triggers’ and ‘effects’ (if-then-else) using XML representation Allows probabilistic rules 2/23/15

NL Understanding Define rules that trigger conditionally on content of a user utterance 2/23/15

Rules for user utterances (trigger=“u_u”) Simple NLU rule testing equality of a user utterance and setting a response utterance u_ms 2/23/15

NLU for simulator-step-by-step Conditional NLU rule testing for a user utterance and setting response dialog act 2/23/15

DM (action selection) for simulator-step-by-step 2/23/15

NLG for simulator-step-by-step 2/23/15

Overall picture NLU DM Trigger: u_u (user utterance) Effect: set a_u (user act) DM Trigger: a_u (act user) Effect: a_m (act machine) NLG Trigger: a_m (act machine) Effect: u_m (utterance machine) 2/23/15

Probabilistic rules The DM applies probability/utility rules (see Lison’s thesis chapter 4) Probability rules Form: If … then … else … condition effect probability (of the effect) cumulative effect of a condition sum to 1 Utility most utility rules only include one single decision variable the possibility to integrate multiple decision variables is helpful in domains where the system can execute multiple actions in parallel (for instance to communicate through both verbal and non-verbal channels) 2/23/15

Example of a utility rule Read chapter 4 of P.Lison’s thesis for more info on probability and utility rules 2/23/15

Suggestions If using OpenDial, read chapter 7 of P. Lison’s thesis Develop DM functionality using text interface Start by understanding example dialogs Start with non-probabilistic DM Explore probabilistic rules to handle ASR errors Plug in speech Post questions on: the class discussion board OpenDial discussion forum 2/23/15

Next Class (3/2) Feedback on your ideas 1 paper on NLG Brian McMahan and Matthew Stone Training an integrated sentence planner on user dialogue SigDial, 2014 [ presenter: Ananya Aniruddh Poddar] 2 papers on Evaluation F. Jurčíček and S. Keizer and F. Mairesse and B. Thomson and K. Yu and S. Young Real user evaluation of spoken dialogue systems using Amazon Mechanical Turk. in Proceedings of Interstpeech, 2011 K. Georgila, J. Henderson, and O. Lemon. 2005. Learning User Simulations for Information State Update Dialogue Systems. In Proceedings of Interspeech. 2/23/15