MITRE Dialog Management Workshop – a review Dan Bohus Dialogs on Dialogs reading group CMU, November 2003.

Slides:



Advertisements
Similar presentations
Key architectural details RavenClaw: Dialog Management Using Hierarchical Task Decomposition and an Expectation Agenda Dan BohusAlex Rudnicky School of.
Advertisements

Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 4 Shieber 1993; van Deemter 2002.
Goteborg University Dialogue Systems Lab Motivation for using GF with GoDiS TALK meeting Edinburgh 7/
An information state approach to natural interactive dialogue Staffan Larsson, Robin Cooper Department of linguistics Göteborg University, Sweden.
CS 484 – Artificial Intelligence1 Announcements Choose Research Topic by today Project 1 is due Thursday, October 11 Midterm is Thursday, October 18 Book.
© Chinese University, CSE Dept. Software Engineering / Software Engineering Topic 1: Software Engineering: A Preview Your Name: ____________________.
INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING NLP-AI IIIT-Hyderabad CIIL, Mysore ICON DECEMBER, 2003.
Task Learning in COLLAGEN The COLLAGEN Architecture; Task Learning from Demonstrations Mitshubishi Electric Research Labs Andrew Garland, Neal.
COLLAGEN: When Agents Collaborate with People Charles Rich and Candace L. Sidner Presented by Daniel Schulman.
Chapter 1 Introduction. “How do I send picture by ?” “Click on Attach button, or paper clip icon, select the picture and click attach” The instructions.
Dialogue in Intelligent Tutoring Systems Dialogs on Dialogs Reading Group CMU, November 2002.
LE TRINDIKIT A toolkit for building and experimenting with dialogue move engines and systems, based on the information state approach.
1 Learning from Behavior Performances vs Abstract Behavior Descriptions Tolga Konik University of Michigan.
Goteborg University Dialogue Systems Lab WP1: GoDiS VCR application Edinburgh TALK meeting 7/
Question Accommodation and Information States in Dialogue
Spoken Dialog Management for an Astronaut’s Procedure Assistant Presented by: Dan Bohus Collaborators: Gregory Aist, RIALIST Group.
Madeleine, a RavenClaw Exercise in the Medical Diagnosis Domain Dan Bohus, Alex Rudnicky MITRE Workshop on Dialog Management, Boston, October 2003.
TrindiKit A toolkit for building and experimenting with dialogue move engines and systems, based on the information state approach.
LE A toolkit for building and experimenting with dialogue move engines and systems, based on the information state approach TrindiKit.
Goteborg University Dialogue Systems Lab GoDiS and TrindiKit MITRE workshop 27/10-03 Staffan Larsson Göteborg University Sweden.
QUALITY ASSURANCE PROJECT Conducting Effective Meetings The purpose of this module is to enhance participants’ knowledge and skill in observing team meetings.
Productive Math Talk Math Alliance April 3, 2012.
31 st October, 2012 CSE-435 Tashwin Kaur Khurana.
Increasing Parent Involvement
Chapter 7 Requirement Modeling : Flow, Behaviour, Patterns And WebApps.
May Distribution authorized to U.S. Government Agencies only Symmetric Multimodal Interactive Intelligent Development Environments Dramatic reduction.
1 Design and Integration: Part 1 Nuggets about Design vs Project Management.
Learning Law Orientation: August 16, Synthesis Judgment 4. Problem Solving 3. Spotting Issues 2. Understanding 1. Knowledge 1. Recognition vs.
Author: James Allen, Nathanael Chambers, etc. By: Rex, Linger, Xiaoyi Nov. 23, 2009.
Chapter 7. BEAT: the Behavior Expression Animation Toolkit
Introduction CS 3358 Data Structures. What is Computer Science? Computer Science is the study of algorithms, including their  Formal and mathematical.
Course Instructor: K ashif I hsan 1. Chapter # 2 Kashif Ihsan, Lecturer CS, MIHE2.
1 PLAN RECOGNITION & USER INTERFACES Sony Jacob March 4 th, 2005.
MERL 1 COLLAGEN: Middleware for Building Mixed-Initiative Problem Solving Assistants ( Neal Lesh, Andy Garland, Chris Lee, David McDonald, Egon Pasztor,
Requirements Engineering Requirements Elicitation Process Lecture-8.
An information state approach to natural interactive dialogue Staffan Larsson, Robin Cooper Department of linguistics Göteborg University, Sweden.
Introduction CS 3358 Data Structures. What is Computer Science? Computer Science is the study of algorithms, including their  Formal and mathematical.
1 Introduction to Software Engineering Lecture 1.
Well-behaved objects Main concepts to be covered Testing Debugging Test automation Writing for maintainability Objects First with Java - A Practical.
Dept. of Computer Science University of Rochester Rochester, NY By: James F. Allen, Donna K. Byron, Myroslava Dzikovska George Ferguson, Lucian Galescu,
Issues in Multiparty Dialogues Ronak Patel. Current Trend  Only two-party case (a person and a Dialog system  Multi party (more than two persons Ex.
ENTERFACE 08 Project 1 “MultiParty Communication with a Tour Guide ECA” Mid-term presentation August 19th, 2008.
A Common Ground for Virtual Humans: Using an Ontology in a Natural Language Oriented Virtual Human Architecture Arno Hartholt (ICT), Thomas Russ (ISI),
1 Design and Integration: Part 2. 2 Plus Delta Feedback Reading and lecture repeat Ambiguous questions on quizzes Attendance quizzes Boring white lecture.
Information state and dialogue management in the TRINDI Dialogue Move Engine Toolkit, Larsson and Traum 2000 D&QA Reading Group, Feb 20 th 2007 Genevieve.
Generic Tasks by Ihab M. Amer Graduate Student Computer Science Dept. AUC, Cairo, Egypt.
Human Computer Interaction
Information-State Dialogue Modelling in Several Versions HS Dialogmanagement, SS 2002 Universität Saarbrücken Michael Götze.
Article Summary of The Structural Complexity of Software: An Experimental Test By Darcy, Kemerer, Slaughter and Tomayko In IEEE Transactions of Software.
FORESTUR How to work… …with this training platform? …with this methodology?
Dialog Models September 18, 2003 Thomas Harris.
Effective Presentation Skills. Why Are You Here? What do you want to achieve by the end of this workshop? What do you want to achieve by the end of this.
1 Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents S. Kawamoto, et al. October 27, 2004.
1 USC INFORMATION SCIENCES INSTITUTE EXPECT TEMPLE: TEMPLate Extension Through Knowledge Acquisition Yolanda Gil Jim Blythe Information Sciences Institute.
University of North Alabama
ANALYSIS PHASE OF BUSINESS SYSTEM DEVELOPMENT METHODOLOGY.
MERL 1 COLLAGEN: Applying Collaborative Discourse Theory to Human-Computer Interaction Charles Rich Candace L. Sidner Neal Lesh Mitsubishi Electric Research.
n Taking Notes and Keeping a Journal n Listening Skills n Working Together n Managing Your Time.
A Speech Interface to Virtual Environment Authors Scott McGlashan and Tomas Axling Swedish Institute of Computer Science.
Agent-Based Dialogue Management Discourse & Dialogue CMSC November 10, 2006.
1 Automating Network Diagnostics to Help End-Users Dave Thaler
Design Evaluation Overview Introduction Model for Interface Design Evaluation Types of Evaluation –Conceptual Design –Usability –Learning Outcome.
16 April 2011 Alan, Edison, etc, Saturday.. Knowledge, Planning and Robotics 1.Knowledge 2.Types of knowledge 3.Representation of knowledge 4.Planning.
Andrew Garland, Neal Lesh, and Charles Rich Mitsubishi Electric Research Laboratories Responding to and Recovering from Mistakes during Collaboration.
The Semiotic Engineering of Human-Computer Interaction Section I Foundation Chapter 1 Introduction.
DSS: Decision Support Systems and AI: Artificial Intelligence
Architecture Components
Sphinx Lunch Talk Carnegie Mellon University, October 2004
Applying Use Cases (Chapters 25,26)
Applying Use Cases (Chapters 25,26)
Presentation transcript:

MITRE Dialog Management Workshop – a review Dan Bohus Dialogs on Dialogs reading group CMU, November 2003

MITRE Dialog Management Workshop The Workshop  MITRE Dialog Workshop MITRE, Bedford/Boston  October 27-28, 2003  Idea  Bring together researchers working on dialog management  Give them a homework Adapt you dialog manager to a medical diagnosis domain (details in a sec)  Discuss, compare, learn workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop The Homework  Implement a dialog system for the medical diagnosis domain  Task left open-ended (diagnosis, tutoring, etc)  No speech, just text in and out  Backend provided backend.docbackend.doc Java version and web-based interface version 3 diseases: malaria, coccidioidomycosis, another one List of symptoms: headache, nausea, muscle pain, etc. Decision tree involving symptoms and tests (fever, blood tests, travel patterns, etc)  Small enough to presumably not be lots of work, but large enough to allow illustration of functionalities, and provide some skeleton to the discussions… workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Participants  MITRE (Carl Burke et al) MiDiKi  Gothenburg (Staffan Larsson) GoDiS (TRINDIKit)  USC ICT (David Traum) ICT Dialogue Manager  NTT/CMU (Matthias Denecke) Ariadne  CMU (Dan, Alex) RavenClaw  Ames (Beth-Ann Hockey) NASA Dialogue Manager  DFKI (Norbert Reithinger) DFKI Dialogue Manager  MERL (Candy Sidner, Charles Rich) COLLAGEN … and others invited but not present workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop GoDiS workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop GoDiS  TRINDIKit – information state update dialogue management toolkit  Information state Private: dialog plan, beliefs, agenda (short term goals) Shared: established facts, QUD, last utterance information  Dialog moves  Update rules  GoDiS: dialog management system implemented in TRINDIKit, handing:  information oriented dialogue  action oriented dialogue workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop TRINDIKit / GoDiS architecture input inter- pret TIS DEVICES LEXICON DOMAIN backend interface control update select gene- rate output lexicon domain knowledge DME Dialog plans Ontology Connectio n to Java Backend workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop GoDiS: Task Representation  Plans; propositional logic  Dialogue plans for dealing with diagnosis (issues opened at dialogue start)  ?x.disease(x): ”which disease is diagnosed?”  ?confirmed_by_interview: ”Is the diagnosis confirmed by additional information?”  ?confirmed_by_tests: ”Is the diagnosis confirmed by medical tests?”  Additional plans  ?x.info(x): ”What information is there about a given disease?”  ?x.treatment(x): ”What treatment is there for a given disease?” workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop GoDiS: Alternate Tasks  User-driven dialogue (implemented)  Not load issues when resetting; user has to raise all issues  User can ask system to Provide a diagnosis Confirm whether user has given disease  Decision trees as dialogue plans  Move backend knowledge into dialogue plans  Information conversion could be done automatically  Separate genre: expert system dialogue  Add special purpose update rules  Dynamic dialogue planning by expert workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop GoDiS: Highlights / Lowlights  Highlights:  Reuse, you get for free: Grounding Accomodation / plan recognition Multiple simultaneous issues & info sharing  High-level abstraction for dialog plans Rapid prototyping  Lowlights  Not used in this type of domain so far, so not entirely straight-forward (update rule changes)  Dynamic dialog plans (backend decides) workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop GoDiS RavenClaw workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop RavenClaw  Captures all domain-specific dialog (task) logic with a hierarchical description  The authoring effort is focused entirely here Dialog Task (Specification) Domain-independent Dialog Engine  Manages dialog by executing the dialog task specification  Provides domain-independent conversational strategies workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop RavenClaw Architecture Dialog Stack Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome Expectation Agenda general_feeling chart have_fever diagnostic workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop RavenClaw Architecture Dialog Stack Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome Expectation Agenda general_feeling chart have_fever diagnostic workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop RavenClaw Architecture Dialog Stack Madeleine Welcome Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome Expectation Agenda general_feeling chart have_fever diagnostic workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop RavenClaw Architecture Dialog Stack Madeleine Hi, this is Madeleine, the automated… Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome Expectation Agenda general_feeling chart have_fever diagnostic workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop RavenClaw Architecture Dialog Stack Madeleine Hi, this is Madeleine, the automated… Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome LoadSymptoms R:HeadacheR: Expectation Agenda general_feeling chart have_fever diagnostic headache workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop RavenClaw Architecture Dialog Stack Madeleine Hi, this is Madeleine, the automated… Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome R:HeadacheR: Expectation Agenda general_feeling chart have_fever diagnostic headache workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop RavenClaw Architecture Dialog Stack Madeleine Hi, this is Madeleine, the automated… Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome R:HeadacheR: GeneralFeel Expectation Agenda general_feeling chart have_fever diagnostic headache workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop RavenClaw Architecture Dialog Stack Madeleine Hi, this is Madeleine, the automated… Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome R:HeadacheR: GeneralFeel How are you feeling today? general_feeling chart have_fever diagnostic HowAreYou Expectation Agenda general_feeling: [good], [bad], [soso] GeneralFeel I:GladI:Sorry Not so good, I think I have a fever [soso](not so good) [fever](I think I have a fever) headache GeneralFeel workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Illustrated Features  Dynamic generation of dialog task structure  Symptoms loaded from backend, appropriate structures to “talk about them” created on-the-fly  New symptoms – no DM changes  Dynamic dialog control policy  The order in which symptoms are addressed is controlled by the backend  Conversational skills workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Illustrated Features  Dynamic generation of dialog task structure  Symptoms loaded from backend, appropriate structures to “talk about them” created on-the-fly  New symptoms – no DM changes  Dynamic dialog control policy  The order in which symptoms are addressed is controlled by the backend  Conversational skills workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Dynamic Dialog Control … Dialog Stack Madeleine Hi, this is Madeleine, the automated… How are you today? Not so good, I think I have a headache Sorry to hear you’re not feeling so good, Tell me more about your symptoms… Do you have abdominal pain? Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome R:HeadacheR: Diagnose Expectation Agenda general_feeling chart have_fever diagnostic headache Backend Decision Tree workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Illustrated Features  Dynamic generation of dialog task structure  Symptoms loaded from backend, appropriate structures to “talk about them” created on-the-fly  New symptoms – no DM changes  Dynamic dialog control policy  The order in which symptoms are addressed is controlled by the backend  Conversational skills workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Conversational Skills  Corresponding agencies added automatically to the dialog task tree  Help  What Can I Say?  Repeat  Suspend / Resume  Start Over  Timeout handling (not illustrated)  Still need all the language generation prompts and grammar, but some of those are develop-once, too workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop RavenClaw Conclusion  Highlights  Set task posed no challenges to the framework Easy to implement  Dynamic dialog structure and control  Automatic use of domain-independent conversational skills  Lowlights?  Toolkit perspective: how easy would it be for someone else to build it?  Asynchronous behaviors? (timing)  Couple of bugs / fixes (or is that a highlight?) workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop GoDiS Collagen workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop COLLAGEN Collaborative Interface Agent communicate interact observe plan tree focus stack * Collagen workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop COLLAGEN Systems  air travel planning  reading and responding (w. IBM/Lotus)  GUI design tool operation  car navigation system operation  airport landing path planning (w. MITRE)  gas turbine operator training (w. USC/ISI)  personal video recorder operation  programmable thermostat operation (with Delft U.)  multi-modal web-based form-filling workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Collagen: Theory and Implementation Intentional purposes, contributes Linguistic segments, lexical items Attentional focus spaces, focus stack SharedPlan Discourse Theory (Grosz, Sidner, Kraus, Lochbaum ) Java Implementation focus stack purpose tree workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Collagen: Discourse Segments and Purposes (Grosz, 1974) E: Replace the pump and belt please. A: Ok, I found a belt in the back. A: Is that where it should be? A: [removes belt] A: It’s done. E: Now remove the pump. … E: First you have to remove the flywheel. … E: Now take the pump off the base plate. A: Already did. replace belt replace pump replace pump and belt (fixing an air compressor, E = expert, A = apprentice) workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Discourse state representation E: Replace the pump and belt please. A: Ok, I found a belt in the back. A: Is that where it should be? A: [removes belt] A: It’s done Focus Stack replace belt replace pump and belt Purpose Tree replace pump and belt replace pumpreplace belt current focus space (Grosz & Sidner, 1986) replace belt replace pump and belt workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Discourse interpretation algorithm (Lochbaum, 1998) starts a new segment/focus space (push) ends the current segment/focus space (pop) continues (contributes to) the current segment/... (add) The current (communication or manipulation) act either: focus stack directly achieves the purpose is a step in the plan for the purpose * identifies the recipe used to achieve the purpose identifies who should perform the purpose or a step in the plan identifies a parameter of the purpose or a step in the plan An act contributes to the purpose of a segment if it: purpose tree * does not include recursive plan recognition (see later topic) workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop COLLAGEN … my take  Separation of task from dialog/discourse engine  Recipes / Domain plans / Task tree  Full-blown HTN Hierarchical Preconditions (constraints) Effects Completion / failure Live nodes  Stack to keep track of focus and discourse structure  Tree explicitly contains agent and user nodes  Formalized / descriptive recipe specs (actually Java underneath), with procedure overwrites… workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop GoDiS Themes … workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Themes: Task Representation  Task representation  Separation of task representation from dialog engine  High-level representations of task  Descriptive rather than procedural Procedural will be unavoidable for complex tasks Expressive power  GoDiS, RavenClaw, Collagen: plan based representations of task workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Themes: Task/Domain/Gendre  The notion of dialog gendre  Tutoring  Diagnosis  Information Access  Where to fold it in a dialog manager?  GoDiS: update/select rules  Ariadne: plugins  RavenClaw: collapsed with task  How clear is that separation: task vs. gendre? workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Themes: Development time  Systems took on the order of 3-5 days to develop  Significant effort in the backend connection Some sites shortcut it  Significant effort in grammar/language generation development Some sites shortcut it  Everyone that had an implementation: “fixed a couple of bugs, but no major changes required” workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Themes: Development tools  Regression testing (GoDiS)  Systems are complex. Change something in a dialog management framework, can you prove that it did not screw up things that used to work?  System-wise, very intractable  Component-wise, maybe: i.e. DM with DM inputs/outputs  System diagnosis / log visualization tools (Collagen) workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Themes: Timing  (Micro)timing  unaddressed  Turn-taking models  in general, very rudimentary  Asynchronous behaviors  Could be accomplished, but no-one seemed to have it  Multi-party conversation  unaddressed workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Themes: the important problems  Different people have different views of what those are:  Plan / Intention recognition  Reference resolution  Backup in complex systems  Tense problems  Negations  Grounding; error prevention / recovery workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Themes: Reasoning  Dialog Managers vs Backends  Where to draw the line?  Who does the reasoning?  Can we avoid duplicating it?  How rich is the interaction between them? Dialog systems - use language to act in a domain, so they are generally strongly tied Basic set of conversational skills can be identified  Drawing that line is still an “art”, no general agreement or solutions exist workshop : godis : ravenclaw : collagen : themes

MITRE Dialog Management Workshop Themes: Science of Dialog?  How much science do we have?  Theory vs. experiment  Interesting Collagen / RavenClaw similarities  Representation or not?  GUI analogy  Do we have the checkboxes and radio-buttons? workshop : godis : ravenclaw : collagen : themes