Goteborg University Dialogue Systems Lab Introduction to dialogue systems Staffan Larsson Dialogsystem HT04.

Slides:



Advertisements
Similar presentations
Lecture 2 - Revenue Models
Advertisements

Chapter 11 Designing the User Interface
Map of Human Computer Interaction
Technology and teaching A l(IT)eracy perspective.
Chapter 11 user support. Issues –different types of support at different times –implementation and presentation both important –all need careful design.
An information state approach to natural interactive dialogue Staffan Larsson, Robin Cooper Department of linguistics Göteborg University, Sweden.
Negotiative dialogue some definitions and ideas. Negotiation vs. acceptance Clark’s ladder: –1. A attends to B’s utterance –2. A percieves B’s utterance.
Introduction to E-Commerce
INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING NLP-AI IIIT-Hyderabad CIIL, Mysore ICON DECEMBER, 2003.
Dialogue types GSLT course on dialogue systems spring 2002 Staffan Larsson.
What is the Internet? Internet: The Internet, in simplest terms, is the large group of millions of computers around the world that are all connected to.
LE TRINDIKIT A toolkit for building and experimenting with dialogue move engines and systems, based on the information state approach.
Ambient Computational Environments Sprint Research Symposium March 8-9, 2000 Professor Gary J. Minden The University of Kansas Electrical Engineering and.
1 SWE Introduction to Software Engineering Lecture 22 – Architectural Design (Chapter 13)
A preliminary classification of dialogue genres or Correlating properties of activities with properties of dialogue systems Staffan Larsson Dept. of linguistics.
ISTD 2003, Audio / Speech Interactive Systems Technical Design Seminar work: Audio / Speech Ville-Mikko Rautio Timo Salminen Vesa Hyvönen.
Research about dialogue and dialogue systems and the department of linguistics goal: –develop theories about human dialogue which can be used when building.
Application architectures
Information, action and negotiation in dialogue systems Staffan Larsson Kings College, Jan 2001.
TrindiKit A toolkit for building and experimenting with dialogue move engines and systems, based on the information state approach.
Object Oriented Design Goals  OOD meets input from the Web  Design workshop u Form teams u Brainstorm projects.
What is communication? What are the issues in interpersonal communication? What is the nature of communication in organizations? How can we build more.
Goteborg University Dialogue Systems Lab GoDiS and TrindiKit MITRE workshop 27/10-03 Staffan Larsson Göteborg University Sweden.
Chapter 13: Designing the User Interface
Mobile Multimodal Applications. Dr. Roman Englert, Gregor Glass March 23 rd, 2006.
Website Content, Forms and Dynamic Web Pages. Electronic Portfolios Portfolio: – A collection of work that clearly illustrates effort, progress, knowledge,
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Communication Skills - Chapter 2 Mr. Sherpinsky Business Management Class Council Rock School District.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
WELCOME TO UNIT 7 Customer Service MT 221 Marilyn Radu, Instructor.
Search Engines and Information Retrieval Chapter 1.
Chapter 12 The Impact of Globalization on Customer Service
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
1 INTRODUCTION TO DATABASE MANAGEMENT SYSTEM L E C T U R E
Speech User Interfaces Katherine Everitt CSE 490 JL Section Wednesday, Oct 27.
COMPUTER ASSISTED / AIDED LANGUAGE LEARNING (CALL) By: Sugeili Liliana Chan Santos.
Module 3: Business Information Systems Chapter 8: Electronic and Mobile Commerce.
1 PLAN RECOGNITION & USER INTERFACES Sony Jacob March 4 th, 2005.
The Information State approach to dialogue modelling Staffan Larsson Dundee, Jan 2001.
1 COMPSCI 110 Operating Systems Who - Introductions How - Policies and Administrative Details Why - Objectives and Expectations What - Our Topic: Operating.
COMPUTER-ASSISTED LANGUAGE LEARNING (CALL)
An information state approach to natural interactive dialogue Staffan Larsson, Robin Cooper Department of linguistics Göteborg University, Sweden.
Chapter 3 DECISION SUPPORT SYSTEMS CONCEPTS, METHODOLOGIES, AND TECHNOLOGIES: AN OVERVIEW Study sub-sections: , 3.12(p )
User Support Chapter 8. Overview Assumption/IDEALLY: If a system is properly design, it should be completely of ease to use, thus user will require little.
Dept. of Computer Science University of Rochester Rochester, NY By: James F. Allen, Donna K. Byron, Myroslava Dzikovska George Ferguson, Lucian Galescu,
INTRODUCTION TO USER DOCUMENTATION Function and purpose Production specifications Evaluate the effectiveness.
NLP ? Natural Language is one of fundamental aspects of human behaviors. One of the final aim of human-computer communication. Provide easy interaction.
Towards a Theoretical Framework for the Integration of Dialogue Models into Human-Agent Interaction John R. Lee Assistive Intelligence Inc. Andrew B. Williams.
ENTERFACE 08 Project 1 “MultiParty Communication with a Tour Guide ECA” Mid-term presentation August 19th, 2008.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Computer Science, Algorithms, Abstractions, & Information CSC 2001.
1 Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents S. Kawamoto, et al. October 27, 2004.
A preliminary classification of dialogue genres Staffan Larsson Internkonferens 2003.
Creating User Interfaces Ideas & Trends Homework: Post constructive comments. Work on project.
Writing to Teach - Tutorials Chapter 2. Writing to Teach - Tutorials The purpose of a tutorial is to accommodate information to the needs of the user.
Goteborg University Dialogue Systems Lab Comments on ”A Framework for Dialogue Act Specification” 4th Workshop on Multimodal Semantic Representation January.
Speech Processing 1 Introduction Waldemar Skoberla phone: fax: WWW:
Slide no 1 Cognitive Systems in FP6 scope and focus Colette Maloney DG Information Society.
Software Architecture for Multimodal Interactive Systems : Voice-enabled Graphical Notebook.
Agent-Based Dialogue Management Discourse & Dialogue CMSC November 10, 2006.
Application architectures Advisor : Dr. Moneer Al_Mekhlafi By : Ahmed AbdAllah Al_Homaidi.
6. (supplemental) User Interface Design. User Interface Design System users often judge a system by its interface rather than its functionality A poorly.
A seminar by Ramesh Kumar Raju S CSSE 07121A1547.
INTRODUCTION TO WIRELESS SENSOR NETWORKS
The Semiotic Engineering of Human-Computer Interaction Section I Foundation Chapter 1 Introduction.
OVERVIEW OF DISCOURSE ANALYSIS
Module 4: Strategy Formulation: Customer Interface
Data Warehousing and Data Mining
Chapter 11 user support.
Presentation transcript:

Goteborg University Dialogue Systems Lab Introduction to dialogue systems Staffan Larsson Dialogsystem HT04

Goteborg University Dialogue Systems Lab Overview Why Develop Speech Applications for the Telephone (Larson ch.1) Dialogue and dialogue genres Dialogue modeling and dialogue systems Research areas & local projects History of dialogue systems Methodology for dialogue systems design (Agenter, dialog och talakter) (Dialogspel)

Goteborg University Dialogue Systems Lab Chapter 1. Why Develop Speech Applications for the Telephone ©2002 Larson Technical Services

Goteborg University Dialogue Systems Lab Discussion Questions Why are speaking and listening fundamental to our lives? Why use speech to interact with a computer? When might speech not be appropriate for interacting with a computer? Why are speech applications possible today? What are the limitations of speech as a user interface for a computer? What can you do with a single call? What are some example speech applications? ©2002 Larson Technical Services

Goteborg University Dialogue Systems Lab Speaking and Listening Are Fundamental to Our Lives. People interact by speaking and listening. It’s “unnatural” when people don’t speak. We spend the first three years of our lives learning how to speak and listen. Section 1.1©2002 Larson Technical Services

Goteborg University Dialogue Systems Lab Speaking and Listening to a Computer Is Natural and Convenient. Despite physical handicaps such as blindness or poor physical dexterity To bypass the limitations of small keyboards and screens When the device has no keyboard When callers work with their hands and eyes At anytime during the day With instant connection without being placed on “hold.” When languages do not lend themselves to keyboarding To convey emotion—“earcons” To access all types of time-sensitive data To access all types of location-sensitive data To access all types of public and personal information To control computerized processes and activities Section 1.2©2002 Larson Technical Services

Goteborg University Dialogue Systems Lab When Speaking and Listening to a Computer May Be Inappropriate. Graphics Pointing Selecting Limitation of human memory Impact of noise –The computer cannot hear –The caller cannot hear Concern about privacy Section 1.3©2002 Larson Technical Services

Goteborg University Dialogue Systems Lab Speech Applications are Possible Today Speech application enablers –Increased computing power at less expense –Improved algorithms –Improved dialog design –Availability of telephones and cell phones Section 1.4©2002 Larson Technical Services

Goteborg University Dialogue Systems Lab Speech Application Challenges Limitations of speech interfaces –Speech technologies are not perfect. –Callers have false expectations. –Speech is a transient medium. Section 1.5©2002 Larson Technical Services

Goteborg University Dialogue Systems Lab Types of Speech Applications Application Classes Technologies Touch- Speech Touch- Multimodal ToneOnlyTone + OnlySpeech Speech inNoYesYesYes KeypadYesNoYesYes Mouse and keyboardNoNoNoYes Speech outYesYesYesYes Display monitorNoNoNoYes ©2002 Larson Technical Services

Goteborg University Dialogue Systems Lab What Can You Do with a Single Phone Call? Commerce –Self-service queries and transactions –Support desks, order tracking airline arrival and departure, cinema and theater booking, home banking, e-commerce Content –Public information—Community information; local, national, and international news; entertainment information –Personal information—Calendar, addresses, telephone lists, to- do lists, shopping lists, calorie and exercise logs, personal diaries Communication –Initiate telephone calls, sending and receiving and voice- mail Section 1.6©2002 Larson Technical Services

Goteborg University Dialogue Systems Lab What Can You Do with a Single Phone Call? New dial tone—“How may I help you?” Voice portals—Verbal Web sites that enable caller interaction with multiple services by speaking and listening Section 1.6©2002 Larson Technical Services

Goteborg University Dialogue Systems Lab Voice Portals Mass market voice portal – –Driving instructions –Traffic conditions –Weather –Telephone number search –Business reminders –Local information –Stock quotes –Personalized news –Entertainment information –Sports –Horoscopes Corporate external voice portal –Telephone attendant –Product and service information –Order entry –Help desk –Banking –Sales Corporate internal voice portal –Customer relationship management –Product availability and pricing –Order status –Human resource information –Supply chain management –Customer account information Section 1.7©2002 Larson Technical Services

Goteborg University Dialogue Systems Lab Key Concepts Speech enables new applications and new users. Speech enables the hands-free and eyes-free computer access. Speech enables callers to access computers from whereever they are—at work, but away from their desk; at home; or on the road. Speech has drawbacks: –No pointing –Stretches limits of human short-term memory –Privacy Callers use voice applications to access commerce, content, and communication services. ©2002 Larson Technical Services

Goteborg University Dialogue Systems Lab Dialogue and dialogue genres

Goteborg University Dialogue Systems Lab Dialogue & dialogue systems Dia logos = through language What is interesting for dialogue system applications? –Spoken natural language human-computer dialogue in specific domains But maybe also –Written? –Multimodal (incl. graphics)?

Goteborg University Dialogue Systems Lab Discourse and Dialogue Discourse (in one sense of the word) –Text; monologue Dialog –Speech; multiple participants Really two independent dimensions –Modality: text / speech ( / gesture / image ) –monologue or dialogue

Goteborg University Dialogue Systems Lab Dahlbäck (1997) taxonomy Modality: spoken/written Kinds of agents: human/computer Interaction: dialogue/monologue Context : spatial, temporal Number & type of tasks –Simultaneous? Dialogue-task distance –Similarity of dialogue structure – task structure Kinds of shared knowledge exploited –Perceptual, linguistic, cultural

Goteborg University Dialogue Systems Lab Discussion: Dahlbäck Several dimensions, some relevant but some not –We currently assume spoken human-computer dialogue –Dialogue-task distance perhaps too abstract –Context, kinds of shared knowledge used, and number of tasks relevant, but not yet included in our classification –Type of task similar to our concept of activity

Goteborg University Dialogue Systems Lab Allen et. al. (2001) technique usedexample tasktask complexity dialogue phenomena handled finite-state scriptlong-distance dialing least complexuser answers questions frame-basedgetting train timetable info user asks questions, simple clarifications by system sets of contextstravel booking agent shifts between predetermined topics plan-based models kitchen design consultant dynamically generated topic structures, collaborative negotiation subdialogues agent-based models disaster relief management most complex different modalities (e.g. planned world and actual world)

Goteborg University Dialogue Systems Lab Discussion:Allen et. al. Relates properties of system to properties of activity, BUT Based on technologies, not properties of activities –Dialogue phenomena don’t necessarily come in lumps Focus on information seeking and collaborative planning; some types of dialogue not included –Tutorial, Explanatory, Instructional…

Goteborg University Dialogue Systems Lab Desiderata for a classification of dialogue Based on multiple independent properties of (dialogue in) different activities Relating properties of activity to properties of system Covering not only information seeking and collaborative planning dialogue

Goteborg University Dialogue Systems Lab Some initial dimensions of classification Inquiry-oriented vs. Action-oriented dialogue Type of result: simple/complex Type of external process: active/passive Distribution of decision rights: shared/disjoint

Goteborg University Dialogue Systems Lab Inquiry-oriented vs. action-oriented dialogue IOD: raising and addressing issues –E.g. database search AOD: introduces (non-communicative) actions to performed (requests) –E.g. programming a Video Recorder Dialogue genre Moves/rulesInformation State components Inquiry- Oriented (IOD) ask answer Question stack Action- Oriented (AOD) request confirm Action stack

Goteborg University Dialogue Systems Lab Result type Is the primary result of the dialogue a simple or a complex information object? –Simple: proposition, action –Complex: plan, proof, explanation Complex results require update rules and information state components (e.g. a tree) enabling incremental construction Example: offline planning –U: Get me coffee –R: How do I do that? –U: First, go to the kitchen. –R: OK. And then? –U: Go to the coffee machine. –…

Goteborg University Dialogue Systems Lab Proactivity of external process Passive: database, simple device (e.g. Video Recorder) (Pro)active: device, e.g. robot, burglar alarm –May need to interrupt current dialogue, perhaps even interrupt user utterances This dimension correlates with –the way the system is connected to the device Is the device interface a resource (passive) or a module (active)? –System intitiative and turntaking mechanisms

Goteborg University Dialogue Systems Lab Distribution of decision rights Disjoint: each question directed to a specific DP ; this DP decides on the answer and does not need to negotiate Shared: some question(s) should be answered jointly; negotiation may be needed Dialogue system requirements for negotiation: –Dialogue move: propose –Information state component: a stack of pairs of issue under negotiation, and alternative solutions/answers to this issue N.B.: we here refer to collaborative negotiation (non- conflicting goals) –E.g. SunDial furniture selection task

Goteborg University Dialogue Systems Lab activityIOD/AO D result typeexternal process decision rights database searchIODsimple: price etc. complex: itinerary passive (database) disjoint ticket bookingAOD+ IOD simple: flightpassive (database) disjoint simple device control AOD+ IOD simple: actionspassive or active disjoint instructional (sys instructs usr) AOD+ IOD simple: actionspassive (manual) disjoint offline planning, incl. itinerary planning, complex device control AODcomplex: plan(s)passive (planner) shared online planning, e.g. TRIPS AOD+ IOD complex: planactive (device+ planner) shared explanationIODcomplex: proof or explanation passive (inference engine) shared tutorialIOD/ AOD complex?passive (planner) disjoint narrationIODcomplex: narrativepassivedisjoint

Goteborg University Dialogue Systems Lab Possible additional activity-related factors Distribution of information –Symmetric: DPs have same kind of information –Asymmetric: DPs have different kinds of information –Relation to distribution of decision rights? Shared or conflicting goals –Conflicting goals may lead to non-collaborative negotiation, which would require argumentation acts, including rhetorical acts Number of simultaneous tasks (one or several) –But probably very few activities with just one task …

Goteborg University Dialogue Systems Lab Comments What we really are classifying are activities –Table shows a classification of activities according to features of a dialogue system needed to particitpate in dialogues in these activities How specific should our activities, or activity types, be? –Action oriented dialogue? Device control? VCR control? Dialogue with Panasonic VCR 4500? Is ”genre” still a useful term? –Could perhaps be reserved for very basic properties, such as IOD/AOD –Or have genres like ”AOD for active devices and collaborative negotiation and asymmetric distrubution of information”

Goteborg University Dialogue Systems Lab Dialogue modelling and dialogue systems

Goteborg University Dialogue Systems Lab Dialogue modelling Theoretical motivations –find structure of dialogue –explain structure –relate dialogue structure to informational and intentional structure Practical motivations –build dialogue systems to enable natural human- computer interaction –speech-to-speech translation –...

Goteborg University Dialogue Systems Lab Informal approaches to dialogue modelling speech act theory (Austin, Searle,...) –utterances are actions –illocutionary acts: ask, assert, instruct etc. discourse analysis (Schegloff, Sacks,...) –turn-taking, pre-sequences etc. dialogue games (Sinclair & Coulthard,...) –structure of dialogue segments (rather than separate utterances) –can e.g. be encoded as regular expressions or finite automata qna-game -> question qna-game* answer

Goteborg University Dialogue Systems Lab Dialogue management frameworks (computational approaches) Industry systems –finite state automata –form-based (VoiceXML) Research systems: –plan-based speech acts as plan operators –general reasoning (Sadek,...) often combined with plan-based Information state approach –generalises over all the above

Goteborg University Dialogue Systems Lab Why build dialogue systems? theoretical: test theories –e.g. what kind of information does the system need to keep track of? –problems complex system with many components how to evaluate (Turing test not so useful) practical: natural language interfaces –databases (train timetables etc) –electronic devices (mobile phones,...) –instructional/helpdesk systems –booking flights etc –tutorial systems

Goteborg University Dialogue Systems Lab What does a system need to be able to do? speech recognition parsing, syntactic and semantic interpretation –resolve ambiguities –anaphora and ellipsis resolution, etc... dialogue management –how does an utterance change the state of the dialogue? –given the current state of the dialogue, what should the system do? natural language generation speech synthesis

Goteborg University Dialogue Systems Lab Why spoken dialogue? Spoken dialogue is the natural way for people to communicate –computers should adapt to humans rather than the other way around important to enable system and user to communicate in a natural (human-like) way –mixed initiative –turntaking, feedback, barge-in –handle embedded subdialogues –...

Goteborg University Dialogue Systems Lab What’s happening with dialogue systems Simple systems are being used commercially –Command systems (user command + system response) –Form-filling (system questions + user responses; system delivers info) Limited domains –need to encode domain-specific knowledge; –a general system would require general world knowledge, which may not be feasible –speech recognition is harder with large lexicon Need to bridge gap between dialogue theory and working systems

Goteborg University Dialogue Systems Lab framework dataflow, datastructures, etc. Framework level Framework –Takes care of low-level programming: dataflow, datastructures etc. Examples –Current, TrindiKit, OAA, Communicator, SOAR

Goteborg University Dialogue Systems Lab framework basic dialogue theory basic system dataflow, datastructures, etc. Basic dialogue system Formulate an application-independent dialogue theory to instantiate the framework Examples –GoDiS, VoiceXML, TRIPS,...

Goteborg University Dialogue Systems Lab basic dialogue theory basic system genre-specific theory additions genre-specific system Genre- and modality-specific system Add genre- and modality-dependent components framework dataflow, datastructures, etc.

Goteborg University Dialogue Systems Lab basic dialogue theory domain & language resources basic system application genre-specific theory additions genre-specific system Application Add application-specific resources framework dataflow, datastructures, etc.