Wrapping Up Ling575 Spoken Dialog Systems June 5, 2013.

Slides:



Advertisements
Similar presentations
1 © 2005 CHIL KTH ASIDE 2005, Aalborg, Applications of distributed dialogue systems: The KTH Connector Jens Edlund & Anna Hjalmarsson Applications.
Advertisements

Empirical and Data-Driven Models of Multimodality Advanced Methods for Multimodal Communication Computational Models of Multimodality Adequate.
Languages & The Media, 4 Nov 2004, Berlin 1 Multimodal multilingual information processing for automatic subtitle generation: Resources, Methods and System.
Forschungszentrum Telekommunikation Wien [Telecommunications Research Center Vienna] Interfaces between Speech and Non-Speech Audio Technology Michael.
Results Clear distinction between two question intonations: perception and understanding level Three distinct prototypes for different interpretations.
Course Overview Lecture 1 Spoken Language Processing Prof. Andrew Rosenberg.
Shallow Processing: Summary Shallow Processing Techniques for NLP Ling570 December 7, 2011.
Dialogue Management Ling575 Discourse and Dialogue May 18, 2011.
Combining Prosodic and Text Features for Segmentation of Mandarin Broadcast News Gina-Anne Levow University of Chicago SIGHAN July 25, 2004.
Error detection in spoken dialogue systems GSLT Dialogue Systems, 5p Gabriel Skantze TT Centrum för talteknologi.
John Hu Nov. 9, 2004 Multimodal Interfaces Oviatt, S. Multimodal interfaces Mankoff, J., Hudson, S.E., & Abowd, G.D. Interaction techniques for ambiguity.
Prosodic Cues to Discourse Segment Boundaries in Human-Computer Dialogue SIGDial 2004 Gina-Anne Levow April 30, 2004.
Spoken Dialogue Technology How can Jerry Springer contribute to Computer Science Research Projects?
ITCS 6010 Spoken Language Systems: Architecture. Elements of a Spoken Language System Endpointing Feature extraction Recognition Natural language understanding.
Detecting missrecognitions Predicting with prosody.
1 error handling – Higgins / Galatea Dialogs on Dialogs Group July 2005.
The Use of Speech in Speech-to-Speech Translation Andrew Rosenberg 8/31/06 Weekly Speech Lab Talk.
Turn-taking in Mandarin Dialogue: Interactions of Tone and Intonation Gina-Anne Levow University of Chicago October 14, 2005.
6/28/20151 Spoken Dialogue Systems: Human and Machine Julia Hirschberg CS 4706.
Towards Learning Dialogue Structures from Speech Data and Domain Knowledge: Challenges to Conceptual Clustering using Multiple and Complex Knowledge Source.
MUSCLE Multimodal e-team related activity Technical University of Crete Speech Processing and Dialog Systems Group Presenter: Prof. Alex Potamianos Technical.
1 Chapter 19: Dialogue and Conversational Agents Nadia Hamrouni and Ahmed Abbasi 12/5/2006.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
SDS Architectures Julia Hirschberg COMS 4706 (Thanks to Josh Gordon for slides.) 1.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Facial Feature Detection
9/8/20151 Natural Language Processing Lecture Notes 1.
Li Deng Microsoft Research Redmond, WA Presented at the Banff Workshop, July 2009 From Recognition To Understanding Expanding traditional scope of signal.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Interactive Dialogue Systems Professor Diane Litman Computer Science Department & Learning Research and Development Center University of Pittsburgh Pittsburgh,
Recognition of meeting actions using information obtained from different modalities Natasa Jovanovic TKI University of Twente.
Exploiting lexical information for Meeting Structuring Alfred Dielmann, Steve Renals (University of Edinburgh) {
APML, a Markup Language for Believable Behavior Generation Soft computing Laboratory Yonsei University October 25, 2004.
Area Report Machine Translation Hervé Blanchon CLIPS-IMAG A Roadmap for Computational Linguistics COLING 2002 Post-Conference Workshop.
Cognitive Systems Foresight Language and Speech. Cognitive Systems Foresight Language and Speech How does the human system organise itself, as a neuro-biological.
Machine Learning in Spoken Language Processing Lecture 21 Spoken Language Processing Prof. Andrew Rosenberg.
circle Adding Spoken Dialogue to a Text-Based Tutorial Dialogue System Diane J. Litman Learning Research and Development Center & Computer Science Department.
Machine Translation  Machine translation is of one of the earliest uses of AI  Two approaches:  Traditional approach using grammars, rewrite rules,
Introduction to CL & NLP CMSC April 1, 2003.
Turn-taking Discourse and Dialogue CS 359 November 6, 2001.
1 Natural Language Processing Lecture Notes 14 Chapter 19.
Introduction to Computational Linguistics
Automatic Cue-Based Dialogue Act Tagging Discourse & Dialogue CMSC November 3, 2006.
ICS 482: Natural language Processing Pre-introduction
Recognizing Discourse Structure: Speech Discourse & Dialogue CMSC October 11, 2006.
Cognitive Systems Foresight Language and Speech. Cognitive Systems Foresight Language and Speech How does the human system organise itself, as a neuro-biological.
1 Workshop « Multimodal Corpora » Jean-Claude MARTIN Patrizia PAGGIO Peter KÜEHNLEIN Rainer STIEFELHAGEN Fabio PIANESI.
Multimodality, universals, natural interaction… and some other stories… Kostas Karpouzis & Stefanos Kollias ICCS/NTUA HUMAINE WP4.
LREC 2004, 26 May 2004, Lisbon 1 Multimodal Multilingual Resources in the Subtitling Process S.Piperidis, I.Demiros, P.Prokopidis, P.Vanroose, A. Hoethker,
1 Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents S. Kawamoto, et al. October 27, 2004.
Challenges in Dialogue Discourse and Dialogue CMSC October 27, 2006.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
User Responses to Prosodic Variation in Fragmentary Grounding Utterances in Dialog Gabriel Skantze, David House & Jens Edlund.
Adapting Dialogue Models Discourse & Dialogue CMSC November 19, 2006.
Seminar on SDS Final class 5/4/2015. Topics discussed ASR, NLU in dialogue Dialogue management NLG, information presentation Evaluation Error recovery.
Software Architecture for Multimodal Interactive Systems : Voice-enabled Graphical Notebook.
Concept-Based Analysis of Scientific Literature Chen-Tse Tsai, Gourab Kundu, Dan Roth UIUC.
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
Tight Coupling between ASR and MT in Speech-to-Speech Translation Arthur Chan Prepared for Advanced Machine Translation Seminar.
COMMUNICATION OF MEANING
Landmark-Based Speech Recognition: Spectrogram Reading, Support Vector Machines, Dynamic Bayesian Networks, and Phonology Mark Hasegawa-Johnson
Linguistic knowledge for Speech recognition
NSF Grant Number: IIS PI: Joseph Picone Institution: Mississippi State University Title: Integrating Prosody, Speech Recognition, Parsing In Spoken-Language.
Investigating Pitch Accent Recognition in Non-native Speech
Issues in Spoken Dialogue Systems
Spoken Dialogue Systems
Spoken Dialogue Systems
Spoken Dialogue Systems: System Overview
Applied Linguistics Chapter Four: Corpus Linguistics
Presentation transcript:

Wrapping Up Ling575 Spoken Dialog Systems June 5, 2013

Roadmap Overview Distinctive factors in dialog: Human-human Human-computer Dialog components & dialog management Specialized topics: Detailed analysis of: Distinctive factors Techniques and applications Discussion: Trends, techniques, interrelations

Characteristics of Dialog Human-human: Multi-party interaction: Flexible turn-taking, mixed initiative Speech acts: Actions via speech, levels of interpretation Implicature: Grice’s maxims Cooperativity & closure: Grounding and levels of display Corrections, repairs, and confirmations

Characteristics of Dialog Human-computer – most deployed systems Multi-party interaction:

Characteristics of Dialog Human-computer – most deployed systems Multi-party interaction: Rigid silence-based turn-taking, system or “mixed” initiative Speech acts:

Characteristics of Dialog Human-computer – most deployed systems Multi-party interaction: Rigid silence-based turn-taking, system or “mixed” initiative Speech acts: Actions via speech: dialog acts, NLU Implicature:

Characteristics of Dialog Human-computer – most deployed systems Multi-party interaction: Rigid silence-based turn-taking, system or “mixed” initiative Speech acts: Actions via speech: dialog acts, NLU Implicature: Um… depends on dialog management, NLU Grounding:

Characteristics of Dialog Human-computer – most deployed systems Multi-party interaction: Rigid silence-based turn-taking, system or “mixed” initiative Speech acts: Actions via speech: dialog acts, NLU Implicature: Um… depends on dialog management, NLU Grounding: Confirmation: implicit/explicit: learned? Corrections, repairs: problematic Why?

Characteristics of Dialog Human-computer – most deployed systems Multi-party interaction: Rigid silence-based turn-taking, system or “mixed” initiative Speech acts: Actions via speech: dialog acts, NLU Implicature: Um… depends on dialog management, NLU Grounding: Confirmation: implicit/explicit: learned? Corrections, repairs: problematic Constrained by complexity, processing, speed, etc

Dialog System Components HMM-based ASR models NLU: call-routing, semantic grammars Dialog acts and recognition Dialog management: Finite-state Frame-based VoiceXML Information state Statistical dialog management Lots of examples!

Topics In-depth discussions: Computational approaches to make human-computer interaction more like human-human interaction Many issues raised in characterizing dialog: Multi-party

Topics In-depth discussions: Computational approaches to make human-computer interaction more like human-human interaction Many issues raised in characterizing dialog: Multi-party: multi-party interaction, turn-taking, initiative Grounding

Topics In-depth discussions: Computational approaches to make human-computer interaction more like human-human interaction Many issues raised in characterizing dialog: Multi-party: multi-party interaction, turn-taking, initiative Grounding: Miscommunication & repair, incremental processing Interpretation:

Topics In-depth discussions: Computational approaches to make human-computer interaction more like human-human interaction Many issues raised in characterizing dialog: Multi-party: multi-party interaction, turn-taking, initiative Grounding: Miscommunication & repair, incremental processing Interpretation: Reference, affect, subjectivity, personification, information structure, prosody Multi-modality Applications and issues: Tutoring, machine translation, information-seeking Non-native speech

Interconnections Sentiment Reference Persona Turn- taking Apps: MT Multi-party Prosody TutoringNon-native Multi- modality Miscommu nication Info. Struct Increment Affect Initiative

Interconnections Sentiment Reference Persona Turn- taking Apps: MT Multi-party Prosody TutoringNon-native Multi- modality Miscommu nication Info. Struct Increment Affect Initiative

Techniques & Sources of Information Range of techniques:

Techniques & Sources of Information Range of techniques: Deep processing, shallow processing, manual rules Machine learning:

Techniques & Sources of Information Range of techniques: Deep processing, shallow processing, manual rules Machine learning: Anything from decision trees to POMDPs Information sources:

Techniques & Sources of Information Range of techniques: Deep processing, shallow processing, manual rules Machine learning: Anything from decision trees to POMDPs Information sources: Acoustic, lexical, prosodic, timing, syntactic, semantic, pragmatic, etc Multimodal: gaze, gesture, etc Integration

Techniques & Sources of Information Range of techniques: Deep processing, shallow processing, manual rules Machine learning: Anything from decision trees to POMDPs Information sources: Acoustic, lexical, prosodic, timing, syntactic, semantic, pragmatic, etc Multimodal: gaze, gesture, etc Integration: Complex and varied Huge feature vectors, tandem models, blackboards, learned Substantial strides, but huge remaining challenges

Questions? Favorite topic? Most surprising result? Most obvious result? Most surprising gap?