Dialogue State Tracking & Dialogue Corpus Survey

Slides:



Advertisements
Similar presentations
Linking Entities in #Microposts ROMIL BANSAL, SANDEEP PANEM, PRIYA RADHAKRISHNAN, MANISH GUPTA, VASUDEVA VARMA INTERNATIONAL INSTITUTE OF INFORMATION TECHNOLOGY,
Advertisements

Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
Primary Reading Focus Group
Dialogue – Driven Intranet Search Suma Adindla School of Computer Science & Electronic Engineering 8th LANGUAGE & COMPUTATION DAY 2009.
Search Engines and Information Retrieval
Unsupervised Information Extraction from Unstructured, Ungrammatical Data Sources on the World Wide Web Mathew Michelson and Craig A. Knoblock.
Beyond datasets: Learning in a fully-labeled real world Thesis proposal Alexander Sorokin.
Writing Reports, Proposals, and Technical Documents By Laurie A. Pinkert.
Search Engines and Information Retrieval Chapter 1.
Role-plays for CALL: System Architecture and Resources Sabrina Wilske & Magdalena Wolska Saarland University ICL, Villach, September.
Interactive Dialogue Systems Professor Diane Litman Computer Science Department & Learning Research and Development Center University of Pittsburgh Pittsburgh,
Today’s Topics Chapter 2 in One Slide Chapter 18: Machine Learning (ML) Creating an ML Dataset –“Fixed-length feature vectors” –Relational/graph-based.
 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
Crowdsourcing for Spoken Dialogue System Evaluation Ling 575 Spoken Dialog April 30, 2015.
Describing Images using Inferred Visual Dependency Representations Authors : Desmond Elliot & Arjen P. de Vries Presentation of Paper by : Jantre Sanket.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
Describing Images using Inferred Visual Dependency Representations Authors : Desmond Elliot & Arjen P. de Vries Presentation of Paper by : Jantre Sanket.
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
Big traffic data processing framework for intelligent monitoring and recording systems 學生 : 賴弘偉 教授 : 許毅然 作者 : Yingjie Xia a, JinlongChen a,b,n, XindaiLu.
Performance Task Overview Introduction This training module answers the following questions: –What is a performance task? –What is a Classroom Activity?
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Towards Semi-Automated Annotation for Prepositional Phrase Attachment Sara Rosenthal William J. Lipovsky Kathleen McKeown Kapil Thadani Jacob Andreas Columbia.
Reinforcement Learning for Mapping Instructions to Actions S.R.K. Branavan, Harr Chen, Luke S. Zettlemoyer, Regina Barzilay Computer Science and Artificial.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Language Identification and Part-of-Speech Tagging
Mohammad Alqahtani, Dr. Eric Atwell
Olivier Siohan David Rybach
Queensland University of Technology
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
  A preliminary study: Perceptions of aviation maintenance students related to the use of Augmented Reality maintenance instructions Amadou Anne, Yu Wang.
Chapter 6. Data Collection in a Wizard-of-Oz Experiment in Reinforcement Learning for Adaptive Dialogue Systems by: Rieser & Lemon. Course: Autonomous.
Deep Compositional Cross-modal Learning to Rank via Local-Global Alignment Xinyang Jiang, Fei Wu, Xi Li, Zhou Zhao, Weiming Lu, Siliang Tang, Yueting.
ELT 329 ACTION RESEARCH Week 4
Architecture Components
Tim Sheerman-Chase, Eng-Jon Ong and Richard Bowden
Spoken Dialog System.
Generating Natural Answers by Incorporating Copying and Retrieving Mechanisms in Sequence-to-Sequence Learning Shizhu He, Cao liu, Kang Liu and Jun Zhao.
Towards Understanding End-of-trip Instructions in a Taxi Ride Scenario
--Mengxue Zhang, Qingyang Li
Integrating Learning of Dialog Strategies and Semantic Parsing
Social Knowledge Mining
Distributed Representation of Words, Sentences and Paragraphs
Quanzeng You, Jiebo Luo, Hailin Jin and Jianchao Yang
Exploring Matching Networks for Cross Language Information Retrieval
Eiji Aramaki* Sadao Kurohashi* * University of Tokyo
iSRD Spam Review Detection with Imbalanced Data Distributions
Interpret the execution mode of SQL query in F1 Query paper
MATERIAL Resources for Cross-Lingual Information Retrieval
Memory-augmented Chinese-Uyghur Neural Machine Translation
Deep Cross-media Knowledge Transfer
Resource Recommendation for AAN
Socialized Word Embeddings
AST Based Sequence to Sequence Natural Language Question to SQL Method
Domain Mixing for Chinese-English Neural Machine Translation
Unsupervised Pretraining for Semantic Parsing
SLT 2018 Special Session – Microsoft Dialogue Challenge: Building E2E Task-Completion Dialogue Systems. Agenda 1:00 - 1:10PM Opening, Jianfeng Gao (MSR)
Natural Language to SQL(nl2sql)
Independent Project Natural Language to SQL
Using Multilingual Neural Re-ranking Models for Low Resource Target Languages in Cross-lingual Document Detection Using Multilingual Neural Re-ranking.
Dennis Zhao,1 Dragomir Radev PhD1 LILY Lab
Deep Learning for the Soft Cutoff Problem
Lab Report Guidelines.
Ask and Answer Questions
Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.
Scientific Workflows Lecture 15
Presentation transcript:

Dialogue State Tracking & Dialogue Corpus Survey Suyi Li, Tao Yu, Dragomir Radev LILY Lab, Yale University LILY Lab Introduction Table 1. Comparison of Existing Dialogue Corpus Metric MultiWOZ WOZ2.0 Frames DSTC2 Dialog system is an automatic human-computer conversation system that intends to converse with a human with a coherent structure. With the development of big data and deep learning techniques, the goal of creating an automatic human computer conversation system is no longer an illusion. One of the challenging tasks in dialogue system area is dialogue state tracking, which aims to find requests and goals expressed in the user utterances. Encoder-encoder and attention mechanism are applied to solving this problem, which are proven to be effective. In the future, we will base on dialogue state tracking to develop our new dialogue system task.  My work includes: 1. Survey the current dialogue system papers. 2. Analyze the current corpus used to build dialogue systems. 3. Collaborate with Tao to define and refine the task settings for new dialogue task.  And the future work includes: 1. build up our large-scale and cross-domain corpus for new dialogue system task. 2. Develop baseline models on corpus to prove usability. Dialogue Corpus #Dialogues 8438 600 1369 1612 One of the big challenges in developing a large-scale and cross domain dialogue system is the lack of high-quality labeled data in large-scale corpus. In order to detailly define the task settings of the coming dialogue task, I conducted a survey of the existing dialogue corpus, including MultiWoZ 2.0 and Frames, and mainly focused on the pipeline of data collection. MultiWoZ is entirely developed based on crowd-sourcing without the need of hiring professional annotators. The data collection process is human-to-human using Amazon Mechanical Turk. In the conversation, dialogue systems act as assistants to answer users’ questions and help users with restaurant reservation, information query and taxi booking, etc.. Another dataset is Frame, which consists of 1369 human-human dialogues with an average of 15 turns per dialogue. The authors formalized the frame tracking task in their work, which extends the state tracking task to a setting where several semantic frames are simultaneously tracked throughout the dialogue. All values discussed in the conversation are recorded in the frames so that the users can access to the previous constraints at any point. Besides, datasets such as, DTSC2, WOZ2.0 are similar datasets. More details could be found in Table 1. Avg. turns per dialogue 13.68 7.45 14.60 14.49 Avg. tokens per turn 13.18 11.24 12.60 8.54 Total unique tokens 24071 2142 12043 986 #Slots 25 4 61 8 #Values 4510 99 3871 212 Figure 2. Overview of the GLAD architecture [1]. Dialogue System Research In the project, I surveyed and replicates several papers of dialogue system research. These papers are related to dialogue state tracking and our future dialogue system work. One of the influential work is the Global-Locally Self-Attention Dialogue State Tracker (GLAD) [1]. The model focus on the rare slot value pairs in the training dataset and is able to track these slot value pairs. Two modules are employed in the mode. The global module shares parameters between each slot and the local module is responsible to learn slot-specific features. Experimental results show that the GLAD model outperforms the state-of-the-art models. Another important work is proposed by [2]. Their work mainly focus on mapping natural language sentences to SQL query, which is related to both semantic parsing and dialogue system. The model adopts encoder-decoder architecture with different mechanism to improve the model performance. For example, a turn level encoder is added to understand the context in the conversation. Experimental results show that mechanisms to handle contexts lead to higher denotation accuracy. Conclusion and Future Work Figure 3. . Illustration of the model architecture during the third decoding step while processing the instruction which ones arrive at 7pm from the interaction [2] During this semester, I mainly focus on dialogue state tracking challenge and how to develop dialogue corpus for new dialogue system task. Encoder-decoder architecture with attention mechanism is proven to be effective in dialogue state tracking research. I surveyed and replicated several papers in this area. However, one of the biggest challenges for dialogue system task is the lack of high-quality cross-domain corpus. In the next months, I will devote myself to developing our corpus with Tao and build models to prove the usability and sets a baseline for future studies. Table 2. Replication Results of the GLAD model. Reference [1] Victor Zhong, Caiming Xiong, and Richard Socher. “Global-Locally Self-Attentive Dialogue State Tracker”. In: Association for Computational Linguistics (ACL). 2018. [2] Alane Suhr, Srinivasan Iyer, and Yoav Artzi. “Learning to Map Context-Dependent Sentences to Executable Formal Queries”. In: North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). 2018. Acknowledgement I would like to acknowledge Prof. Dragomir Radev for his supervision of my project. Also, I feel grateful to Tao Yu for his guidance and help in the through the progress. Figure 4. Example of dialogue state tracking task.