AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.

Slides:



Advertisements
Similar presentations
UCLA : GSE&IS : Department of Information StudiesJF : 276lec1.ppt : 5/2/2015 : 1 I N F S I N F O R M A T I O N R E T R I E V A L S Y S T E M S Week.
Advertisements

For Friday No reading Homework –Chapter 23, exercises 1, 13, 14, 19 –Not as bad as it sounds –Do them IN ORDER – do not read ahead here.
Semantic Role Chunking Combining Complementary Syntactic Views Sameer Pradhan, Kadri Hacioglu, Wayne Ward, James H. Martin, Daniel Jurafsky  Center for.
Information Retrieval in Practice
Sunita Sarawagi.  Enables richer forms of queries  Facilitates source integration and queries spanning sources “Information Extraction refers to the.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
Search Engines and Information Retrieval
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
The Informative Role of WordNet in Open-Domain Question Answering Marius Paşca and Sanda M. Harabagiu (NAACL 2001) Presented by Shauna Eggers CS 620 February.
Basi di dati distribuite Prof. M.T. PAZIENZA a.a
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Overview of Search Engines
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Information Retrieval in Practice
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
JAVELIN Project Briefing 1 AQUAINT Year I Mid-Year Review Language Technologies Institute Carnegie Mellon University Status Update for Mid-Year Program.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Search Engines and Information Retrieval Chapter 1.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
CIG Conference Norwich September 2006 AUTINDEX 1 AUTINDEX: Automatic Indexing and Classification of Texts Catherine Pease & Paul Schmidt IAI, Saarbrücken.
Interactive Dialogue Systems Professor Diane Litman Computer Science Department & Learning Research and Development Center University of Pittsburgh Pittsburgh,
©2008 Srikanth Kallurkar, Quantum Leap Innovations, Inc. All rights reserved. Apollo – Automated Content Management System Srikanth Kallurkar Quantum Leap.
2007. Software Engineering Laboratory, School of Computer Science S E Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying.
Experiments on Building Language Resources for Multi-Modal Dialogue Systems Goals identification of a methodology for adapting linguistic resources for.
Automatic Detection of Tags for Political Blogs Khairun-nisa Hassanali and Vasileios Hatzivassiloglou Human Language Technology Research Institute The.
1 Text Summarization: News and Beyond Kathleen McKeown Department of Computer Science Columbia University.
Automatic Detection of Tags for Political Blogs Khairun-nisa Hassanali Vasileios Hatzivassiloglou The University.
Carnegie Mellon School of Computer Science Copyright © 2001, Carnegie Mellon. All Rights Reserved. JAVELIN Project Briefing 1 AQUAINT Phase I Kickoff December.
Pete Bohman Adam Kunk.  ChronoSearch: A System for Extracting a Chronological Timeline ChronoChrono.
21/11/2002 The Integration of Lexical Knowledge and External Resources for QA Hui YANG, Tat-Seng Chua Pris, School of Computing.
Edinburg March 2001CROSSMARC Kick-off meetingICDC ICDC background and know-how and expectations from CROSSMARC CROSSMARC Project IST Kick-off.
AQUAINT Workshop – June 2003 Improved Semantic Role Parsing Kadri Hacioglu, Sameer Pradhan, Valerie Krugler, Steven Bethard, Ashley Thornton, Wayne Ward,
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
AQUAINT Kickoff Meeting Advanced Techniques for Answer Extraction and Formulation Language Computer Corporation Dallas, Texas.
Summarization Focusing on Polarity or Opinion Fragments in Blogs Yohei Seki Toyohashi University of Technology Visiting Scholar at Columbia University.
AQUAINT Phase II Six Month Workshop – October 2004 Fusing Rich Information Extracted from Multiple Media and Languages to Generate Contextualized, Complex.
Introduction to Dialogue Systems. User Input System Output ?
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Friday Finish chapter 24 No written homework.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
Summarizing Encyclopedic Term Descriptions on the Web from Coling 2004 Atsushi Fujii and Tetsuya Ishikawa Graduate School of Library, Information and Media.
AQUAINT June 2002 Workshop June 2002 Just-in-Time Interactive Question Answering Sanda Harabagiu: PI Language Computer Corporation.
MedKAT Medical Knowledge Analysis Tool December 2009.
Faculty Faculty Richard Fikes Edward Feigenbaum (Director) (Emeritus) (Director) (Emeritus) Knowledge Systems Laboratory Stanford University “In the knowledge.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
Answer Mining by Combining Extraction Techniques with Abductive Reasoning Sanda Harabagiu, Dan Moldovan, Christine Clark, Mitchell Bowden, Jown Williams.
Acquisition of Categorized Named Entities for Web Search Marius Pasca Google Inc. from Conference on Information and Knowledge Management (CIKM) ’04.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
Relevance Models and Answer Granularity for Question Answering W. Bruce Croft and James Allan CIIR University of Massachusetts, Amherst.
Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)
AQUAINT R&D Program Phase I Kickoff Workshop WELCOME.
1 Question Answering and Logistics. 2 Class Logistics  Comments on proposals will be returned next week and may be available as early as Monday  Look.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Annotating and measuring Temporal relations in texts Philippe Muller and Xavier Tannier IRIT,Université Paul Sabatier COLING 2004.
NTNU Speech Lab 1 Topic Themes for Multi-Document Summarization Sanda Harabagiu and Finley Lacatusu Language Computer Corporation Presented by Yi-Ting.
AQUAINT Mid-Year PI Meeting – June 2002 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
AQUAINT Mid-Year Workshop: Observations and Comments Jimmy Lin MIT Artificial Intelligence Laboratory.
A Database of Narrative Schemas A 2010 paper by Nathaniel Chambers and Dan Jurafsky Presentation by Julia Kelly.
Evaluation Issues: June 2002 Donna Harman Ellen Voorhees.
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
Information Retrieval in Practice
Search Engine Architecture
Ling573 NLP Systems and Applications May 30, 2013
What is IR? In the 70’s and 80’s, much of the research focused on document retrieval In 90’s TREC reinforced the view that IR = document retrieval Document.
Presentation transcript:

AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering Vasileios Hatzivassiloglou, Kathleen R. McKeown Columbia University Dan Jurafsky, Wayne H. Ward, James H. Martin University of Colorado

AQUAINT Kickoff Meeting – December 2001 Current State of the Art TREC Q&A track Technology enabling focused retrieval Good coverage (up to 70%) of facts (who, when, where) Issues: –System must understand the question type –Answers extracted verbatim from the source –One source at a time

AQUAINT Kickoff Meeting – December 2001 Our focus – Question type Facts, but not absolute facts Rather, questions with answers that depend on –source –perspective –time When Mullah Omar was born vs. Who controls Jalalabad?

AQUAINT Kickoff Meeting – December 2001 Our focus – Multiple sources Integrate answers from multiple sources Use similarities across sources to locate core part of the answer Highlight important differences between sources

AQUAINT Kickoff Meeting – December 2001 Our focus – Answer form Answer contains –Core part where sources agree –Differences in perspective –Trends in time Text is not copied verbatim Text generation allows for concise combination of materials from multiple sources

AQUAINT Kickoff Meeting – December 2001 Our focus – Q&A Environment Spoken and written questions Specialized language model for accepting questions in realistic, noisy environments Context management system allows for –clarifications –follow-up questions

AQUAINT Kickoff Meeting – December 2001 Technology innovations Specialized speech recognition and dialog management Semantic parsing of questions and source text Event recognition Information fusion

AQUAINT Kickoff Meeting – December 2001 Architecture

AQUAINT Kickoff Meeting – December 2001 Semantic Parsing Use of semantic information important in many of the current best Q&A systems (Srihari et al 2000; Hovy et al 2001; Harabagiu et al 2001) Semantics often derived from lexical relationships and taxonomies, e.g., WordNet We will use instead an explicit representation of semantic roles, FrameNet Shallow, efficiently computable semantic representation

AQUAINT Kickoff Meeting – December 2001 FrameNet FrameNet provides database of words with associated semantic roles

AQUAINT Kickoff Meeting – December 2001 Filling in semantic roles Initial statistical syntactic parse identifies dependencies Statistical classifier assigns roles to phrases Issues: –Prominent role for named entities –Generalization across domains –FrameNet coverage and fallback techniques

AQUAINT Kickoff Meeting – December 2001 Event recognition Events vs. topics Events as a basis for segmenting documents and classifying document fragments as matching a question Event algebra will allow –grouping sub-events –linking related events –detecting updates

AQUAINT Kickoff Meeting – December 2001 Detecting an event Hypothesis: Events can be detected on –participants (named entities, semantic roles) –time –location –limited constraints on verbs

AQUAINT Kickoff Meeting – December 2001 Information fusion Combining answer fragments in a concise response Summarization for question answering –Clustering of related fragments at the phrase level –Identification of syntactic dependencies and semantic roles –Fusion of matching entities, including paraphrases –Regeneration of the answer

AQUAINT Kickoff Meeting – December 2001 Technologies for information fusion and generation Learning paraphrases, syntactic and lexical (countless – lots of, repulsion – aversion) Detecting important differences Merging descriptions Learning and formulating content plans

AQUAINT Kickoff Meeting – December 2001 Speech and Dialog for Q&A Adapt speech recognition technology for –unknown words (using robust semantic classes) –named entities –noisy environment Dialog system for clarification and follow-up –use semantic annotation to generalize across domains –maintain focus on important semantic elements

AQUAINT Kickoff Meeting – December 2001 Data Input will come from TREC and TDT-3 –extended time span –multiple sources reporting on same event Annotation –TREC: focus on absolute facts –local: focus on questions with multiple answers –AQUAINT-level?

AQUAINT Kickoff Meeting – December 2001 Evaluation Component evaluation (several levels) –Identifying granularity of evaluation units –IR measures such as precision and recall –Human judgments for equivalence of output End-to-end evaluation –Participation in TREC –Extension of TREC methodology to questions with multiple answers

AQUAINT Kickoff Meeting – December 2001 Tools we have (and can share) C OMMUNICATOR speech recognition system (see our demo) FrameNet semantic resources and statistical semantic parser Document- and sentence-level clustering tools Summarization software (for the last two, see our NEWSBLASTER demo)

AQUAINT Kickoff Meeting – December 2001 Tools we are looking for Robust named entity recognizers Tools for classifying questions and identifying important elements in questions Tools for aiding human annotators

AQUAINT Kickoff Meeting – December 2001 Goals for the first six months Initial FrameNet parser (limited coverage) Identification of participants, time, location Identifying paraphrases from comparable news reports on the same event Adapting information fusion from summarization to question-answering Building prototype Q&A system