AQUAINT 18-Month Workshop 1 Light Semantic Processing for QA Language Technologies Institute, Carnegie Mellon B. Van Durme, Y. Huang, A. Kupsc and E. Nyberg.

Slides:

Advertisements

Similar presentations

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.

Advertisements

CILC2011 A framework for structured knowledge extraction and representation from natural language via deep sentence analysis Stefania Costantini Niva Florio.

QA-LaSIE Components The question document and each candidate answer document pass through all nine components of the QA-LaSIE system in the order shown.

Learning Semantic Information Extraction Rules from News The Dutch-Belgian Database Day 2013 (DBDBD 2013) Frederik Hogenboom Erasmus.

SEARCHING QUESTION AND ANSWER ARCHIVES Dr. Jiwoon Jeon Presented by CHARANYA VENKATESH KUMAR.

A Linguistic Approach for Semantic Web Service Discovery International Symposium on Management Intelligent Systems 2012 (IS-MiS 2012) July 13, 2012 Jordy.

CSE 425: Logic Programming I Logic and Programs Most programs use Boolean expressions over data Logic statements can express program semantics –I.e., axiomatic.

Robust Textual Inference via Graph Matching Aria Haghighi Andrew Ng Christopher Manning.

 Christel Kemke 2007/08 COMP 4060 Natural Language Processing Semantics.

Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.

NLP and Speech Course Review. Morphological Analyzer Lexicon Part-of-Speech (POS) Tagging Grammar Rules Parser thethe – determiner Det NP → Det.

A Probabilistic Framework for Information Integration and Retrieval on the Semantic Web by Livia Predoiu, Heiner Stuckenschmidt Institute of Computer Science,

Ang Sun Ralph Grishman Wei Xu Bonan Min November 15, 2011 TAC 2011 Workshop Gaithersburg, Maryland USA.

Victoria Uren, Simon Buckingham Shum, Gangmin Li, John Domingue, Enrico Motta Knowledge Media Institute The Open University 21 May 2003 Scholarly Publishing.

Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.

By : Vanessa López, Enrico Motta Knowledge Media Institute. Open University Ontology-driven question answering in: AQUALog 9 th International Conference.

Fuzzy Medical Image Segmentation

Ontology-Based Free-Form Query Processing for the Semantic Web Mark Vickers Brigham Young University MS Thesis Defense Supported by:

Translation Divergence LING 580MT Fei Xia 1/10/06.

Knowledge-Based NLP and the Semantic Web Sergei Nirenburg Institute for Language and Information Technologies University of Maryland Baltimore County Workshop.

Latent Semantic Analysis (LSA). Introduction to LSA Learning Model Uses Singular Value Decomposition (SVD) to simulate human learning of word and passage.

Employing Two Question Answering Systems in TREC 2005 Harabagiu, Moldovan, et al 2005 Language Computer Corporation.

Ontology Learning and Population from Text: Algorithms, Evaluation and Applications Chapters Presented by Sole.

OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR

AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.

JAVELIN Project Briefing 1 AQUAINT Year I Mid-Year Review Language Technologies Institute Carnegie Mellon University Status Update for Mid-Year Program.

Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.

Tree Kernels for Parsing: (Collins & Duffy, 2001) Advanced Statistical Methods in NLP Ling 572 February 28, 2012.

BACKGROUND KNOWLEDGE IN ONTOLOGY MATCHING Pavel Shvaiko joint work with Fausto Giunchiglia and Mikalai Yatskevich INFINT 2007 Bertinoro Workshop on Information.

Survey of Semantic Annotation Platforms

Machine Learning Approach for Ontology Mapping using Multiple Concept Similarity Measures IEEE/ACIS International Conference on Computer and Information.

Scott Duvall, Brett South, Stéphane Meystre A Hands-on Introduction to Natural Language Processing in Healthcare Annotation as a Central Task for Development.

BLAST: A Case Study Lecture 25. BLAST: Introduction The Basic Local Alignment Search Tool, BLAST, is a fast approach to finding similar strings of characters.

1 Evaluating top-k Queries over Web-Accessible Databases Paper By: Amelie Marian, Nicolas Bruno, Luis Gravano Presented By Bhushan Chaudhari University.

A Probabilistic Graphical Model for Joint Answer Ranking in Question Answering Jeongwoo Ko, Luo Si, Eric Nyberg (SIGIR ’ 07) Speaker: Cho, Chin Wei Advisor:

Abstract Question answering is an important task of natural language processing. Unification-based grammars have emerged as formalisms for reasoning about.

Carnegie Mellon School of Computer Science Copyright © 2001, Carnegie Mellon. All Rights Reserved. JAVELIN Project Briefing 1 AQUAINT Phase I Kickoff December.

AQUAINT BBN’s AQUA Project Ana Licuanan, Jonathan May, Scott Miller, Ralph Weischedel, Jinxi Xu 3 December 2002.

A Language Independent Method for Question Classification COLING 2004.

80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.

EasyQuerier: A Keyword Interface in Web Database Integration System Xian Li 1, Weiyi Meng 2, Xiaofeng Meng 1 1 WAMDM Lab, RUC & 2 SUNY Binghamton.

LOD for the Rest of Us Tim Finin, Anupam Joshi, Varish Mulwad and Lushan Han University of Maryland, Baltimore County 15 March 2012

A Classification of Schema-based Matching Approaches Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan.

JAVELIN Project Briefing AQUAINT Program 1 AQUAINT 6-month Meeting 10/08/04 JAVELIN II: Scenarios and Variable Precision Reasoning for Advanced QA from.

Module networks Sushmita Roy BMI/CS 576 Nov 18 th & 20th, 2014.

GREGORY SILVER KUSHEL RIA BELLPADY JOHN MILLER KRYS KOCHUT WILLIAM YORK Supporting Interoperability Using the Discrete-event Modeling Ontology (DeMO)

AQUAINT Kickoff Meeting Advanced Techniques for Answer Extraction and Formulation Language Computer Corporation Dallas, Texas.

1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )

Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar

Lexico-semantic Patterns for Information Extraction from Text The International Conference on Operations Research 2013 (OR 2013) Frederik Hogenboom

Towards Linguistically Grounded Ontologies Paul Buitelaar, Philipp Cimiano, Peter Haase, and Michael Sintek Proceedings of the 6 th European Semantic Web.

Automatic Question Answering  Introduction  Factoid Based Question Answering.

HITIQA: Scenario Based Question Answering Tomek Strzalkowski, et al The State University of New York at Albany Paul Kantor, et al Rutgers University Boris.

AQUAINT IBM PIQUANT ARDACYCORP Subcontractor: IBM Question Answering Update piQuAnt ARDA/AQUAINT December 2002 Workshop This work was supported in part.

Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.

Service discovery with semantic alignment Alberto Fernández AT COST WG1 meeting, Cyprus, Dec, 2009.

THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.

Commonsense Reasoning in and over Natural Language Hugo Liu, Push Singh Media Laboratory of MIT The 8 th International Conference on Knowledge- Based Intelligent.

Answer Mining by Combining Extraction Techniques with Abductive Reasoning Sanda Harabagiu, Dan Moldovan, Christine Clark, Mitchell Bowden, Jown Williams.

Keyword Translation Accuracy and Cross-Lingual Question Answering in Chinese and Japanese Teruko Mitamura Mengqiu Wang Hideki Shima Frank Lin In CMU EACL.

1 Question Answering and Logistics. 2 Class Logistics  Comments on proposals will be returned next week and may be available as early as Monday  Look.

Text Summarization using Lexical Chains. Summarization using Lexical Chains Summarization? What is Summarization? Advantages… Challenges…

AQUAINT Mid-Year PI Meeting – June 2002 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.

GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011

AQUAINT Mid-Year Workshop: Observations and Comments Jimmy Lin MIT Artificial Intelligence Laboratory.

Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin

Ontology Evolution: A Methodological Overview

Web IR: Recent Trends; Future of Web Search

Traditional Question Answering System: an Overview

Donna M. Gates Carnegie Mellon University

Presentation transcript:

AQUAINT 18-Month Workshop 1 Light Semantic Processing for QA Language Technologies Institute, Carnegie Mellon B. Van Durme, Y. Huang, A. Kupsc and E. Nyberg Towards Light Semantic Processing for Question Answering

AQUAINT 18-Month Workshop 2 Light Semantic Processing for QA Overview of This Talk Motivation Components of the Approach –Logical Form –Similarity Measure –Unification Strategy Incorporation into JAVELIN Future Work / Next Steps

AQUAINT 18-Month Workshop 3 Light Semantic Processing for QA Example of Extraction Error Question: “When was Wendy’s founded?” Passage candidate: –“The renowned Murano glassmaking industry, on an island in the Venetian lagoon, has gone through several reincarnations since it was founded in Three exhibitions of 20th-century Murano glass are coming up in New York. By Wendy Moonan.” Statistical extractor: 20th-century

AQUAINT 18-Month Workshop 4 Light Semantic Processing for QA Basic Idea Q: “xxx xxxx xxxx xxxx xxxxxxxxxx xx xxxxx?”P: “xxx xxxx xxxx xxxx xxxxx xx xxxxx.” A(?,C)A(B,C)A(B,C) ? = B extract Unification on simple predicates representing basic argument structure will provide a more accurate way to match questions with appropriate answer(s) Two Challenges: * Where do predicates come from? * Flexibility in interpretation… partial interpretation

AQUAINT 18-Month Workshop 5 Light Semantic Processing for QA Associating Tokens with Concepts Imprecise Reference, e.g.: “John W. was greeted by William Clinton” “Bill greeted Mr. Wright” Definite Description, e.g. “Mr. Bush” vs. “the president” Anaphoric Reference UNIFY( {GREET(“William Clinton”,”John W.”)}, {GREET(“Bill”,”Mr. Wright”)} ) Interpretation of tokens must be: Approximate, not exact Context-sensitive

AQUAINT 18-Month Workshop 6 Light Semantic Processing for QA Language Processing Tools BBN IdentiFinder (BBN, 2000) Link Grammar parser (Grinberg et al., 1995) KANTOO parser (Nyberg & Mitamura, 2000) Brill part-of-speech tagger (Brill, 1995) WordNet (Fellbaum, 1998) Lexical Conceptual Structure (LCS) Database (Dorr 2001)

AQUAINT 18-Month Workshop 7 Light Semantic Processing for QA Representation Formula: a set of literals Literal: a predicate, plus two terms Extrinsic literal: a relation mapping a label to a label –SUBJECT(x1,x2) Intrinsic literal: a relation mapping a label to a value –ROOT(x1,|Benjamin|) Value: EVENT, past, +, |Mary Smith|,…

AQUAINT 18-Month Workshop 8 Light Semantic Processing for QA Example Q = Who killed Jefferson? ROOT(x1,?a0),ROOT(x2,|kill|),ROOT(x3,|Jefferson|), TYPE(x2,|event|),TYPE(x1,|person|),TYPE(x3,|person|), SUBJECT(x2,x1),OBJECT(x2,x3),ANS(?a0) P = Benjamin murdered Jefferson. ROOT(y1,|Benjamin|),ROOT(y2,|murder|),ROOT(y3,|Jef ferson|), TYPE(y2,|event|),TYPE(y1,|person|),TYPE(y3,|person|), SUBJECT(y2,y1),OBJECT(y2,y3)

AQUAINT 18-Month Workshop 9 Light Semantic Processing for QA Graphically ?a0 x1 x2 kill x3 Jefferson person Benjamin y1 y2 murder y3 Jefferson person event person event SUBJECT OBJECT ROOT TYPE

AQUAINT 18-Month Workshop 10 Light Semantic Processing for QA Similarity Functions A zero-to-one function that returns a value representing similarity between the formulae for question, passage Unification requires similarity measurement between literal values sim(“Who killed Jefferson?”, ”Benjamin murdered Jefferson.”) = 0.9

AQUAINT 18-Month Workshop 11 Light Semantic Processing for QA sim(formula0,formula1) Given two formulae, we define the similarity to be the geometric mean of the similarity between the separate extrinsic literals.

AQUAINT 18-Month Workshop 12 Light Semantic Processing for QA sim(extrinsicLiteral0,extrinsicLiteral1) To measure the similarity between two extrinsic literals, we take the square root of the product of the similarity between each of the two pairs of labels.

AQUAINT 18-Month Workshop 13 Light Semantic Processing for QA sim(label0,label1) To measure the similarity of two labels, we find the maximum possible value of taking the geometric mean of the similarity of each pairwise combination of intrinsic literals that are shared by the two labels.

AQUAINT 18-Month Workshop 14 Light Semantic Processing for QA sim(intrinsicLiteral0,intrinsicLiteral1) The similarity between two intrinsic literals is measured by similarity of the paired words, times the weight of the first literal.

AQUAINT 18-Month Workshop 15 Light Semantic Processing for QA sim(word0,word1) sim(|kill|,|murder|) = 0.8 –via WordNet distance function sim(?a0,|Benjamin|) = 1.0 –zero cost for variable binding

AQUAINT 18-Month Workshop 16 Light Semantic Processing for QA Example

AQUAINT 18-Month Workshop 17 Light Semantic Processing for QA Answer Find the maximum possible similarity score, return the term bound to ?a0 ?a0/|Benjamin| sim(Q,P) = 0.9 Answer = Benjamin, 0.9

AQUAINT 18-Month Workshop 18 Light Semantic Processing for QA Current Status, Future Work First version implemented, testing now Short Term: Test “NLP IX” against statistical extraction module on factoid questions Longer Term: –Support simple reasoning about questions and passages –Investigate approach in narrower domains Question answering based on CNS data on terrorism and weapons of mass destruction –Extend similarity metric at word level Word co-occurrence information Distance metrics on ontologies other than WordNet –Incorporate LCS Lexicon

AQUAINT 18-Month Workshop 19 Light Semantic Processing for QA Summary We believe complex question answering requires more than statistical extraction methods Knowledge bottleneck forces compromise in depth of language processing Robust unification based on heuristic measure of similarity offers short-term solution

AQUAINT 18-Month Workshop 20 Light Semantic Processing for QA Additional Resources Paper available: B. Van Durme, Y. Huang, A. Kupsc and E. Nyberg (2003). “Towards Light Semantic Processing for Question Answering”, presented at the HLT/NAACL 2003 Workshop on Text Meaning. This and other papers at the JAVELIN web site:

AQUAINT 18-Month Workshop 21 Light Semantic Processing for QA Questions?

AQUAINT 18-Month Workshop 22 Light Semantic Processing for QA Logical Form := + := (, ) := | := |[a-nA-Z0-9\s]+| := [a-z]+[0-9]+ := [A-Z]+ Extrinsic literal: (, ) Intrinsic literal: (, )