Robust Textual Inference via Graph Matching Aria Haghighi Andrew Ng Christopher Manning.

Slides:



Advertisements
Similar presentations
CILC2011 A framework for structured knowledge extraction and representation from natural language via deep sentence analysis Stefania Costantini Niva Florio.
Advertisements

COGEX at the Second RTE Marta Tatu, Brandon Iles, John Slavick, Adrian Novischi, Dan Moldovan Language Computer Corporation April 10 th, 2006.
COGEX at the Second RTE Marta Tatu, Brandon Iles, John Slavick, Adrian Novischi, Dan Moldovan Language Computer Corporation April 10 th, 2006.
Recognizing Textual Entailment Challenge PASCAL Suleiman BaniHani.
Summarization Evaluation Using Transformed Basic Elements Stephen Tratz and Eduard Hovy Information Sciences Institute University of Southern California.
Max-Margin Matching for Semantic Role Labeling David Vickrey James Connor Daphne Koller Stanford University.
Explanation Producing Combination of NLP and Logical Reasoning through Translation of Text to KR Formalisms CHITTA BARAL ARIZONA STATE UNIVERSITY 1 School.
 Graph Graph  Types of Graphs Types of Graphs  Data Structures to Store Graphs Data Structures to Store Graphs  Graph Definitions Graph Definitions.
A Joint Model For Semantic Role Labeling Aria Haghighi, Kristina Toutanova, Christopher D. Manning Computer Science Department Stanford University.
Learning with Probabilistic Features for Improved Pipeline Models Razvan C. Bunescu Electrical Engineering and Computer Science Ohio University Athens,
Object Detection by Matching Longin Jan Latecki. Contour-based object detection Database shapes: …..
Normalized alignment of dependency trees for detecting textual entailment Erwin Marsi & Emiel Krahmer Tilburg University Wauter Bosma & Mariët Theune University.
PCFG Parsing, Evaluation, & Improvements Ling 571 Deep Processing Techniques for NLP January 24, 2011.
Stanford Rajat Raina, Aria Haghighi, Christopher Cox, Jenny Finkel, Jeff Michels, Kristina Toutanova, Bill MacCartney, Marie-Catherine de Marneffe,
Reranking Parse Trees with a SRL system Charles Sutton and Andrew McCallum University of Massachusetts June 30, 2005.
1 Quasi-Synchronous Grammars  Based on key observations in MT: translated sentences often have some isomorphic syntactic structure, but not usually in.
Discriminative Learning of Extraction Sets for Machine Translation John DeNero and Dan Klein UC Berkeley TexPoint fonts used in EMF. Read the TexPoint.
Using Information Extraction for Question Answering Done by Rani Qumsiyeh.
Automatic Classification of Semantic Relations between Facts and Opinions Koji Murakami, Eric Nichols, Junta Mizuno, Yotaro Watanabe, Hayato Goto, Megumi.
What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley.
A Confidence Model for Syntactically-Motivated Entailment Proofs Asher Stern & Ido Dagan ISCOL June 2011, Israel 1.
Discrete Mathematics Lecture 9 Alexander Bukharovich New York University.
An Information Theoretic Approach to Bilingual Word Clustering Manaal Faruqui & Chris Dyer Language Technologies Institute SCS, CMU.
Part I: Introductory Materials Introduction to Graph Theory Dr. Nagiza F. Samatova Department of Computer Science North Carolina State University and Computer.
Page 1 Relation Alignment for Textual Entailment Recognition Department of Computer Science University of Illinois at Urbana-Champaign Mark Sammons, V.G.Vinod.
Hubert CARDOTJY- RAMELRashid-Jalal QURESHI Université François Rabelais de Tours, Laboratoire d'Informatique 64, Avenue Jean Portalis, TOURS – France.
Answer Validation Exercise - AVE QA subtrack at Cross-Language Evaluation Forum 2007 UNED (coord.) Anselmo Peñas Álvaro Rodrigo Valentín Sama Felisa Verdejo.
Copyright © Cengage Learning. All rights reserved.
Knowledge and Tree-Edits in Learnable Entailment Proofs Asher Stern and Ido Dagan (earlier partial version by Roy Bar-Haim) Download at:
Fabio Massimo Zanzotto
“How much context do you need?” An experiment about context size in Interactive Cross-language Question Answering B. Navarro, L. Moreno-Monteagudo, E.
Knowledge and Tree-Edits in Learnable Entailment Proofs Asher Stern, Amnon Lotan, Shachar Mirkin, Eyal Shnarch, Lili Kotlerman, Jonathan Berant and Ido.
METEOR-Ranking & M-BLEU: Flexible Matching & Parameter Tuning for MT Evaluation Alon Lavie and Abhaya Agarwal Language Technologies Institute Carnegie.
Triplet Extraction from Sentences Lorand Dali Blaž “Jožef Stefan” Institute, Ljubljana 17 th of October 2008.
AQUAINT 18-Month Workshop 1 Light Semantic Processing for QA Language Technologies Institute, Carnegie Mellon B. Van Durme, Y. Huang, A. Kupsc and E. Nyberg.
1 Learning Sub-structures of Document Semantic Graphs for Document Summarization 1 Jure Leskovec, 1 Marko Grobelnik, 2 Natasa Milic-Frayling 1 Jozef Stefan.
CIKM Recognition and Classification of Noun Phrases in Queries for Effective Retrieval Wei Zhang 1 Shuang Liu 2 Clement Yu 1
Opinion Holders in Opinion Text from Online Newspapers Youngho Kim, Yuchul Jung and Sung-Hyon Myaeng Reporter: Chia-Ying Lee Advisor: Prof. Hsin-Hsi Chen.
A Systematic Exploration of the Feature Space for Relation Extraction Jing Jiang & ChengXiang Zhai Department of Computer Science University of Illinois,
1 Intelligente Analyse- und Informationssysteme Frank Reichartz, Hannes Korte & Gerhard Paass Fraunhofer IAIS, Sankt Augustin, Germany Dependency Tree.
Artificial Intelligence Research Center Pereslavl-Zalessky, Russia Program Systems Institute, RAS.
Relation Alignment for Textual Entailment Recognition Cognitive Computation Group, University of Illinois Experimental ResultsTitle Mark Sammons, V.G.Vinod.
August 17, 2005Question Answering Passage Retrieval Using Dependency Parsing 1/28 Question Answering Passage Retrieval Using Dependency Parsing Hang Cui.
NP-Completeness (Nondeterministic Polynomial Completeness) Sushanth Sivaram Vallath & Z. Joseph.
Evaluating Answer Validation in multi- stream Question Answering Álvaro Rodrigo, Anselmo Peñas, Felisa Verdejo UNED NLP & IR group nlp.uned.es The Second.
2003 (c) University of Pennsylvania1 Better MT Using Parallel Dependency Trees Yuan Ding University of Pennsylvania.
Commonsense Reasoning in and over Natural Language Hugo Liu, Push Singh Media Laboratory of MIT The 8 th International Conference on Knowledge- Based Intelligent.
SALSA-WS 09/05 Approximating Textual Entailment with LFG and FrameNet Frames Aljoscha Burchardt, Anette Frank Computational Linguistics Department Saarland.
Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)
CONTEXTUAL SEARCH AND NAME DISAMBIGUATION IN USING GRAPHS EINAT MINKOV, WILLIAM W. COHEN, ANDREW Y. NG SIGIR’06 Date: 2008/7/17 Advisor: Dr. Koh,
Structured learning: overview Sunita Sarawagi IIT Bombay TexPoint fonts used in EMF. Read the TexPoint manual before.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Graphs Definition: a graph is an abstract representation of a set of objects where some pairs of the objects are connected by links. The interconnected.
Dan Roth University of Illinois, Urbana-Champaign 7 Sequential Models Tutorial on Machine Learning in Natural.
Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.
Statistical Natural Language Parsing Parsing: The rise of data and statistics.
Rule-based Reasoning in Semantic Text Analysis
Linguistic Graph Similarity for News Sentence Searching
Approaches to Machine Translation
Web News Sentence Searching Using Linguistic Graph Similarity
Learning Textual Entailment from Examples
Semantic Parsing for Question Answering
Reading Report: Open QA Systems
Recognizing Partial Textual Entailment
Donna M. Gates Carnegie Mellon University
Approaches to Machine Translation
CS246: Information Retrieval
The Winograd Schema Challenge Hector J. Levesque AAAI, 2011
Presentation transcript:

Robust Textual Inference via Graph Matching Aria Haghighi Andrew Ng Christopher Manning

Textual Entailment Examples TEXT (T): A Filipino hostage in Iraq was released. HYPOTHESIS (H): A Filipino hostage was freed in Iraq. Entailed Only Need Lexical Similarity Matching

Another Example T: The Psychobiology Institute of Israel was established in H: Israel was established in Not Entailed Must go beyond matching only words

The Need For Relations H: Israel was founded in T: The Psychobiolgy Institute of Israel was founded in No match for important relation in H! Must match words and relations between them

Our Approach Dependency Graph Represent words / phrases as vertices and edges as syntactic / semantic relations Graph Matching Approximate notion of Isomorphism H is entailed from T if the cost of matching H to T low.

Representation Pipeline Raw Text John’s mother walked to the store. Modified parser of [Klein and Manning ‘03] Handle collocations: John rang_up Mary Phrase Structure Parse PP S NPVP to the store. John’s mother walked

Representation Pipeline Phrase Structure Parse PP S NPVP to the store. John’s mother walked Dependency Tree walked (VBD) mother (NN) John (NNP) store (NN) subj poss to Modified Collins’ Head Rules Typed relations via tgrep expressions

Representation Pipeline Local dependencies not enough Additional Analysis Semantic Role Labeling [Toutanova et al ‘05] Named Entity Recognition: Collapse named entities into single vertex [Finkel et al ‘04] Coreference Resolution: T: Since its formation in 1948, Israel … H: Israel was established in 1948.

Matching Example Hypothesis Text

Cost Model Matching: A mapping from vertices of H to those of T (and NULL vertex) Cost of matching H to T determined by lowest cost matching

Vertex Cost Model Penalize for each vertex substitution

Vertex Substitution VertexSub(v,M(v)) Exact Match Synonym Match Hypernym Match: v is a “kind of” M(v) WordNet Similarity (Resnik Measure) Distributional Similarity Part-Of-Speech Match

Vertex Weight Weights for Vertex Importance Part-Of-Speech Named Entity Type TF-IDF

Relation Matching Partial Match (and Stem Match) T: The Japanese invasion of Manchuria. H: Japan invaded Manchuria. Ancestor Match T: John is studying French farming practices. H: John is studying French farming.

Relation Cost For each edge e in H, is the image under M, a path in T Weigh each edge according to “importance” of typed relation

Cost Model PathSub(v v’, M(v) M(v’)) Exact Match: Matching preserves edge and edge label Partial Match: Match preserves edge but not label Ancestor Match: M(v) is an ancestor of M(v’) Kinked Match: M(v) and M(v’) share a common ancestor Costs Scale with Length of Path

Final Cost Model Combine VertexCost and RelationCost

Matching Example Hypothesis Text

Finding Minimal Matching With VertexCost only, minimal matching found with Bipartite Graph Matching NP-Hard: RelationCost(M) = 0 if and only if H isomorphic to sub-graph of T Approximate Search Initialize M to best matching using only VertexCost(M) [Bipartite Graph Matching] Do Greedy Hill-climbing with full cost model Seems to do well in practice

Learning Weights Parameterize Substitution Costs Problem: We don’t know matchings in training data. If we did, training would be easy. Solution: Alternate between finding matchings and re-estimating parameters

Experiments CWS = Confidence Weighted Score Data: Recognizing Textual Entailment ‘05 [Dagan et al, ‘05] 567 Development Pairs 800 Test Pairs

Problem Cases Monotonicity Assumptions Superlatives T: Osaka is the tallest tower in western Japan. H: Osaka is the tallest tower in Japan. Non-Factive Verbs T: It is rumored that John is dating Sally. H: John is dating Sally.

Conclusions What’s been done Learned Graph Matching framework New edge and vertex features Fast effective search procedure What’s Needed? More Resources! Lexical Resources: Problems with Recall Better Dependency Parsing Measures of Phrasal Similarity

Thanks! Aria Haghighi Andrew Ng Christopher Manning

Examples T: C and D Technologies announced that it has closed the acquisition of Datel Inc. H: Datel Acquired C and D technologies. Not Entailed Recognize switch in argument structure. Note nominilization

Textual Entailment Problem Definition Given text and hypothesis (T,H) Determine if H ‘follows’ from T ? Not strict logical entailment Applications Information Extraction Question Answering