SALSA-WS 09/05 Approximating Textual Entailment with LFG and FrameNet Frames Aljoscha Burchardt, Anette Frank Computational Linguistics Department Saarland.

Slides:



Advertisements
Similar presentations
CILC2011 A framework for structured knowledge extraction and representation from natural language via deep sentence analysis Stefania Costantini Niva Florio.
Advertisements

COGEX at the Second RTE Marta Tatu, Brandon Iles, John Slavick, Adrian Novischi, Dan Moldovan Language Computer Corporation April 10 th, 2006.
COGEX at the Second RTE Marta Tatu, Brandon Iles, John Slavick, Adrian Novischi, Dan Moldovan Language Computer Corporation April 10 th, 2006.
Recognizing Textual Entailment Challenge PASCAL Suleiman BaniHani.
Logic Programming Infrastructure for Inferences on FrameNet Peter Baumgartner MPI Informatik, Saarbrücken Aljoscha Burchardt Universitaet des Saarlandes,
FATE: a FrameNet Annotated corpus for Textual Entailment Marco Pennacchiotti, Aljoscha Burchardt Computerlinguistik Saarland University, Germany LREC 2008,
The SALSA experience: semantic role annotation Katrin Erk University of Texas at Austin.
Building Text Meaning Representations from Contextually Related Frames – A Case Study – Aljoscha Burchardt Anette Frank Manfred Pinkal Saarland University.
Recognizing Implicit Discourse Relations in the Penn Discourse Treebank Ziheng Lin, Min-Yen Kan, and Hwee Tou Ng Department of Computer Science National.
1 Unsupervised Semantic Parsing Hoifung Poon and Pedro Domingos EMNLP 2009 Best Paper Award Speaker: Hao Xiong.
FATE: a FrameNet Annotated corpus for Textual Entailment Marco Pennacchiotti, Aljoscha Burchardt Computerlinguistik Saarland University, Germany LREC 2008,
Robust Textual Inference via Graph Matching Aria Haghighi Andrew Ng Christopher Manning.
Components for a semantic textual similarity system Focus on word and sentence similarity Formal side: define similarity in principle.
Normalized alignment of dependency trees for detecting textual entailment Erwin Marsi & Emiel Krahmer Tilburg University Wauter Bosma & Mariët Theune University.
Inducing Information Extraction Systems for New Languages via Cross-Language Projection Ellen Riloff University of Utah Charles Schafer, David Yarowksy.
Automatic Classification of Semantic Relations between Facts and Opinions Koji Murakami, Eric Nichols, Junta Mizuno, Yotaro Watanabe, Hayato Goto, Megumi.
Introduction to CL Session 1: 7/08/2011. What is computational linguistics? Processing natural language text by computers  for practical applications.
Shallow semantic parsing: Making most of limited training data Katrin Erk Sebastian Pado Saarland University.
Using Maximal Embedded Subtrees for Textual Entailment Recognition Sophia Katrenko & Pieter Adriaans Adaptive Information Disclosure project Human Computer.
UNED at PASCAL RTE-2 Challenge IR&NLP Group at UNED nlp.uned.es Jesús Herrera Anselmo Peñas Álvaro Rodrigo Felisa Verdejo.
Article by: Feiyu Xu, Daniela Kurz, Jakub Piskorski, Sven Schmeier Article Summary by Mark Vickers.
Comments on Guillaume Pitel: “Using bilingual LSA for FrameNet annotation of French text from generic resources” Gerd Fliedner Computational Linguistics.
Page 1 Relation Alignment for Textual Entailment Recognition Department of Computer Science University of Illinois at Urbana-Champaign Mark Sammons, V.G.Vinod.
Outline P1EDA’s simple features currently implemented –And their ablation test Features we have reviewed from Literature –(Let’s briefly visit them) –Iftene’s.
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
Ontology Learning from Text: A Survey of Methods Source: LDV Forum,Volume 20, Number 2, 2005 Authors: Chris Biemann Reporter:Yong-Xiang Chen.
Andreea Bodnari, 1 Peter Szolovits, 1 Ozlem Uzuner 2 1 MIT, CSAIL, Cambridge, MA, USA 2 Department of Information Studies, University at Albany SUNY, Albany,
Knowledge and Tree-Edits in Learnable Entailment Proofs Asher Stern and Ido Dagan (earlier partial version by Roy Bar-Haim) Download at:
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Fabio Massimo Zanzotto
“How much context do you need?” An experiment about context size in Interactive Cross-language Question Answering B. Navarro, L. Moreno-Monteagudo, E.
Distributional Part-of-Speech Tagging Hinrich Schütze CSLI, Ventura Hall Stanford, CA , USA NLP Applications.
Assessing the Impact of Frame Semantics on Textual Entailment Authors: Aljoscha Burchardt, Marco Pennacchiotti, Stefan Thater, Manfred Pinkal Saarland.
Knowledge and Tree-Edits in Learnable Entailment Proofs Asher Stern, Amnon Lotan, Shachar Mirkin, Eyal Shnarch, Lili Kotlerman, Jonathan Berant and Ido.
SALSA The Saarbrücken Lexical Semantics Annotation & Acquisition Project Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Pado,
Syntactic and semantic models and algorithms in Question Answering Alexander Solovyev Bauman Moscow Sate Technical University RCDL.
Based on “Semi-Supervised Semantic Role Labeling via Structural Alignment” by Furstenau and Lapata, 2011 Advisors: Prof. Michael Elhadad and Mr. Avi Hayoun.
Scott Duvall, Brett South, Stéphane Meystre A Hands-on Introduction to Natural Language Processing in Healthcare Annotation as a Central Task for Development.
Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003.
This work is supported by the Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior National Business Center contract number.
Using a Lemmatizer to Support the Development and Validation of the Greek WordNet Harry Kornilakis 1, Maria Grigoriadou 1, Eleni Galiotou 1,2, Evangelos.
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
A Cascaded Finite-State Parser for German Michael Schiehlen Institut für Maschinelle Sprachverarbeitung Universität Stuttgart
Modelling Human Thematic Fit Judgments IGK Colloquium 3/2/2005 Ulrike Padó.
Approaches to Machine Translation CSC 5930 Machine Translation Fall 2012 Dr. Tom Way.
Opinion Holders in Opinion Text from Online Newspapers Youngho Kim, Yuchul Jung and Sung-Hyon Myaeng Reporter: Chia-Ying Lee Advisor: Prof. Hsin-Hsi Chen.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 4.
Relation Alignment for Textual Entailment Recognition Cognitive Computation Group, University of Illinois Experimental ResultsTitle Mark Sammons, V.G.Vinod.
GermaNet-WS II A WordNet “Detour” to FrameNet Aljoscha Burchardt Katrin Erk Anette Frank* Saarland University, DFKI* Saarbrücken
Deep structure (semantic) Structure of language Surface structure (grammatical, lexical, phonological) Semantic units have all meaning components such.
Towards the Use of Linguistic Information in Automatic MT Evaluation Metrics Projecte de Tesi Elisabet Comelles Directores Irene Castellon i Victoria Arranz.
Support Vector Machines and Kernel Methods for Co-Reference Resolution 2007 Summer Workshop on Human Language Technology Center for Language and Speech.
Commonsense Reasoning in and over Natural Language Hugo Liu, Push Singh Media Laboratory of MIT The 8 th International Conference on Knowledge- Based Intelligent.
1 Measuring the Semantic Similarity of Texts Author : Courtney Corley and Rada Mihalcea Source : ACL-2005 Reporter : Yong-Xiang Chen.
1 Fine-grained and Coarse-grained Word Sense Disambiguation Jinying Chen, Hoa Trang Dang, Martha Palmer August 22, 2003.
Exploiting Named Entity Taggers in a Second Language Thamar Solorio Computer Science Department National Institute of Astrophysics, Optics and Electronics.
Knowledge Structure Vijay Meena ( ) Gaurav Meena ( )
SIMS 296a-4 Text Data Mining Marti Hearst UC Berkeley SIMS.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
A Database of Narrative Schemas A 2010 paper by Nathaniel Chambers and Dan Jurafsky Presentation by Julia Kelly.
Recognising Textual Entailment Johan Bos School of Informatics University of Edinburgh Scotland,UK.
Relation Extraction (RE) via Supervised Classification See: Jurafsky & Martin SLP book, Chapter 22 Exploring Various Knowledge in Relation Extraction.
Identifying Expressions of Opinion in Context Eric Breck and Yejin Choi and Claire Cardie IJCAI 2007.
PRESENTED BY: PEAR A BHUIYAN
Learning Textual Entailment from Examples
Improving a Pipeline Architecture for Shallow Discourse Parsing
Kiril Simov1, Alexander Popov1, Iliana Simova2, Petya Osenova1
Automatic Detection of Causal Relations for Question Answering
CS224N Section 3: Corpora, etc.
Presentation transcript:

SALSA-WS 09/05 Approximating Textual Entailment with LFG and FrameNet Frames Aljoscha Burchardt, Anette Frank Computational Linguistics Department Saarland University, Saarbr ü cken Second Pascal Challenge Workshop Venice, April 2006

SALSA-WS 09/05 Outline of this Talk Frame Semantics A baseline system for approximating Textual Entailment –LFG syntactical analyses with –Frame semantics –Statistical decision: entailed? Walk-through example from RTE 2006 RTE 2006 results / brief conclusions

SALSA-WS 09/05 Frame Semantics (Fillmore 1976, Fillmore et. al. 2003) Lexical semantic classification of predicates and their argument structure A frame represents a prototypical situation (e.g. Commercial_transaction, Theft, Awareness) A set of roles identifies the participants or propositions involved Frames are organized in a hierarchy Berkeley FrameNet Project db: 600 frames, lexical units, annotated sentences

SALSA-WS 09/05 SellerBMW bought Rover from British Aerospace. Buyer Rover was bought by BMW, which financed [...] the new Range Rover. Goods BMW, which acquired Rover in 1994, is now dismantling the company. Money BMW‘s purchase of Rover for $1.2 billion was a good move. Linguistic Normalizations (Frame: Commerce_buy) Voice: active / passive POS: verb / noun Lexicalization

SALSA-WS 09/05 Frame Semantics for RTE Focusing on lexical semantic classes and role- based argument structure –Built-in normalizations help to determine semantic similarity at a high level of abstraction –Disregarding aspects of “deep“ semantics: negation, modality, quantification,... –Open for deeper modeling on demand (e.g. our treatment of modality)

SALSA-WS 09/05 A Baseline System for Approximating Textual Entailment Fine-grained LFG-based syntactic analysis –English LFG grammar (Riezler et al. 2002) –Wide-coverage with high-quality probabilistic disambiguation Frame Semantics –Shallow lexical-semantic classification of predicate-argument structure –Extensions: WordNet senses, SUMO concepts Computing structural and semantic overlap of t and h –Hypothesis: large overlap ≈ entailment text hypothesis

Statistical Decision: Entailment? Computing Semantic Overlap Linguistic Analyses hypothesis LFG f-structure graph w/ frames & concepts text LFG f-structure graph w/ frames & concepts text-hypothesis match graph different types of matches (aspects of similarity) Feature extraction lexical, syntactic, semantic structure & overlap measures Model training & classification A Baseline System for Approximating Textual Entailment

SALSA-WS 09/05 Rule-based: extend & refine sem. proj. NEs, Locations Co-reference Modality, etc. Linguistic Components XLE parsing: LFG f-structure F-structure w/ semantics projection WordNet-based WSD: WordNet & SUMO Fred / Detour / Rosy: frames & roles Using XLE term rewriting system (Crouch 2005)

SALSA-WS 09/05 Example from RTE 2006 Pair 716 Text In 1983, Aki Kaurismäki directed his first full-time feature. Hypothesis Aki Kaurismäki directed a film.

LFG F-Structures

SALSA-WS 09/05 Automatic Frame Annotation for Text (SALTO Viewer) Fred & Rosy frames & roles (statistical) Detour System frames (via WordNet) Collins Parse

SALSA-WS 09/05 Automatic Frame Annotation for Hypothesis 716_h: Aki Karusmäki directed a film.

SALSA-WS 09/05 LFG + Frames for Hypothesis (FEFViewer) Aki Kaurismäki directed a film. Rule-based (LFG-NER)

SALSA-WS 09/05 Hypothesis-Text-Match Graphs Computing Structural and Semantic overlap Match graph bundles overlapping partial graphs marked by match types Aspects of similarity –Syntax-based (i.e. lexical and structural): Identical predicates (attributes) trigger node (edge) matches. – Semantics-based: Identical frames/concepts (roles) trigger node (edge) matches. Degrees of similarity –Strict matching –Weak matching conditions for non-identical predicates: “Structurally related” e.g. via coreference (relative clauses, appositives, pronominals) “Semantically related” via WordNet, Frame-Relations

h: Aki Kaurismäki directed a film. WordNet related t: In 1983, Aki Kaurismäki directed his first full-time feature. Grammatically related

Statistical Modeling Feature extraction on the basis of –Syntactic, Semantic matches (of different types) –Matching clusters’ sizes –Ratio (matched vs. hypothesis) –(Non-)matching modality –RTE-task, fragmentary (parse),… Training/classification with WEKA tool –Feature selection 1.Predicate Matches 2.Frame overlap 3.Matching cluster size –Model 1: Conjunctive rule (Feat. 1,2) –Model 2: LogitBoost (Feat. 1,2,3)

RTE 2006 Results all tasksIEIRQASUM Model Model SUM (and IR) are natural tasks for Frame Semantics, IE and QA need more deeper modeling (aboutness vs. factivity) Error analysis –True positives: high semantic overlap –True negatives: 27% involve modality mismatches –False examples: poor modeling of dissimalrity Many high-frequency features measuring similarity Few low-frequency features measuring dissimilarity

SALSA-WS 09/05 Brief Conclusions Good approximation of semantic similarity –Deep LFG syntactical analyses integrated with –Shallow lexical Frame Semantics (plus other lex. resources) –Match graph measuring overlap Need better model for semantic dissimilarity –Too few rejections (false positives >> false negatives) Towards deeper modeling –Treatment of modal contexts –Integration of lexical inferences –Open for collaborations

stmt_type(f(0),declarative). tense(f(0),past). pred(f(0),direct). mood(f(0),indicative). dsubj(f(0),f(7)). dobj(f(0),f(2)). pred(f(2),film). num(f(2),sg). det_type(f(2),indef). proper(f(7),name). pred(f(7),'Kaurismaki'). num(f(7),sg). mod(f(7),f(10)). proper(f(10),name). pred(f(10),'Aki'). num(f(10),sg). sslink(f(0),s(41)). sslink(f(2),s(42)). sslink(f(7),s(45)). sslink(f(10),s(59)). frame(s(41),'Behind_the_scenes'). artist(s(41),s(45)). production(s(41),s(42)). frame(s(42),'Behind_the_scenes'). frame(s(45),'People'). person(s(45),s(59)). person(s(45),s(45)). ont(s(41),s(48)). ont(s(42),s(49)). ont(s(45),s(56)). wn_syn(s(48),'direct#v#11'). sumo_sub(s(48),'Steering'). milo_sub(s(48),'Steering'). wn_syn(s(49),'film#n#1'). sumo_sub(s(49),'MotionPicture'). milo_sub(s(49),'MotionPicture'). sumo_syn(s(56),'Human'). sumo_syn(s(58),'Human'). LFG + Frames for Hypothesis (FEF)