Presentation is loading. Please wait.

Presentation is loading. Please wait.

SALSA-WS 09/05 Approximating Textual Entailment with LFG and FrameNet Frames Aljoscha Burchardt, Anette Frank Computational Linguistics Department Saarland.

Similar presentations


Presentation on theme: "SALSA-WS 09/05 Approximating Textual Entailment with LFG and FrameNet Frames Aljoscha Burchardt, Anette Frank Computational Linguistics Department Saarland."— Presentation transcript:

1 SALSA-WS 09/05 Approximating Textual Entailment with LFG and FrameNet Frames Aljoscha Burchardt, Anette Frank Computational Linguistics Department Saarland University, Saarbr ü cken Second Pascal Challenge Workshop Venice, April 2006

2 SALSA-WS 09/05 Outline of this Talk Frame Semantics A baseline system for approximating Textual Entailment –LFG syntactical analyses with –Frame semantics –Statistical decision: entailed? Walk-through example from RTE 2006 RTE 2006 results / brief conclusions

3 SALSA-WS 09/05 Frame Semantics (Fillmore 1976, Fillmore et. al. 2003) Lexical semantic classification of predicates and their argument structure A frame represents a prototypical situation (e.g. Commercial_transaction, Theft, Awareness) A set of roles identifies the participants or propositions involved Frames are organized in a hierarchy Berkeley FrameNet Project db: 600 frames, 9.000 lexical units, 135.000 annotated sentences

4 SALSA-WS 09/05 SellerBMW bought Rover from British Aerospace. Buyer Rover was bought by BMW, which financed [...] the new Range Rover. Goods BMW, which acquired Rover in 1994, is now dismantling the company. Money BMW‘s purchase of Rover for $1.2 billion was a good move. Linguistic Normalizations (Frame: Commerce_buy) Voice: active / passive POS: verb / noun Lexicalization

5 SALSA-WS 09/05 Frame Semantics for RTE Focusing on lexical semantic classes and role- based argument structure –Built-in normalizations help to determine semantic similarity at a high level of abstraction –Disregarding aspects of “deep“ semantics: negation, modality, quantification,... –Open for deeper modeling on demand (e.g. our treatment of modality)

6 SALSA-WS 09/05 A Baseline System for Approximating Textual Entailment Fine-grained LFG-based syntactic analysis –English LFG grammar (Riezler et al. 2002) –Wide-coverage with high-quality probabilistic disambiguation Frame Semantics –Shallow lexical-semantic classification of predicate-argument structure –Extensions: WordNet senses, SUMO concepts Computing structural and semantic overlap of t and h –Hypothesis: large overlap ≈ entailment text hypothesis

7 Statistical Decision: Entailment? Computing Semantic Overlap Linguistic Analyses hypothesis LFG f-structure graph w/ frames & concepts text LFG f-structure graph w/ frames & concepts text-hypothesis match graph different types of matches (aspects of similarity) Feature extraction lexical, syntactic, semantic structure & overlap measures Model training & classification A Baseline System for Approximating Textual Entailment

8 SALSA-WS 09/05 Rule-based: extend & refine sem. proj. NEs, Locations Co-reference Modality, etc. Linguistic Components XLE parsing: LFG f-structure F-structure w/ semantics projection WordNet-based WSD: WordNet & SUMO Fred / Detour / Rosy: frames & roles Using XLE term rewriting system (Crouch 2005)

9 SALSA-WS 09/05 Example from RTE 2006 Pair 716 Text In 1983, Aki Kaurismäki directed his first full-time feature. Hypothesis Aki Kaurismäki directed a film.

10 LFG F-Structures

11 SALSA-WS 09/05 Automatic Frame Annotation for Text (SALTO Viewer) Fred & Rosy frames & roles (statistical) Detour System frames (via WordNet) Collins Parse

12 SALSA-WS 09/05 Automatic Frame Annotation for Hypothesis 716_h: Aki Karusmäki directed a film.

13 SALSA-WS 09/05 LFG + Frames for Hypothesis (FEFViewer) Aki Kaurismäki directed a film. Rule-based (LFG-NER)

14 SALSA-WS 09/05 Hypothesis-Text-Match Graphs Computing Structural and Semantic overlap Match graph bundles overlapping partial graphs marked by match types Aspects of similarity –Syntax-based (i.e. lexical and structural): Identical predicates (attributes) trigger node (edge) matches. – Semantics-based: Identical frames/concepts (roles) trigger node (edge) matches. Degrees of similarity –Strict matching –Weak matching conditions for non-identical predicates: “Structurally related” e.g. via coreference (relative clauses, appositives, pronominals) “Semantically related” via WordNet, Frame-Relations

15 h: Aki Kaurismäki directed a film. WordNet related t: In 1983, Aki Kaurismäki directed his first full-time feature. Grammatically related

16 Statistical Modeling Feature extraction on the basis of –Syntactic, Semantic matches (of different types) –Matching clusters’ sizes –Ratio (matched vs. hypothesis) –(Non-)matching modality –RTE-task, fragmentary (parse),… Training/classification with WEKA tool –Feature selection 1.Predicate Matches 2.Frame overlap 3.Matching cluster size –Model 1: Conjunctive rule (Feat. 1,2) –Model 2: LogitBoost (Feat. 1,2,3)

17 RTE 2006 Results all tasksIEIRQASUM Model 159.049.559.554.572.5 Model 257.848.558.557.067.0 SUM (and IR) are natural tasks for Frame Semantics, IE and QA need more deeper modeling (aboutness vs. factivity) Error analysis –True positives: high semantic overlap –True negatives: 27% involve modality mismatches –False examples: poor modeling of dissimalrity Many high-frequency features measuring similarity Few low-frequency features measuring dissimilarity

18 SALSA-WS 09/05 Brief Conclusions Good approximation of semantic similarity –Deep LFG syntactical analyses integrated with –Shallow lexical Frame Semantics (plus other lex. resources) –Match graph measuring overlap Need better model for semantic dissimilarity –Too few rejections (false positives >> false negatives) Towards deeper modeling –Treatment of modal contexts –Integration of lexical inferences –Open for collaborations

19 stmt_type(f(0),declarative). tense(f(0),past). pred(f(0),direct). mood(f(0),indicative). dsubj(f(0),f(7)). dobj(f(0),f(2)). pred(f(2),film). num(f(2),sg). det_type(f(2),indef). proper(f(7),name). pred(f(7),'Kaurismaki'). num(f(7),sg). mod(f(7),f(10)). proper(f(10),name). pred(f(10),'Aki'). num(f(10),sg). sslink(f(0),s(41)). sslink(f(2),s(42)). sslink(f(7),s(45)). sslink(f(10),s(59)). frame(s(41),'Behind_the_scenes'). artist(s(41),s(45)). production(s(41),s(42)). frame(s(42),'Behind_the_scenes'). frame(s(45),'People'). person(s(45),s(59)). person(s(45),s(45)). ont(s(41),s(48)). ont(s(42),s(49)). ont(s(45),s(56)). wn_syn(s(48),'direct#v#11'). sumo_sub(s(48),'Steering'). milo_sub(s(48),'Steering'). wn_syn(s(49),'film#n#1'). sumo_sub(s(49),'MotionPicture'). milo_sub(s(49),'MotionPicture'). sumo_syn(s(56),'Human'). sumo_syn(s(58),'Human'). LFG + Frames for Hypothesis (FEF)


Download ppt "SALSA-WS 09/05 Approximating Textual Entailment with LFG and FrameNet Frames Aljoscha Burchardt, Anette Frank Computational Linguistics Department Saarland."

Similar presentations


Ads by Google