University of Texas at Austin Machine Learning Group Department of Computer Sciences University of Texas at Austin Learning a Compositional Semantic Parser.

Slides:

Advertisements

Similar presentations

CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 2 (06/01/06) Prof. Pushpak Bhattacharyya IIT Bombay Part of Speech (PoS)

Advertisements

Albert Gatt Corpora and Statistical Methods Lecture 11.

University of Texas at Austin Machine Learning Group Department of Computer Sciences University of Texas at Austin Learning Semantic Parsers Using Statistical.

Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.

1 Natural Language Processing COMPSCI 423/723 Rohit Kate.

Adapting Discriminative Reranking to Grounded Language Learning Joohyun Kim and Raymond J. Mooney Department of Computer Science The University of Texas.

A Joint Model For Semantic Role Labeling Aria Haghighi, Kristina Toutanova, Christopher D. Manning Computer Science Department Stanford University.

March 1, 2009 Dr. Muhammed Al-Mulhem 1 ICS 482 Natural Language Processing Probabilistic Context Free Grammars (Chapter 14) Muhammed Al-Mulhem March 1,

1 Unsupervised Semantic Parsing Hoifung Poon and Pedro Domingos EMNLP 2009 Best Paper Award Speaker: Hao Xiong.

PCFG Parsing, Evaluation, & Improvements Ling 571 Deep Processing Techniques for NLP January 24, 2011.

Are Linguists Dinosaurs? 1.Statistical language processors seem to be doing away with the need for linguists. –Why do we need linguists when a machine.

Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University

Probabilistic Parsing Ling 571 Fei Xia Week 5: 10/25-10/27/05.

1 Learning to Interpret Natural Language Navigation Instructions from Observation Ray Mooney Department of Computer Science University of Texas at Austin.

11 CS 388: Natural Language Processing: Syntactic Parsing Raymond J. Mooney University of Texas at Austin.

Semantic Parsing: The Task, the State of the Art and the Future

Unambiguity Regularization for Unsupervised Learning of Probabilistic Grammars Kewei TuVasant Honavar Departments of Statistics and Computer Science University.

BİL711 Natural Language Processing1 Statistical Parse Disambiguation Problem: –How do we disambiguate among a set of parses of a given sentence? –We want.

1 Data-Driven Dependency Parsing. 2 Background: Natural Language Parsing Syntactic analysis String to (tree) structure He likes fish S NP VP NP VNPrn.

For Friday Finish chapter 23 Homework: –Chapter 22, exercise 9.

Machine Learning Group Department of Computer Sciences University of Texas at Austin Learning Language Semantics from Ambiguous Supervision Rohit J. Kate.

A Natural Language Interface for Crime-related Spatial Queries Chengyang Zhang, Yan Huang, Rada Mihalcea, Hector Cuellar Department of Computer Science.

1 David Chen Advisor: Raymond Mooney Research Preparation Exam August 21, 2008 Learning to Sportscast: A Test of Grounded Language Acquisition.

David Chen Advisor: Raymond Mooney Research Preparation Exam August 21, 2008 Learning to Sportscast: A Test of Grounded Language Acquisition.

1 Statistical Parsing Chapter 14 October 2012 Lecture #9.

1 Learning for Semantic Parsing Using Statistical Syntactic Parsing Techniques Ruifang Ge Ph.D. Final Defense Supervisor: Raymond J. Mooney Machine Learning.

Learning to Transform Natural to Formal Language Presented by Ping Zhang Rohit J. Kate, Yuk Wah Wong, and Raymond J. Mooney.

An Extended GHKM Algorithm for Inducing λ-SCFG Peng Li Tsinghua University.

1 Semi-Supervised Approaches for Learning to Parse Natural Languages Rebecca Hwa

1 Natural Language Processing Lecture Notes 11 Chapter 15 (part 1)

Reordering Model Using Syntactic Information of a Source Tree for Statistical Machine Translation Kei Hashimoto, Hirohumi Yamamoto, Hideo Okuma, Eiichiro.

Approaches to Machine Translation CSC 5930 Machine Translation Fall 2012 Dr. Tom Way.

11 Chapter 14 Part 1 Statistical Parsing Based on slides by Ray Mooney.

1 David Chen & Raymond Mooney Department of Computer Sciences University of Texas at Austin Learning to Sportscast: A Test of Grounded Language Acquisition.

A Systematic Exploration of the Feature Space for Relation Extraction Jing Jiang & ChengXiang Zhai Department of Computer Science University of Illinois,

University of Texas at Austin Machine Learning Group Department of Computer Sciences University of Texas at Austin Learning for Semantic Parsing Raymond.

INSTITUTE OF COMPUTING TECHNOLOGY Forest-to-String Statistical Translation Rules Yang Liu, Qun Liu, and Shouxun Lin Institute of Computing Technology Chinese.

Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.

PARSING 2 David Kauchak CS159 – Spring 2011 some slides adapted from Ray Mooney.

For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.

For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.

Supertagging CMSC Natural Language Processing January 31, 2006.

CPSC 422, Lecture 27Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27 Nov, 16, 2015.

11 Project, Part 3. Outline Basics of supervised learning using Naïve Bayes (using a simpler example) Features for the project 2.

Natural Language Generation with Tree Conditional Random Fields Wei Lu, Hwee Tou Ng, Wee Sun Lee Singapore-MIT Alliance National University of Singapore.

Wei Lu, Hwee Tou Ng, Wee Sun Lee National University of Singapore

NLP. Introduction to NLP Time flies like an arrow –Many parses –Some (clearly) more likely than others –Need for a probabilistic ranking method.

For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.

University of Texas at Austin Machine Learning Group Department of Computer Sciences University of Texas at Austin Learning for Semantic Parsing of Natural.

Department of Computer Science The University of Texas at Austin USA Joint Entity and Relation Extraction using Card-Pyramid Parsing Rohit J. Kate Raymond.

Overview of Statistical NLP IR Group Meeting March 7, 2006.

University of Texas at Austin Machine Learning Group Department of Computer Sciences University of Texas at Austin Learning for Semantic Parsing of Natural.

A Syntax-Driven Bracketing Model for Phrase-Based Translation Deyi Xiong, et al. ACL 2009.

PARSING David Kauchak CS159 – Fall Admin Assignment 3 Quiz #1  High: 36  Average: 33 (92%)  Median: 33.5 (93%)

A Simple English-to-Punjabi Translation System By : Shailendra Singh.

1 Learning Semantic Parsers: An Important But Under-Studied Problem Raymond J. Mooney Dept. of Computer Sciences University of Texas at Austin "The fish.

Natural Language Processing Vasile Rus

A Kernel-based Approach to Learning Semantic Parsers

CSC 594 Topics in AI – Natural Language Processing

Authorship Attribution Using Probabilistic Context-Free Grammars

Semantic Parsing for Question Answering

Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27

Using String-Kernels for Learning Semantic Parsers

Learning to Transform Natural to Formal Languages

Data Recombination for Neural Semantic Parsing

CS 388: Natural Language Processing: Semantic Parsing

Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27

Learning to Parse Database Queries Using Inductive Logic Programming

Learning to Sportscast: A Test of Grounded Language Acquisition

Natural Language to SQL(nl2sql)

Presentation transcript:

University of Texas at Austin Machine Learning Group Department of Computer Sciences University of Texas at Austin Learning a Compositional Semantic Parser using an Existing Syntactic Parser August 4, 2009 Ruifang Ge and Raymond J. Mooney

2 Semantic Parsing Semantic Parsing: maps a natural-language sentence into a completely formal meaning representation (MR) in a meaning representation language ( MRL ) Applications –CLang (Chen et al., 2003) –Geoquery (Zelle & Mooney, 1996)

3 Example: Clang (RoboCup Coach Language) In RoboCup Coach competition teams compete to coach simulated players The coaching instructions are given in a formal language called CLang Simulated soccer field Coach If our player 2 has the ball, then position our player 5 in the midfield. CLang ((bowner (player our {2})) (do (player our {5}) (pos (midfield)))) Semantic Parsing

4 GeoQuery: A Database Query Application Query application for U.S. geography database [Zelle & Mooney, 1996] User How many states does the Mississippi run through? Query answer(A, count(B, (state(B), C=riverid(mississippi), traverse(C,B)), A)) Semantic Parsing DataBase 10

5 Outline Prior work on learning semantic parsers Learning a compositional semantic parser using an existing syntactic parser Experimental results Conclusions

6 Prior Work: S CISSOR (Ge & Mooney, 2005) A syntax-driven semantic parser. Uses an integrated syntactic-semantic parsing model to generate a semantically augmented parse tree (SAPT) for an NL sentence. Gold-standard SAPTs are required for training.

7 Meaning Representation Language Grammar (MRLG) ProductionPredicate CONDITION →(bowner PLAYER)P_BOWNER PLAYER →(player TEAM {UNUM})P_PLAYER UNUM → 2P_UNUM TEAM → ourP_OUR Assumes MRL is defined by an unambiguous context-free grammar. Each production rule introduces a single predicate in the MRL. The parse of an MR gives its predicate-argument structure.

8 Example: Scissor (SAPT) ourplayer2hasballthe VP λa 1 P_BOWNER S P_BOWNER PRP$ P_OUR NP NULL NP λa 1 P_PLAYER VB λa 1 P_BOWNER DET NULL NN NULL NP P_PLAYER NN λa 1 λa 2 P_PLAYER CD P_UNUM P_BOWNER P_PLAYER P_UNUM P_OUR

9 ourplayer2hasballthe PRP$ P_OUR VB λa 1 P_BOWNER DET NULL NN NULL NN λa 1 λa 2 P_PLAYER CD P_UNUM VP λa 1 bowner(a 1 ) NP NULL NP player(our {2}) S bowner(player(our {2})) NP λa 1 player(a 1 {2}) Example: Scissor (Meaning Composition)

10 Problem with Scissor Requires extensive SAPT annotation for training, including both syntactic parse trees and semantic labels. Must learn both syntax and semantics from same limited training corpus. High performance syntactic parsers are available that are trained on existing large corpora (Collins, 1997; Charniak & Johnson, 2005).

11 Outline Prior work on learning semantic parsers Learning a compositional semantic parser using an existing syntactic parser Experimental results Conclusions

12 Unambiguous CFG of MRL Training set, {(s,t,m)} where m is the sentence’s MR Training Semantic parsing Input sentence, s Output MR, m Testing Before training & testing training/test sentence, s Syntactic parser syntactic parse tree, t Semantic knowledge acquisition Semantic lexicon & composition rules Parameter estimation Probabilistic parsing model System Overview

13 Outline Prior work on learning semantic parsers Learning a compositional semantic parser using an existing syntactic parser –Semantic knowledge acquisition –Parameter estimation Experimental results Conclusions

14 Learning a Semantic Lexicon Use a statistical MT approach based on Wong and Mooney (2006) to construct word alignments between NL sentences and their MRs. Train IBM Model 5 word alignment (GIZA++) to generate the top 5 word/predicate alignments for each training example. Assume each word alignment defines a possible mapping from words to predicates for composing the correct MR.

15 Example: Word Alignment ourplayer2hasballthe P_PLAYERP_BOWNERP_OURP_UNUM

16 Learning Semantic Composition Rules Assume each word alignment and syntactic parse defines a possible SAPT for composing the correct MR. A bottom-up, left-to-right process is used to compose the MRs of internal nodes according to its correct MR. Semantic composition rules are extracted directly during this process.

17 Learning Semantic Composition Rules ourplayer2hasballthe VP S PRP$ NP VB DETNN NP NNCD P_OUR λa 1 λa 2 P_PLAYER λa 1 P_BOWNER P_UNUMNULL P_BOWNER P_PLAYER P_UNUMP_OUR

18 Learning Semantic Composition Rules ourplayer2hasballthe VP S PRP$ NP VB DETNN NP NNCD P_OUR λa 1 λa 2 P_PLAYER λa 1 P_BOWNER P_UNUMNULL P_BOWNER P_PLAYER P_UNUMP_OUR

19 Learning Semantic Composition Rules ourplayer2hasballthe VP S PRP$ NP VB DETNN NP NNCD P_OUR λa 1 λa 2 P_PLAYER λa 1 P_BOWNER P_UNUMNULL P_BOWNER P_PLAYER P_UNUM P_OUR λa 1 λa 2 P_PLAYER + P_UNUM  {λa 1 P_PLAYER, a 2 =c 2 } λa 1 P_PLAYER

20 Learning Semantic Composition Rules ourplayer2hasballthe VP S PRP$ NP VB DETNN NP NNCD P_OUR λa 1 λa 2 P_PLAYER λa 1 P_BOWNER P_UNUMNULL P_BOWNER P_PLAYER P_UNUMP_OUR λa 1 P_PLAYER P_OUR +λa 1 P_PLAYER  {P_PLAYER, a 1 =c 1 } P_PLAYER

21 Learning Semantic Composition Rules ourplayer2hasballthe VP S PRP$ NP VB DETNN NP NNCD P_OUR λa 1 λa 2 P_PLAYER λa 1 P_BOWNER P_UNUMNULL P_BOWNER P_PLAYER P_UNUMP_OUR λa 1 P_PLAYER NULL λa 1 P_OWNER P_PLAYER

22 Learning Semantic Composition Rules ourplayer2hasballthe VP S PRP$ NP VB DETNN NP NNCD P_OUR λa 1 λa 2 P_PLAYER λa 1 P_BOWNER P_UNUMNULL P_BOWNER P_PLAYER P_UNUMP_OUR λa 1 P_PLAYER NULL λa 1 P_BOWNER P_PLAYER P_BOWNER P_PLAYER + λa 1 P_BOWNER  {P_OWNER, a 1 =c 1 }

23 Ensuring Meaning Composition The syntactic structure of the NL is sometimes different from the syntactic structure of the MR. Introduce macro-predicates that combine multiple predicates. These ensure that each MR in the training set can be composed using the syntactic parse of its corresponding NL given its word alignments.

24 Ensuring Meaning Composition λa 1 P_POS P_PLAYER P_MIDFIELDλa 1 λa 2 P_DO thenpositionour player 5in the midfield VBNP PP ADVP RBVP P_DO P_POSP_PLAYER P_MIDFIELD λa 1 λa 2 P_DO

25 Ensuring Meaning Composition λa 1 P_POS P_PLAYER P_MIDFIELDλa 1 λa 2 P_DO thenpositionour player 5in the midfield VBNP PP ADVP RB VP P_DO P_POSP_PLAYER P_MIDFIELD λa 1 P_POS + P_PLAYER  ? λa 1 λa 2 P_DO

26 Ensuring Meaning Composition λa 1 P_POS P_PLAYER P_MIDFIELDλa 1 λa 2 P_DO thenpositionour player 5in the midfield VBNP PP ADVP RB VP P_DO P_POSa 1 : PLAYER a 2 :REGION λa 1 P_POS + P_PLAYER  {λp 1 λa 2 P_DO_POS, a 1 =c 2 } Macro: P_DO_POS λp 1 λa 2 P_DO_POS λa 1 λa 2 P_DO

27 Ensuring Meaning Composition λa 1 P_POS P_PLAYER P_MIDFIELDλa 1 λa 2 P_DO thenpositionour player 5in the midfield VBNP PP ADVP RB VP λp 1 λa 2 P_DO_POS + P_MIDFIELD  {λp 1 P_DO_POS, a 2 =c 2 } λp 1 λa 2 P_DO_POS λp 1 P_DO_POS = λp 1 P_DO λa 1 λa 2 P_DO P_DO P_POSP_PLAYER P_MIDFIELD

28 Ensuring Meaning Composition λa 1 P_POS P_PLAYER P_MIDFIELDλa 1 λa 2 P_DO thenpositionour player 5in the midfield VBNP PP ADVP RB VP λa 1 λa 2 P_DO + λp 1 P_DO  {P_DO, a 1 =(c 2,a 1 ), a 2 =(c 2,a 2 ) } λp 1 λa 2 P_DO_POS λp 1 P_DO_POS = λp 1 P_DOλa 1 λa 2 P_DO P_DO P_POSP_PLAYER P_MIDFIELD

29 Outline Prior work on learning semantic parsers Learning a compositional semantic parser using an existing syntactic parser –Semantic knowledge acquisition –Parameter estimation Experimental results Conclusions

30 Parameter Estimation Use a maximum-entropy model similar to that of Zettlemoyer & Collins (2006), and Wong & Mooney (2006) (s: sentence; t:syntactic parse; s a : SAPTs) Training finds a parameter that (approximately) maximizes the sum of the conditional log-likelihood of the training set ( m: MR )

31 Features Lexical features: –Unigram features: the number of times a word is assigned a particular predicate. –Bigram features: the number of times a word is assigned a particular predicate given its previous/subsequent word. Rule features: the number of times a particular composition rule is applied in a derivation

32 Outline Prior work on learning semantic parsers Learning a compositional semantic parser using an existing syntactic parser Experimental results Conclusions

33 Experimental Corpora CLang –300 randomly selected rules from the log files of the 2003 RoboCup Coach Competition –CLANG advice is annotated with NL sentences –22.52 words per sentence GeoQuery [Zelle & Mooney, 1996] –Prolog logical forms –880 queries for U.S. geography database –7.48 words per sentence

34 Experimental Methodology Evaluated using standard 10-fold cross validation Correctness –CLang: output exactly matches the correct representation –Geoquery: query retrieves correct answer, reflecting the quality of the final result returned to the user

35 Experimental Methodology Metrics

36 Compared Systems Scissor (Ge & Mooney, 2005) –An integrated syntactic-semantic parser, training requires S APT s K RISP (Kate & Mooney, 2006) –An SVM-based parser using string kernels Lu (Lu et al., 2008) –A generative model with discriminative reranking W ASP (Wong & Mooney, 2006; Wong & Mooney, 2007) –A system based on synchronous grammars Z&C (Zettlemoyer & Collins, 2007) –A probabilistic parser based on relaxed CCG grammars, requiring a hand-built, ambiguous CCG grammar template Our system –Requires a syntactic parser Scissor (Ge & Mooney, 2005)

37 Syntactic Parses Utilized Gold-standard syntactic parses ( GoldSyn ) Collins’s head-driven syntactic parsing model II trained on WSJ plus a small number of in-domain training sentences –CLang: Syn20 (88.21%) –Geoquery: Syn40 (91.46%) Collins’s parser trained only on WSJ –CLang: Syn0 (82.15%) –Geoquery: Syn0 (76.44%)

38 Performance on CLang PrecisionRecallF-measure G OLD S YN S YN S YN W ASP K RISP S CISSOR LULU

39 Performance on Geoquery PrecisionRecallF-measure G OLD S YN S YN S YN W ASP Z&C S CISSOR K RISP LULU

40 Experiments with Less Training Data CLang40: 40 random selected examples from the training set in CLang G EO 250 (Zelle & Mooney, 1996): an existing 250- example subset

41 Performance on CLang40 PrecisionRecallF-measure G OLD S YN S YN S YN W ASP K RISP S CISSOR

42 Performance on G EO 250 PrecisionRecallF-measure G OLD S YN S YN S YN W ASP S CISSOR K RISP LULU

43 Conclusion Presented a new approach to learning a semantic parser that exploits an existing syntactic parser to produce disambiguated parse trees. The experiments showed that the resulting system produces improved results on standard corpora. The improvements of our system with less training data demonstrate the advantage of utilizing an existing syntactic parser. –Syntactic structure is learned from large open domain treebanks instead of relying just on the training data.

44 Thank You! Questions?