SEMANTIC ROLE LABELING BY TAGGING SYNTACTIC CHUNKS

Slides:



Advertisements
Similar presentations
School of something FACULTY OF OTHER School of Computing FACULTY OF ENGINEERING Chunking: Shallow Parsing Eric Atwell, Language Research Group.
Advertisements

Information Extraction Lecture 7 – Linear Models (Basic Machine Learning) CIS, LMU München Winter Semester Dr. Alexander Fraser, CIS.
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 2 (06/01/06) Prof. Pushpak Bhattacharyya IIT Bombay Part of Speech (PoS)
Page 1 SRL via Generalized Inference Vasin Punyakanok, Dan Roth, Wen-tau Yih, Dav Zimak, Yuancheng Tu Department of Computer Science University of Illinois.
Sequence Classification: Chunking Shallow Processing Techniques for NLP Ling570 November 28, 2011.
Prototype-Driven Grammar Induction Aria Haghighi and Dan Klein Computer Science Division University of California Berkeley.
E XTRACTING SEMANTIC ROLE INFORMATION FROM UNSTRUCTURED TEXTS Diana Trandab ă 1 and Alexandru Trandab ă 2 1 Faculty of Computer Science, University “Al.
Part-Of-Speech Tagging and Chunking using CRF & TBL
Max-Margin Matching for Semantic Role Labeling David Vickrey James Connor Daphne Koller Stanford University.
计算机科学与技术学院 Chinese Semantic Role Labeling with Dependency-driven Constituent Parse Tree Structure Hongling Wang, Bukang Wang Guodong Zhou NLP Lab, School.
Statistical NLP: Lecture 3
Semantic Role Labeling Abdul-Lateef Yussiff
A Joint Model For Semantic Role Labeling Aria Haghighi, Kristina Toutanova, Christopher D. Manning Computer Science Department Stanford University.
Recognizing Implicit Discourse Relations in the Penn Discourse Treebank Ziheng Lin, Min-Yen Kan, and Hwee Tou Ng Department of Computer Science National.
Towards Parsing Unrestricted Text into PropBank Predicate- Argument Structures ACL4 Project NCLT Seminar Presentation, 7th June 2006 Conor Cafferkey.
CS460/449 : Speech, Natural Language Processing and the Web/Topics in AI Programming (Lecture 2– Introduction+ML and NLP) Pushpak Bhattacharyya CSE Dept.,
Semantic Role Chunking Combining Complementary Syntactic Views Sameer Pradhan, Kadri Hacioglu, Wayne Ward, James H. Martin, Daniel Jurafsky  Center for.
NLP and Speech Course Review. Morphological Analyzer Lexicon Part-of-Speech (POS) Tagging Grammar Rules Parser thethe – determiner Det NP → Det.
SRL using complete syntactic analysis Mihai Surdeanu and Jordi Turmo TALP Research Center Universitat Politècnica de Catalunya.
Two-Phase Semantic Role Labeling based on Support Vector Machines Kyung-Mi Park Young-Sook Hwang Hae-Chang Rim NLP Lab. Korea Univ.
1 CSC 594 Topics in AI – Applied Natural Language Processing Fall 2009/ Shallow Parsing.
Page 1 Generalized Inference with Multiple Semantic Role Labeling Systems Peter Koomen, Vasin Punyakanok, Dan Roth, (Scott) Wen-tau Yih Department of Computer.
A transformation-based approach to argument labeling Derrick Higgins Educational Testing Service
A Memory-Based Approach to Semantic Role Labeling Beata Kouchnir Tübingen University 05/07/04.
1 Natural Language Processing for the Web Prof. Kathleen McKeown 722 CEPSR, Office Hours: Wed, 1-2; Tues 4-5 TA: Yves Petinot 719 CEPSR,
Semantic Role Labeling using Maximum Entropy Model Joon-Ho Lim NLP Lab. Korea Univ.
Semantic Role Labeling
Richard Socher Cliff Chiung-Yu Lin Andrew Y. Ng Christopher D. Manning
Probabilistic Parsing Reading: Chap 14, Jurafsky & Martin This slide set was adapted from J. Martin, U. Colorado Instructor: Paul Tarau, based on Rada.
Authors: Ting Wang, Yaoyong Li, Kalina Bontcheva, Hamish Cunningham, Ji Wang Presented by: Khalifeh Al-Jadda Automatic Extraction of Hierarchical Relations.
Ling 570 Day 17: Named Entity Recognition Chunking.
HW7 Extracting Arguments for % Ang Sun March 25, 2012.
Methods for the Automatic Construction of Topic Maps Eric Freese, Senior Consultant ISOGEN International.
Semantic Role Labeling for Arabic using Kernel Methods Mona Diab Alessandro Moschitti Daniele Pighin.
AQUAINT Workshop – June 2003 Improved Semantic Role Parsing Kadri Hacioglu, Sameer Pradhan, Valerie Krugler, Steven Bethard, Ashley Thornton, Wayne Ward,
INSTITUTE OF COMPUTING TECHNOLOGY Forest-based Semantic Role Labeling Hao Xiong, Haitao Mi, Yang Liu and Qun Liu Institute of Computing Technology Academy.
A Cascaded Finite-State Parser for German Michael Schiehlen Institut für Maschinelle Sprachverarbeitung Universität Stuttgart
A Systematic Exploration of the Feature Space for Relation Extraction Jing Jiang & ChengXiang Zhai Department of Computer Science University of Illinois,
Linguistic Essentials
AQUAINT Phase II Six Month Workshop – October 2004 Fusing Rich Information Extracted from Multiple Media and Languages to Generate Contextualized, Complex.
Prototype-Driven Learning for Sequence Models Aria Haghighi and Dan Klein University of California Berkeley Slides prepared by Andrew Carlson for the Semi-
CPE 480 Natural Language Processing Lecture 4: Syntax Adapted from Owen Rambow’s slides for CSc Fall 2006.
Combining Lexical Resources: Mapping Between PropBank and VerbNet Edward Loper,Szu-ting Yi, Martha Palmer September 2006.
CS : Speech, NLP and the Web/Topics in AI Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture-14: Probabilistic parsing; sequence labeling, PCFG.
Deep Learning for Efficient Discriminative Parsing Niranjan Balasubramanian September 2 nd, 2015 Slides based on Ronan Collobert’s Paper and video from.
NLP. Introduction to NLP Background –From the early ‘90s –Developed at the University of Pennsylvania –(Marcus, Santorini, and Marcinkiewicz 1993) Size.
Sentences Ed McCorduck English 402--Grammar SUNY Cortland
POS Tagger and Chunker for Tamil
Shallow Parsing for South Asian Languages -Himanshu Agrawal.
11 Project, Part 3. Outline Basics of supervised learning using Naïve Bayes (using a simpler example) Features for the project 2.
NLP. Introduction to NLP Last week, Min broke the window with a hammer. The window was broken with a hammer by Min last week With a hammer, Min broke.
Exploiting Named Entity Taggers in a Second Language Thamar Solorio Computer Science Department National Institute of Astrophysics, Optics and Electronics.
Virtual Examples for Text Classification with Support Vector Machines Manabu Sassano Proceedings of the 2003 Conference on Emprical Methods in Natural.
CS : Speech, NLP and the Web/Topics in AI Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture-15: Probabilistic parsing; PCFG (contd.)
NLP. Parsing ( (S (NP-SBJ (NP (NNP Pierre) (NNP Vinken) ) (,,) (ADJP (NP (CD 61) (NNS years) ) (JJ old) ) (,,) ) (VP (MD will) (VP (VB join) (NP (DT.
Prototype-Driven Grammar Induction Aria Haghighi and Dan Klein Computer Science Division University of California Berkeley.
A Unified Architecture for Natural Language Processing: Deep Neural Networks with Multitask Learning Ronan Collobert Jason Weston Presented by Jie Peng.
Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.
Dan Roth University of Illinois, Urbana-Champaign 7 Sequential Models Tutorial on Machine Learning in Natural.
Relation Extraction (RE) via Supervised Classification See: Jurafsky & Martin SLP book, Chapter 22 Exploring Various Knowledge in Relation Extraction.
Semantic Role Labelling Using Chunk Sequences Ulrike Baldewein Katrin Erk Sebastian Padó Saarland University Saarbrücken Detlef Prescher Amsterdam University.
Natural Language Processing Information Extraction Jim Martin (slightly modified by Jason Baldridge)
English Grammar Lecture 5: Sentences
Statistical NLP: Lecture 3
INAGO Project Automatic Knowledge Base Generation from Text for Interactive Question Answering.
CSCI 5832 Natural Language Processing
Linguistic Essentials
Sentences Ed McCorduck English 402--Grammar SUNY Cortland
CSCI 5832 Natural Language Processing
Progress report on Semantic Role Labeling
Presentation transcript:

SEMANTIC ROLE LABELING BY TAGGING SYNTACTIC CHUNKS Kadri Hacioglu1, Sameer Pradhan1, Wayne Ward1 James H. Martin1, Daniel Jurafsky2 1 The Center for Spoken Language Research 2 Stanford NLP Group Stanford University University of Colorado at Boulder

OUTLINE Semantic Role Labeling (SRL) Nature of Shared Task Data Our Strategy System Description & Features Experiments Concluding Remarks

SEMANTIC ROLE LABELING Based on predicate-argument structure: First explored by (Gildea & Jurafsky, 2000) Predicate: pursue A1 (Theme) A0 (Agent) AM-MNR (Manner) we completion of this transaction aggressively PropBank style Thematic role [A0We] are prepared to [PREDpursue] [A1aggressively] [AM-MNR completion of this transaction] he says

EXAMPLE OF SHARED TASK DATA words POS tags Clause tags Semantic labels Sales NNS B-NP (S* - (A1*A1) declined VBD B-VP * decline (V*V) 10 CD B-NP * - (A2* % NN I-NP * - *A2) to TO B-PP * - * $ $ B-NP * - (A4* 251.2 CD I-NP * - * million CD I-NP * - *A4) from IN B-PP * - * $ $ B-NP * - (A3* 287.7 CD I-NP * - * million CD I-NP * - *A3) . . O *S) - * BP tags (BOI2) Predicate Info

OUTLINE OF OUR STRATEGY Change Shared Task Representation make sure that it is reversible Engineer additional features use intuition, experience and data analysis Optimize system settings context size SVM parameters; degree of polynomial, C

CHANGE IN REPRESENTATION Restructure available information - words collapsed into respective BPs - only headwords are retained (rightmost words) - exceptions: VPs with the predicate; Outside (O) chunks Modify semantic role labeling - BOI2 scheme instead of bracketing scheme

NEW REPRESENTATION BPs POS tags Clause tags Semantic labels (BOI2) NP Sales NNS B-NP (S* - B-A1 VP declined VBD B-VP * decline B-V NP % NN I-NP * - B-A2 PP to TO B-PP * - O NP million CD I-NP * - B-A4 PP from IN B-PP * - O NP million CD I-NP * - B-A3 O . . O *S) - O Headwords BP tags (BOI2) Predicate Info

DIFFERENCES BETWEEN REPRESENTATIONS Original Representation New Tokens words base phrases Lexical Info all words headwords #Tagging Steps larger fewer Context span narrower wider # Role Labels greater smaller Info Loss - yes Performance worse better

SYSTEM DESCRIPTION Phrase-by-phrase Left-to-right Binary feature encoding Discriminative Deterministic SVM based (YamCha toolkit, developed by Taku Kudo) Simple post-processing (for consistent bracketing)

BASE FEATURES Words Predicate lemmas Part of speech tags Base phrase IOB2 tags Clause bracketing tags Named Entities

ADDITIONAL FEATURES Token level Sentence level Token position Path Clause bracket patterns Clause Position Headword suffixes Distance Length Predicate POS tag Predicate Frequency Predicate Context (POS, BP) Predicate Argument Frames Number of predicates

EXPERIMENTAL SET-UP Corpus: Flattened PropBank (2004 release) Training set: Sections 15-18 Dev set: Section 20 Test set: Section 21 SVMs: 78 OVA classes, polynomial kernel, d=2, C=0.01 Context: sliding +2/-2 tokens window

RESULTS Base features, W-by-W & P-by-P approaches, dev set Method Precision Recall F1 W-by-W 68.34% 45.16% 54.39 P-by-P 69.04% 54.68% 61.02 All features, P-by-P approach Data Precision Recall F1 Dev set 74.17% 69.42% 71.72 Test set 72.43% 66.77% 69.49

CONCLUSIONS We have done SRL by tagging base phrase chunks - original representation has been changed - additional features have been engineered - SVMs have been used Improved performance with new representation and additional features Compared to W-by-W approach, our method - classifies larger units - uses wider context - runs faster - performs better

THANK YOU! So so… Boring! Cool! Wow!… That’s OK!… Awesome! Not too bad! Yawning..

CLAUSE FEATURES Clause (CL) markers CL pattern to predicate One CD B-NP (S* - OUT (S*(S**S) - troubling VBG I-NP * - OUT (S**S) (S* aspect NN I-NP * - OUT (S**S) (S* of IN B-PP * - OUT (S**S) (S* DEC NNP B-NP * - OUT (S**S) (S* 's POS B-NP * - OUT (S**S) (S* results NNS I-NP * - OUT (S**S) (S* , , O * - OUT (S**S) (S* analysts NNS B-NP (S* - IN (S**S) (S* said VBD B-VP *S) say IN - - , , O * - OUT *S) *S) was VBD B-VP * - OUT *S) *S) its PRP$ B-NP * - OUT *S) *S) performance NN I-NP * - OUT *S) *S) in IN B-PP * - OUT *S) *S) Europe NNP B-NP * - OUT *S) *S) . . O *S) - OUT *S)*S) - CL pattern to sentence begin predicate CL pattern to sentence end

SUFFIXES  suffixes of length 2-4 as features for head words are tried The confusion B-AM-MNR  B-AM-TMP single word cases: fetchingly, tacitly, provocatively  suffixes of length 2-4 as features for head words are tried