Two-Phase Semantic Role Labeling based on Support Vector Machines Kyung-Mi Park Young-Sook Hwang Hae-Chang Rim NLP Lab. Korea Univ.

Slides:



Advertisements
Similar presentations
SEMANTIC ROLE LABELING BY TAGGING SYNTACTIC CHUNKS
Advertisements

SVM - Support Vector Machines A new classification method for both linear and nonlinear data It uses a nonlinear mapping to transform the original training.
Classification / Regression Support Vector Machines
Progress update Lin Ziheng. System overview 2 Components – Connective classifier Features from Pitler and Nenkova (2009): – Connective: because – Self.
Page 1 SRL via Generalized Inference Vasin Punyakanok, Dan Roth, Wen-tau Yih, Dav Zimak, Yuancheng Tu Department of Computer Science University of Illinois.
Sequence Classification: Chunking Shallow Processing Techniques for NLP Ling570 November 28, 2011.
Support Vector Machines
Machine learning continued Image source:
Modeling the Evolution of Product Entities Priya Radhakrishnan 1, Manish Gupta 1,2, Vasudeva Varma 1 1 Search and Information Extraction Lab, IIIT-Hyderabad,
Max-Margin Matching for Semantic Role Labeling David Vickrey James Connor Daphne Koller Stanford University.
计算机科学与技术学院 Chinese Semantic Role Labeling with Dependency-driven Constituent Parse Tree Structure Hongling Wang, Bukang Wang Guodong Zhou NLP Lab, School.
IMAN SAUDY UMUT OGUR NORBERT KISS GEORGE TEPES-NICA BARLEY SEEDS CLASSIFICATION.
Statistical NLP: Lecture 3
Chunk Parsing CS1573: AI Application Development, Spring 2003 (modified from Steven Bird’s notes)
Semantic Role Labeling Abdul-Lateef Yussiff
Recognizing Implicit Discourse Relations in the Penn Discourse Treebank Ziheng Lin, Min-Yen Kan, and Hwee Tou Ng Department of Computer Science National.
Applications of Sequence Learning CMPT 825 Mashaal A. Memon
1 Fast Methods for Kernel-based Text Analysis Taku Kudo 工藤 拓 Yuji Matsumoto 松本 裕治 NAIST (Nara Institute of Science and Technology) 41st Annual Meeting.
SRL using complete syntactic analysis Mihai Surdeanu and Jordi Turmo TALP Research Center Universitat Politècnica de Catalunya.
Page 1 Generalized Inference with Multiple Semantic Role Labeling Systems Peter Koomen, Vasin Punyakanok, Dan Roth, (Scott) Wen-tau Yih Department of Computer.
A transformation-based approach to argument labeling Derrick Higgins Educational Testing Service
A Memory-Based Approach to Semantic Role Labeling Beata Kouchnir Tübingen University 05/07/04.
Course Summary LING 572 Fei Xia 03/06/07. Outline Problem description General approach ML algorithms Important concepts Assignments What’s next?
Boğaziçi University Artificial Intelligence Lab. Artificial Intelligence Laboratory Department of Computer Engineering Boğaziçi University Techniques for.
What is Learning All about ?  Get knowledge of by study, experience, or being taught  Become aware by information or from observation  Commit to memory.
Part-of-Speech Tagging & Sequence Labeling
The classification problem (Recap from LING570) LING 572 Fei Xia, Dan Jinguji Week 1: 1/10/08 1.
Chunking Pierre Bourreau Cristina España i Bonet LSI-UPC PLN-PTM.
Text Classification With Labeled and Unlabeled Data Presenter: Aleksandar Milisic Supervisor: Dr. David Albrecht.
Semantic Role Labeling using Maximum Entropy Model Joon-Ho Lim NLP Lab. Korea Univ.
Semantic Role Labeling
1 Data-Driven Dependency Parsing. 2 Background: Natural Language Parsing Syntactic analysis String to (tree) structure He likes fish S NP VP NP VNPrn.
Authors: Ting Wang, Yaoyong Li, Kalina Bontcheva, Hamish Cunningham, Ji Wang Presented by: Khalifeh Al-Jadda Automatic Extraction of Hierarchical Relations.
Ling 570 Day 17: Named Entity Recognition Chunking.
A search-based Chinese Word Segmentation Method ——WWW 2007 Xin-Jing Wang: IBM China Wen Liu: Huazhong Univ. China Yong Qin: IBM China.
HW7 Extracting Arguments for % Ang Sun March 25, 2012.
Methods for the Automatic Construction of Topic Maps Eric Freese, Senior Consultant ISOGEN International.
Semantic Role Labeling for Arabic using Kernel Methods Mona Diab Alessandro Moschitti Daniele Pighin.
A Language Independent Method for Question Classification COLING 2004.
Automatic classification for implicit discourse relations Lin Ziheng.
AQUAINT Workshop – June 2003 Improved Semantic Role Parsing Kadri Hacioglu, Sameer Pradhan, Valerie Krugler, Steven Bethard, Ashley Thornton, Wayne Ward,
INSTITUTE OF COMPUTING TECHNOLOGY Forest-based Semantic Role Labeling Hao Xiong, Haitao Mi, Yang Liu and Qun Liu Institute of Computing Technology Academy.
Opinion Holders in Opinion Text from Online Newspapers Youngho Kim, Yuchul Jung and Sung-Hyon Myaeng Reporter: Chia-Ying Lee Advisor: Prof. Hsin-Hsi Chen.
A Systematic Exploration of the Feature Space for Relation Extraction Jing Jiang & ChengXiang Zhai Department of Computer Science University of Illinois,
Considering Cost Asymmetry in Learning Classifiers Presented by Chunping Wang Machine Learning Group, Duke University May 21, 2007 by Bach, Heckerman and.
Prior Knowledge Driven Domain Adaptation Gourab Kundu, Ming-wei Chang, and Dan Roth Hyphenated compounds are tagged as NN. Example: H-ras Digit letter.
University of Texas at Austin Machine Learning Group Department of Computer Sciences University of Texas at Austin Support Vector Machines.
Introduction to Syntactic Parsing Roxana Girju November 18, 2004 Some slides were provided by Michael Collins (MIT) and Dan Moldovan (UT Dallas)
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Psychiatric document retrieval using a discourse-aware model Presenter : Wu, Jia-Hao Authors : Liang-Chih.
NLP. Introduction to NLP Background –From the early ‘90s –Developed at the University of Pennsylvania –(Marcus, Santorini, and Marcinkiewicz 1993) Size.
POS Tagger and Chunker for Tamil
Support Vector Machines and Kernel Methods for Co-Reference Resolution 2007 Summer Workshop on Human Language Technology Center for Language and Speech.
11 Project, Part 3. Outline Basics of supervised learning using Naïve Bayes (using a simpler example) Features for the project 2.
Virtual Examples for Text Classification with Support Vector Machines Manabu Sassano Proceedings of the 2003 Conference on Emprical Methods in Natural.
An Improved Algorithm for Decision-Tree-Based SVM Sindhu Kuchipudi INSTRUCTOR Dr.DONGCHUL KIM.
NLP. Parsing ( (S (NP-SBJ (NP (NNP Pierre) (NNP Vinken) ) (,,) (ADJP (NP (CD 61) (NNS years) ) (JJ old) ) (,,) ) (VP (MD will) (VP (VB join) (NP (DT.
Part-of-Speech Tagging & Sequence Labeling Hongning Wang
Prototype-Driven Grammar Induction Aria Haghighi and Dan Klein Computer Science Division University of California Berkeley.
A Unified Architecture for Natural Language Processing: Deep Neural Networks with Multitask Learning Ronan Collobert Jason Weston Presented by Jie Peng.
Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.
Relation Extraction (RE) via Supervised Classification See: Jurafsky & Martin SLP book, Chapter 22 Exploring Various Knowledge in Relation Extraction.
Semantic Role Labelling Using Chunk Sequences Ulrike Baldewein Katrin Erk Sebastian Padó Saarland University Saarbrücken Detlef Prescher Amsterdam University.
Question Classification Ling573 NLP Systems and Applications April 25, 2013.
Statistical NLP: Lecture 3
Improving a Pipeline Architecture for Shallow Discourse Parsing
Prototype-Driven Learning for Sequence Models
Pawan Lingras and Cory Butz
Chunk Parsing CS1573: AI Application Development, Spring 2003
CSCI 5832 Natural Language Processing
Progress report on Semantic Role Labeling
Presentation transcript:

Two-Phase Semantic Role Labeling based on Support Vector Machines Kyung-Mi Park Young-Sook Hwang Hae-Chang Rim NLP Lab. Korea Univ.

2 Contents  Introduction  Two-phase semantic role labeling based on SVMs Semantic argument boundary identification phase Semantic role classification phase  Experiments  Conclusion

3 Introduction(1)  Advantages of using SVMs high generalization performance in high dimensional feature spaces learning with combination of multiple features is possible by virtue of polynomial kernel functions  Semantic Role Labeling(SRL) task is one of the multiclass classification task since SVM is a binary classifier, we have to extend SVMs to multiclass classification task we are often confronted with the unbalanced class distribution problem in a multiclass classification task

4 Introduction(2)  If we try to apply SVMs in the SRL task we have to find a method of resolving the unbalanced class distribution problem  Propose a two-phase SRL method Boundary identification phase + Role classification phase  We can alleviate the unbalanced class distribution problem In the identification phase, only three SVM classifiers are required to identify B-ARG, I-ARG, O. We can decrease the number of negative examples. In the classification phase, we can ignore non-arguments constituents

5 Two-phase Semantic Role Labeling(1)  First phase: semantic argument identification Phase Identify the boundary of semantic arguments First, segment a sentence into syntactic constituents(c) using a unit of chunk or subclause Second, classify syntactic constituents into B-ARG, I-ARG, O  Second phase: semantic role classification phase assign appropriate semantic roles to the identified semantic arguments

6 Two-phase Semantic Role Labeling(2) InputOutput Under the existing constract, Rockwell said, it has already delivered 793 of the shipsets to Boeing. IN DT VBG NN, NNP VBD, PRP VBZ RB VBN CD IN DT NNS TO NNP. B-PP B-NP I-NP O B-NP B-VP O B-NP B-VP I-VP B-NP B-PP B-NP I-NP B-PP B-NP O (S* * (S* *S) * *S) O B-ORG O B-MISC O - exist - deliver - * (V*V) (A1*A1) * (AM-LOC* * *AM-LOC) * (A0*A0) * (AM-TMP*AM-TMP) (V*V) (A1* * *A1) * (A2*A2) * Underthe existingcontract, Rockwell said,it has already delivered 793ofthe shipsetstoBoeing CCCCCCCPCCCCC B-ARGI-ARG OOOB-ARGP I-ARG OB-ARG ARG P AM-LOCA0PA1A2

7 Semantic Argument Boundary Identification(1)  Restrict the search space in terms of the constituents the left search boundary is set to the left boundary of the second upper clause the right search boundary is set to the right boundary of the immediate clause  Utilize features for identifying syntactic constituents which are dependent to a predicate Semantic arguments are dependent on the predicate Features for finding dependency relations are implicitly represented

8 Semantic Argument Boundary Identification(2)  29 features are used for representing syntactic and semantic information related to dependency relationships between syntactic constituents and predicate FeaturesValues predicate-constituent (intervening features) position distance # of VP, NP, SBAR # of POS [CC] [,] [:] POS[“]&POS[”] path -2, -1, 1 0, 1, 2… -1, 0, 1 VP-PP-NP, … predicate itself & context headword, headword’s POS, chunk type beginning word’s POS context-1: headword, headword’s POS, chunk type MD, TO, VBZ, … constituent itself & context headword, headword’s POS, chunk type context-2: headword, headword’s POS, chunk type context-1: headword, headword’s POS, chunk type context+1: headword, headword’s POS, chunk type

9 Semantic Role Classification(1)  We consider only 18 semantic roles based on frequency in the training data AM-MOD, AM-NEG are post-processed by hand-crafted rules we do not consider 19 semantic roles that appear less than 36 times in the training data — A5, AM-PRD, AM-REC, AA — R-A3, R-AA, R-AM-TMP, R-AM-LOC, R-AM-MNR, R-AM-ADV, R-AM- PNC — C-A0, C-A2, C-A3, C-AM-MNR, C-AM-ADV, C-AM-EXT, C-AM-DIS, C- AM-CAU 18 semantic roles A0, A1, A2, A3, A4, R-A0, R-A1, R-A2, C-A1 AM-TMP, AM-ADV, AM-MNR, AM-LOC, AM-DIS AM-PNC, AM-CAU, AM-DIR, AM-EXT

10 Semantic Role Classification(2)  This phase also uses all features applied in the identification phase except for # of POS[:] and POS[“] & POS[”]  In addition, we use voice feature This is a binary feature identifying whether the target phrase is active or passive  Named-entity information is not used performance is decreased when NE information is included

11 Experiments(1)  We used SVM light package (  In both phases, we used a polynomial kernel (degree 2) with the one-vs-rest classification method  Results on the development set (closed challenge) PrecisionRecallF-measureAccuracy Overall67.27%64.36%65.78%- Identification75.96%72.30%74.08%- Classification %

12 Experiments(2)  Results on the test set (closed challenge) prec.rec.F1 Overall A A A A A A50.00 AM-ADV AM-CAU AM-DIR20.00 AM-DIS AM-EXT AM-LOC AM-MNR prec.rec.F1 AM-MOD AM-NEG AM-PNC AM-PRD0.00 AM-TMP R-A R-A R-A R-A30.00 R-AM-LOC0.00 R-AM-MNR0.00 R-AM-PNC0.00 R-AM-TMP0.00 V96.66

13 Conclusion  proposed a method of two-phase semantic role labeling based on the support vector machines  By applying the two-phase method, we can alleviate the unbalanced class distribution problem caused by the negative examples  Our system obtains F-measure of % on the test set and % on the development set