Automatic Extraction of Hierarchical Relations from Text

Slides:



Advertisements
Similar presentations
University of Sheffield NLP Machine Learning in GATE Angus Roberts, Horacio Saggion, Genevieve Gorrell.
Advertisements

University of Sheffield NLP Module 11: Advanced Machine Learning.
Specialized models and ranking for coreference resolution Pascal Denis ALPAGE Project Team INRIA Rocquencourt F Le Chesnay, France Jason Baldridge.
Search-Based Structured Prediction
Discovering Severity and Body Site Modifiers Dmitriy Dligach, Ph.D. Boston Children’s Hospital and Harvard Medical School.
Sequence Classification: Chunking Shallow Processing Techniques for NLP Ling570 November 28, 2011.
计算机科学与技术学院 Chinese Semantic Role Labeling with Dependency-driven Constituent Parse Tree Structure Hongling Wang, Bukang Wang Guodong Zhou NLP Lab, School.
Multi-Task Transfer Learning for Weakly- Supervised Relation Extraction Jing Jiang Singapore Management University ACL-IJCNLP 2009.
Semantic Role Labeling Abdul-Lateef Yussiff
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
CS4705.  Idea: ‘extract’ or tag particular types of information from arbitrary text or transcribed speech.
Learning syntactic patterns for automatic hypernym discovery Rion Snow, Daniel Jurafsky and Andrew Y. Ng Prepared by Ang Sun
Extracting Interest Tags from Twitter User Biographies Ying Ding, Jing Jiang School of Information Systems Singapore Management University AIRS 2014, Kuching,
Towards a semantic extraction of named entities Diana Maynard, Kalina Bontcheva, Hamish Cunningham University of Sheffield, UK.
Recognition of Multi-sentence n-ary Subcellular Localization Mentions in Biomedical Abstracts G. Melli, M. Ester, A. Sarkar Dec. 6, 2007
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Authors: Ting Wang, Yaoyong Li, Kalina Bontcheva, Hamish Cunningham, Ji Wang Presented by: Khalifeh Al-Jadda Automatic Extraction of Hierarchical Relations.
Extracting Semantic Constraint from Description Text for Semantic Web Service Discovery Dengping Wei, Ting Wang, Ji Wang, and Yaodong Chen Reporter: Ting.
Ling 570 Day 17: Named Entity Recognition Chunking.
1 Technologies for (semi-) automatic metadata creation Diana Maynard.
Complex Linguistic Features for Text Classification: A Comprehensive Study Alessandro Moschitti and Roberto Basili University of Texas at Dallas, University.
Combining terminology resources and statistical methods for entity recognition: an evaluation Angus Roberts, Robert Gaizauskas, Mark Hepple, Yikun Guo.
A Language Independent Method for Question Classification COLING 2004.
Experiments of Opinion Analysis On MPQA and NTCIR-6 Yaoyong Li, Kalina Bontcheva, Hamish Cunningham Department of Computer Science University of Sheffield.
1 Learning Sub-structures of Document Semantic Graphs for Document Summarization 1 Jure Leskovec, 1 Marko Grobelnik, 2 Natasa Milic-Frayling 1 Jozef Stefan.
Natural language processing tools Lê Đức Trọng 1.
A Systematic Exploration of the Feature Space for Relation Extraction Jing Jiang & ChengXiang Zhai Department of Computer Science University of Illinois,
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
1 Language Technologies (2) Valentin Tablan University of Sheffield, UK ACAI 05 ADVANCED COURSE ON KNOWLEDGE DISCOVERY.
4. Relationship Extraction Part 4 of Information Extraction Sunita Sarawagi 9/7/2012CS 652, Peter Lindes1.
Support Vector Machines and Kernel Methods for Co-Reference Resolution 2007 Summer Workshop on Human Language Technology Center for Language and Speech.
11 Project, Part 3. Outline Basics of supervised learning using Naïve Bayes (using a simpler example) Features for the project 2.
Using Wikipedia for Hierarchical Finer Categorization of Named Entities Aasish Pappu Language Technologies Institute Carnegie Mellon University PACLIC.
Virtual Examples for Text Classification with Support Vector Machines Manabu Sassano Proceedings of the 2003 Conference on Emprical Methods in Natural.
Department of Computer Science The University of Texas at Austin USA Joint Entity and Relation Extraction using Card-Pyramid Parsing Rohit J. Kate Raymond.
Learning Event Durations from Event Descriptions Feng Pan, Rutu Mulkar, Jerry R. Hobbs University of Southern California ACL ’ 06.
Convolution Kernels on Constituent, Dependency and Sequential Structures for Relation Extraction Advisor: Hsin-His Chen Reporter: Chi-Hsin Yu Date:
Relation Extraction: Rule-based Approaches CSCI-GA.2590 Ralph Grishman NYU.
Dan Roth University of Illinois, Urbana-Champaign 7 Sequential Models Tutorial on Machine Learning in Natural.
Question Classification Ling573 NLP Systems and Applications April 25, 2013.
Automatically Labeled Data Generation for Large Scale Event Extraction
Investigating Pitch Accent Recognition in Non-native Speech
Sentiment analysis algorithms and applications: A survey
CRF &SVM in Medication Extraction
Longhua Qian School of Computer Science and Technology
Relation Extraction CSCI-GA.2591
Natural Language Processing (NLP)
张昊.
Supervised Machine Learning
Social Knowledge Mining
Machine Learning in Natural Language Processing
Discriminative Frequent Pattern Analysis for Effective Classification
Review-Level Aspect-Based Sentiment Analysis Using an Ontology
Automatic Detection of Causal Relations for Question Answering
CSCI 5832 Natural Language Processing
Cost Sensitive Evaluation Measures for F-term Classification
SVM Based Learning System for F-term Patent Classification
Perceptron Learning for Chinese Word Segmentation
Text Mining & Natural Language Processing
Using Uneven Margins SVM and Perceptron for IE
Natural Language Processing (NLP)
CS224N Section 3: Corpora, etc.
A Path-based Transfer Model for Machine Translation
Hierarchical, Perceptron-like Learning for OBIE
By Hossein Hematialam and Wlodek Zadrozny Presented by
Rachit Saluja 03/20/2019 Relation Extraction with Matrix Factorization and Universal Schemas Sebastian Riedel, Limin Yao, Andrew.
Tokenizing Search/regex Statistics
Extracting Why Text Segment from Web Based on Grammar-gram
Stance Classification of Ideological Debates
Natural Language Processing (NLP)
Presentation transcript:

Automatic Extraction of Hierarchical Relations from Text Ting Wang1, Yaoyong Li2, Kalina Bontcheva2, Hamish Cunningham2, Ji Wang1 Presented by Valentin Tablan2 1National University of Defense Technology, Changsha, China 2University of Sheffield, Sheffield, UK http://gate.ac.uk/ http://nlp.shef.ac.uk/

Outline Relation extraction and machine learning. ACE04 corpus for relation extraction. Experiments using the SVM and a variety of NLP features. Experimental results. ESWC 2006, Budva, Montenegro

Relation Extraction Two key stages for Information Extraction (IE) Identify the instances of information entities Recognise the relations among the entity instances Typical relations among named entities. Physical: located, near, part-whole Personal or social: business, family Membership: employer-stuff, member-of-group …… ESWC 2006, Budva, Montenegro

Machine Learning for Relation Extraction A number of learning algorithms have been applied on relation extraction E.g. HMM, CRF, MEM, and SVM Machine learning based system can be ported to other domains relatively easily. SVM using rich features achieved good results for relation extraction. ESWC 2006, Budva, Montenegro

Main contributions of This Paper Apply the SVM to extract relations in hierarchy. Investigate a variety of general NLP features for relation extraction. Evaluate several kernel types of the SVM for relation extraction. ESWC 2006, Budva, Montenegro

The ACE04 Corpus The ACE04 corpus was used in our experiments. It contains 452 documents annotated with named entity mentions and a set of relations among the entity mentions. There are 7 relation types and 23 subtypes. 5914 relation instances. ESWC 2006, Budva, Montenegro

ACE04 Entity Hierarchy Type Subtype Person (none) Organization Government, Commercial, Educational, Non-profit, … Facility Plant, Building, Subarea_Building, Bounded Area, … Location Address, Boundary, Celestial, Water_Body, Land_Region_natural, … Geo-political Entity Continent, Nation, State/Province, County/District, … Vehicle Air, Land, Water, Subarea_Vehicle, Other Weapon Blunt, Exploding, Sharp, Chemical, Biological, ESWC 2006, Budva, Montenegro

ACE04 Relation Hierarchy Type Subtype Physical Located, Near*, Part-Whole Personal/Social Business*, Family*, Other* Employment/Membership/Subsidiary Employ-Exec, Employ-Staff, Employ-Undetermined, Member-of-Group, Subsidiary, Partner*, Other* Agent-Artifact User/Owner, Inventor/Manufacturer, Other PER/ORG Affiliation Ethnic, Ideology, Other GPE Affiliation Citizen/Resident, Based-In, Other Discourse (none) ESWC 2006, Budva, Montenegro

Three Levels in Relation Hierarchy ESWC 2006, Budva, Montenegro

Relation Extraction Experiments Extract the pre-defined relations among the named entity mentions. Use the “true” named entity mentions and their types in the ACE04 corpus. Multi-class classification problem: whether or not any two entity mentions in a sentence have any of the predefined relations. ESWC 2006, Budva, Montenegro

Using SVM for Relation Extraction SVM is a popular learning algorithm. We used the one-against-one method to learn k(k-1)/2 binary SVM classifiers for the k-class classification. Use the max-win method to determine the class of a test example. ESWC 2006, Budva, Montenegro

NLP Features for Relation Extraction For one example (i.e. a pair of entities mentions in the same sentence) the features used were from the two mentions and the neighbouring words in the sentence. We used a variety of NLP features, produced by the GATE and third-party software (GATE plugins). All the features used were independent of the data’s domain. A total of 94 features. ESWC 2006, Budva, Montenegro

Simple Features Words. POS tags of words used. Tokens of two mentions and those surrounding them. POS tags of words used. Entity features provided in the ACE04 corpus. Including types and subtypes of two mentions. Overlap features. The relative position of the two mentions. ESWC 2006, Budva, Montenegro

Syntactic features Chunk features. whether two mentions are in the same NP or VP chunk. Dependency feature obtained from the MiniPar involving dependency relationship between the two mentions. Parse tree feature obtained from the BuChart. the lowest phrase labels of the two mentions, and the phrase labels in the path connecting two mentions in the parse tree. ESWC 2006, Budva, Montenegro

Semantic features From the SQLF produced by BuChart. WordNet. Paths of semantic labels from predicate to the heads of two mentions, respectively. Semantic labels of some important predicates. WordNet. The synsets of two entity mentions and the words surrounding them. For a word, pick up the most important synset which has a POS compatible with the word’s. ESWC 2006, Budva, Montenegro

Experimental Settings Use the SVM package LIBSVM for training and testing. Do 5-fold cross-validation on the ACE04 data for each experiment. Adopt Precision, Recall and F1 as evaluation measures. ESWC 2006, Budva, Montenegro

Contribution of Individual Features Precision(%) Recall(%) F1(%) Word 57.90 23.48 33.38 +POS Tag 57.63 26.91 36.66 +Entity 60.03 44.47 50.49 +Mention 61.03 45.60 52. 03 +Overlap 60.51 48.01 53. 52 +Chunk 61.46 48.46 54.19 +Dependency 63.07 48.26 54.67 +Parse Tree 63.57 48.58 55.06 +SQLF 63.74 48.92 55.34 + WordNet 67.53 48.98 56.78 ESWC 2006, Budva, Montenegro

Discussion Every feature we used has some contribution. Most features improved the recall of results. Word alone can achieve 0.33 F1. Entity mentions’ types and subtypes are most useful among other features. Some complicated features, such as chunk, dependency, parse tree, and SQLF, are not as useful as we expected. ESWC 2006, Budva, Montenegro

Results for Different Kernel Types Precision (%) Recall (%) F1(%) Linear 66.41 49.18 56.50 Quadratic 68.96 46.20 55.33 Cubic 71.31 42.39 53.17 Linear kernel had the best performance in F1, though not significantly better than the quadratic kernel. Linear kernel is fastest. ESWC 2006, Budva, Montenegro

Impact of the Hierarchy Hierarchical level Precision (%) Recall (%) F1(%) Relation detection (1 class) 73.87 69.50 71.59 Type classification (7 classes) 71.41 60.03 65.20 Subtype classification (23 classes) 67.53 48.98 56.78 ESWC 2006, Budva, Montenegro

Results on the Subtypes of EMP-ORG #examples Precision (%) Recall (%) F1 (%) Employ-Exec 630 71.37 63.90 67.16 Employ-Undetermined 129 68.76 43.23 51.20 Employ-Staff 694 64.39 60.97 62.25 Member-of-Group 225 62.16 38.55 46.85 Subsidiary 300 83.81 65.29 72.79 Partner 16 0.00 Other 90 33.33 5.89 9.90 ESWC 2006, Budva, Montenegro

Discussion Results were different for different subtypes. Strong connection with to the number of examples. Subtype recognition had lower results than type. Recognition for subtype is more difficult than for type (subtypes less distinct than types). Deeper features help more with fine-grained distinctions. ESWC 2006, Budva, Montenegro

Conclusions Investigated the SVM-based classification for relation extraction. Explored a diverse set of NLP features. The results obtained were encouraging. More work would be needed. Experiments of extracting relations based on the named entities recognised by IE system. Evaluate the system on large-scale ontology and data. ESWC 2006, Budva, Montenegro

Thank you! More information: http://gate.ac.uk http://nlp.shef.ac.uk ESWC 2006, Budva, Montenegro