Easy-First Coreference Resolution Veselin Stoyanov and Jason Eisner Johns Hopkins University.

Slides:



Advertisements
Similar presentations
Multi-Document Person Name Resolution Michael Ben Fleischman (MIT), Eduard Hovy (USC) From Proceedings of ACL-42 Reference Resolution workshop 2004.
Advertisements

Unsupervised Learning Clustering K-Means. Recall: Key Components of Intelligent Agents Representation Language: Graph, Bayes Nets, Linear functions Inference.
Ch. Eick: More on Machine Learning & Neural Networks Different Forms of Learning: –Learning agent receives feedback with respect to its actions (e.g. using.
Perceptron Learning Rule
Albert Gatt Corpora and Statistical Methods Lecture 13.
Integrated Instance- and Class- based Generative Modeling for Text Classification Antti PuurulaUniversity of Waikato Sung-Hyon MyaengKAIST 5/12/2013 Australasian.
A Machine Learning Approach to Coreference Resolution of Noun Phrases By W.M.Soon, H.T.Ng, D.C.Y.Lim Presented by Iman Sen.
Imbalanced data David Kauchak CS 451 – Fall 2013.
A Corpus for Cross- Document Co-Reference D. Day 1, J. Hitzeman 1, M. Wick 2, K. Crouch 1 and M. Poesio 3 1 The MITRE Corporation 2 University of Massachusetts,
Structured SVM Chen-Tse Tsai and Siddharth Gupta.
1 Relational Learning of Pattern-Match Rules for Information Extraction Presentation by Tim Chartrand of A paper bypaper Mary Elaine Califf and Raymond.
NetSci07 May 24, 2007 Entity Resolution in Network Data Lise Getoor University of Maryland, College Park.
Robust Textual Inference via Graph Matching Aria Haghighi Andrew Ng Christopher Manning.
The University of Wisconsin-Madison Universal Morphological Analysis using Structured Nearest Neighbor Prediction Young-Bum Kim, João V. Graça, and Benjamin.
Re-ranking for NP-Chunking: Maximum-Entropy Framework By: Mona Vajihollahi.
Prénom Nom Document Analysis: Linear Discrimination Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Discriminative Learning of Extraction Sets for Machine Translation John DeNero and Dan Klein UC Berkeley TexPoint fonts used in EMF. Read the TexPoint.
A Deterministic Co-reference System with Rich Syntactic Features and Semantic Knowledge Heeyoung Lee & Sudarshan Rangarajan Collaborators : Karthik Raghunathan.
1 Jun Wang, 2 Sanjiv Kumar, and 1 Shih-Fu Chang 1 Columbia University, New York, USA 2 Google Research, New York, USA Sequential Projection Learning for.
An Illustrative Example
Supervised models for coreference resolution Altaf Rahman and Vincent Ng Human Language Technology Research Institute University of Texas at Dallas 1.
Minimum Error Rate Training in Statistical Machine Translation By: Franz Och, 2003 Presented By: Anna Tinnemore, 2006.
Improving Machine Learning Approaches to Coreference Resolution Vincent Ng and Claire Cardie Cornell Univ. ACL 2002 slides prepared by Ralph Grishman.
SI485i : NLP Set 14 Reference Resolution. 2 Kraken, also called the Crab-fish, which is not that huge, for heads and tails counted, he is no larger than.
A Discriminative Latent Variable Model for Online Clustering Rajhans Samdani, Kai-Wei Chang, Dan Roth Department of Computer Science University of Illinois.
Webpage Understanding: an Integrated Approach
A Global Relaxation Labeling Approach to Coreference Resolution Coling 2010 Emili Sapena, Llu´ıs Padr´o and Jordi Turmo TALP Research Center Universitat.
Active Sampling for Entity Matching Aditya Parameswaran Stanford University Jointly with: Kedar Bellare, Suresh Iyengar, Vibhor Rastogi Yahoo! Research.
Andreea Bodnari, 1 Peter Szolovits, 1 Ozlem Uzuner 2 1 MIT, CSAIL, Cambridge, MA, USA 2 Department of Information Studies, University at Albany SUNY, Albany,
Tree Kernels for Parsing: (Collins & Duffy, 2001) Advanced Statistical Methods in NLP Ling 572 February 28, 2012.
1 Data-Driven Dependency Parsing. 2 Background: Natural Language Parsing Syntactic analysis String to (tree) structure He likes fish S NP VP NP VNPrn.
Dual Coordinate Descent Algorithms for Efficient Large Margin Structured Prediction Ming-Wei Chang and Scott Wen-tau Yih Microsoft Research 1.
Illinois-Coref: The UI System in the CoNLL-2012 Shared Task Kai-Wei Chang, Rajhans Samdani, Alla Rozovskaya, Mark Sammons, and Dan Roth Supported by ARL,
Ling 570 Day 17: Named Entity Recognition Chunking.
Automatic Detection of Tags for Political Blogs Khairun-nisa Hassanali Vasileios Hatzivassiloglou The University.
Incorporating Extra-linguistic Information into Reference Resolution in Collaborative Task Dialogue Ryu Iida Shumpei Kobayashi Takenobu Tokunaga Tokyo.
Basic Machine Learning: Clustering CS 315 – Web Search and Data Mining 1.
I2B2 Shared Task 2011 Coreference Resolution in Clinical Text David Hinote Carlos Ramirez.
A Cross-Lingual ILP Solution to Zero Anaphora Resolution Ryu Iida & Massimo Poesio (ACL-HLT 2011)
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Coarse-to-Fine Efficient Viterbi Parsing Nathan Bodenstab OGI RPE Presentation May 8, 2006.
An Entity-Mention Model for Coreference Resolution with Inductive Logic Programming Xiaofeng Yang 1 Jian Su 1 Jun Lang 2 Chew Lim Tan 3 Ting Liu 2 Sheng.
First-Order Probabilistic Models for Coreference Resolution Aron Culotta Computer Science Department University of Massachusetts Amherst Joint work with.
1 Toward Opinion Summarization: Linking the Sources Veselin Stoyanov and Claire Cardie Department of Computer Science Cornell University Ithaca, NY 14850,
Hierarchical Clustering for POS Tagging of the Indonesian Language Derry Tanti Wijaya and Stéphane Bressan.
Improving Named Entity Translation Combining Phonetic and Semantic Similarities Fei Huang, Stephan Vogel, Alex Waibel Language Technologies Institute School.
CoCQA : Co-Training Over Questions and Answers with an Application to Predicting Question Subjectivity Orientation Baoli Li, Yandong Liu, and Eugene Agichtein.
Inference Protocols for Coreference Resolution Kai-Wei Chang, Rajhans Samdani, Alla Rozovskaya, Nick Rizzolo, Mark Sammons, and Dan Roth This research.
Learning to Detect Faces A Large-Scale Application of Machine Learning (This material is not in the text: for further information see the paper by P.
Support Vector Machines and Kernel Methods for Co-Reference Resolution 2007 Summer Workshop on Human Language Technology Center for Language and Speech.
1 Minimum Error Rate Training in Statistical Machine Translation Franz Josef Och Information Sciences Institute University of Southern California ACL 2003.
Multilingual Information Retrieval using GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of Kaohsiung.
Probabilistic Equational Reasoning Arthur Kantor
Learning to Rank: From Pairwise Approach to Listwise Approach Authors: Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li Presenter: Davidson Date:
Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!
Dan Roth University of Illinois, Urbana-Champaign 7 Sequential Models Tutorial on Machine Learning in Natural.
Pattern Recognition Lecture 20: Neural Networks 3 Dr. Richard Spillman Pacific Lutheran University.
Automatically Labeled Data Generation for Large Scale Event Extraction
Simone Paolo Ponzetto University of Heidelberg Massimo Poesio
Erasmus University Rotterdam
NYU Coreference CSCI-GA.2591 Ralph Grishman.
(Entity and) Event Extraction CSCI-GA.2591
Lecture 9: Entity Resolution
Clustering Algorithms for Noun Phrase Coreference Resolution
A Machine Learning Approach to Coreference Resolution of Noun Phrases
An Interactive Approach to Collectively Resolving URI Coreference
Chapter 8: Generalization and Function Approximation
A Machine Learning Approach to Coreference Resolution of Noun Phrases
Authors: Wai Lam and Kon Fan Low Announcer: Kyu-Baek Hwang
Statistical NLP Spring 2011
Presentation transcript:

Easy-First Coreference Resolution Veselin Stoyanov and Jason Eisner Johns Hopkins University

Coreference Resolution Given a peace of text, identify all noun phrases that refer to the same real-world entity

Coreference Resolution Given a peace of text, identify all noun phrases that refer to the same real-world entity Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Coreference Resolution Given a peace of text, identify all noun phrases that refer to the same real-world entity Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Coreference Resolution Given a peace of text, identify all noun phrases that refer to the same real-world entity Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Coreference Resolution Typical pairwise approaches But, inherently global task Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Coreference Resolution Inherently global task Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Coreference Resolution Inherently global task Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Global Approaches to Coreference Resolution A difficult search problem – Exponentially many possible clusterings Recent approaches have been successful: – [Culotta et al., 2007] – train a global coherence measure; use greedy inference at test time – [Poon and Domingos, 2008] – use an unsupervised approach based on MLNs to enforce global properties

A Different Approach A Multi-Pass Sieve [Raghunathan et al., 2010] – Build coreference clusters incrementally – Applying sieves (hand-specified rules) of decreasing precision and increasing recall – Winner of the CoNLL-2011 shared task

A Multi-Pass Sieve for Coreference Resolution [Raghunathan et al. 2010] Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

A Multi-Pass Sieve for Coreference Resolution [Raghunathan et al. 2010] Sieves: 1.Exact match Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

A Multi-Pass Sieve for Coreference Resolution [Raghunathan et al. 2010] Sieves: 1.Exact match 2.Precise Constructs Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

A Multi-Pass Sieve for Coreference Resolution [Raghunathan et al. 2010] Sieves: 1.Exact match 2.Precise Constructs Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

A Multi-Pass Sieve for Coreference Resolution [Raghunathan et al. 2010] Sieves: 1.Exact match 2.Precise Constructs 3.Strict Head match Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

A Multi-Pass Sieve for Coreference Resolution [Raghunathan et al. 2010] Sieves: 1.Exact match 2.Precise Constructs 3.Strict Head match Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

A Multi-Pass Sieve for Coreference Resolution [Raghunathan et al. 2010] Sieves: 1.Exact match 2.Precise Constructs 3.Strict Head match …………. 7. Pronoun Matching Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

A Multi-Pass Sieve for Coreference Resolution [Raghunathan et al. 2010] Sieves: 1.Exact match 2.Precise Constructs 3.Strict Head match …………. 7. Pronoun Matching Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Easy-First Coreference Resolution Use the same idea: – Greedily build clusters – Using more information for the less confident decisions But learn automatically what constitutes an “easy” (confident) decision – Leads to empirical improvement over the Multi- Pass Sieve approach and a competitive baseline

Easy-First Coreference Resolution Incrementally and greedily build clusters – Using machine learning Learn policy for the test time inference procedure

Easy-First Coreference: Talk Overview Test-Time Inference Training Algorithm Experimental results

TEST-TIME INFERENCE

Test-Time Inference Begin with each NP in a separate cluster Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star. Cluster 1 SemType: uknwn Gender: uknwn Number: single Names: NIL Cluster 2 SemType: person Gender: male Number: single Names: Mike Tyson Cluster 3 SemType: person Gender: male Number: single Names: NIL Cluster 3 SemType: person Gender: male Number: single Names: NIL Begin with each NP in a separate cluster

Compute a feature vector for each pair of clusters Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star. Cluster 1 SemType: uknwn Gender: uknwn Number: single Names: NIL Cluster 2 SemType: person Gender: male Number: single Names: Mike Tyson

Compute a feature vector for each pair of clusters Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star. Cluster 1 SemType: uknwn Gender: uknwn Number: single Names: NIL Cluster 2 SemType: person Gender: male Number: single Names: Mike Tyson SemTypeSame = False

Compute a feature vector for each pair of clusters Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star. Cluster 1 SemType: uknwn Gender: uknwn Number: single Names: NIL Cluster 2 SemType: person Gender: male Number: single Names: Mike Tyson SemTypeSame = False SemTypeComp = True

Compute a feature vector for each pair of clusters Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star. Cluster 1 SemType: uknwn Gender: uknwn Number: single Names: NIL Cluster 2 SemType: person Gender: male Number: single Names: Mike Tyson SemTypeSame = False SemTypeComp = True NumberSame = True

Compute a feature vector for each pair of clusters Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star. Cluster 1 SemType: uknwn Gender: uknwn Number: single Names: NIL Cluster 2 SemType: person Gender: male Number: single Names: Mike Tyson SemTypeSame = False SemTypeComp = True NumberSame = True RoleAppositive = True

Compute a feature vector for each pair of clusters Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star. Cluster 1 SemType: uknwn Gender: uknwn Number: single Names: NIL Cluster 2 SemType: person Gender: male Number: single Names: Mike Tyson SemTypeSame = False SemTypeComp = True NumberSame = True RoleAppositive = True SentDistance = 0

Compute a feature vector for each pair of clusters Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star. Cluster 1 SemType: uknwn Gender: uknwn Number: single Names: NIL Cluster 2 SemType: person Gender: male Number: single Names: Mike Tyson SemTypeSame = False SemTypeComp = True NumberSame = True RoleAppositive = True SentDistance = 0 Global features Local features

Compute a feature vector for each pair of clusters Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star. Cluster 1 SemType: uknwn Gender: uknwn Number: single Names: NIL Cluster 2 SemType: person Gender: male Number: single Names: Mike Tyson f(join ij )

Compute a weight for each join action Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star. Cluster 1 SemType: uknwn Gender: uknwn Number: single Names: NIL Cluster 2 SemType: person Gender: male Number: single Names: Mike Tyson w.f(join ij )

Compute a weight for each join action Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star. Cluster 1 SemType: uknwn Gender: uknwn Number: single Names: NIL Cluster 2 SemType: person Gender: male Number: single Names: Mike Tyson w.f(join ij ) weight ij =

Perform the action with the highest weight Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star. Cluster 1 SemType: uknwn Gender: uknwn Number: single Names: NIL Cluster 2 SemType: person Gender: male Number: single Names: Mike Tyson w.f(join ij ) a ij =argmax ij

Perform the action with the highest weight Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star. Cluster 1 SemType: uknwn Gender: uknwn Number: single Names: NIL Cluster 2 SemType: person Gender: male Number: single Names: Mike Tyson w.f(join ij ) a 12 =argmax ij

Perform the action with the highest weight Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star. Cluster 15 SemType: person Gender: male Number: single Names: Mike Tyson w.f(join ij ) a 12 =argmax ij

Perform the action with the highest weight Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Perform the action with the highest weight Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Perform the action with the highest weight Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Perform the action with the highest weight Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Perform the action with the highest weight Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Perform the action with the highest weight Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Perform the action with the highest weight Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Perform the action with the highest weight Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Perform the action with the highest weight Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Perform the action with the highest weight Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Perform the action with the highest weight Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Until the action with the highest weight is the special action HALT Test-Time Inference Former Boxing Champion Mike Tyson revealed that he had caught Hollywood heartthrob Brad Pitt with his ex-wife Robin Givens. In an interview Tyson said that he was amidst the process of divorce from Givens in the late 1980s when he found her sleeping with the Hollywood star.

Time complexity Let n = the number of mentions in a document At most n clusters at any time s = number of joins performed by the algorithm s<=n at most n joins before HALT (usually much less)

Time complexity Each step requires: – Popping the top action -- O(logn) (priority queue) – Performing the join – O(1) – Deleting O(n) old actions – O(nlogn) – Computing scores for O(n) new actions -- O(n) – Inserting those -- O(nlogn) or O(n) Total runtime = O(n 2 logn)+O(snlogn)

TRAINING ALGORITHM

Learning Learn weight vector w that leads the Easy-First algorithm to good clusterings At each step we are given a state s and a set of possible actions {join ij } ∪ {HALT} We want to learn what is “the best” action in each state But the current state s depends on the action taken so far

Reinforcement Learning A classic reinforcement learning setting Act in the world and learn “good” actions in each state We tried several reinforcement learning algorithms – Policy gradient [Sutton et al., 2000] – Q-learning [Watkins and Dayan, 1992] – No success: distant supervision; too many possible states

Perceptron-Style Training A version of the early update structured perceptron [Collins and Roark, 2004] – Inspired by the success of easy-first parsing [Goldberg and Elhadad, 2010] The algorithm: – Keep acting in the world until the next action will result in an error (reduction in score) – Update weights away from the top action and towards the highest scoring positive action

Training algorithm Perform N iterations: – For each document d in the training set do: Initialize C so that each mention in its own cluster Initialize A = {join ij } ∪ {HALT} repeat until a max != HALT: – for k steps do: » a max =A.peekMax() » if a max.isPositive() break » a pos =A.peekMaxPositive() » w+=(feat(a pos,C)-feat(a max,C)) » Recompute scores in A using new w update weight

Training algorithm Perform N iterations: – For each document d in the training set do: Initialize C so that each mention in its own cluster Initialize A = {join ij } ∩ {HALT} repeat until a max != HALT: – Update weights – a max =A.popMax() – if a max is a join » perform a max

EXPERIMENTAL EVALUATION

Experimental Setup Five datasets: – MUC6, MUC7, ACE03, ACE04, ACE05 Two evaluation metrics: – MUC score – B^3 score Implemented in the Reconcile research platform [Stoyanov et al. 2010] – Code available upon request Two settings: – Automatic mentions – Gold (manual) mentions

Baselines A pairwise baseline [Stoyanov et al. 2009] Reimplementation of the pairwise baseline using different features Easy-First coreference resolution – Easy-First percep – weights from the pairwise baseline – Easy-First struct – weights from our training algorithm Only for the gold mentions data – Entity-centered approach [Haghighi and Klein, 2009] – Multi-pass sieve [Raghunathan et al., 2010]

Results: Automatic Mentions MUC6MUC7ACE03ACE04ACE05 B^3 MUCB^3 MUCB^3 MUCB^3 MUCB^3 MUC Reconcile

Results: Automatic Mentions MUC6MUC7ACE03ACE04ACE05 B^3 MUCB^3 MUCB^3 MUCB^3 MUCB^3 MUC Reconcile Pairwise baseline

Results: Automatic Mentions MUC6MUC7ACE03ACE04ACE05 B^3 MUCB^3 MUCB^3 MUCB^3 MUCB^3 MUC Reconcile Pairwise baseline

Results: Automatic Mentions MUC6MUC7ACE03ACE04ACE05 B^3 MUCB^3 MUCB^3 MUCB^3 MUCB^3 MUC Reconcile Pairwise baseline

Results: Automatic Mentions MUC6MUC7ACE03ACE04ACE05 B^3 MUCB^3 MUCB^3 MUCB^3 MUCB^3 MUC Reconcile Pairwise baseline Easy-First percep

Results: Automatic Mentions MUC6MUC7ACE03ACE04ACE05 B^3 MUCB^3 MUCB^3 MUCB^3 MUCB^3 MUC Reconcile Pairwise baseline Easy-First percep Easy-First struct

Results: Automatic Mentions MUC6MUC7ACE03ACE04ACE05 B^3 MUCB^3 MUCB^3 MUCB^3 MUCB^3 MUC Reconcile Pairwise baseline Easy-First percep Easy-First struct

Results: Automatic Mentions MUC6MUC7ACE03ACE04ACE05 B^3 MUCB^3 MUCB^3 MUCB^3 MUCB^3 MUC Reconcile Pairwise baseline Easy-First percep Easy-First struct

Results: Gold Mentions MUC6ACE04 B^3MUCB^3MUC Haghighi and Klein Raghunathan et al

Results: Gold Mentions MUC6ACE04 B^3MUCB^3MUC Haghighi and Klein Raghunathan et al Stoyanov et al

Results: Gold Mentions MUC6ACE04 B^3MUCB^3MUC Haghighi and Klein Raghunathan et al Stoyanov et al

Results: Gold Mentions MUC6ACE04 B^3MUCB^3MUC Haghighi and Klein Raghunathan et al Stoyanov et al

Results: Gold Mentions MUC6ACE04 B^3MUCB^3MUC Haghighi and Klein Raghunathan et al Stoyanov et al Pairwise baseline

Results: Gold Mentions MUC6ACE04 B^3MUCB^3MUC Haghighi and Klein Raghunathan et al Stoyanov et al Pairwise baseline

Results: Gold Mentions MUC6ACE04 B^3MUCB^3MUC Haghighi and Klein Raghunathan et al Stoyanov et al Pairwise baseline Easy-First percep

Results: Gold Mentions MUC6ACE04 B^3MUCB^3MUC Haghighi and Klein Raghunathan et al Stoyanov et al Pairwise baseline Easy-First percep Easy-First struct

Conclusions A new approach to coreference using greedy incremental clustering – More accurate clustering by utilizing global information as it becomes available for less confident decisions Learning policy for the test-time inference procedure leads to improvements

THANK YOU!