Presentation is loading. Please wait.

Presentation is loading. Please wait.

LIBERAl EVENT EXTRACTION, zero-shot learning and Error analysis

Similar presentations


Presentation on theme: "LIBERAl EVENT EXTRACTION, zero-shot learning and Error analysis"— Presentation transcript:

1 LIBERAl EVENT EXTRACTION, zero-shot learning and Error analysis
Heng Ji February 22, 2018

2 Basic Idea and Hypotheses
Principle: Achieve both merits of traditional IE (high quality) and Open IE (Wu and Weld, 2010; Mausam et al., 2012; Bovi et al., 2015) (high scalability) Key Idea: Bottom-up discovery instead of top-down classification Embrace Symbolic and Distributional knowledge resources and background knowledge to represent knowledge elements Cluster semantically similar knowledge elements and clusters into ontologies based on semantic representations Propose a new common semantic space to share representation, knowledge and resources across types, languages and data modalities Input: an arbitrary corpus, without any supervision, without human-defined schema or manual annotations, restriction or prior knowledge on its size, topic, or domain Output: Simultaneously discover a customizable schema and extract knowledge elements “Cold-start”: can be easily adapted to a new domain, genre or language Conservative kbp: dying.

3 How to Cluster Words? “attack” “end-position”
Distributional Semantics Theory: a word is characterized by the company it keeps (Harris, 1954)  represent each word as a vector distribution of its contextual words But let’s distinguish senses of an ambiguous word Word Sense based General Embeddings Fire-1 Score firing cannon grenades grenade gun arm explosive point-blank projectile misfired 0.829 0.774 0.767 0.760 0.757 0.755 0.742 0.740 0.748 0.729 Fire-2 Score rehired hire-1 resign-1 rehire sacked quit-1 sack-1 quits hiring dismissal 0.790 0.626 0.618 0.596 0.591 0.565 0.563 0.551 0.546 Injure-1 Score injures hurt harm maim injuring endanger dislocate kill injured ravage skyscrapers harming reinjure 0.602 0.593 0.592 0.571 0.561 0.543 0.529 0.527 0.516 0.502 0.501 0.497 “attack” “end-position” 3

4 But Local Context Matters
Borrow meanings from related concepts in local contexts  Context-specific Embeddings But not all words are related: Use Abstract Meaning Representation (AMR) (Banarescu et al., 2013) (150 semantic relations) to choose related concepts Use Tensor based Recursive Auto Encoder to compose embeddings of related concepts Life-die? No 4

5 Also Need External Knowledge
Election Leader of conservative New Democracy party Antonis Samaras is cheered by supporters after his statement on the election results in Athens June 17, 2012. Turkey’s Foreign Minister Ahmet Davutoglu greets his supports during an election rally of his ruling AK Party in Konya, central Turkey, March 28, 2014. DBPedia: Antonis Samarás Properties: Party, profession, religion, occupation, successor, residence … Type Labels: Person, Agent, Politician, President … DBPedia: New Democracy Party Properties: Country, headquarter, president, logo, founder… Type Labels: Organization, PoliticalParty, SocialGroup … DBPedia: Justice and Development Party Properties: Country, headquarter, leaderName, position … Type Labels: Organization, PoliticalParty, Agent … DBPedia: Ahmet Davutoğlu Properties: birthDate, birthPlace, deputy, party, president, successor, religion … Type Labels: Person, Agent, Politician, Leader, Writer, President, Minister … 5

6 Assign a Type to Each Mention
Event Semantics Theory (Pustejovsky, 1991): the semantics of an event structure can be generalized and mapped to event mention structures in a systematic and predictable way Borrow Zero Shot Transfer Learning from Vision (Frome et al., 2013; Norouzi et al., 2013; Socher et al., 2013) Learning a regression function between object (e.g., image, entity) semantic space and label semantic space based on annotated data for seen labels; The regression model can be used to predict the unseen labels for any given image; 6

7 Grounding instead of Classification
Modeling IE as a grounding problem instead of classification The mapping function for grounding is independent of event types, so we can re-use resources of seen types for new types, without any additional annotations Iranian state television stated that the conflict between the Iranian police and the drug smugglers took place near the town of mirjaveh. The Government of China has ruled Tibet since 1951 after dispatching troops to the Himalayan region in 1950. Learning a regression function between object (e.g., image, entity) semantic space and label semantic space based on annotated data for seen labels; The regression model can be used to predict the unseen labels for any given image; 0.1 7

8 Common Semantic Space between Mentions and Types, between Seen Types and Unseen Types
8

9 Approach Details Joint Event Mention and Type Label Embedding
Mapping between event mention structure and type structure t: a specific trigger y: the annotated event type y’: the top-ranked event type Y: the type set from the target ontology Y’: the seen type set m: margin value Ct,y: the Cosine Similarity between t and y Joint Event Argument and Role Embedding Mapping between argument and role path: Argument path: e.g., dispatch01->:Arg0->China Role path: Transport_person -> Agent Learn path representations using two weight-sharing CNN; Loss function: Consider to remove; Take all tuples as input and learn structural representation for each event mention and type based on two weight-sharing Convolutional Neural Networks

10 Comparison with Traditional Supervised Model
10 popular “seen” event types and 23 new “unseen” types Supervised Model: train and test on unseen types Zero-shot: no training data for unseen types Without any annotated mentions, our approach can achieve performance comparable to that of LSTM, which is trained on 3,000 sentences with 500 annotated event mentions (Huang et al., 2018) 10

11 Move to any New Domain Discovered schemas (Huang et al., 2016)
Move to a new domain (biomedical literature): 83.1% precision 11

12 Event Trigger Identification Missing Errors
OOV this morning in Michigan, a second straight night and into this morning, hundreds of people have been rioting [attack] in Benton harbor. Scene understanding This was the Italian ship that was taken -- that was captured [transfer- ownership] by Palestinian terrorists back in 1985 and some may remember the story of Leon clinghover, he was in a cheal chair and the terrorists shot him and pushed him over the side of the ship into the Mediterranean where he obviously, died. The gloved one claims the label has been releasing new albums and Jackson five merchandise without giving [transfer-money] him "a single dollar." Airlines are getting [transport] flyers to destinations ontime more often. Three Young Boys,ages 2, 5 and 10 survived and are in critical [injure] condition after spending in 18 hours in the cold. Two most popular previous event representations Inheritance is a way to describe the roadmap of event hierarchy; the property of the child is inherited from its parent 12 12

13 Event Trigger Identification Spurious Errors
Lara hijacking in 1995 has been captured [arrest-jail] by U.S. forces in or near Baghdad. I want to take [transport] this opportunity to stand behind the Mimi and proclaim my solidarity. But when we moved [transport] in closer and started to scan the area, then I saw the 2-year-old waving from next to the airplane. He's left [transport] a lot on the table. Barbara, Bill Bennet's Glam gambling loss [die] changed my opinion. Stewart has found the road to fortune wherever she has traveled [transport]. And it's hard to win back that sort of brand equity that she's lost [end-position]. Police measure the spot where ten months old Miana Williams landed after she was thrown [transport] from a seventh floor window at this apartment building. Reason, not result: The baby fell [die] 80 feet. These tree branches cushioned her fall [die] and saved her life. Two most popular previous event representations Inheritance is a way to describe the roadmap of event hierarchy; the property of the child is inherited from its parent 13 13

14 Event Trigger Classification Errors
a man in New York is facing attempted murder charges after allegedly throwing [transportattack] his baby seven stories to the ground below. He's also being fined [transfer-money  fine] $3 million and was ordered to pay $1.2 million in restitution to the New York State Tax Commission. ``They admitted to having nuclear capability and weapons at this moment,'' said Rep. Curt Weldon, who headed a delegation of U.S. lawmakers that visited [meettransport] Pyongyang for three days ending Sunday. Blasphemy is punishable by death [die  execute] under the Pakistan Penal Code. We discussed [phone-write  meet] the Middle East peace process. Two most popular previous event representations Inheritance is a way to describe the roadmap of event hierarchy; the property of the child is inherited from its parent 14 14

15 Event Argument Identification Missing Errors
Scene understanding: This was the Italian ship that was taken -- that was captured by Palestinian terrorists back in 1985 [die_time] and some may remember the story of Leon clinghover, he was in a cheal chair and the terrorists shot him and pushed him over the side of the ship into the Mediterranean [die_place] where he [die_victim] obviously, died. She [die_victim] was going to fight to the death. Then police say the baby's mother pulled out a kitchen knife [die_instrument] opinion on the 911 tape you can hear Williams tape say "go ahead kill me." Coreference: another story out of Belgrade [attack_place], violence at the highest form. Unusual syntactic structure: Two f-14 Tomcats struck the targets, the same area was a site [attack_place] of heavy bombing yesterday. The EU foreign ministers met hours after U.S. President George W. Bush gave Saddam [attack_target] 48 hours to leave Iraq or face invasion. Two most popular previous event representations Inheritance is a way to describe the roadmap of event hierarchy; the property of the child is inherited from its parent 15 15

16 Event Argument Identification Spurious Errors
He's been in Libya and he's been living under the protection of Saddam Hussein in Baghdad [die_place], but he is wanted for murder in Italy Italy. The EU foreign ministers met hours after U.S. President George W. Bush [meet_entity] gave Saddam 48 hours to leave Iraq or face invasion. Two most popular previous event representations Inheritance is a way to describe the roadmap of event hierarchy; the property of the child is inherited from its parent 16 16

17 Event Argument Role Classification Errors
His private plane [vehicle  artifact] arrived at Heathrow Airport. a cruise ship [instrument  target] is being searched off the Hawaii coast after two notes threatening terrorist attack were found on board. Two most popular previous event representations Inheritance is a way to describe the roadmap of event hierarchy; the property of the child is inherited from its parent 17 17


Download ppt "LIBERAl EVENT EXTRACTION, zero-shot learning and Error analysis"

Similar presentations


Ads by Google