Exploiting Timelines to Enhance Multi-document Summarization Jun-Ping Ng, Yan Chen, Min-Yen Kan and Zhoujun Li National University of Singapore Beihang.

Slides:



Advertisements
Similar presentations
Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.
Advertisements

Specialized models and ranking for coreference resolution Pascal Denis ALPAGE Project Team INRIA Rocquencourt F Le Chesnay, France Jason Baldridge.
Automatic summarization Dragomir R. Radev University of Michigan
Automatically Evaluating Text Coherence Using Discourse Relations Ziheng Lin, Hwee Tou Ng and Min-Yen Kan Department of Computer Science National University.
Exploiting Timelines to Enhance Multi-document Summarization Jun-Ping Ng, Yan Chen, Min-Yen Kan, Zhoujun Li DSO National Laboratories National University.
Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.
An Introduction of Support Vector Machine
Baselines for Recognizing Textual Entailment Ling 541 Final Project Terrence Szymanski.
NYU ANLP-00 1 Automatic Discovery of Scenario-Level Patterns for Information Extraction Roman Yangarber Ralph Grishman Pasi Tapanainen Silja Huttunen.
Patch to the Future: Unsupervised Visual Prediction
Recognizing Implicit Discourse Relations in the Penn Discourse Treebank Ziheng Lin, Min-Yen Kan, and Hwee Tou Ng Department of Computer Science National.
LEDIR : An Unsupervised Algorithm for Learning Directionality of Inference Rules Advisor: Hsin-His Chen Reporter: Chi-Hsin Yu Date: From EMNLP.
Cascaded Filtering For Biometric Identification Using Random Projection Atif Iqbal.
Spatio-Temporal Outlier Detection in Precipitation Data
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
Predicting Text Quality for Scientific Articles AAAI/SIGART-11 Doctoral Consortium Annie Louis : Louis A. and Nenkova A Automatically.
Search and Retrieval: More on Term Weighting and Document Ranking Prof. Marti Hearst SIMS 202, Lecture 22.
The use of unlabeled data to improve supervised learning for text summarization MR Amini, P Gallinari (SIGIR 2002) Slides prepared by Jon Elsas for the.
Latent Dirichlet Allocation a generative model for text
EVENT IDENTIFICATION IN SOCIAL MEDIA Hila Becker, Luis Gravano Mor Naaman Columbia University Rutgers University.
Information Retrieval in Practice
Query session guided multi- document summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.
Temporal Event Map Construction For Event Search Qing Li Department of Computer Science City University of Hong Kong.
Rui Yan, Yan Zhang Peking University
Slide Image Retrieval: A Preliminary Study Guo Min Liew and Min-Yen Kan National University of Singapore Web IR / NLP Group (WING)
Richard Socher Cliff Chiung-Yu Lin Andrew Y. Ng Christopher D. Manning
Learning Information Extraction Patterns Using WordNet Mark Stevenson and Mark A. Greenwood Natural Language Processing Group University of Sheffield,
Estimating Importance Features for Fact Mining (With a Case Study in Biography Mining) Sisay Fissaha Adafre School of Computing Dublin City University.
A Comparative Study of Search Result Diversification Methods Wei Zheng and Hui Fang University of Delaware, Newark DE 19716, USA
Processing of large document collections Part 7 (Text summarization: multi- document summarization, knowledge- rich approaches, current topics) Helena.
Generic text summarization using relevance measure and latent semantic analysis Gong Yihong and Xin Liu SIGIR, April 2015 Yubin Lim.
Efficiently Computed Lexical Chains As an Intermediate Representation for Automatic Text Summarization H.G. Silber and K.F. McCoy University of Delaware.
Domain-Specific Iterative Readability Computation Jin Zhao 13/05/2011.
Summarizing Conversations with Clue Words Giuseppe Carenini Raymond T. Ng Xiaodong Zhou Department of Computer Science Univ. of British Columbia.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Extracting meaningful labels for WEBSOM text archives Advisor.
Improving Web Search Results Using Affinity Graph Benyu Zhang, Hua Li, Yi Liu, Lei Ji, Wensi Xi, Weiguo Fan, Zheng Chen, Wei-Ying Ma Microsoft Research.
A Novel Local Patch Framework for Fixing Supervised Learning Models Yilei Wang 1, Bingzheng Wei 2, Jun Yan 2, Yang Hu 2, Zhi-Hong Deng 1, Zheng Chen 2.
Wikipedia as Sense Inventory to Improve Diversity in Web Search Results Celina SantamariaJulio GonzaloJavier Artiles nlp.uned.es UNED,c/Juan del Rosal,
Mining Binary Constraints in Feature Models: A Classification-based Approach Yi Li.
LOGO Summarizing Conversations with Clue Words Giuseppe Carenini, Raymond T. Ng, Xiaodong Zhou (WWW ’07) Advisor : Dr. Koh Jia-Ling Speaker : Tu.
Semantic v.s. Positions: Utilizing Balanced Proximity in Language Model Smoothing for Information Retrieval Rui Yan†, ♮, Han Jiang†, ♮, Mirella Lapata‡,
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
2015/12/121 Extracting Key Terms From Noisy and Multi-theme Documents Maria Grineva, Maxim Grinev and Dmitry Lizorkin Proceeding of the 18th International.
August 17, 2005Question Answering Passage Retrieval Using Dependency Parsing 1/28 Question Answering Passage Retrieval Using Dependency Parsing Hang Cui.
Inference Protocols for Coreference Resolution Kai-Wei Chang, Rajhans Samdani, Alla Rozovskaya, Nick Rizzolo, Mark Sammons, and Dan Roth This research.
Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick.
1 A Biterm Topic Model for Short Texts Xiaohui Yan, Jiafeng Guo, Yanyan Lan, Xueqi Cheng Institute of Computing Technology, Chinese Academy of Sciences.
Threshold Setting and Performance Monitoring for Novel Text Mining Wenyin Tang and Flora S. Tsai School of Electrical and Electronic Engineering Nanyang.
Semantic Search An Active Approach to Searching. Why Semantic Search? A better question is – why not? We are interested in what the document means, not.
Probabilistic Text Structuring: Experiments with Sentence Ordering Mirella Lapata Department of Computer Science University of Sheffield, UK (ACL 2003)
1 Centroid Based multi-document summarization: Efficient sentence extraction method Presenter: Chen Yi-Ting.
Acquisition of Categorized Named Entities for Web Search Marius Pasca Google Inc. from Conference on Information and Knowledge Management (CIKM) ’04.
Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.
A Novel Relational Learning-to- Rank Approach for Topic-focused Multi-Document Summarization Yadong Zhu, Yanyan Lan, Jiafeng Guo, Pan Du, Xueqi Cheng Institute.
Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)
The P YTHY Summarization System: Microsoft Research at DUC 2007 Kristina Toutanova, Chris Brockett, Michael Gamon, Jagadeesh Jagarlamudi, Hisami Suzuki,
Learning Event Durations from Event Descriptions Feng Pan, Rutu Mulkar, Jerry R. Hobbs University of Southern California ACL ’ 06.
Tracking a Hurricane Hurricane season spans from June – November. Maximum activity in the Atlantic Ocean is early to mid September. Hurricanes are huge.
Question Answering Passage Retrieval Using Dependency Relations (SIGIR 2005) (National University of Singapore) Hang Cui, Renxu Sun, Keya Li, Min-Yen Kan,
A Survey on Automatic Text Summarization Dipanjan Das André F. T. Martins Tolga Çekiç
Relation Extraction (RE) via Supervised Classification See: Jurafsky & Martin SLP book, Chapter 22 Exploring Various Knowledge in Relation Extraction.
Automatically Labeled Data Generation for Large Scale Event Extraction
Using Social Media to Enhance Emergency Situation Awareness
A review of audio fingerprinting (Cano et al. 2005)
Wei Wei, PhD, Zhanglong Ji, PhD, Lucila Ohno-Machado, MD, PhD
Video Summarization via Determinantal Point Processes (DPP)
Exploiting Timelines to Enhance Multi-document Summarization
Lei Sha, Jing Liu, Chin-Yew Lin, Sujian Li, Baobao Chang, Zhifang Sui
Assessing Quality of Paradata to Better Understand the Data Collection Process for CAPI Social Surveys François Laflamme Milana Karaganis European Conference.
David Pelayo #19 7A Iraqi journalist throws shoes at Bush in Baghdad
Presentation transcript:

Exploiting Timelines to Enhance Multi-document Summarization Jun-Ping Ng, Yan Chen, Min-Yen Kan and Zhoujun Li National University of Singapore Beihang University

Cyclone Sidr 2007, JTWC designation: 06B Cyclone Sidr 2007, JTWC designation: 06B Image Courtesy: Univ. Wisconsin- Madison “A fierce cyclone packing extreme winds and torrential rain smashed into Bangladesh’s southwestern coast Thursday, …” 24 Jun 2014ACL Timelines in Summarization 2

24 Jun 2014ACL Timelines in Summarization 3 Image Courtesy: US Navy / Wikipedia “… wiping out homes and trees in what officials described as the worst storm in years.”

24 Jun 2014ACL Timelines in Summarization 4 Image Courtesy: US State Department / Wikipedia “More than 100,000 coastal villagers have been evacuated before the cyclone made landfall.”

Image Courtesy: US Navy / Wikipedia 1991 Bangladesh Cyclone “The storm matched one in 1991 that sparked a tidal wave that killed an estimated 138,000 people, Karmakar told AFP.” 24 Jun 2014ACL Timelines in Summarization 5

24 Jun 2014ACL Timelines in Summarization 6 [3] “The storm matched one in 1991 that sparked a tidal wave that killed an estimated 138,000 people, Karmakar told AFP.” [2] “More than 100,000 coastal villagers have been evacuated before the cyclone made landfall.” [1] “A fierce cyclone packing extreme winds and torrential rain smashed into Bangladesh’s southwestern coast Thursday, wiping out homes and trees in what officials described as the worst storm in years.”

24 Jun 2014ACL Timelines in Summarization 7 [3] “The storm matched one in 1991 that sparked a tidal wave that killed an estimated 138,000 people, Karmakar told AFP.” [2] “More than 100,000 coastal villagers have been evacuated before the cyclone made landfall.” [1] “A fierce cyclone packing extreme winds and torrential rain smashed into Bangladesh’s southwestern coast Thursday, wiping out homes and trees in what officials described as the worst storm in years.”

Timelines from Text 24 Jun 2014ACL Timelines in Summarization 8 [3] “The storm matched one in 1991 that sparked a tidal wave that killed an estimated 138,000 people, Karmakar told AFP.” [2] “More than 100,000 coastal villagers have been evacuated before the cyclone made landfall.” [1] “A fierce cyclone packing extreme winds and torrential rain smashed into Bangladesh’s southwestern coast Thursday, wiping out homes and trees in what officials described as the worst storm in years.”

Key time spans are summary worthy 24 Jun 2014ACL Timelines in Summarization 9 [3] “The storm matched one in 1991 that sparked a tidal wave that killed an estimated 138,000 people, Karmakar told AFP.” [2] “More than 100,000 coastal villagers have been evacuated before the cyclone made landfall.” [1] “A fierce cyclone packing extreme winds and torrential rain smashed into Bangladesh’s southwestern coast Thursday, wiping out homes and trees in what officials described as the worst storm in years.”

Timelines + Summarization Timelines (per input document) Summarization System Summary Lexical and positional features Timeline- derived features Jun 2014ACL Timelines in Summarization Timelines + Summarization

Outline Goal and Motivation Timeline Generation Integrating Timelines –In Scoring: (Contextual) Importance, Density –In Re-ordering: TimeMMR Experiments Discussion Jun 2014ACL Timelines in Summarization

Timeline Generation Jun 2014ACL Timelines in Summarization

1. Event-Event Temporal Classification Jun 2014ACL Timelines in Summarization (Ng et al., 2013; EMNLP)

2. Event-Timex Temporal Classification Jun 2014ACL Timelines in Summarization (Ng and Kan, 2012; COLING)

3. Timex Normalization 15 “Today”  June 6, Jun 2014ACL Timelines in Summarization (HeidelTime; Strötgen and Gertz, 2013)

Timeline Construction 1.Map normalized timexes to timeline 2.Place events which OVERLAP with timexes onto timeline 3.Place events which OVERLAP with other events onto the timeline 4.Insert rest of events based on BEFORE/AFTER ordering Jun 2014ACL Timelines in Summarization

Integrating Timelines into SWING 17 Time Span Importance Contextual Time Span Importance Sentence Temporal Coverage Density Time MMR 24 Jun 2014ACL Timelines in Summarization Temporal Processing Summarization Pipeline SWING (Ng et al., COLING 2012, TAC 2011) State-of-the-art open-source extractive summarizer NG-NUS/SWING NG-NUS/SWING Basic, k of n sentence summaries

1. Time Span Importance (TSI) Time spans which contain many events are more salient Sentences which references events in these time spans are thus better candidates for a summary 24 Jun 2014ACL Timelines in Summarization 18

2. Contextual Time Span Importance (CTSI) Time spans near to important time spans are important Search left and right for local peaks 24 Jun 2014ACL Timelines in Summarization 19, where

3. Sentence Temporal Coverage Density (TCD) Favour sentences which –contain more events –covering a wide variety of time spans 24 Jun 2014ACL Timelines in Summarization 20

Identifying Redundancies SWING makes use of the Maximal Marginal Relevance (MMR) algorithm to identify redundancies in selected sentences MMR is based largely on surface lexical similarities Idea: Let’s use time as a basis to penalize the selection of sentences from redundant time periods Jun 2014ACL Timelines in Summarization

TimeMMR Beyond lexical similarities, identify sentences which contain substantial time span overlap. Candidate sentences which share many time spans with selected sentences are penalized. 22 (1)An official in Barisal, 120 kilometres south of Dhaka, spoke of severe destruction as the 500 kilometre-wide mass of cloud passed overhead. (2)“Many trees have been uprooted and houses and schools blown away,” Mostofa Kamal, a district relief and rehabilitation officer, told AFP by telephone. (3)“Mud huts have been damaged and the roofs of several houses blown off,” said the state’s relief minister, Mortaza Hossain. Lexically dissimilar but redundant 24 Jun 2014ACL Timelines in Summarization Proportion of overlap

Experiments Data –TAC 2010 dataset for training –TAC 2011 dataset for testing Temporal Processing Systems –HeidelTime (Strötgen and Gertz, 2013) –E-T temporal classification (Ng and Kan, 2012) –E-E temporal classification (Ng et al., 2013) Summarization baseline –SWING (Ng et al., 2012) Jun 2014ACL Timelines in Summarization

Results 24 #ConfigurationR-2 RSWING B1CLASSY SWING + Timeline Features0.1394* 2SWING + Timeline Features + TimeMMR Jun 2014ACL Timelines in Summarization Doesn’t seem very effective! * = p < 0.1, ** = p < 0.05, against R row

Analysis: Timelines contain errors Errors from underlying temporal processing systems Simplifying assumptions made in timeline construction Lack of consistency checking and validation For effective use, we must identify good timelines Identify timelines which potentially contain more errors Exclude these when performing summarization Jun 2014ACL Timelines in Summarization

Reliability Filtering Short timelines can result when the system fails to extract or relate events and timexes Features derived from short timelines are prone to have extreme values Use the length of a timeline as a gauge of its accuracy Don’t use timelines shorter than average (as computed over the whole collection) Jun 2014ACL Timelines in Summarization

With Reliability Filtering 27 #ConfigurationR-2 RSWING B1CLASSY SWING + Timeline Features0.1394* 2SWING + Timeline Features + TimeMMR SWING + Timeline Features [Filtered]0.1418** 4SWING + Timeline Features + TimeMMR [Filtered]0.1402** 24 Jun 2014ACL Timelines in Summarization TimeMMR doesn’t seem effective! Why? * = p < 0.1, ** = p < 0.05, against R row

Does TimeMMR actually help? 28 L1An Iraqi reporter threw his shoes at visiting U.S. President George W. Bush and called him a ”dog” in Arabic during a news conference with Iraqi Prime Minister Nuri al-Maliki in Baghdad R1 L2”All I can report is it is a size 10,.R2 L3Muntadhar al-Zaidi, reporter of Baghdadiya television jumped and threw his two shoes one by one at the president, who ducked and thus narrowly missed being struck, raising chaos in the hall in Baghdad’s heavily fortified green Zone. The incident occurred as Bush was appearing with Iraqi Prime Minister Nouri al-Maliki. R3 L4The president lowered his head and the first shoe hit the American and Iraqi flags behind the two leaders. Muntadhar al-Zaidi, reporter of Baghdadiya television jumped and threw his two shoes one by one at the president, who ducked and thus narrowly missed being struck, raising chaos in the hall in Baghdad’s heavily fortified green Zone. R4 L5TheThe president lowered his head and theR5 R-2: , worse by R-2R-2: , better by R-2 24 Jun 2014ACL Timelines in Summarization Possibly Redundant? = Could an (automated) evaluation metric cater for time?

Conclusion Use of automatic timeline generation Integration of timelines into summarization –Sentence scoring via timeline features –Sentence re-ordering via TimeMMR –Length based timeline filtering helps to ameliorate errors Jun 2014ACL Timelines in Summarization For details on temporal processing, see: Jun Ping’s work at COLING 2012, EMNLP 2013 and his doctoral thesis (2014) Questions? If not, ask for more detailed analysis!

ADDITIONAL SLIDES Jun 2014ACL Timelines in Summarization

Related Work For Sentence Reordering –Barzilay et al., 1999 Recency as an indicator of salience –Goldstein et al., 2000;Wan, 2007; Demartini et al., 2010 –Liu et al., 2009 (“Temporal Graph”) –Wu, 2008 (“Largest Cluster”) TREC Temporal Summarization Track –Not as relevant; about monitoring an event over time 24 Jun 2014ACL Timelines in Summarization 31 Close to our TSI

32 24 Jun 2014ACL Timelines in Summarization Baseline; worse With time features; better

TSI: A crane accident With TSI, the cause of the accident in this summary is included; the alternative R1 sentence is background information and does not occur at any key time span. 24 Jun 2014ACL Timelines in Summarization 33 With TSI; better Without TSI; worse

34 24 Jun 2014ACL Timelines in Summarization With CTSI; better Without CTSI; worse With CTSI, the “warn” and “disappear” events were promoted in importance due to their proximity with peak P CTSI: Coral Reef Preservation

Timeline Caveats Some events span a long period of time (i.e., “1999”) Events are ordered based on the start of the duration Timeline captures relative order Construction algorithm does not attempt to reconcile contradictions 24 Jun 2014ACL Timelines in Summarization 35

Timex Normalization Source:Bethard, Jun 2014ACL Timelines in Summarization 36

References Jun-Ping Ng, Interpreting Text with Time, Doctoral Thesis, National University of Singapore, 2014 Jun-Ping Ng, Min-Yen Kan, Ziheng Lin, Wei Feng, Bin Chen, Jian Su, Chew-Lim Tan, Exploiting Discourse Analysis for Article-Wide Temporal Classification, EMNLP 2013 Jun-Ping Ng, Praveen Bysani, Ziheng Lin, Min-Yen Kan, Chew-Lim Tan, Exploiting Category-Specific Information for Multi-Document Summarization, COLING 2012 Jun-Ping Ng, Min-Yen Kan, Improved Temporal Relation Classification using Dependency Parses and Selective Crowdsourced Annotations, COLING Jun 2014ACL Timelines in Summarization 37