Extractive Summarisation via Sentence Removal: Condensing Relevant Sentences into a Short Summary Marco Bonzanini, Miguel Martinez-Alvarez, and Thomas.

Extractive Summarisation via Sentence Removal: Condensing Relevant Sentences into a Short Summary Marco Bonzanini, Miguel Martinez-Alvarez, and Thomas Roelleke Queen Mary University of London {marcob,miguel,thor}@eecs.qmul.ac.uk SIGIR '13

Introduction u The main contribution of this paper is the definition of an algorithm for sentence removal, developed to maximise the score between the summary and the original document. u Instead of ranking the sentences and selecting the most important ones, the algorithm iteratively removes unimportant sentences until a desired compression rate is reached. 2

Extractive Summarisation (1/4) u The task of extractive summarisation is to select the subset of sentences and to combine them into a summary which better represents the topic. u In order to form the summary, a length limit has to be considered, based on the number of sentences or the number of words. 3

Extractive Summarisation (2/4) 4

Sentence Selection (1/3) 7

Sentence Removal 10

Sentence Removal (Cont’d) 11

SR Algorithm 12

Opinosis Dataset u The Opinosis dataset is a collection of opinion- oriented data, divided into 51 different topics. u Each topic includes a number of sentences (min. 50, max. 575, avg. 139), taken from different reviews from popular review web sites. u For each topic, 4 or 5 golden standard (human- written) summaries are provided. u The golden standard summaries hence present the pivot opinion for each topic, in a concise way (approx. 2 sentences each). u For this reason, the maximum length of the system generated summaries is fixed to two sentences. 13

ROUGE Framework u The ROUGE framework is used to provide a quantitative assessment between the candidate summaries and the golden standards. u Specifically, the results for ROUGE-1, ROUGE-2, and ROUGE-SU4 are reported. u This study also reports the results for MEAD, a state-of-the-art extractive summariser based on cluster centroids. u The best overall results are shown in bold, and the best results within the same scoring function are shown in italic. u Best results labelled with a † show that second-best results are outside their 95% confidence interval. 14

Results for Recall ROUGE-1ROUGE-2ROUGE-SU4 MEAD49.32 †10.5823.16 † 33.8006.4312.12 18.8403.9903.91 22.3205.7505.43 20.4605.5404.97 37.4609.2913.80 46.0508.6720.10 15.7802.6403.03 15.6001.4402.96 15

Results for Precision ROUGE-1ROUGE-2ROUGE-SU4 MEAD09.1601.8401.02 15.2102.7803.00 30.4206.6411.22 29.3907.7810.27 30.7008.3612.10 19.8705.1805.44 09.6401.7701.10 25.3604.6808.72 12.7001.2002.23 16

ROUGE-1ROUGE-2ROUGE-SU4 MEAD15.1503.0801.89 19.8003.6604.19 22.8404.8805.47 24.6706.3906.42 24.0306.5006.59 24.3806.2306.31 15.6402.8802.03 19.0103.2804.16 13.3301.2502.11 17

ROUGE-1ROUGE-2ROUGE-SU4 MEAD26.2705.4304.34 25.1904.7105.94 20.2004.2904.39 23.0505.9505.66 21.6805.8605.48 29.92 †07.54 †08.28 † 25.3904.7004.16 16.8802.8603.37 14.3801.3402.38 18

Conclusions 19

Extractive Summarisation via Sentence Removal: Condensing Relevant Sentences into a Short Summary Marco Bonzanini, Miguel Martinez-Alvarez, and Thomas.

Similar presentations

Presentation on theme: "Extractive Summarisation via Sentence Removal: Condensing Relevant Sentences into a Short Summary Marco Bonzanini, Miguel Martinez-Alvarez, and Thomas."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Extractive Summarisation via Sentence Removal: Condensing Relevant Sentences into a Short Summary Marco Bonzanini, Miguel Martinez-Alvarez, and Thomas.

Similar presentations

Presentation on theme: "Extractive Summarisation via Sentence Removal: Condensing Relevant Sentences into a Short Summary Marco Bonzanini, Miguel Martinez-Alvarez, and Thomas."— Presentation transcript:

Similar presentations

About project

Feedback