Word Sense and Subjectivity (Coling/ACL 2006) Janyce Wiebe Rada Mihalcea University of Pittsburgh University of North Texas Acknowledgements: This slide.

Slides:



Advertisements
Similar presentations
Using Syntax to Disambiguate Explicit Discourse Connectives in Text Source: ACL-IJCNLP 2009 Author: Emily Pitler and Ani Nenkova Reporter: Yong-Xiang Chen.
Advertisements

Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis Theresa Wilson Janyce Wiebe Paul Hoffmann University of Pittsburgh.
Automatic Metaphor Interpretation as a Paraphrasing Task Ekaterina Shutova Computer Lab, University of Cambridge NAACL 2010.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
Multi-Perspective Question Answering Using the OpQA Corpus Veselin Stoyanov Claire Cardie Janyce Wiebe Cornell University University of Pittsburgh.
Semi-supervised learning and self-training LING 572 Fei Xia 02/14/06.
CS Word Sense Disambiguation. 2 Overview A problem for semantic attachment approaches: what happens when a given lexeme has multiple ‘meanings’?
Sentiment Lexicon Creation from Lexical Resources BIS 2011 Bas Heerschop Erasmus School of Economics Erasmus University Rotterdam
Predicting the Semantic Orientation of Adjective Vasileios Hatzivassiloglou and Kathleen R. McKeown Presented By Yash Satsangi.
1 Noun Homograph Disambiguation Using Local Context in Large Text Corpora Marti A. Hearst Presented by: Heng Ji Mar. 29, 2004.
Learning Subjective Nouns using Extraction Pattern Bootstrapping Ellen Riloff, Janyce Wiebe, Theresa Wilson Presenter: Gabriel Nicolae.
Learning Subjective Adjectives from Corpora Janyce M. Wiebe Presenter: Gabriel Nicolae.
Semantic Video Classification Based on Subtitles and Domain Terminologies Polyxeni Katsiouli, Vassileios Tsetsos, Stathes Hadjiefthymiades P ervasive C.
A Framework for Named Entity Recognition in the Open Domain Richard Evans Research Group in Computational Linguistics University of Wolverhampton UK
Extracting Interest Tags from Twitter User Biographies Ying Ding, Jing Jiang School of Information Systems Singapore Management University AIRS 2014, Kuching,
Flash talk by: Aditi Garg, Xiaoran Wang Authors: Sarah Rastkar, Gail C. Murphy and Gabriel Murray.
Word Sense Disambiguation for Automatic Taxonomy Construction from Text-Based Web Corpora 12th International Conference on Web Information System Engineering.
Aiding WSD by exploiting hypo/hypernymy relations in a restricted framework MEANING project Experiment 6.H(d) Luis Villarejo and Lluís M à rquez.
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
Multilingual Word Sense Disambiguation using Wikipedia Bharath Dandala (University of North Texas) Rada Mihalcea (University of North Texas) Razvan Bunescu.
COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics – Bag of concepts – Semantic distance between two words.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Carmen Banea, Rada Mihalcea University of North Texas A Bootstrapping Method for Building Subjectivity Lexicons for Languages.
1 A study on automatically extracted keywords in text categorization Authors:Anette Hulth and Be´ata B. Megyesi From:ACL 2006 Reporter: 陳永祥 Date:2007/10/16.
1 Wikification CSE 6339 (Section 002) Abhijit Tendulkar.
Unsupervised Word Sense Disambiguation Rivaling Supervised Methods Oh-Woog Kwon KLE Lab. CSE POSTECH.
1 A Unified Relevance Model for Opinion Retrieval (CIKM 09’) Xuanjing Huang, W. Bruce Croft Date: 2010/02/08 Speaker: Yu-Wen, Hsu.
Word Sense Disambiguation UIUC - 06/10/2004 Word Sense Disambiguation Another NLP working problem for learning with constraints… Lluís Màrquez TALP, LSI,
Annotating Words using WordNet Semantic Glosses Julian Szymański Department of Computer Systems Architecture, Faculty of Electronics, Telecommunications.
1 Statistical NLP: Lecture 9 Word Sense Disambiguation.
Word Sense and Subjectivity Jan Wiebe Rada Mihalcea University of Pittsburgh University of North Texas.
Improving Subcategorization Acquisition using Word Sense Disambiguation Anna Korhonen and Judith Preiss University of Cambridge, Computer Laboratory 15.
SYMPOSIUM ON SEMANTICS IN SYSTEMS FOR TEXT PROCESSING September 22-24, Venice, Italy Combining Knowledge-based Methods and Supervised Learning for.
1 Co-Training for Cross-Lingual Sentiment Classification Xiaojun Wan ( 萬小軍 ) Associate Professor, Peking University ACL 2009.
An Effective Word Sense Disambiguation Model Using Automatic Sense Tagging Based on Dictionary Information Yong-Gu Lee
Learning to Link with Wikipedia David Milne and Ian H. Witten Department of Computer Science, University of Waikato CIKM 2008 (Best Paper Award) Presented.
A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources Author: Carmen Banea, Rada Mihalcea, Janyce Wiebe Source:
Subjectivity and Sentiment Analysis Jan Wiebe Department of Computer Science CERATOPS: Center for the Extraction and Summarization of Events and Opinions.
Subjectivity and Sentiment Analysis Jan Wiebe Department of Computer Science CERATOPS: Center for the Extraction and Summarization of Events and Opinions.
Learning Multilingual Subjective Language via Cross-Lingual Projections Mihalcea, Banea, and Wiebe ACL 2007 NLG Lab Seminar 4/11/2008.
Wikipedia as Sense Inventory to Improve Diversity in Web Search Results Celina SantamariaJulio GonzaloJavier Artiles nlp.uned.es UNED,c/Juan del Rosal,
A Scalable Machine Learning Approach for Semi-Structured Named Entity Recognition Utku Irmak(Yahoo! Labs) Reiner Kraft(Yahoo! Inc.) WWW 2010(Information.
Automatic Identification of Pro and Con Reasons in Online Reviews Soo-Min Kim and Eduard Hovy USC Information Sciences Institute Proceedings of the COLING/ACL.
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
Using Semantic Relatedness for Word Sense Disambiguation
Page 1 NAACL-HLT 2010 Los Angeles, CA Training Paradigms for Correcting Errors in Grammar and Usage Alla Rozovskaya and Dan Roth University of Illinois.
Named Entity Disambiguation on an Ontology Enriched by Wikipedia Hien Thanh Nguyen 1, Tru Hoang Cao 2 1 Ton Duc Thang University, Vietnam 2 Ho Chi Minh.
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
Finding frequent and interesting triples in text Janez Brank, Dunja Mladenić, Marko Grobelnik Jožef Stefan Institute, Ljubljana, Slovenia.
Information Retrieval using Word Senses: Root Sense Tagging Approach Sang-Bum Kim, Hee-Cheol Seo and Hae-Chang Rim Natural Language Processing Lab., Department.
1 Generating Comparative Summaries of Contradictory Opinions in Text (CIKM09’)Hyun Duk Kim, ChengXiang Zhai 2010/05/24 Yu-wen,Hsu.
Learning Subjective Nouns using Extraction Pattern Bootstrapping Ellen Riloff School of Computing University of Utah Janyce Wiebe, Theresa Wilson Computing.
Automatic acquisition for low frequency lexical items Nuria Bel, Sergio Espeja, Montserrat Marimon.
Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -
Subjectivity Recognition on Word Senses via Semi-supervised Mincuts Fangzhong Su and Katja Markert School of Computing, University of Leeds Human Language.
1 Gloss-based Semantic Similarity Metrics for Predominant Sense Acquisition Ryu Iida Nara Institute of Science and Technology Diana McCarthy and Rob Koeling.
1 Fine-grained and Coarse-grained Word Sense Disambiguation Jinying Chen, Hoa Trang Dang, Martha Palmer August 22, 2003.
Learning Extraction Patterns for Subjective Expressions 2007/10/09 DataMining Lab 안민영.
From Words to Senses: A Case Study of Subjectivity Recognition Author: Fangzhong Su & Katja Markert (University of Leeds, UK) Source: COLING 2008 Reporter:
Word classes and part of speech tagging. Slide 1 Outline Why part of speech tagging? Word classes Tag sets and problem definition Automatic approaches.
Learning Event Durations from Event Descriptions Feng Pan, Rutu Mulkar, Jerry R. Hobbs University of Southern California ACL ’ 06.
Subjectivity and Sentiment Analysis Jan Wiebe Department of Computer Science Intelligent Systems Program University of Pittsburgh.
Finding Predominant Word Senses in Untagged Text Diana McCarthy & Rob Koeling & Julie Weeds & Carroll Department of Indormatics, University of Sussex {dianam,
Question Classification Ling573 NLP Systems and Applications April 25, 2013.
Using lexical chains for keyword extraction
Erasmus University Rotterdam
Statistical NLP: Lecture 9
Statistical NLP : Lecture 9 Word Sense Disambiguation
Presentation transcript:

Word Sense and Subjectivity (Coling/ACL 2006) Janyce Wiebe Rada Mihalcea University of Pittsburgh University of North Texas Acknowledgements: This slide is created based on the presentation slides from

2/33 Outline Motivation and Goals Assigning Subjectivity Labels to Word Senses Manually Automatically Word Sense Disambiguation using Automatic Subjectivity Analysis Conclusions

3/33 Outline Motivation and Goals Assigning Subjectivity Labels to Word Senses Manually Automatically Word Sense Disambiguation using Automatic Subjectivity Analysis Conclusions

4/33 Motivation Growing interest in the automatic extraction of opinions, emotions, and sentiments in text  subjectivity Continuing interest in word sense Though both are concerned with text meaning, they have mainly been investigated independently

5/33 Subjectivity Labels on Senses Alarm: S: Alarm, dismay, consternation – (fear resulting from the awareness of danger) 驚慌 O: Alarm, warning device, alarm system – (a device that signals the occurrence of some undesirable event) 警報器 Interest: S: Interest, involvement -- (a sense of concern with and curiosity about someone or something; "an interest in music") 愛好 O: Interest -- (a fixed charge for borrowing money; usually a percentage of the amount borrowed; "how much interest do you pay on your mortgage?") 利息

6/33 WSD using Subjectivity Tagging Sense 4 “a sense of concern with and curiosity about someone or something” S Sense 1 “a fixed charge for borrowing money” O WSD System Sense 4 Sense 1? Sense 1 Sense 4? The notes do not pay interest. He spins a riveting plot which grabs and holds the reader’s interest.

7/33 Sense 4 “a sense of concern with and curiosity about someone or something” S Sense 1 “a fixed charge for borrowing money” O WSD using Subjectivity Tagging The notes do not pay interest. He spins a riveting plot which grabs and holds the reader’s interest. WSD System Sense 4 Sense 1? Sense 1 Sense 4? Subjectivity Classifier S O

8/33 Sense 4 “a sense of concern with and curiosity about someone or something” S Sense 1 “a fixed charge for borrowing money” O WSD using Subjectivity Tagging The notes do not pay interest. He spins a riveting plot which grabs and holds the reader’s interest. WSD System Sense 4 Sense 1? Sense 1 Sense 4? Subjectivity Classifier S O

9/33 Subjectivity Classifier Subjectivity Tagging using WSD The notes do not pay interest. He spins a riveting plot which grabs and holds the reader’s interest. O S? S O?

10/33 Subjectivity Classifier S Sense 4 “a sense of concern with and curiosity about someone or something” O Sense 1 “a fixed charge for borrowing money” Subjectivity Tagging using WSD The notes do not pay interest. He spins a riveting plot which grabs and holds the reader’s interest. WSD System Sense 4 Sense 1 O S? S O?

11/33 Subjectivity Classifier S Sense 4 “a sense of concern with and curiosity about someone or something” O Sense 1 “a fixed charge for borrowing money” Subjectivity Tagging using WSD The notes do not pay interest. He spins a riveting plot which grabs and holds the reader’s interest. WSD System Sense 4 Sense 1 O S? S O?

12/33 Goals Explore interactions between word sense and subjectivity Can subjectivity labels be assigned to word senses? Manually Automatically Can subjectivity analysis improve word sense disambiguation? Can word sense disambiguation improve subjectivity analysis?  Future work

13/33 Outline Motivation and Goals Assigning Subjectivity Labels to Word Senses Manually Automatically Word Sense Disambiguation using Automatic Subjectivity Analysis Conclusions

14/33 Subjectivity Tagging Prior art: identify words and phrases associated with subjectivity  subjectivity classification of sentences, clauses, phrases, or word instances in context Here: subjectivity labels are applied to word senses

15/33 Annotation Scheme Assigning subjectivity labels to WordNet senses S: Subjective O: Objective B: Both Annotators are given the synset and its hypernym

16/33 Senses Definitions When the sense is used in a text or conversation … S: expect it to express subjectivity, and expect the phrase/sentence containing it to be subjective. O: don ’ t expect it to express subjectivity and, if the phrase/sentence containing it is subjective, the subjectivity is due to something else. Ex: Will someone shut that darn alarm off? B: covers both subjective and objective usages

17/33 Annotated Data 64 words; 354 senses Balanced subset 32 words; 138 senses  2 judges  Agreement The ambiguous nouns of the SENSEVAL-3 English Lexical Task 20 words; 117 senses  2 judges  WSD w/ subj. info. Others 12 words; 99 senses  1 judge  Auto subjectivity annotation on word sense

18/33 Agreement Results (tag U removed  95%, : 0.90)

19/33 Outline Motivation and Goals Assigning Subjectivity Labels to Word Senses Manually Automatically Word Sense Disambiguation using Automatic Subjectivity Analysis Conclusions

20/33 Main idea Assess the subjectivity of a word sense based on information about the subjectivity of: a set of distributionally similar words (DSW) on unannotated corpus (BNC) in a corpus annotated with subjective expressions (MPQA)

21/33 Algorithm_ diagram Unannotated Corpus (BNC) dsw 1 inst 1 (S) Annotated dsw 1 inst 2 (O) Corpus dsw 2 inst 1 (S) (MPQA) word sense w i alarm w 1 : 驚慌 w 2 : 警報器 DSW = {detector, panic} [-1, 1] [highly objective, highly subjective]

22/33 Algorithm_ pseudo code

23/33 Evaluation No DSW instances for 82 senses  auto assignment of subjectivity for =272 word senses Calculate subj scores for all word senses, sort them, and assign a subj label to the top N word sense  calculate the precision of the algorithm at different points of recall Baseline: random selection of S labels Number of assigned S labels matches number of S labels in the gold standard (recall = 1.0)

24/33 PR curves use only those DSWs that lead to the highest similarity (link)link

25/33 How about the # of DSW? represents the value where precision and recall become equal

26/33 Outline Motivation and Goals Assigning Subjectivity Labels to Word Senses Manually Automatically Word Sense Disambiguation using Automatic Subjectivity Analysis Conclusions

27/33 WSD + Subj. Augment an existing WSD system with a feature reflecting the subjectivity of the context of the ambiguous word Compare the performance of original and subjectivity-aware WSD systems The 20 ambiguous nouns of the SENSEVAL-3 English Lexical Task

28/33 WSD using Subjectivity Tagging Subjectivity Classifier S, O, or B S Original WSD System Subjectivity Aware WSD System Sense 4Sense 1 Sentence i “interest” Sense 1 “a sense of concern with and curiosity about someone or something” Sense 4 “a fixed charge for borrowing money”

29/33 Results Words with Only O Senses Words with S and O Senses the most frequent sense

30/33 Outline Motivation and Goals Assigning Subjectivity Labels to Word Senses Manually Automatically Word Sense Disambiguation using Automatic Subjectivity Analysis Conclusions

31/33 Conclusion Can subjectivity labels be assigned to word senses? Manually Good agreement; Kappa=0.74 Very good when uncertain cases removed; Kappa=0.90 Automatically Method substantially outperforms baseline Showed feasibility of assigning subjectivity labels to the fine-grained level of word senses

32/33 Conclusion Can subjectivity analysis improve word sense disambiguation? Improves performance, but mainly for words with both S and O senses (4.3% error reduction; significant (p < 0.05)) Performance largely remains the same or degrades for words that don ’ t Assign subjectivity labels to WordNet; WSD system should consult WordNet tags to decide when to pay attention to the contextual subjectivity feature.

33/33 Thank you

34/33 Method – Optional Variation w 1 dsw 1 dsw 2 dsw 3 w 2 dsw 1 dsw 2 dsw 3 w 3 dsw 1 dsw 2 dsw 3 if k is in a subjective expression: subj += sim(w i,dsw j ) else: subj -= sim(w i,dsw j ) “Selected” back