Identifying Sarcasm in Twitter: A Closer Look

Slides:



Advertisements
Similar presentations
Dan Jurafsky Lecture 4: Sarcasm, Alzheimers, +Distributional Semantics Computational Extraction of Social and Interactional Meaning SSLST, Summer 2011.
Advertisements

Progress Monitoring. Progress Monitoring Steps  Monitor the intervention’s progress as directed by individual student’s RtI plan  Establish a baseline.
Instructor: Smaranda Muresan Columbia University Course Introduction.
Computational Extraction of Social and Interactional Meaning from Speech Dan Jurafsky and Mari Ostendorf Lecture 7: Dialog Acts & Sarcasm Mari Ostendorf.
Sentiment Analysis on Twitter Data
Tweet Classification for Political Sentiment Analysis Micol Marchetti-Bowick.
Distant Supervision for Emotion Classification in Twitter posts 1/17.
Problem Semi supervised sarcasm identification using SASI
Computational Models of Discourse Analysis Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Semi Supervised Recognition of Sarcastic Sentences in Twitter and Amazon Dmitry DavidovOren TsurAri Rappoport.
Sarcasm Detection on Twitter A Behavioral Modeling Approach
Pollyanna Gonçalves (UFMG, Brazil) Matheus Araújo (UFMG, Brazil) Fabrício Benevenuto (UFMG, Brazil) Meeyoung Cha (KAIST, Korea) Comparing and Combining.
Intensity of Emotions and Narrative Structure Tibor Pólya Institute for Psychology Hungarian Academy of Sciences.
CS 599: Social Media Analysis University of Southern California1 Sentiment Analysis Kristina Lerman University of Southern California.
A Survey on Text Categorization with Machine Learning Chikayama lab. Dai Saito.
Peiti Li 1, Shan Wu 2, Xiaoli Chen 1 1 Computer Science Dept. 2 Statistics Dept. Columbia University 116th Street and Broadway, New York, NY 10027, USA.
Literary Style Classification with Deep Linguistic Features Hyung Jin Kim Minjong Chung Wonhong Lee.
Extracting Social Meaning Identifying Interactional Style in Spoken Conversation Jurafsky et al ‘09 Presented by Laura Willson.
Predicting the Semantic Orientation of Adjectives
Techniques for Emotion Classification Julia Hirschberg COMS 4995/6998 Thanks to Kaushal Lahankar.
Extracting Interest Tags from Twitter User Biographies Ying Ding, Jing Jiang School of Information Systems Singapore Management University AIRS 2014, Kuching,
#title We know tweeted last summer ! Shrey Gupta & Sonali Aggarwal.
Forecasting with Twitter data Presented by : Thusitha Chandrapala MARTA ARIAS, ARGIMIRO ARRATIA, and RAMON XURIGUERA.
Lexical Differences in Autobiographical Narratives from Schizophrenic Patients and Healthy Controls Kai Hong, Christian G. Kohler, Mary E. March, Amber.
WEB FORUM MINING BASED ON USER SATISFACTION PAGE 1 WEB FORUM MINING BASED ON USER SATISFACTION By: Suresh Pokharel Information and Communications Technologies.
Unit 1 – Understanding Non-Fiction and Media Texts
Deriving Topics and Opinions from Microblogs Feng Jiang Supervisors: Jixue Liu & Jiuyong Li.
Some Advances in Transformation-Based Part of Speech Tagging
Carmen Banea, Rada Mihalcea University of North Texas A Bootstrapping Method for Building Subjectivity Lexicons for Languages.
LIWC Linguistic Inquiry & Word Count Jeff Spicer & Matthew Egizii.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
1 Emotion Classification Using Massive Examples Extracted from the Web Ryoko Tokuhisa, Kentaro Inui, Yuji Matsumoto Toyota Central R&D Labs/Nara Institute.
This work is supported by the Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior National Business Center contract number.
Hierarchical emotion classification and emotion component analysis on chinese micro-blog posts Hua Xu 1, Weiwei Yang 1, Jiushuo Wang 1, 2 1 State Key Laboratory.
A Weakly-Supervised Approach to Argumentative Zoning of Scientific Documents Yufan Guo Anna Korhonen Thierry Poibeau 1 Review By: Pranjal Singh Paper.
14/12/2009ICON Dipankar Das and Sivaji Bandyopadhyay Department of Computer Science & Engineering Jadavpur University, Kolkata , India ICON.
Prediction of Influencers from Word Use Chan Shing Hei.
Summarization Focusing on Polarity or Opinion Fragments in Blogs Yohei Seki Toyohashi University of Technology Visiting Scholar at Columbia University.
1 Computation Approaches to Emotional Speech Julia Hirschberg
Recognizing Stances in Ideological Online Debates.
Emotions from text: machine learning for text-based emotion prediction Cecilia Alm, Dan Roth, Richard Sproat UIUC, Illinois HLT/EMPNLP 2005.
CSC 594 Topics in AI – Text Mining and Analytics
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
Tools for Linguistic Analysis. Overview of Linguistic Tools  Dictionaries  Linguistic Inquiry and Word Count (LIWC) Linguistic Inquiry and Word Count.
Reputation Management System
Lexical Affect Sensing: Are Affect Dictionaries Necessary to Analyze Affect? Alexander Osherenko, Elisabeth André University of Augsburg.
Web Intelligence and Intelligent Agent Technology 2008.
Feelings and Emotions. Angry What does it mean to be angry? What makes you angry or mad? Make a mad or angry face.
2014 Lexicon-Based Sentiment Analysis Using the Most-Mentioned Word Tree Oct 10 th, 2014 Bo-Hyun Kim, Sr. Software Engineer With Lina Chen, Sr. Software.
Semi-Supervised Recognition of Sarcastic Sentences in Twitter and Amazon -Smit Shilu.
Twitter as a Corpus for Sentiment Analysis and Opinion Mining
Interpreting Ambiguous Emotional Expressions Speech Analysis and Interpretation Laboratory ACII 2009.
Project Deliverable-1 -Prof. Vincent Ng -Girish Ramachandran -Chen Chen -Jitendra Mohanty.
Towards Semantic Affect Sensing in Sentences Alexander Osherenko.
Finnish sentiment analysis and a tool for social media tracking
A Sentiment-Based Approach to Twitter User Recommendation BY AJAY ABDULPUR RAJARAM NIKKAM.
Homework 3 Progress Presentation -Meet Shah. Goal Identify whether tweet is sarcastic or not.
IDENTIFYING GREAT TEACHERS THROUGH THEIR ONLINE PRESENCE Evanthia Faliagka, Maria Rigou, Spiros Sirmakessis.
A Simple Approach for Author Profiling in MapReduce
Name: Sushmita Laila Khan Affiliation: Georgia Southern University
Presenter: Jia-Kuan Lin Advisor: Chung-Hsien Wu
Automatic Hedge Detection
Sentiment Analysis Study
MID-SEM REVIEW.
The Extended Cohn-Kanade Dataset (CK+): A complete dataset for action unit and emotion-specified expression By: Patrick Lucey, Jeffrey F. Cohn, Takeo.
Proportion of Original Tweets
Elham Rastegari University of Nebraska at Omaha College of IS&T
Predicting Prevalence of Influenza-Like Illness From Geo-Tagged Tweets
Introduction to Sentiment Analysis
Presentation transcript:

Identifying Sarcasm in Twitter: A Closer Look Roberto Gonzalez Smaranda Muresan Nina Wacholder

Aim of the study To construct a corpus of sarcastic utterances that have been explicitly labeled so by the composers themselves. (#sarcasm, #sarcastic) To exemplify the difficulty in distinguishing sarcastic sentences from negative/positive sentences.

Data Data for the study is divided in three sets of 900 tweets each: sarcastic, positive and negative. Each data set is culled from twitter using appropriate hash-tags. Sarcasm: #sarcasm, #sarcastic Positive: #happy, #joy, #lucky Negative: #sadness, #frustrated, #angry

Data Preprocessing Tweets tagged with #sarcasm or #sarcastic in the middle of the tweet removed. Manually checked to see if the tags were a part of the content of the tweet. Eg: “I really love #sarcasm”

Lexical features Unigrams Dictionary based Pennebaker et al (LIWC) Linguistic Processes (adverbs, pronouns) Psychological Processes (Positive, negative emotion) Personal Concerns (work, achievement) Spoken Categories ( assent, non-fluencies) WordNet Affect List of interjections and punctuations

Pragmatic Features Positive emoticons Negative emoticons ToUser smileys Negative emoticons Frowning faces ToUser @user

Comparisons and X2 rankings

Classification Logistic Regression and Support Vector Machine with SMO (sequential minimal optimization) Features used: Unigrams Dictionary features presence (LIWC+_P) Dictionary features frequency (LIWC+_F)

Classification Results

Comparison against human performance 3 judges asked to classify tweets as sarcastic, positive or negative. (90 tweets per category) S-N-P: 50% agreement (k = 0.4788) S-NS: 71.67% agreement (k = 0.5861) Emoticon based S-NS: 89% agreement (k = 0.74)

Human Comparison results