Why Watching Movie Tweets Won’t Tell the Whole Story? Felix Ming-Fai Wong, Soumya Sen, Mung Chiang EE, Princeton University 1 WOSN’12. August 17, Helsinki.

Slides:

Advertisements

Similar presentations

By Klejdi Muca & Stephen Quinn. A method used by companies like IMDB or Netlfix to turn raw data into useful information, for example It helps companies.

Advertisements

Inferring User Political Preferences from Streaming Communications Svitlana Volkova 1, Glen Coppersmith 2 and Benjamin Van Durme 1,2 1 Center for Language.

Recognizing Human Actions by Attributes CVPR2011 Jingen Liu, Benjamin Kuipers, Silvio Savarese Dept. of Electrical Engineering and Computer Science University.

Sentiment Analysis on Twitter Data

Sophomore Slumpware Predicting Album Sales with Artificial Neural Networks Matthew Wirtala ECE 539.

Bringing Order to the Web: Automatically Categorizing Search Results Hao Chen SIMS, UC Berkeley Susan Dumais Adaptive Systems & Interactions Microsoft.

Tweet Classification for Political Sentiment Analysis Micol Marchetti-Bowick.

Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.

Distant Supervision for Emotion Classification in Twitter posts 1/17.

Predicting Emerging Social Conventions in Online Social Networks Farshad Kooti * Winter Mason † Krishna Gummadi * Meeyoung Cha ‡ MPI-SWS * Stevens Institute.

LYRIC-BASED ARTIST NETWORK Derek Gossi CS 765 Fall 2014.

Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.

Data Mining Classification: Alternative Techniques

Problem Semi supervised sarcasm identification using SASI

Semi Supervised Recognition of Sarcastic Sentences in Twitter and Amazon Dmitry DavidovOren TsurAri Rappoport.

Data preprocessing before classification In Kennedy et al.: “Solving data mining problems”

Pollyanna Gonçalves (UFMG, Brazil) Matheus Araújo (UFMG, Brazil) Fabrício Benevenuto (UFMG, Brazil) Meeyoung Cha (KAIST, Korea) Comparing and Combining.

Made with OpenOffice.org 1 Sentiment Classification using Word Sub-Sequences and Dependency Sub-Trees Pacific-Asia Knowledge Discovery and Data Mining.

The Wisdom of the Few A Collaborative Filtering Approach Based on Expert Opinions from the Web Xavier Amatriain Telefonica Research Nuria Oliver Telefonica.

Semantic Analysis of Movie Reviews for Rating Prediction

Decision Tree Rong Jin. Determine Milage Per Gallon.

Supervised classification performance (prediction) assessment Dr. Huiru Zheng Dr. Franscisco Azuaje School of Computing and Mathematics Faculty of Engineering.

Distributed Representations of Sentences and Documents

CONTENT-BASED BOOK RECOMMENDING USING LEARNING FOR TEXT CATEGORIZATION TRIVIKRAM BHAT UNIVERSITY OF TEXAS AT ARLINGTON DATA MINING CSE6362 BASED ON PAPER.

Twitter Volume Spikes: Analysis and Application in Stock Trading Yuexin Mao, Wei Wei and Bing Wang COMP4332/RMBI4310 CHAN Chun Ting ( )

Selective Sampling on Probabilistic Labels Peng Peng, Raymond Chi-Wing Wong CSE, HKUST 1.

STUDENTLIFE PREDICTIVE MODELING Hongyu Chen Jing Li Mubing Li CS69/169 Mobile Health March 2015.

M. Sulaiman Khan Dept. of Computer Science University of Liverpool 2009 COMP527: Data Mining Classification: Evaluation February 23,

Mining the Peanut Gallery: Opinion Extraction and Semantic Classification of Product Reviews K. Dave et al, WWW 2003, citations Presented by Sarah.

Supervised Learning and k Nearest Neighbors Business Intelligence for Managers.

Lecture #32 WWW Search. Review: Data Organization Kinds of things to organize –Menu items –Text –Images –Sound –Videos –Records (I.e. a person ’ s name,

CSSE463: Image Recognition Day 27 This week This week Last night: k-means lab due. Last night: k-means lab due. Today: Classification by “boosting” Today:

 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.

Feature selection LING 572 Fei Xia Week 4: 1/29/08 1.

Microblogs: Information and Social Network Huang Yuxin.

Improving Classification Accuracy Using Automatically Extracted Training Data Ariel Fuxman A. Kannan, A. Goldberg, R. Agrawal, P. Tsaparas, J. Shafer Search.

Modeling the Human Classification of Galaxy Morphology Wednesday, December 5, 2007 Mike Specian.

Prediction of Molecular Bioactivity for Drug Design Experiences from the KDD Cup 2001 competition Sunita Sarawagi, IITB

Automatic Syllabus Classification JCDL – Vancouver – 22 June 2007 Edward A. Fox (presenting co-author), Xiaoyan Yu, Manas Tungare, Weiguo Fan, Manuel Perez-Quinones,

Social Media 101 An Overview of Social Media Basics.

Combining multiple learners Usman Roshan. Bagging Randomly sample training data Determine classifier C i on sampled data Goto step 1 and repeat m times.

Prediction of Influencers from Word Use Chan Shing Hei.

Department of Electrical Engineering and Computer Science Kunpeng Zhang, Yu Cheng, Yusheng Xie, Doug Downey, Ankit Agrawal, Alok Choudhary {kzh980,ych133,

Sentiment Analysis with Incremental Human-in-the-Loop Learning and Lexical Resource Customization Shubhanshu Mishra 1, Jana Diesner 1, Jason Byrne 2, Elizabeth.

CoCQA : Co-Training Over Questions and Answers with an Application to Predicting Question Subjectivity Orientation Baoli Li, Yandong Liu, and Eugene Agichtein.

CSSE463: Image Recognition Day 33 This week This week Today: Classification by “boosting” Today: Classification by “boosting” Yoav Freund and Robert Schapire.

Measuring Behavioral Trust in Social Networks

Recognizing Stances in Online Debates Unsupervised opinion analysis method for debate-side classification. Mine the web to learn associations that are.

Germany / United Kingdom Panels, Tracking & General Findings.

Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales Bo Pang and Lillian Lee Cornell University Carnegie.

CS378 Final Project The Netflix Data Set Class Project Ideas and Guidelines.

Predicting Voice Elicited Emotions

Fabricio Benevenuto, Gabriel Magno, Tiago Rodrigues, and Virgilio Almeida Universidade Federal de Minas Gerais Belo Horizonte, Brazil ACSAC 2010 Fabricio.

2014 Lexicon-Based Sentiment Analysis Using the Most-Mentioned Word Tree Oct 10 th, 2014 Bo-Hyun Kim, Sr. Software Engineer With Lina Chen, Sr. Software.

Semi-Supervised Recognition of Sarcastic Sentences in Twitter and Amazon -Smit Shilu.

Alvin CHAN Kay CHEUNG Alex YING Relationship between Twitter Events and Real-life.

PREDICTION ON TWEET FROM DYNAMIC INTERACTION Group 19 Chan Pui Yee Wong Tsz Wing Yeung Chun Kit.

Opinion spam and Analysis 소프트웨어공학 연구실 G 최효린 1 / 35.

A Sentiment-Based Approach to Twitter User Recommendation BY AJAY ABDULPUR RAJARAM NIKKAM.

Passengers Theo warrington.

Sentiment Analysis of Twitter Messages Using Word2Vec

Name: Sushmita Laila Khan Affiliation: Georgia Southern University

Sentiment analysis tools

The Problem: Classification

Erasmus University Rotterdam

Estimating Link Signatures with Machine Learning Algorithms

iSRD Spam Review Detection with Imbalanced Data Distributions

CSCI N317 Computation for Scientific Applications Unit Weka

Predicting Loan Defaults

Outlines Introduction & Objectives Methodology & Workflow

Presentation transcript:

Why Watching Movie Tweets Won’t Tell the Whole Story? Felix Ming-Fai Wong, Soumya Sen, Mung Chiang EE, Princeton University 1 WOSN’12. August 17, Helsinki.

Twitter: A Gold Mine? 2 Apps: Mombo, TwitCritics, Fflick

Or Too Good to be True? 3

Why representativeness?  Selection bias  Gender  Age  Education  Computer skills  Ethnicity  Interests  The silent majority 4

Why Movies?  Large online interest  Relatively unambiguous  Right in Timing: Oscars 5

Key Questions  Is there a bias in Twitter movie ratings?  How do Twitters compare to other online site?  Is there a quantifiable difference across types of movies?  Oscar nominated vs. non-nominated commercial movies  Can hype ensure Box-office gains? 6

Data Stats  12 Million Tweets  1.77M valid movie tweets  February 2 – March 12, 2012  34 movie titles  New releases (20)  Oscar-nominees (14) 7

Example Movies Commercial Movies (non-nominees)Oscar-nominees The GreyThe Descendants Underworld: Awakening The Artist Contraband The Help Haywire Hugo The Woman in BlackMidnight in Paris Chronicle Moneyball The Vow The Tree of Life Journey 2: The Mysterious IslandWar Horse 8 Oscar-nominees for Best Picture or Best Animated Film

Rating Comparison  Twitter movie ratings  Binary Classification: Positivity/Negativity  IMDb, Rotten Tomatoes ratings  Rating scale: 0 – 10, 0 – 5  Benchmark definition  Rotten Tomatoes: Positive - 3.5/5  IMDb: Positive: 7/10  Movie score: weighted average  High mutual information 9

Challenges  Common Words in titles  e.g., “Thanks for the help ”, “ the grey sky”  No API support for exact matching  e.g., “a grey cat in the box”  Misrepresentations  e.g., “underworld awakening”  Non-standard vocabulary  e.g., “sick movie”  Non-English tweets  Retweets: approves original opinion 10

Tweet Classification 11

Steps  Preprocessing  Remove usernames  Conversion:  Punctuation marks (e.g., “!” to a meta-word “exclmark”)  Emoticons (e,g,, or :),  etc. ), URLs,  Feature Vector Conversion  Preprocessed tweet to binary feature vector (using MALLET)  Training & Classification  11K non-repeated, randomly sampled tweets manually labeled  Trained three classifiers 12

Classifier Comparison  SVM based classifier performance is significantly better  Numbers in brackets are balanced accuracy rates 13

Temporal Context 14 Woman in Black Chronicle The Vow Safe House  Most “current” tweets are “mentions”  LBS  “Before” or “After” tweets are much more positive Fraction of tweets in joint-classes

Sentiment: Positive Bias 15 Woman in Black Chronicle The Vow Safe House Haywire Star Wars I  Twitters are overwhelmingly more positive for almost all movies

Rotten Tomatoes vs. IMDb?  Good match between RT and IMDb (Benchmark) 16

Twitters vs. IMDb vs. RT ratings (1)  New (non-nominated) movies score more positively from Twitter users 17 IMDb vs. Twitter RT vs. Twitter

Twitters vs. IMDb vs. RT ratings (2)  Oscar-nominees rate more positively on IMDb and RT 18 IMDb vs. Twitter RT vs. Twitter

Quantification Metrics (1)  (x*, y*) are the medians of proportion of positive ratings in Twitter and IMDb/RT 19 B P Positiveness (P): Bias (B): IMDb/RT score Twitter score x* y* 1 1

Quantification Metrics (2) 20 Inferrability (I):  If one knows the average rating on Twitter, how well can he/she predict the ratings on IMDb/RT?

Summary Metrics 21  Oscar-nominees have higher positiveness  RT-IMDb have high mutual inferrability and low bias  Twitter users are more positively biased for non-nominated new releases

Hype-approval vs. Ratings 22  Higher (lower) Hype-approval ≠ Higher (lower) IMDb/RT ratings The Vow This Means War

Hype-approval vs. Box-office  High Hype + High IMDb rating: Box-office success  Other combination: Unpredictable 23 Journey 2

Summary  Tweeters aren’t representative enough  Need to be careful about tweets as online polls  Tweeters’ tend to appear more positive, have specific interests and taste  Need for quantitative metrics Thanks! 24