Cristian Danescu-Niculescu-Mizil 1, Gueorgi Kossinets 2, Jon Kleinberg 1, Lillian Lee 1 1 Dept. of Computer Science, Cornell University, 2 Google Inc.

Slides:



Advertisements
Similar presentations
Cristian Danescu-Niculescu-Mizil Dept. of Computer Science Cornell University Gueorgi Kossinets Google Inc. Jon Kleinberg Dept. of Computer Science Cornell.
Advertisements

© The McGraw-Hill Companies, Inc., 2000 CorrelationandRegression Further Mathematics - CORE.
CHAPTER 20 Psychological Research and Statistics.
Correlation Chapter 9.
The Wisdom of the Few A Collaborative Filtering Approach Based on Expert Opinions from the Web Xavier Amatriain Telefonica Research Nuria Oliver Telefonica.
Unit 2: Research Methods in Psychology
Agenda for January 25 th Administrative Items/Announcements Attendance Handouts: course enrollment, RPP instructions Course packs available for sale in.
Experimental Evaluation
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Chapter 2 Research Methods. The Scientific Approach: A Search for Laws Empiricism: testing hypothesis Basic assumption: events are governed by some lawful.
Variance and Standard Deviation. Variance: a measure of how data points differ from the mean Data Set 1: 3, 5, 7, 10, 10 Data Set 2: 7, 7, 7, 7, 7 What.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Correlation and Linear Regression
Chapter 2: The Research Enterprise in Psychology
Chapter 2: The Research Enterprise in Psychology
Chapter 2 Research Methods. The Scientific Approach: A Search for Laws Empiricism: testing hypothesis Basic assumption: events are governed by some lawful.
Chapter 2 The Research Enterprise in Psychology. n Basic assumption: events are governed by some lawful order  Goals: Measurement and description Understanding.
PowerPoint presentation to accompany Research Design Explained 6th edition ; ©2007 Mark Mitchell & Janina Jolley Chapter 7 Introduction to Descriptive.
Copyright © Allyn & Bacon 2007 Chapter 2: Research Methods.
Analyzing Reliability and Validity in Outcomes Assessment (Part 1) Robert W. Lingard and Deborah K. van Alphen California State University, Northridge.
Chapter 1: The Research Enterprise in Psychology.
The Research Enterprise in Psychology. The Scientific Method: Terminology Operational definitions are used to clarify precisely what is meant by each.
User Study Evaluation Human-Computer Interaction.
Nature of Science Science Nature of Science Scientific methods Formulation of a hypothesis Formulation of a hypothesis Survey literature/Archives.
Analyzing and Interpreting Quantitative Data
Yes….. I expect you to read it! Maybe more than once!
Measures of Variability Variability: describes the spread or dispersion of scores for a set of data.
Chapter 2 AP Psychology Outline
1 Discovering Authorities in Question Answer Communities by Using Link Analysis Pawel Jurczyk, Eugene Agichtein (CIKM 2007)
Chapter 2 The Research Enterprise in Psychology. Table of Contents The Scientific Approach: A Search for Laws Basic assumption: events are governed by.
Validity and Reliability Edgar Degas: Portraits in a New Orleans Cotton Office, 1873.
Research Process Parts of the research study Parts of the research study Aim: purpose of the study Aim: purpose of the study Target population: group whose.
Managerial Economics Demand Estimation & Forecasting.
*Erasmus University Rotterdam P.O. Box 1738, NL-3000 DR Rotterdam, the Netherlands † Teezir BV Wilhelminapark 46, NL-3581 NL, Utrecht, the Netherlands.
SW388R6 Data Analysis and Computers I Slide 1 Multiple Regression Key Points about Multiple Regression Sample Homework Problem Solving the Problem with.
The Statistical Analysis of Data. Outline I. Types of Data A. Qualitative B. Quantitative C. Independent vs Dependent variables II. Descriptive Statistics.
How Useful are Your Comments? Analyzing and Predicting YouTube Comments and Comment Ratings Stefan Siersdorfer, Sergiu Chelaru, Wolfgang Nejdl, Jose San.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Revision Analysing data. Measures of central tendency such as the mean and the median can be used to determine the location of the distribution of data.
Interpersonal Relationships in Group Interaction in CSCW Environments Yang Cao, Golha Sharifi, Yamini Upadrashta, Julita Vassileva University of Saskatchewan,
An Unsupervised Approach for the Detection of Outliers in Corpora David Guthrie Louise Guthire, Yorick Wilks The University of Sheffield.
How Opinions are Received by Online Communities A Case Study on Amazon.com Helpfulness Votes Cristian Danescu-Niculescu-Mizil 1, Gueorgi Kossinets 2, Jon.
Chapter 2 The Research Enterprise in Psychology. Table of Contents The Scientific Approach: A Search for Laws Basic assumption: events are governed by.
Chapter 6: Analyzing and Interpreting Quantitative Data
Criteria for selection of a data collection instrument. 1.Practicality of the instrument: -Concerns its cost and appropriateness for the study population.
A New Approach to Utterance Verification Based on Neighborhood Information in Model Space Author :Hui Jiang, Chin-Hui Lee Reporter : 陳燦輝.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
A Framework to Predict the Quality of Answers with Non-Textual Features Jiwoon Jeon, W. Bruce Croft(University of Massachusetts-Amherst) Joon Ho Lee (Soongsil.
Research in Psychology Chapter Two 8-10% of Exam AP Psychology.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
1 Collecting and Interpreting Quantitative Data Deborah K. van Alphen and Robert W. Lingard California State University, Northridge.
Hypothesis Testing and Statistical Significance
CS791 - Technologies of Google Spring A Web­based Kernel Function for Measuring the Similarity of Short Text Snippets By Mehran Sahami, Timothy.
Introduction Dispersion 1 Central Tendency alone does not explain the observations fully as it does reveal the degree of spread or variability of individual.
BUS 308 Entire Course (Ash Course) For more course tutorials visit BUS 308 Week 1 Assignment Problems 1.2, 1.17, 3.3 & 3.22 BUS 308.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Chapter 2 Research Methods.
Chapter 4 Basic Estimation Techniques
Lecture Slides Elementary Statistics Twelfth Edition
Discussion Discuss your results.
RESEARCH METHODS 8-10% 250$ 250$ 250$ 250$ 500$ 500$ 500$ 500$ 750$
Analyzing Reliability and Validity in Outcomes Assessment Part 1
iSRD Spam Review Detection with Imbalanced Data Distributions
Thinking critically with psychological science
Analyzing Reliability and Validity in Outcomes Assessment
Collecting and Interpreting Quantitative Data
Presentation transcript:

Cristian Danescu-Niculescu-Mizil 1, Gueorgi Kossinets 2, Jon Kleinberg 1, Lillian Lee 1 1 Dept. of Computer Science, Cornell University, 2 Google Inc. WWW 2009 Emin Sadiyev Cmpe 493

Amazon.com layout 2 Average star rating Helpfulness ratio

Outline Users’ evaluation on online reviews: Helpfulness votes Make hypothesis Proving their validity Coming up with a mathematical model that explains these behaviors

Introduction Opinion What did Y think of X? 4

Introduction Meta-Opinion What did Z think of Y’s opinion of X? 5

The Helpfulness of Reviews Widely-used web sites include not just reviews, but also evaluations of the helpfulness of the reviews The helpfulness vote “Was this review helpful to you?” Helpfulness ratio: “a out of b people found the review itself helpful” 6

Flow of Presentation HypothesizingVerifyingModeling 7

Flow of Presentation Hypothesizing Conformity Individual-bias Brilliant-but-cruel Quality-only VerifyingModeling 8

Hypotheses: Social Mechanisms Well-studied hypotheses for how social effects influence group’s reaction to an opinion The conformity hypothesis The individual-bias hypothesis The brilliant-but-cruel hypothesis The quality-only straw-man hypothesis 9

Hypotheses The conformity hypothesis Review is evaluated as more helpful when its star rating is closer to the consensus star rating Helpfulness ratio will be the highest of which reviews have star rating equal to overall average The individual-bias hypothesis When a user considers a review, he or she will rate it more highly if it expresses an opinion that he or she agrees with 10

Hypotheses (contd.) The brilliant-but-cruel hypothesis Negative reviewers are perceived as more intelligent, competent, and expert than positive reviewers The Quality-only straw-man hypothesis Helpfulness is being evaluated purely based on the textual content of reviews Non-textual factors are simply correlates of textual quality 11

Flow of Presentation Hypothesizing Verifying Absolute deviation of helpfulness ratio Signed deviation of helpfulness ratio Variance of star rating and helpfulness ratio Making use of plagiarism Modeling 12

Hypotheses Conformity A review is evaluated as more helpful when its star rating is closer to the average star rating Individual-bias A review is evaluated as more helpful when its star rating is closer to evaluator’s opinion Brilliant-but-cruel A review is evaluated as more helpful when its star rating is below to the average star rating Quality-only Only textual information affects helpfulness evaluation 13

Absolute Deviation from Average  Consistent with conformity hypothesis Strong inverse correlation between the median helpfulness ratio and the absolute deviation Reviews with star rating close to the average gets higher helpfulness ratio 14

Hypotheses Conformity A review is evaluated as more helpful when its star rating is closer to the average star rating Individual-bias A review is evaluated as more helpful when its star rating is closer to evaluator’s opinion Brilliant-but-cruel A review is evaluated as more helpful when its star rating is below to the average star rating Quality-only Only textual information affects helpfulness evaluation 15

Signed Deviation from Average Not consistent with brilliant- but-cruel hypothesis There is tendency towards positivity Black lines should not be sloped that way if it is valid hypothesis 16

Hypotheses Conformity A review is evaluated as more helpful when its star rating is closer to the average star rating Individual-bias A review is evaluated as more helpful when its star rating is closer to evaluator’s opinion Brilliant-but-cruel A review is evaluated as more helpful when its star rating is below to the average star rating Quality-only Only textual information affects helpfulness evaluation 17

Addressing Individual-bias Effects It is hard to distinguish between the conformity and the individual-bias hypothesis We need to examine cases in which individual people’s opinions do not come from exactly the same distribution Cases in which there is high variance in star ratings Otherwise conformity and individual-bias are indistinguishable Everyone has same opinion 18

Variance of Star Rating and Helpfulness Ratio 19 Helpfulness ratio is the highest with reviews of which rating is slightly- above the average Two-humped camel plots: local minimum around average Helpfulness ratio is the highest when star ratings of reviews have average value

Hypotheses Conformity A review is evaluated as more helpful when its star rating is closer to the average star rating Individual-bias A review is evaluated as more helpful when its star rating is closer to evaluator’s opinion Brilliant-but-cruel A review is evaluated as more helpful when its star rating is below to the average star rating Quality-only Only textual information affects helpfulness evaluation 20

Quality-only hypothesis Possible other methods Human annotation Could be subjective Classification using machine learning methods We cannot guarantee the accuracies of algorithms Plagiarized reviews Almost(not exact) same text same text could be considered as spam reviews Different non-textual information If the quality-only straw man hypothesis holds, helpfulness ratios of documents in each pair should be the same 21

Plagiarism Making use of plagiarism is effective way to control for the effect of review text Definition of plagiarized pair(s) of reviews Two or more reviews of different products With near-complete textual overlap Author takes %70 textual overlap as plagiarism 22

An Example Skull-splitting headache guaranteed! If you enjoy thumping, skull splitting migraine headache, then Sing N Learn is for you. As a longtime language instructor, I agree with the attempt and effort that this series makes, but it is the execution that ultimately weakens Sing N Learn Chinese. To be sure, there are much, much better ways to learn Chinese. In fact, I would recommend this title only as a last resort and after you’ve thoroughly exhausted traditional ways to learn Chinese … Migraine Headache at No Extra Charge If you enjoy a thumping, skull splitting migraine headache, then the Sing N Learn series is for you. As a longtime language instructor, I agree with the effort that this series makes, but it is the execution that ultimately weakens Sing N Learn series. To be sure, there are much, much better ways to learn a foreign language. In fact, I would recommend this title only as a last resort and after you’ve thoroughly exhausted traditional ways to learn Korean … 23

Experiments with Plagiarism Text quality is not the only explanatory factor Statistically significant difference between the helpfulness ratios of plagiarized pairs 24 The plagiarized reviews with deviation 1 is significantly more helpful than those with deviation 1.5

Hypotheses Conformity A review is evaluated as more helpful when its star rating is closer to the average star rating Individual-bias A review is evaluated as more helpful when its star rating is closer to evaluator’s opinion Brilliant-but-cruel A review is evaluated as more helpful when its star rating is below to the average star rating Quality-only Only textual information affects helpfulness evaluation 25

Flow of Presentation HypothesizingVerifying Modeling Based on individual bias and mixtures of distributions 26

Authors’ Model Based on individual bias and mixtures of distributions Two distributions: one for positive, one for negative evaluators Balance between positive and negative evaluators: Controversy level: Density function of helpfulness ratios of positive evaluators Gaussian distribution of which average is -centered Density function of helpfulness ratios of negative evaluators Gaussian distribution of which average is -centered 27

Validity of the Model Empirical observation and model generated 28

Conclusion A review’s perceived helpfulness depends not just on its content, but also the relation of its score to other scores The dependence of the score is consistent with a simple and natural model of individual-bias in the presence of a mixture of opinion distributions Directions for further research Variations in the effect can be used to form hypotheses about differences in the collective behaviors of the underlying populations It would be interesting to consider social feedback mechanisms that might be capable of modifying the effects authors observed here Considering possible outcomes of design problem for systems enabling the expression and dissemination of opinions 29

Discussions So, how can we use this? In which cases would this information be helpful? Available information is very limited Star ratings Helpfulness ratios Conclusion is rather trivial Does not present new discoveries 30