Show me the Money! Deriving the Pricing Power of Product Features by Mining Consumer Reviews. Nikolay Archak, Anindya Ghose, Panagiotis Ipeirotis New York.

Slides:



Advertisements
Similar presentations
Mobile and Internet Systems Group Reputation Premiums in Electronic Peer-to-Peer Markets: Analyzing Textual Feedback and Network Structure Presenter: Sean.
Advertisements

Farag Saad i-KNOW 2014 Graz- Austria,
Big Data Stupid Decisions The Importance Of Measuring What We Should Be Measuring Stern School of Business, New York University “A Computer.
Pattern Recognition and Machine Learning: Kernel Methods.
Learning to Cluster Web Search Results SIGIR 04. ABSTRACT Organizing Web search results into clusters facilitates users quick browsing through search.
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
Opinion Spam and Analysis Nitin Jindal and Bing Liu Department of Computer Science University of Illinois at Chicago.
Anindya Ghose Panos Ipeirotis Arun Sundararajan Stern School of Business New York University Opinion Mining using Econometrics A Case Study on Reputation.
A Novel Lexicalized HMM-based Learning Framework for Web Opinion Mining Wei Jin Department of Computer Science, North Dakota State University, USA Hung.
Product Review Summarization from a Deeper Perspective Duy Khang Ly, Kazunari Sugiyama, Ziheng Lin, Min-Yen Kan National University of Singapore.
Latent Aspect Rating Analysis without Aspect Keyword Supervision Hongning Wang, Yue Lu, ChengXiang Zhai Department of.
Sentiment Lexicon Creation from Lexical Resources BIS 2011 Bas Heerschop Erasmus School of Economics Erasmus University Rotterdam
CONTENT-BASED BOOK RECOMMENDING USING LEARNING FOR TEXT CATEGORIZATION TRIVIKRAM BHAT UNIVERSITY OF TEXAS AT ARLINGTON DATA MINING CSE6362 BASED ON PAPER.
1 Collaborative Filtering: Latent Variable Model LIU Tengfei Computer Science and Engineering Department April 13, 2011.
Applied Business Forecasting and Planning
Sentiment Analysis  Some Important Techniques  Discussions: Based on Research Papers.
Get Another Label? Using Multiple, Noisy Labelers Joint work with Victor Sheng and Foster Provost Panos Ipeirotis Stern School of Business New York University.
Panos Ipeirotis Stern School of Business New York University Analyzing User-Generated Content using Econometrics.
Panos Ipeirotis Stern School of Business New York University Structuring and querying online opinions using econometrics.
Designing Ranking Systems for Hotels on Travel Search Engines by Mining User-Generated and Crowd sourced Content Author - Anindya Ghose, Panagiotis G.
Nikolay Archak,Anindya Ghose,Panagiotis G. Ipeirotis Class Presentation By: Arunava Bhattacharya.
1 Extracting Product Feature Assessments from Reviews Ana-Maria Popescu Oren Etzioni
Mining and Summarizing Customer Reviews
Mining the Peanut Gallery: Opinion Extraction and Semantic Classification of Product Reviews K. Dave et al, WWW 2003, citations Presented by Sarah.
A Joint Model of Feature Mining and Sentiment Analysis for Product Review Rating Jorge Carrillo de Albornoz Laura Plaza Pablo Gervás Alberto Díaz Universidad.
Mining and Summarizing Customer Reviews Minqing Hu and Bing Liu University of Illinois SIGKDD 2004.
Panos Ipeirotis Stern School of Business New York University Opinion Mining using Econometrics A Case Study on Reputation Systems Join work with Anindya.
A Random Walk on the Red Carpet: Rating Movies with User Reviews and PageRank Derry Tanti Wijaya Stéphane Bressan.
1 Opinion Spam and Analysis (WSDM,08)Nitin Jindal and Bing Liu Date: 04/06/09 Speaker: Hsu, Yu-Wen Advisor: Dr. Koh, Jia-Ling.
Opinion Mining Using Econometrics: A Case Study on Reputation Systems Anindya Ghose, Panagiotis G. Ipeirotis, and Arun Sundararajan Leonard N. Stern School.
Panos Ipeirotis Stern School of Business New York University Opinion Mining Using Econometrics.
Panos Ipeirotis New York University Opinion Mining using Econometrics A Case Study on Reputation Systems Joint work with Anindya Ghose and Arun Sundararajan.
Designing Ranking Systems for Consumer Reviews: The Economic Impact of Customer Sentiment in Electronic Markets Anindya Ghose Panagiotis Ipeirotis Stern.
GAUSSIAN PROCESS FACTORIZATION MACHINES FOR CONTEXT-AWARE RECOMMENDATIONS Trung V. Nguyen, Alexandros Karatzoglou, Linas Baltrunas SIGIR 2014 Presentation:
Panos Ipeirotis New York University Opinion Mining using Econometrics A Case Study on Reputation Systems Joint work with Anindya Ghose and Arun Sundararajan.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
Panos Ipeirotis Stern School of Business New York University Text Mining of Electronic News Content for Economic Research “On the Record”: A Forum on Electronic.
Panos Ipeirotis Stern School of Business New York University Opinion Mining using Econometrics A Case Study on Reputation Systems Join work with Anindya.
Statistical Methods Statistical Methods Descriptive Inferential
Sentiment Detection Naveen Sharma( ) PrateekChoudhary( ) Yashpal Meena( ) Under guidance Of Prof. Pushpak Bhattacharya.
Opinion Mining of Customer Feedback Data on the Web Presented By Dongjoo Lee, Intelligent Databases Systems Lab. 1 Dongjoo Lee School of Computer Science.
1. 2 Traditional Income Statement LO1: Prepare a contribution margin income statement.
MODEL ADAPTATION FOR PERSONALIZED OPINION ANALYSIS MOHAMMAD AL BONI KEIRA ZHOU.
Department of Electrical Engineering and Computer Science Kunpeng Zhang, Yu Cheng, Yusheng Xie, Doug Downey, Ankit Agrawal, Alok Choudhary {kzh980,ych133,
Prepared by: Mahmoud Rafeek Al-Farra College of Science & Technology Dep. Of Computer Science & IT BCs of Information Technology Data Mining
CoCQA : Co-Training Over Questions and Answers with an Application to Predicting Question Subjectivity Orientation Baoli Li, Yandong Liu, and Eugene Agichtein.
Sentiment Analysis Introduction Data Source for Sentiment analysis
Xutao Li1, Gao Cong1, Xiao-Li Li2
Pairwise Preference Regression for Cold-start Recommendation Speaker: Yuanshuai Sun
Comparative Experiments on Sentiment Classification for Online Product Reviews Hang Cui, Vibhu Mittal, and Mayur Datar AAAI 2006.
McCormick Northwestern Engineering 1 Electrical Engineering & Computer Science Mining Millions of Reviews: A Technique to Rank Products Based on Importance.
Opinion Observer: Analyzing and Comparing Opinions on the Web
Show Me the Money! Deriving the Pricing Power of Product Features by Mining Consumer Reviews Nikolay Archak, Anindya Ghose, and Panagiotis G. Ipeirotis.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
"Estimating the Helpfulness and Economic Impact of Product Reviews: Mining Text and Reviewer Characteristics“ by Anindya Ghose and Panagiotis Ipeirotis.
Unsupervised Streaming Feature Selection in Social Media
Machine Learning in Practice Lecture 2 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Opinion Observer: Analyzing and Comparing Opinions on the Web WWW 2005, May 10-14, 2005, Chiba, Japan. Bing Liu, Minqing Hu, Junsheng Cheng.
COMP423 Summary Information retrieval and Web search  Vecter space model  Tf-idf  Cosine similarity  Evaluation: precision, recall  PageRank 1.
Regularized Least-Squares and Convex Optimization.
CMS SAS Users Group Conference Learn more about THE POWER TO KNOW ® October 17, 2011 Medicare Payment Standardization Modeling using SAS Enterprise Miner.
Opinion spam and Analysis 소프트웨어공학 연구실 G 최효린 1 / 35.
Nikolay Archak, Anindya Ghose, Panagiotis Ipeirotis
Erasmus University Rotterdam
Aspect-based sentiment analysis
Fenglong Ma1, Jing Gao1, Qiuling Suo1
RECOMMENDER SYSTEMS WITH SOCIAL REGULARIZATION
Analysis for Predicting the Selling Price of Apartments Pratik Nikte
Ontology-Enhanced Aspect-Based Sentiment Analysis
Presentation transcript:

Show me the Money! Deriving the Pricing Power of Product Features by Mining Consumer Reviews. Nikolay Archak, Anindya Ghose, Panagiotis Ipeirotis New York University Stern School of Business Information Systems Group, IOMS department

Word of “Mouse” Consumer reviews  Derived from user experience  Describe different product features  Provide subjective evaluations of product features I love virtually everything about this camera....except the lousy picture quality. The camera looks great, feels nice, is easy to use, starts up quickly, and is of course waterproof. It fits easily in a pocket and the battery lasts for a reasonably long period of time. CommentComment | Was this review helpful to you? (Report this) (Report this)Report this

Existing work Identifying product features  Hu, Liu (AAAI, 2004)  Ghani, Probst, Liu, Krema, Fano (KDD, 2006)  Scaffidi (2006) Sentiment classification  Das, Chen (2001)  Turney, Littman (ACL, 2003)  Dave, Lawrence, Pennock (WWW, 2003)  Hu, Liu (KDD, 2004)  Popescu, Etzioni, (EMNLP, 2005) Opinion Analysis  Hu, Liu, Cheng (WWW, 2005)

Research Questions How important is each product feature to customers? What is the pragmatic polarity and strength of customers’ opinions? Sales data provides valuable clues

Examine changes in demand and estimate weights of features and strength of evaluations. Overview of our Approach “poor lenses” +3% “excellent lenses” -1% “poor photos” +6% “excellent photos” -2%  Feature “photos” is twice more important than “lenses”  “Excellent” is positive, “poor” is negative  “Excellent” is three times stronger than “poor”

Economic background – Hedonic goods and hedonic regressions We are not the first to measure weights of product features. Economists are doing this for years. Hedonic goods [Rosen, 1974]:  Each good is characterized by the set of its objectively measured features  Preferences of consumers are solely determined by features of available goods  Are all goods hedonic? Hedonic regressions:  log(CameraPrice) = const + b 1 *NumMegapixels + b 2 *Zoom + b 3 *StorageSize +…

Hedonic regressions with subjectively measured features Problem: traditional hedonic regressions include only objectively measured features Our solution: introduce review evaluations into the hedonic framework. Each opinion assigns implicit subjective score to a feature [We don’t know the scores]. For example: review1 says “excellent lenses” [implicit opinion score: 0.7] and “nice lenses” [implicit opinion score: 0.3] review2 says “decent lenses” [implicit opinion score: -0.1]  Average score of the “lenses” feature is: [ ] / 3 = 0.3

Representing consumer review(s) N x – opinion phrase frequency W x – opinion phrase weight s – smoothing factor excellentpoorgood ejej lenses photos 000.3… ease of use 00.40… fifi ……… W ij Evaluations Features Matrix [tensor] representation allows us naturally estimate feature weights and evaluation scores.

Our Model log (Demand) = a + b* log (Price) + b 1 * Megapixels + b 2 * Zoom + … Ψ 11 *W[“excellent lenses”] + Ψ 12 *W[“great lenses”] Ψ 1M *W[“terrible lenses”] + Ψ 21 *W[“excellent photos”] + Ψ 22 *W[“great photos”] + … + Ψ 2M *W[“terrible photos”] + … Ψ N1 *W[“excellent size”] + Ψ N2 *W[“great size”] Ψ NM *W[“terrible size”]

Technical Challenge – Reduce the Number of Parameters Solution: place a rank constraint Special case (p = 1): independent features weights and evaluation scores

Amazon.com Dataset Product Category “Audio & Video” “Camera & Photo” Number of products Number of sales rank observations 35,14331,233 Number of reviews 2,5801,955 PeriodApril 2005 – May 2006

Results - Feature Weights for “Camera & Photo”

Results - Evaluation Coefficients for “Camera & Photo”

Partial effects for “Camera & Photo” great camera0.4235decent battery good camera0.1128decent quality great quality0.0931poor quality good quality0.0385bad camera great battery0.0138fine camera Partial effect of an opinion phrase: score of the “average review” where all evaluations of the feature f are replaced by the evaluation e minus score of the “average review”.

Predictive power of product reviews Goal: predict future sales using review text Model test: 10-fold cross validation (product holdout) Compared with model that ignores text but keeps numeric variables including average review rating Average RMSE improvement 5%, Avg. Err improvement 3%

Conclusions We provided technique for: Measuring importance of product features for consumers Identifying polarity and strength of user evaluations Alleviating problem of data sparseness

Thank you! Comments? Questions?

Related Work  Chevalier, Mayzlin (2006)  Chevalier, Goolsbee (2003)  Ghani, Probst, Liu, Krema, Fano (2006)  Hu, Liu (2004)  Hu, Liu, Cheng (2005)  Turney (2002)  Pang, Lee (2005)  Popescu, Etzioni (2005)