Data mining (KDD) process

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
CS583 – Data Mining and Text Mining
Civil and Environmental Engineering Carnegie Mellon University Sensors & Knowledge Discovery (a.k.a. Data Mining) H. Scott Matthews April 14, 2003.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Data Mining By Archana Ketkar.
Chapter 14 The Second Component: The Database.
CS 5831 CS583 – Data Mining and Text Mining Course Web Page 05/cs583.html.
CS 5941 CS583 – Data Mining and Text Mining Course Web Page 05/cs583.html.
CS583 – Data Mining and Text Mining
CS583 – Data Mining and Text Mining Course Web Page 07/cs583.html.
What have you learned about technologies from the process of constructing this product?
Mining and Searching Opinions in User-Generated Contents Bing Liu Department of Computer Science University of Illinois at Chicago.
A Holistic Lexicon-Based Approach to Opinion Mining
Chapter 11. Opinion Mining. Bing Liu, UIC ACL-07 2 Introduction – facts and opinions Two main types of information on the Web.  Facts and Opinions Current.
Mining and Summarizing Customer Reviews
CES 514 – Data Mining Spring 2010 Sonoma State University.
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
Mining and Summarizing Customer Reviews Minqing Hu and Bing Liu University of Illinois SIGKDD 2004.
Data Mining Techniques As Tools for Analysis of Customer Behavior
Sudeshna Sarkar IIT Kharagpur
A Holistic Lexicon-Based Approach to Opinion Mining Xiaowen Ding, Bing Liu and Philip Yu Department of Computer Science University of Illinois at Chicago.
1 Entity Discovery and Assignment for Opinion Mining Applications (ACM KDD 09’) Xiaowen Ding, Bing Liu, Lei Zhang Date: 09/01/09 Speaker: Hsu, Yu-Wen Advisor:
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
FEATURE BASED RECOMMENDER SYSTEMS Mehrbod Sharifi Language Technology Institute Carnegie Mellon University 1.
Designing Ranking Systems for Consumer Reviews: The Economic Impact of Customer Sentiment in Electronic Markets Anindya Ghose Panagiotis Ipeirotis Stern.
CS 5831 CS583 – Data Mining and Text Mining Course Web Page 06/cs583.html.
Visualization of Product Review. 2 Introduction To activate the on-line purchase, most of shopping mall adopts a strategy which records the “ Product.
1 Team Members: Rohan Kothari Vaibhav Mehta Vinay Rambhia Hybrid Review System.
Opinion Mining of Customer Feedback Data on the Web Presented By Dongjoo Lee, Intelligent Databases Systems Lab. 1 Dongjoo Lee School of Computer Science.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Customer Experience at. Experience Good:  Web site provided lots of information:  Technical specs  Customer reviews  Picture of what the cameras.
3-1 Data Mining Kelby Lee. 3-2 Overview ¨ Transaction Database ¨ What is Data Mining ¨ Data Mining Primitives ¨ Data Mining Objectives ¨ Predictive Modeling.
Information Extraction and Integration Bing Liu Department of Computer Science University of Illinois at Chicago (UIC)
MIS2502: Data Analytics Advanced Analytics - Introduction.
DATA MINING It is a process of extracting interesting(non trivial, implicit, previously, unknown and useful ) information from any data repository. The.
COMP423 Summary Information retrieval and Web search  Vecter space model  Tf-idf  Cosine similarity  Evaluation: precision, recall  PageRank 1.
E commerce Online Shopping Website at Rs. 7920/-.
Marketing. Kurt Foriska The Ohio State University.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
Research Progress Kieu Que Anh School of Knowledge, JAIST.
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
CS583 – Data Mining and Text Mining
Data Mining.
Data Mining – Intro.
Websites and Weblogs with Weebly
MIS2502: Data Analytics Advanced Analytics - Introduction
CS583 – Data Mining and Text Mining
MIS 451 Building Business Intelligence Systems
Top Six Web Design Tips for eCommerce Website
Memory Standardization
Introduction to Digital Photography
Introduction C.Eng 714 Spring 2010.
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Chapter 18 Buying a PC.
Information Visualization Picture worth 1000 words...
MIS5101: Data Analytics Advanced Analytics - Introduction
Data Warehousing and Data Mining
The strategy to strengthen on-line customer’s
[Your company] Business Plan [Street Address City, State & Zip Code
CS583 – Data Mining and Text Mining
PRACTICE MAKES PERFECT
T H E P U B G P R O J E C T.
Introduction to Digital Photography
Fundamentals of Marketing
Understanding Customer Behaviors with Information Technologies
MIS2502: Data Analytics Introduction to Advanced Analytics
Welcome! Knowledge Discovery and Data Mining
CSE591: Data Mining by H. Liu
PolyAnalyst™ text mining tool Allstate Insurance example
Presentation transcript:

Data mining (KDD) process Understand the application domain Identify data sources and select target data Pre-process: cleaning, attribute selection Data mining to extract patterns or models Post-process: identifying interesting or useful patterns Incorporate patterns in real world tasks CSE 321

Data mining applications Marketing, customer profiling and retention, identifying potential customers, market segmentation. Fraud detection identifying credit card fraud, intrusion detection Scientific data analysis Text and web mining Any application that involves a large amount of data … CSE 321

Web data extraction Data region1 A data record A data record CSE 321

Align and extract data items (e.g., region1) image1 EN7410 17-inch LCD Monitor Black/Dark charcoal $299.99 Add to Cart (Delivery / Pick-Up ) Penny Shopping Compare image2 17-inch LCD Monitor $249.99 image3 AL1714 17-inch LCD Monitor, Black $269.99 image4 SyncMaster 712n 17-inch LCD Monitor, Black Was: $369.99 Save $70 After: $70 mail-in-rebate(s) CSE 321

Opinion Analysis Word-of-mouth on the Web The Web has dramatically changed the way that consumers express their opinions. One can post reviews of products at merchant sites, Web forums, discussion groups, blogs Techniques are being developed to exploit these sources. Benefits of Review Analysis Potential Customer: No need to read many reviews Product manufacturer: market intelligence, product benchmarking CSE 321

Feature Based Analysis & Summarization Extracting product features (called Opinion Features) that have been commented on by customers. Identifying opinion sentences in each review and deciding whether each opinion sentence is positive or negative. Summarizing and comparing results. CSE 321

An example GREAT Camera., Jun 3, 2004 Reviewer: jprice174 from Atlanta, Ga. I did a lot of research last year before I bought this camera... It kinda hurt to leave behind my beloved nikon 35mm SLR, but I was going to Italy, and I needed something smaller, and digital. The pictures coming out of this camera are amazing. The 'auto' feature takes great pictures most of the time. And with digital, you're not wasting film if the picture doesn't come out. … …. Summary: Feature1: picture Positive: 12 The pictures coming out of this camera are amazing. Overall this is a good camera with a really good picture clarity. … Negative: 2 The pictures come out hazy if your hands shake even for a moment during the entire process of taking a picture. Focusing on a display rack about 20 feet away in a brightly lit room during day time, pictures produced by this camera were blurry and in a shade of orange. Feature2: battery life CSE 321

Visual Comparison + + _ _ Summary of reviews of Digital camera 1 Picture Battery Zoom Size Weight + Comparison of reviews of Digital camera 1 Digital camera 2 _ CSE 321