1 What is Data Mining? l Data mining is the process of automatically discovering useful information in large data repositories. l There are many other.

Slides:



Advertisements
Similar presentations
Data Analysis Santiago González. Contents  Introduction  CRISP-DM(1)  Tools  Data understanding  Data preparation  Modeling (2)  Association rules.
Advertisements

QMM 384 – Data Mining Data Mining: Introduction Introduction to Predictive Analytics.
CPS : Information Management and Mining Shivnath Babu.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Introduction Introduction to Data Mining by Tan, Steinbach, Kumar.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Introduction Lecture Notes for Chapter 1 Introduction to Data Mining by Tan,
Data Mining: Introduction
MIS 111: Computers and the Inter-networked Society Class 11: Data Mining July 25th, 2011.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Introduction Lecture Notes for Chapter 1 Introduction to Data Mining by Tan,
© Vipin Kumar CSci 8980 (Data Mining) Fall CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance Computing Research Center Department.
BUS 297D: Data Mining Professor David Mease Lecture 1 Agenda:
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Introduction Lecture Notes for Chapter 1 Introduction to Data Mining by Tan,
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Decision Support: Data Mining Introduction.
Data Mining: Introduction
Data Mining and Business Intelligence
Object Oriented Data Base. Traditional Data Models : Hierarchical, Network (since mid- 60’s), Relational (since 1970 and commercially since 1982) Object.
Data Mining: Introduction. Why Data Mining? l The Explosive Growth of Data: from terabytes to petabytes –Data collection and data availability  Automated.
Tang: Introduction to Data Mining (with modification by Ch. Eick) I: Introduction to Data Mining A.Short Preview 1.Initial Definition of Data Mining 2.Motivation.
1 Business System Analysis & Decision Making - Lecture 14 Zhangxi Lin ISQS 5340 Summer II 2006.
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
1 1 Slide Introduction to Data Mining and Business Intelligence.
Introduction to Data Mining. Why Mine the Data? Lots of data is being collected and warehoused – Web data, e-commerce – purchases at department/ grocery.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Introduction Lecture Notes for Chapter 1 Introduction to Data Mining by Tan,
CSE4334/5334 DATA MINING CSE 4334/5334 Data Mining, Fall 2011 Department of Computer Science and Engineering, University of Texas at Arlington Chengkai.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Introduction Lecture Notes for Chapter 1 Introduction to Data Mining by Tan,
1 Data Mining: Introduction Chapter 1 of Introduction to Data Mining by Tan, Steinbach, Kumar.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Introduction Lecture Notes for Chapter 1 Introduction to Data Mining by Minqi.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Introduction to Data Mining Jinze Liu April 8 th, 2009.
Christoph Eick Introduction to Data Mining 8/19/ Dr. Eick 2. COSC.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Introduction Lecture Notes for Chapter 1 Introduction to Data Mining by Tan,
COMSATS Institute of Information Technology Department of Computer Science Databases and Information Systems Dr. Ramzan Talib Databases and Information.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Waqas Haider Bangyal. 2 Source Materials “ Data Mining: Concepts and Techniques” by Jiawei Han & Micheline Kamber, Second Edition, Morgan Kaufmann, 2006.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
An Introduction to Data Mining
Department of Computer Science Sir Syed University of Engineering & Technology, Karachi-Pakistan. Presentation Title: DATA MINING Submitted By.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Lecture Notes for Chapter 1 Introduction to Data Mining.
Modul 1: Introduction. Topics  Definitions  Business intelligence  DW & OLAP  Data mining  Data Warehousing and Data Mining Motivation  Data mining.
Data Mining: Introduction
Introduction to Game Data Mining
MIS2502: Data Analytics Advanced Analytics - Introduction
Data Mining Introduction
Data Mining: Introduction
Statistics 202: Statistical Aspects of Data Mining
Data Mining: Introduction
Introduction to Data Mining Part 1 Knowledge Sources COSC 6335 Webpage
Introduction to Data Mining- CMPT 741 Instructor: Ke Wang
Data mining and real systems modeling
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Introduction Lecture Notes for Chapter 1 Introduction to Data Mining by Tan,
Data Mining: Introduction
Techniques for Finding Patterns in Large Amounts of Data: Applications in Biology Vipin Kumar William Norris Professor and Head, Department of Computer.
Data Mining: Introduction
Data Mining: Introduction
Data Mining: Introduction
Data Mining: Introduction
Data Mining: Introduction
Data Mining: Introduction
Data Mining: Introduction
Data Mining: Introduction
Data Mining: Introduction
Data Mining: Introduction
Data Mining: Introduction
Data Mining: Introduction
Data Mining: Introduction
Data Mining: Introduction
Data Mining: Introduction
Data Mining: Introduction
Data Mining: Introduction
Data Mining: Introduction
Presentation transcript:

1 What is Data Mining? l Data mining is the process of automatically discovering useful information in large data repositories. l There are many other definitions The problem/question of interest

2 Data Mining Examples and Non-Examples Data Mining: -Certain names are more prevalent in certain US locations (O’Brien, O’Rurke, O’Reilly… in Boston area) -Group together similar documents returned by search engine according to their context (e.g. Amazon rainforest, Amazon.com, etc.) NOT Data Mining: -Look up phone number in phone directory -Query a Web search engine for information about “Amazon”

3 Why Mine Data? Scientific Viewpoint l Data collected and stored at enormous speeds (GB/hour) – remote sensors on a satellite – telescopes scanning the skies – microarrays generating gene expression data – scientific simulations generating terabytes of data l Traditional techniques infeasible for raw data l Data mining may help scientists – in classifying and segmenting data – in hypothesis formation

4 Why Mine Data? Commercial Viewpoint l Lots of data is being collected and warehoused – Web data, e-commerce – Purchases at department/ grocery stores – Bank/credit card transactions l Computers have become cheaper and more powerful l Competitive pressure is strong – Provide better, customized services for an edge

5 In class exercise #1: Give an example of something you did yesterday or today which resulted in data which could potentially be mined to discover useful information.

6 Origins of Data Mining Draws ideas from machine learning, AI, pattern recognition and statistics lTraditional techniques may be unsuitable due to –Enormity of data –High dimensionality of data –Heterogeneous, distributed nature of data Statistics Data Mining AI/Machine Learning/ Pattern Recognition

7 2 Types of Data Mining Tasks l Prediction Methods: Use some variables to predict unknown or future values of other variables. l Description Methods: Find human-interpretable patterns that describe the data.

8 What is Data? l An attribute is a property or characteristic of an object l Examples: eye color of a person, temperature, etc. l Attribute is also known as variable, field, characteristic, or feature l A collection of attributes describe an object l Object is also known as record, point, case, sample, entity, instance, or observation Attributes Objects