Data Mining Concept Submitted TO: Mrs. MONIKA SUBMITTED BY: SHALU 4717.

Slides:



Advertisements
Similar presentations
An Introduction to Data Mining
Advertisements

Data Mining Tools Overview Business Intelligence for Managers.
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Week 9 Data Mining System (Knowledge Data Discovery)
Data Mining By Archana Ketkar.
Data Mining Adrian Tuhtan CS157A Section1.
Data Mining – Intro.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Data Mining: A Closer Look
Data Mining.
Business Intelligence
CIT 858: Data Mining and Data Warehousing Course Instructor: Bajuna Salehe Web:
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
GUHA method in Data Mining Esko Turunen Tampere University of Technology Tampere, Finland.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
Knowledge Discovery & Data Mining process of extracting previously unknown, valid, and actionable (understandable) information from large databases Data.
MAKING THE BUSINESS BETTER Presented By Mohammed Dwikat DATA MINING Presented to Faculty of IT MIS Department An Najah National University.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Tang: Introduction to Data Mining (with modification by Ch. Eick) I: Introduction to Data Mining A.Short Preview 1.Initial Definition of Data Mining 2.Motivation.
Spatial Statistics and Spatial Knowledge Discovery First law of geography [Tobler]: Everything is related to everything, but nearby things are more related.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Data Clustering 1 – An introduction
Chapter 1 Introduction to Data Mining
Data MINING Data mining is the process of extracting previously unknown, valid and actionable information from large data and then using the information.
Data Mining By Dave Maung.
Classification and Prediction Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot Readings: Chapter 6 – Han and Kamber.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
1 Improving quality of graduate students by data mining Asst. Prof. Kitsana Waiyamai, Ph.D. Dept. of Computer Engineering Faculty of Engineering, Kasetsart.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
3-1 Data Mining Kelby Lee. 3-2 Overview ¨ Transaction Database ¨ What is Data Mining ¨ Data Mining Primitives ¨ Data Mining Objectives ¨ Predictive Modeling.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Chapter 14 Data Mining Transparencies. 2 Chapter Objectives u The concepts associated with data mining. u The main features of data mining operations,
Apache Mahout Qiaodi Zhuang Xijing Zhang.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Academic Year 2014 Spring Academic Year 2014 Spring.
Data Mining Copyright KEYSOFT Solutions.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Waqas Haider Bangyal. 2 Source Materials “ Data Mining: Concepts and Techniques” by Jiawei Han & Micheline Kamber, Second Edition, Morgan Kaufmann, 2006.
 What is a CRM  Uses of a CRM  What is Data Mining  Data Mining Tasks  How a CRM Utilizes Data Mining  Companies who Use CRM Data Mining.
DATA MINING TECHNIQUES (DECISION TREES ) Presented by: Shweta Ghate MIT College OF Engineering.
Copyright  2007 McGraw-Hill Pty Ltd PPTs t/a Marketing Research 2e by Lukas, Hair, Bush and Ortinau Slides prepared by Judy Rex 19-1 Chapter Nineteen.
Data Mining – Introduction (contd…) Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
Department of Computer Science Sir Syed University of Engineering & Technology, Karachi-Pakistan. Presentation Title: DATA MINING Submitted By.
Data Mining: Confluence of Multiple Disciplines Data Mining Database Systems Statistics Other Disciplines Algorithm Machine Learning Visualization.
Data Mining Functionalities
Data Mining.
Data Mining – Intro.
What Is Cluster Analysis?
By Arijit Chatterjee Dr
MIS2502: Data Analytics Advanced Analytics - Introduction
DATA MINING © Prentice Hall.
Introduction to Data Mining
MIS 451 Building Business Intelligence Systems
Data Mining 101 with Scikit-Learn
Data and Applications Security Introduction to Data Mining
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Adrian Tuhtan CS157A Section1
Sangeeta Devadiga CS 157B, Spring 2007
Data Analysis.
CSE591: Data Mining by H. Liu
Prepared by: Mahmoud Rafeek Al-Farra
Data Warehousing Data Mining Privacy
Data Mining: Concepts and Techniques
©Jiawei Han and Micheline Kamber
CSE591: Data Mining by H. Liu
Presentation transcript:

Data Mining Concept Submitted TO: Mrs. MONIKA SUBMITTED BY: SHALU 4717

Content oData, Information& Knowledge oWhat is data mining? oNeed of data mining oOn What Kind Of Data? oData Mining Vs Data Warehouse oKnowledge Discovery In Databases oData Mining Vs KDD oData Mining Tasks oApplications of Data Mining oData Mining Tools

Data, Information& Knowledge

What is data mining?  Data mining is a process that uses a variety of data analysis tools to discover patterns and relationships in data that may be used to make valid and accurate predictions.

Need of data mining  Data, Data everywhere … We are drowning in data but starving for knowledge !!

On What Kind Of Data? It is not specific to one type of the data. o Flat files oRelational databases oData warehouse o Multimedia Databases oSpatial Databases: oTime-Series Databases:

DATA MINING VS DATA WAREHOUSE  Data mining is the process of extracting meaningful data from that database Example : Credit Card  Data warehousing is the process of centralizing or aggregating data from multiple sources into one common repository Example : Facebook So, The data mining process relies on the data compiled in the data warehousing phase in order to detect meaningful patterns.

Knowledge Discovery In Databases  Data mining is actually one step of a larger process known as Knowledge Discovery in Databases.  The iterative process consists of the following steps:

DATA MINING VS KDD

Data MINING TASK Clustering is a process of partitioning a set of data (or objects) in a set of meaningful sub-classes, called clusters

CLUSTERING ALGORITHM oK-means clustering algorithm oInput: a database D, of m records r1,…, rm and a desired number of clusters. k oOutput: set of k clusters Begin Randomly choose k records as the centroids for the k clusters’ Repeat Assign each record, ri, to a cluster such that the distance between ri and the cluster centroid (mean) is the smallest among the k clusters; Recalculate the centroid (mean) for each cluster based on the records assigned to the cluster; Until no change; End;

EXAMPLE

 CLASSIFICATION AND PREDICTION Classification Predicts categorical class labels (discrete or nominal) " Use labels of the training data to classify new data Example : A marketing manager at a company needs to analyse a customer with a given profile, who will buy a new computer Prediction Models continuous-valued functions, i.e., predicts unknown or missing values Prediction is a short name for numeric prediction Example: A marketing manager would like to predict how much a given costumer will spend during a sale

CLASSIFICATION STEPS Step1: Model Construction Classification Classifier Algorithm Training data Age Income Class 27 28K Budget-Spender 35 36K Big-Spender 38 28K Budget-Spender 65 45K Budget-Spender 20 18k Budget-Spender 75 40k Budget-Spender 28 50k Big-Spender 40 60k Big-Spender 60 65k Big-Spender If age <30 & income <30k Then, Budget-Spender If age 30k Then, Big-Spender If 30 30k Then Big-spender If 30<age<60 and income <30k Then Budget-spender If age >60, Then Budget-spender

Step2 : Model Usage 1 - Test the classifier Age Income Class label 27 28K Budget-Spenders 25 36K Big-Spenders 70 45K Budget-Spenders 40 35k Big-Spender 2 - If acceptable accuracy Unlabelled data Age Income 18 28K 37 40K 60 45K 40 36k Test Classifier Accuracy Classified data Classifier Age Income Class label 18 28K Budget-Spenders 37 40K Big-Spenders 60 45K Budget-Spenders 40 36k Budget-Spenders

 Association : Association is the discovery of togetherness or connection of objects. Such kind of togetherness or connection is termed as association rule.  Summarization : It is the process of representing the collected data in an accurate and compact way without losing any information, it also involves getting a information from collected data. Example : Long distance calls of customer

Applications of Data Mining Data mining is widely used in diverse areas. There are a number of commercial data mining system available today and yet there are many challenges in this field. The list of areas where data mining is widely used : oRetail Industry oTelecommunication Industry oBiomedical and DNA data analysis oFinancial Data Analysis

DATA MINING TOOLS Oracle Data Miner m/odminer.html Data To Knowledge SAS Clementine Intelligent Miner