TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 1 31.03.2014 (Muscat, Oman) DATA MINING.

Slides:



Advertisements
Similar presentations
Supporting End-User Access
Advertisements

Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
1. Abstract 2 Introduction Related Work Conclusion References.
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
Week 9 Data Mining System (Knowledge Data Discovery)
Data Mining Knowledge Discovery in Databases Data 31.
Data Mining By Archana Ketkar.
Data Mining Adrian Tuhtan CS157A Section1.
Data Mining – Intro.
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Data Mining: A Closer Look
Chapter 5 Data mining : A Closer Look.
Data Mining.
Chapter 35 Data Mining Transparencies. 2 Chapter Objectives u The concepts associated with data mining. u The main features of data mining operations,
CIT 858: Data Mining and Data Warehousing Course Instructor: Bajuna Salehe Web:
Data Mining & Data Warehousing PresentedBy: Group 4 Kirk Bishop Joe Draskovich Amber Hottenroth Brandon Lee Stephen Pesavento.
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
Data Mining CS 157B Section 2 Keng Teng Lao. Overview Definition of Data Mining Application of Data Mining.
Enterprise systems infrastructure and architecture DT211 4
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Application of SAS®! Enterprise Miner™ in Credit Risk Analytics
Dr. Awad Khalil Computer Science Department AUC
Chapter 5: Data Mining for Business Intelligence
Data Mining Techniques
MAKING THE BUSINESS BETTER Presented By Mohammed Dwikat DATA MINING Presented to Faculty of IT MIS Department An Najah National University.
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
Data Mining Dr. Chang Liu. What is Data Mining Data mining has been known by many different terms Data mining has been known by many different terms Knowledge.
1 Data Mining DT211 4 Refer to Connolly and Begg 4ed.
Data Mining Solutions (Westphal & Blaxton, 1998) Dr. K. Palaniappan Dept. of Computer Engineering & Computer Science, UMC.
Data Mining Techniques As Tools for Analysis of Customer Behavior
Data Mining Chun-Hung Chou
Understanding Data Analytics and Data Mining Introduction.
Tang: Introduction to Data Mining (with modification by Ch. Eick) I: Introduction to Data Mining A.Short Preview 1.Initial Definition of Data Mining 2.Motivation.
3 Objects (Views Synonyms Sequences) 4 PL/SQL blocks 5 Procedures Triggers 6 Enhanced SQL programming 7 SQL &.NET applications 8 OEM DB structure 9 DB.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
Banking on Analytics Dr A S Ramasastri Director, IDRBT.
Data MINING Data mining is the process of extracting previously unknown, valid and actionable information from large data and then using the information.
 Fundamentally, data mining is about processing data and identifying patterns and trends in that information so that you can decide or judge.  Data.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Guest Lecture Introduction to Data Mining Dr. Bhavani Thuraisingham September 17, 2010.
CS157B Fall 04 Introduction to Data Mining Chapter 22.3 Professor Lee Yu, Jianji (Joseph)
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
CRM - Data mining Perspective. Predicting Who will Buy Here are five primary issues that organizations need to address to satisfy demanding consumers:
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Chapter 14 Data Mining Transparencies. 2 Chapter Objectives u The concepts associated with data mining. u The main features of data mining operations,
MIS2502: Data Analytics Advanced Analytics - Introduction.
CS507 Information Systems. Lesson # 11 Online Analytical Processing.
Academic Year 2014 Spring Academic Year 2014 Spring.
Data Mining. Overview the extraction of hidden predictive information from large databases Data mining tools predict future trends and behaviors, allowing.
Data Mining Copyright KEYSOFT Solutions.
DATA MINING It is a process of extracting interesting(non trivial, implicit, previously, unknown and useful ) information from any data repository. The.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Classification Tree Interaction Detection. Use of decision trees Segmentation Stratification Prediction Data reduction and variable screening Interaction.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
Ahmed K. Ezzat, SQL Server 2008 and Data Mining Overview 1 Data Mining and Big Data.
Data Mining – Introduction (contd…) Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
Data Mining.
MIS2502: Data Analytics Advanced Analytics - Introduction
DATA MINING © Prentice Hall.
Data and Applications Security Introduction to Data Mining
Adrian Tuhtan CS157A Section1
Supporting End-User Access
CSE591: Data Mining by H. Liu
Presentation transcript:

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 2 OUTLINE Definition of Data Mining Using Areas of Data Mining Data Mining Tasks Some examples about Data Mining Methods

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 3 Data, Information and Knowledge DATA INFORMATION KNOWLEDGE (Raw facts) (Organized facts and figures)(Meaningful information, helps to make decision) Company 125 TL Company 230 TL Company310 TL Company 440 TL Example 1: Data : 25, 30, 10, 40 Information : These are prices of the st. For these companies Knowledge : the cheapest one is company3, deciding to buy Data is anything that recorded and processed by computers. Data is raw fact, may be numeric or text format. "Information" is "data" that has been processed. Information can be converted into knowledge. We make decisions using knowledge.

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 4 Problem ? : Huge and Meaningless Data Data is not valueable while querying with traditional methods How can we get knowledge to compete ? Solution : Data Mining Process

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 5 DATA MINING OVERVIEW Data Mining is the process of analyzing data from different perspectives and summarizing it into useful information Data Mining is knowledge discovery from data

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 6 DATA MINING APPLICATION AREA EXAMPLES Medical / Pharma Computer Assisted Diagnosis (expert systems learning) Characterization/prediction of patient's response to product dosage Identification of successful medical therapies (successful prescription patterns). Study of relations between dosage and potentially related adverse events Banking / Finance Detection of fraudulent credit card usage patterns. Risk management related to attribution of loans using scorecards. Find hidden correlations between different financial indicators. Identification of stocks trading rules from historical market data. Retail / Marketing Discovery of buying behavior patterns Detection of associations among customer characteristics. Prediction of the probability that clients answer to mailing.

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 7 Some Possible Questions : Banking : Does he take a new credit card or manage to pay it Stock market Will gold prices increase Shopping market: Which items are frequently purchased Does he interest new fashion shoes/hats/clothes Hospital: Which ilness may be.. How long does it take to recover from this illness ? Answers: Looking historical data or finding correlation with Data Mining

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 8 DATA MINING & DATA WAREHOUSING & OLTP OLTP system includes transactional data, unnecessary data for analysing Data Warehouses is designed for analysis Data warehouses are subject based, fast Data warehouses are designed to use in decision support systems Data Mining is performed on Data Warehouses

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 9 Data Mining Tasks 1. Defining Problem is the first task. May be it is the most important phase because, according to the purpose, using methods and rules will be changed. The goal should be clear. Some Example Goals: Is there a relationship between gender and education ? Is there a relationship between travelling expenditure and education? What type of customer does have problem with paying their loan?

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 10 Data Mining Tasks (Cont’d) 2. Preparing data : This phase takes most time. While preparing data, data is taken from different sources. Then all data is combined and cleaned. Sometimes, generating new variables is needed. Not only the size but also the quality of the data is very important. After providing data quality, modelling task will begin. 3. Modelling task is performed using different methods. Data Mining tools are used After modelling finished, it will be evaluated and used.

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 11 Prediction Methods Some variables are used to predict unknown or future values of other variables. Example : classification, regression, deviation detection, decision trees Description Methods Find human-interpretable patterns that describe the data. Example: clustering, assocation rule discovery,.. Data Mining Methods

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 12 DECISION TREE EXAMPLE CustomerLoanIncomeStatusRisk 1highbademployerbad 2high employeebad 3highlowemployeebad 4Lowlowemployeegood 5low employerbad Example : Use these data to predict the risk of a customer.

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 13 A - node B - node C - node BAD GOOD BADGOOD loan:highloan :low income:high income:low Status : employer Status : employee DECISION TREE

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 14 REGRESSION ANALYSIS The goal of the regression analysis is finding Relationship among variables Regression analysis is a process of finding releationships between Y variable and other X1,X2,….Xn variables. It includes many techniques for modelling and analyzing several variables A Regression equation example is :

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 15 Example2 : Regression Analysis A (independent variable) B (dependent variable)

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 16 Some Popular Data Mining Tools: SAS Enterprise Miner IBM SPSS Modeler ORACLE Data Mining RapidMiner (Open Source) Pentaho (Open Source)

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 17 Summary Data Mining is the process of getting information from data Before modelling, determining the goal is most important The quality of the data is also very important Data Mining is performed on Data Warehouses Data Mining has prediction and description methods Some Data Mining Tools are used for the Data Mining process

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 18 Summary of Business Intelligence Components:

TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT 19 Thank you & Questions ?