OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.

Slides:



Advertisements
Similar presentations
Advanced Data Mining: Introduction
Advertisements

modified by Marius Bulacu
Data Mining: Concepts and Techniques
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Dr. Tahar Kechadi Dr. Joe Carthy
Data Mining By Archana Ketkar.
July 13, 2015ICS426: Introduction1 DATA WAREHOUSING AND DATA MINING.
Data Mining – Intro.
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
Data Warehousing 資料倉儲 Min-Yuh Day 戴敏育 Assistant Professor 專任助理教授 Dept. of Information Management, Tamkang University Dept. of Information ManagementTamkang.
Data Mining.
Business Intelligence
CIT 858: Data Mining and Data Warehousing Course Instructor: Bajuna Salehe Web:
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
Data Mining Using IBM Intelligent Miner Presented by: Qiyan (Jennifer ) Huang.
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
Chapter 1. Introduction Motivation: Why data mining?
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Techniques As Tools for Analysis of Customer Behavior
Data Mining: Introduction. Why Data Mining? l The Explosive Growth of Data: from terabytes to petabytes –Data collection and data availability  Automated.
Data Mining: Concepts and Techniques
Data Mining Techniques As Tools for Analysis of Customer Behavior Lecture 2:
3 Objects (Views Synonyms Sequences) 4 PL/SQL blocks 5 Procedures Triggers 6 Enhanced SQL programming 7 SQL &.NET applications 8 OEM DB structure 9 DB.
Data Warehousing/Mining 1 Data Warehousing/Mining Comp 150 DW Chapter 1. Introduction Instructor: Dan Hebert.
Chapter 1 Introduction to Data Mining
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
DATA MINING 1. 2 Data Mining Extracting or “mining” knowledge from large amounts of data Data mining is the process of autonomously retrieving useful.
Introduction Pertemuan 01 Matakuliah: M0614 / Data Mining & OLAP Tahun : Feb
2015年10月18日星期日 2015年10月18日星期日 2015年10月18日星期日 Introduction to Data Mining 1 Chapter 1 Introduction to Data Mining Chen. Chun-Hsien Department of Information.
October 18, 2015 Data Mining: Concepts and Techniques 1 DATA MINING Motivation: Why data mining? What is data mining? Data Mining: On what kind of data?
1 Introduction to Data Mining and Data Warehousing Muhammad Ali Yousuf DSC – ITM Friday, 9 th May 2003 Based on ©Jiawei Han and Micheline Kamber Intelligent.
CS690L - Lecture 6 1 CS690L Data Mining and Knowledge Discovery Overview Yugi Lee STB #555 (816) This.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
1 Improving quality of graduate students by data mining Asst. Prof. Kitsana Waiyamai, Ph.D. Dept. of Computer Engineering Faculty of Engineering, Kasetsart.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Data Mining Lecture 2. Course Syllabus Course topics: Introduction (Week1-Week2) –What is Data Mining? –Data Collection and Data Management Fundamentals.
Introduction to Data-Mining Marko Grobelnik Institut Jozef Stefan.
Data Mining: Concepts and Techniques. Overview 1.Introduction 2.Data Preprocessing 3.Data Warehouse and OLAP Technology: An Introduction 4.Advanced Data.
MIS2502: Data Analytics Advanced Analytics - Introduction.
January 17, 2016Data Mining: Concepts and Techniques 1 What Is Data Mining? Data mining (knowledge discovery from data) Extraction of interesting ( non-trivial,
Academic Year 2014 Spring Academic Year 2014 Spring.
February 13, 2016 Data Mining: Concepts and Techniques 1 1 Data Mining: Concepts and Techniques These slides have been adapted from Han, J., Kamber, M.,
DATA MINING LECTURE 1 INTRODUCTION TO DATA MINING.
LECTURE 2: DATA MINING. WHAT IS DATA MINING? 2 D ATA M INING AND D ATA W AREHOUSES ? It evolved in to being as the science of databases evolved Database.
Data Warehousing/Mining 1. 2 Chapter 1. Introduction v Motivation: Why data mining? v What is data mining? v Data Mining: On what kind of data? v Data.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
2016年6月12日星期日 2016年6月12日星期日 2016年6月12日星期日 Introduction to Data Mining 1 Chapter 1 Introduction to Data Mining Chen. Chun-Hsien Department of Information.
CENG 770. Data mining (knowledge discovery from data) – Extraction of interesting ( non-trivial, implicit, previously unknown and potentially useful)
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
July 7, 2016 Data Mining: Concepts and Techniques 1 1.
Data Mining - Introduction Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
There is an inherent meaning in everything. “Signs for people who can see.”
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
Data Mining – Intro.
Data Mining Motivation: “Necessity is the Mother of Invention”
Data warehouse & Data Mining: Concepts and Techniques
Introduction C.Eng 714 Spring 2010.
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Introduction to Data Mining
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques Course Outline
Data Warehousing and Data Mining
Data Mining Introduction
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
Data Mining Concepts and Techniques
Data Mining Techniques As Tools for Analysis of Customer Behavior
Data Mining: Concepts and Techniques
Presentation transcript:

OLAM and Data Mining: Concepts and Techniques

Introduction Data explosion problem: –Automated data collection tools and mature database technology lead to tremendous amounts of data stored in databases, data warehouses and other information repositories We are drowning in data, but starving for knowledge! Data warehousing and data mining: –On-line analytical processing – query-driven data analysis –The efficient discovery of interesting knowledge (rules, regularities, patterns, constraints) from data in large databases

Evolution of Database Technology 1960s: –Data collection, database creation, IMS and network DBMS 1970s: –Relational data model, relational DBMS 1980s: –RDBMS, advanced data models (extended-relational, OO, deductive, etc.) and application-oriented DBMS (spatial, scientific, engineering, etc.) 1990s: –Data mining and data warehousing, multimedia databases, and Web technology

What is data mining? Data mining: the process of efficient discovery of previously unknown patterns, relationships, rules in large databases and data warehouses Goal: help the human analyst to understand the data SQL query: –How many bottles of wine did we sell in 1 st Qtr of 1999 in Poland vs Austria?

What is data mining? Data mining query: –How do the buyers of wine in Poland and Austria differ? –What else do the buyers of wine in Austria buy along with wine? –How the buyers of wine can be characterized?

What is data mining? Data mining (knowledge discovery in databases): –Extraction of interesting ( non-trivial, implicit, previously unknown and potentially useful) information from data in large databases Alternative names and their “inside stories”: –Knowledge discovery in databases (KDD: SIGKDD), knowledge extraction, data archeology, data dredging, information harvesting, business intelligence, etc. –Data mining: a misnomer? What is not data mining? –Expert systems or small statistical programs –OLAP

Data Mining: A KDD Process Steps of a KDD Process: –Learning the application domain: relevant prior knowledge and goals of application –Creating a target data set: data selection –Data cleaning and preprocessing: (may take 60% of effort!) –Data reduction and projection: –Find useful features, dimensionality/variable reduction, invariant representation. –Choosing functions of data mining summarization, classification, regression, association, clustering. –Choosing the mining algorithm(s) –Data mining: search for patterns of interest –Interpretation: analysis of results. visualization, transformation, removing redundant patterns, etc. –Use of discovered knowledge

Data Mining and Business Intelligence Increasing potential to support business decisions Data Sources Paper, Files, Database systems, OLTP, WWW Data Warehouses/Data Marts OLAP, MDA Data Exploration Statistical Analysis, Reporting Data Mining Information Discovery Data Presentation Visualization Making Decisions End User DBA Business Analyst Data Analyst

Data Warehouse Meta Data MDDB OLAM Engine OLAP Engine User GUI API Data Cube API Database API Data cleaning Data integration Filtering Databases Filtering&Integration Mining queryMining result An OLAM Architecture

Data Mining: Confluence of Multiple Disciplines Database systems, data warehouse and OLAP Statistics Machine learning Visualization Information science High performance computing Other disciplines: –Neural networks, mathematical modeling, information retrieval, pattern recognition, etc.

Data Mining: On What Kind of Data? Relational databases Data warehouses Transactional databases Advanced DB systems and information repositories –Object-oriented and object-relational databases –Spatial databases –Time-series data and temporal data –Text databases and multimedia databases –Heterogeneous and legacy databases –WWW

Data Mining Functionality Data mining methods may be classified onto 6 basic classes: Associations –Finding rules like “if the customer buys mustard, sausage, and beer, then the probability that he/she buys chips is 50%” Classifications –Classify data based on the values of the decision attribute, e.g. classify patients based on their “state” Clustering –Group data to form new classes, cluster customers based on their behavior to find common patterns

Data Mining Functionality Sequential patterns –Finding rules like “if the customer buys TV, then, few days later, he/she buys camera, then the probability that he/she will buy within 1 month video is 50%” Time-Series similarities –Finding similar sequences (or subsequences) in time- series (e.g. stock analysis) Outlier detection –Finding anomalies/exceptions/deviations in data