Introduction.  Instructor: Cengiz Örencik   Course materials:  myweb.sabanciuniv.edu/cengizo/courses.

Slides:



Advertisements
Similar presentations
2015/6/1Course Introduction1 Welcome! MSCIT 521: Knowledge Discovery and Data Mining Qiang Yang Hong Kong University of Science and Technology
Advertisements

1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Data Mining By Archana Ketkar.
Data Mining Adrian Tuhtan CS157A Section1.
Data Mining – Intro.
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
Data mining By Aung Oo.
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
CS 5941 CS583 – Data Mining and Text Mining Course Web Page 05/cs583.html.
Data Warehousing 資料倉儲 Min-Yuh Day 戴敏育 Assistant Professor 專任助理教授 Dept. of Information Management, Tamkang University Dept. of Information ManagementTamkang.
Data Mining: A Closer Look
Data Mining.
CIT 858: Data Mining and Data Warehousing Course Instructor: Bajuna Salehe Web:
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
Enterprise systems infrastructure and architecture DT211 4
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
1 © Goharian & Grossman 2003 Introduction to Data Mining (CS 422) Fall 2010.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
『 Data Mining 』 By Jung, hae-sun. 1.Introduction 2.Definition 3.Data Mining Applications 4.Data Mining Tasks 5. Overview of the System 6. Data Mining.
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
Data Mining Techniques As Tools for Analysis of Customer Behavior
Data Mining: Introduction. Why Data Mining? l The Explosive Growth of Data: from terabytes to petabytes –Data collection and data availability  Automated.
1 Data Mining Books: 1.Data Mining, 1996 Pieter Adriaans and Dolf Zantinge Addison-Wesley 2.Discovering Data Mining, 1997 From Concept to Implementation.
Chapter 1 Introduction to Data Mining
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
1 1 Slide Introduction to Data Mining and Business Intelligence.
Course Title Database Technologies Instructor: Dr ALI DAUD Course Credits: 3 with Lab Total Hours: 45 approximately.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Data MINING Data mining is the process of extracting previously unknown, valid and actionable information from large data and then using the information.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
CS157B Fall 04 Introduction to Data Mining Chapter 22.3 Professor Lee Yu, Jianji (Joseph)
Introduction of Data Mining and Association Rules cs157 Spring 2009 Instructor: Dr. Sin-Min Lee Student: Dongyi Jia.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Part I Data Mining Fundamentals Chapter 1 Data Mining: A First View Jason C. H. Chen, Ph.D. Professor.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Introduction to Data-Mining Marko Grobelnik Institut Jozef Stefan.
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Academic Year 2014 Spring Academic Year 2014 Spring.
February 13, 2016 Data Mining: Concepts and Techniques 1 1 Data Mining: Concepts and Techniques These slides have been adapted from Han, J., Kamber, M.,
Waqas Haider Bangyal. 2 Source Materials “ Data Mining: Concepts and Techniques” by Jiawei Han & Micheline Kamber, Second Edition, Morgan Kaufmann, 2006.
DATA MINING It is a process of extracting interesting(non trivial, implicit, previously, unknown and useful ) information from any data repository. The.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
Data Mining – Intro.
Data Mining Motivation: “Necessity is the Mother of Invention”
MIS2502: Data Analytics Advanced Analytics - Introduction
DATA MINING © Prentice Hall.
MIS 451 Building Business Intelligence Systems
Introduction C.Eng 714 Spring 2010.
Social Media Data Mining
Adrian Tuhtan CS157A Section1
Data Mining: Concepts and Techniques Course Outline
כריית מידע -- מבוא ד"ר אבי רוזנפלד.
Data Warehousing and Data Mining
Smart Portal To Protect Child Online
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
Data Mining Concepts and Techniques
Data Mining: Concepts and Techniques
Welcome! Knowledge Discovery and Data Mining
CSE591: Data Mining by H. Liu
Presentation transcript:

Introduction

 Instructor: Cengiz Örencik   Course materials:  myweb.sabanciuniv.edu/cengizo/courses

 Reference Books ◦ Veri Madenciliği: Kavram ve Algoritmaları, Doç. Dr. Gökhan Silahtaroğlu, 2013 ◦ Data Mining: Concepts and Techniques, Jiawei Han and Micheline Kamber, 2010

 1 midterm%30  2 inclass quiz %20  1 final %50  HW ?

 Fundamental data mining tools / concepts  Classification, clustering, associations and correlations algorithms  Real life examples and implementations

 Data preprocess  Data Warehouses ◦ Data from different sources/different structure  unified schema, reside at a single site ◦ Periodic data  summary  Associations and correlations ◦ Market basket analysis, etc.  Classification and prediction ◦ E.g. is he trustable for credit application?

 Cluster Analysis ◦ People with similar spending patterns  Text and WEB mining  Privacy preserving data mining ◦ Protect personal information

 “Necessity is the mother of invention” Plato

 Continuously petabytes of new data is produced ◦ 90% of world's data generated over last two years ◦ Twitter, facebook, online shopping, mobese cams etc.  Easy to access and store data  e.g. customer voice records  Web Crawler  e.g. twits that contain “election” and “party” terms  Hard part is getting knowledge from the data

 Data mining is extracting non-trivial (previously unknown) and valid knowledge from large amounts of data that can be used in decision making  Non-trivial ◦ Huge cost to get predictable info ◦ Not to prove sth you already know  Diaper – beer correlation  Large data ◦ Validity  Decision making

DatabasesData Mining  Query ◦ Suitable  SQL – relational DB  Data ◦ Dynamic  Output ◦ known ◦ Subset of data  Query ◦ Not suitable ◦ No common language  Data ◦ Static  Output ◦ Not known ◦ Not subset of data

 Database queries ◦ List of the people that has a boat at Kalamış marine and has the name “Ahmet” ◦ Credit card owners under 30 that has >5000 TL/m spending  Data Mining Queries ◦ Credit application with low risk (classification) ◦ Card owners with similar buying patterns (clustering) ◦ Products purchased together with PS4 games (association rules)

Databases Data Warehouse Data Mining patterns Knowledge CleaningSelection transformation Evaluation Presentation

14 Increasing potential to support business decisions End User Business Analyst Data Analyst DBA Decision Making Data Presentation Visualization Techniques Data Mining Information Discovery Data Exploration Statistical Summary, Querying, and Reporting Data Preprocessing/Integration, Data Warehouses Data Sources Paper, Files, Web documents, Scientific experiments, Database Systems

 Market analysis ◦ Target audience, customer relations  Risk analysis ◦ Resource management, check competitive enterprise  Fraud detection ◦ Insurance, banking ◦ Modeling using history data  Document similarity ◦ plagiarism

 Want to fit data into a model  Predictive mining ◦ Classify people that may not pay mortgage payments ◦ Predict people that leave your company for another ◦ Predict exchange market (borsa)  Descriptive mining ◦ Shows hidden information ◦ Shows your best customers ◦ Which products sell together ◦ Which customers have similar shopping trends

 Classification [Predictive]  Clustering [Descriptive]  Association Rules [Descriptive]