Data Mining Database Systems Timothy Vu. 2 Mining Mining is the extraction of valuable minerals or other geological materials from the earth, usually.

Slides:



Advertisements
Similar presentations
Data Mining Tri Nguyen. Agenda Data Mining As Part of KDD Decision Tree Association Rules Clustering Amazon Data Mining Examples.
Advertisements

Data Mining Sangeeta Devadiga CS 157B, Spring 2007.
CIS 678 Artificial Intelligence problems deduction, reasoning knowledge representation planning learning natural language processing motion and manipulation.
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
Week 9 Data Mining System (Knowledge Data Discovery)
Data Mining with Decision Trees Lutz Hamel Dept. of Computer Science and Statistics University of Rhode Island.
Data Mining Adrian Tuhtan CS157A Section1.
Data Mining By: Thai Hoa Nguyen Pham. Data Mining  Define Data Mining  Classification  Association  Clustering.
© Prentice Hall1 DATA MINING Introductory and Advanced Topics Part II Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist.
Data Mining – Intro.
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
Data Mining: A Closer Look
Enterprise systems infrastructure and architecture DT211 4
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Database System Concepts, 6 th Ed. ©Silberschatz, Korth and Sudarshan See for conditions on re-usewww.db-book.com Chapter 20: Data Analysis.
Basic Data Mining Techniques
Knowledge Discovery & Data Mining process of extracting previously unknown, valid, and actionable (understandable) information from large databases Data.
MAKING THE BUSINESS BETTER Presented By Mohammed Dwikat DATA MINING Presented to Faculty of IT MIS Department An Najah National University.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
1 An Introduction to Data Mining Hosein Rostani Alireza Zohdi Report 1 for “advance data base” course Supervisor: Dr. Masoud Rahgozar December 2007.
Spatial Statistics and Spatial Knowledge Discovery First law of geography [Tobler]: Everything is related to everything, but nearby things are more related.
COMP3503 Intro to Inductive Modeling
Copyright: Silberschatz, Korth and Sudarshan 1 Data Mining.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
Data Mining By Fu-Chun (Tracy) Juang. What is Data Mining? ► The process of analyzing LARGE databases to find useful patterns. ► Attempts to discover.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Knowledge Discovery and Data Mining Evgueni Smirnov.
DATA MINING 1. 2 Data Mining Extracting or “mining” knowledge from large amounts of data Data mining is the process of autonomously retrieving useful.
Database System Concepts, 6 th Ed. ©Silberschatz, Korth and Sudarshan See for conditions on re-usewww.db-book.com Other Topics 2: Warehousing,
Data Mining By Dave Maung.
Computing & Information Sciences Kansas State University Friday. 30 Nov 2007CIS 560: Database System Concepts Lecture 39 of 42 Friday, 30 November 2007.
Data Mining: Association Rule By: Thanh Truong. Association Rules In Association Rules, we look at the associations between different items to draw conclusions.
Copyright © 2004 Pearson Education, Inc.. Chapter 27 Data Mining Concepts.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
27-18 września Data Mining dr Iwona Schab. 2 Semester timetable ORGANIZATIONAL ISSUES, INDTRODUCTION TO DATA MINING 1 Sources of data in business,
Database System Concepts, 6 th Ed. ©Silberschatz, Korth and Sudarshan See for conditions on re-usewww.db-book.com Chapter 20: Data Analysis.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
DATA MINING WITH CLUSTERING AND CLASSIFICATION Spring 2007, SJSU Benjamin Lam.
Data Mining Brandon Leonardo CS157B (Spring 2006).
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College Bio Informatics January
Academic Year 2014 Spring Academic Year 2014 Spring.
Data Mining Copyright KEYSOFT Solutions.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Data Warehousing and Data Mining. Data Warehousing Data Mining Classification Association Rules Clustering.
Data Mining – Introduction (contd…) Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
Chapter 20 Data Warehousing and Mining 1 st Semester, 2016 Sanghyun Park.
Knowledge Discovery in a DBMS Data Mining Computing models and finding patterns in large databases current major challenge in database systems & large.
Chapter 20 Data Warehousing and Mining 1 st Semester, 2016 Sanghyun Park.
DATA MINING and VISUALIZATION Instructor: Dr. Matthew Iklé, Adams State University Remote Instructor: Dr. Hong Liu, Embry-Riddle Aeronautical University.
Oracle Advanced Analytics
Data Mining Functionalities
Data Mining.
Data Mining – Intro.
By Arijit Chatterjee Dr
Data Mining Motivation: “Necessity is the Mother of Invention”
DATA MINING © Prentice Hall.
Chapter 20: Data Analysis
Data Mining 101 with Scikit-Learn
Waikato Environment for Knowledge Analysis
Adrian Tuhtan CS157A Section1
What is Pattern Recognition?
Sangeeta Devadiga CS 157B, Spring 2007
Data Analysis.
T H E P U B G P R O J E C T.
Presentation transcript:

Data Mining Database Systems Timothy Vu

2 Mining Mining is the extraction of valuable minerals or other geological materials from the earth, usually bauxite, coal, diamonds, iron, precious metals, lead, limestone, nickel, phosphate, rock salt, tin, and uranium, petroleum, natural gas, and even water. Often something that is valuable, rare, or useful.

3 What is Data Mining Data Mining, also known as Knowledge-Discovery in Databases (KDD), is the process of automatically searching large volumes of data for patterns. In order to achieve this, data mining uses computational techniques from statistics, machine learning and pattern recognition. Machine learning - a method for creating computer programs by the analysis of data sets. Pattern recognition - classify data (patterns) based on either a priori knowledge or on statistical information extracted from the patterns.

4 Why Data Mining Data mining is a technique that helps individuals or companies find useful information to make better decisions from large amounts of data. - Reduce risks - Find problems and issues - Save money - High confidence predictions - Simplifying information

5 Discussion Topics 1 ) Classification 2 )Regression 3) Association 4) Clustering

6 Classifiers Decision-Tree Classifiers – each node has an associated class and each internal node has a predicate. Bayesian Classifiers – find the distribution of attribute values for each class in the training data ( the maximum probability predicted ). Nuro Net Classifiers – Use the training data to train artificial nuro nets.

7 Regression Regression – Deals with the prediction of a value rather than a class. Linear Regression – Predict values using a polynomial by finding the curve fitting, meaning finding coefficients that give the best answer.

8 Associations Finding the association or relationship between two or more items. Support – measure of what fractions of the pupulation satisifies both the antecedent and the consequent of the rule. MILK => Screwdrivers Confidence – how often the consequent is true when the antecedent is true. MILK => Bread

9 Clustering Clustering is the classification of similar objects into different groups, or more precisely, the partitioning of a data set into subsets (clusters), so that the data in each subset (ideally) share some common trait - often proximity according to some defined distance measure.

10 Applications of Data Mining 1. Predictions - Stock Market - Earth Quakes - NBA games 2. Association - Store Inventory - Fashion Trends 3. Descriptive Patterns - Disease Analysis - Image Recognition - Fraud Detection

11 Gather Data

12 Electrocardiogram

13 Disease Analysis

14 References Silberschatz, H.F. Korth, S. Sudershan: Database System Concepts, 5th ed., McGraw-Hill, 2006 Runge, Marschall, Magnus Ohman, and Frank Netter. Netter's Cardiology (Netter Clinical Science). W.B. Saunders Company, "Data mining". Wikipedia. 4/1/2006.