HW#2: A Strategy for Mining Association Rules Continuously in POS Scanner Data.

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

An Introduction to Data Mining
Rule Discovery from Time Series Presented by: Murali K. Kadimi.
Association Rules Spring Data Mining: What is it?  Two definitions:  The first one, classic and well-known, says that data mining is the nontrivial.
IT 433 Data Warehousing and Data Mining Association Rules Assist.Prof.Songül Albayrak Yıldız Technical University Computer Engineering Department
10 -1 Lecture 10 Association Rules Mining Topics –Basics –Mining Frequent Patterns –Mining Frequent Sequential Patterns –Applications.
Nadia Andreani Dwiyono DESIGN AND MAKE OF DATA MINING MARKET BASKET ANALYSIS APLICATION AT DE JOGLO RESTAURANT.
Chase Repp.  knowledge discovery  searching, analyzing, and sifting through large data sets to find new patterns, trends, and relationships contained.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Week 9 Data Mining System (Knowledge Data Discovery)
Data Mining By Archana Ketkar.
Lecture14: Association Rules
1 Synthesizing High-Frequency Rules from Different Data Sources Xindong Wu and Shichao Zhang IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL.
Data Mining – Intro.
Data mining By Aung Oo.
MIS 451 Building Business Intelligence Systems Association Rule Mining (1)
Data Mining CS 157B Section 2 Keng Teng Lao. Overview Definition of Data Mining Application of Data Mining.
GUHA method in Data Mining Esko Turunen Tampere University of Technology Tampere, Finland.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Data Mining : Introduction Chapter 1. 2 Index 1. What is Data Mining? 2. Data Mining Functionalities 1. Characterization and Discrimination 2. MIning.
『 Data Mining 』 By Jung, hae-sun. 1.Introduction 2.Definition 3.Data Mining Applications 4.Data Mining Tasks 5. Overview of the System 6. Data Mining.
Knowledge Discovery & Data Mining process of extracting previously unknown, valid, and actionable (understandable) information from large databases Data.
Chapter 5: Data Mining for Business Intelligence
MAKING THE BUSINESS BETTER Presented By Mohammed Dwikat DATA MINING Presented to Faculty of IT MIS Department An Najah National University.
 BA_EM Electronic Marketing – Pavel
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Techniques As Tools for Analysis of Customer Behavior
Tang: Introduction to Data Mining (with modification by Ch. Eick) I: Introduction to Data Mining A.Short Preview 1.Initial Definition of Data Mining 2.Motivation.
Association Rules. 2 Customer buying habits by finding associations and correlations between the different items that customers place in their “shopping.
Data mining: some basic ideas Francisco Moreno Excerpts from Fundamentals of DB Systems, Elmasri & Navathe and other sources.
Mining High Utility Itemsets without Candidate Generation Date: 2013/05/13 Author: Mengchi Liu, Junfeng Qu Source: CIKM "12 Advisor: Jia-ling Koh Speaker:
Data Mining CS157B Fall 04 Professor Lee By Yanhua Xue.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
1 1 Slide Introduction to Data Mining and Business Intelligence.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Using Data Mining Technologies to find Currency Trading Rules A. G. Malliaris M. E. Malliaris Loyola University Chicago Multinational Finance Society,
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
Data Mining By : Tung, Sze Ming ( Leo ) CS 157B. Definition A class of database application that analyze data in a database using tools which look for.
 Fundamentally, data mining is about processing data and identifying patterns and trends in that information so that you can decide or judge.  Data.
Data Mining By Dave Maung.
Data Mining Jim King. What is Data Mining?  A.k.a. knowledge discovery The search for previously unknown relationships in large data setsThe search for.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Data mining. Data mining, at its core, is the transformation of large amounts of data into meaningful patterns and rules.
3-1 Data Mining Kelby Lee. 3-2 Overview ¨ Transaction Database ¨ What is Data Mining ¨ Data Mining Primitives ¨ Data Mining Objectives ¨ Predictive Modeling.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research.
Frequent-Itemset Mining. Market-Basket Model A large set of items, e.g., things sold in a supermarket. A large set of baskets, each of which is a small.
Association Rule Mining
DATA MINING By Cecilia Parng CS 157B.
CURE Clustering Using Representatives Handles outliers well. Hierarchical, partition First a constant number of points c, are chosen from each cluster.
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
DATA MINING Using Association Rules by Andrew Williamson.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
DATA MINING PREPARED BY RAJNIKANT MODI REFERENCE:DOUG ALEXANDER.
Academic Year 2014 Spring Academic Year 2014 Spring.
Chapter 8 Association Rules. Data Warehouse and Data Mining Chapter 10 2 Content Association rule mining Mining single-dimensional Boolean association.
Chap 6: Association Rules. Rule Rules!  Motivation ~ recent progress in data mining + warehousing have made it possible to collect HUGE amount of data.
DATA MINING It is a process of extracting interesting(non trivial, implicit, previously, unknown and useful ) information from any data repository. The.
Chapter 3 Data Mining: Classification & Association Chapter 4 in the text box Section: 4.3 (4.3.1),
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Introduction BIM Data Mining.
By Arijit Chatterjee Dr
DATA MINING © Prentice Hall.
Data Mining Jim King.
Market Basket Analysis and Association Rules
Data Mining: Introduction
I will explain the case, and application for asscoiation rules
Presentation transcript:

HW#2: A Strategy for Mining Association Rules Continuously in POS Scanner Data

Contents I.Introduction II.Data Mining and Association Rules III.Point of Scanner Data IV.A Strategy for Data Mining in POS Scanner Data V.Case Study: Divide and Conquer strategy in Practice VI. Conclusion

I. Introduction Section 2 presents KDD and Data Mining aspects with particular focus on Association Rules. Section 3 describes POS common operations and information. Section 4 our strategy is presented, based on previously described POS characteristics.

II. Data Mining and Association Rules Data Mining in Knowledge Discovery in Databases(KDD) Process which is the “non-trivial process of extracting patens from data that are useful, novel and comprehensive.” KDD process can be viewed in 3 parts: 1) Goal definition and data preparing, where existing data, related to the goal is collected, cleaned and enriched as much as possible. 2) Analysis, where data is transformed and applied to one or more Data Mining algorithms. 3) Results Interpretation, validation and deployment, where useful, novelty process output is used as a benefit to the organization.

There exist many distinct Data Mining techniques, each one with many algorithms. gives a first taxonomy to these techniques depending on the author’s point of view, calling them: Class problems A) Association Rules, where events are verified in order to determine if their behaviors, events, or items are linked B) Classification, where events in business are labeled within some classes and then a description is searched for them, so new events can be pre-classified. C) Sequences, where events are studied to check if their occurrences are sub-sequents, or if there exists sequence patterns in business events.

III. Point of Scanner Data POS: Point of Sale Each time a transaction is made in a POS (or a basket) By the time new data is generated, operational procedures are made In each transaction recorded by a system for reading barcodes. There is information about product purchased, quantity and price, and there could be one or more products in one transaction.

IV. A Strategy for Data Mining in POS Scanner Data A. POS Scanner facts The number of daily transactions can easily reach ten thousand even in small stores. Here, transactions are accumulated in a period of one day, and the daily transactions are analyzed.

B. Divide and Conquer Strategy In market basket, the whole problem is to find Association Rules in a set of all transaction within a period, a day here. This problem can be divided in time intervals (one day here in this case). Association Rules are discovered in all resulting sets.

V. Case Study: Divide and Conquer strategy in Practice A. Strategy Parameters and Choices 2 supermarkets in Brazil are equipped with scanning systems for reading barcodes and will be identified. The associations were generated on a daily basis, being accumulated in a general rules based. Association rules were induced and analyzed to see if they are or not present in transactions. Support and Confidence.

To limit daily rules, the extraction support parameter was to set a minimum limit of 1% of total transactions. Therefore, products that were not within 1% of the buying tickets(baskets) in a given day were search excluded and were not considered for the rule formation. Association rules are useful, meaningful, and significant when its stability value is over 95% and confidence value is over 45%.

B. Extracted Results (Stability: if the rule happens/found every day in the period analyzed, stability will be 100%) stability presence above 95% of the observed period the usual associations(meaningful) are stable in the two stores. Ex) Ham & Bread

Non-usual associations (not meaningful) The non-usual associations in store A were stable over the analyzed period. However, in store B, these associations are not stable, meaning they are not significant because of inconsistency in two stores. Ex) determent & long-life package milk Applications of Association rules: Products would be arrange by using Association Rules. The arrangement could be tested for validity and increasing sale.