Download presentation

Presentation is loading. Please wait.

Published byArianna Lank Modified over 2 years ago

1
Pertemuan XIV FUNGSI MAYOR Assosiation

2
What Is Association Mining? Association rule mining: –Finding frequent patterns, associations, correlations, or causal structures among sets of items or objects in transaction databases, relational databases, and other information repositories. Applications: –Basket data analysis, cross-marketing, catalog design, loss-leader analysis, clustering, classification, etc. Examples. –Rule form: “Body ead [support, confidence]”. –buys(x, “diapers”) buys(x, “beers”) [0.5%, 60%] –major(x, “CS”) ^ takes(x, “DB”) grade(x, “A”) [1%, 75%]

3
Association Rules Wal-Mart customers who purchase Barbie dolls have a 60% likelihood of also purchasing one of three types of candy bars [Forbes, Sept 8, 1997] Customers who purchase maintenance agreements are very likely to purchase large appliances (author experience) When a new hardware store opens, one of the most commonly sold items is toilet bowl cleaners (author experience) So what…

4
Tugas asosiasi data mining adalah menemukan atribut yang muncul dalam satu waktu.

5
Rule Measures: Support and Confidence Find all the rules X & Y Z with minimum confidence and support –support, s, probability that a transaction contains {X Y Z} –confidence, c, conditional probability that a transaction having {X Y} also contains Z Let minimum support 50%, and minimum confidence 50%, we have A C (50%, 66.6%) C A (50%, 100%) Customer buys diaper Customer buys both Customer buys beer

6
Association Rule Mining Given a set of transactions, find rules that will predict the occurrence of an item based on the occurrences of other items in the transaction Market-Basket transactions Example of Association Rules {Diaper} {Beer}, {Milk, Bread} {Eggs,Coke}, {Beer, Bread} {Milk}, Implication means co-occurrence, not causality!

7
Definition: Frequent Itemset Itemset –A collection of one or more items Example: {Milk, Bread, Diaper} –k-itemset An itemset that contains k items Support count ( ) –Frequency of occurrence of an itemset –E.g. ({Milk, Bread,Diaper}) = 2 Support –Fraction of transactions that contain an itemset –E.g. s({Milk, Bread, Diaper}) = 2/5 Frequent Itemset –An itemset whose support is greater than or equal to a minsup threshold

8
Definition: Association Rule Example: l Association Rule –An implication expression of the form X Y, where X and Y are itemsets –Example: {Milk, Diaper} {Beer} l Rule Evaluation Metrics –Support (s) Fraction of transactions that contain both X and Y –Confidence (c) Measures how often items in Y appear in transactions that contain X

9
Mining Association Rules Example of Rules: {Milk,Diaper} {Beer} (s=0.4, c=0.67) {Milk,Beer} {Diaper} (s=0.4, c=1.0) {Diaper,Beer} {Milk} (s=0.4, c=0.67) {Beer} {Milk,Diaper} (s=0.4, c=0.67) {Diaper} {Milk,Beer} (s=0.4, c=0.5) {Milk} {Diaper,Beer} (s=0.4, c=0.5) Observations: All the above rules are binary partitions of the same itemset: {Milk, Diaper, Beer} Rules originating from the same itemset have identical support but can have different confidence Thus, we may decouple the support and confidence requirements

10
The Apriori Algorithm — Example Database D Scan D C1C1 L1L1 L2L2 C2C2 C2C2 C3C3 L3L3

11
Asosiasi dengan Business Intelligence pada SQL Server

21
Algoritma Asosiasi MBA (Market Basket Analysis) Langkah-langkah algoritma MBA: 1.Tetapkan besaran dari konsep itemset sering, nilai minimum besaran support dan besaran confidence yang diinginkan. 2.Menetapkan semua itemset sering, yaitu itemset yang memiliki frekuensi itemset minimal sebesar bilangan sebelumnya. 3.Dari semua itemset sering, hasilkan aturan asosiasi yang memenuhi nilai minimum support dan confidence

22
Support (A B) = P(A B) Confidence(A B) = P(B|A)

Similar presentations

OK

Association Rules Spring 2010. Data Mining: What is it? Two definitions: The first one, classic and well-known, says that data mining is the nontrivial.

Association Rules Spring 2010. Data Mining: What is it? Two definitions: The first one, classic and well-known, says that data mining is the nontrivial.

© 2018 SlidePlayer.com Inc.

All rights reserved.

Ads by Google

Ppt on computer malware scanner Ppt on moles concept Ppt on trans-siberian railway route Human body parts for kids ppt on batteries Ppt on the road not taken symbolism Ppt on bionics lab Ppt on two point perspective grid Pharynx anatomy and physiology ppt on cells Ppt on instant messaging and chat Ppt on instrument landing system cost