Discovering Frequent Arrangements of Temporal Intervals Papapetrou, P. ; Kollios, G. ; Sclaroff, S. ; Gunopulos, D. ICDM 2005.

Slides:



Advertisements
Similar presentations
Association Analysis (2). Example TIDList of item ID’s T1I1, I2, I5 T2I2, I4 T3I2, I3 T4I1, I2, I4 T5I1, I3 T6I2, I3 T7I1, I3 T8I1, I2, I3, I5 T9I1, I2,
Advertisements

Association Rule Mining. 2 The Task Two ways of defining the task General –Input: A collection of instances –Output: rules to predict the values of any.
STAGGER: Periodicity Mining of Data Streams using Expanding Sliding Windows Mohamed G. Elfeky Walid G.Aref Ahmed K. Elmagarmid ICDM /10/021Chen.
1 Department of Information & Computer Education, NTNU SmartMiner: A Depth First Algorithm Guided by Tail Information for Mining Maximal Frequent Itemsets.
Edi Winarko, John F. Roddick
FP-Growth algorithm Vasiljevic Vladica,
Sampling Large Databases for Association Rules ( Toivenon’s Approach, 1996) Farzaneh Mirzazadeh Fall 2007.
Carson Kai-Sang Leung, Mark Anthony F. Mateo, and Dale A. Brajczuk PAKDD 2008 A Tree-based Approach for Frequent Pattern Mining from Uncertain Data.
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Data Mining Association Analysis: Basic Concepts and Algorithms
Association Analysis (2). Example TIDList of item ID’s T1I1, I2, I5 T2I2, I4 T3I2, I3 T4I1, I2, I4 T5I1, I3 T6I2, I3 T7I1, I3 T8I1, I2, I3, I5 T9I1, I2,
Data Mining Association Analysis: Basic Concepts and Algorithms
Mining Time-Series Databases Mohamed G. Elfeky. Introduction A Time-Series Database is a database that contains data for each point in time. Examples:
Temporal Pattern Matching of Moving Objects for Location-Based Service GDM Ronald Treur14 October 2003.
1 Mining Quantitative Association Rules in Large Relational Database Presented by Jin Jin April 1, 2004.
Pattern Lattice Traversal by Selective Jumps Osmar R. Zaïane and Mohammad El-Hajj Department of Computing Science, University of Alberta Edmonton, AB,
Panagiotis Papapetrou Department of Computer Science Boston University Constraint-based Mining of Frequent Arrangements of Temporal Intervals Master Thesis.
1 Efficiently Mining Frequent Trees in a Forest Mohammed J. Zaki.
Sequential PAttern Mining using A Bitmap Representation
Approximate Frequency Counts over Data Streams Gurmeet Singh Manku, Rajeev Motwani Standford University VLDB2002.
Data Mining Association Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 6 Introduction to Data Mining By Tan, Steinbach, Kumar Lecture.
Modul 7: Association Analysis. 2 Association Rule Mining  Given a set of transactions, find rules that will predict the occurrence of an item based on.
Efficient Data Mining for Calling Path Patterns in GSM Networks Information Systems, accepted 5 December 2002 SPEAKER: YAO-TE WANG ( 王耀德 )
Graph Indexing: A Frequent Structure- based Approach Alicia Cosenza November 26 th, 2007.
CEMiner – An Efficient Algorithm for Mining Closed Patterns from Time Interval-based Data Yi-Cheng Chen, Wen-Chih Peng and Suh-Yin Lee ICDM 2011.
CSE4334/5334 DATA MINING CSE4334/5334 Data Mining, Fall 2014 Department of Computer Science and Engineering, University of Texas at Arlington Chengkai.
1 On Mining General Temporal Association Rules in a Publication Database Chang-Hung Lee, Cheng-Ru Lin and Ming-Syan Chen, Proceedings of the 2001 IEEE.
Panagiotis Papapetrou, George Kollios, Stan Sclaroff, Dimitrios Gunopulos Department of Computer Science Boston University University of California, Riverside.
CloSpan: Mining Closed Sequential Patterns in Large Datasets Xifeng Yan, Jiawei Han and Ramin Afshar Proceedings of 2003 SIAM International Conference.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Association Analysis This lecture node is modified based on Lecture Notes for.
1 AC-Close: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery Advisor : Dr. Koh Jia-Ling Speaker : Tu Yi-Lang Date : Hong.
Mining Document Collections to Facilitate Accurate Approximate Entity Matching Presented By Harshda Vabale.
Discovering multi-label temporal patterns in sequence databases Yen-Liang Chen, Shin-Yi Wu, Yu-Cheng Wang IS (Information Sciences)
Intelligent Database Systems Lab Advisor : Dr.Hsu Graduate : Keng-Wei Chang Author : Salvatore Orlando Raffaele Perego Claudio Silvestri 國立雲林科技大學 National.
CMU SCS : Multimedia Databases and Data Mining Lecture #30: Data Mining - assoc. rules C. Faloutsos.
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Indexing and Mining Free Trees Yun Chi, Yirong Yang, Richard R. Muntz Department of Computer Science University of California, Los Angeles, CA {
1 Data Mining Lecture 6: Association Analysis. 2 Association Rule Mining l Given a set of transactions, find rules that will predict the occurrence of.
黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Exploring Spatial-Temporal Trajectory Model for Location.
1 Mining the Smallest Association Rule Set for Predictions Jiuyong Li, Hong Shen, and Rodney Topor Proceedings of the 2001 IEEE International Conference.
Mining General Temporal Association Rules for Items with Different Exhibition Cheng-Yue Chang, Ming-Syan Chen, Chang-Hung Lee, Proc. of the 2002 IEEE international.
On the Discovery of Interesting Patterns in Association Rules
CSC317 Graph algorithms Why bother?
Frequent Pattern Mining
Association Rules.
TT-Join: Efficient Set Containment Join
Chang-Hung Lee, Jian Chih Ou, and Ming Syan Chen, Proc
Data Mining Association Analysis: Basic Concepts and Algorithms
Association Rule Mining
Data Mining Association Analysis: Basic Concepts and Algorithms
Vasiljevic Vladica, FP-Growth algorithm Vasiljevic Vladica,
Association Rule Mining
A Parameterised Algorithm for Mining Association Rules
Mining Association Rules from Stars
Data Mining Association Analysis: Basic Concepts and Algorithms
Association Rule Mining
Presented by: Prof. Ali Jaoua
Amer Zaheer PC Mohammad Ali Jinnah University, Islamabad
Farzaneh Mirzazadeh Fall 2007
Graphs.
Association Analysis: Basic Concepts and Algorithms
732A02 Data Mining - Clustering and Association Analysis
Data Warehousing Mining & BI
Frequent-Pattern Tree
Geometrically Inspired Itemset Mining*
An Efficient Method for Projected Clustering
15-826: Multimedia Databases and Data Mining
Association Analysis: Basic Concepts
Discovering Frequent Poly-Regions in DNA Sequences
Presentation transcript:

Discovering Frequent Arrangements of Temporal Intervals Papapetrou, P. ; Kollios, G. ; Sclaroff, S. ; Gunopulos, D. ICDM 2005

Outlines 2  Introduction  Definition  The Arrangement Enumeration Tree  BFS-based Approach  DFS-based Approach  Hybrid DFS-based Approach  Experimental Evaluation  Conclusion

Introduction 3  In this paper the goal is to develop methods that discover temporal arrangements of correlated event intervals which occur frequently in a database.  BFS-based Approach  DFS-based Approach  Hybrid DFS-based Approach

Definition 4  Event Interval Temporal Relations

Cont. 5  event interval sequence or e-sequence : .  k-e-sequence  E.g.  5-e-sequence  {(A,1,7), (B,3,19), (D,4,30), (C,7,15), (C,23,42)}  an e-sequence database

Cont. 6  E.g.  This can be done by using the “AND” operand denoted by *.  (b) A|B * A|C * B>C  R = { |, ||, >, → } and *. A|B → CA|B>C

The Arrangement Enumeration Tree 7 N(1) N(2) N(k)

BFS-based Approach 8  The BFS-based approach uses an arrangement enumeration tree to discover the set of frequent arrangements.  Definition :  F k denote the complete set of frequent k -arrangements.  C k the set of candidate frequent k -arrangements.

9  Scanning D and filtering with min_sup = 2.  /  F 1 = {{A}, {B}, {C}}

10  Based on F 1 and the enumeration tree, N 2 is generated.  / N2N2

11  For every pair of events in the arrangements, D is scanned to get all the types of relations between them, i.e. IM 2.  / IM 2

12  If these relations satisfy the support threshold they are added to F 2.  F 2 :

13  F 3 :  /  Output : F = A > B * A > C * B > C

DFS-based Approach 14  We must completely explore all the sub-arrangements on a path before moving to another one.  One more step is added to BFS-based Algorithm :  When a node is found to contain a frequent arrangement, each sub-arrangement is added to F and the corresponding expansions are made on the tree.  We can skip those expansions in the enumeration tree reducing the cost of computation.

Hybrid DFS-based Approach 15  We use a hybrid DFS approach that generates the first two levels of the tree using BFS and then follows DFS for the rest of the tree.  This would compensate for the multiple database scans.

Experimental Evaluation 16 medium densitysparse dense medium density

Conclusion 17  The BFS-based approach uses an arrangement enumeration tree to discover the set of frequent arrangements.  The DFS-based method further improves performance over BFS by reaching longer arrangements faster and hence eliminating the need for examining smaller subsets of these arrangements.