Mining temporal interval relational rules from temporal data Yong Joon Lee, Jun Wook Lee, Duck Jin Chai, Bu Hyun Hwang, Keun Ho Ryu JSS (The Journal of.

Slides:



Advertisements
Similar presentations
An Inductive Database for Mining Temporal Patterns in Event Sequences Alexandre Vautier, Marie-Odile Cordier and René Quiniou
Advertisements

An Efficient Algorithm for Mining Time Interval-based Patterns in Large Databases Yi-Cheng Chen, Ji-Chiang Jiang, Wen-Chih Peng and Suh-Yin Lee Department.
Mining Frequent Patterns Using FP-Growth Method Ivan Tanasić Department of Computer Engineering and Computer Science, School of Electrical.
A distributed method for mining association rules
Data Mining (Apriori Algorithm)DCS 802, Spring DCS 802 Data Mining Apriori Algorithm Spring of 2002 Prof. Sung-Hyuk Cha School of Computer Science.
Temporal Constraints Time and Time Again: The Many Ways to Represent Time James F Allen.
gSpan: Graph-based substructure pattern mining
PREFIXSPAN ALGORITHM Mining Sequential Patterns Efficiently by Prefix- Projected Pattern Growth
Correlation Search in Graph Databases Yiping Ke James Cheng Wilfred Ng Presented By Phani Yarlagadda.
Mining Mobile Group Patterns: A Trajectory-based Approach San-Yih Hwang, Ying-Han Liu, Jeng-Kuen Chiu NSYSU, Taiwan Ee-Peng Lim NTU, Singapore.
Minimum Spanning Trees (MSTs) Prim's Algorithm For each vertex not in the tree, keep track of the lowest cost edge that would connect it to the tree This.
Edi Winarko, John F. Roddick
FP (FREQUENT PATTERN)-GROWTH ALGORITHM ERTAN LJAJIĆ, 3392/2013 Elektrotehnički fakultet Univerziteta u Beogradu.
Temporal Pattern Matching of Moving Objects for Location-Based Service GDM Ronald Treur14 October 2003.
SSCP: Mining Statistically Significant Co-location Patterns Sajib Barua and Jörg Sander Dept. of Computing Science University of Alberta, Canada.
DATA MINING -ASSOCIATION RULES-
ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis)
Computability and Complexity 24-1 Computability and Complexity Andrei Bulatov Approximation.
2/8/00CSE 711 data mining: Apriori Algorithm by S. Cha 1 CSE 711 Seminar on Data Mining: Apriori Algorithm By Sung-Hyuk Cha.
1 A DATA MINING APPROACH FOR LOCATION PREDICTION IN MOBILE ENVIRONMENTS* by Gökhan Yavaş Feb 22, 2005 *: To appear in Data and Knowledge Engineering, Elsevier.
Mining Long Sequential Patterns in a Noisy Environment Jiong Yang, Wei Wang, Philip S. Yu, Jiawei Han SIGMOD 2002.
1 Synthesizing High-Frequency Rules from Different Data Sources Xindong Wu and Shichao Zhang IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL.
Pattern Recognition Lecture 20: Data Mining 3 Dr. Richard Spillman Pacific Lutheran University.
Chapter 5 Mining Association Rules with FP Tree Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2010.
Mining Sequential Patterns: Generalizations and Performance Improvements R. Srikant R. Agrawal IBM Almaden Research Center Advisor: Dr. Hsu Presented by:
Mining Preferences from Superior and Inferior Examples KDD’08 1.
Ch5 Mining Frequent Patterns, Associations, and Correlations
Sequential PAttern Mining using A Bitmap Representation
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Keng-Wei Chang Author : Anthony K.H. Tung Hongjun Lu Jiawei Han Ling Feng 國立雲林科技大學 National.
Similarity based Retrieval from Sequence Databases using Automata as Queries 作者 : A. Prasad Sistla, Tao Hu, Vikas howdhry 出處 :CIKM 2002 ACM 指導教授 : 郭煌政老師.
Efficient Data Mining for Calling Path Patterns in GSM Networks Information Systems, accepted 5 December 2002 SPEAKER: YAO-TE WANG ( 王耀德 )
Mining Multidimensional Sequential Patterns over Data Streams Chedy Raїssi and Marc Plantevit DaWak_2008.
MINING FREQUENT ITEMSETS IN A STREAM TOON CALDERS, NELE DEXTERS, BART GOETHALS ICDM2007 Date: 5 June 2008 Speaker: Li, Huei-Jyun Advisor: Dr. Koh, Jia-Ling.
Data Mining Association Rules: Advanced Concepts and Algorithms
CEMiner – An Efficient Algorithm for Mining Closed Patterns from Time Interval-based Data Yi-Cheng Chen, Wen-Chih Peng and Suh-Yin Lee ICDM 2011.
1 Discovering Robust Knowledge from Databases that Change Chun-Nan HsuCraig A. Knoblock Arizona State UniversityUniversity of Southern California Journal.
Expert Systems with Applications 34 (2008) 459–468 Multi-level fuzzy mining with multiple minimum supports Yeong-Chyi Lee, Tzung-Pei Hong, Tien-Chin Wang.
Pattern Discovery of Fuzzy Time Series for Financial Prediction -IEEE Transaction of Knowledge and Data Engineering Presented by Hong Yancheng For COMP630P,
Generalized Sequential Pattern Mining with Item Intervals Yu Hirate Hayato Yamana PAKDD2006.
Frequent Sequential Attack Patterns of Malware in Botnets Nur Rohman Rosyid.
Practical Considerations When t , the supp, then there will be a value of t when supp folds, it becomes multi-w-to-one-x mapping.
Gao Cong, Long Wang, Chin-Yew Lin, Young-In Song, Yueheng Sun SIGIR’08 Speaker: Yi-Ling Tai Date: 2009/02/09 Finding Question-Answer Pairs from Online.
1 Inverted Matrix: Efficient Discovery of Frequent Items in Large Datasets in the Context of Interactive Mining -SIGKDD’03 Mohammad El-Hajj, Osmar R. Zaïane.
DATA MINING By Cecilia Parng CS 157B.
Mining Frequent Patterns, Associations, and Correlations Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
A Optimal On-line Algorithm for k Servers on Trees Author : Marek Chrobak Lawrence L. Larmore 報告人:羅正偉.
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
1 Efficient Discovery of Frequent Approximate Sequential Patterns Feida Zhu, Xifeng Yan, Jiawei Han, Philip S. Yu ICDM 2007.
Intelligent Database Systems Lab Advisor : Dr.Hsu Graduate : Keng-Wei Chang Author : Salvatore Orlando Raffaele Perego Claudio Silvestri 國立雲林科技大學 National.
SeqStream: Mining Closed Sequential Pattern over Stream Sliding Windows Lei Chang Tengjiao Wang Dongqing Yang Hua Luan ICDM’08 Lei Chang Tengjiao Wang.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Mining Progressive Confident Rules M. Zhang, W. Hsu and M.L. Lee Int'l Conf on Data Mining (ICDM),2006 IEEE Advisor : Jia-Ling Koh Speaker : Tsui-Feng.
Graph Indexing From managing and mining graph data.
Indexing and Mining Free Trees Yun Chi, Yirong Yang, Richard R. Muntz Department of Computer Science University of California, Los Angeles, CA {
An unsupervised conditional random fields approach for clustering gene expression time series Chang-Tsun Li, Yinyin Yuan and Roland Wilson Bioinformatics,
Mining Sequential Patterns With Item Constraints
Discovering Frequent Arrangements of Temporal Intervals Papapetrou, P. ; Kollios, G. ; Sclaroff, S. ; Gunopulos, D. ICDM 2005.
Data Mining: Confluence of Multiple Disciplines Data Mining Database Systems Statistics Other Disciplines Algorithm Machine Learning Visualization.
On the Discovery of Interesting Patterns in Association Rules
DATA MINING © Prentice Hall.
When to Update the Sequential Patterns of Stream Data?
Advanced Pattern Mining 02
On Efficient Graph Substructure Selection
A Parameterised Algorithm for Mining Association Rules
Amer Zaheer PC Mohammad Ali Jinnah University, Islamabad
Mining Sequential Patterns
Lecture 2 Basic Concepts on Probability (Section 0.2)
Discovering Frequent Poly-Regions in DNA Sequences
Presentation transcript:

Mining temporal interval relational rules from temporal data Yong Joon Lee, Jun Wook Lee, Duck Jin Chai, Bu Hyun Hwang, Keun Ho Ryu JSS (The Journal of Systems and Software )

OUTLINE 1. Introduction 2. Related works 3. Problem definition 4. Technique for mining temporal Interval relation rules 5. Experimental results 6. Conclusions and future research 2

1. Introduction Lots of studies did not consider temporal interval data and used only data stamped with time points. The preprocessing algorithm The temporal interval relation discovery algorithm 3

2. Related works Sequential patterns Similar time sequences Temporal rules These studies are limited to data stamped with time points and do not consider temporal interval data. Allen’s interval algebra 4

3. Problem definition TS : a set of primitive time points. An event e, e = (E,t) E : event type t : a time-point at which an event e has occurred. t ∈ TS 5 Event

3. Problem definition A transaction is a set of events such as e = (Cid,E,t), Cid : a customer identifier E : an event type t : a time-point that the event has occurred. 6 Transaction

3. Problem definition A customer C is a sequence of transactions. C =, where ts(T i ) < ts(T j ) if i < j. ts(T i ) : the time-point at which T i has been issued. 7 Customer

3. Problem definition A sequence of events for a customer Cid and an event type E. Event sequence, ES(Cid,E), is represented by, where e i = (E, t i ), e i ∈ T i, t i ∈ TS, and t i ≦ t i+1 for each i=1,…,n-1 A time interval between the first event e1 and the last event en of ES is expressed as [t 1, t n ]. 8 Sequence of events

3. Problem definition A time interval between the first event e 1 and the last event e n of ES is expressed as [t 1, t n ]. An event e’ with a temporal interval is denoted as e’= (E,[vs,ve]), e’.vs = event e’ start time-point e’.ve = event e’ end time-point A sequence of events with the same event type can be converted into one generalized event with a time interval. 9

3. Problem definition A sequence ES’ of events with a time interval is denoted by,Where e j ’ = (E x,[vs j, ve j ]) and ve j ≦ vs j+1 for each j = 1,...,n-1. ES’ is a sequence of events for an event type E x. 10 Sequence of events with a time interval

3. Problem definition Suppose that a database of transactions DB is as follows: 11 EXAMPLE SS(Cid = 101) = {,,,, } SS means a set of sequences DB = { T 1 = (101, {D,B}, 1), T 2 = (101, {D,A,E}, 2), T 3 = (101, {A,E,B}, 3), T 4 = (101, {C}, 5, T 5 = (101, {E,C}, 7)} ES(Cid = 101,E) = can be converted into a generalized event (E, [2, 7]). Similary, (D,[1,2]) (B,[1,3]) (A,[2,3]) (C,[5,7])

3. Problem definition A temporal interval relation is defined as R(x,y) = {P(x,y)|(x,y) ∈ Ω,P ∈ IO} The set of temporal interval operators is IO = {before, equals, meets, overlaps, during} R(x,y) : 12 Temporal interval relation Let IE be {(D, [1,2]), (B, [1,3]), (A, [2,3]). There are three temporal interval relations such as meets(D,A), during(D,B), and during(A,B).

3. Problem definition 13

3. Problem definition 14

3. Problem definition The support of E is denoted by Supp(E) and it is the number of customers supporting E in DB. If Supp(E)/ N cust ≧ Supp min (the minimum support assigned by a user), then E is a large event type and an event e having E is a large event. N cust is the number of customers in DB 15 For instance, assume that there are 500 customers and Supp min is 40%. If Supp(E) is 200, E is a large event type since Supp(E)/ N cust = 200/500 = 0.4.

3. Problem definition For instance, suppose that 10 windows with a window size five exist in the lifespan [1, 50] of E i. WFreq min is 50%, WFreq(E i ) is 6, E i is a uniform event type since WFreq(E 1 )/ W num = 6/10 = Window size

3. Problem definition 17 Candidate temporal interval relations

3. Problem definition A temporal interval relation rule TR is defined as TR(e 1 ’,e 2 ’,e 3 ’) = (R 1 (e 1 ’,e 2 ’)|Supp(R 1 )) Λ (R 2 (e 2 ’,e 3 ’)|Supp(R 2 )) Λ (R 3 (e 1 ’,e 3 ’)|Supp(R 3 )) 18

4. Technique for mining temporal Interval relation rules (D,1)(D,2) (B,1)(B,3) (A,2)(A,3) (E,2)(E,3)(E,7) (C,5)  [D(1,2)]  [B(1,3)]  [A(2,3)]  [E(2,7)]  [C(5)] 101 A B C D E 102 B C E G 103 B C D E F 104 A C D E F

4. Technique for mining temporal Interval relation rules 20 Window size = 2 Window length = 12/2 = 6

4. Technique for mining temporal Interval relation rules 21

4. Technique for mining temporal Interval relation rules 22

4. Technique for mining temporal Interval relation rules DAG TR (B, C, E) for TR(B, C, E) is defined as (Vertex, Edge), where Vertex = {B, C, E} Edge = {,,. 23 Event type Events relation Relation support

4. Technique for mining temporal Interval relation rules 24

5. Experimental results 25

5. Experimental results 26

6. Conclusions and future research 27