Mining Frequent Spatio-temporal Sequential Patterns

Slides:



Advertisements
Similar presentations
On the Effect of Trajectory Compression in Spatio-temporal Querying Elias Frentzos, and Yannis Theodoridis Data Management Group, University of Piraeus.
Advertisements

Mining User Similarity Based on Location History Yu Zheng, Quannan Li, Xing Xie Microsoft Research Asia.
An Interactive-Voting Based Map Matching Algorithm
Mining Compressed Frequent- Pattern Sets Dong Xin, Jiawei Han, Xifeng Yan, Hong Cheng Department of Computer Science University of Illinois at Urbana-Champaign.
gSpan: Graph-based substructure pattern mining
ANTLR in SSP Xingzhong Xu Hong Man Aug Outline ANTLR Abstract Syntax Tree Code Equivalence (Code Re-hosting) Future Work.
Rule Discovery from Time Series Presented by: Murali K. Kadimi.
Mining Mobile Group Patterns: A Trajectory-based Approach San-Yih Hwang, Ying-Han Liu, Jeng-Kuen Chiu NSYSU, Taiwan Ee-Peng Lim NTU, Singapore.
Patch to the Future: Unsupervised Visual Prediction
Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei 1, Yu Zheng 2, Wen-Chih Peng 1 1 National Chiao Tung University, Taiwan 2 Microsoft.
Critical Analysis Presentation: T-Drive: Driving Directions based on Taxi Trajectories Authors of Paper: Jing Yuan, Yu Zheng, Chengyang Zhang, Weilei Xie,
Trajectory Pattern Mining NTU IM Hsieh, Hsun-Ping Trajectory Pattern Mining Reporter : Hsieh, Hsun-Ping 解巽評 (R ) Fosca Giannotti Mirco Nanni Dino.
Transfer Learning for WiFi-based Indoor Localization
Cascading Spatio-Temporal Pattern Discovery P. Mohan, S.Shekhar, J. Shine, J. Rogers CSci 8715 Presented by: Atanu Roy Akash Agrawal.
Mining Sequence Patterns from Wind Tunnel Experimental Data Zhenyu Liu †, Wesley W. Chu †, Adam Huang ‡, Chris Folk ‡, Chih-Ming Ho ‡
Temporal Pattern Matching of Moving Objects for Location-Based Service GDM Ronald Treur14 October 2003.
On Discovering Moving Clusters in Spatio-temporal Data Panos Kalnis National University of Singapore Nikos Mamoulis University of Hong Kong Spiridon Bakiras.
1 SINA: Scalable Incremental Processing of Continuous Queries in Spatio-temporal Databases Mohamed F. Mokbel, Xiaopeng Xiong, Walid G. Aref Presented by.
1 Convolution and Its Applications to Sequence Analysis Student: Bo-Hung Wu Advisor: Professor Herng-Yow Chen & R. C. T. Lee Department of Computer Science.
Fast Algorithms for Association Rule Mining
Detecting Time Series Motifs Under
Mining Sequences. Examples of Sequence Web sequence:  {Homepage} {Electronics} {Digital Cameras} {Canon Digital Camera} {Shopping Cart} {Order Confirmation}
Learning Transportation Mode from Raw GPS Data for Geographic Applications on the Web Yu Zheng, Like Liu, Xing Xie Microsoft Research.
Statistical Critical Path Selection for Timing Validation Kai Yang, Kwang-Ting Cheng, and Li-C Wang Department of Electrical and Computer Engineering University.
GeoPKDD Geographic Privacy-aware Knowledge Discovery and Delivery Kick-off meeting Pisa, March 14, 2005.
Friends and Locations Recommendation with the use of LBSN
1 Contact Prediction, Routing and Fast Information Spreading in Social Networks Kazem Jahanbakhsh Computer Science Department University of Victoria August.
Discovering the Intrinsic Cardinality and Dimensionality of Time Series using MDL BING HU THANAWIN RAKTHANMANON YUAN HAO SCOTT EVANS1 STEFANO LONARDI EAMONN.
Sequential PAttern Mining using A Bitmap Representation
Trajectory Pattern Mining
Energy-Aware Scheduling with Quality of Surveillance Guarantee in Wireless Sensor Networks Jaehoon Jeong, Sarah Sharafkandi and David H.C. Du Dept. of.
Spatial-Temporal Models in Location Prediction Jingjing Wang 03/29/12.
Friends and Locations Recommendation with the use of LBSN By EKUNDAYO OLUFEMI ADEOLA
Stylization and Abstraction of Photographs Doug Decarlo and Anthony Santella.
Shape-based Similarity Query for Trajectory of Mobile Object NTT Communication Science Laboratories, NTT Corporation, JAPAN. Yutaka Yanagisawa Jun-ichi.
2131 Structured System Analysis and Design By Germaine Cheung Hong Kong Computer Institute Lecture 8 (Chapter 7) MODELING SYSTEM REQUIREMENTS WITH USE.
INFERRING HUMAN ACTIVITY FROM GPS TRACKS Sun Simiao.
A data-mining approach for multiple structural alignment of proteins WY Siu, N Mamoulis, SM Yiu, HL Chan The University of Hong Kong Sep 9, 2009.
ICDE, San Jose, CA, 2002 Discovering Similar Multidimensional Trajectories Michail VlachosGeorge KolliosDimitrios Gunopulos UC RiversideBoston UniversityUC.
Algorithms For Time Series Knowledge Mining Fabian Moerchen 沈奕聰.
MINING COLOSSAL FREQUENT PATTERNS BY CORE PATTERN FUSION FEIDA ZHU, XIFENG YAN, JIAWEI HAN, PHILIP S. YU, HONG CHENG ICDE07 Advisor: Koh JiaLing Speaker:
1 AC-Close: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery Advisor : Dr. Koh Jia-Ling Speaker : Tu Yi-Lang Date : Hong.
Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying 1, Wang-Chien Lee 2, Tz-Chiao Weng 1 and Vincent S. Tseng 1 1 Department of Computer.
Mining Trajectory Profiles for Discovering User Communities Speaker : Chih-Wen Chang National Chiao Tung University, Taiwan Chih-Chieh Hung,
LogTree: A Framework for Generating System Events from Raw Textual Logs Liang Tang and Tao Li School of Computing and Information Sciences Florida International.
Efficient Data Compression in Location Based Services Yuni Xia, Yicheng Tu, Mikhail Atallah, Sunil Prabhakar.
Intelligent Database Systems Lab Advisor : Dr.Hsu Graduate : Keng-Wei Chang Author : Salvatore Orlando Raffaele Perego Claudio Silvestri 國立雲林科技大學 National.
Trajectory Data Mining Dr. Yu Zheng Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Editor-in-Chief of ACM Trans.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A personal route prediction system base on trajectory.
Chaoyang University of Technology Clustering web transactions using rough approximation Source : Fuzzy Sets and Systems 148 (2004) 131–138 Author : Supriya.
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Engine measurement dataset Engine Sub-system Measurements from Metro-Transit Buses in Twin-Cities Sample Engine Variables:  GPS Speed and Position  Vehicle.
Extracting stay regions with uncertain boundaries from GPS trajectories a case study in animal ecology Haidong Wang.
An Energy-Efficient Approach for Real-Time Tracking of Moving Objects in Multi-Level Sensor Networks Vincent S. Tseng, Eric H. C. Lu, & Kawuu W. Lin Institute.
黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Exploring Spatial-Temporal Trajectory Model for Location.
Density-based Place Clustering in Geo-Social Networks Jieming Shi, Nikos Mamoulis, Dingming Wu, David W. Cheung Department of Computer Science, The University.
Gspan: Graph-based Substructure Pattern Mining
DATA MINING © Prentice Hall.
A Melody Composer for both Tonal and Non-Tonal Languages
G10 Anuj Karpatne Vijay Borra
SMART APPLICATIONS FOR SMART CITY: A CONTRIBUTION TO INNOVATION
Video Summarization by Spatial-Temporal Graph Optimization
Lin Lu, Margaret Dunham, and Yu Meng
Spatial Online Sampling and Aggregation
CS 548 Sequence Mining Showcase By Bian Du, Wa Gao, and Cam Jones
Continuous Motion Pattern Query
Mining Long, Sharable Patterns in Trajectories of Moving Objects
Discovery of Significant Usage Patterns from Clickstream Data
Discovering Frequent Poly-Regions in DNA Sequences
Presentation transcript:

Mining Frequent Spatio-temporal Sequential Patterns Authors: Huiping Cao, Nikos Mamoulis, David W. Cheung Department of Computer Science, University of Hong Kong Accepted to ICDM 2005 Abhinaya Sinha Vijay Gandhi CSCI 8715 Date: 4/3/06

Outline Introduction Problem Definition Contribution Approach Experimental Evaluation Assumptions Rewrite today

Spatio-Temporal Pattern Spatio-temporal sequence is a list of time-sampled locations. (x1,y1,t1), (x2,y2,t2),…,(xn,yn,tn) A pattern is a subset of the sequence A pattern is frequent if its support >= threshold value Bus routes across 3 runs

Motivation Predict trajectory of an object Detect frequent routes followed by an object Ecology CHIMPS project at UofM (http://www.discoverchimps.org) Tracking chimpanzee movements and analyzing their behavior Location based services Where a user may go next and thus, provide customized services

Problem Definition Input: Given spatio-temporal sequence S and support threshold min_sup Output: Frequent spatio-temporal patterns Objective: Lower computation time and space Constraints: Use of a device such as GPS to sample locations

Contributions Model for spatio-temporal sequential pattern mining Algorithm for extracting frequent singular spatial pattern elements a.k.a. spatial regions Algorithm for combining single regions to patterns of longer length

Methodology Convert spatio-temporal sequence to set of line segments i.e. abstract the trajectory Detect frequent, singular regions Combine singular patterns to longer sequences

Modeling the problem Aim: abstract (approximate) the trajectory Input: original spatio-temporal sequence Output: sequence segments Need locations are not repeated exactly Derived line segments are used for defining spatial regions Compress original data and decrease effort required in mining Uses Douglas-Pecker (DP) algorithm for performing this step

Concepts in Modeling Spatio-temporal Segment Representative line segment of a segment Similarity between line segments | l1.angle – l2.angle | <= θ | l1.len – l2.len | <= length factor * max ( l1.len, l2.len) Closeness between line segments Distance (points belonging to line1, line2) <= ε Mean line segment of a set of line segments Spatial pattern element/ spatial region Sides determined by mean line segment and average orthogonal distance Spatio-temporal sequential pattern Ordered sequence of spatial pattern elements Support of a pattern number of sequences supporting it

Discovering frequent singular patterns Input: All segments representing the spatio-temporal sequence, support threshold Output: All segments have been assigned to regions or been found as outliers. This step constructs frequent single spatial regions Uses similarity and closeness to assign segments to regions

Deriving longer patterns Goal is to derive longer patterns from the singular frequent patterns obtained in previous step. Input is series SR of spatial regions. Output: Frequent patterns. Two algorithms Level-wise mining Mining using substring tree

Example Input: r1-r2-r1-r2-r3 minsup = 2 Output candidates r1-r2 #2; r2-r1 #1; r2-r3 #1 r1 #2 r2 #2; r3 #1 Frequent Patterns: r1-r2; r1; r2

Level-wise pattern mining Apriori Algorithm e.g. input: r1-r2-r1-r2 r1(2) r2(2) r1-r2(2) r2-r1(1) Output: r1; r2; r1-r2

Contribution: Improvements Connectivity constraint Closeness property Input: r1-r2-r1-r3-r1-r2 minsup: 2 r1(3) r2(2) r3(1) r1-r2 r2-r1 r1-r3 r3-r1 r2r3 r3r2

Mining using substring tree Tree structure with each node is pattern element The substring tree is used to help in counting long substrings Depth first search gives substring e.g. r1-r2-r1-r3-r1-r2

Validation Methodology Computer simulations Candidate Algorithms: Level-wise, Grid I, Grid II, Substring tree Evaluation done on Real-world dataset Synthetic dataset

Experimental Evaluation Substring takes least time

Experimental Evaluation Extracted patterns Comparison with grid-based approach

Assumptions Spatio-temporal sequence is a list of (x,y) values Trajectory is abstracted by line segments Closeness and similarity of line segments is defined using 2-D data constructs Hence is only applicable to 2-D data

Rewrite Today Experimental evaluation on more real-life datasets Account for noise in input data (inaccuracy of GPS data) Explore techniques that abstract the spatial trajectory with higher accuracy

Any questions??