GPS Trajectories Analysis in MOPSI Project Minjie Chen SIPU group Univ. of Eastern Finland.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

On the Effect of Trajectory Compression in Spatio-temporal Querying Elias Frentzos, and Yannis Theodoridis Data Management Group, University of Piraeus.
Aggregating local image descriptors into compact codes
Clustering Basic Concepts and Algorithms
Learning Trajectory Patterns by Clustering: Comparative Evaluation Group D.
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
On Map-Matching Vehicle Tracking Data
Compression of GPS Trajectories Minjie Chen, Mantao Xu and Pasi Fränti Speech and Image Processing Unit (SIPU) School of Computing University of Eastern.
The Evolution of Spatial Outlier Detection Algorithms - An Analysis of Design CSci 8715 Spatial Databases Ryan Stello Kriti Mehra.
Patch to the Future: Unsupervised Visual Prediction
Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei 1, Yu Zheng 2, Wen-Chih Peng 1 1 National Chiao Tung University, Taiwan 2 Microsoft.
T-Drive : Driving Directions Based on Taxi Trajectories Microsoft Research Asia University of North Texas Jing Yuan, Yu Zheng, Chengyang Zhang, Xing Xie,
DIMENSIONALITY REDUCTION BY RANDOM PROJECTION AND LATENT SEMANTIC INDEXING Jessica Lin and Dimitrios Gunopulos Ângelo Cardoso IST/UTL December
Trajectories Simplification Method for Location-Based Social Networking Services Presenter: Yu Zheng on behalf of Yukun Cheng, Kai Jiang, Xing Xie Microsoft.
Geographic Information Systems
1 Techniques for Efficient Road- Network-Based Tracking of Moving Objects Speaker : Jia-Hui Huang Date : 2006/10/23.
Tracking Moving Objects in Anonymized Trajectories Nikolay Vyahhi 1, Spiridon Bakiras 2, Panos Kalnis 3, and Gabriel Ghinita 3 1 St. Petersburg State University.
The Tornado Model: Uncertainty Model for Continuously Changing Data Byunggu Yu 1, Seon Ho Kim 2, Shayma Alkobaisi 2, Wan Bae 2, Thomas Bailey 3 Department.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Learning Transportation Mode from Raw GPS Data for Geographic Applications on the Web Yu Zheng, Like Liu, Xing Xie Microsoft Research.
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
Tokyo Research Laboratory © Copyright IBM Corporation 2009 | 2009/04/03 | SDM 09 / Travel-Time Prediction Travel-Time Prediction using Gaussian Process.
Context Awareness System and Service SCENE JS Lee 1 Energy-Efficient Rate-Adaptive GPS-based Positioning for Smartphones.
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
Clustering methods Course code: Pasi Fränti Speech & Image Processing Unit School of Computing University of Eastern Finland Joensuu,
Detecting Movement Type by Route Segmentation and Classification Karol Waga, Andrei Tabarcea, Minjie Chen and Pasi Fränti.
Bei Pan (Penny), University of Southern California
Mining Interesting Locations and Travel Sequences from GPS Trajectories IDB & IDS Lab. Seminar Summer 2009 강 민 석강 민 석 July 23 rd,
Wen He Tsinhua University, Beijing, China and Xi'an Communication Institute, Xi'an, China Deyi Li Tsinhua University, Beijing, China and Chinese.
Knowledge Discovery and Delivery Lab (ISTI-CNR & Univ. Pisa)‏ www-kdd.isti.cnr.it Anna Monreale Fabio Pinelli Roberto Trasarti Fosca Giannotti A. Monreale,
Hierarchical Distributed Genetic Algorithm for Image Segmentation Hanchuan Peng, Fuhui Long*, Zheru Chi, and Wanshi Siu {fhlong, phc,
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
 Larger datasets are becoming available from GPS, GSM, RFID, and other sensors.  Interest in movement has shifted from raw movement data analysis to.
Travel Speed Study of Urban Streets Using GPS &GIS Tom E. Sellsted City of Yakima, Washington Information Systems and Traffic.
Time Series Data Analysis - I Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What are Time Series? How to.
Dynamic Programming.
CSE 185 Introduction to Computer Vision Pattern Recognition 2.
FAST DYNAMIC QUANTIZATION ALGORITHM FOR VECTOR MAP COMPRESSION Minjie Chen, Mantao Xu and Pasi Fränti University of Eastern Finland.
Learning and Inferring Transportation Routines By: Lin Liao, Dieter Fox and Henry Kautz Best Paper award AAAI’04.
BING: Binarized Normed Gradients for Objectness Estimation at 300fps
Cluster-specific Named Entity Transliteration Fei Huang HLT/EMNLP 2005.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
Advanced Analytics on Hadoop Spring 2014 WPI, Mohamed Eltabakh 1.
August 30, 2004STDBM 2004 at Toronto Extracting Mobility Statistics from Indexed Spatio-Temporal Datasets Yoshiharu Ishikawa Yuichi Tsukamoto Hiroyuki.
Network Community Behavior to Infer Human Activities.
Feature Selection in k-Median Clustering Olvi Mangasarian and Edward Wild University of Wisconsin - Madison.
Trajectory Data Mining Dr. Yu Zheng Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Editor-in-Chief of ACM Trans.
Trajectory Data Mining Dr. Yu Zheng Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Editor-in-Chief of ACM Trans.
Trajectory Data Mining
Trajectory Simplification: On Minimizing the Direction-based Error
Trajectory Data Mining
Week Aug-24 – Aug-29 Introduction to Spatial Computing CSE 5ISC Some slides adapted from the book Computing with Spatial Trajectories, Yu Zheng and Xiaofang.
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Indexing Time Series. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases Time Series databases Text databases.
1 Travel Times from Mobile Sensors Ram Rajagopal, Raffi Sevlian and Pravin Varaiya University of California, Berkeley Singapore Road Traffic Control TexPoint.
黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Exploring Spatial-Temporal Trajectory Model for Location.
Location-based Social Networks 6/11/20161 CENG 770.
南台科技大學 資訊工程系 Region partition and feature matching based color recognition of tongue image 指導教授:李育強 報告者 :楊智雁 日期 : 2010/04/19 Pattern Recognition Letters,
Chapter 6 Activity Recognition from Trajectory Data Yin Zhu, Vincent Zheng and Qiang Yang HKUST November 2011.
Semi-Supervised Clustering
Transport mode detection in the city of Lyon using mobile phone sensors Jorge Chong Internship for MLDM M1 Jean Monnet University
Final Project – Anomalies Detection
Radu Mariescu-Istodor
Image Segmentation Techniques
Fuzzy Clustering.
Data Transformations targeted at minimizing experimental variance
Radu Mariescu-Istodor
Topological Signatures For Fast Mobility Analysis
Using Clustering to Make Prediction Intervals For Neural Networks
Lecture 16. Classification (II): Practical Considerations
Presentation transcript:

GPS Trajectories Analysis in MOPSI Project Minjie Chen SIPU group Univ. of Eastern Finland

Introduction A number of trajectories/routes are collected of users’ position and time information uses a mobile phone with built-in GPS receiver. The focus of this work is to design efficient algorithm (analysis, compression, etc) on the collected GPS data.

Outline Route reduction Route segmentation and classification Other topics GPS trajectory compression

To save the time cost of route rendering, we propose a multi- resolution polygonal approximation algorithm for estimating approximated route in each scale with linear time complexity and space complexity For one route, we give its corresponding approximated route in five different scale in our system

An example of polygonal approximation for the 5004 points route Initial approximated route with M’ =78 Approximated route (M =73) after reduction ⊿ ISE (P’) = 1.1*10 5 Approximated route after fine-tune step ⊿ ISE (P’) = 6.6*10 4

5004 points, original route

294 points, scale 1

78 points, scale 2

42 points, scale 3

44 points13 points6 points The original route has 575 points in this example

pointsRead file Segment routes Wgs- >utm MRPA Output file Total Sadjad Karol Andrei Pasi Ilkka

3s processing time even for a curve with 2,560,000 points

Route segmentation and classification The focus of this work is to analyze the human behaviour based on the collected GPS data. The collected routes are divided into several segments with different properties (transportation modes), such as stationary, walking, biking, running, or car driving.

Methodology Our approach consists of three parts: GPS signal pre-filtering A change-point-detection for route segmentation An inference algorithm for classification the properties of each segments.

GPS signal pre-filtering GPS signal has an accuracy around 10m, design efficient filtering algorithm is important for route analysis task Our proposed algorithm has two steps: outlier removal and route smooth No prior information is needed (e.g. road network)

Outlier removal Points with impossible speed and variance are detected and removed. Outlier point is removed after filtering

Before After filtering

Considered as a change-point detection problem Our solution has two steps: initialization and merging. We minimize the sum of speed variance for all segments by dynamic programming. Adjacent segments with similar properties are merged together by a pre-trained classifier.

Route 3:Non-moving Route 4: Jogging and running with non-moving interval Route 1: ski Route 2: Jogging and running with non-moving interval

In classification step, we want to classify each segments as stationary, walking, biking, running, or car driving Training a classifier on a number of features (speed, acceleration, time, distance) directly is inaccurate. We also consider the dependency of the properties in neighbor segments by minimizing:

Highway? detect some speed change

Detecting stopping area

Speed slow down in city center

Other info, Parking place?

Karol come to office by bicycle every day?

Future work Route analysis Similarity search

We extend the Longest Common Subsequence Similarity (LCSS) criterion for similarity calculation of two GPS trajectories. LCSS is defined as the time percentage of the overlap segments for two GPS trajectories.

Similar travel interests are found for different users

Cluster A Cluster B A → B 2 routes Starting Time: 16:30-17:00 B → A 6 routes Starting Time: 7:50-8:50 We can guess: A is office B is home nonmoving part in Karol’s routes, maybe his favorite shops

There are some lanes Karol goes frequently, but it doesn’t exist on Google map, road network can be updated in this way. Common stop points (Food shops) Commonly used route which is not existing in the street map Start points (Home of the user)

GPS trajectory compression GPS trajectories include Latitude, Longitude and Timestamp. Storage cost is around 120KB/hour if the data is collected at 1 second interval. For 10,000 users, the storage cost is 30GB/day, 10TB/year. Compression algorithm can save the storage cost significantly

Simple algorithms for GPS trajectory compression Reduce the number of points of the trajectory data, with no further compression process for the reduced data. Difference criterions are used, such as TD-TR, Open Window, STTrace. Synchronous Euclidean distance (SED) is used as the error metrics.

Performance of existing algorithms

Our algorithm Optimizes both for the reduction approximation and the quantization. Dataset: Microsoft Geolife dataset, 640 trajectories, 4,526,030 points Sampling rate: 1s,2s,5s Transportation mode: walking, bus, car and plane or multimodal. The size of uncompressed file : 43KB/hour(binary), 120KB/hour(txt), 300+KB/hour(GPX)

Result

Result

Result

Result: Compression performance Uncompressed (KB) Max SED = 1m (KB) Max SED = 3m (KB) Max SED =10m (KB) 1 Hour Day1, Month31, Year378,4326,5703,4161,664 Compression Ratio

Result: Time cost and average SED Max SED = 1mMax SED = 3mMax SED = 10m Ave_SED(m)0.43± ± ±0.36 Encoding time (second/10000 points) 3.44± ± ±0.45 Decoding time (second/10000 points) 3.44± ± ±0.47

Comparison We also compare the performance of proposed method with the state-of- the-art method TD-TR 1. Compression performance (KB/hour) TD-TR + WinZipProposed Max SED = 1m2.04± ±0.42 Max SED = 3m1.16± ±0.21 Max SED = 10m0.61± ± N. Meratnia and R. A. de By. "Spatiotemporal Compression Techniques for Moving Point Objects", Advances in Database Technology, vol. 2992, pp. 551 – 562, 2004.

42 Trajectory Pattern (Giannotti et al. 07) A trajectory pattern should describe the movements of objects both in space and in time

43 Sample T-Patterns Data Source: Trucks in Athens – 273 trajectories)

44 Trajectory Clustering (Lee et al. 07) 7 Clusters from Hurricane Data 570 Hurricanes (1950~2004) A red line: a representative trajectory

45 Data (Three Classes) Features: 10 Region-Based Clusters 37 Trajectory-Based Clusters Accuracy = 83.3%

Find users with similar behavior (Yu et al. 10) Estimate the similarity between users: semantic location history (SLH) The similarity can include : Geographic overlaps(same place), Semantic overlaps(same type of place), Location sequence.