Reducing Uncertainty of Low-sampling-rate Trajectories Kai Zheng, Yu Zheng, Xing Xie, Xiaofang Zhou University of Queensland & Microsoft Research Asia.

Slides:



Advertisements
Similar presentations
Xiaolei Li, Zhenhui Li, Jiawei Han, Jae-Gil Lee. 1. Motivation 2. Anomaly Definitions 3. Algorithm 4. Experiments 5. Conclusion.
Advertisements

Incremental Clustering for Trajectories
Mining User Similarity Based on Location History Yu Zheng, Quannan Li, Xing Xie Microsoft Research Asia.
An Interactive-Voting Based Map Matching Algorithm
Driving with Knowledge from the Physical World Jing Yuan, Yu Zheng Microsoft Research Asia.
PRESS: A Novel Framework of Trajectory Compression in Road Networks
Urban Computing with Taxicabs
Learning Trajectory Patterns by Clustering: Comparative Evaluation Group D.
On Map-Matching Vehicle Tracking Data
VTrack: Accurate, Energy-Aware Road Traffic Delay Estimation Using Mobile Phones Arvind Thiagarajan, Lenin Ravindranath, Katrina LaCurts, Sivan Toledo,
Retrieving k-Nearest Neighboring Trajectories by a Set of Point Locations Lu-An Tang, Yu Zheng, Xing Xie, Jing Yuan, Xiao Yu, Jiawei Han University of.
Patch to the Future: Unsupervised Visual Prediction
Yoshiharu Ishikawa (Nagoya University) Yoji Machida (University of Tsukuba) Hiroyuki Kitagawa (University of Tsukuba) A Dynamic Mobility Histogram Construction.
4/15/2017 Using Gaussian Process Regression for Efficient Motion Planning in Environments with Deformable Objects Barbara Frank, Cyrill Stachniss, Nichola.
Constructing Popular Routes from Uncertain Trajectories Authors of Paper: Ling-Yin Wei (National Chiao Tung University, Hsinchu) Yu Zheng (Microsoft Research.
Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei 1, Yu Zheng 2, Wen-Chih Peng 1 1 National Chiao Tung University, Taiwan 2 Microsoft.
Travel Time Estimation of a Path using Sparse Trajectories
Critical Analysis Presentation: T-Drive: Driving Directions based on Taxi Trajectories Authors of Paper: Jing Yuan, Yu Zheng, Chengyang Zhang, Weilei Xie,
A Mobile Infrastructure Based VANET Routing Protocol in the Urban Environment School of Electronics Engineering and Computer Science, PKU, Beijing, China.
T-Drive : Driving Directions Based on Taxi Trajectories Microsoft Research Asia University of North Texas Jing Yuan, Yu Zheng, Chengyang Zhang, Xing Xie,
--Presented By Sudheer Chelluboina. Professor: Dr.Maggie Dunham.
Trajectories Simplification Method for Location-Based Social Networking Services Presenter: Yu Zheng on behalf of Yukun Cheng, Kai Jiang, Xing Xie Microsoft.
The Fourth WIM Meeting 1 Active Nearest Neighbor Queries for Moving Objects Jan Kolar, Igor Timko.
Tracking Moving Objects in Anonymized Trajectories Nikolay Vyahhi 1, Spiridon Bakiras 2, Panos Kalnis 3, and Gabriel Ghinita 3 1 St. Petersburg State University.
Quickest path and Quickest routing: A dynamic routing method Research Topic: Jiang, XidongMS candidate in computer science at California State University,
Scalable Network Distance Browsing in Spatial Database Samet, H., Sankaranarayanan, J., and Alborzi H. Proceedings of the 2008 ACM SIGMOD international.
Route Planning Vehicle navigation systems, Dijkstra’s algorithm, bidirectional search, transit-node routing.
Trip Planning Queries F. Li, D. Cheng, M. Hadjieleftheriou, G. Kollios, S.-H. Teng Boston University.
1 Vehicular Sensor Networks for Traffic Monitoring In proceedings of 17th International Conference on Computer Communications and Networks (ICCCN 2008)
Learning Transportation Mode from Raw GPS Data for Geographic Applications on the Web Yu Zheng, Like Liu, Xing Xie Microsoft Research.
Exploration of Ground Truth from Raw GPS Data National University of Defense Technology & Hong Kong University of Science and Technology Exploration of.
Distance Indexing on Road Networks A summary Andrew Chiang CS 4440.
Sensys 2009 Speaker:Lawrence.  Introduction  Overview & Challenges  Algorithm  Travel Time Estimation  Evaluation  Conclusion.
Performance Evaluation of Vehicular DTN Routing under Realistic Mobility Models Pei’en LUO.
Mining Interesting Locations and Travel Sequences from GPS Trajectories IDB & IDS Lab. Seminar Summer 2009 강 민 석강 민 석 July 23 rd,
1 A Bayesian Method for Guessing the Extreme Values in a Data Set Mingxi Wu, Chris Jermaine University of Florida September 2007.
On Graph Query Optimization in Large Networks Alice Leung ICS 624 4/14/2011.
Scientific Writing Abstract Writing. Why ? Most important part of the paper Number of Readers ! Make people read your work. Sell your work. Make your.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Relay Placement Problem in Smart Grid Deployment Wei-Lun Wang and Quincy Wu Department of Computer Science and Information Engineering, National Chi Nan.
Easiest-to-Reach Neighbor Search Fatimah Aldubaisi.
Group 8: Denial Hess, Yun Zhang Project presentation.
University “Ss. Cyril and Methodus” SKOPJE Cluster-based MDS Algorithm for Nodes Localization in Wireless Sensor Networks Ass. Biljana Stojkoska.
Robotics Club: 5:30 this evening
Performance of Adaptive Beam Nulling in Multihop Ad Hoc Networks Under Jamming Suman Bhunia, Vahid Behzadan, Paulo Alexandre Regis, Shamik Sengupta.
Trajectory Data Mining Dr. Yu Zheng Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Editor-in-Chief of ACM Trans.
Real-Time Trip Information Service for a Large Taxi Fleet
Trends: Spatio-temporal graphs Introduction to Spatial Computing.
Week Aug-24 – Aug-29 Introduction to Spatial Computing CSE 5ISC Some slides adapted from the book Computing with Spatial Trajectories, Yu Zheng and Xiaofang.
U of Minnesota DIWANS'061 Energy-Aware Scheduling with Quality of Surveillance Guarantee in Wireless Sensor Networks Jaehoon Jeong, Sarah Sharafkandi and.
TÜBİTAK An Optimization Approach for Airport Ground Operations with A Shortest Path Algorithm 12 November 2015 Orhan Eroglu - TUBITAK BILGEM, Turkey Zafer.
Urban Traffic Simulated From A Dual Perspective Hu Mao-Bin University of Science and Technology of China Hefei, P.R. China
1 Travel Times from Mobile Sensors Ram Rajagopal, Raffi Sevlian and Pravin Varaiya University of California, Berkeley Singapore Road Traffic Control TexPoint.
C ROWD P LANNER : A C ROWD -B ASED R OUTE R ECOMMENDATION S YSTEM Han Su, Kai Zheng, Jiamin Huang, Hoyoung Jeung, Lei Chen, Xiaofang Zhou.
Incrementally Improving Lookup Latency in Distributed Hash Table Systems Hui Zhang 1, Ashish Goel 2, Ramesh Govindan 1 1 University of Southern California.
VADD: Vehicle-Assisted Data Delivery in Vehicular Ad Hoc Networks Zhao, J.; Cao, G. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 鄭宇辰
Presented by: Siddhant Kulkarni Spring Authors: Publication:  ICDE 2015 Type:  Research Paper 2.
ParkNet: Drive-by Sensing of Road-Side Parking Statistics Irfan Ullah Department of Information and Communication Engineering Myongji university, Yongin,
Shape2Pose: Human Centric Shape Analysis CMPT888 Vladimir G. Kim Siddhartha Chaudhuri Leonidas Guibas Thomas Funkhouser Stanford University Princeton University.
Modeling Perspective Effects in Photographic Composition Zihan Zhou, Siqiong He, Jia Li, and James Z. Wang The Pennsylvania State University.
A Flexible Spatio-temporal indexing Scheme for Large Scale GPS Tracks Retrieval Yu Zheng, Longhao Wang, Xing Xie Microsoft Research.
Presented by: Mi Tian, Deepan Sanghavi, Dhaval Dholakia
T-Share: A Large-Scale Dynamic Taxi Ridesharing Service
Urban Sensing Based on Human Mobility
Probabilistic Data Management
Introduction Secondary Users (SUs) Primary Users (PUs)
Finding Fastest Paths on A Road Network with Speed Patterns
Showcasing work by Jing Yuan, Yu Zheng, Xing Xie, Guangzhou Sun
Continuous Density Queries for Moving Objects
Topological Signatures For Fast Mobility Analysis
Presentation transcript:

Reducing Uncertainty of Low-sampling-rate Trajectories Kai Zheng, Yu Zheng, Xing Xie, Xiaofang Zhou University of Queensland & Microsoft Research Asia ICDE 2012, Washington D.C.

Outline Introduction Problem Methodologies Evaluation

Trajectories in mathematical and real worlds A location trajectory is a record of the path of a variety of moving objects, such as people, vehicles, animals and nature phenomena From mathematics point, a trajectory is a continuous mapping from time to space In real world, GPS devices can only report their locations on discrete time instants. Essentially, a real world trajectory is a sample of its counterpart in mathematical world.

Trajectories in mathematical and real worlds

Low-sampling-rate Issues Since we always use a sample to approximate the original trajectory of the moving object, higher sampling rate results in better approximation However, huge amount of low-sampling-rate trajectories exist in many scenarios

Low-sampling-rate Issues (Cont.) GPS devices report their location at low frequency to save battery and communication cost Less than 17% of trajectories with sampling rate > every 2 mins, based on taxicabs of Beijing Tourists can upload their photos with geo-tags to photo sharing services (Flickr etc), which also form trajectories of their travel routes

Impact of low-sampling-rate Detailed travel information is lost Uncertainty arise when querying against such kind of data Making decision solely based on these data can be unhelpful (e.g. traffic management, urban planning)

Traditional methodologies Just ignore this issue, and process as usual Uncertainty-awareness trajectory models, indexes, and queries Space-time prism model, necklace model Probabilistic queries (range and NN)

Our idea Can we reduce the uncertainty caused by the low- sampling-rate before the trajectories undergo further processing? To be more specific, can we estimate its original route from the samples? Our basic idea is to leverage the historical trajectory data as well as the following two observations.

Key Observation – 1 Travel patterns between certain locations are often highly skewed we can find some popular routes between certain locations Limitation: we need a reasonably large set of quality trajectories with high-sampling-rate, so that we can know their routes

A B C

Key Observation – 2 Trajectories sharing the same/similar routes can often complement each other to make themselves more complete In other words, it’s possible to interpolate a low-sampling-rate trajectory by cross-referring other trajectories on the same/similar route, so that they all become high-sampling- rate

Challenges on real data Data sparseness Trajectories are sparse compared with the space A query can be given with any origin and destination, which may not exist in historical dataset Data quality The trajectory dataset is mixed with high- and low-sampling-rate trajectories GPS locations can be off-road (in most case they are!) Outlier

Outline Introduction Problem Methodologies Evaluation

Problem statement Input A set of historical trajectories (various qualities) A road network A user-given query trajectory with low-sampling-rate Output A few possible routes of this query trajectory

Main contributions Propose a new idea and framework on how to deal with low-sampling-rate trajectories Develop a system based on real-world large trajectory dataset Trajectories of taxicabs in Beijing

Outline Introduction Problem Methodologies Evaluation

System Overview

Outline Introduction Problem Methodologies Pre-processing Reference trajectory search Local route inference Global route inference Evaluation

Preprocessing (on historical data) Trip partition A GPS log contains the record of movement for a long period Partition a long trajectory into meaningful trips Concept: stay point [zheng2009mining] Map matching for GPS points Candidate edges Indexing all the GPS points

Route inference Search for reference trajectories Select the relevant historical trajectories that may be helpful in inferring the route of the query Local route inference Inferring the routes between consecutive samples of query Global route inference Inferring the whole routes by connecting the local routes

Outline Introduction Problem Methodologies Pre-processing Reference trajectory search Local route inference Global route inference Evaluation

Reference trajectory search Intuitively, we only need to utilize the ones in the surrounding area of the query since the relationship between two trajectories faraway from each other is usually Simple and spliced reference trajectory

Reference trajectory search (cont.) Simple reference trajectory They natively exist in the trajectory archive

T1, T2 -- yes T3, T4 – no Reference trajectory search (cont.)

Spliced reference trajectory They don’t exist in the trajectory archive by nature Formed by splicing two parts of trajectories

T1, T2, T4 – not simple reference trajectory Parts of T1 and T2 can form a reference trajectory Reference trajectory search (cont.)

Why we only consider two consecutive points? Why we propose spliced reference trajectory? Reference trajectory search (cont.) Data sparseness!

Outline Introduction Problem Methodologies Pre-processing Reference trajectory search Local route inference Global route inference Evaluation

Local route inference Basic idea is to treat all the reference trajectories collectively Using the points from reference trajectories as the evidence of popularity of each road Traverse graph based approach Nearest neighbor based approach

Traverse graph based approach Intuition: if a road segment is not travelled by any reference, there is a high chance that the query object did not pass by it either Focus on the road segments traversed by some reference trajectories rather than all the edges in the road network

Traverse graph based approach (cont.) Essentially, the traverse graph is a conceptual graph that incorporates the topological structure of the underlying road network as well as the distribution of reference trajectories

Traverse graph based approach (cont.)

Nearest neighbor based approach Consider all the reference points in Euclidean space Try to find a continuous hops with shortest Euclidean distance from origin to destination via the reference points Recursively search for kNN of the current position and jump to one of the kNNs

Nearest neighbor based approach (cont.)

We will keep track of each path that has been built. So if another recursion hits any node of this path, we can re-use them

Nearest neighbor based approach (cont.) Pros: more adaptive to the distribution of the reference trajectories Cons: not as reliable as the traverse graph not efficient when the number of reference points increase

Hybrid approach Combine the advantage of both approaches Detect the density of reference points in surrounding area High density: traverse graph based Low density: nearest neighbor based

Outline Introduction Problem Methodologies Pre-processing Reference trajectory search Local route inference Global route inference Evaluation

Global route inference

Global route inference (cont.)

The quality of a global route depends on The quality of each local route The quality of the connections between local routes Correspondingly, popularity function for each local route transition confidence function for the connections

Global route inference (cont.) Popularity of a local route How many traffic on the route The distribution of the traffic on each road of the route

Global route inference (cont.)

We try to find the subset of global routes that maximize the global route score Downward closure property holds: an optimal route implies an optimal sub-route Can be solved by Dynamic Programming method

Outline Introduction Problem Methodologies Evaluation

Experiment setup Historical dataset: 100K raw trajectories of 33,000+ Beijing taxicabs over 3 months as the historical trajectory set (about 10% have at least one sample point in every 2 minutes) Beijing digital map with 106,579 road nodes and 141,380 road segments Query trajectories are from Geolife project

Evaluation approach Ground truth: query trajectories from Geolife are of high- sampling-rate, so we know their original routes We re-sample the queries using low-sampling-rate as the input of our system for test purpose Compare the route recovered by our methods against the original one

Evaluation approach As comparison, we use three map-matching algorithm to align the samples onto the road and interpolate by shortest path Incremental method [Greenfeld2002matching] ST-matching [lou2009map] IVMM algorithm [yuan2010interactive]

Results summary (sample/minute) Accuracy w.r.t. sampling rate

Results summary (cont.) Accuracy w.r.t. query length

Results summary (cont.) Effect of search radius for reference trajectories

Results summary (cont.) Effect of density of reference points

Results summary (cont.)

Conclusion and future work Adopt a new perspective to deal with the data quality issue in real trajectory base Develop a systematic framework based on real historical taxi data to demonstrate the feasibility of our proposals We haven’t considered personalization so far, which may be another interesting direction It may be helpful to incorporate more environmental factors into the system, such as the weather, time, real-time traffic condition, etc.

Thank you & welcome to Brisbane for ICDE’13!