PRESS: A Novel Framework of Trajectory Compression in Road Networks

Slides:



Advertisements
Similar presentations
Introduction to Algorithms
Advertisements

Multimedia Data Compression
15th CTI Workshop, July 26, Smart Itinerary Recommendation based on User-Generated GPS Trajectories Hyoseok Yoon 1, Y. Zheng 2, X. Xie 2 and W.
and 6.855J Cycle Canceling Algorithm. 2 A minimum cost flow problem , $4 20, $1 20, $2 25, $2 25, $5 20, $6 30, $
Scalable Routing In Delay Tolerant Networks
0 - 0.
Improving Compression Ratio, Area Overhead, and Test Application Time in System-on-a-chip Test Data Compression/Decompression Paul Theo Gonciari*, Bashir.
On the Effect of Trajectory Compression in Spatio-temporal Querying Elias Frentzos, and Yannis Theodoridis Data Management Group, University of Piraeus.
Information Retrieval in Practice
Multi-Guarded Safe Zone: An Effective Technique to Monitor Moving Circular Range Queries Presented By: Muhammad Aamir Cheema 1 Joint work with Ljiljana.
Research Challenges in the CarTel Mobile Sensor System Samuel Madden Associate Professor, MIT.
Capacity-Approaching Codes for Reversible Data Hiding Weiming Zhang, Biao Chen, and Nenghai Yu Department of Electrical Engineering & Information Science.
An Interactive-Voting Based Map Matching Algorithm
February 12, 2007 WALCOM '2007 1/22 DiskTrie: An Efficient Data Structure Using Flash Memory for Mobile Devices N. M. Mosharaf Kabir Chowdhury Md. Mostofa.
International Data Encryption Algorithm
CO-AUTHOR RELATIONSHIP PREDICTION IN HETEROGENEOUS BIBLIOGRAPHIC NETWORKS Yizhou Sun, Rick Barber, Manish Gupta, Charu C. Aggarwal, Jiawei Han 1.
Driving with Knowledge from the Physical World Jing Yuan, Yu Zheng Microsoft Research Asia.
Succinct Data Structures for Permutations, Functions and Suffix Arrays
1 ECE734 VLSI Arrays for Digital Signal Processing Loop Transformation.
Combinatorial Pattern Matching CS 466 Saurabh Sinha.
Urban Computing with Taxicabs
Introduction to Computer Science 2 Lecture 7: Extended binary trees
Applied Algorithmics - week7
Michael Alves, Patrick Dugan, Robert Daniels, Carlos Vicuna
On Map-Matching Vehicle Tracking Data
Greedy Algorithms (Huffman Coding)
Yoshiharu Ishikawa (Nagoya University) Yoji Machida (University of Tsukuba) Hiroyuki Kitagawa (University of Tsukuba) A Dynamic Mobility Histogram Construction.
Constructing Popular Routes from Uncertain Trajectories Authors of Paper: Ling-Yin Wei (National Chiao Tung University, Hsinchu) Yu Zheng (Microsoft Research.
Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei 1, Yu Zheng 2, Wen-Chih Peng 1 1 National Chiao Tung University, Taiwan 2 Microsoft.
Critical Analysis Presentation: T-Drive: Driving Directions based on Taxi Trajectories Authors of Paper: Jing Yuan, Yu Zheng, Chengyang Zhang, Weilei Xie,
Data Compression Michael J. Watts
Compression & Huffman Codes
T-Drive : Driving Directions Based on Taxi Trajectories Microsoft Research Asia University of North Texas Jing Yuan, Yu Zheng, Chengyang Zhang, Xing Xie,
SWE 423: Multimedia Systems
T.Sharon-A.Frank 1 Multimedia Size of Data Frame.
Compression & Huffman Codes Fawzi Emad Chau-Wen Tseng Department of Computer Science University of Maryland, College Park.
Tracking Moving Objects in Anonymized Trajectories Nikolay Vyahhi 1, Spiridon Bakiras 2, Panos Kalnis 3, and Gabriel Ghinita 3 1 St. Petersburg State University.
Document and Query Forms Chapter 2. 2 Document & Query Forms Q 1. What is a document? A document is a stored data record in any form A document is a stored.
Scalable Network Distance Browsing in Spatial Database Samet, H., Sankaranarayanan, J., and Alborzi H. Proceedings of the 2008 ACM SIGMOD international.
Lossless Data Compression Using run-length and Huffman Compression pages
CSE Lectures 22 – Huffman codes
Reducing Uncertainty of Low-sampling-rate Trajectories Kai Zheng, Yu Zheng, Xing Xie, Xiaofang Zhou University of Queensland & Microsoft Research Asia.
VLDB '2006 Haibo Hu (Hong Kong Baptist University, Hong Kong) Dik Lun Lee (Hong Kong University of Science and Technology, Hong Kong) Victor.
A Summary of XISS and Index Fabric Ho Wai Shing. Contents Definition of Terms XISS (Li and Moon, VLDB2001) Numbering Scheme Indices Stored Join Algorithms.
PMLAB Finding Similar Image Quickly Using Object Shapes Heng Tao Shen Dept. of Computer Science National University of Singapore Presented by Chin-Yi Tsai.
Image Compression (Chapter 8) CSC 446 Lecturer: Nada ALZaben.
Lossless Compression CIS 465 Multimedia. Compression Compression: the process of coding that will effectively reduce the total number of bits needed to.
COMPRESSION. Compression in General: Why Compress? So Many Bits, So Little Time (Space) CD audio rate: 2 * 2 * 8 * = 1,411,200 bps CD audio storage:
Mingyang Zhu, Huaijiang Sun, Zhigang Deng Quaternion Space Sparse Decomposition for Motion Compression and Retrieval SCA 2012.
Huffman Code and Data Decomposition Pranav Shah CS157B.
AAAI 2011, San Francisco Trajectory Regression on Road Networks Tsuyoshi Idé (IBM Research – Tokyo) Masashi Sugiyama (Tokyo Institute of Technology)
August 30, 2004STDBM 2004 at Toronto Extracting Mobility Statistics from Indexed Spatio-Temporal Datasets Yoshiharu Ishikawa Yuichi Tsukamoto Hiroyuki.
CS654: Digital Image Analysis Lecture 34: Different Coding Techniques.
Trajectory Data Mining Dr. Yu Zheng Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Editor-in-Chief of ACM Trans.
Trajectory Data Mining
U of Minnesota DIWANS'061 Energy-Aware Scheduling with Quality of Surveillance Guarantee in Wireless Sensor Networks Jaehoon Jeong, Sarah Sharafkandi and.
Lossless Decomposition and Huffman Codes Sophia Soohoo CS 157B.
Computer Sciences Department1. 2 Data Compression and techniques.
Data Compression Michael J. Watts
Managing Massive Trajectories on the Cloud
Design & Analysis of Algorithm Huffman Coding
Compression & Huffman Codes
Data Compression.
Tries 07/28/16 11:04 Text Compression
T-Share: A Large-Scale Dynamic Taxi Ridesharing Service
Pagerank and Betweenness centrality on Big Taxi Trajectory Graph
Traffic Sign Recognition Using Discriminative Local Features Andrzej Ruta, Yongmin Li, Xiaohui Liu School of Information Systems, Computing and Mathematics.
Mining Spatio-Temporal Reachable Regions over Massive Trajectory Data
Dr. Clincy Professor of CS
Topological Signatures For Fast Mobility Analysis
Presentation transcript:

PRESS: A Novel Framework of Trajectory Compression in Road Networks Renchu Song, Weiwei Sun, Fudan University Baihua Zheng, Singapore Management University Yu Zheng, Microsoft Research, Beijing

Background Big Data Huge volume of spatial trajectories cause heavy burden to data storage and data process Trajectories contain redundant parts that contribute very limited to spatial and temporal information Solution: Trajectory Compression

PRESS: Paralleled Road-Network-based Trajectory Compression Map trajectory Map matcher Trajectory re-formatter Temporal sequence Spatial path Spatial compressor Temporal compressor GPS trajectory Location-based services Compressed spatial path Compressed temporal sequence Query processor

PRESS (cont’d) Key highlights Separate the spatial path from the temporal information when presenting a trajectory Propose a lossless spatial compression algorithm HSC Propose an error-bounded temporal compression algorithm BTC Support multiple popular location-based services without fully decompressing the trajectories

Trajectory Representation Traditional representation (x1, y1, t1), (x2, y2, t1) … Spatial path The sequence of road segments passed by a trajectory Temporal sequence The sequence of (di, ti) vectors di refers to the distance travelled from the start of the trajectory until time stamp ti

HSC: Spatial Compression Hybrid Spatial Compression (HSC) is lossless, and it consists of two stages STAGE 1 Shortest Path Compression Input: spatial path (consecutive edge sequence) Output: non-consecutive edge sequence Input: non-consecutive edge sequence Output binary code STAGE 2 Frequent Sub-Tra. Compression

HSC Stage 1: Shortest Path Compression Observation: given a source s and a destination d, most of the time we take the shortest path between s and d if all the edges roughly share the similar traffic condition Given an edge sequence If the sequence refers to the shortest path from to , we will replace the sequence with

HSC Stage 2: Frequent Sub-trajectory Compression Observation: certain road segments are much more popular than others Basic idea: We can treat the sequence of edges as a string, and can employ suitable coding techniques to use fewer bits to represent more common sub-strings Main approach Identify the frequent sub-trajectories (FSTs) using a training set Decompose a trajectory into a sequence of FSTs Use Huffman coding to represent the decomposed trajectory

HSC Stage 2: Frequent Sub-trajectory Compression (cont’d) Training Trajectory Set All the sub-trajectories with length   Aho-Corasick Automaton: facilitate trajectory decomposition Trie: capture sub-trajectories and their frequency Huffman tree: code each node in Trie

HSC Stage 2: Frequent Sub-trajectory Compression (cont’d) Aho-Corasick Automaton: facilitate trajectory decomposition Huffman tree: code each node in Trie

BTC: Temporal Compression Temporal info: TSND (Time Synchronized Network Distance): Given a trajectory T and its compressed one T′, TSND measures the maximum difference between the distance object travels via trajectory T and that via trajectory T′ at any time slot with TSND(T, T′) = Maxtx(|Dis(T, tx)−Dis(T′, tx)|). NSTD (Network Synchronized Time Difference) defines the maximum time difference between a trajectory T and its compressed form T′ while traveling any same distance with NSTD(T, T′) = Maxdx (|Tim(T, dx)− Tim(T′, dx)|).

Experiments The experiments are based on real trajectory data from one major taxi company in Singapore. Each taxi has installed GPS, and it reports its locations regularly. In our studies, we use the trajectories reported within January 2011, in total 465,000 trajectories generated by about 15,000 taxis. The original storage cost of this dataset is 13.2GB.

Experiment (cont’d) Compression ratio of HSC (spatial compression algorithm)

Experiment (cont’d) Compression ratio of BTC (temporal compression algorithm)

Experiment (cont’d) Compression ratio of PRESS framework

Experiment (cont’d) Comparison of PRESS and its competitors (note both competitors are not bounded by TSND and NSTD but TSED only) MMTC: Georgios Kellaris, Nikos Pelekis, and Yannis Theodoridis. Map-matched trajectory compression. JSS, 86(6):1566–1579, 2013. Nonmaterial: Hu Cao and Ouri Wolfson. Nonmaterialized motion information in transport networks. In ICDT’05, pages 173–188, 2005. Compression ratio of commercial compressors RAR: 3.78 ZIP: 2.09

Q & A