實驗室 : 先進網路技術與服務實驗室報告者 : 黃福銘 (Angus F.M. Huang) Adaptive Fastest Path Computation on a Road Network: A Traffic Mining Approach TMSG 2013.10.02.

Slides:

Advertisements

Similar presentations

Problem solving with graph search

Advertisements

ADAPTIVE FASTEST PATH COMPUTATION ON A ROAD NETWORK: A TRAFFIC MINING APPROACH Hector Gonzalez, Jiawei Han, Xiaolei Li, Margaret Myslinska, John Paul Sondag.

Urban Computing with Taxicabs

Ranking Outliers Using Symmetric Neighborhood Relationship Wen Jin, Anthony K.H. Tung, Jiawei Han, and Wei Wang Advances in Knowledge Discovery and Data.

Ch2 Data Preprocessing part3 Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.

Mining Compressed Frequent- Pattern Sets Dong Xin, Jiawei Han, Xifeng Yan, Hong Cheng Department of Computer Science University of Illinois at Urbana-Champaign.

gSpan: Graph-based substructure pattern mining

Efficient Keyword Search for Smallest LCAs in XML Database Yu Xu Department of Computer Science & Engineering University of California, San Diego Yannis.

Mining Multiple-level Association Rules in Large Databases

Fast Algorithms For Hierarchical Range Histogram Constructions

On Map-Matching Vehicle Tracking Data

IKI 10100: Data Structures & Algorithms Ruli Manurung (acknowledgments to Denny & Ade Azurat) 1 Fasilkom UI Ruli Manurung (Fasilkom UI)IKI10100: Lecture10.

CS171 Introduction to Computer Science II Graphs Strike Back.

UNINFORMED SEARCH Problem - solving agents Example : Romania  On holiday in Romania ; currently in Arad.  Flight leaves tomorrow from Bucharest.

PATHFINDING WITH A* Presented by Joseph Siefers February 19 th, 2008.

Generated Waypoint Efficiency: The efficiency considered here is defined as follows: As can be seen from the graph, for the obstruction radius values (200,

The Structure of Networks with emphasis on information and social networks T-214-SINE Summer 2011 Chapter 8 Ýmir Vigfússon.

Adaptive Fastest Path Computation on a Road Network : A Traffic Mining Approach Hector Gonzalez Jiawei Han Xiaolei Li Margaret Myslinska John Paul Sondag.

SASH Spatial Approximation Sample Hierarchy

T-Drive : Driving Directions Based on Taxi Trajectories Microsoft Research Asia University of North Texas Jing Yuan, Yu Zheng, Chengyang Zhang, Xing Xie,

Modelling and Predicting Future Trajectories of Moving Objects in a Constrained Network appeared in “Proceedings of the 7th International Conference on.

Tracking Moving Objects in Anonymized Trajectories Nikolay Vyahhi 1, Spiridon Bakiras 2, Panos Kalnis 3, and Gabriel Ghinita 3 1 St. Petersburg State University.

11 Quantifying Benefits of Traffic Information Provision under Stochastic Demand and Capacity Conditions: A Multi-day Traffic Equilibrium Approach Mingxin.

SubSea: An Efficient Heuristic Algorithm for Subgraph Isomorphism Vladimir Lipets Ben-Gurion University of the Negev Joint work with Prof. Ehud Gudes.

Time-Variant Spatial Network Model Vijay Gandhi, Betsy George (Group : G04) Group Project Overview of Database Research Fall 2006.

Route Planning Vehicle navigation systems, Dijkstra’s algorithm, bidirectional search, transit-node routing.

Non-Conservative Cost Bound Increases in IDA* Doug Demyen.

Distance Indexing on Road Networks A summary Andrew Chiang CS 4440.

The Structure of Networks with emphasis on information and social networks T-214-SINE Summer 2011 Chapter 8 Ýmir Vigfússon.

Using Abstraction to Speed Up Search Robert Holte University of Ottawa.

Graph Indexing: A Frequent Structure based Approach Authors:Xifeng Yan†, Philip S‡. Yu, Jiawei Han†

SAvPS – úvod Genči 2009 (bsaed on Tanenbaum’s slides.

Chapter 9 – Graphs A graph G=(V,E) – vertices and edges

Network Aware Resource Allocation in Distributed Clouds.

Efficient Data Mining for Calling Path Patterns in GSM Networks Information Systems, accepted 5 December 2002 SPEAKER: YAO-TE WANG ( 王耀德 )

Web Usage Mining for Semantic Web Personalization جینی شیره شعاعی زهرا.

Graph Indexing: A Frequent Structure- based Approach Alicia Cosenza November 26 th, 2007.

Efficient Route Computation on Road Networks Based on Hierarchical Communities Qing Song, Xiaofan Wang Department of Automation, Shanghai Jiao Tong University,

Interfacing NGSIM Lane Selection Algorithm with TSIS/CORSIM Li Zhang, Ph.D., P.E. Guanghua Zhang, JiZhan Gou Fatemeh Sayyady, Di Wu & Fan Ye January 20,

Finding Top-k Shortest Path Distance Changes in an Evolutionary Network SSTD th August 2011 Manish Gupta UIUC Charu Aggarwal IBM Jiawei Han UIUC.

Outline Introduction – Frequent patterns and the Rare Item Problem – Multiple Minimum Support Framework – Issues with Multiple Minimum Support Framework.

Easiest-to-Reach Neighbor Search Fatimah Aldubaisi.

Review: Tree search Initialize the frontier using the starting state While the frontier is not empty – Choose a frontier node to expand according to search.

CanTree: a tree structure for efficient incremental mining of frequent patterns Carson Kai-Sang Leung, Quamrul I. Khan, Tariqul Hoque ICDM ’ 05 報告者：林靜怡.

1 Fast packet classification for two-dimensional conflict-free filters Department of Computer Science and Information Engineering National Cheng Kung University,

Behavior Control of Virtual Vehicle

Efficient Computing k-Coverage Paths in Multihop Wireless Sensor Networks XuFei Mao, ShaoJie Tang, and Xiang-Yang Li Dept. of Computer Science, Illinois.

黃福銘 (Angus). Angus Fuming Huang Academia Sinica, Institute of Information Science, ANTS Lab Jae-Gil Lee Jiawei Han UIUC Kyu-Young Whang KAIST ACM SIGMOD’07.

1 Knowledge Discovery from Transportation Network Data Paper Review Jiang, W., Vaidya, J., Balaporia, Z., Clifton, C., and Banich, B. Knowledge Discovery.

Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying 1, Wang-Chien Lee 2, Tz-Chiao Weng 1 and Vincent S. Tseng 1 1 Department of Computer.

Short term forecast of travel times on the Danish highway network based on TRIM data Klaus Kaae Andersen Thomas Kaare Christensen Bo Friis Nielsen Informatics.

Heuristic Functions. A Heuristic is a function that, when applied to a state, returns a number that is an estimate of the merit of the state, with respect.

Image Processing A Study in Pixel Averaging Building a Resolution Pyramid With Parallel Computing Denise Runnels and Farnaz Zand.

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

Data Preprocessing: Data Reduction Techniques Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.

黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Exploring Spatial-Temporal Trajectory Model for Location.

Written By: Presented By: Swarup Acharya,Amr Elkhatib Phillip B. Gibbons, Viswanath Poosala, Sridhar Ramaswamy Join Synopses for Approximate Query Answering.

Gspan: Graph-based Substructure Pattern Mining

Review: Tree search Initialize the frontier using the starting state

CAT: Correct Answers of Continuous Queries using Triggers

A paper on Join Synopses for Approximate Query Answering

Yi Wu 9/17/2018.

Intra-Domain Routing Jacob Strauss September 14, 2006.

Motion Planning for Multiple Autonomous Vehicles

HW #1 Due 29/9/2008 Write Java Applet to solve Goats and Cabbage “Missionaries and cannibals” problem with the following search algorithms: Breadth first.

Finding Fastest Paths on A Road Network with Speed Patterns

Authors: Wai Lam and Kon Fan Low Announcer: Kyu-Baek Hwang

Efficient Cache-Supported Path Planning on Roads

Continuous Density Queries for Moving Objects

CSE572: Data Mining by H. Liu

Presentation transcript:

實驗室 : 先進網路技術與服務實驗室報告者 : 黃福銘 (Angus F.M. Huang) Adaptive Fastest Path Computation on a Road Network: A Traffic Mining Approach TMSG

Angus F.M. Huang 2

Conference – VLDB ‘07, September 23-28, 2007, Vienna, Austria Authors –Hector Gonzalez, Jiawei Han, Xiaolei Li, Margaret Myslinska, John Paul Sondag Department of Computer Science University of Illinois at Urbana-Champaign 3 Publication

Angus F.M. Huang Outline INTRODUCTION PROBLEM DEFINITION TRAFFIC DATABASE ROAD NETWORK PARTITIONING TRAFFIC MINING PRE-COMPUTATION AND UPGRADES FASTEST PATH COMPUTATION EXPERIMENTAL EVALUATION CONCLUSIONS 4

Angus F.M. Huang Introduction MapQuest, MapPoint, Google Maps –Route planning systems –MapQuest had 10 billion routes queries from 1996 to 2006 Current speed conditions are not enough for the fastest route searching –Road speed limits, average speed,… Example 1: Importance of driving patterns –Local experts will consider a multitude of important factors that are difficult to explicitly incorporate into a path finding algorithm Example 2: Importance of speed patterns –Time of departure, weather conditions, car pool lane, etc. 5

Angus F.M. Huang Introduction Solution –Traffic-mining-based path-finding method –Speed and driving patterns from historic traffic data Technical Contributions –Road hierarchy-based partitioning –Speed rule mining –Driving pattern mining –Adaptive pre-computation –Road upgrading –Adaptive fastest path algorithm 6

Angus F.M. Huang Problem Definition Def.: Road network –G(V, E) Def.: Speed pattern – –d i is a value for speed factor Di –m is an aggregate function computed on edge speed Def.: Driving pattern –A sequences s of edges e(1),e(2),…,e(l) –appears more than min_sup times –support(s), the number of paths containing the sequence –length(s), the number of edges that it contains Def.: Edge forecast model –F(edge_id, t) –Returns a tuple (d 1,d 2,…,d k ) with the expected driving conditions for edge edge_id at time t 7

Angus F.M. Huang Time-of-day D 1 = weather D 2 = vehicle-type 8 Larger roads are shown in bold 24,123 edges 18,496 nodes TIGER line files Forecast function example –At 5 pm [time], for highway 74 between Champaign and Normal [edge], Weather = rain, and Construction = no [conditions]

Angus F.M. Huang Problem Statement Given a road network G(V,E), a set of speed patterns S, an edge forecast model F, and a query q ←(s, e, start_time) Compute a fast route q r between nodes s and e starting from s at time start_time, such that q r contains a large number of frequent driving patterns 9

Angus F.M. Huang Traffic Database (edge_id, time, speed) –Basic traffic observation (car_id, edge_id, time, speed) –Radio-frequency tags (edge_id, start_time, end_time, (d 1,d 2,…,d k ):m) –Augment each traffic observation with the driving factors 10

Angus F.M. Huang Road Network Partitioning Road hierarchy –Highway, interstate road, multi-lane road, small road,… Grid-based partitioning is bad The natural partition induced by the road hierarchy itself can be used to divide the network into semantically meaningful areas –With well defined driving and speed patterns Given a road hierarchy with l levels, we can construct a hierarchy of areas as a tree of depth l-1 –Road class 1 is the largest, and road class l the smallest 11

Angus F.M. Huang San Joaquin Partitioned Map 12 a:b –a is the area number when roads of level 1 are used –b is the subarea of a when roads of level 2 are used to subdivide a The upper left !! –Quite a few strong connection

Angus F.M. Huang Area Partitioning Algorithm Generate semantically meaningful partitions –By using road hierarchy information Flood filling technique –Identify strongly connected components –class(n) > k It automatically identifies.. –Interior nodes, those with a single area in their area set –Border nodes, those with multiple areas in their area set O(n) –O(1), interior node, n nodes –O(|a|), border nodes, a areas –O(n x |a|) –|a| << n 13

Angus F.M. Huang Traffic Mining Speed pattern mining –See the mining as a classification problem Where we would like to predict edge speed based on time and feature values d 1,…,d k –“if area = a 1 and weather = icy and time = rush hour then speed =1/4 x base speed” Abstraction level, general representation –Run a preprocessing step to discretize speed factors, which will be treated as our class label –Use Decision tree induction to perform rule induction 14

Angus F.M. Huang Traffic Mining Driving pattern mining –Ask local people for route tips in an unfamiliar area –Frequent pattern mining Minimum support level –Uniform mining support level is difficult to define And it may filter many important local roads, or may keep infrequently traveled high-level roads –Use a frequent pattern mining method guided by the area and road hierarchies Frequent edges are mined according to different area level To distinguish different level-supports 15

Angus F.M. Huang Pre-computation and Upgrades To improve the performance both in terms of run time and path accuracy Area level pre-computation –A*, Floyd Warshall,… –When edge speed is a function of factors The fastest path between two nodes may be different for different times and conditions –We can check two conditions to determine pre-computing benefits How many fastest path queries will go through nodes of the pre- computed path How stable is the path –To compute certain fastest paths only within the nodes inside the area 16

Angus F.M. Huang Pre-computation and Upgrades Assumption: drivers take the largest road available to reach destination Exception !! : if there is a small road that is faster than a large road 17 Small road upgrades –If under some driving conditions small roads have a significantly higher speed –To upgrade the internal edges to upper level

Angus F.M. Huang Fastest Path Computation Properties of the (approximate) fastest routes –Be well supported by the historical driver behavior –Larger road first, significant smaller road second –Account for all relevant factors affecting driving speed Before computation… –Road network partitioning –Speed patterns are mined get_edge_speed(edge_id, t, (d 1,…,d k )) –Driving patterns are mined is_frequent(edge_seq, t, (d 1,…,d n )) –Area-level paths are pre-computed –Internal roads are upgraded get_edge_class(edge_id, t, (d 1,…,d k )) 18

Angus F.M. Huang Fastest Path Algorithm It is a variation of A* Algorithm strategies 1.Priority queue of expanded paths 2.Pick the frequent node with lowest g(n)+h(n) g(n), the current travel time cost h(n), the expected travel time cost, 3.Ascending search to find the bigger road 4.Descending search to find the smaller road 5.Simple estimation policy, h(n) = distance(n, end) / max_speed 6.Online path re-computation Lemma –The adaptive fastest path algorithm, when computing a path between (start, end) nodes, in areas a i, a j respectively will consider at most O(|a i |+|a j |+|bn|+|un|) distinct nodes 19

Angus F.M. Huang 20

Angus F.M. Huang Experimental Evaluation Comparisons –A*, basic A* –Hier, the algorithm without area pre-computation –Adapt, the algorithm Data synthesis –San Francisco Bay area, 175,343 nodes, 223,606 edges –Illinois, 831,524 nodes, 1,048,080 nodes, 24,123 edges Traffic simulator –Network-based Generator of Moving Objects by Thomas Brinkhoff –Rush hour: 10,000 objects, Non-rush: 1,000 objects –Include weather factor to slow down speeds –Two car classes: Cars with faster speeds, Trucks with slower speeds Simulation output was a list of edge observations – –Then, mine the speed patterns for each edge 21

Angus F.M. Huang Network-based Generator of Moving Objects Thomas Brinkhoff Institut für Angewandte Photogrammetrie und Geoinformatik (IAPG) 22

Angus F.M. Huang Query Length We varied the average distance between the starting and ending nodes –The longer the distance the larger the search space –The distance is as a percentage of the map diameter –20% upgraded roads –Pre-compute fastest paths in 30% of the lowest level areas Figure 5, Adapt only expands slightly more nodes than Hier Figure 6, Adapt is as good as the A*’s fastest path, efficiency & accuracy Figure 7, the same pattern as in the expanded nodes 23

Angus F.M. Huang Upgraded Paths Vary the percentage of lowest level areas that contain a path that is faster than the border paths and thus needs to be upgraded Figure 8&10: A* and Hier are significantly affected (???) Figure 8&10: Adapt suffers as having more upgraded edges but still gradual Figure 9: when no edges are upgraded both Hier and Adapt perform equally, as we increase the number of upgraded edges Adapt starts closing the gap with A* We can use a fairly aggressive edge updating strategy to improve path quality without incurring any significant performance penalty –Interior edges as long as are 80% as fast as border edges 24

Angus F.M. Huang Area Pre-computation Examine the performance gain for different levels of pre-compution –Adapt vs. Adapt_nopre The same algorithm but withourt using pre- computed areas –Select a percentage of the lowest level areas to pre- compute fastest path, 0% to 100% The performance improvement is very significant If we use higher level area, the performance would have been more noticeable 25

Angus F.M. Huang Road Network Size Compare query processing efficiency for 3 road network sizes –sj, nodes and edges –sf, nodes and edges –il, nodes and edges –sj < sf < il Adapt has excellent scalability in terms of road network size –The number of nodes usually grow much slower than the number of small roads 26

Angus F.M. Huang Conclusion We developed an adaptive fastest path algorithm, that bases routing decision on driving and speed patterns mined from historical data. The partitioning algorithm yields very natural partitions, where larger areas are observed at regions with low road densities, and much finer areas are observed at dense regions such as big cities. 27

Angus F.M. Huang Angus Comments 如果道路層級規畫不佳，此篇成效依然會很好嗎 ?? 此篇亦無解決歷史資料稀疏的問題。 The power of Number & Trajectory – 28

Angus F.M. Huang 29