Mining Interesting Locations and Travel Sequences from GPS Trajectories IDB & IDS Lab. Seminar Summer 2009 강 민 석강 민 석 July 23 rd,

Slides:



Advertisements
Similar presentations
15th CTI Workshop, July 26, Smart Itinerary Recommendation based on User-Generated GPS Trajectories Hyoseok Yoon 1, Y. Zheng 2, X. Xie 2 and W.
Advertisements

Vincent W. Zheng, Yu Zheng, Xing Xie, Qiang Yang Hong Kong University of Science and Technology Microsoft Research Asia This work was done when Vincent.
Location-Based Social Networks Yu Zheng and Xing Xie Microsoft Research Asia Chapter 8 and 9 of the book Computing with Spatial Trajectories.
University of Minnesota Location-based & Preference-Aware Recommendation Using Sparse Geo-Social Networking Data Location-based & Preference-Aware Recommendation.
Mining User Similarity Based on Location History Yu Zheng, Quannan Li, Xing Xie Microsoft Research Asia.
An Interactive-Voting Based Map Matching Algorithm
Urban Computing with Taxicabs
Mining di Dati Web Web Community Mining and Web log Mining : Commody Cluster based execution Romeo Zitarosa.
Experiments on Query Expansion for Internet Yellow Page Services Using Log Mining Summarized by Dongmin Shin Presented by Dongmin Shin User Log Analysis.
Learning Location Correlation From GPS Trajectories Yu Zheng Microsoft Research Asia March 16, 2010.
Detecting Nearly Duplicated Records in Location Datasets Microsoft Research Asia Search Technology Center Yu Zheng Xing Xie, Shuang Peng, James Fu.
Constructing Popular Routes from Uncertain Trajectories Authors of Paper: Ling-Yin Wei (National Chiao Tung University, Hsinchu) Yu Zheng (Microsoft Research.
Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei 1, Yu Zheng 2, Wen-Chih Peng 1 1 National Chiao Tung University, Taiwan 2 Microsoft.
George Lee User Context-based Service Control Group
6/2/ An Automatic Personalized Context- Aware Event Notification System for Mobile Users George Lee User Context-based Service Control Group Network.
T-Drive : Driving Directions Based on Taxi Trajectories Microsoft Research Asia University of North Texas Jing Yuan, Yu Zheng, Chengyang Zhang, Xing Xie,
Yu Zheng, Lizhu Zhang, Xing Xie, Wei-Ying Ma Microsoft Research Asia
A reactive location-based service for geo-referenced individual data collection and analysis Xiujun Ma Department of Machine Intelligence, Peking University.
Trajectories Simplification Method for Location-Based Social Networking Services Presenter: Yu Zheng on behalf of Yukun Cheng, Kai Jiang, Xing Xie Microsoft.
LinkSelector: A Web Mining Approach to Hyperlink Selection for Web Portals Xiao Fang University of Arizona 10/18/2002.
Mining Interesting Locations and Travel Sequences from GPS Trajectories defense by Alok Rakkhit.
Affinity Rank Yi Liu, Benyu Zhang, Zheng Chen MSRA.
Vincent W. Zheng †, Bin Cao †, Yu Zheng ‡, Xing Xie ‡, Qiang Yang † † Hong Kong University of Science and Technology ‡ Microsoft Research Asia This work.
Learning Transportation Mode from Raw GPS Data for Geographic Applications on the Web Yu Zheng, Like Liu, Xing Xie Microsoft Research.
Mining Interesting Locations and Travel Sequences From GPS Trajectories Yu Zheng and Xing Xie Microsoft Research Asia March 16, 2009.
Temporal Event Map Construction For Event Search Qing Li Department of Computer Science City University of Hong Kong.
Research Meeting Seungseok Kang Center for E-Business Technology Seoul National University Seoul, Korea.
«Tag-based Social Interest Discovery» Proceedings of the 17th International World Wide Web Conference (WWW2008) Xin Li, Lei Guo, Yihong Zhao Yahoo! Inc.,
Friends and Locations Recommendation with the use of LBSN
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
Preventing Denial-of-request Inference Attacks in Location- sharing Services Kazuhiro Minami Institute of Statistical Mathematics ICMU 2014.
Social scope: Enabling Information Discovery On Social Content Sites
Generating Intelligent Links to Web Pages by Mining Access Patterns of Individuals and the Community Benjamin Lambert Omid Fatemieh CS598CXZ Spring 2005.
Wen He Tsinhua University, Beijing, China and Xi'an Communication Institute, Xi'an, China Deyi Li Tsinhua University, Beijing, China and Chinese.
Recommendation system MOPSI project KAROL WAGA
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Presented by: Apeksha Khabia Guided by: Dr. M. B. Chandak
Classical Music for Rock Fans?: Novel Recommendations for Expanding User Interests Makoto Nakatsuji, Yasuhiro Fujiwara, Akimichi Tanaka, Toshio Uchiyama,
Web Intelligence Web Communities and Dissemination of Information and Culture on the www.
Wang-Chien Lee i Pervasive Data Access ( i PDA) Group Pennsylvania State University Mining Social Network Big Data Intelligent.
Center for E-Business Technology Seoul National University Seoul, Korea BrowseRank: letting the web users vote for page importance Yuting Liu, Bin Gao,
Friends and Locations Recommendation with the use of LBSN By EKUNDAYO OLUFEMI ADEOLA
Collaborative Filtering versus Personal Log based Filtering: Experimental Comparison for Hotel Room Selection Ryosuke Saga and Hiroshi Tsuji Osaka Prefecture.
User Behavior Analysis of Location Aware Search Engine Third international Conference of MDM, 2002 Takahiko Shintani, Iko Pramudiono NTT Information Sharing.
Improving Web Search Results Using Affinity Graph Benyu Zhang, Hua Li, Yi Liu, Lei Ji, Wensi Xi, Weiguo Fan, Zheng Chen, Wei-Ying Ma Microsoft Research.
Center for E-Business Technology Seoul National University Seoul, Korea Social Ranking: Uncovering Relevant Content Using Tag-based Recommender Systems.
Algorithmic Detection of Semantic Similarity WWW 2005.
Jiafeng Guo(ICT) Xueqi Cheng(ICT) Hua-Wei Shen(ICT) Gu Xu (MSRA) Speaker: Rui-Rui Li Supervisor: Prof. Ben Kao.
Finding Experts Using Social Network Analysis 2007 IEEE/WIC/ACM International Conference on Web Intelligence Yupeng Fu, Rongjing Xiang, Yong Wang, Min.
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying 1, Wang-Chien Lee 2, Tz-Chiao Weng 1 and Vincent S. Tseng 1 1 Department of Computer.
Mining Trajectory Profiles for Discovering User Communities Speaker : Chih-Wen Chang National Chiao Tung University, Taiwan Chih-Chieh Hung,
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Trajectory Data Mining Dr. Yu Zheng Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Editor-in-Chief of ACM Trans.
Trajectory Data Mining Dr. Yu Zheng Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Editor-in-Chief of ACM Trans.
Trajectory Data Mining Dr. Yu Zheng Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Editor-in-Chief of ACM Trans.
1 Jong Hee Kang, William Welbourne, Benjamin Stewart, Gaetano Borriello, October 2004, Proceedings of the 2nd ACM international workshop on Wireless mobile.
Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Longzhuang Li, Yi Shang, Wei Zhang 2002.ACM. Improvement of HITS-based Algorithms.
Context-Aware Query Classification Huanhuan Cao, Derek Hao Hu, Dou Shen, Daxin Jiang, Jian-Tao Sun, Enhong Chen, Qiang Yang Microsoft Research Asia SIGIR.
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.
Semantic Web in Context Broker Architecture Presented by Harry Chen, Tim Finin, Anupan Joshi At PerCom ‘04 Summarized by Sungchan Park
黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Exploring Spatial-Temporal Trajectory Model for Location.
Location-based Social Networks 6/11/20161 CENG 770.
TRACE ANALYSIS AND MINING FOR SMART CITIES By G. Pan Zhejiang Univ., Hangzhou, China G. Qi ; W. Zhang ; S. Li ; Z. Wu ; L. T. Yang.
Diversified Trajectory Pattern Ranking in Geo-Tagged Social Media
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Location Recommendation — for Out-of-Town Users in Location-Based Social Network Yina Meng.
Presentation transcript:

Mining Interesting Locations and Travel Sequences from GPS Trajectories IDB & IDS Lab. Seminar Summer 2009 강 민 석강 민 석 July 23 rd, 2009 Yu Zheng, Lizhu Zhang, Xing Xie, Wei-Ying Ma WWW 2009 Center for E-Business Technology Seoul National University Seoul, Korea Microsoft Research Asia Intelligent Database Systems Lab.

Copyright  2009 by CEBT Abstract  Mining Interesting Locations and Travel Sequences from GPS Trajectories GPS log : record users’ outdoor movements with GPS By mining multiple users’ location histories, discover interesting locations and travel sequences in a given region  Problem How to model multiple users’ location history from GPS log How to infer the interest level of a location Location interest not only depend on the number of visiting, but also users’ travel experiences. How to detect classical sequences in a given region 2 timestampLatitudelongitude :30:00N 33º 30’ 19.5”E 126º 29’ 35.3” :30:30N 33º 30’ 19.4”E 126º 29’ 35.2” :31:00N 33º 30’ 19.2”E 126º 29’ 35.3” :31:30N 33º 30’ 19.1”E 126º 29’ 35.3” :32:00N 33º 30’ 19.1”E 126º 29’ 35.4” timestampLatitudelongitude :30:00N 33º 30’ 19.5”E 126º 29’ 35.3” :30:30N 33º 30’ 19.4”E 126º 29’ 35.2” :31:00N 33º 30’ 19.2”E 126º 29’ 35.3” :31:30N 33º 30’ 19.1”E 126º 29’ 35.3” :32:00N 33º 30’ 19.1”E 126º 29’ 35.4” timestampLatitudelongitude :30:00N 33º 30’ 19.5”E 126º 29’ 35.3” :30:30N 33º 30’ 19.4”E 126º 29’ 35.2” :31:00N 33º 30’ 19.2”E 126º 29’ 35.3” :31:30N 33º 30’ 19.1”E 126º 29’ 35.3” :32:00N 33º 30’ 19.1”E 126º 29’ 35.4”

Contents  Introduction  Modeling Location History  Location Interest Inference  Experiments  Related Work  Conclusions 3

Copyright  2009 by CEBT Introduction  GPS log Recently, many users record their outdoor movements with GPS. Travel experience sharing, Life Logging, Sports activity GPS devices are changing the way people interact with the Web by using locations as contexts. 4

Copyright  2009 by CEBT Introduction  GPS log Let’s look at my GPS Trajectories! 5

removed some photos for privacy 6

Copyright  2009 by CEBT 7

Introduction  Architecture System comprises of three parts Location history modeling, location interest & sequence mining, recommendation 8 Tree-Based Hierarchical Graph HITS-Based Inference Model User Travel Experience Location Interest Location History Modeling Location Interest and Sequence Mining Recommendation Modeling Location History GPS Logs Experienced Users Interesting Locations Travel Sequences Mining Travel Sequences Location Recommender

Contents  Introduction  Modeling Location History GPS Trajectory & Stay Point Location History Tree-Based Hierarchical Graph (TBHG)  Location Interest Inference  Experiments  Related Work  Conclusions 9

Copyright  2009 by CEBT Modeling Location History  GPS Trajectory GPS point : contain (timestamp, latitude, longitude) GPS log : a collection of GPS points GPS trajectory : sequentially connect GPS points  Stay Point geographic region where a user stayed over a certain period time interval Time threshold T : stay over T (e.g. 20 min) Distance threshold D : distance between two points is less than D (e.g. 200 m) 10 timestampLatitudelongitude :30:00N 33º 30’ 19.5”E 126º 29’ 35.3” :30:30N 33º 30’ 19.4”E 126º 29’ 35.2” :31:00N 33º 30’ 19.2”E 126º 29’ 35.3” :31:30N 33º 30’ 19.1”E 126º 29’ 35.3” :32:00N 33º 30’ 19.1”E 126º 29’ 35.4” :32:30N 33º 30’ 19.1”E 126º 29’ 35.4” :33:00N 33º 30’ 19.2”E 126º 29’ 35.4”

Copyright  2009 by CEBT Modeling Location History  Location History represented as a sequence of stay points with corresponding arrival and leaving times 11 S1 S2 S3 S4 S5 S6 S7 Home Supermarket Company Restaurant S8 S9 S1 0

Copyright  2009 by CEBT Modeling Location History  Model multiple users’ location histories Location history of various people are inconsistent and incomparable stay points of different individuals are not identical  Considering the scale of location 12 A B S1 S2 S3 S4 S5 S6 S7 Home Supermarket Company Restaurant S8 S9 S1 0 C1 C2 C3 C4

Copyright  2009 by CEBT Modeling Location History  Tree-Based Hierarchy Build a tree using a hierarchical clustering algorithm Density-based clustering algorithm OPTICS (Ordering Points to Identify the Clustering Structure) Hierarchically cluster stay points into some geospatial regions Different levels denote different geospatial granularity 13

Copyright  2009 by CEBT Modeling Location History  Tree-Based Hierarchical Graph (TBHG) 1.Formulate a Tree-based Hierarchy Hierarchically cluster stay points 2.Build Graphs on each Level Link is generated when consecutive stay points are contained in two clusters 14

Copyright  2009 by CEBT Modeling Location History  Tree-Based Hierarchical Graph (TBHG) location history can be represented by a sequence of stay point clusters with transition time between two clusters on different geospatial scales 15 S1 S2 S3 S4 S5 S6 S7 Home Supermarket Company Restaurant S8 S9 S1 0 C1 C2 C3 C4 S1 S2 S3 S4 S5 S6 S7 S8 S9 S1 0 A B

Contents  Introduction  Modeling Location History  Location Interest Inference HITS-Based Inference Model Mining Classical Travel Sequences  Experiments  Related Work  Conclusions 16

Copyright  2009 by CEBT Location Interest Inference  HITS (Hypertext Induced Topic Search) search query dependent ranking algorithm for Web IR produce two rankings Hub : web page with many out-links Authority : web page with many in-links Hub and Authority have a mutual reinforcement relationship 17

Copyright  2009 by CEBT Location Interest Inference  HITS-Based Inference Model regard an user’s visit to a location as an implicitly directed link from the user to that location Hub and Authority Hub : a user who has accessed many places → users’ travel experiences Authority : a location which has been visited by many users → location interest mutual reinforcement relationship Users’ travel experiences (hub scores) & interest of locations (authority scores) 18

Copyright  2009 by CEBT Location Interest Inference  Data Selection Strategy Motivation User’s travel experience is region-related. need to specify a geospatial region before conducting HITS-based inference Strategy calculate scores using regions specified by their ascendant clusters can have multiple authority and hub scores based on the different region scales 19

Copyright  2009 by CEBT Location Interest Inference  Inference Build adjacent matrix between users and locations mutual reinforcement relationship of user travel experience and location interest Iterative process for generating the final results Calculate authority and hub scores using the power iteration method 20

Copyright  2009 by CEBT Mining Classical Travel Sequences  calculate Score for each Location Sequence the Travel Experiences of Users taking this sequence Hub scores of the user the Interests of the Locations contained in the sequence Authority scores of the locations in this sequence 21 5 users have taken A→C We know each user’s hub score. What is the classical score of sequence A→C→D TBHG We know location C’s authority score.

Copyright  2009 by CEBT Mining Classical Travel Sequences  calculate Score for each Location Sequence the Travel Experiences of Users taking this sequence Hub scores of the user the Interests of the Locations contained in the sequence Authority scores of the locations in this sequence Authority scores are weighted based on the probability to take sequence 22 What is the classical score of sequence A→C→D Authority score of location A Hub score of Users Probability of moving out from A to this sequence

Contents  Introduction  Modeling Location History  Location Interest Inference  Experiments  Related Work  Conclusions 23

Copyright  2009 by CEBT Experimental Settings  GPS Data GPS devices to collect data Users 107 users record their outdoor movements get payments based on the distance of GPS log Data mostly in China, some in the USA, Korea, Japan 1 year (from May 2007 to Oct. 2008) 5 million GPS points (166,372 km)  Parameter Stay Point extracted 10,354 stay points Clustering 159 clusters (4 th level TBHG) 24

Copyright  2009 by CEBT Evaluation Approaches  Evaluation Explore effectiveness of location & travel recommendation by a user study 29 subjects who have been in Beijing for more that 6 years  Two Aspects of Evaluation Presentation the ability of the retrieved interesting locations in presenting a given region Representative, Comprehensive, Novelty Rank The ranking performance of the retrieved locations based on relative interests User Desirability Rating on each location & each sequence employ two criteria – nDCG and MAP  Baseline Interesting Locations rank-by-count, rank-by-frequency Classical Travel Sequences rank-by-count, rank-by-interests, rank-by-experience 25

Copyright  2009 by CEBT Experimental Results  Results outperformed baseline approaches  Investigations Advantages of the hierarchy of the TBHG Help users understand the region step-by-step (level-by-level) can be used to specify users’ travel experiences in different regions 26

Contents  Introduction  Modeling Location History  Location Interest Inference  Experiments  Related Work Mining Location History Location Recommenders  Conclusions 27

Copyright  2009 by CEBT Related Work  Mining Location History Individual location history Detect significant locations of a user Predict user’s movement Recognize user-specific activities at each location Multiple users’ location history Mining similar sequences Predict where a driver may be going Recognize the social pattern in daily user activity 28

Copyright  2009 by CEBT Related Work  Location Recommenders Recommenders based on real-time location Mobile Tourist Guide System Recommenders based on location history More Personalized recommendation using location history Recommend geographic locations like shops or restaurants Enhance collaborative filtering solution 29

Contents  Introduction  Modeling Location History  Location Interest Inference  Experiments  Related Work  Conclusions 30

Copyright  2009 by CEBT Conclusion  Mining Interesting Locations and Travel Sequences from GPS propose a tree-based hierarchical graph (TBHG), which can model multiple users’ location history propose a HITS-based model to infer users’ travel experiences and interest of a location within a region consider users’ travel experiences and location interests, and mine travel sequences evaluate methodology using large GPS dataset 31 Tree-Based Hierarchical Graph HITS-Based Inference Model User Travel Experience Location Interest Location History ModelingLocation Interest and Sequence Mining Recommendation Modeling Location History GPS Logs Experienced Users Interesting Locations Travel Sequences Mining Travel Sequences Location Recommender

Copyright  2009 by CEBT Conclusion  Implications Help understand the correlation between users and locations Enable location and travel recommendation Step towards enhancing mobile Web from multiple users’ location histories Improve location-based services by integrating social networking into mobile Web  GeoLife project Building social networks using human location history a location-based social-networking service on Microsoft Virtual Earth. enables users to share life experiences and build connections among each other using human location history. 32

Copyright  2009 by CEBT Discussion  Discussion about this paper (talked with Sungchan) Modeling Location History Stay point detection is simple and easy to apply Hierarchy model is appropriate to zoom in/out map HITS-based Location Interest Inference Pretty Reasonable : consider user’s travel experience is better than rank-by-count But, try another way to find location interest and user travel experience Travel Sequence too naïve for calculating sequence score  Motivation Context-aware Service Time + Location 33

Copyright  2009 by CEBT References  This Slide Some Images from GeoLife : Building social networks using human location history, Microsoft Research Y. Zheng, Mining Individual Life Pattern Based on Location History: A Paradigm and Framework, Slide, 2009 References [5], [7], [14], [18]  GeoLife Project Paper Yu Zheng and Xing Xie, Mining Individual Life Pattern Based on Location History, IEEE, 2009 Yu Zheng, Xing Xie, and Wei-Ying Ma, GeoLife2.0: A Location-Based Social Networking Service, IEEE, 2009 Yu Zheng, Xing Xie, and Wei-Ying Ma, Mining Interesting Locations and Travel Sequences From GPS Trajectories, ACM, 2009 Quannan Li, Yu Zheng, Xing Xie, and Wei-Ying Ma, Mining user similarity based on location history, ACM, 2008 Yu Zheng, Xing Xie, and Wei-Ying Ma, Understanding mobility based on GPS data, ACM, 2008 Yu Zheng and Xing Xie, Learning Transportation Mode from Raw GPS Data for Geographic Application on the Web, ACM,

35 Clustering the Tagged Web  Thank you~