Mining User Similarity Based on Location History Yu Zheng, Quannan Li, Xing Xie Microsoft Research Asia.

Slides:



Advertisements
Similar presentations
15th CTI Workshop, July 26, Smart Itinerary Recommendation based on User-Generated GPS Trajectories Hyoseok Yoon 1, Y. Zheng 2, X. Xie 2 and W.
Advertisements

Image Retrieval With Relevant Feedback Hayati Cam & Ozge Cavus IMAGE RETRIEVAL WITH RELEVANCE FEEDBACK Hayati CAM Ozge CAVUS.
Vincent W. Zheng, Yu Zheng, Xing Xie, Qiang Yang Hong Kong University of Science and Technology Microsoft Research Asia This work was done when Vincent.
Location-Based Social Networks Yu Zheng and Xing Xie Microsoft Research Asia Chapter 8 and 9 of the book Computing with Spatial Trajectories.
University of Minnesota Location-based & Preference-Aware Recommendation Using Sparse Geo-Social Networking Data Location-based & Preference-Aware Recommendation.
An Interactive-Voting Based Map Matching Algorithm
Driving with Knowledge from the Physical World Jing Yuan, Yu Zheng Microsoft Research Asia.
Collaborative QoS Prediction in Cloud Computing Department of Computer Science & Engineering The Chinese University of Hong Kong Hong Kong, China Rocky.
Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.
A Machine Learning Approach for Improved BM25 Retrieval
Dong Liu Xian-Sheng Hua Linjun Yang Meng Weng Hong-Jian Zhang.
Learning Location Correlation From GPS Trajectories Yu Zheng Microsoft Research Asia March 16, 2010.
Retrieving k-Nearest Neighboring Trajectories by a Set of Point Locations Lu-An Tang, Yu Zheng, Xing Xie, Jing Yuan, Xiao Yu, Jiawei Han University of.
Detecting Nearly Duplicated Records in Location Datasets Microsoft Research Asia Search Technology Center Yu Zheng Xing Xie, Shuang Peng, James Fu.
Geographical and Temporal Similarity Measurement in Location-based Social Networks Chongqing University of Posts and Telecommunications KTH – Royal Institute.
Constructing Popular Routes from Uncertain Trajectories Authors of Paper: Ling-Yin Wei (National Chiao Tung University, Hsinchu) Yu Zheng (Microsoft Research.
Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei 1, Yu Zheng 2, Wen-Chih Peng 1 1 National Chiao Tung University, Taiwan 2 Microsoft.
Critical Analysis Presentation: T-Drive: Driving Directions based on Taxi Trajectories Authors of Paper: Jing Yuan, Yu Zheng, Chengyang Zhang, Weilei Xie,
T-Drive : Driving Directions Based on Taxi Trajectories Microsoft Research Asia University of North Texas Jing Yuan, Yu Zheng, Chengyang Zhang, Xing Xie,
Yu Zheng, Lizhu Zhang, Xing Xie, Wei-Ying Ma Microsoft Research Asia
Trajectories Simplification Method for Location-Based Social Networking Services Presenter: Yu Zheng on behalf of Yukun Cheng, Kai Jiang, Xing Xie Microsoft.
Mining Interesting Locations and Travel Sequences from GPS Trajectories defense by Alok Rakkhit.
Vincent W. Zheng †, Bin Cao †, Yu Zheng ‡, Xing Xie ‡, Qiang Yang † † Hong Kong University of Science and Technology ‡ Microsoft Research Asia This work.
Learning Transportation Mode from Raw GPS Data for Geographic Applications on the Web Yu Zheng, Like Liu, Xing Xie Microsoft Research.
Understanding User Mobility Based on GPS Data Yu Zheng Microsoft Research Asia.
Exploration of Ground Truth from Raw GPS Data National University of Defense Technology & Hong Kong University of Science and Technology Exploration of.
Mining Interesting Locations and Travel Sequences From GPS Trajectories Yu Zheng and Xing Xie Microsoft Research Asia March 16, 2009.
Friends and Locations Recommendation with the use of LBSN
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
Bei Pan (Penny), University of Southern California
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Mining Interesting Locations and Travel Sequences from GPS Trajectories IDB & IDS Lab. Seminar Summer 2009 강 민 석강 민 석 July 23 rd,
Wen He Tsinhua University, Beijing, China and Xi'an Communication Institute, Xi'an, China Deyi Li Tsinhua University, Beijing, China and Chinese.
Beyond Co-occurrence: Discovering and Visualizing Tag Relationships from Geo-spatial and Temporal Similarities Date : 2012/8/6 Resource : WSDM’12 Advisor.
Streaming Predictions of User Behavior in Real- Time Ethan DereszynskiEthan Dereszynski (Webtrends) Eric ButlerEric Butler (Cedexis) OSCON 2014.
WALKING IN FACEBOOK: A CASE STUDY OF UNBIASED SAMPLING OF OSNS junction.
Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying 1, Eric Hsueh-Chan Lu 2 and Vincent S. Tseng 1 1 Institute of Computer Science and Information.
QUANNAN LI 1,2, YU ZHENG 2, XING XIE 2, YUKUN CHEN 2, WENYU LIU 1, WEI-YING MA 2 1 DEPT. ELECTRONICS AND INFORMATION ENGINEERING, HUAZHONG UNIVERSITY OF.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Wang-Chien Lee i Pervasive Data Access ( i PDA) Group Pennsylvania State University Mining Social Network Big Data Intelligent.
Google News Personalization: Scalable Online Collaborative Filtering
INTERACTIVE ANALYSIS OF COMPUTER CRIMES PRESENTED FOR CS-689 ON 10/12/2000 BY NAGAKALYANA ESKALA.
Developing Trust Networks based on User Tagging Information for Recommendation Making Touhid Bhuiyan et al. WISE May 2012 SNU IDB Lab. Hyunwoo Kim.
Friends and Locations Recommendation with the use of LBSN By EKUNDAYO OLUFEMI ADEOLA
Learning Geographical Preferences for Point-of-Interest Recommendation Author(s): Bin Liu Yanjie Fu, Zijun Yao, Hui Xiong [KDD-2013]
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Improving Web Search Results Using Affinity Graph Benyu Zhang, Hua Li, Yi Liu, Lei Ji, Wensi Xi, Weiguo Fan, Zheng Chen, Wei-Ying Ma Microsoft Research.
Jiafeng Guo(ICT) Xueqi Cheng(ICT) Hua-Wei Shen(ICT) Gu Xu (MSRA) Speaker: Rui-Rui Li Supervisor: Prof. Ben Kao.
Network Community Behavior to Infer Human Activities.
Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying, Eric Hsueh-Chan Lu, Wen-Ning Kuo and Vincent S. Tseng Institute of Computer Science.
Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying 1, Wang-Chien Lee 2, Tz-Chiao Weng 1 and Vincent S. Tseng 1 1 Department of Computer.
Mining Trajectory Profiles for Discovering User Communities Speaker : Chih-Wen Chang National Chiao Tung University, Taiwan Chih-Chieh Hung,
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
Trajectory Data Mining Dr. Yu Zheng Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Editor-in-Chief of ACM Trans.
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
A Framework to Predict the Quality of Answers with Non-Textual Features Jiwoon Jeon, W. Bruce Croft(University of Massachusetts-Amherst) Joon Ho Lee (Soongsil.
1 Random Walks on the Click Graph Nick Craswell and Martin Szummer Microsoft Research Cambridge SIGIR 2007.
Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.
黃福銘 (Angus F.M. Huang) ANTS Lab, IIS, Academia Sinica Exploring Spatial-Temporal Trajectory Model for Location.
Location-based Social Networks 6/11/20161 CENG 770.
A Recommender System based on Tag and Time Information for Social Tagging Systems Nan Zheng and Qiudan Li (Chinese Academy of Sciences) Expert Systems.
A Flexible Spatio-temporal indexing Scheme for Large Scale GPS Tracks Retrieval Yu Zheng, Longhao Wang, Xing Xie Microsoft Research.
Diversified Trajectory Pattern Ranking in Geo-Tagged Social Media
Urban Sensing Based on Human Mobility
WSRec: A Collaborative Filtering Based Web Service Recommender System
Location Recommendation — for Out-of-Town Users in Location-Based Social Network Yina Meng.
Liang Zheng and Yuzhong Qu
Mole: Motion Leaks through Smartwatch Sensors
Presentation transcript:

Mining User Similarity Based on Location History Yu Zheng, Quannan Li, Xing Xie Microsoft Research Asia

Outline Introduction Architecture – Modeling Location History – Measuring User Similarity Experimental Results Conclusion

Introduction (1) Goals – Inferring the similarity (correlations ) between users from their location histories – Enable friend recommendation  Personalized location recommendation Motivation – The increasing availability of user-generated trajectories Life logging, Travel experience sharing Sports activity analysis, Multimedia content management,… – People’s outdoor movements in the real world imply their interests Like sports: if frequently visit gyms and stadiums Like Travel: if usually access mountains and lakes – According to the first law of the geography Everything is related to everything else, but near things are more related than distant things. People with similar location histories might share similar interests and preferences. – Significance of user similarity in Web communities Generally, it help users find more relevant information from a large-scale dataset In GIS community: friend discovering and location recommendation

Introduction (2) Difficulty & Challenges – How to model different users’ location history uniformly Various users’ location histories are inconsistent and incomparable What’s a shared location? By distance ?? X – How to measure the similarity between users By counting the number of shared locations ?? The Pearson correlation and the cosine correlation ?? They do not take into account two important properties of people’s outdoor movements. Contribution and insights – A step towards integrating social networking into GIS – A hierarchical-graph Uniformly modeling different users’ location histories on a various scales of geo-spaces – A similarity measure considering Sequence property of users’ movement behavior Hierarchy property of geographic spaces

Preliminary GPS logs P and GPS trajectory Stay points S={s 1, s 2,…, s n }. – Stands for a geo-region where a user has stayed for a while – E.g., if a user spent more 20 minutes within a distance of 200 meters – Carry a semantic meaning beyond a raw GPS point Location history: – represented by a sequence of stay points – with transition intervals

Architecture (1) Modeling Location History Measuring Similarity A similarity score Sij for each pair of users A Hierarchical Graph for each individual

Modeling Location History (1) 1. Stay point detection 2. Hierarchical clustering 3. Individual graph building Measuring Similarity A similarity score Sij for each pair of users A Hierarchical Graph for each individual Modeling Location History

3. Individual graph building Modeling Location History (2) 1. Stay point detection 2. Hierarchical clustering

Measuring User Similarity (1) 1. Sequence Extraction 2. Sequence Matching 3. Similarity Score Calculating Measuring Similarity A similarity score Sij for each pair of users A Hierarchical Graph for each individual Modeling Location History

Measuring Similarity (2) Similar sequence Extraction,,

Measuring Similarity (3) Sequence matching – We aim to find out the maximum-length similar sequence – A pair of similar sequence: two individuals share the property of visiting the same sequence of places with a similar time interval ACAC A  B  C √ Same visiting order: a i == b i Similar transition time: ABAB B  A X X

Measuring Similarity (4) Similarity Calculating – Two factors The length of the matched similar sequence The level of the matched similar sequence – Calculation,, 1. Calculating similarity score for each sequence (weighted by its length) 2. Adding up similarity score of each sequence found on a level 3. Weighted Summing up the score of multiple levels

Measuring Similarity (5) User 2: b  d User 1: A  B User 1: a  c  e User 1: A  B User 3: A  B A  B c  e A  B User 1: a  c  e User 2: A  B User 3: b  c  e User 1: User3> User 2

Experiments (1) GPS Devices and Users – 112 users collecting the data in the past year

Experiments (2) GPS dataset – > 6 million GPS points – > 170,000 kilometers – 36 cities in China and a few city in the USA, Korea and Japan

Experiments (3) Relevance levelRelationships suggestion 4Strongly similarFamily members/intimate lovers/roommate 3SimilarGood friends/workmates/classmates 2Weakly similarOrdinary friends, neighbors in a community 1DifferentStrangers in the same city 0Quite differentStrangers in other cities Evaluation approach – Evaluated as an information retrieval problem – Ground truth: Users label the relationship with a ratings show in this Table

Experiments (4) Comparing with baselines – The Pearson Correlation – Cosine Similarity

Experiments (5) NDCG comparison

Conclusion A hierarchical graph – A uniform framework to measure various users’ location histories – Effectively modeling users’ outdoor movements Sequentially Hierarchically Our similarity measurement outperformed existing methods – The Person measurement and – Cosine similarity measurement – Hierarchy + Sequence achieved the best performance

Thanks! Microsoft Research Asia