Friends and Locations Recommendation with the use of LBSN

Slides:



Advertisements
Similar presentations
15th CTI Workshop, July 26, Smart Itinerary Recommendation based on User-Generated GPS Trajectories Hyoseok Yoon 1, Y. Zheng 2, X. Xie 2 and W.
Advertisements

Location-Based Social Networks Yu Zheng and Xing Xie Microsoft Research Asia Chapter 8 and 9 of the book Computing with Spatial Trajectories.
Mining User Similarity Based on Location History Yu Zheng, Quannan Li, Xing Xie Microsoft Research Asia.
Indexing DNA Sequences Using q-Grams
Learning Location Correlation From GPS Trajectories Yu Zheng Microsoft Research Asia March 16, 2010.
Social Media Mining Chapter 5 1 Chapter 5, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool, September, 2010.
Data Mining Techniques: Clustering
Geographical and Temporal Similarity Measurement in Location-based Social Networks Chongqing University of Posts and Telecommunications KTH – Royal Institute.
Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei 1, Yu Zheng 2, Wen-Chih Peng 1 1 National Chiao Tung University, Taiwan 2 Microsoft.
Identity and search in social networks Presented by Pooja Deodhar Duncan J. Watts, Peter Sheridan Dodds and M. E. J. Newman.
Company LOGO 1 Identity and Search in Social Networks D.J.Watts, P.S. Dodds, M.E.J. Newman Maryam Fazel-Zarandi.
Context-aware Query Suggestion by Mining Click-through and Session Data Authors: H. Cao et.al KDD 08 Presented by Shize Su 1.
Yu Zheng, Lizhu Zhang, Xing Xie, Wei-Ying Ma Microsoft Research Asia
Chen Cheng1, Haiqin Yang1, Irwin King1,2 and Michael R. Lyu1
Temporal Pattern Matching of Moving Objects for Location-Based Service GDM Ronald Treur14 October 2003.
Chapter 2Modeling 資工 4B 陳建勳. Introduction.  Traditional information retrieval systems usually adopt index terms to index and retrieve documents.
Mining Interesting Locations and Travel Sequences from GPS Trajectories defense by Alok Rakkhit.
1 CS 430 / INFO 430 Information Retrieval Lecture 3 Vector Methods 1.
1 A DATA MINING APPROACH FOR LOCATION PREDICTION IN MOBILE ENVIRONMENTS* by Gökhan Yavaş Feb 22, 2005 *: To appear in Data and Knowledge Engineering, Elsevier.
Similar Sequence Similar Function Charles Yan Spring 2006.
Recommender systems Ram Akella November 26 th 2008.
Mining Interesting Locations and Travel Sequences From GPS Trajectories Yu Zheng and Xing Xie Microsoft Research Asia March 16, 2009.
1 Shortest Path Calculations in Graphs Prof. S. M. Lee Department of Computer Science.
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα
 Clustering of Web Documents Jinfeng Chen. Zhong Su, Qiang Yang, HongHiang Zhang, Xiaowei Xu and Yuhen Hu, Correlation- based Document Clustering using.
Guiding Motif Discovery by Iterative Pattern Refinement Zhiping Wang, Mehmet Dalkilic, Sun Kim School of Informatics, Indiana University.
Mining Interesting Locations and Travel Sequences from GPS Trajectories IDB & IDS Lab. Seminar Summer 2009 강 민 석강 민 석 July 23 rd,
Kumar Srijan ( ) Syed Ahsan( ). Problem Statement To create a Neural Networks based multiclass object classifier which can do rotation,
1 CS 430: Information Discovery Lecture 9 Term Weighting and Ranking.
Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.
1 University of Palestine Topics In CIS ITBS 3202 Ms. Eman Alajrami 2 nd Semester
Wang-Chien Lee i Pervasive Data Access ( i PDA) Group Pennsylvania State University Mining Social Network Big Data Intelligent.
Spatial-Temporal Models in Location Prediction Jingjing Wang 03/29/12.
Friends and Locations Recommendation with the use of LBSN By EKUNDAYO OLUFEMI ADEOLA
Learning Geographical Preferences for Point-of-Interest Recommendation Author(s): Bin Liu Yanjie Fu, Zijun Yao, Hui Xiong [KDD-2013]
Distributed Information Retrieval Server Ranking for Distributed Text Retrieval Systems on the Internet B. Yuwono and D. Lee Siemens TREC-4 Report: Further.
Improving Web Search Results Using Affinity Graph Benyu Zhang, Hua Li, Yi Liu, Lei Ji, Wensi Xi, Weiguo Fan, Zheng Chen, Wei-Ying Ma Microsoft Research.
Query Suggestion Naama Kraus Slides are based on the papers: Baeza-Yates, Hurtado, Mendoza, Improving search engines by query clustering Boldi, Bonchi,
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
Answering Similar Region Search Queries Chang Sheng, Yu Zheng.
1 CSC 594 Topics in AI – Text Mining and Analytics Fall 2015/16 6. Dimensionality Reduction.
Algorithmic Detection of Semantic Similarity WWW 2005.
Spatial Analysis of Crime Data: A Case Study Mike Tischler Presented by Arnold Boedihardjo.
DOCUMENT UPDATE SUMMARIZATION USING INCREMENTAL HIERARCHICAL CLUSTERING CIKM’10 (DINGDING WANG, TAO LI) Advisor: Koh, Jia-Ling Presenter: Nonhlanhla Shongwe.
Ranking Link-based Ranking (2° generation) Reading 21.
1 A Web Search Engine-Based Approach to Measure Semantic Similarity between Words Presenter: Guan-Yu Chen IEEE Trans. on Knowledge & Data Engineering,
Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying 1, Wang-Chien Lee 2, Tz-Chiao Weng 1 and Vincent S. Tseng 1 1 Department of Computer.
Mining Trajectory Profiles for Discovering User Communities Speaker : Chih-Wen Chang National Chiao Tung University, Taiwan Chih-Chieh Hung,
Self-Organized Web Usage Regularities. Problems of foraging information on WWW Slow accession Difficulty in finding useful information is related to balkanization.
Trajectory Data Mining Dr. Yu Zheng Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Editor-in-Chief of ACM Trans.
A Supervised Machine Learning Algorithm for Research Articles Leonidas Akritidis, Panayiotis Bozanis Dept. of Computer & Communication Engineering, University.
1 CS 430: Information Discovery Lecture 5 Ranking.
1 CSC 594 Topics in AI – Text Mining and Analytics Fall 2015/16 8. Text Clustering.
Computational Biology, Part 3 Representing and Finding Sequence Features using Frequency Matrices Robert F. Murphy Copyright  All rights reserved.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Mining Advisor-Advisee Relationships from Research Publication.
Detecting Remote Evolutionary Relationships among Proteins by Large-Scale Semantic Embedding Xu Linhe 14S
© Prentice Hall1 DATA MINING Web Mining Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University Companion slides.
Location-based Social Networks 6/11/20161 CENG 770.
Cluster Analysis What is Cluster Analysis? Types of Data in Cluster Analysis A Categorization of Major Clustering Methods Partitioning Methods.
IR 6 Scoring, term weighting and the vector space model.
WEB STRUCTURE MINING SUBMITTED BY: BLESSY JOHN R7A ROLL NO:18.
Diversified Trajectory Pattern Ranking in Geo-Tagged Social Media
Mining User Similarity from Semantic Trajectories
Clustering of Web pages
Spatio-temporal Segmentation of Video by Hierarchical Mean Shift Analysis Daniel DeMenthon SMVP 2002.
Topic 3: Cluster Analysis
Location Recommendation — for Out-of-Town Users in Location-Based Social Network Yina Meng.
CS 430: Information Discovery
Topic 5: Cluster Analysis
Topological Signatures For Fast Mobility Analysis
Presentation transcript:

Friends and Locations Recommendation with the use of LBSN By EKUNDAYO OLUFEMI ADEOLA 16.02.2014

TERMINOLOGIES LBSN (Location Base Social Networks) POI (Points of Interest)Specific point on earth User Trajectory P1->P2->P3->…Pn Stay Point Time threshold (tout – tin >∂time) More of these will be defined as we proceed.

Are we really just scattered around the world with random behaviors?

Law of Geography Everything is related to everything else, but near things are more related than distant things. Users and Locations are the two things to be considered here.

Here we have different kinds of graphs; User - Location graph User - User graph Location - Location graph

Understanding Users How best can we understand users behavior, so that we can recommend for them some friend or locations. Here are some tips;

One day Activity; Location history in one day Routine Activity; People with rich knowledge about a region Visit Point/Reference Point; Visit point(P) (tout – tin > time threshold(∂time)). Reference point eg House, School One day Activities;

Routine activities mining Discovering activities patterns from multiple 1-day activities. Find groups of similar 1-day activities Represent 1-day activities within the same group with a routine activity model.

Time based clustering algorithms for visit point extractions

One-day activity clustering algorithm for routine activity mining.

Modeling human location history in semantic spaces Stay Point Representation Stay region Using IT-IDF (Term Frequency Inverse Document Frequency) the semantic meaning of a region could be gotten. Regards POIs as word and treat stay region as documents.

Feature Vector; The feature of a stay region r in a collection of regions R is fr = <w1,w2 ...,wK >, where K is the number of unique POI categories in a POI database and wi is the weight of POI category i in the region r. The value of wi is calculated as ;

Example; Suppose thats1 contains two restaurants and one museum, and s2 only has four restaurants. The total number of stay regions created by all the users is 100, in which 50 have restaurants and two contain museums. So, the feature vectors of s1 and s2 are f1 and f2 respectively: PS: Although we still cannot identify the exact POI category visited by an individual, this feature vector determines the interests of a user to some extent by extracting the semantic meaning of a region accessed by the individual

Using IT-IDF the semantic meaning of a region could be gotten Using IT-IDF the semantic meaning of a region could be gotten. Regards POIs as word and treat stay region as documents.

User Similarities Calculations Popularity of different locations Hierarchical property of geographic spaces Sequential property of users’ movements; Loc A -> LocB -> LocC ->…-> LocN. Here we consider Travel Match

Travel Match The locations in a travel match do not have to be consecutive in the user’s original location history What we need to detect for the calculating of user similarity are the maximum travel matches

Similarity Score N = Total number of users in the database, n = number of users visiting location c. fw(l) is employed to assign a bigger weight to the similarity of sequences occurring at a lower layer, where l = depth of a layer in the hierarchy.

Hierarachical User clustering

Generic Travel Recommendations Interesting Places Popular Travel Sequence  Itinerary Planning  Activities Recommendation.

Mining Interesting Locations 1) Formulate a shared hierarchical framework F, using the stay points detected from user's GPS logs, which are then clustered into hierarchy geospatial region. 2) Build location graphs on each layer based on shared framework F and user’s location histories

Factor considered Location Interest  Uses HIT (Hypertext Induced Topic Search) to give Authority score and Hub score to places. Location interest (cij) is represented by; where Iij = authority score of cluster cij conditioned by its ascendant nodes on level l, where 1 ≤ l < i.

Travel Experience; In our system, a user’s (e. g Travel Experience; In our system, a user’s (e.g., uk) travel experience is represented by a set of hub scores: Where denoted uk’s hub score conditioned by region cij.

Detecting travel sequences Example: Demonstrates the calculation of the popularity score for a 2-length sequence (i.e., a sequence containing two locations),A→C. First we consider the authority score of location A(IA), weighted by the Probability of people leaving by the sequence (Out AC) = 5/7. Secondly authority score of location C(Ic) Then the hub score of users (UAC) who have taken this sequence.

Following this method, the popularity score of sequence C → D is calculated as follows: Thus, the popularity score of sequence A→C→D equals: PS: This calculation will continue until we get the K-length most travel sequence.

Itinerary Recommendation

Factors considerd Query Verification; Check the feasibility of query according to spatial and temporal constraints. Trip Candidate Selection; Searches a location-interest graph for candidate itineraries satisfying a user’s query Trip Candidate Ranking; It ranks candidate itineraries according to three factor namely;

Elapsed Time ratio (ETR): Ratio between the time length of a recommended itinerary and that given by a user. Stay Time Ratio (STR): Ratio between time a user could spend in a location and that for traveling between locations. Interest Density Ratio (IDR): Sum of the interest values of the locations contained in an itinerary. Itineraries is ranked by the euclidean distance in these three dimensions: NB: Popular travel sequence could also be considered here.

Location-Activity Recommendation Location – feature Matrix; It uses IT-IDF to assign a feature to a location (e.g. Restaurants, movies theatre). Activities – Activities: It models the correlation between two diifferents activities that can be performed at a location.

This is my first draft Sir, your comment will be well appreciated This is my first draft Sir, your comment will be well appreciated. Thanks you.