Dr. Guandong Xu Intelligent Web & Information Systems (IWIS) Department of Computer Science, Aalborg University Web Usage Mining & Personalization.

Slides:



Advertisements
Similar presentations
Recommender System A Brief Survey.
Advertisements

Web Mining.
Recommender Systems & Collaborative Filtering
Data Mining for Web Personalization
Prediction Modeling for Personalization & Recommender Systems Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Mining Frequent Patterns II: Mining Sequential & Navigational Patterns Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Collaborative Filtering Sue Yeon Syn September 21, 2005.
1 RegionKNN: A Scalable Hybrid Collaborative Filtering Algorithm for Personalized Web Service Recommendation Xi Chen, Xudong Liu, Zicheng Huang, and Hailong.
Recommender Systems Aalap Kohojkar Yang Liu Zhan Shi March 31, 2008.
Chapter 12: Web Usage Mining - An introduction
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Web Mining Research: A Survey
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
LinkSelector: A Web Mining Approach to Hyperlink Selection for Web Portals Xiao Fang University of Arizona 10/18/2002.
Web Usage Mining: Processes and Applications
The Web is perhaps the single largest data source in the world. Due to the heterogeneity and lack of structure, mining and integration are challenging.
Web Mining Research: A Survey
Web Mining Research: A Survey
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Discovery of Aggregate Usage Profiles for Web Personalization
Recommender Systems; Social Information Filtering.
Recommender systems Ram Akella November 26 th 2008.
Overview of Web Data Mining and Applications Part I
Data Mining for Web Personalization
Chapter 12 (Section 12.4) : Recommender Systems Second edition of the book, coming soon.
Personalizing & Recommender Systems
Overview of Distributed Data Mining Xiaoling Wang March 11, 2003.
1.Understand the decision-making process of consumer purchasing online. 2.Describe how companies are building one-to-one relationships with customers.
Data Mining : Introduction Chapter 1. 2 Index 1. What is Data Mining? 2. Data Mining Functionalities 1. Characterization and Discrimination 2. MIning.
Data Mining Techniques
+ Social Bookmarking and Collaborative Filtering Christopher G. Wagner.
FALL 2012 DSCI5240 Graduate Presentation By Xxxxxxx.
The 2nd International Conference of e-Learning and Distance Education, 21 to 23 February 2011, Riyadh, Saudi Arabia Prof. Dr. Torky Sultan Faculty of Computers.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Chapter 7 DATA, TEXT, AND WEB MINING Pages , 311, Sections 7.3, 7.5, 7.6.
Copyright © 2009 Pearson Education, Inc. Slide 6-1 Chapter 6 E-commerce Marketing Concepts.
Distributed Networks & Systems Lab. Introduction Collaborative filtering Characteristics and challenges Memory-based CF Model-based CF Hybrid CF Recent.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Lecture 9: Knowledge Discovery Systems Md. Mahbubul Alam, PhD Associate Professor Dept. of AEIS Sher-e-Bangla Agricultural University.
EMIS 8381 – Spring Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.
1 Business System Analysis & Decision Making – Data Mining and Web Mining Zhangxi Lin ISQS 5340 Summer II 2006.
Data Mining By Dave Maung.
Discovery of Aggregate Usage Profiles for Web Personalization Bamshad Mobasher, Honghua Dai, Tao Luo, Miki Nakagawa, Yuqing Sun, Jim Wiltshire WebKDD 2000.
Chapter 12: Web Usage Mining - An introduction Chapter written by Bamshad Mobasher Many slides are from a tutorial given by B. Berendt, B. Mobasher, M.
Srivastava J., Cooley R., Deshpande M, Tan P.N.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Recommender Systems Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata Credits to Bing Liu (UIC) and Angshul Majumdar.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
Recommender Systems. Recommender Systems (RSs) n RSs are software tools providing suggestions for items to be of use to users, such as what items to buy,
Web Mining Issues Size Size –>350 million pages –Grows at about 1 million pages a day Diverse types of data Diverse types of data.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
User Modeling and Recommender Systems: Introduction to recommender systems Adolfo Ruiz Calleja 06/09/2014.
Data Mining and Decision Support
Information Design Trends Unit Five: Delivery Channels Lecture 2: Portals and Personalization Part 2.
Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
© Prentice Hall1 DATA MINING Web Mining Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University Companion slides.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Chapter 8: Web Analytics, Web Mining, and Social Analytics
Collaborative Filtering - Pooja Hegde. The Problem : OVERLOAD Too much stuff!!!! Too many books! Too many journals! Too many movies! Too much content!
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
Data Mining: Concepts and Techniques
Recommender Systems & Collaborative Filtering
Adopted from Bin UIC Recommender Systems Adopted from Bin UIC.
Data Warehousing Data Mining Privacy
Discovery of Significant Usage Patterns from Clickstream Data
Web Mining Research: A Survey
Presentation transcript:

Dr. Guandong Xu Intelligent Web & Information Systems (IWIS) Department of Computer Science, Aalborg University Web Usage Mining & Personalization

References MOBASHAR, B., Chap4: Web Usage Mining and Personalization. In Practical Handbook of Internet Computing, Munindar P. Singh (ed.), CRC Press XU, G., ZHANG, Y., & LI, L., Web Mining and Social Networking : Techniques and Applications, Springer, Nov, 2010, http://

Personalization Web personalization can be described as any action that makes the Web experience of a user customized to the user’s taste or preferences. Principal elements of Web personalization include modeling of Web objects (such as pages or products) and subjects (such as users or customers), categorization of objects and subjects, matching between and across objects and/or subjects, and determination of the set of actions to be recommended for personalization

Personalization categories three general groups: manual decision rule systems - rules based on user demographics or static profiles (collected through a registration process); content-based filtering agents - rely on personal profiles and the content similarity of Web documents collaborative filtering systems- user ratings or preferences, and through a correlation engine.

Drawbacks of current systems Rule-based filtering Subjective description of users Prone to biased Static user profile – degrade the performance after time Content-based filtering Rely on content similarity, not feasible for non-textual resource Missing the consideration of user preference Collaborative filtering – a predominant approach in most commercial e-commerce systems Instead of the content features, it involves in matching the ratings of like-minded users for objects (e.g. movies or products) Most commonly used KNN algorithm

Limitations of collaborative filtering The high dimensionality of items in the system The sparsity of rating data – decrease the likelihood of a significant overlap of ratings The high computational cost of real-time prediction the necessity of integrating the explicit user ratings and implicit content or product-oriented features

Improvement on these concerns Optimization strategies, such as Similarity indexing and dimensionality reduction Offline clustering of user records – search only within a matching cluster Integration of content and user demographics A promising technique – web usage mining Goal is to capture and model the patterns and profiles of users interacting with a Web site

The aims of web usage mining The discovered patterns are usually represented as collections of pages or items that are frequently accessed by groups of users with common needs or interests. Such patterns can be used to better understand behavioral characteristics of visitors or user segments, improve the organization and structure of the site, and create a personalized experience for visitors by providing dynamic recommendations.

Aspects of web usage mining Enhance the above discussed approaches and remedy the shortcomings Benefits from the advance of data mining – employing data mining algorithms on the offline pattern discovery from user transaction – improve the scalability of CF Clustering Association rule mining Navigation pattern mining and so on

Personalization based on WUM Goal: to recommend a set of objects, e.g. links, ads, text, products or service tailored to the user’s preference Personalization process Derive the navigational patterns of users – “long-term” (the user activity history) or “short-term” (single sessions) – web log mining or analysis Match the target active user with the derived patterns Make recommendations via the matched pattern (recommendation engine)

The overview of web personalization based on WUM Consists of three phases data preparation and transformation, pattern discovery, and recommendation ( real-time)

The offline data preparation and pattern discovery components

The online personalization component

Data Preparation and Modeling the most time consuming and computationally intensive step in the knowledge discovery process requires the use of especial algorithms and heuristics not commonly employed in other domains critical to the successful extraction of useful patterns from the data

Sources and Types of Data Usage data: log file Content data: textual Structure data: linkage map User data: demographic, domain know…

Usage data preparation data preprocessing include data cleaning, pageview identification, user identification, session identification (or sessionization), the inference of missing references due to caching, and transaction (episode) identification SESSION #925 (USER_ID = 338) /news/default.asp /admissions/ >/news/default.asp /admissions/requirements.asp > /admissions/ /programs/ >/admissions/requirements.asp /programs/2002/gradect2002.asp > /programs/ /pdf/promos/2002/ect2002.pdf > /programs/2002/gradect2002.asp PageurlDuration(s)weight(%) /news/default.asp: /admissions/ /admissions/requirements.asp /programs/61.28 /programs/2002/gradect2002.asp /pdf/promos/2002/ect2002.pdf……

Web usage data model The data preprocessing results in User sessions, S={S i |i=1,…,m} Pages corpus, P={P j |j=1,…,n} Each user session s i ={,…, }, or simply s i ={w i1, w i2 …,w in }

Data integration from multiple source

Data integration example TP matrix: transaction-pageview ∈ R m×n PF matrix: pageview-feature ∈ R n×k Multiply TP with PF : TP ×PF=TP={t 1 ’,t 2 ’,…,t m ’} ∈ R m×k t j ’ is a k-dimensional vector over the feature space a user transaction can be represented as a content feature vector, reflecting that user’s interests in particular concepts or topics

Pattern discovery from usage data Employ various DM & ML approaches Standard statistical techniques for user behavior On more integrated data warehouse

Data mining approaches for usage data Given a set of n pageviews, P = {p 1, p 2, · · ·, p n }, and a set of m user transactions, T = {t 1, t 2, · · ·, t m }, where each t i ∈ T is a subset of P. Each transaction t as an l-length sequence of ordered pairs: t =, where each p t i = p j for some j ∈ {1, · · ·, n}, and w(p t i ) is the weight associated with pageview p t i in the transaction t.

Association rule mining Given a transaction T and a set I = {I 1, I 2,..., I k } of frequent itemsets over T. The support of an itemset I i ∈ I is defined as An association rule r is an expression of the form X ⇒ Y ( σ r, α r ), where X and Y are itemsets, σ r = σ (X ∪ Y ) is the support of X ∪ Y representing the probability that X and Y occur together in a transaction. The confidence for the rule r, α r, is given by σ (X ∪ Y ) / σ (X) and represents the conditional probability that Y occurs in a transaction given that X has occurred in that transaction.

An association rule example For example, a high-confidence rule such as {special-offers/, /products/software/} ⇒ {shopping-cart/} might provide some indication that a promotional campaign on software products is positively affecting online sales. Such rules can also be used to optimize the structure of the site. For example, if a site does not provide direct linkage between two pages A and B, the discovery of a rule {A} ⇒ {B} would indicate that providing a direct hyperlink might aid users in finding the intended information.

Sequential pattern mining Given a transaction set T and a set S = {S 1, S 2,..., S n } of frequent sequential (respectively, contiguous sequential) pattern over T, the support of each S i is defined as follows: The confidence of the rule X ⇒ Y, where X and Y are (contiguous) sequential patterns, is defined as where ◦ denotes the concatenation operator

Clustering approaches A process to assign data objects into various data groups or categories based on the similarity or distance between the objects such that the intra-group similarity within one group is maximized but the inter-group similarity is minimized. A unsupervised approach only relying on the mutual similarity, in contrast, classification is supervised learning In the context of usage data, two types of clustering : clustering the transactions (or users), or clustering pageviews Applications of clustering in Web usage mining, e-marketing, personalization, and collaborative filtering

Clustering in user profiling An example of deriving aggregate usage profiles from transaction clusters

Steps of clustering in user profiling Given the mapping of user transactions into a multi-dimensional space as vectors of pageviews (i.e., the matrix TP)

Steps of clustering in user profiling Given the mapping of user transactions into a multi-dimensional space as vectors of pageviews (i.e., the matrix TP) Employ standard clustering algorithms, such as k-means, generally partition this space into subgroups Obtain user segments, but not capturing an aggregated view of common user patterns

Steps of clustering in user profiling Given the mapping of user transactions into a multi-dimensional space as vectors of pageviews (i.e., the matrix TP) Employ standard clustering algorithms, such as k-means, generally partition this space into subgroups Obtain user segments, but not capturing an aggregated view of common user patterns Utilize the centroid (or the mean vector) of each cluster to represent the aggregated view of user pattern

Discovered Patterns for Personalization several effective recommendation algorithms based on clustering (which can be seen as an extension of standard kNN- based collaborative filtering), association rule mining (AR), and sequential pattern (SP) or contiguous sequential pattern (CSP) discovery.

kNN-Based Approach k-Nearest-Neighbor (kNN) approach involves comparing the activity record for a target user with the historical records of other users in order to find the top k users who have similar tastes or interests. Measuring the similarity or correlation between the active session s and each transaction vector t (where t ∈ T ). The top k most similar transactions to s are considered to be the neighborhood for the session s, which denoted by NB(s) (taking the size k of the neighborhood to be implicit) NB(s) = {t s 1,t s 2, · · ·,t s k }. Various similarity measure – Pearson correlation coefficient; cosine coefficient; Jaccard distance and so on.

Recommendation score Given the active use session s and the nearest neighbors NB(s), the recommendation score of an item is where weight(p,NB(s)) is the mean weight for pageview p in the neighborhood as expressed in the centroid vector

Using Clustering For Personalization A method PACT (Profile Aggregation Based on Clustering Transactions) [Mobasher et al., 2002] an aggregate usage profile pr c as a set of pageview-weight pairs: pr c = { | p ∈ P, weight(p, pr c ) ≥ μ }, where the significance weight, weight(p, pr c ), of the pageview p within the usage profile pr c Recommendation score

Association Rules for Personalization Sample Web Transactions involving pageviews A, B, C, D and E Example of discovered frequent itemsets

An example of a Frequent Itemsets Graph (frequency threshold of 4) Now, given user active session window B,E, the recommendation generation algorithm finds items A and C as candidate recommendations. The recommendation scores of item A and C are 1 and 4/5, corresponding to the confidences of the rules {B,E} → {A} and {B,E} → {C}, respectively.

Sequential Patterns for Personalization Example of discovered sequential patterns

Example of discovered contiguous sequential patterns

Recommendation example Give a user’s active session window A,B, the recommendation engine using sequential patterns finds item E as a candidate recommendation. The recommendation score of item E is 1, corresponding to the rule A,B ⇒ E. On the other hand, the recommendation engine using contiguous sequential patterns will, in this case, fails to give any recommendations

Thanks!