COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

Recommender Systems & Collaborative Filtering
Chapter 5: Introduction to Information Retrieval
A Graph-based Recommender System Zan Huang, Wingyan Chung, Thian-Huat Ong, Hsinchun Chen Artificial Intelligence Lab The University of Arizona 07/15/2002.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Mining and Summarizing Customer Reviews Advisor : Dr.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
Item-based Collaborative Filtering Idea: a user is likely to have the same opinion for similar items [if I like Canon cameras, I might also like Canon.
Predicting Text Quality for Scientific Articles AAAI/SIGART-11 Doctoral Consortium Annie Louis : Louis A. and Nenkova A Automatically.
Database Management Systems, R. Ramakrishnan1 Computing Relevance, Similarity: The Vector Space Model Chapter 27, Part B Based on Larson and Hearst’s slides.
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Sentiment Lexicon Creation from Lexical Resources BIS 2011 Bas Heerschop Erasmus School of Economics Erasmus University Rotterdam
Recommendations via Collaborative Filtering. Recommendations Relevant for movies, restaurants, hotels…. Recommendation Systems is a very hot topic in.
Mining the Medical Literature Chirag Bhatt October 14 th, 2004.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
CS 5831 CS583 – Data Mining and Text Mining Course Web Page 05/cs583.html.
Chapter 5: Information Retrieval and Web Search
CS583 – Data Mining and Text Mining Course Web Page 07/cs583.html.
1/16 Final project: Web Page Classification By: Xiaodong Wang Yanhua Wang Haitang Wang University of Cincinnati.
Mining and Searching Opinions in User-Generated Contents Bing Liu Department of Computer Science University of Illinois at Chicago.
A Holistic Lexicon-Based Approach to Opinion Mining
Chapter 12 (Section 12.4) : Recommender Systems Second edition of the book, coming soon.
Item-based Collaborative Filtering Recommendation Algorithms
Chapter 11. Opinion Mining. Bing Liu, UIC ACL-07 2 Introduction – facts and opinions Two main types of information on the Web.  Facts and Opinions Current.
Mining and Summarizing Customer Reviews
CES 514 – Data Mining Spring 2010 Sonoma State University.
Mining and Summarizing Customer Reviews Minqing Hu and Bing Liu University of Illinois SIGKDD 2004.
The 2nd International Conference of e-Learning and Distance Education, 21 to 23 February 2011, Riyadh, Saudi Arabia Prof. Dr. Torky Sultan Faculty of Computers.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics – Bag of concepts – Semantic distance between two words.
Distributed Networks & Systems Lab. Introduction Collaborative filtering Characteristics and challenges Memory-based CF Model-based CF Hybrid CF Recent.
A Holistic Lexicon-Based Approach to Opinion Mining Xiaowen Ding, Bing Liu and Philip Yu Department of Computer Science University of Illinois at Chicago.
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Exploring Online Social Activities for Adaptive Search Personalization CIKM’10 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
CS 5831 CS583 – Data Mining and Text Mining Course Web Page 06/cs583.html.
Chapter 6: Information Retrieval and Web Search
1 Computing Relevance, Similarity: The Vector Space Model.
Collaborative Information Retrieval - Collaborative Filtering systems - Recommender systems - Information Filtering Why do we need CIR? - IR system augmentation.
1 Collaborative Filtering & Content-Based Recommending CS 290N. T. Yang Slides based on R. Mooney at UT Austin.
Recommender Systems Debapriyo Majumdar Information Retrieval – Spring 2015 Indian Statistical Institute Kolkata Credits to Bing Liu (UIC) and Angshul Majumdar.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
Recommender Systems. Recommender Systems (RSs) n RSs are software tools providing suggestions for items to be of use to users, such as what items to buy,
Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval O. Chum, et al. Presented by Brandon Smith Computer Vision.
Web Search and Text Mining Lecture 5. Outline Review of VSM More on LSI through SVD Term relatedness Probabilistic LSI.
McCormick Northwestern Engineering 1 Electrical Engineering & Computer Science Mining Millions of Reviews: A Technique to Rank Products Based on Importance.
ASSOCIATIVE BROWSING Evaluating 1 Jinyoung Kim / W. Bruce Croft / David Smith for Personal Information.
Augmenting (personal) IR Readings Review Evaluation Papers returned & discussed Papers and Projects checkin time.
Collaborative Filtering via Euclidean Embedding M. Khoshneshin and W. Street Proc. of ACM RecSys, pp , 2010.
Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
User Modeling and Recommender Systems: recommendation algorithms
Item-Based Collaborative Filtering Recommendation Algorithms Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl GroupLens Research Group/ Army.
COMP423 Summary Information retrieval and Web search  Vecter space model  Tf-idf  Cosine similarity  Evaluation: precision, recall  PageRank 1.
COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics Semantic distance between two words.
Item-Based Collaborative Filtering Recommendation Algorithms
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
Data Mining: Concepts and Techniques
Recommender Systems & Collaborative Filtering
CF Recommenders.
Data mining (KDD) process
Adopted from Bin UIC Recommender Systems Adopted from Bin UIC.
Author: Kazunari Sugiyama, etc. (WWW2004)
CS583 – Data Mining and Text Mining
Movie Recommendation System
CSE 635 Multimedia Information Retrieval
Recommender Systems Copyright: Dietmar Jannah, Markus Zanker and Gerhard Friedrich (slides based on their IJCAI talk „Tutorial: Recommender Systems”)
Chapter 5: Information Retrieval and Web Search
Recommendation Systems
Presentation transcript:

COMP423 Intelligent Agents

Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set of items in the past – Content based filtering (e.g SmartMuseum) Based on how well the contend of the target item matches the user’s preferred content pattern, which is learnt from the user’s own past ratings and the content pattern of the rated items. – Hybrid

User-based Collaborative Filtering Nearest Neighbor Collaborative Filtering – Calculate user similarities Pearson’s correlation – Define the effective neighborhood – Computer the predicted ratings The correlation of two users ken and lee, they both ranked n items K(1..n) L (1..n) Prediction on Ken’s ranking for m

Item-based Collaborative filtering Item ranking Matrix Item vectors: the columns Item similarity – Pearson’s Correlation – Cosine similarity – Adjusted Cosine similarity

typical Collaborative Filtering Memory based collaborative filtering – Nearest-neighbor based – User similarity – Item similarity Clustering for collaborative filtering – Kmeans – HAC – Naïve Bayes clustering – Group oriented, less personalized, can be addressed by reducing cluster size

Content based filtering Content – Features: Movie: directors, actor/actress, producers., editors, distributors, editors, keywords, review, …. Text recommendation: a set of extracted keywords Classification problem

Hybrid Collaborative filtering: – Require other users rating data (cold start problem) – Can do cross domain – Non-transitive association problem: users are linked by common items and items are linked by common users. Content Based – Require one user’s rating data – Require item’s content data – Not cross domain Sequential Hybridization Combinational Hybridization

Evaluation Binary: change rates to positive or negative – Precision – Top N precision – Recall – F-measure – MAP: consider ranking, precision, recall Mean of the Average Precision for all queries Average Precision: the mean of the precision when each relevant document is retrieved. (M: No of relevant documents) Average precision is roughly the area under the precision and recall curve

Evaluation Consider ranking score MAE: mean absolute error

Research projects Recommender systems combined with personalized search – Building profile from click through data – Query expansion based on profile Two way recommendation – Online dating systems Knowledge-based, Personalized recommendation

Opinion mining Document level Sentence level Feature level

Bing Liu, UIC ACL Feature-based Summary (Hu and Liu, KDD-04) GREAT Camera., Jun 3, 2004 Reviewer: jprice174 from Atlanta, Ga. I did a lot of research last year before I bought this camera... It kinda hurt to leave behind my beloved nikon 35mm SLR, but I was going to Italy, and I needed something smaller, and digital. The pictures coming out of this camera are amazing. The 'auto' feature takes great pictures most of the time. And with digital, you're not wasting film if the picture doesn't come out. … …. Feature Based Summary : Feature1: picture Positive: 12 The pictures coming out of this camera are amazing. Overall this is a good camera with a really good picture clarity. … Negative: 2 The pictures come out hazy if your hands shake even for a moment during the entire process of taking a picture. Focusing on a display rack about 20 feet away in a brightly lit room during day time, pictures produced by this camera were blurry and in a shade of orange. Feature2: battery life …

Bing Liu, UIC ACL Visual summarization & comparison Summary of reviews of Digital camera 1 PictureBatterySizeWeightZoom + _ Comparison of reviews of Digital camera 1 Digital camera 2 _ +

Opinion mining and sentiment analysis Classification Extraction Summarization Supervised, unsupervised Corpus based, dictionary based

Opinion mining Opinion holder, object and opinions(P, N) Comparative relations – A is cheaper than B Temporal opinion mining and summarization

Projects Web of things Hardware and software Cross domain learning Personalized search learning large Knowledge base Cross checking with Cyc, wordnet Privacy

Web data mining Web Content mining Web structure mining Web usage mining

Two projects on security Intrusion detection by clustering Web log files – New similarity measure Malicious Web pages Automatic detection