Collaborative Filtering in Social Tagging System on Joint Item-Tag Recommendations Date : 2011/11/7 Source : Jing Peng et. al (CIKM’10) Speaker : Chiu.

Slides:



Advertisements
Similar presentations
Recommender System A Brief Survey.
Advertisements

Music Recommendation by Unified Hypergraph: Music Recommendation by Unified Hypergraph: Combining Social Media Information and Music Content Jiajun Bu,
Aggregating local image descriptors into compact codes
Latent Semantic Analysis
Collaborative QoS Prediction in Cloud Computing Department of Computer Science & Engineering The Chinese University of Hong Kong Hong Kong, China Rocky.
1 Latent Semantic Mapping: Dimensionality Reduction via Globally Optimal Continuous Parameter Modeling Jerome R. Bellegarda.
CUBELSI : AN EFFECTIVE AND EFFICIENT METHOD FOR SEARCHING RESOURCES IN SOCIAL TAGGING SYSTEMS Bin Bi, Sau Dan Lee, Ben Kao, Reynold Cheng The University.
A Semantic Clustering-based Approach For Searching And Browsing Tag Spaces Date: 2011/10/17 Source:Damir Vandic et. al (SAC’11) Speaker:Chiang,guang-ting.
Chapter 2 Random Vectors 與他們之間的性質 (Random vectors and their properties)
Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan Susan T.Dumains Eric Horvitz MIT,CSAILMicrosoft Researcher Microsoft.
DIMENSIONALITY REDUCTION BY RANDOM PROJECTION AND LATENT SEMANTIC INDEXING Jessica Lin and Dimitrios Gunopulos Ângelo Cardoso IST/UTL December
:New Land ★★★★☆ 題組: Problem Set Archive with Online Judge 題號: 11871: New Land 解題者:施博修 解題日期: 2011 年 6 月 8 日 題意:國王有一個懶兒子,為了勞動兒子,他想了一個 辦法,令他在某天早上開始走路,直到太陽下山前,靠.
: OPENING DOORS ? 題組: Problem Set Archive with Online Judge 題號: 10606: OPENING DOORS 解題者:侯沛彣 解題日期: 2006 年 6 月 11 日 題意: - 某間學校有 N 個學生,每個學生都有自己的衣物櫃.
JAVA 程式設計與資料結構 第十四章 Linked List. Introduction Linked List 的結構就是將物件排成一列, 有點像是 Array ,但是我們卻無法直接經 由 index 得到其中的物件 在 Linked List 中,每一個點我們稱之為 node ,第一個 node.
程式註解說明. 2 程式註解格式 塊狀註解 對檔案、 class 、 method 、資料結構、一段程式 …. 等程式區塊 做說明。 第一行的開頭必需為 “/*” 且沒有其他文字,最後一行的開頭 必需以 “*/” 做為結束,在中間每一行的開頭都必需是一個 “*” 。 單行註解 佔據一整行的說明。 以.
: The Playboy Chimp ★★☆☆☆ 題組: Problem Set Archive with Online Judge 題號: 10611: The Playboy Chimp 解題者:蔡昇宇 解題日期: 2010 年 2 月 28 日 題意:給一已排序的數列 S( 升冪.
1 單元三 查詢結果的引用分析 Web of Science 利用指引 查看出版及被引用情況 在查詢結果的清單中,可以瀏覽近 20 年來查詢主題出版和被引用的情況。
圖片索引專題 指導教授:陳淑媛 教授 黃伯偉 林育瑄. 動機 & 理念  目前圖像檢索系統中使用的大多都為利用文字 標籤圖像或是圖像輪廓特徵來進行搜尋,然而 輪廓特徵的缺點卻是所有組成圖像的線條都要 逐一處理相當耗時。  所以本研究的目標在於,提出一個以像素點為 特徵的有效率與正確率的圖像檢索演算法實作。
: Problem A : MiniMice ★★★★☆ 題組: Contest Archive with Online Judge 題號: 11411: Problem A : MiniMice 解題者:李重儀 解題日期: 2008 年 9 月 3 日 題意:簡單的說,題目中每一隻老鼠有一個編號.
: Ahoy, Pirates! ★★★★☆ 題組: Contest Archive with Online Judge 題號: 11402: Ahoy, Pirates! 解題者:李重儀 解題日期: 2008 年 8 月 26 日 題意:有一個海盜島有 N 個海盜,他們的編號 (id)
Singular Value Decomposition in Text Mining Ram Akella University of California Berkeley Silicon Valley Center/SC Lecture 4b February 9, 2011.
資料結構實習-一 參數傳遞.
IR Models: Latent Semantic Analysis. IR Model Taxonomy Non-Overlapping Lists Proximal Nodes Structured Models U s e r T a s k Set Theoretic Fuzzy Extended.
Queensland University of Technology An Ontology-based Mining Approach for User Search Intent Discovery Yan Shen, Yuefeng Li, Yue Xu, Renato Iannella, Abdulmohsen.
845: Gas Station Numbers ★★★ 題組: Problem Set Archive with Online Judge 題號: 845: Gas Station Numbers. 解題者:張維珊 解題日期: 2006 年 2 月 題意: 將輸入的數字,經過重新排列組合或旋轉數字,得到比原先的數字大,
: Finding Paths in Grid ★★★★☆ 題組: Contest Archive with Online Judge 題號: 11486: Finding Paths in Grid 解題者:李重儀 解題日期: 2008 年 10 月 14 日 題意:給一個 7 個 column.
A Privacy Preserving Efficient Protocol for Semantic Similarity Join Using Long String Attributes Bilal Hawashin, Farshad Fotouhi Traian Marius Truta Department.
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Predicting Missing Provenance Using Semantic Associations in Reservoir Engineering Jing Zhao University of Southern California Sep 19 th,
Recommender Systems on the Web: A Model-Driven Approach Gonzalo Rojas – Francisco Domínguez – Stefano Salvatori Department of Computer Science University.
Tag Clouds Revisited Date : 2011/12/12 Source : CIKM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh. Jia-ling 1.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Wancai Zhang, Hailong Sun, Xudong Liu, Xiaohui Guo.
Beyond Co-occurrence: Discovering and Visualizing Tag Relationships from Geo-spatial and Temporal Similarities Date : 2012/8/6 Resource : WSDM’12 Advisor.
Andriy Shepitsen, Jonathan Gemmell, Bamshad Mobasher, and Robin Burke
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
Exploring Online Social Activities for Adaptive Search Personalization CIKM’10 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
Thwarting Passive Privacy Attacks in Collaborative Filtering Rui Chen Min Xie Laks V.S. Lakshmanan HKBU, Hong Kong UBC, Canada UBC, Canada Introduction.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
Retrieval Models for Question and Answer Archives Xiaobing Xue, Jiwoon Jeon, W. Bruce Croft Computer Science Department University of Massachusetts, Google,
Document Clustering 文件分類 林頌堅 世新大學圖書資訊學系 Sung-Chien Lin Department of Library and Information Studies Shih-Hsin University.
Authors: Rosario Sotomayor, Joe Carthy and John Dunnion Speaker: Rosario Sotomayor Intelligent Information Retrieval Group (IIRG) UCD School of Computer.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
SINGULAR VALUE DECOMPOSITION (SVD)
Mingyang Zhu, Huaijiang Sun, Zhigang Deng Quaternion Space Sparse Decomposition for Motion Compression and Retrieval SCA 2012.
Face Recognition by Support Vector Machines 指導教授 : 王啟州 教授 學生 : 陳桂華 Guodong Guo, Stan Z. Li, and Kapluk Chan School of Electrical and Electronic Engineering.
LOGO Identifying Opinion Leaders in the Blogosphere Xiaodan Song, Yun Chi, Koji Hino, Belle L. Tseng CIKM 2007 Advisor : Dr. Koh Jia-Ling Speaker : Tu.
Modern information retreival Chapter. 02: Modeling (Latent Semantic Indexing)
Pairwise Preference Regression for Cold-start Recommendation Speaker: Yuanshuai Sun
Web Search and Text Mining Lecture 5. Outline Review of VSM More on LSI through SVD Term relatedness Probabilistic LSI.
Automating Readers’ Advisory to Make Book Recommendations for K-12 Readers by Alicia Wood.
CONTEXTUAL SEARCH AND NAME DISAMBIGUATION IN USING GRAPHS EINAT MINKOV, WILLIAM W. COHEN, ANDREW Y. NG SIGIR’06 Date: 2008/7/17 Advisor: Dr. Koh,
A DDING S TRUCTURE TO T OP -K: F ORM I TEMS TO E XPANSIONS Date : Source : CIKM’ 11 Speaker : I-Chih Chiu Advisor : Dr. Jia-Ling Koh 1.
MMM2005The Chinese University of Hong Kong MMM2005 The Chinese University of Hong Kong 1 Video Summarization Using Mutual Reinforcement Principle and Shot.
Contextual Search and Name Disambiguation in Using Graphs Einat Minkov, William W. Cohen, Andrew Y. Ng Carnegie Mellon University and Stanford University.
Mining Tag Semantics for Social Tag Recommendation Hsin-Chang Yang Department of Information Management National University of Kaohsiung.
2013/06/13 Page 1 複變數邊界積分方程推導 National Taiwan Ocean University MSVLAB Department of Harbor and River Engineering Date: June, 13, 2013 Student: Jia-Wei.
CiteData: A New Multi-Faceted Dataset for Evaluating Personalized Search Performance CIKM’10 Advisor : Jia-Ling, Koh Speaker : Po-Hsien, Shih.
A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.
1 Dongheng Sun 04/26/2011 Learning with Matrix Factorizations By Nathan Srebro.
Customized of Social Media Contents using Focused Topic Hierarchy
Where Did You Go: Personalized Annotation of Mobility Records
Sourse: WSDM 2017 Advisor: Jia-Ling Koh Speaker: Hsiu-Yi,Chu
Asymmetric Correlation Regularized Matrix Factorization for Web Service Recommendation Qi Xie1, Shenglin Zhao2, Zibin Zheng3, Jieming Zhu2 and Michael.
Postdoc, School of Information, University of Arizona
Enriching Taxonomies With Functional Domain Knowledge
Date: 2012/11/15 Author: Jin Young Kim, Kevyn Collins-Thompson,
WSExpress: A QoS-Aware Search Engine for Web Services
Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.
An Image Quality Evaluation Method Based on Digital Watermarking
Presentation transcript:

Collaborative Filtering in Social Tagging System on Joint Item-Tag Recommendations Date : 2011/11/7 Source : Jing Peng et. al (CIKM’10) Speaker : Chiu I- Chih Advisor : Dr. Koh. Jia-ling 1

Outline Introduction Unified User Profiling Joint Item-Tag Recommendation Experiment Conclusion 2

Introduction In recent years, social tagging has been gaining wide-spread popularity in a variety of applications. Enabling automated recommendation of various kinds in social tagging systems can further enhance this important social information discovery mechanism. All of the previous research focuses on recommendations of either items or tags. 3

Motivation: How to improve item recommendation leveraging tagging information. Goal: Propose a unified user profiling scheme. Generate joint item-tag recommendations.  A novel framework for collaborative filtering in social tagging systems. Introduction 4

Unified User Profiling 1. Integrated structure of social tagging 2. Unified user profiling scheme 3. Weighting of user profile matrix 4. Dimensionality reduction 5

Integrated Structure of Social Tagging B-Select B-Use T-Annotate Proposed structure has integrated all possible co-occurrence information among the three entities into one framework. 6 B-Associated

Unified User Profiling Scheme There are three methods to profile users on the tagging data: 1. The item vector of the user’s historical records. 2. A tag vector of use frequencies. 3. An extended 0-1 valued item-plus-tag vector. The similarity between users is usually judged based on some of the following criteria: 1. Users have saved common items — B-Select similarity. 2. Users have used common tags — B-Use similarity. 3. Users have used common tags on the same item — T-Annotate similarity. 7

User A T1T2T3T4 I10011 I21000 I30110 I40000 User B T1T2T3T4 I10110 I20100 I31001 I

When the item is being saved, this Hidden Tag will be used automatically. Users will be considered to be similar to a certain extent through the Hidden Tag once they have saved the same item, even if they have assigned completely different sets of tags to the item. Laziness, use of non-descriptive tags for personal uses only, and spelling error. User A T1T2T3T4 I10011 I21000 I30110 I40000 User B T1T2T3T4 I10110 I20100 I31001 I

Weighting of User Profile Matrix T0T1T2T3 I I I I Hidden Tag Hidden Item Tag Item k : ranking index of the tag. K : total number of tags. : empirical parameter. T1T2T3 I1101 I2010 I3001 k T1=3,T2=2,T3=1, if they exist in the same item K I1=2,I2=4,I3=8 = 2 p 01 =0.25/3 p 02 =0.125/3 p 03 =0.75/3 10

Dimensionality Reduction Profile matrix It can be efficiently stored in a sparse form.  Calculating similarities → Time-consuming. Solution : Latent Semantic AnalysisLatent Semantic Analysis The lower dimensional representation of users is obtained  Compute the cosine similarity between users a and b 11 : A user’s feature matrix in the item × tag subspace. : entry of feature matrix.

Joint Item-Tag Recommendation 1. Problem Definition 2. From Joint Recommendation to Item Recommendation 12

Problem Definition Propose to recommend a joint item-tag matrix to each user, with the tags representing the topics of the target item that might attract the user. 13 a : the recommended profile matrix for user a. : the similarity between users a and b, sim(F a,F b ) b : the profile matrix (not feature matrix) of user b. Users Profile matrix a P a b P b c P c d P d e P e Similarity S ab = 0.5 S ac = 0.4 S ad = 0.3 S ae = 0.6

The recommended profile matrix for each user consists of four blocks. 14 () : Tag t in the Hidden Item Row of the recommended (a user’s potential interest in tags.) () : Initial profile matrices. (a user’s current interest in tags.) Problem Definition

From Joint Recommendation to Item Recommendation Joint real item-tag recommendations explicitly consider a user’s possible interest in each item with respect to each tag (topic). An intuitive solution would be generating a denser type of joint recommendations based on the two denser pure recommendations and then fuse it with the joint real item-tag recommendation. 15 (B-Associated information )

The final joint item-tag recommendation result can then be computed as a weighted average of the two joint recommendations. A refined item recommendation can be obtained by marginalizing the final joint recommendation result. 16 From Joint Recommendation to Item Recommendation

Experiment Datasets 1. Delicious 2. CiteULike 3. Bibsonomy 17

Evaluation Protocols and Metrics 18 N rec : Total number of recommendations N hit : Number of correct recommendations N test : Number of items in the test set : j-th item is saved by user → 1 otherwise → 0 j : index of an item in the predicted ranked list h : the viewing half-life of users

Results 19

20 = 0.5 = 0.7 = 0.6 = 0.4 = 0.8 = 1.0 Sensitivity Analysis Gain a truthful representation of a user’s interest in tags It’s very stable when approach 1

Conclusion Joint item-tag recommendation framework is able to utilize complete information in the tagging data, to produce high-quality item recommendations. Find some theoretical foundations for the unified user profiling scheme and to develop more systematic weighting methods for it. Explore effective alternative approaches to measuring the association between items and tags. 21

Thanks for your listening 22

Latent Semantic Analysis LSA( 隱含語意分析 ) 是以統計的方式去解析某個字詞在文件間的接近程度, 使用 LSA 來分 析而成的索引就是 LSI(Latent Semantic Indexing) SVD(Singular Value Decomposition) LSI 對 SVD 做了一點改變,就是對Ʃ的 r 個對角線元素進行了排序, 並只保留前 k 個值 ( k < r ) , 後面 r - k 個設 0 。 23

Singular Value Decomposition 24

25 q k is then compared with every document vector in V k using the cosine similarity., k=2 query q = “user interface”