MusicSense: Contextual Music Recommendation using Emotional Allocation Modeling Rui Cai, Chao Zhang, Chong Wang, Lei Zhang, and Wei-Ying Ma Proceedings.

Slides:



Advertisements
Similar presentations
Recommender System A Brief Survey.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
A Music Search Engine Built upon Audio-based and Web-based Similarity Measures P. Knees, T., Pohle, M. Schedl, G. Widmer SIGIR 2007.
1 / 22 Issues in Text Similarity and Categorization Jordan Smith – MUMT 611 – 27 March 2008.
LYRIC-BASED ARTIST NETWORK METHODOLOGY Derek Gossi CS 765 Fall 2014.
Bag-of-Words Methods for Text Mining CSCI-GA.2590 – Lecture 2A
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
Context-Aware Query Classification Huanhuan Cao 1, Derek Hao Hu 2, Dou Shen 3, Daxin Jiang 4, Jian-Tao Sun 4, Enhong Chen 1 and Qiang Yang 2 1 University.
Relevance Feedback based on Parameter Estimation of Target Distribution K. C. Sia and Irwin King Department of Computer Science & Engineering The Chinese.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Online Learning for Web Query Generation: Finding Documents Matching a Minority Concept on the Web Rayid Ghani Accenture Technology Labs, USA Rosie Jones.
Scalable Text Mining with Sparse Generative Models
Chapter 5: Information Retrieval and Web Search
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 30, (2014) BERLIN CHEN, YI-WEN CHEN, KUAN-YU CHEN, HSIN-MIN WANG2 AND KUEN-TYNG YU Department of Computer.
Finding Advertising Keywords on Web Pages Scott Wen-tau YihJoshua Goodman Microsoft Research Vitor R. Carvalho Carnegie Mellon University.
Query session guided multi- document summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.
CS344: Introduction to Artificial Intelligence Vishal Vachhani M.Tech, CSE Lecture 34-35: CLIR and Ranking in IR.
Modeling (Chap. 2) Modern Information Retrieval Spring 2000.
«Tag-based Social Interest Discovery» Proceedings of the 17th International World Wide Web Conference (WWW2008) Xin Li, Lei Guo, Yihong Zhao Yahoo! Inc.,
MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.
Citation Recommendation 1 Web Technology Laboratory Ferdowsi University of Mashhad.
1 A study on automatically extracted keywords in text categorization Authors:Anette Hulth and Be´ata B. Megyesi From:ACL 2006 Reporter: 陳永祥 Date:2007/10/16.
Topic Models in Text Processing IR Group Meeting Presented by Qiaozhu Mei.
Aardvark Anatomy of a Large-Scale Social Search Engine.
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
Improving Web Spam Classification using Rank-time Features September 25, 2008 TaeSeob,Yun KAIST DATABASE & MULTIMEDIA LAB.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Modern Information Retrieval: A Brief Overview By Amit Singhal Ranjan Dash.
Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.
Mining the Web to Create Minority Language Corpora Rayid Ghani Accenture Technology Labs - Research Rosie Jones Carnegie Mellon University Dunja Mladenic.
11 A Hybrid Phish Detection Approach by Identity Discovery and Keywords Retrieval Reporter: 林佳宜 /10/17.
TEMPLATE DESIGN © Zhiyao Duan 1,2, Lie Lu 1, and Changshui Zhang 2 1. Microsoft Research Asia (MSRA), Beijing, China.2.
Chapter 6: Information Retrieval and Web Search
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
Detecting Dominant Locations from Search Queries Lee Wang, Chuang Wang, Xing Xie, Josh Forman, Yansheng Lu, Wei-Ying Ma, Ying Li SIGIR 2005.
Web Image Retrieval Re-Ranking with Relevance Model Wei-Hao Lin, Rong Jin, Alexander Hauptmann Language Technologies Institute School of Computer Science.
Music Information Retrieval Information Universe Seongmin Lim Dept. of Industrial Engineering Seoul National University.
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
Combining Audio Content and Social Context for Semantic Music Discovery José Carlos Delgado Ramos Universidad Católica San Pablo.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
A Statistical Comparison of Tag and Query Logs Mark J. Carman, Robert Gwadera, Fabio Crestani, and Mark Baillie SIGIR 2009 June 4, 2010 Hyunwoo Kim.
Bag-of-Words Methods for Text Mining CSCI-GA.2590 – Lecture 2A Ralph Grishman NYU.
Introduction to LDA Jinyang Gao. Outline Bayesian Analysis Dirichlet Distribution Evolution of Topic Model Gibbs Sampling Intuition Analysis of Parameter.
Flickr Tag Recommendation based on Collective Knowledge BÖrkur SigurbjÖnsson, Roelof van Zwol Yahoo! Research WWW Summarized and presented.
Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.
AN EFFECTIVE STATISTICAL APPROACH TO BLOG POST OPINION RETRIEVAL Ben He Craig Macdonald Iadh Ounis University of Glasgow Jiyin He University of Amsterdam.
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
Iterative similarity based adaptation technique for Cross Domain text classification Under: Prof. Amitabha Mukherjee By: Narendra Roy Roll no: Group:
Post-Ranking query suggestion by diversifying search Chao Wang.
Using Social Annotations to Improve Language Model for Information Retrieval Shengliang Xu, Shenghua Bao, Yong Yu Shanghai Jiao Tong University Yunbo Cao.
Advisor: Koh Jia-Ling Nonhlanhla Shongwe EFFICIENT QUERY EXPANSION FOR ADVERTISEMENT SEARCH WANG.H, LIANG.Y, FU.L, XUE.G, YU.Y SIGIR’09.
Concept-based Short Text Classification and Ranking
Virtual Examples for Text Classification with Support Vector Machines Manabu Sassano Proceedings of the 2003 Conference on Emprical Methods in Natural.
Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.
Learning to Rank: From Pairwise Approach to Listwise Approach Authors: Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li Presenter: Davidson Date:
Meta-Path-Based Ranking with Pseudo Relevance Feedback on Heterogeneous Graph for Citation Recommendation By: Xiaozhong Liu, Yingying Yu, Chun Guo, Yizhou.
CS791 - Technologies of Google Spring A Web­based Kernel Function for Measuring the Similarity of Short Text Snippets By Mehran Sahami, Timothy.
1 Text Categorization  Assigning documents to a fixed set of categories  Applications:  Web pages  Recommending pages  Yahoo-like classification hierarchies.
A content-based System for Music Recommendation and Visualization of User Preference Working on Semantic Notions Dmitry Bogdanov, Martin Haro, Ferdinand.
University Of Seoul Ubiquitous Sensor Network Lab Query Dependent Pseudo-Relevance Feedback based on Wikipedia 전자전기컴퓨터공학 부 USN 연구실 G
Topic Modeling for Short Texts with Auxiliary Word Embeddings
Large-Scale Content-Based Audio Retrieval from Text Queries
Text Categorization Assigning documents to a fixed set of categories
Unsupervised learning of visual sense models for Polysemous words
Presentation transcript:

MusicSense: Contextual Music Recommendation using Emotional Allocation Modeling Rui Cai, Chao Zhang, Chong Wang, Lei Zhang, and Wei-Ying Ma Proceedings of the 15th international conference on Multimedia, MULTIMEDIA '07 Keywords: MusicSense, contextual music recommendation, emotional allocation modeling, moods Short papers poster session 2 - arts, content, applications

2 劇情 Introduction (1/2) automatically recommendation now has becoming an important role in music sales –Last.fm - collaborative filtering –Pandora - content similarity passive mode – queries -> recommendation active way –Google AdSense deliver contextual advertises matched to websites’ content

3 劇情 Introduction (2/2) MusicSense –contextual music recommendation –automatically deliver music pieces (or their thumbnails) which are relevant to the context of a Web page when users read it –measure the context relevance between music songs and Web pages –emotion as the bridge for such a relevance matching –music is all about conveying composers’ emotions –lots of Web pages such as Weblogs also express sentiments of writers

4 小小的 paper review –Lu et al. utilized Gaussian mixture models to classify music songs into four emotion categories, using acoustic content features like intensity, timbre, and rhythm –Cui et al. have comparatively studied various supervised classifiers to classify online product reviews into positive and negative opinions –Leshed et al. tried to categorize Weblogs into ten most frequently used moods with SVM most works can handle only a few mood categories difficult to collect enough high quality training data for all possible emotions cross-modal emotion mapping is still an open problem

5 MusicSense 怎麼辦 A probabilistic modeling called Emotional Allocation to characterize songs and Web pages as distributions in a common emotion space –leverages both the statistics from a large-scale Web corpus and guidance from psychological studies –keeps the inference ability of generative models relevance matching –songs and Web documents are respectively represented by a collection of word terms, based on which their emotion distribution parameters are optimized in an iterative way –Kullback-Liebler divergence

6 Framework

7 Three steps –emotional allocation modeling –Web-based music description generation and Web document analysis –probability inference and relevance matching refer to knowledge of psychology, to get a relatively complete and well-structured emotion vocabulary (40 basic emotion vocabulary) –R. Cowie, E. Douglas-Cowie, and et al. Emotion recognition in human computer interaction. IEEE Signal Processing Magazine, 18(1):33–80, 2001

8 content-based music analysis lyrics and reviews to describe the semantic of a song use search engines to retrieve more information from the Web to characterize songs –we just prepare two queries in the form of “title + lyrics” and “title + reviews”, respectively –first page returned by the first query, and the top 20 pages returned by the second query –retrieved pages are merged as a virtual document after removing HTML tags and stop words –tf × idf is computed as its weight –top N terms with the highest weights are selected out as the description (N = 100 the most informative terms) Web documents – Blog – 100 highest tf × idf are kept as salient words

9 Emotional Allocation Modeling Assume that –given a language and its vocabulary –different emotions should have different distributions over the terms in this vocabulary –the frequencies of a term under different emotions are also different Given a collection of terms (e.g. a document), –we can suppose it is generated by sampling a mixture of various emotions, –as terms in this collection can be considered as controlled by different emotions. The parameters of such a sampling can be computed in a maximum likelihood manner In such a way, a term collection would have a certain allocation of emotions, in form of a probability distribution.

10 Model LDA 模型 k 種 emotion, m 種字 latent Dirichlet allocation

11 β

12 Relevance Matching Distance 越短越好

13 實驗 Evaluation collected 100 songs and 50 Weblogs Five college students were then invited to label the ground truth –Each labeler was asked to listen each song and then tag it with one or more words from the forty emotions in the Basic English Emotion Vocabulary [8]. The Weblog posts were also tagged in the same way. –For each Weblog, each labeler was asked to find out 3 ∼ 5 songs, which are the most ideal candidates in his (her) mind for listening when reading that blog post, from all the 100 songs.

14 關於 Emotion Allocation 的結果 0.71 on this song. The average correlation coefficient over the whole 100 songs is about 0.48

15 關於 Music Recommendation 的結果 Merge first - on average, there are around 5.75 such suggestions for each post top N ranked songs are selected as recommendations

16 Conclusion music recommendation (MusicSense) –new probabilistic model called Emotional Allocation Modeling –each song (or a Weblog) is generated with a distribution over the mixture of emotions –Emotion acts as a bridge for the relevance matching between blogs and songs –Preliminary experiments Future –deeply investigate some current implementation details to improve the performance –also try to utilize more information besides emotion to measure the relevance between music and documents –carry out more user study to design an idea UI to deliver the contextual music recommendation