Nonnegative Shared Subspace Learning and Its Application to Social Media Retrieval Presenter: Andy Lim.

Slides:



Advertisements
Similar presentations
A Comparison of Implicit and Explicit Links for Web Page Classification Dou Shen 1 Jian-Tao Sun 2 Qiang Yang 1 Zheng Chen 2 1 Department of Computer Science.
Advertisements

Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos.
Yansong Feng and Mirella Lapata
Location Recognition Given: A query image A database of images with known locations Two types of approaches: Direct matching: directly match image features.
Multimedia Database Systems
Diversified Retrieval as Structured Prediction Redundancy, Diversity, and Interdependent Document Relevance (IDR ’09) SIGIR 2009 Workshop Yisong Yue Cornell.
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
Precision and Recall.
Evaluating Search Engine
Hinrich Schütze and Christina Lioma
 Users annotate things (resources) with labels (tags)  These annotations are shared, creating a collaborative dataset called a folksonomy coffee java.
Automatic Image Annotation and Retrieval using Cross-Media Relevance Models J. Jeon, V. Lavrenko and R. Manmathat Computer Science Department University.
1 Statistical correlation analysis in image retrieval Reporter : Erica Li 2004/9/30.
Commentary-based Video Categorization and Concept Discovery By Janice Leung.
Reference Collections: Task Characteristics. TREC Collection Text REtrieval Conference (TREC) –sponsored by NIST and DARPA (1992-?) Comparing approaches.
MANISHA VERMA, VASUDEVA VARMA PATENT SEARCH USING IPC CLASSIFICATION VECTORS.
Probabilistic Latent Semantic Analysis
Introduction to Machine Learning Approach Lecture 5.
Jinhui Tang †, Shuicheng Yan †, Richang Hong †, Guo-Jun Qi ‡, Tat-Seng Chua † † National University of Singapore ‡ University of Illinois at Urbana-Champaign.
School of something FACULTY OF OTHER School of Computing FACULTY OF ENGINEERING Sharing of Community Practice through Semantics: A Case Study in Academic.
Cao et al. ICML 2010 Presented by Danushka Bollegala.
Title Extraction from Bodies of HTML Documents and its Application to Web Page Retrieval Microsoft Research Asia Yunhua Hu, Guomao Xin, Ruihua Song, Guoping.
MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Classifying Tags Using Open Content Resources Simon Overell, Borkur Sigurbjornsson & Roelof van Zwol WSDM ‘09.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Mining Discriminative Components With Low-Rank and Sparsity Constraints for Face Recognition Qiang Zhang, Baoxin Li Computer Science and Engineering Arizona.
Philosophy of IR Evaluation Ellen Voorhees. NIST Evaluation: How well does system meet information need? System evaluation: how good are document rankings?
Review of the web page classification approaches and applications Luu-Ngoc Do Quang-Nhat Vo.
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
Andriy Shepitsen, Jonathan Gemmell, Bamshad Mobasher, and Robin Burke
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
User Profiling based on Folksonomy Information in Web 2.0 for Personalized Recommender Systems Huizhi (Elly) Liang Supervisors: Yue Xu, Yuefeng Li, Richi.
A Personalized Recommender System Based on Users’ Information In Folksonomies Date: 2013/12/18 Author: Mohamed Nader Jelassi, Sadok Ben Yahia, Engelbert.
ON INCENTIVE-BASED TAGGING Xuan S. Yang, Reynold Cheng, Luyi Mo, Ben Kao, David W. Cheung {xyang2, ckcheng, lymo, kao, The University.
Nonnegative Shared Subspace Learning and Its Application to Social Media Retrieval Sunil Kumar Gupta, Dinh Phung, Brett Adams, Tran The Truyen, Svetha.
Chapter 6: Information Retrieval and Web Search
Flickr the framework of Flickr. Observe them  How many photos does each user offer?  How many tags does each photo have?  The tag hot-list  How many.
Mining Topic-Specific Concepts and Definitions on the Web Bing Liu, etc KDD03 CS591CXZ CS591CXZ Web mining: Lexical relationship mining.
Web Image Retrieval Re-Ranking with Relevance Model Wei-Hao Lin, Rong Jin, Alexander Hauptmann Language Technologies Institute School of Computer Science.
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.
Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
Effective Automatic Image Annotation Via A Coherent Language Model and Active Learning Rong Jin, Joyce Y. Chai Michigan State University Luo Si Carnegie.
Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval O. Chum, et al. Presented by Brandon Smith Computer Vision.
Linked Data Profiling Andrejs Abele National University of Ireland, Galway Supervisor: Paul Buitelaar.
Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -
Final Project Mei-Chen Yeh May 15, General In-class presentation – June 12 and June 19, 2012 – 15 minutes, in English 30% of the overall grade In-class.
Topic by Topic Performance of Information Retrieval Systems Walter Liggett National Institute of Standards and Technology TREC-7 (1999)
Virtual Examples for Text Classification with Support Vector Machines Manabu Sassano Proceedings of the 2003 Conference on Emprical Methods in Natural.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Information Retrieval Lecture 3 Introduction to Information Retrieval (Manning et al. 2007) Chapter 8 For the MSc Computer Science Programme Dell Zhang.
Finding similar items by leveraging social tag clouds Speaker: Po-Hsien Shih Advisor: Jia-Ling Koh Source: SAC 2012’ Date: October 4, 2012.
Cross-modal Hashing Through Ranking Subspace Learning
Image Retrieval and Ranking using L.S.I and Cross View Learning Sumit Kumar Vivek Gupta
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Large-Scale Content-Based Audio Retrieval from Text Queries
Information Retrieval and Web Search
Multimodal Learning with Deep Boltzmann Machines
Project Implementation for ITCS4122
Ying Dai Faculty of software and information science,
Ying Dai Faculty of software and information science,
Ying Dai Faculty of software and information science,
CS246: Information Retrieval
Ying Dai Faculty of software and information science,
Word embeddings (continued)
Precision and Recall.
Privacy-Aware Tag Recommendation for Image Sharing
Presentation transcript:

Nonnegative Shared Subspace Learning and Its Application to Social Media Retrieval Presenter: Andy Lim

Paper Topic Folksonomy Social media sharing platforms

The Problem Rise in popularity of social image and video sharing platforms Precision of tag-based media retrieval Tags are Noisy Ambiguous Incomplete Subjective Lack of constraints Free-text tags (i.e. “djfja;sldfkj”) Tags: hotdog, chinese, trololol, aidjishi, sandwich, bread

Previous Research (Internal) Improving tag relevance Sigurbjornsson and Zwol Developed a method of recommending a set of relevant tags based on tag popularity Li et al. List all images for a given tag and determine tag relevance from visual similarity All are confined to noisy tags within the primary dataset

The Approach Internal vs. External Leverage external auxiliary sources of information to improve target tagging systems (presumably much noisier) Exploit disparate characteristics of target domain using auxiliary source Note: What is the optimal level of joint modeling such that the target domain still benefits from the auxiliary source?

Assumptions There is a common underlying subspace shared by the primary and secondary domains The primary domain is much nosier than the secondary domains

Nonnegative Matrix Factorization X (M x N data matrix) where N = documents in terms of M vocabulary words F (M x R nonnegative matrix) represents R basis vectors H (R x N nonnegative matrix) contains coordinates of each document

Joint Shared Nonnegative Matrix Factorization (JSNMF) Input: X (target domain), Y (auxiliary domain), R 1 and R 2 (dimensionality of underlying subspaces of X and Y), K (basis vectors) Output: W (joint shared subspace), U (remaining subspace in target domain), V (remaining subspace in auxiliary domain), H (coordinate matrix for target domain), L (coordinate matrix for auxiliary domain)

Retrieval using JSNMF Input: W, U, H, query sentence S Q, number of images (or videos) to be retrieved N and image (or video) dataset Output: Return top N retrieved images (or videos)

Experiment Use LabelMe tags (auxiliary) to improve Image retrieval in Flickr Video retrieval in Youtube Why LabelMe? Object image tagging Controlled vocabulary

Flickr Dataset Downloaded 50,000 images from Flickr Average number of distinct tags = 8 Removed Rare tags (appears less than 5 times) Images with no tags and non-English tags Obtained 20,000 labeled images 7,000 examples are kept for investigating internal auxiliary dataset

YouTube Dataset Downloaded 18,000 videos’ metadata (tags, URL, category, title, comments, etc.) Average number of distinct tags = 7 Removed Rare tags (appearing less than 2 times) Videos with no tags or non-English tags Obtained dataset corresponding to 12,000 videos Again, kept 7,000 examples to be used as an internal auxiliary dataset

LabelMe Dataset Added 7,000 images with tags from LabelMe Average number of distinct tags = 32 Removed Rare tags (appearing less than 2 times) Cleanup does not reduce dataset

Evaluation Measures Defined query set Q {cloud, man, street, water, road, leg, table, plant, girl, drawer, lamp, bed, cable, bus, pole, laptop, plate, kitchen, river, pool, flower} Manually annotated the two datasets (Flickr and YouTube) with respect to the query set (no benchmark dataset available) Query term and an image is relevant if the concept is clearly visible in the image (or video)

Results with JSNMF Precision-Scope Curve Fix recall at 0.1 Users are usually only interested in first few results 10% improvement

Results with JSNMF Under-representation Shares very few basis vectors Over-representation Forces many basis vectors to represent both datasets Appropriate level of representation

Flickr Retrieval Results Results are better with LabelMe As recall increases, precision decreases When K=0 (no sharing) or K=40 (fully sharing), precision is lower compared to K=15

YouTube Retrieval Results Similar to Flickr Results

Extra Notes & Questions? Can be extended to multiple datasets (not just 2) Can use generic model to apply to other data mining tasks