Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos.

Slides:

Advertisements

Similar presentations

Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London.

Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Improvements and extras Paul Thomas CSIRO. Overview of the lectures 1.Introduction to information retrieval (IR) 2.Ranked retrieval 3.Probabilistic retrieval.

Introduction to Information Retrieval

Multimedia Database Systems

Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?

Content-Based Image Retrieval

TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets Chun Chen 1, Feng Li 2, Beng Chin Ooi 2, and Sai Wu 2 1 Zhejiang University, 2 National.

Bring Order to Your Photos: Event-Driven Classification of Flickr Images Based on Social Knowledge Date: 2011/11/21 Source: Claudiu S. Firan (CIKM’10)

GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.

Stephan Gammeter, Lukas Bossard, Till Quack, Luc Van Gool.

Metric Inverted - An efficient inverted indexing method for metric spaces Benjamin Sznajder Jonathan Mamou Yosi Mass Michal Shmueli-Scheuer IBM Research.

Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.

Bag of Features Approach: recent work, using geometric information.

ACM Multimedia th Annual Conference, October , 2004

Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.

1 Integrating User Feedback Log into Relevance Feedback by Coupled SVM for Content-Based Image Retrieval 9-April, 2005 Steven C. H. Hoi *, Michael R. Lyu.

A novel log-based relevance feedback technique in content- based image retrieval Reporter: Francis 2005/6/2.

Presentation in IJCNN 2004 Biased Support Vector Machine for Relevance Feedback in Image Retrieval Hoi, Chu-Hong Steven Department of Computer Science.

Presented by Zeehasham Rasheed

1 Web Query Classification Query Classification Task: map queries to concepts Application: Paid advertisement 问题：百度 /Google 怎么赚钱？

A structured learning framework for content- based image indexing and visual Query (Joo-Hwee, Jesse S. Jin) Presentation By: Salman Ahmad (270279)

Finding Advertising Keywords on Web Pages Scott Wen-tau YihJoshua Goodman Microsoft Research Vitor R. Carvalho Carnegie Mellon University.

SIEVE—Search Images Effectively through Visual Elimination Ying Liu, Dengsheng Zhang and Guojun Lu Gippsland School of Info Tech,

Improving web image search results using query-relative classifiers Josip Krapacy Moray Allanyy Jakob Verbeeky Fr´ed´eric Jurieyy.

Information Retrieval in Practice

DOG I : an Annotation System for Images of Dog Breeds Antonis Dimas Pyrros Koletsis Euripides Petrakis Intelligent Systems Laboratory Technical University.

Jinhui Tang †, Shuicheng Yan †, Richang Hong †, Guo-Jun Qi ‡, Tat-Seng Chua † † National University of Singapore ‡ University of Illinois at Urbana-Champaign.

Slide Image Retrieval: A Preliminary Study Guo Min Liew and Min-Yen Kan National University of Singapore Web IR / NLP Group (WING)

MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.

Bridge Semantic Gap: A Large Scale Concept Ontology for Multimedia (LSCOM) Guo-Jun Qi Beckman Institute University of Illinois at Urbana-Champaign.

Multimedia Databases (MMDB)

©2008 Srikanth Kallurkar, Quantum Leap Innovations, Inc. All rights reserved. Apollo – Automated Content Management System Srikanth Kallurkar Quantum Leap.

Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos Yiming Liu, Dong Xu, Ivor W. Tsang, Jiebo Luo Nanyang Technological.

Content-Based Image Retrieval

A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.

Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.

Improving Web Spam Classification using Rank-time Features September 25, 2008 TaeSeob,Yun KAIST DATABASE & MULTIMEDIA LAB.

Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.

Topical Crawlers for Building Digital Library Collections Presenter: Qiaozhu Mei.

Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.

INTELLIGENT ORACLE CEMNET, SCE, NTU Speaker: Zeng Zinan

Introduction to Digital Libraries hussein suleman uct cs honours 2003.

Web Image Retrieval Re-Ranking with Relevance Model Wei-Hao Lin, Rong Jin, Alexander Hauptmann Language Technologies Institute School of Computer Science.

Combining multiple learners Usman Roshan. Bagging Randomly sample training data Determine classifier C i on sampled data Goto step 1 and repeat m times.

The Anatomy of a Large-Scale Hyper textual Web Search Engine S. Brin, L. Page Presenter :- Abhishek Taneja.

Competence Centre on Information Extraction and Image Understanding for Earth Observation 29th March 2007 Category - based Semantic Search Engine 1 Mihai.

Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.

Flickr Tag Recommendation based on Collective Knowledge BÖrkur SigurbjÖnsson, Roelof van Zwol Yahoo! Research WWW Summarized and presented.

Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.

Semi-Automatic Image Annotation Liu Wenyin, Susan Dumais, Yanfeng Sun, HongJiang Zhang, Mary Czerwinski and Brent Field Microsoft Research.

Data Mining, ICDM '08. Eighth IEEE International Conference on Duy-Dinh Le National Institute of Informatics Hitotsubashi, Chiyoda-ku Tokyo,

Content-Based Image Retrieval (CBIR) By: Victor Makarenkov Michael Marcovich Noam Shemesh.

Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC

Unsupervised Auxiliary Visual Words Discovery for Large-Scale Image Object Retrieval Yin-Hsi Kuo1,2, Hsuan-Tien Lin 1, Wen-Huang Cheng 2, Yi-Hsuan Yang.

The Cross Language Image Retrieval Track: ImageCLEF Breakout session discussion.

Chapter. 3: Retrieval Evaluation 1/2/2016Dr. Almetwally Mostafa 1.

Combining Text and Image Queries at ImageCLEF2005: A Corpus-Based Relevance-Feedback Approach Yih-Cheng Chang Department of Computer Science and Information.

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

Relevance Feedback in Image Retrieval System: A Survey Tao Huang Lin Luo Chengcui Zhang.

Semantic search-based image annotation Petra Budíková, FI MU CEMI meeting, Plzeň,

Cross-modal Hashing Through Ranking Subspace Learning

Large-Scale Content-Based Audio Retrieval from Text Queries

Text Based Information Retrieval

Multimedia Content-Based Retrieval

Multimedia Information Retrieval

Multimedia Information Retrieval

CSE 635 Multimedia Information Retrieval

Information Retrieval and Web Design

Introduction to Search Engines

Presentation transcript:

Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos

Motivation Digital cameras and mobile phone cameras popularize rapidly: –More and more personal photos; –Retrieving images from enormous collections of personal photos becomes a more and more important topic. ? How to retrieve?

Prior Work: CBIR Content-Based Image Retrieval (CBIR) –Users provide images as queries to retrieve personal photos. The paramount challenge -- semantic gap: –The gap between the low-level visual features and the high-level semantic concepts. … Low-level Feature vector Image with high- level concept queryresult … … Feature vectors in DB compare Semantic Gap

Prior Work: Image Annotation It is more convenient for the user to retrieve the desirable personal photos using textual queries. Image annotation is used to classify images w.r.t. high-level semantic concepts. –Semantic concepts are analogous to the textual terms describing document contents. An intermediate stage for textual query based image retrieval. query Sunset Annotation Result: high-level concepts Annotation Result: high-level conceptsannotate compare …database …result Retrieve

Idea Web images are accompanied by tags, categories and titles. –Google and Flickr exploit them to index web images. … building people, family people, wedding sunset … WebImagesContextualInformation But raw consumer photos from digital cameras do not contain such semantic textual descriptions Web Images Consumer Photos Leverage information from web images to retrieve consumer photos in personal photo collection. information No intermediate image annotation process.

Framework

When user provides a textual query, Textual Query Classifier Automatic Web Image Retrieval Automatic Web Image Retrieval Large Collection of Web images (with descriptive words) Relevant/ Irrelevant Images WordNet Relevance Feedback Relevance Feedback Refined Top-Ranked Consumer Photos Consumer Photo Retrieval Consumer Photo Retrieval Raw Consumer Photos Top-Ranked Consumer Photos It would be used to find relevant/irrelevant images in web image collections. Then, a classifier is trained based on these web images. And then consumer photos can be ranked based on the classifiers decision value. The user can also use relevance feedback to refine the retrieval results.

Automatic Web Image Retrieval

boat Inverted File Inverted File Relevant Web Images Irrelevant Web Images boat ark barge dredgerhouseboat … Semantic Word Trees Based on WordNet For users textual query, first search it in the semantic word trees. The web images containing the query word are considered as relevant web images. The web images which do not containing the query word and its two-level descendants are considered as irrelevant web images.

Classifier Training

Relevant Web Images Irrelevant Web Images … sample1 sample2sample3sample4 ds … Construct 100 smaller training sets: –Negative Samples: Randomly sample a fixed number of irrelevant web images for 100 times; –Positive Samples: The relevant web images. Based on each training set, train decision stumps on each dimension. Classifier f s (x) Finally, linearly combine all decision stumps based on their training errors.

Relevance Feedback via Cross-Domain Regularized Regression

Other images f T (x) should be close to f s (x) Design a target linear classifier f T (x) = w T x. User-labeled images x 1,…,x l f T (x) should be close to +1 (labeled as positive) 1 (labeled as negative) A regularizer to control the complexity of the target classifier f T (x) This problem can be solved with least square solver.

Source Classifiers Decision Stump Ensemble: –Trained on each dimension for each bag; –Decision values are fused after a sigmoid mapping: f d (x) = i γ id h(s id (x d -θ id )); –Pros: Non-linear; Easy to be parallelized; –Cons: Testing is time-consuming;

Accelerating Source Classifiers One possible solution: –Remove sigmoid mapping: f d (x) = i γ id s id (x id -θ id ) = ( i γ id s id )x i -( i γ id s id θ id ); Assume there are N bags, D dims: –Testing Complexity: O(ND) --> O(D) –Cons: Become linear; –Too weak.

Accelerating Source Classifiers Another possible solution: –Use linear svm instead of decision stump ensemble. Train 1 linear svm classifier for each bag; Fuse the decision values with a sigmoid mapping; –Pros: It is hopeful to use less bags to achieve a satisfying retrieval precision; Although testing complexity is still O(ND), there are much less ``exp'' function calls (ND --> N); Individual classifiers are computed with just a vector dot product, which can be efficiently computed with SIMD instructions.

Comparison on Time Cost

Performance Comparison

Relevance Feedback +1 positive -0.1 negative

Error Rate Refinement during RF Assume that there are M training data, in which E instances are incorrectly classified. –err_rate = E / M; For f s (x), when user labels one instance x as y \in (-1, 1): –If f s (x) = y, then err_rate = E / (M + α) –If f s (x) = -y, then err_rate = (E + α) / (M + α)

The End