Large-Scale Image Retrieval From Your Sketches Daniel Brooks 1,Loren Lin 2,Yijuan Lu 1 1 Department of Computer Science, Texas State University, TX, USA.

Slides:



Advertisements
Similar presentations
Chapter 2: Marr’s theory of vision. Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Overview Introduce Marr’s distinction between.
Advertisements

Content-Based Image Retrieval
Part 1: Bag-of-words models by Li Fei-Fei (Princeton)
FATIH CAKIR MELIHCAN TURK F. SUKRU TORUN AHMET CAGRI SIMSEK Content-Based Image Retrieval using the Bag-of-Words Concept.
Outline SIFT Background SIFT Extraction Application in Content Based Image Search Conclusion.
Marco Cristani Teorie e Tecniche del Riconoscimento1 Teoria e Tecniche del Riconoscimento Estrazione delle feature: Bag of words Facoltà di Scienze MM.
TP14 - Indexing local features
Image Information Retrieval Shaw-Ming Yang IST 497E 12/05/02.
CS4670 / 5670: Computer Vision Bag-of-words models Noah Snavely Object
PHP-based Image Recognition and Retrieval of Late 18th Century Artwork Ben Goodwin Handouts are available for students writing summaries for class assignments.
1 Image Retrieval Hao Jiang Computer Science Department 2009.
1 P. Arbelaez, M. Maire, C. Fowlkes, J. Malik. Contour Detection and Hierarchical image Segmentation. IEEE Trans. on PAMI, Student: Hsin-Min Cheng.
Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA
On Constrained Optimization Approach To Object Segmentation Chia Han, Xun Wang, Feng Gao, Zhigang Peng, Xiaokun Li, Lei He, William Wee Artificial Intelligence.
Sketch Tokens: A Learned Mid-level Representation for Contour and Object Detection CVPR2013 POSTER.
Object Recognition. So what does object recognition involve?
Effective Image Database Search via Dimensionality Reduction Anders Bjorholm Dahl and Henrik Aanæs IEEE Computer Society Conference on Computer Vision.
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
Tour the World: building a web-scale landmark recognition engine ICCV 2009 Yan-Tao Zheng1, Ming Zhao2, Yang Song2, Hartwig Adam2 Ulrich Buddemeier2, Alessandro.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
1 Content Based Image Retrieval Using MPEG-7 Dominant Color Descriptor Student: Mr. Ka-Man Wong Supervisor: Dr. Lai-Man Po MPhil Examination Department.
What is Texture? Texture depicts spatially repeating patterns Many natural phenomena are textures radishesrocksyogurt.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
ADVISE: Advanced Digital Video Information Segmentation Engine
Agenda Introduction Bag-of-words model Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
“Bag of Words”: when is object recognition, just texture recognition? : Advanced Machine Perception A. Efros, CMU, Spring 2009 Adopted from Fei-Fei.
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Content-Based Image Retrieval (CBIR) Student: Mihaela David Professor: Michael Eckmann Most of the database images in this presentation are from the Annotated.
1 Learning to Detect Natural Image Boundaries David Martin, Charless Fowlkes, Jitendra Malik Computer Science Division University of California at Berkeley.
Object Recognition. So what does object recognition involve?
ART: Augmented Reality Table for Interactive Trading Card Game Albert H.T. Lam, Kevin C. H. Chow, Edward H. H. Yau and Michael R. Lyu Department of Computer.
“Bag of Words”: recognition using texture : Advanced Machine Perception A. Efros, CMU, Spring 2006 Adopted from Fei-Fei Li, with some slides from.
SIEVE—Search Images Effectively through Visual Elimination Ying Liu, Dengsheng Zhang and Guojun Lu Gippsland School of Info Tech,
Presenting by, Prashanth B R 1AR08CS035 Dept.Of CSE. AIeMS-Bidadi. Sketch4Match – Content-based Image Retrieval System Using Sketches Under the Guidance.
Wang, Z., et al. Presented by: Kayla Henneman October 27, 2014 WHO IS HERE: LOCATION AWARE FACE RECOGNITION.
Mobile Mapping Systems (MMS) for infrastructural monitoring and mapping are becoming more prevalent as the availability and affordability of solutions.
Multiclass object recognition
Project 4 Image Search based on BoW model with Inverted File System
Linked Edges as Stable Region Boundaries* Michael Donoser, Hayko Riemenschneider and Horst Bischof This work introduces an unsupervised method to detect.
MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.
CSE 473/573 Computer Vision and Image Processing (CVIP)
Special Topic on Image Retrieval
JASS 2005 Next-Generation User-Centered Information Management Information visualization Alexander S. Babaev Faculty of Applied Mathematics.
3D Sketch-based 3D Model Retrieval With Kinect Natacha Feola 1, Azeem Ghumman 2, Scott Forster 3, Dr. Yijuan Lu 4 1 Department of Computer Science, University.
Neural Information in the Visual System By Paul Ruvolo Bryn Mawr College Fall 2012.
Proposal for Term Project J. H. Wang Mar. 2, 2015.
Real-Time Cyber Physical Systems Application on MobilityFirst Winlab Summer Internship 2015 Karthikeyan Ganesan, Wuyang Zhang, Zihong Zheng.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Computer Science Department Pacific University Artificial Intelligence -- Computer Vision.
Stylization and Abstraction of Photographs Doug Decarlo and Anthony Santella.
IEEE Int'l Symposium on Signal Processing and its Applications 1 An Unsupervised Learning Approach to Content-Based Image Retrieval Yixin Chen & James.
IBM QBIC: Query by Image and Video Content Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC 28223
GUIDED BY DR. A. J. AGRAWAL Search Engine By Chetan R. Rathod.
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
March 31, 1998NSF IDM 98, Group F1 Group F Multi-modal Issues, Systems and Applications.
Chittampally Vasanth Raja 10IT05F vasanthexperiments.wordpress.com.
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Yixin Chen and James Z. Wang The Pennsylvania State University
Scalable Person Re-identification on Supervised Smoothed Manifold
Deeply learned face representations are sparse, selective, and robust
CS 2770: Computer Vision Feature Matching and Indexing
Multimedia Content-Based Retrieval
Color-Texture Analysis for Content-Based Image Retrieval
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Text Detection in Images and Video
Multimedia Information Retrieval
Part 1: Bag-of-words models
Presentation transcript:

Large-Scale Image Retrieval From Your Sketches Daniel Brooks 1,Loren Lin 2,Yijuan Lu 1 1 Department of Computer Science, Texas State University, TX, USA 2 Department of Computer Science, University of Texas at Austin, TX, USA Large-Scale Image Retrieval From Your Sketches Daniel Brooks 1,Loren Lin 2,Yijuan Lu 1 1 Department of Computer Science, Texas State University, TX, USA 2 Department of Computer Science, University of Texas at Austin, TX, USA Introduction Framework Demo System Conclusions and Future Work References  Hit Map Generation ■The input sketch is translated in to a hit map with added tolerance.  Image Indexing ■All pixels on the detected contours of the database are indexed in to an inverted index file.  Image Search ■Compare the expanded query sketch to the inverted index file and return a short list of matched images.  Hit Map Generation ■The input sketch is translated in to a hit map with added tolerance.  Image Indexing ■All pixels on the detected contours of the database are indexed in to an inverted index file.  Image Search ■Compare the expanded query sketch to the inverted index file and return a short list of matched images. Experiments & Results In this project, we investigate the problems of large- scale sketch-based image search, and try to overcome two major challenges, indexing and matching. An effective algorithm [1] is explored to describe images as a “bag of shape words” and an efficient indexing framework is utilized to make sketch-based image retrieval scalable to millions of images. Our real-time sketch-based image search demo system is built on two popular datasets: Caltech 256 and ImageNet. Experimental results on various retrieval tasks (basic sketch search & shape based similar image search) show great promise for both accuracy and speed. In this project, we investigate the problems of large- scale sketch-based image search, and try to overcome two major challenges, indexing and matching. An effective algorithm [1] is explored to describe images as a “bag of shape words” and an efficient indexing framework is utilized to make sketch-based image retrieval scalable to millions of images. Our real-time sketch-based image search demo system is built on two popular datasets: Caltech 256 and ImageNet. Experimental results on various retrieval tasks (basic sketch search & shape based similar image search) show great promise for both accuracy and speed. Problem ■Pair-wise shape based matching is limited to small scale databases. ■ How to represent, index, and match shape to make shape based search scalable to a large dataset. ■ Hint: Text Words in Information Retrieval (IR)  Compactness  Descriptiveness  Document search is scalable ■Pair-wise shape based matching is limited to small scale databases. ■ How to represent, index, and match shape to make shape based search scalable to a large dataset. ■ Hint: Text Words in Information Retrieval (IR)  Compactness  Descriptiveness  Document search is scalable Retrieve Bag-of-Word model Of all the sensory impressions proceeding to the brain, the visual experiences are the dominant ones. Our perception of the world around us is based essentially on the messages that reach the brain from our eyes. For a long time it was thought that the retinal image was transmitted point by point to visual centers in the brain; the cerebral cortex was a movie screen, so to speak, upon which the image in the eye was projected. sensory, brain, visual, perception, retinal, cerebral cortex, eye, cell, optical nerve, image Hubel, Wiesel China is forecasting a trade surplus of $90bn (£51bn) to $100bn this year, a threefold increase on 2004's $32bn. The Commerce Ministry said the surplus would be created by a predicted 30% jump in exports to $750bn, compared with a 18% rise in imports to $660bn. The figures are likely to further annoy the US, which has long argued that China's exports are unfairly helped by a deliberately undervalued yuan. China, trade, surplus, commerce, exports, imports, US, yuan, bank, domestic, foreign, increase, trade, value Image Bag of ‘shape words’  Can an image’s shape be transformed into Bag-of-Shape- Words?  Image Collection  Caltech256 [2] ■256 Categories: American Flag, Backpack, Bat, Base Ball, etc. ■ 29,780 images.  ImageNet [3] ■100 Categories Selected: Acoustic Guitar, Cab, Mountain, Mobile Home, etc. ■ 100,843 images  Feature Extraction ■Down-sample images to a maximal side length of 200 (a good balance between structure information preservation and storage cost). ■Detect object contours for each image (Berkeley Detector [4]) ■User sketch input window design (200 × 200) ■Extract the location (x, y) and orientation (θ) of each pixel on the input sketch and contours of each image in the database.  Shape Dictionary Construction ■Shape Word Representation: (x, y, θ) ■Location x, y: 1~200 ■Orientation θ: 1 ~ 6 (-15˚~15˚: 1; 15˚~45˚: 2; 45˚~75˚: 3; 75˚~105˚: 4; 105˚ ~135˚: 5; 135˚~165˚: 6) ■Dictionary Size: 200*200*6=240,000 shape words  Image Collection  Caltech256 [2] ■256 Categories: American Flag, Backpack, Bat, Base Ball, etc. ■ 29,780 images.  ImageNet [3] ■100 Categories Selected: Acoustic Guitar, Cab, Mountain, Mobile Home, etc. ■ 100,843 images  Feature Extraction ■Down-sample images to a maximal side length of 200 (a good balance between structure information preservation and storage cost). ■Detect object contours for each image (Berkeley Detector [4]) ■User sketch input window design (200 × 200) ■Extract the location (x, y) and orientation (θ) of each pixel on the input sketch and contours of each image in the database.  Shape Dictionary Construction ■Shape Word Representation: (x, y, θ) ■Location x, y: 1~200 ■Orientation θ: 1 ~ 6 (-15˚~15˚: 1; 15˚~45˚: 2; 45˚~75˚: 3; 75˚~105˚: 4; 105˚ ~135˚: 5; 135˚~165˚: 6) ■Dictionary Size: 200*200*6=240,000 shape words Successful implementation of an edge pixel based inverted index algorithm for sketch based and similar image searching. Plans to improve performance by enhancing our implementation to a two-way search. Use this implementation of a state of the art solution as a benchmark when developing and testing our own original algorithm. Successful implementation of an edge pixel based inverted index algorithm for sketch based and similar image searching. Plans to improve performance by enhancing our implementation to a two-way search. Use this implementation of a state of the art solution as a benchmark when developing and testing our own original algorithm. [1] Yang Cao, Changhu Wang, Liqing Zhang, Lei Zhang, “Edgel Index for Large-Scale Sketch-based Image Search,” International Conference of Computer Vision and Pattern Recognition, [2] Caltech 256, [3] ImageNet, [4] D. R. Martin, C. C. Fowlkes, and J. Malik. “Learning to detect natural image boundaries using local brightness, color, and texture cues”. PAMI, [1] Yang Cao, Changhu Wang, Liqing Zhang, Lei Zhang, “Edgel Index for Large-Scale Sketch-based Image Search,” International Conference of Computer Vision and Pattern Recognition, [2] Caltech 256, [3] ImageNet, [4] D. R. Martin, C. C. Fowlkes, and J. Malik. “Learning to detect natural image boundaries using local brightness, color, and texture cues”. PAMI,  A simple interface with a window to sketch or upload, with results displayed to the right.  Good results with simple shapes & sketches.  Excellent speed scalable in to the millions. Acknowledgements This research is supported by the NSF REU Program and Texas State University.