Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.

Slides:



Advertisements
Similar presentations
James Hays and Alexei A. Efros Carnegie Mellon University CVPR IM2GPS: estimating geographic information from a single image Wen-Tsai Huang.
Advertisements

1 Recognition by Association: ask not “What is it?” ask “What is it like?” Tomasz Malisiewicz and Alyosha Efros CMU CVPR’08.
Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 4 – Digital Image Representation Klara Nahrstedt Spring 2009.
Shape Sharing for Object Segmentation
Lecture 31: Modern object recognition
Hidden Variables, the EM Algorithm, and Mixtures of Gaussians Computer Vision CS 143, Brown James Hays 02/22/11 Many slides from Derek Hoiem.
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
IMAGE RESTORATION AND REALISM MILLIONS OF IMAGES SEMINAR YUVAL RADO.
Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA
Large dataset for object and scene recognition A. Torralba, R. Fergus, W. T. Freeman 80 million tiny images Ron Yanovich Guy Peled.
Discriminative and generative methods for bags of features
Beyond bags of features: Part-based models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Fitting: The Hough transform
Recognition: A machine learning approach
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
Event prediction CS 590v. Applications Video search Surveillance – Detecting suspicious activities – Illegally parked cars – Abandoned bags Intelligent.
Small Codes and Large Image Databases for Recognition CVPR 2008 Antonio Torralba, MIT Rob Fergus, NYU Yair Weiss, Hebrew University.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Fast and Compact Retrieval Methods in Computer Vision Part II A. Torralba, R. Fergus and Y. Weiss. Small Codes and Large Image Databases for Recognition.
Statistical Recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Kristen Grauman.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Let Computer Draw Qingyuan Kong. Goal Give me a picture “Obama stands in front of a pyramid”
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
1 Segmentation with Scene and Sub-Scene Categories Joseph Djugash Input Image Scene/Sub-Scene Classification Segmentation.
Highlights Lecture on the image part (10) Automatic Perception 16
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Visual Object Recognition Rob Fergus Courant Institute, New York University
Extreme, Non-parametric Object Recognition 80 million tiny images (Torralba et al)
Opportunities of Scale Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.
Presented By : Murad Tukan
Understanding Faces Computational Photography
Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.
Unsupervised Learning of Categories from Sets of Partially Matching Image Features Kristen Grauman and Trevor Darrel CVPR 2006 Presented By Sovan Biswas.
Computer Vision James Hays, Brown
Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.
Human abilities Presented By Mahmoud Awadallah 1.
Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.
Nonparametric Part Transfer for Fine-grained Recognition Presenter Byungju Kim.
Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,
CSE 185 Introduction to Computer Vision Pattern Recognition 2.
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
Scene Completion Using Millions of Photographs James Hays, Alexei A. Efros Carnegie Mellon University ACM SIGGRAPH 2007.
Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,
Visual Data on the Internet With slides from Alexei Efros, James Hays, Antonio Torralba, and Frederic Heger : Computational Photography Jean-Francois.
CS 1699: Intro to Computer Vision Support Vector Machines Prof. Adriana Kovashka University of Pittsburgh October 29, 2015.
Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.
Context Neelima Chavali ECE /21/2013. Roadmap Introduction Paper1 – Motivation – Problem statement – Approach – Experiments & Results Paper 2 Experiments.
Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.
Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.
SUN Database: Large-scale Scene Recognition from Abbey to Zoo Jianxiong Xiao *James Haysy Krista A. Ehinger Aude Oliva Antonio Torralba Massachusetts Institute.
Tentative Future Courses Fall `11 : Computer Vision – emphasis on recognition Spring `11 : Graduate seminar Fall `12 : Computational Photography.
Data Driven Attributes for Action Detection
Lecture 25: Introduction to Recognition
Nonparametric Semantic Segmentation
Saliency detection Donghun Yeo CV Lab..
Opportunities of Scale, Part 2
Lecture 14: Introduction to Recognition
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT
An opposition to: Context-Based Vision System for Place and Object Recognition Contextual Models for Object Detection Using BRFs.
Object detection as supervised classification
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Lecture 25: Introduction to Recognition
Rob Fergus Computer Vision
RGB-D Image for Scene Recognition by Jiaqi Guo
Brief Review of Recognition + Context
Presentation transcript:

Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba

Recap Opportunities of Scale: Data-driven methods Monday – Scene completion – Im2gps Today – Recognition via Tiny Images – More recognition by association

General Principal Input Image Images Associated Info Huge Dataset Info from Most Similar Images image matching Hopefully, If you have enough images, the dataset will contain very similar images that you can find with simple matching methods.

… 200 total

Graph cut + Poisson blending

im2gpsim2gps (Hays & Efros, CVPR 2008) 6 million geo-tagged Flickr images

Tiny Images 80 million tiny images: a large dataset for non- parametric object and scene recognition Antonio Torralba, Rob Fergus and William T. Freeman. PAMI

Human Scene Recognition

Humans vs. Computers: Car-Image Classification Humans for 32 pixel tall images Various computer vision algorithms for full resolution images

Powers of 10 Number of images on my hard drive: 10 4 Number of images seen during my first 10 years:10 8 (3 images/second * 60 * 60 * 16 * 365 * 10 = ) Number of images seen by all humanity: ,456,367,669 humans 1 * 60 years * 3 images/second * 60 * 60 * 16 * 365 = 1 from Number of photons in the universe: Number of all 32x32 images: *32*3 ~

Scenes are unique

But not all scenes are so original

Lots Of Images A. Torralba, R. Fergus, W.T.Freeman. PAMI 2008

Lots Of Images A. Torralba, R. Fergus, W.T.Freeman. PAMI 2008

Lots Of Images

Application: Automatic Colorization Input Matches (gray)Matches (w/ color)Avg Color of Match Color Transfer

Application: Automatic Colorization Input Matches (gray)Matches (w/ color)Avg Color of Match Color Transfer

Application: Person Detection 80 million “tiny images” downloaded by keyword search. 80 nearest neighbors vote for image category.

Re-ranking Altavista search for “person”

Automatic Orientation Examples A. Torralba, R. Fergus, W.T.Freeman. 2008

Recognition by Association Malisiewicz and Efros (NIPS 2009Malisiewicz and Efros (NIPS 2009) Rather than categorizing objects, associate them to stored examples of objects and transfer the associated labels.

Recognition by Association Malisiewicz and Efros (CVPR 200Malisiewicz and Efros (CVPR 2008) Rather than categorizing objects, associate them with stored examples of objects and transfer the associated labels.

Training procedure Learn a region similarity measure from hand-segmented objects in LabelMe Similarity features – Shape: region mask, pixel area, bounding box size – Texture: normalized texton histogram – Color: mean RGB, std RGB, color histogram – Position: coarse 8x8 image mask, coords of top/bottom pixels

Training procedure Learn a distance/similarity measure for each region – Minimize distance to K most similar examples from same category – Maximize distance to examples from other categories distance weights distance measures Set to 1 for K nearest examples Hinge Loss

Learned Similarity Measure Learned Distance Texton Distance

Learned Similarity Measure

Testing procedure Create multiple segmentations (MeanShift + Ncuts) Find similar object regions in training set; each votes for the object label What about bad segments? – Most of the time, they don’t match any objects in the training set – Consider only associations with distance < 1

Automatic Parses

Bush’s Memex (1945) Store recorded knowledge on microfilm

Summary With billions of images on the web, it’s often possible to find a close nearest neighbor In such cases, we can shortcut hard problems by “looking up” the answer, stealing the labels from our nearest neighbor For example, simple (or learned) associations can be used to synthesize background regions, colorize, or recognize objects