Kapitel 14 Recognition Scene understanding / visual object categorization Pose clustering Object recognition by local features Image categorization Bag-of-features.

Slides:

Advertisements

Similar presentations

Advertisements

1 Vorlesung Informatik 2 Algorithmen und Datenstrukturen (Parallel Algorithms) Robin Pomplun.

Kapitel 14 Recognition – p. 1 Recognition Scene understanding / visual object categorization Pose clustering Object recognition by local features Image.

© 2008 Pearson Addison Wesley. All rights reserved Chapter Seven Costs.

Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.

Chapter 1 The Study of Body Function Image PowerPoint

Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.

Author: Julia Richards and R. Scott Hawley

Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.

Objectives: Generate and describe sequences. Vocabulary:

UNITED NATIONS Shipment Details Report – January 2006.

David Burdett May 11, 2004 Package Binding for WS CDL.

1 RA I Sub-Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Casablanca, Morocco, 20 – 22 December 2005 Status of observing programmes in RA I.

Properties of Real Numbers CommutativeAssociativeDistributive Identity + × Inverse + ×

Custom Statutory Programs Chapter 3. Customary Statutory Programs and Titles 3-2 Objectives Add Local Statutory Programs Create Customer Application For.

FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.

1 Hierarchical Part-Based Human Body Pose Estimation * Ramanan Navaratnam * Arasanathan Thayananthan Prof. Phil Torr * Prof. Roberto Cipolla * University.

1. 2 Objectives Become familiar with the purpose and features of Epsilen Learn to navigate the Epsilen environment Develop a professional ePortfolio on.

REVIEW: Arthropod ID. 1. Name the subphylum. 2. Name the subphylum. 3. Name the order.

PP Test Review Sections 6-1 to 6-6

EU market situation for eggs and poultry Management Committee 20 October 2011.

EU Market Situation for Eggs and Poultry Management Committee 21 June 2012.

Bright Futures Guidelines Priorities and Screening Tables

Bellwork Do the following problem on a ½ sheet of paper and turn in.

2 |SharePoint Saturday New York City

Computer vision: models, learning and inference

Distinctive Image Features from Scale-Invariant Keypoints David Lowe.

Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.

1 RA III - Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Buenos Aires, Argentina, 25 – 27 October 2006 Status of observing programmes in RA.

Factor P 16 8(8-5ab) 4(d² + 4) 3rs(2r – s) 15cd(1 + 2cd) 8(4a² + 3b²)

Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.

CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.

© 2012 National Heart Foundation of Australia. Slide 2.

Adding Up In Chunks.

Understanding Generalist Practice, 5e, Kirst-Ashman/Hull

Model and Relationships 6 M 1 M M M M M M M M M M M M M M M M

25 seconds left…...

Subtraction: Adding UP

Analyzing Genes and Genomes

DTU Informatics Introduction to Medical Image Analysis Rasmus R. Paulsen DTU Informatics TexPoint fonts.

©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.

Essential Cell Biology

Intracellular Compartments and Transport

PSSA Preparation.

Essential Cell Biology

Immunobiology: The Immune System in Health & Disease Sixth Edition

Chapter 13 Web Page Design Studio

Energy Generation in Mitochondria and Chlorplasts

Instructor: Mircea Nicolescu Lecture 17

CS4670 / 5670: Computer Vision Bag-of-words models Noah Snavely Object

Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,

Bag-of-features models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Lecture 28: Bag-of-words models

Bag-of-features models

Keypoint-based Recognition and Object Search

10/31/13 Object Recognition and Augmented Reality Computational Photography Derek Hoiem, University of Illinois Dali, Swans Reflecting Elephants.

Object Recognition and Augmented Reality

Review: Intro to recognition Recognition tasks Machine learning approach: training, testing, generalization Example classifiers Nearest neighbor Linear.

Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,

Keypoint-based Recognition Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/04/10.

Final Exam Review CS485/685 Computer Vision Prof. Bebis.

Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.

Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,

Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe & Computer Vision Laboratory ETH.

10/31/13 Object Recognition and Augmented Reality Computational Photography Derek Hoiem, University of Illinois Dali, Swans Reflecting Elephants.

CS654: Digital Image Analysis

Lecture IX: Object Recognition (2)

By Suren Manvelyan, Crocodile (nile crocodile?) By Suren Manvelyan,

Presentation transcript:

Kapitel 14 Recognition Scene understanding / visual object categorization Pose clustering Object recognition by local features Image categorization Bag-of-features models Large-scale image search TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA

Scene understanding (1) Scene categorization outdoor/indoor city/forest/factory/etc.

Scene understanding (2) Object detection find pedestrians

Scene understanding (3) sky mountain building tree building banner street lamp market people

Visual object categorization (1) ~10,000 to 30,000

Visual object categorization (2)

Visual object categorization (3)

Visual object categorization (4) Recognition is all about modeling variability camera position illumination shape variations within-class variations

Visual object categorization (5) Within-class variations (why are they chairs?)

Pose clustering (1) Working in transformation space - main ideas: Generate many hypotheses of transformation image vs. model, each built by a tuple of image and model features Correct transformation hypotheses appear many times Main steps: Quantize the space of possible transformations For each tuple of image and model features, solve for the optimal transformation that aligns the matched features Record a “vote” in the corresponding transformation space bin Find "peak" in transformation space

Pose clustering (2) Example: Rotation only. A pair of one scene segment and one model segment suffices to generate a transformation hypothesis.

Pose clustering (3) C.F. Olson: Efficient pose clustering using a randomized algorithm. IJCV, 23(2): 131-147, 1997. 2-tuples of image and model corner points are used to generate hypotheses. (a) The corners found in an image. (b) The four best hypotheses found with the edges drawn in. The nose of the plane and the head of the person do not appear because they were not in the models.

Object recognition by local features (1) D.G. Lowe: Distinctive image features from scale-invariant keypoints. IJCV, 60(2): 91-110, 2004 The SIFT features of training images are extracted and stored For a query image Extract SIFT features Efficient nearest neighbor indexing Pose clustering by Hough transform For clusters with >2 keypoints (object hypotheses): determine the optimal affine transformation parameters by least squares method; geometry verification Input Image Stored Image

Object recognition by local features (2) Cluster of 3 corresponding feature pairs

Object recognition by local features (3)

Object recognition by local features (4)

Image categorization (1) Training Labels Training Images Training Image Features Classifier Training Trained Classifier Testing Image Features Trained Classifier Prediction Outdoor Test Image

Image categorization (2) Global histogram (distribution): color, texture, motion, … histogram matching distance

Image categorization (3) Cars found by color histogram matching See Chapter “Inhaltsbasierte Suche in Bilddatenbanken

Bag-of-features models (1) Bag of ‘words’ Object

Bag-of-features models (2) Origin 1: Text recognition by orderless document representation (frequencies of words from a dictionary), Salton & McGill (1983) US Presidential Speeches Tag Cloud http://chir.ag/phernalia/preztags/

Bag-of-features models (3) Origin 2: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures, it is the identity of the textons, not their spatial arrangement, that matters

Bag-of-features models (4) histogram Universal texton dictionary

Bag-of-features models (5) We need to build a “visual” dictionary! G. Cruska et al.: Visual categorization with bags of keypoints. Proc. of ECCV, 2004.

Bag-of-features models (6) Main step 1: Extract features (e.g. SIFT) …

Bag-of-features models (7) Main step 2: Learn visual vocabulary (e.g. using clustering) clustering

Bag-of-features models (8) visual vocabulary (cluster centers)

Bag-of-features models (9) Example: learned codebook … Appearance codebook Source: B. Leibe

Bag-of-features models (10) Main step 3: Quantize features using “visual vocabulary”

Bag-of-features models (11) Main step 4: Represent images by frequencies of “visual words” (i.e., bags of features)

Bag-of-features models (12) Example: representation based on learned codebook

Bag-of-features models (13) Main step 5: Apply any classifier to the histogram feature vectors

Bag-of-features models (14) Example:

Bag-of-features models (15) Caltech6 dataset Dictionary quality and size are very important parameters!

Bag-of-features models (16) Caltech6 dataset Action recognition J.C. Niebles, H. Wang, L. Fei-Fei: Unsupervised learning of human action categories using spatial-tempotal words. IJCV, 79(3): 299-318, 2008.

Large-scale image search (1) J. Philbin, et al.: Object retrieval with large vocabularies and fast spatial matching. Proc. of CVPR, 2007 Query Results from 5k Flickr images

Large-scale image search (2) Aachen Cathedral [Quack, Leibe, Van Gool, CIVR’08] Mobile tourist guide self-localization object/building recognition photo/video augmentation

Large-scale image search (3)

Large-scale image search (4) Moulin Rouge Old Town Square (Prague) Tour Montparnasse Colosseum Viktualienmarkt Maypole Application: Image auto-annotation Left: Wikipedia image Right: closest match from Flickr

Sources K. Grauman, B. Leibe: Visual Object Recognition. Morgen & Claypool Publishers, 2011 R. Szeliski: Computer Vision: Algorithms and Applications. Springer, 2010. (Chapter 14 “Recognition”) Course materials from others (G. Bebis, J. Hays, S. Lazebnik, …)