Object Recognition Based on Shape Similarity Longin Jan Latecki Computer and Information Sciences Dept. Temple Univ.,

Slides:



Advertisements
Similar presentations
Project title: Image Recognition and Classification Based on Object Parts PI: Longin Jan Latecki (Temple Univ.) Co-PI: Rolf Lakaemper (Temple Univ.) Co-PI:
Advertisements

Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.
NUS CS5247 Motion Planning for Camera Movements in Virtual Environments By Dennis Nieuwenhuisen and Mark H. Overmars In Proc. IEEE Int. Conf. on Robotics.
1 Approximated tracking of multiple non-rigid objects using adaptive quantization and resampling techniques. J. M. Sotoca 1, F.J. Ferri 1, J. Gutierrez.
Proposed concepts illustrated well on sets of face images extracted from video: Face texture and surface are smooth, constraining them to a manifold Recognition.
Patch to the Future: Unsupervised Visual Prediction
Object Recognition using Invariant Local Features Applications l Mobile robots, driver assistance l Cell phone location or object recognition l Panoramas,
Probabilistic Robotics
(Includes references to Brian Clipp
Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee.
Lab 2 Lab 3 Homework Labs 4-6 Final Project Late No Videos Write up
Robust Object Tracking via Sparsity-based Collaborative Model
Shape Classification Based on Skeleton Path Similarity Xingwei Yang, Xiang Bai, Deguang Yu, and Longin Jan Latecki.
Uncertainty Representation. Gaussian Distribution variance Standard deviation.
Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.
Object Detection by Matching Longin Jan Latecki. Contour-based object detection Database shapes: …..
Hidden Markov Model based 2D Shape Classification Ninad Thakoor 1 and Jean Gao 2 1 Electrical Engineering, University of Texas at Arlington, TX-76013,
Randomized Motion Planning for Car-like Robots with C-PRM Guang Song and Nancy M. Amato Department of Computer Science Texas A&M University College Station,
Visual Recognition Tutorial
Shape and Dynamics in Human Movement Analysis Ashok Veeraraghavan.
ICIP 2000, Vancouver, Canada IVML, ECE, NTUA Face Detection: Is it only for Face Recognition?  A few years earlier  Face Detection Face Recognition 
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
A Study of Approaches for Object Recognition
Adaptive Rao-Blackwellized Particle Filter and It’s Evaluation for Tracking in Surveillance Xinyu Xu and Baoxin Li, Senior Member, IEEE.
Detecting and Tracking Moving Objects for Video Surveillance Isaac Cohen and Gerard Medioni University of Southern California.
SLAM: Simultaneous Localization and Mapping: Part I Chang Young Kim These slides are based on: Probabilistic Robotics, S. Thrun, W. Burgard, D. Fox, MIT.
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Snakes Goes from edges to boundaries. Edge is strong change in intensity. Boundary is boundary of an object. –Smooth (more or less) –Closed. –…
Probabilistic Robotics
Based on Slides by D. Gunopulos (UCR)
Randomized Motion Planning for Car-like Robots with C-PRM Guang Song, Nancy M. Amato Department of Computer Science Texas A&M University College Station,
Automatic Camera Calibration for Image Sequences of a Football Match Flávio Szenberg (PUC-Rio) Paulo Cezar P. Carvalho (IMPA) Marcelo Gattass (PUC-Rio)
A Bidirectional Matching Algorithm for Deformable Pattern Detection with Application to Handwritten Word Retrieval by K.W. Cheung, D.Y. Yeung, R.T. Chin.
1 Activity and Motion Detection in Videos Longin Jan Latecki and Roland Miezianko, Temple University Dragoljub Pokrajac, Delaware State University Dover,
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Computer Vision and Data Mining Research Projects Longin Jan Latecki Computer and Information Sciences Dept. Temple University
Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.
Circular Augmented Rotational Trajectory (CART) Shape Recognition & Curvature Estimation Presentation for 3IA 2007 Russel Ahmed Apu & Dr. Marina Gavrilova.
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
Gwangju Institute of Science and Technology Intelligent Design and Graphics Laboratory Multi-scale tensor voting for feature extraction from unstructured.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
1 A Bayesian Method for Guessing the Extreme Values in a Data Set Mingxi Wu, Chris Jermaine University of Florida September 2007.
Mapping and Localization with RFID Technology Matthai Philipose, Kenneth P Fishkin, Dieter Fox, Dirk Hahnel, Wolfram Burgard Presenter: Aniket Shah.
Intelligent Vision Systems ENT 496 Object Shape Identification and Representation Hema C.R. Lecture 7.
Particle Filters for Shape Correspondence Presenter: Jingting Zeng.
Young Ki Baik, Computer Vision Lab.
Particle Filters.
Image Segmentation & Template Matching Multimedia Signal Processing lecture on Petri Hirvonen.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
Ground Truth Free Evaluation of Segment Based Maps Rolf Lakaemper Temple University, Philadelphia,PA,USA.
In Defense of Nearest-Neighbor Based Image Classification Oren Boiman The Weizmann Institute of Science Rehovot, ISRAEL Eli Shechtman Adobe Systems Inc.
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Supervised Learning Resources: AG: Conditional Maximum Likelihood DP:
Autonomous Robots Vision © Manfred Huber 2014.
CSE 140: Computer Vision Camillo J. Taylor Assistant Professor CIS Dept, UPenn.
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
A Tutorial on using SIFT Presented by Jimmy Huff (Slightly modified by Josiah Yoder for Winter )
Course14 Dynamic Vision. Biological vision can cope with changing world Moving and changing objects Change illumination Change View-point.
Shape similarity and Visual Parts Longin Jan Latecki Temple Univ., In cooperation with Rolf Lakamper (Temple Univ.),
Discrete Approach to Curve and Surface Evolution
Using the Particle Filter Approach to Building Partial Correspondences between Shapes Rolf Lakaemper, Marc Sobel Temple University, Philadelphia,PA,USA.
Recognizing specific objects Matching with SIFT Original suggestion Lowe, 1999,2004.
11/25/03 3D Model Acquisition by Tracking 2D Wireframes Presenter: Jing Han Shiau M. Brown, T. Drummond and R. Cipolla Department of Engineering University.
Traffic Sign Recognition Using Discriminative Local Features Andrzej Ruta, Yongmin Li, Xiaohui Liu School of Information Systems, Computing and Mathematics.
Supervised Time Series Pattern Discovery through Local Importance
Camera Calibration class 9
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Probabilistic Robotics Bayes Filter Implementations FastSLAM
Presentation transcript:

Object Recognition Based on Shape Similarity Longin Jan Latecki Computer and Information Sciences Dept. Temple Univ., Collaborators: Zygmunt Pizlo, Psychological Sciences Dept., Purdue Univ., Nagesh Adluru, Suzan Köknar-Tezel, Rolf Lakaemper, Thomas Young, Temple Univ., Xiang Bai, Huazhong Univ. of Sci. & Tech. Wuhan, China

2 Object Recognition Process: Source: 2D image of a 3D object Matching: Correspondence of Visual Parts Contour Segmentation Contour Extraction Object Segmentation Contour Cleaning, e.g., Evolution

3 Motivation Once a significant visual part is recognized the whole recognition process is strongly constrained in possible top- down object models. (H1) object recognition is preceded by, and based on recognition of visual parts. (H2): contour extraction is driven by shape similarity to a known shape.

4 What do you see?

5 With grouping constraints we can see (i.e., recognize the object).

6 Object contours Psychophysical and neurophysiological studies provide an abundance of evidence that contours of objects are extracted in early processing stages of human visual perception. Contours play a central role in the Gestalt-theory.

7 Salient visual parts can influence the object recognition (Singh and Hoffman 2001)

8

9

10 Visual parts and shape similarity (H1) object recognition is preceded by, and based on shape recognition of visual parts. (H2): contour extraction is driven by shape similarity to a known shape. becomes: (H2) Contour extraction is based on grouping of contour parts to larger contour parts with grouping assignments driven by shape familiarity.

11 Contour detection is a difficult inverse problem A given image could be produced by infinitely many possible 3D scenes. In order to produce a unique, stable and accurate interpretation, the visual system must use a priori constraints (see Pizlo, 2001 for a review). The solution is obtained by optimizing a cost function which consists of two general terms: 1. how close the solution is to the visual data 2. how well the constraints are satisfied

12 Partial shape similarity

13 Partial shape similarity (1) length problem, (2) scale problem, (3) distortion problem Query ShapeTarget Shape Given only a part (of a shape ), find similar shapes

14 Partial Shape Similarity We reduce partial shape similarity to subsequence matching: This is done by computing a curvature like value at every contour point. We do this for complete contours of known objects in our database and for query contour parts extracted from edge images

15 Subsequence Matching (shape similarity) Database contours Query contours

16 Motivation for subsequence matching The top (red) and bottom (blue) sequences represent parts of contours of two different but very similar bone shapes

17 Motivation(2) Example sequences: a = {1, 2, 8, 6, 8} b = {1, 2, 9, 15, 3, 5, 9}

18 OSB Algorithm Goal: given two real-valued sequences a and b, find subsequences a’ of a and b’ of b such that a’ best matches b’ Possible to skip elements in both a and b The ability to exclude outliers Preserve the order of the elements A one-to-one correspondence

19 OSB Algorithm (2) Create a dissimilarity matrix No restrictions on the distance function d We used d(a i,b j ) = (a i – b j ) 2 To find the optimal correspondence, use a shortest path algorithm on a DAG

20 OSB Algorithm (3) The nodes of the DAG are all the index pairs of the matrix: (i,j){1,…,m}{1,…,n} The edge weights w are defined by C is the jump cost (the penalty for skipping an element)

21 OSB Algorithm (4) The edge cost may be extended to impose a warping window Set a maximal value for k – i – 1 and l – j - 1 This definition of the edge weights is our main contribution

22 A Simple Example a = {1, 2, 8, 6, 8} b = {1, 2, 9, 15, 3, 5, 9} b a d(a i,b j ) = (a i – b j ) 2 The dissimilarity matrix

23 (indices) elements distance Key (1,1) (1,2) (1,3) (1,4) (1,5) (1,6) (1,7) (2,1) (2,2) (2,3) (2,4) (2,5) (2,6) (2,7) (3,1) (3,2) (3,3) (3,4) (3,5) (3,6) (3,7) (4,1) (4,2) (4,3) (4,4) (4,5) (4,6) (4,7) (5,1) (5,2) (5,3) (5,4) (5,5) (5,6) (5,7) The DAG

24 Experimental results on MPEG 7 dataset, 1400 targets in 70 classes

25 Humans group contours automatically An adaptive, probabilistic process to perform grouping All shapes contain local symmetry  exploit local symmetry How to find contour parts in images ?

26 Shape model

27

28

29 Play moviemovie

30 Rao-Blackwellized particle filter is an adaptive, probabilistic approach Frequently utilized in SLAM approaches to Robot Mapping Each particle’s successor is its most likely successor Particles are resampled to eliminate poorly performing particles Contour Grouping as Robot Mapping

31 Traversal space generated as discrete “center points”

32 Center Points Center points act as center points for maximal radius disk between the two sample points Full set of center points gives full set of maximal radius disks Entire set of potential skeletal points Want to generate a skeletal path that best groups the segments for a given shape model

33 Center points and particle paths

34 Shape model System needs to utilize reference model Some a priori knowledge to discover the proper shape Model is a sequence of radii at sample skeleton points Position in reference model determined by triangulation

35 Contour Smoothness Smoothness as a criterion for segment selection Smoothness is the measure of the amount of turn and the distance between segments Use least sum of distance to determine both distance and the segment pairing Smoothness as Gaussian mixture of distance and angle

36 SLAM framework Obtain particles by sampling from the maximum posterior probability x is the path traversal m is the contour grouping model z is the observations u is the reference model

37 Particle filter 1)Sampling: The next generation of particles x (i) t is obtained from the current generation x (i) t-1 by sampling from a proposal distribution for

38 2) Importance Weighting: An individual importance weight w (i) is assigned to each particle, according to

39 log(M(c 2 )) – log of pdf that a given pixel is a center point of radius 10

40 log(M(c 2 )) – log of pdf that a given pixel is a center point of radius 10

41 4) Contour Estimating: For each pose sample x (i) t, the corresponding contour estimate m (i) t is computed based on the trajectory and the history of observations according to 3) Resampling: Particles with a low importance weight w (i) are typically replaced by samples with a high weight. Residual resampling was used

42 Results Grouping performed on several pictures Useful groupings on many images Little or no noise grouped Few structural particles missed

43

44

45 Future Work Integration of the shape similarity of parts and the contour grouping Learning good contour parts Further improvements to the contour grouping to make it more robust