A Two-level Pose Estimation Framework Using Majority Voting of Gabor Wavelets and Bunch Graph Analysis J. Wu, J. M. Pedersen, D. Putthividhya, D. Norgaard,

Slides:

Advertisements

Similar presentations

Complex Networks for Representation and Characterization of Images For CS790g Project Bingdong Li 9/23/2009.

Advertisements

Image Registration  Mapping of Evolution. Registration Goals Assume the correspondences are known Find such f() and g() such that the images are best.

Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.

Graph Embedding and Extensions: A General Framework for Dimensionality Reduction Keywords: Dimensionality reduction, manifold learning, subspace learning,

Face Recognition and Biometric Systems Elastic Bunch Graph Matching.

ECE738 Advanced Image Processing Face Recognition by Elastic Bunch Graph Matching IEEE Trans. PAMI, July 1997.

Face Recognition By Sunny Tang.

Face Description with Local Binary Patterns:

AGE ESTIMATION: A CLASSIFICATION PROBLEM HANDE ALEMDAR, BERNA ALTINEL, NEŞE ALYÜZ, SERHAN DANİŞ.

Robust 3D Head Pose Classification using Wavelets by Mukesh C. Motwani Dr. Frederick C. Harris, Jr., Thesis Advisor December 5 th, 2002 A thesis submitted.

Automatic Feature Extraction for Multi-view 3D Face Recognition

Amir Hosein Omidvarnia Spring 2007 Principles of 3D Face Recognition.

Uncertainty Representation. Gaussian Distribution variance Standard deviation.

Face Recognition & Biometric Systems, 2005/2006 Face recognition process.

Face Recognition Committee Machine Presented by Sunny Tang.

ICIP 2000, Vancouver, Canada IVML, ECE, NTUA Face Detection: Is it only for Face Recognition?  A few years earlier  Face Detection Face Recognition 

One-Shot Multi-Set Non-rigid Feature-Spatial Matching

Dimensionality Reduction Chapter 3 (Duda et al.) – Section 3.8

1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.

A Study of Approaches for Object Recognition

CS 790Q Biometrics Face Recognition Using Dimensionality Reduction PCA and LDA M. Turk, A. Pentland, "Eigenfaces for Recognition", Journal of Cognitive.

Three Algorithms for Nonlinear Dimensionality Reduction Haixuan Yang Group Meeting Jan. 011, 2005.

Image Matching via Saliency Region Correspondences Alexander Toshev Jianbo Shi Kostas Daniilidis IEEE Conference on Computer Vision and Pattern Recognition.

Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.

Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.

Statistical Shape Models Eigenpatches model regions –Assume shape is fixed –What if it isn’t? Faces with expression changes, organs in medical images etc.

Oral Defense by Sunny Tang 15 Aug 2003

CS 485/685 Computer Vision Face Recognition Using Principal Components Analysis (PCA) M. Turk, A. Pentland, "Eigenfaces for Recognition", Journal of Cognitive.

Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.

Representative Previous Work

Multiclass object recognition

Driver’s View and Vehicle Surround Estimation using Omnidirectional Video Stream Abstract Our research is focused on the development of novel machine vision.

1 Graph Embedding (GE) & Marginal Fisher Analysis (MFA) 吳沛勳劉冠成韓仁智

General Tensor Discriminant Analysis and Gabor Features for Gait Recognition by D. Tao, X. Li, and J. Maybank, TPAMI 2007 Presented by Iulian Pruteanu.

Graph Embedding: A General Framework for Dimensionality Reduction Dong XU School of Computer Engineering Nanyang Technological University

Access Control Via Face Recognition Progress Review.

IEEE TRANSSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

Multimodal Information Analysis for Emotion Recognition

Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.

Computer Vision Lab. SNU Young Ki Baik Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Face Recognition: An Introduction

A Statistical Method for 3D Object Detection Applied to Face and Cars CVPR 2000 Henry Schneiderman and Takeo Kanade Robotics Institute, Carnegie Mellon.

ISOMAP TRACKING WITH PARTICLE FILTER Presented by Nikhil Rane.

GRASP Learning a Kernel Matrix for Nonlinear Dimensionality Reduction Kilian Q. Weinberger, Fei Sha and Lawrence K. Saul ICML’04 Department of Computer.

Manifold learning: MDS and Isomap

Nonlinear Dimensionality Reduction Approach (ISOMAP)

EE4-62 MLCV Lecture Face Recognition – Subspace/Manifold Learning Tae-Kyun Kim 1 EE4-62 MLCV.

Speech Communication Lab, State University of New York at Binghamton Dimensionality Reduction Methods for HMM Phonetic Recognition Hongbing Hu, Stephen.

A NOVEL METHOD FOR COLOR FACE RECOGNITION USING KNN CLASSIFIER

Face Detection Using Skin Color and Gabor Wavelet Representation Information and Communication Theory Group Faculty of Information Technology and System.

Speech Lab, ECE, State University of New York at Binghamton  Classification accuracies of neural network (left) and MXL (right) classifiers with various.

MIT AI Lab / LIDS Laboatory for Information and Decision Systems & Artificial Intelligence Laboratory Massachusetts Institute of Technology A Unified Multiresolution.

2D-LDA: A statistical linear discriminant analysis for image matrix

Statistical Models of Appearance for Computer Vision 主講人：虞台文.

Computer vision. Applications and Algorithms in CV Tutorial 3: Multi scale signal representation Pyramids DFT - Discrete Fourier transform.

Computer Vision Lecture 7 Classifiers. Computer Vision, Lecture 6 Oleh Tretiak © 2005Slide 1 This Lecture Bayesian decision theory (22.1, 22.2) –General.

Face Detection & Recognition

Part 3: Estimation of Parameters. Estimation of Parameters Most of the time, we have random samples but not the densities given. If the parametric form.

1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.

Spectral Methods for Dimensionality

Deeply learned face representations are sparse, selective, and robust

Shuang Hong Yang College of Computing, Georgia Tech, USA Hongyuan Zha

Bag-of-Visual-Words Based Feature Extraction

Recognizing Deformable Shapes

Unsupervised Riemannian Clustering of Probability Density Functions

Can Computer Algorithms Guess Your Age and Gender?

Object Recognition in the Dynamic Link Architecture

Learning with information of features

Brief Review of Recognition + Context

Paper Reading Dalong Du April.08, 2011.

Presentation transcript:

A Two-level Pose Estimation Framework Using Majority Voting of Gabor Wavelets and Bunch Graph Analysis J. Wu, J. M. Pedersen, D. Putthividhya, D. Norgaard, T. Moeslund and M. M. Trivedi Computer Vision and Robotics Lab University of California, San Diego La Jolla, CA, U.S.A.

INTRODUCTION Applications: Intelligent meeting room Driver’s focus analysis Problem setup: Pose is determined by both pan angle (ß) and tilt angle (α) For attention focus problem, both angles need to be determined in a fine scale

PROBLEM DESCRIPTION 93 poses in total Pan angle goes from –90 0 to with discrete interval of 15 0 Tilt angle goes from –90 0 to with interval steps of either 15 0 or 30 0 For every pose, 15 images from 15 subjects are used as training samples while another 15 images from the same 15 subjects are used as the testing samples

POSE ESTIMATION FRAMEWORK Two level structure---coarse to fine: First level: pose estimation is accurate up to a 3x3 neighborhood Second level: accurate pose is determined in the 3x3 neighborhood First level output Second level output

FIRST LEVEL: Multi-resolution subspace classification by majority voting Motivation: –Features from one single resolution are not sufficient –Finer resolution: salient features are less addressed –Coarser resolution: loss of information –For features from different resolutions, different sets of salient features are addressed –They are equally important for classification Algorithm details: –Multi-resolution feature extraction –Gabor wavelets----multi-scale and multi-orientation analysis –Subspace feature extraction –PCA subspace feature extraction –KDA subspace feature extraction –Nearest prototype classification in every resolution and majority voting for classification results from different resolutions

GABOR WAVELETS ANALYSIS Why Gabor wavelets: –A joint spatial frequency representation –Extract the position and orientation of both global and local features as well as preserving frequency information. What is Gabor wavelets: –A convolution of the image with a family of Gabor kernels –All Gabor kernels are generated by a mother wavelet by dilation and rotation –Mother wavelet: a plane wave generated from a complex exponential and restricted by a Gaussian envelope ;

PCA V.S. KDA Why subspace analysis: –Extract the most discriminating information –Reduce the dimensionality PCA: –Linear transformation –The first M eigenvectors of the samples’ covariance matrix – Selects the directions that have most variance Why not PCA: –Not capable of extracting the non-linear structure –Not necessarily the best discriminating features for classification KDA: –Non-linear variant of LDA –Finds the projection according to the Fisher’s criterion, which maximizes the Rayleigh coefficient

PCA V.S. KDA (Contd.) Rayleigh coefficient: Introduce kernel: ; where: is called kernel. (Gaussian kernel is used here)

SECOND LEVEL: Structural landmark analysis by bunch graph template matching Motivation: –To refine the estimate from the first level –Geometric structure is able to catch the small difference between neighboring poses Bunch graph: –Geometric relationship between salient facial points is used –For each pose, a model bunch graph is constructed Nodes: salient facial points Edges: distance information between nodes –The bunch graph for the testing image is compared with a subset of the model bunch graphs –The model template that results the highest similarity score determines the final pose estimate ;

EXPERIMENTAL EVALUATION 3X3 5X5 PCA Subspace

EXPERIMENTAL EVALUATION 3X3 5X5 KDA Subspace PCA: 85.16%PCA: 97.71%

SECOND LEVEL EVALUATION nement #%#%#%#%#%#%

SECOND LEVEL EVALUATION 58.02%

CONCLUSION AND DISCUSSION Visual cues characterizing facial pose have unique multi-resolution spatial frequency and structural signatures In the first level, the statistical multi-resolution subspace analysis gives the pose estimation with an uncertainty of ±15 degree, 90.32% accuracy is achieved In the second level, the structural details are exploited to eliminate the uncertainty, 58.02% accuracy is achieved In the first level, the face registration is done manually, automatic face registration by facial landmark detection algorithm is under investigation and some promising preliminary results have been obtained

THANK YOU!