Face Detection and Head Tracking Ying Wu Electrical Engineering & Computer Science Northwestern University, Evanston, IL

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Active Appearance Models
CSCE643: Computer Vision Bayesian Tracking & Particle Filtering Jinxiang Chai Some slides from Stephen Roth.
Detecting Faces in Images: A Survey
Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.
Rapid Object Detection using a Boosted Cascade of Simple Features Paul Viola, Michael Jones Conference on Computer Vision and Pattern Recognition 2001.
Face detection Behold a state-of-the-art face detector! (Courtesy Boris Babenko)Boris Babenko.
Computer Vision for Human-Computer InteractionResearch Group, Universität Karlsruhe (TH) cv:hci Dr. Edgar Seemann 1 Computer Vision: Histograms of Oriented.
AdaBoost & Its Applications
Face detection Many slides adapted from P. Viola.
Cos 429: Face Detection (Part 2) Viola-Jones and AdaBoost Guest Instructor: Andras Ferencz (Your Regular Instructor: Fei-Fei Li) Thanks to Fei-Fei Li,
EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.
Detecting Pedestrians by Learning Shapelet Features
Challenges in Learning the Appearance of Faces for Automated Image Analysis: part I alessandro verri DISI – università di genova
The Viola/Jones Face Detector Prepared with figures taken from “Robust real-time object detection” CRL 2001/01, February 2001.
The Viola/Jones Face Detector (2001)
Robust Moving Object Detection & Categorization using self- improving classifiers Omar Javed, Saad Ali & Mubarak Shah.
Lecture 5 Template matching
Tracking Objects with Dynamics Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/21/15 some slides from Amin Sadeghi, Lana Lazebnik,
HCI Final Project Robust Real Time Face Detection Paul Viola, Michael Jones, Robust Real-Time Face Detetion, International Journal of Computer Vision,
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Robust Real-time Object Detection by Paul Viola and Michael Jones ICCV 2001 Workshop on Statistical and Computation Theories of Vision Presentation by.
Ensemble Tracking Shai Avidan IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE February 2007.
Face detection and recognition Many slides adapted from K. Grauman and D. Lowe.
A Robust Real Time Face Detection. Outline  AdaBoost – Learning Algorithm  Face Detection in real life  Using AdaBoost for Face Detection  Improvements.
CVPR 2006 New York City Granularity and Elasticity Adaptation in Visual Tracking Ming Yang, Ying Wu NEC Laboratories America Cupertino, CA 95014
CS 223B Assignment 1 Help Session Dan Maynes-Aminzade.
1 Integration of Background Modeling and Object Tracking Yu-Ting Chen, Chu-Song Chen, Yi-Ping Hung IEEE ICME, 2006.
A Robust Real Time Face Detection. Outline  AdaBoost – Learning Algorithm  Face Detection in real life  Using AdaBoost for Face Detection  Improvements.
Robust Real-Time Object Detection Paul Viola & Michael Jones.
Viola and Jones Object Detector Ruxandra Paun EE/CS/CNS Presentation
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Foundations of Computer Vision Rapid object / face detection using a Boosted Cascade of Simple features Presented by Christos Stoilas Rapid object / face.
Face Detection CSE 576. Face detection State-of-the-art face detection demo (Courtesy Boris Babenko)Boris Babenko.
FACE DETECTION AND RECOGNITION By: Paranjith Singh Lohiya Ravi Babu Lavu.
Online Learning Algorithms
Face Detection using the Viola-Jones Method
EADS DS / SDC LTIS Page 1 7 th CNES/DLR Workshop on Information Extraction and Scene Understanding for Meter Resolution Image – 29/03/07 - Oberpfaffenhofen.
Introduction to Computer Vision Olac Fuentes Computer Science Department University of Texas at El Paso El Paso, TX, U.S.A.
A General Framework for Tracking Multiple People from a Moving Camera
Detecting Pedestrians Using Patterns of Motion and Appearance Paul Viola Microsoft Research Irfan Ullah Dept. of Info. and Comm. Engr. Myongji University.
Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.
Lecture 29: Face Detection Revisited CS4670 / 5670: Computer Vision Noah Snavely.
Face detection Slides adapted Grauman & Liebe’s tutorial
Visual Object Recognition
DIEGO AGUIRRE COMPUTER VISION INTRODUCTION 1. QUESTION What is Computer Vision? 2.
Supervised Learning of Edges and Object Boundaries Piotr Dollár Zhuowen Tu Serge Belongie.
ECE738 Advanced Image Processing Face Detection IEEE Trans. PAMI, July 1997.
A Statistical Method for 3D Object Detection Applied to Face and Cars CVPR 2000 Henry Schneiderman and Takeo Kanade Robotics Institute, Carnegie Mellon.
Stable Multi-Target Tracking in Real-Time Surveillance Video
Face Detection Ying Wu Electrical and Computer Engineering Northwestern University, Evanston, IL
HCI/ComS 575X: Computational Perception Instructor: Alexander Stoytchev
Lecture 09 03/01/2012 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
The Viola/Jones Face Detector A “paradigmatic” method for real-time object detection Training is slow, but detection is very fast Key ideas Integral images.
Boosted Particle Filter: Multitarget Detection and Tracking Fayin Li.
Learning to Detect Faces A Large-Scale Application of Machine Learning (This material is not in the text: for further information see the paper by P.
Hand Gesture Recognition Using Haar-Like Features and a Stochastic Context-Free Grammar IEEE 高裕凱 陳思安.
FACE DETECTION : AMIT BHAMARE. WHAT IS FACE DETECTION ? Face detection is computer based technology which detect the face in digital image. Trivial task.
Notes on HW 1 grading I gave full credit as long as you gave a description, confusion matrix, and working code Many people’s descriptions were quite short.
A Brief Introduction on Face Detection Mei-Chen Yeh 04/06/2010 P. Viola and M. J. Jones, Robust Real-Time Face Detection, IJCV 2004.
Cell Segmentation in Microscopy Imagery Using a Bag of Local Bayesian Classifiers Zhaozheng Yin RI/CMU, Fall 2009.
Face detection Many slides adapted from P. Viola.
AdaBoost Algorithm and its Application on Object Detection Fayin Li.
Face Detection 蔡宇軒.
Object detection as supervised classification
Object Tracking Based on Appearance and Depth Information
Learning to Detect Faces Rapidly and Robustly
Cos 429: Face Detection (Part 2) Viola-Jones and AdaBoost Guest Instructor: Andras Ferencz (Your Regular Instructor: Fei-Fei Li) Thanks to Fei-Fei.
ADABOOST(Adaptative Boosting)
Presentation transcript:

Face Detection and Head Tracking Ying Wu Electrical Engineering & Computer Science Northwestern University, Evanston, IL

Face Detection: The Problem The Goal: Identify and locate faces in an image The Challenges: Position Scale Orientation Illumination Facial expression Partial occlusion

Outline The Basics Visual Detection –A framework –Pattern classification –Handling scales Viola & Jones’ method –Feature: Integral image –Classifier: AdaBoosting –Speedup: Cascading classifiers –Putting things together Other methods Open Issues

The Basics: Detection Theory Bayesian decision Likelihood ratio detection

Bayesian Rule posterior likelihood prior

Bayesian Decision Classes {  1,  2,…,  c } Actions {  1,  2,…,  a } Loss: (  k |  i ) Risk: Overall risk: Bayesian decision

Minimum-Error-Rate Decision

Likelihood Ratio Detection x – the data H – hypothesis –H0: the data does not contain the target –H1: the data contains the target Detection: p(x|H1) > p(x|H0) Likelihood ratio

Detection vs. False Positive “+” “-” false positive miss detection threshold “+” “-” false positivemiss detection threshold

Visual Detection A Framework Three key issues – target representation – pattern classification – effective search

Visual Detection Detecting an “object” in an image –output: location and size Challenges – how to describe the “object”? – how likely is an image patch the image of the target? – how to handle rotation? – how to handle the scale? – how to handle illumination?

A Framework Detection window Scan all locations and scales

Three Key Issues Target Representation Pattern Classification –classifier –training Effective Search

Target Representation Rule-based –e.g. “the nose is underneath two eyes”, etc. Shape Template-based – deformable shape Image Appearance-based – vectorize the pixels of an image patch Visual Feature-based – descriptive features

Pattern Classification Linear separable Linear non-separable

Effective Search Location – scan pixel by pixel Scale – solution I keep the size of detection window the same use multiple resolution images – solution II: change the size of detection window Efficiency???

Viola & Jones’ detector Feature  integral image Classifier  AdaBoosting Speedup  Cascading classifiers Putting things together

An Overview Feature-based face representation AdaBoosting as the classifier Cascading classifier to speedup

Harr-like features Q1: how many features can be calculated within a detection window? Q2: how to calculate these features rapidly?

Integral Image

The Smartness

Training and Classification Training – why? – An optimization problem – The most difficult part Classification – basic: two-class (0/1) classification – classifier – online computation

Weak Classifier Weak? – using only one feature for classification – classifier:  thresholding – a weak classifier: (f j,  j,p j ) Why not combining multiple weak classifiers? How???

Training: AdaBoosting Idea 1: combining weak classifiers Idea 2: feature selection

Feature Selection How many features do we have? What is the best strategy?

Training Algorithm

The Final Classifier This is a linear combination of a selected set of weak classifiers

Learning Results

Attentional Cascade Motivation –most detection windows contain non-faces –thus, most computation is wasted Idea? – can we save some computation on non-faces? – can we reject the majority of the non-faces very quickly? – using simple classifiers for screening!

Cascading classifiers

Designing Cascade Design parameters –# of cascade stages –# of features for each stage –parameters of each stage Example: a 32-stage classifier –S1: 2-feature, detect 100% faces and reject 60% non-faces –S2: 5-feature, detect 100% faces and reject 80% non-faces –S3-5: 20-feature –S6-7: 50-feature –S8-12: 100-feature –S13-32: 200-feature

Comparison

Comments It is quite difficult to train the cascading classifiers

Handling scales Scaling the detector itself, rather than using multiple resolution images Why? –const computation Practice –Use a set of scales a factor of 1.25 apart

Integrating multiple detection Why multiple detection? –detector is insensitive to small changes in translation and scale Post-processing –connect component labeling –the center of the component

Putting things together Training: off-line –Data collection positive data negative data –Validation set –Cascade AdaBoosting Detection: on-line –Scanning the image

Training Data

Results

ROC

Summary Advantages –Simple  easy to implement –Rapid  real-time system Disadvantages –Training is quite time-consuming (may take days) –May need enormous engineering efforts for fine tuning

Other Methods Rowley-Baluja-Kanade

Train a set of multilayer perceptrons and arbitrate a decision among all the inputs, and search among different scales, [Rowley, Baluja and Kanade, 1998]

RBK: Some Results Courtesy of Rowley et al., 1998

Open Issues Out-of-plane rotation Occlusion Illumination

Tracking Heads?  The task: Localize faces and track them in image sequences  Challenges: Lighting, occlusion, rotation, etc. Courtesy of Y. Wu, 2001

Outline Motivation What is tracking? One solution (Birchfield_CVPR98) Other methods and open issues

Motivation Why tracking? –The complexity of face detection scan all the pixel positions and several scales –The limitation of face detection hard to handle out-of-plane rotation – Can we maintain the identity of the faces? although face recognition is the ultimate solution for this, we may not need it, if not necessary Objectives – fast (frame-rate) face/head localization – handle 360 o out-of-plane rotation

Visual Tracking

Four Elements Infer target states in video sequences Target states vs. image observations Visual cues and modalities Four elements –Target representation X –Observation representation Z –Hypotheses measurement p(Z t |X t ) –Hypotheses generating p(X t |X t-1 )

Visual Tracking Ground Truth Prediction Hypothesis Estimation

Formulating Visual Tracking P(X t |X t-1 ) Dynm. Mdl P(Z t |X t ) Obsrv. Mdl

Tracking as Density Propagation State space X t State space X t+1 Posterior Prob. Posterior Prob.

One Solution (Birchfield_CVPR98)  Framework  Search strategy  Edge cue  Color cue

Framework s = (x,y,  ) Tracking is treated as a local search based on the prediction hypotheses Edge matching score color matching score

Search Strategy Local exhaustive search Do you have better ideas?  is the search step size

Edge Cue Method I Method II Which is better? The the magnitude of the gradient at perimeter pixel i of the ellipse s. # of pixels on the perimeter of the ellipse unit vector normal to the ellipse at pixel i.

Normalization Why do we need normalization? How good is it?

Color Cue Histogram intersection # of bins Model histogram

Color Cue Color space –B-G –G-R –R+G+B (why do we need that) 8 bins for B-G and G-R, 4 for R+G+B Training the model histogram Normalization

Comments Can the rotation be handled? Can the scaling issue be handled? Is the search strategy good enough? Is the color module good? Is the motion prediction enough? Is the combination of the two cues good? Can it handle occlusion? Can it cope with multiple faces –Coalesce –Switch ID

Other Solutions  Condensation algorithm  3D head tracking

Tracking as Density Propagation State space X t State space X t+1 Posterior Prob. Posterior Prob.

Sequential Monte Carlo P(X t |Z t ) is represented by a set of weighted samples Sample weights are determined by P(Z t (n) |X t (n) ) Hypotheses generating is controlled by P(X t |X t-1 )

Challenge to Condensation Curse of dimensionality –What to track? Positions, orientations Shape deformation Color appearance changing –The dimensionality of X –The number of hypotheses grows exponentially

3D Face Tracking: The Problem The goal: Estimate and track 3D head poses The challenges: Side view Back view Poor illumination Low resolution Different users

3D Face Tracking: A Solution Predictor Motion Model Final Pose Estimator Cropped Input Image Prepro- cessing Feature Extraction Ellipsoid Model Annotated Pose Courtesy of Y. Wu and K. Toyama, 2000

3D Face Tracking: some results