Bayesian Decision Theory Case Studies

Slides:



Advertisements
Similar presentations
ARTIFICIAL PASSENGER.
Advertisements

Pattern Classification & Decision Theory. How are we doing on the pass sequence? Bayesian regression and estimation enables us to track the man in the.
Applications of one-class classification
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Matthias Wimmer, Bernd Radig, Michael Beetz Chair for Image Understanding Computer Science TU München, Germany A Person and Context.
Advanced Image Processing Student Seminar: Lipreading Method using color extraction method and eigenspace technique ( Yasuyuki Nakata and Moritoshi Ando.
Eigenfaces for Recognition Presented by: Santosh Bhusal.
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Change Detection C. Stauffer and W.E.L. Grimson, “Learning patterns of activity using real time tracking,” IEEE Trans. On PAMI, 22(8): , Aug 2000.
Face Recognition. Introduction Why we are interested in face recognition? Why we are interested in face recognition? Passport control at terminals in.
Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.
Learning to estimate human pose with data driven belief propagation Gang Hua, Ming-Hsuan Yang, Ying Wu CVPR 05.
Robust Object Tracking via Sparsity-based Collaborative Model
ICIP 2000, Vancouver, Canada IVML, ECE, NTUA Face Detection: Is it only for Face Recognition?  A few years earlier  Face Detection Face Recognition 
Tracking Objects with Dynamics Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/21/15 some slides from Amin Sadeghi, Lana Lazebnik,
Parameter Estimation: Maximum Likelihood Estimation Chapter 3 (Duda et al.) – Sections CS479/679 Pattern Recognition Dr. George Bebis.
Towards modelling the Semantics of Natural Human Body Movement Hayley Hung A method of feature extraction using.
Automatic Face Recognition Using Color Based Segmentation and Intelligent Energy Detection Michael Padilla and Zihong Fan Group 16 EE368, Spring
CONTENT BASED FACE RECOGNITION Ankur Jain 01D05007 Pranshu Sharma Prashant Baronia 01D05005 Swapnil Zarekar 01D05001 Under the guidance of Prof.
Object Detection and Tracking Mike Knowles 11 th January 2005
Algebraic Functions of Views for 3D Object Recognition CS773C Advanced Machine Intelligence Applications Spring 2008: Object Recognition.
Effective Gaussian mixture learning for video background subtraction Dar-Shyang Lee, Member, IEEE.
Computer Vision I Instructor: Prof. Ko Nishino. Today How do we recognize objects in images?
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
A Vision-Based System that Detects the Act of Smoking a Cigarette Xiaoran Zheng, University of Nevada-Reno, Dept. of Computer Science Dr. Mubarak Shah,
Face Recognition and Retrieval in Video Basic concept of Face Recog. & retrieval And their basic methods. C.S.E. Kwon Min Hyuk.
Noise Estimation from a Single Image Ce Liu William T. FreemanRichard Szeliski Sing Bing Kang.
1 Activity and Motion Detection in Videos Longin Jan Latecki and Roland Miezianko, Temple University Dragoljub Pokrajac, Delaware State University Dover,
Computer vision: models, learning and inference Chapter 6 Learning and Inference in Vision.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
Human-Computer Interaction Human-Computer Interaction Tracking Hanyang University Jong-Il Park.
1. Introduction Motion Segmentation The Affine Motion Model Contour Extraction & Shape Estimation Recursive Shape Estimation & Motion Estimation Occlusion.
EE 492 ENGINEERING PROJECT LIP TRACKING Yusuf Ziya Işık & Ashat Turlibayev Yusuf Ziya Işık & Ashat Turlibayev Advisor: Prof. Dr. Bülent Sankur Advisor:
An Information Fusion Approach for Multiview Feature Tracking Esra Ataer-Cansizoglu and Margrit Betke ) Image and.
Object Recognition in Images Slides originally created by Bernd Heisele.
Face Recognition: An Introduction
Expectation-Maximization (EM) Case Studies
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
Boosted Particle Filter: Multitarget Detection and Tracking Fayin Li.
A Reliable Skin Detection Using Dempster-Shafer Theory of Evidence
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
Computational Intelligence: Methods and Applications Lecture 26 Density estimation, Expectation Maximization. Włodzisław Duch Dept. of Informatics, UMK.
Learning and Removing Cast Shadows through a Multidistribution Approach Nicolas Martel-Brisson, Andre Zaccarin IEEE TRANSACTIONS ON PATTERN ANALYSIS AND.
Face Detection 蔡宇軒.
Gaussian Mixture Model classification of Multi-Color Fluorescence In Situ Hybridization (M-FISH) Images Amin Fazel 2006 Department of Computer Science.
Bayesian Decision Theory Case Studies CS479/679 Pattern Recognition Dr. George Bebis.
CS479/679 Pattern Recognition Dr. George Bebis
Computer vision: models, learning and inference
Traffic Sign Recognition Using Discriminative Local Features Andrzej Ruta, Yongmin Li, Xiaohui Liu School of Information Systems, Computing and Mathematics.
A Forest of Sensors: Using adaptive tracking to classify and monitor activities in a site Eric Grimson AI Lab, Massachusetts Institute of Technology
Tracking Objects with Dynamics
Face Recognition and Feature Subspaces
Dynamical Statistical Shape Priors for Level Set Based Tracking
Lecture 25: Introduction to Recognition
Tremor Detection Using Motion Filtering and SVM Bilge Soran, Jenq-Neng Hwang, Linda Shapiro, ICPR, /16/2018.
Eric Grimson, Chris Stauffer,
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
George Bebis and Wenjing Li Computer Vision Laboratory
Midterm Exam Closed book, notes, computer Similar to test 1 in format:
Announcements Project 2 artifacts Project 3 due Thursday night
Announcements Project 4 out today Project 2 winners help session today
EE 492 ENGINEERING PROJECT
Announcements Artifact due Thursday
Midterm Exam Closed book, notes, computer Similar to test 1 in format:
EM Algorithm and its Applications
Announcements Artifact due Thursday
The “Margaret Thatcher Illusion”, by Peter Thompson
Presentation transcript:

Bayesian Decision Theory Case Studies CS479/679 Pattern Recognition Dr. George Bebis

Case Study I A. Madabhushi and J. Aggarwal, A bayesian approach to human activity recognition, 2nd International Workshop on Visual Surveillance, pp. 25-30, June 1999.

Human activity recognition Recognize human actions using visual information. Useful for monitoring of human activity in department stores, airports, high-security buildings etc. Building systems that can recognize any type of action is a difficult and challenging problem.

Goal Build a system that is capable of recognizing the following 10 (ten) actions, from a frontal or lateral view: sitting down standing up bending down getting up hugging squatting rising from a squatting position bending sideways falling backward walking

Rationale and Approach People sit, stand, walk, bend down, and get up in a more or less similar fashion. Human actions can be recognized by tracking various body parts. Head motion trajectory The head of a person moves in a characteristic fashion during these actions. Recognition is formulated as Bayesian classification using the movement of the head over consecutive frames.

Strengths and Weaknesses The system can recognize actions where the gait of the subject in the test sequence differs considerably from the training sequences. Also, it can recognize actions for people of varying physical structure (i.e., tall, short, fat, thin etc.). Weaknesses Only actions in the frontal or lateral view can be recognized successfully by this system. Certain assumptions might not be valid.

Main Steps input output

Action Representation Estimate the centroid of the head in each frame: Find the absolute differences in successive frames: |

Head Detection and Tracking The centroid of the head is tracked from frame to frame. Accurate head detection and tracking are crucial. Detection was performed manually here.

Bayesian Formulation Given an input sequence, the posterior probabilities are computed for each action using the Bayes rule: Assumption:

Probability Density Estimation Feature vectors X and Y are assumed to be independent (valid?), following a multi-variate Gaussian distribution:

Probability Density Estimation (cont’d) The sample covariance matrices are used to estimate ΣX and ΣY : Two distributions are estimated for each action corresponding to the frontal and lateral views (i.e., 20 densities total). ΣX ΣY

Recognition Given an input sequence, the posterior probabilities are computed for each of the stored actions (i.e., 20 values). The input action is classified based on the most likely action:

Discriminating Similar Actions In some actions, the head moves in a similar fashion, making it difficult to distinguish these actions from one another; for example: (1) The head moves downward without much sideward deviation in the following actions: * squatting * sitting down * bending down

Discriminating Similar Actions (cont’d) (2) The head moves upward without much sideward deviation in the following actions: * standing up * rising * getting up A number of heuristics are used to distinguish among these actions. e.g., when bending down, the head goes much lower than when sitting down.

Training A fixed CCD camera working at 2 frames per second was used to obtain the training sequences. People of diverse physical appearance were used to model the actions. Subjects were asked to perform the actions at a comfortable pace.

Training (cont’d) To train the system, 38 sequences were taken of each person performing all the actions of interest in both the frontal and lateral views. It was found that each action can be completed within 10 frames. Only the first 10 frames from each sequence were used for training/testing (i.e., 5 seconds)

Testing For testing, 39 sequences were used. Of the 39 sequences, 31 were classified correctly. Of the 8 sequences classified incorrectly, 6 were assigned to the correct action but to the wrong view.

Results (cont’d)

Practical Issues How would you find the first and last frames of an action in general (segmentation)? Is the system robust to recognizing an action from incomplete sequences (i.e., assuming that several frames are missing)? Current system is unable to recognize several actions at the same time.

Extension J. Usabiaga, G. Bebis, A. Erol, Mircea Nicolescu, and Monica Nicolescu, "Recognizing Simple Human Actions Using 3D Head Trajectories", Computational Intelligence , vol. 23, no. 4, pp. 484-496, 2007.

Case Study II J. Yang and A. Waibel, A Real-time Face Tracker, Proceedings of WACV'96, 1996.

Goal and Steps Goal Main Steps Build a system that can detect and track a person’s face while the person moves freely in a room. Main Steps (1) Detect arbitrary human faces in various environments using a generic skin-color model. (2) Track the face of interest by controlling the camera position and zoom. (3) Adapt skin-color model parameters based on individual appearance and lighting conditions.

System Components A probabilistic model to characterize skin-color distributions of human faces. A motion model to estimate human motion and to predict search window in the next frame. A camera model to predict camera motion (i.e., camera’s response was much slower than frame rate).

Search Window

Why Using Skin Color for Face Detection? Traditional systems performed face detection using template matching or facial features. Using skin-color leads to a faster and more robust approach compared to template matching or facial feature extraction.

Challenges Using Skin Color Human skin colors differ from person to person. The color representation of a face obtained by a camera is influenced by many factors (e.g., ambient light, motion etc.) Different cameras produce significantly different color values, even for the same person under the same lighting conditions.

Chromatic Color Space RGB is not the best color representation for characterizing skin-color (i.e., it represents not only color but also brightness). Represent skin-color in the chromatic space which is defined from the RGB space as follows: (the normalized blue component is redundant since r + g + b = 1)

Skin-Color Clustering Skin colors do not fall randomly in chromatic color space but form clusters at specific points.

Skin-Color Clustering (cont’d) Distributions of skin-colors of different people are clustered in chromatic color space i.e., they differ much less in color than in brightness (skin-color distribution of 40 people - different races)

Skin-Color Model Experiments (i.e., assuming different lighting conditions and different persons) have shown that the skin-color distribution has a regular shape. Idea: represent skin-color distribution using a Gaussian with mean μ and covariance Σ:

Parameter Estimation Select skin-color regions from a set of face images. Estimate the mean and covariance of skin-color distribution using the sample mean and covariance:

Face detection using the skin-color model Each pixel x in the input image is converted into the chromatic color space and compared with the distribution of the skin-color model.

Example

Dealing with skin-color-like objects It is impossible in general to detect only faces simply from the result of color matching e.g., background may contain skin colors

Dealing with skin-color-like objects (cont’d) Additional information should be used for rejecting false positives(e.g., geometric features, motion etc.)

Skin-color model adaptation If a person is moving, the apparent skin colors change as the person’s position relative to the camera or light changes. Idea: adapt model parameters to handle these changes.

Skin-color model adaptation (cont’d) N determines how long the past parameters will influence the current parameters. The weighting factors ai, bi, ci determine how much the past parameters will influence current parameters. μ 𝑟 = 𝑟 μ 𝑔 = 𝑔

System initialization Automatic mode A general skin-color model is used to identify skin-color regions. Motion and shape information is used to reject non-face regions. The largest face region is selected (face closest to the camera). Skin-color model is adapted to the face being tracked.

System initialization (cont’d) Interactive mode The user selects a point on the face of interest using the mouse. The tracker searches around the point to find the face using a general skin-color model. Skin-color model is adapted to the face being tracked.

Tracking Speed