Mean Shift Theory and Applications

Slides:



Advertisements
Similar presentations
Scale & Affine Invariant Interest Point Detectors Mikolajczyk & Schmid presented by Dustin Lennon.
Advertisements

CSCE643: Computer Vision Mean-Shift Object Tracking Jinxiang Chai Many slides from Yaron Ukrainitz & Bernard Sarel & Robert Collins.
Image Repairing: Robust Image Synthesis by Adaptive ND Tensor Voting IEEE Computer Society Conference on Computer Vision and Pattern Recognition Jiaya.
November 12, 2013Computer Vision Lecture 12: Texture 1Signature Another popular method of representing shape is called the signature. In order to compute.
The SIFT (Scale Invariant Feature Transform) Detector and Descriptor
Introduction To Tracking
Multiple People Detection and Tracking with Occlusion Presenter: Feifei Huo Supervisor: Dr. Emile A. Hendriks Dr. A. H. J. Stijn Oomes Information and.
Segmentation (2): edge detection
1Ellen L. Walker Edges Humans easily understand “line drawings” as pictures.
Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.
Visual Recognition Tutorial
Grouping and Segmentation Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/01/10.
Image segmentation. The goals of segmentation Group together similar-looking pixels for efficiency of further processing “Bottom-up” process Unsupervised.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Mean Shift A Robust Approach to Feature Space Analysis Kalyan Sunkavalli 04/29/2008 ES251R.
Motion Detection And Analysis Michael Knowles Tuesday 13 th January 2004.
Mean-Shift Algorithm and Its Application Bohyung Han
Vision Topics Seminar Mean Shift
Segmentation Divide the image into segments. Each segment:
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2005 with a lot of slides stolen from Steve Seitz and.
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2006 with a lot of slides stolen from Steve Seitz and.
Dorin Comaniciu Visvanathan Ramesh (Imaging & Visualization Dept., Siemens Corp. Res. Inc.) Peter Meer (Rutgers University) Real-Time Tracking of Non-Rigid.
Tal Mor  Create an automatic system that given an image of a room and a color, will color the room walls  Maintaining the original texture.
~5,617,000 population in each state
Image Segmentation Image segmentation is the operation of partitioning an image into a collection of connected sets of pixels. 1. into regions, which usually.
Image Segmentation Rob Atlas Nick Bridle Evan Radkoff.
Chapter 5 Segmentation Presenter: 蘇唯誠
Segmentation and Grouping Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/23/10.
Chapter 2. Image Analysis. Image Analysis Domains Frequency Domain Spatial Domain.
Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.
Computer vision.
Computer Vision James Hays, Brown
National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Image Features Kenton McHenry, Ph.D. Research Scientist.
Mean Shift : A Robust Approach Toward Feature Space Analysis - Bayesian Background Modeling
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
Mean-shift and its application for object tracking
Niloy J. Mitra Leonidas J. Guibas Mark Pauly TU Vienna Stanford University ETH Zurich SIGGRAPH 2007.
1 Mean shift and feature selection ECE 738 course project Zhaozheng Yin Spring 2005 Note: Figures and ideas are copyrighted by original authors.
Mean Shift Theory and Applications Reporter: Zhongping Ji.
Lecture 06 06/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
7.1. Mean Shift Segmentation Idea of mean shift:
ALIGNMENT OF 3D ARTICULATE SHAPES. Articulated registration Input: Two or more 3d point clouds (possibly with connectivity information) of an articulated.
CSE 185 Introduction to Computer Vision Pattern Recognition 2.
報告人 : 林福城 指導老師 : 陳定宏 1 From Res. Center of Intell. Transp. Syst., Beijing Univ. of Technol., Beijing, China By Zhe Liu ; Yangzhou Chen ; Zhenlong Li Appears.
EECS 274 Computer Vision Segmentation by Clustering II.
Non-Euclidean Example: The Unit Sphere. Differential Geometry Formal mathematical theory Work with small ‘patches’ –the ‘patches’ look Euclidean Do calculus.
CS654: Digital Image Analysis Lecture 25: Hough Transform Slide credits: Guillermo Sapiro, Mubarak Shah, Derek Hoiem.
Grouping and Segmentation
CS654: Digital Image Analysis Lecture 30: Clustering based Segmentation Slides are adapted from:
Extracting features from spatio-temporal volumes (STVs) for activity recognition Dheeraj Singaraju Reading group: 06/29/06.
Digital Image Processing
Boosted Particle Filter: Multitarget Detection and Tracking Fayin Li.
EECS 274 Computer Vision Tracking. Motivation: Obtain a compact representation from an image/motion sequence/set of tokens Should support application.
Real-Time Tracking with Mean Shift Presented by: Qiuhua Liu May 6, 2005.
Feature extraction: Corners and blobs. Why extract features? Motivation: panorama stitching We have two images – how do we combine them?
Image Segmentation Shengnan Wang
Mean Shift ; Theory and Applications Presented by: Reza Hemati دی 89 December گروه بینایی ماشین و پردازش تصویر Machine Vision and Image Processing.
研 究 生:周暘庭 Q36994477 電腦與通信工程研究所 通訊與網路組 指導教授 :楊家輝 Mean-Shift-Based Color Tracking in Illuminance Change.
Visual Tracking by Cluster Analysis Arthur Pece Department of Computer Science University of Copenhagen
Instructor: Mircea Nicolescu Lecture 7
Lecture 30: Segmentation CS4670 / 5670: Computer Vision Noah Snavely From Sandlot ScienceSandlot Science.
ICCV 2007 National Laboratory of Pattern Recognition Institute of Automation Chinese Academy of Sciences Half Quadratic Analysis for Mean Shift: with Extension.
Blob detection.
Course Introduction to Medical Imaging Segmentation 1 – Mean Shift and Graph-Cuts Guy Gilboa.
Edge Detection using Mean Shift Smoothing
A segmentation and tracking algorithm
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
The SIFT (Scale Invariant Feature Transform) Detector and Descriptor
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Mean Shift Theory and Applications
Presentation transcript:

Mean Shift Theory and Applications Yaron Ukrainitz & Bernard Sarel

Agenda Mean Shift Theory What is Mean Shift ? Density Estimation Methods Deriving the Mean Shift Mean shift properties Applications Clustering Discontinuity Preserving Smoothing Object Contour Detection Segmentation Object Tracking

Mean Shift Theory

Intuitive Description Region of interest Center of mass Mean Shift vector Objective : Find the densest region Distribution of identical billiard balls

Intuitive Description Region of interest Center of mass Mean Shift vector Objective : Find the densest region Distribution of identical billiard balls

Intuitive Description Region of interest Center of mass Mean Shift vector Objective : Find the densest region Distribution of identical billiard balls

Intuitive Description Region of interest Center of mass Mean Shift vector Objective : Find the densest region Distribution of identical billiard balls

Intuitive Description Region of interest Center of mass Mean Shift vector Objective : Find the densest region Distribution of identical billiard balls

Intuitive Description Region of interest Center of mass Mean Shift vector Objective : Find the densest region Distribution of identical billiard balls

Intuitive Description Region of interest Center of mass Objective : Find the densest region Distribution of identical billiard balls

Density GRADIENT Estimation What is Mean Shift ? A tool for: Finding modes in a set of data samples, manifesting an underlying probability density function (PDF) in RN PDF in feature space Color space Scale space Actually any feature space you can conceive … Non-parametric Density Estimation Data Discrete PDF Representation Non-parametric Density GRADIENT Estimation (Mean Shift) PDF Analysis

Non-Parametric Density Estimation Assumption : The data points are sampled from an underlying PDF Data point density implies PDF value ! Assumed Underlying PDF Real Data Samples

Non-Parametric Density Estimation Assumed Underlying PDF Real Data Samples

Non-Parametric Density Estimation ? Assumed Underlying PDF Real Data Samples

Parametric Density Estimation Assumption : The data points are sampled from an underlying PDF Estimate Assumed Underlying PDF Real Data Samples

Kernel Density Estimation Parzen Windows - General Framework A function of some finite number of data points x1…xn Data Kernel Properties: Normalized Symmetric Exponential weight decay ???

Kernel Density Estimation Parzen Windows - Function Forms A function of some finite number of data points x1…xn Data In practice one uses the forms: or Same function on each dimension Function of vector length only

Kernel Density Estimation Various Kernels A function of some finite number of data points x1…xn Data Examples: Epanechnikov Kernel Uniform Kernel Normal Kernel

Kernel Density Estimation Gradient Give up estimating the PDF ! Estimate ONLY the gradient Using the Kernel form: We get : Size of window

Kernel Density Estimation Computing The Mean Shift Gradient

Computing The Mean Shift Yet another Kernel density estimation ! Simple Mean Shift procedure: Compute mean shift vector Translate the Kernel window by m(x)

Mean Shift Mode Detection What happens if we reach a saddle point ? Perturb the mode position and check if we return back Updated Mean Shift Procedure: Find all modes using the Simple Mean Shift Procedure Prune modes by perturbing them (find saddle points and plateaus) Prune nearby – take highest mode in the window

Mean Shift Properties Automatic convergence speed – the mean shift vector size depends on the gradient itself. Near maxima, the steps are small and refined Convergence is guaranteed for infinitesimal steps only  infinitely convergent, (therefore set a lower bound) For Uniform Kernel ( ), convergence is achieved in a finite number of steps Normal Kernel ( ) exhibits a smooth trajectory, but is slower than Uniform Kernel ( ). Adaptive Gradient Ascent

Real Modality Analysis Tessellate the space with windows Run the procedure in parallel

Real Modality Analysis The blue data points were traversed by the windows towards the mode

Real Modality Analysis An example Window tracks signify the steepest ascent directions

Adaptive Mean Shift

Mean Shift Strengths & Weaknesses Application independent tool Suitable for real data analysis Does not assume any prior shape (e.g. elliptical) on data clusters Can handle arbitrary feature spaces Only ONE parameter to choose h (window size) has a physical meaning, unlike K-Means Weaknesses : The window size (bandwidth selection) is not trivial Inappropriate window size can cause modes to be merged, or generate additional “shallow” modes  Use adaptive window size

Mean Shift Applications

Clustering Cluster : All data points in the attraction basin of a mode Attraction basin : the region for which all trajectories lead to the same mode Mean Shift : A robust Approach Toward Feature Space Analysis, by Comaniciu, Meer

Clustering Synthetic Examples Simple Modal Structures Complex Modal Structures

Clustering Real Example Feature space: L*u*v representation Initial window centers Modes found Modes after pruning Final clusters

Clustering Real Example L*u*v space representation

Clustering Real Example 2D (L*u) space representation Final clusters Not all trajectories in the attraction basin reach the same mode

Discontinuity Preserving Smoothing Feature space : Joint domain = spatial coordinates + color space Meaning : treat the image as data points in the spatial and gray level domain Image Data (slice) Mean Shift vectors Smoothing result Mean Shift : A robust Approach Toward Feature Space Analysis, by Comaniciu, Meer

Discontinuity Preserving Smoothing x y z The image gray levels… … can be viewed as data points in the x, y, z space (joined spatial And color space)

Discontinuity Preserving Smoothing Flat regions induce the modes ! y z

Discontinuity Preserving Smoothing The effect of window size in spatial and range spaces

Discontinuity Preserving Smoothing Example

Discontinuity Preserving Smoothing Example

Object Contour Detection Ray Propagation Accurately segment various objects (rounded in nature) in medical images Vessel Detection by Mean Shift Based Ray Propagation, by Tek, Comaniciu, Williams

Object Contour Detection Ray Propagation Use displacement data to guide ray propagation Discontinuity preserving smoothing Displacement vectors Vessel Detection by Mean Shift Based Ray Propagation, by Tek, Comaniciu, Williams

Object Contour Detection Ray Propagation Speed function Normal to the contour Curvature

Object Contour Detection Gray levels along red line Gray levels after smoothing Original image Displacement vectors Displacement vectors’ derivative

Object Contour Detection Example

Object Contour Detection Example Importance of smoothing by curvature

Segmentation Segment = Cluster, or Cluster of Clusters Algorithm: Run Filtering (discontinuity preserving smoothing) Cluster the clusters which are closer than window size Image Data (slice) Mean Shift vectors Smoothing result Segmentation result Mean Shift : A robust Approach Toward Feature Space Analysis, by Comaniciu, Meer http://www.caip.rutgers.edu/~comanici

Segmentation Example …when feature space is only gray levels…

Segmentation Example

Segmentation Example

Segmentation Example

Segmentation Example

Segmentation Example

Segmentation Example

Non-Rigid Object Tracking … …

Non-Rigid Object Tracking Real-Time Object-Based Video Compression Surveillance Driver Assistance

Mean-Shift Object Tracking General Framework: Target Representation Choose a reference model in the current frame Choose a feature space Represent the model in the chosen feature space Current frame …

Mean-Shift Object Tracking General Framework: Target Localization Start from the position of the model in the current frame Search in the model’s neighborhood in next frame Find best candidate by maximizing a similarity func. Repeat the same process in the next pair of frames Current frame … Model Candidate

Mean-Shift Object Tracking Target Representation Choose a reference target model Represent the model by its PDF in the feature space Quantized Color Space Choose a feature space Kernel Based Object Tracking, by Comaniniu, Ramesh, Meer

Mean-Shift Object Tracking PDF Representation Target Model (centered at 0) Target Candidate (centered at y) Similarity Function:

Mean-Shift Object Tracking Smoothness of Similarity Function Problem: Target is represented by color info only Spatial info is lost Large similarity variations for adjacent locations f is not smooth Gradient-based optimizations are not robust Solution: Mask the target with an isotropic kernel in the spatial domain f(y) becomes smooth in y

Mean-Shift Object Tracking Finding the PDF of the target model model y candidate Target pixel locations A differentiable, isotropic, convex, monotonically decreasing kernel Peripheral pixels are affected by occlusion and background interference The color bin index (1..m) of pixel x Probability of feature u in model Probability of feature u in candidate Normalization factor Pixel weight Normalization factor Pixel weight

Mean-Shift Object Tracking Similarity Function Target model: Target candidate: Similarity function: 1 The Bhattacharyya Coefficient

Mean-Shift Object Tracking Target Localization Algorithm Start from the position of the model in the current frame Search in the model’s neighborhood in next frame Find best candidate by maximizing a similarity func.

Mean-Shift Object Tracking Approximating the Similarity Function Model location: Candidate location: Linear approx. (around y0) Independent of y Density estimate! (as a function of y)

Mean-Shift Object Tracking Maximizing the Similarity Function The mode of = sought maximum Important Assumption: One mode in the searched neighborhood The target representation provides sufficient discrimination

Mean-Shift Object Tracking Applying Mean-Shift The mode of = sought maximum Original Mean-Shift: Find mode of using Extended Mean-Shift: Find mode of using

Mean-Shift Object Tracking About Kernels and Profiles A special class of radially symmetric kernels: The profile of kernel K Extended Mean-Shift: Find mode of using

Mean-Shift Object Tracking Choosing the Kernel A special class of radially symmetric kernels: Epanechnikov kernel: Uniform kernel: Extended Mean-Shift:

Mean-Shift Object Tracking Adaptive Scale Problem: The scale of the target changes in time The scale (h) of the kernel must be adapted Solution: Run localization 3 times with different h Choose h that achieves maximum similarity

Mean-Shift Object Tracking Results Feature space: 161616 quantized RGB Target: manually selected on 1st frame Average mean-shift iterations: 4

Mean-Shift Object Tracking Results Partial occlusion Distraction Motion blur

Mean-Shift Object Tracking Results

Mean-Shift Object Tracking Results Feature space: 128128 quantized RG

Mean-Shift Object Tracking The Scale Selection Problem Kernel too big Poor localization h mustn’t get too big or too small! Kernel too small Problem: In uniformly colored regions, similarity is invariant to h Smaller h may achieve better similarity Nothing keeps h from shrinking too small!

Tracking Through Scale Space Motivation Spatial localization for several scales Simultaneous localization in space and scale Previous method This method Mean-shift Blob Tracking through Scale Space, by R. Collins

Lindeberg’s Theory Selecting the best scale for describing image features x y σ Scale-space representation Differential operator applied 50 strongest responses

Lindeberg’s Theory The Laplacian operator for selecting blob-like features 2D LOG filter with scale σ Laplacian of Gaussian (LOG) Scale-space representation x y σ 3D scale-space representation Best features are at (x,σ) that maximize L

Lindeberg’s Theory Multi-Scale Feature Selection Process 250 strongest responses (Large circle = large scale) Maximize 3D scale-space function Convolve Original Image

Tracking Through Scale Space Approximating LOG using DOG 2D LOG filter with scale σ 2D DOG filter with scale σ 2D Gaussian with μ=0 and scale σ 2D Gaussian with μ=0 and scale 1.6σ Why DOG? Gaussian pyramids are created faster Gaussian can be used as a mean-shift kernel DOG filters at multiple scales 3D spatial kernel Scale-space filter bank

Tracking Through Scale Space Using Lindeberg’s Theory Weight image 3D spatial kernel (DOG) Centered at current location and scale 1D scale kernel (Epanechnikov) 3D scale-space representation Modes are blobs in the scale-space neighborhood Model: Candidate: Color bin: at Pixel weight: Recall: Need a mean-shift procedure that finds local modes in E(x,σ) The likelihood that each candidate pixel belongs to the target

Tracking Through Scale Space Example Image of 3 blobs A slice through the 3D scale-space representation

Tracking Through Scale Space Applying Mean-Shift Use interleaved spatial/scale mean-shift Spatial stage: Scale stage: Fix σ and look for the best x Fix x and look for the best σ x σ x0 σ0 xopt σopt Iterate stages until convergence of x and σ

Tracking Through Scale Space Results Fixed-scale ± 10% scale adaptation Tracking through scale space

Thank You