Robust Higher Order Potentials For Enforcing Label Consistency

Slides:

Advertisements

Similar presentations

Using Strong Shape Priors for Multiview Reconstruction Yunda SunPushmeet Kohli Mathieu BrayPhilip HS Torr Department of Computing Oxford Brookes University.

Advertisements

POSE–CUT Simultaneous Segmentation and 3D Pose Estimation of Humans using Dynamic Graph Cuts Mathieu Bray Pushmeet Kohli Philip H.S. Torr Department of.

Mean-Field Theory and Its Applications In Computer Vision1 1.

O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.

Automatic Photo Pop-up Derek Hoiem Alexei A.Efros Martial Hebert Carnegie Mellon University.

HOPS: Efficient Region Labeling using Higher Order Proxy Neighborhoods Albert Y. C. Chen 1, Jason J. Corso 1, and Le Wang 2 1 Dept. of Computer Science.

Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.

Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.

Spectral graph reduction for image and streaming video segmentation Fabio Galasso 1 Margret Keuper 2 Thomas Brox 2 Bernt Schiele 1 1 Max Planck Institute.

Joint Optimisation for Object Class Segmentation and Dense Stereo Reconstruction Ľubor Ladický, Paul Sturgess, Christopher Russell, Sunando Sengupta, Yalin.

Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.

Scene Labeling Using Beam Search Under Mutex Constraints ID: O-2B-6 Anirban Roy and Sinisa Todorovic Oregon State University 1.

Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.

I Images as graphs Fully-connected graph – node for every pixel – link between every pair of pixels, p,q – similarity w ij for each link j w ij c Source:

The University of Ontario CS 4487/9687 Algorithms for Image Analysis Multi-Label Image Analysis Problems.

ICCV 2007 tutorial on Discrete Optimization Methods in Computer Vision part I Basic overview of graph cuts.

Efficient Inference for Fully-Connected CRFs with Stationarity

GrabCut Interactive Image (and Stereo) Segmentation Joon Jae Lee Keimyung University Welcome. I will present Grabcut – an Interactive tool for foreground.

GraphCut-based Optimisation for Computer Vision Ľubor Ladický.

Optimization & Learning for Registration of Moving Dynamic Textures Junzhou Huang 1, Xiaolei Huang 2, Dimitris Metaxas 1 Rutgers University 1, Lehigh University.

Biased Normalized Cuts 1 Subhransu Maji and Jithndra Malik University of California, Berkeley IEEE Conference on Computer Vision and Pattern Recognition.

Simultaneous Segmentation and 3D Pose Estimation of Humans or Detection + Segmentation = Tracking? Philip H.S. Torr Pawan Kumar, Pushmeet Kohli, Matt Bray.

Contextual Classification with Functional Max-Margin Markov Networks Dan MunozDrew Bagnell Nicolas VandapelMartial Hebert.

Models for Scene Understanding – Global Energy models and a Style-Parameterized boosting algorithm (StyP-Boost) Jonathan Warrell, 1 Simon Prince, 2 Philip.

Schedule Introduction Models: small cliques and special potentials Tea break Inference: Relaxation techniques:

LARGE-SCALE NONPARAMETRIC IMAGE PARSING Joseph Tighe and Svetlana Lazebnik University of North Carolina at Chapel Hill CVPR 2011Workshop on Large-Scale.

P 3 & Beyond Solving Energies with Higher Order Cliques Pushmeet Kohli Pawan Kumar Philip H. S. Torr Oxford Brookes University CVPR 2007.

Region-based Voting Exemplar 1 Query 1 Exemplar 2.

Improved Moves for Truncated Convex Models M. Pawan Kumar Philip Torr.

TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton*, J. Winn†, C. Rother†, and A.

Oxford Brookes Seminar Thursday 3 rd September, 2009 University College London1 Representing Object-level Knowledge for Segmentation and Image Parsing:

Graph Cut based Inference with Co-occurrence Statistics Ľubor Ladický, Chris Russell, Pushmeet Kohli, Philip Torr.

What Energy Functions Can be Minimized Using Graph Cuts? Shai Bagon Advanced Topics in Computer Vision June 2010.

Relaxations and Moves for MAP Estimation in MRFs M. Pawan Kumar STANFORDSTANFORD Vladimir KolmogorovPhilip TorrDaphne Koller.

Automatic Photo Popup Derek Hoiem Alexei A. Efros Martial Hebert Carnegie Mellon University.

Measuring Uncertainty in Graph Cut Solutions Pushmeet Kohli Philip H.S. Torr Department of Computing Oxford Brookes University.

Perceptual Organization: Segmentation and Optical Flow.

What, Where & How Many? Combining Object Detectors and CRFs

Graph-based Segmentation

Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.

Image Renaissance Using Discrete Optimization Cédric AllèneNikos Paragios ENPC – CERTIS ESIEE – A²SI ECP - MAS France.

MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/31/15.

Surface Stereo with Soft Segmentation Michael Bleyer 1, Carsten Rother 2, Pushmeet Kohli 2 1 Vienna University of Technology, Austria 2 Microsoft Research.

Minimizing Sparse Higher Order Energy Functions of Discrete Variables (CVPR’09) Namju Kwak Applied Algorithm Lab. Computer Science Department KAIST 1Namju.

MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/24/10.

Recognition using Regions (Demo) Sudheendra V. Outline Generating multiple segmentations –Normalized cuts [Ren & Malik (2003)] Uniform regions –Watershed.

Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.

Multiplicative Bounds for Metric Labeling M. Pawan Kumar École Centrale Paris Joint work with Phil Torr, Daphne Koller.

Texture We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.

Texture We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.

Algorithms for MAP estimation in Markov Random Fields Vladimir Kolmogorov University College London.

Associative Hierarchical CRFs for Object Class Image Segmentation Ľubor Ladický 1 1 Oxford Brookes University 2 Microsoft Research Cambridge Based on the.

Associative Hierarchical CRFs for Object Class Image Segmentation

O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.

Tractable Higher Order Models in Computer Vision (Part II) Slides from Carsten Rother, Sebastian Nowozin, Pusohmeet Khli Microsoft Research Cambridge Presented.

Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.

Non-Ideal Iris Segmentation Using Graph Cuts

CS654: Digital Image Analysis Lecture 28: Advanced topics in Image Segmentation Image courtesy: IEEE, IJCV.

Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.

Image segmentation.

Holistic Scene Understanding Virginia Tech ECE /02/26 Stanislaw Antol.

MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/27/12.

Learning a Region-based Scene Segmentation Model

HFS: Hierarchical Feature Selection for Efficient Image Segmentation

Markov Random Fields with Efficient Approximations

Nonparametric Semantic Segmentation

Efficient Graph Cut Optimization for Full CRFs with Quantized Edges

Discrete Optimization Methods Basic overview of graph cuts

Graphical Models and Learning

“Traditional” image segmentation

Presentation transcript:

Robust Higher Order Potentials For Enforcing Label Consistency Pushmeet Kohli Microsoft Research Cambridge Lubor Ladicky Philip Torr Oxford Brookes University, Oxford CVPR 2008

Image labelling Problems Assign a label to each image pixel Geometry Estimation Image Denoising Object Segmentation Sky Building Tree Grass

Object Segmentation using CRFs (Shotton et al. ECCV 2006) CRF Energy Unary potentials based on Colour, Location and Texture features Encourages label consistency in adjacent pixels

Limitations of Pairwise CRFs Encourages short boundaries (Shrinkage bias) Can only enforce label consistency in adjacent pixels Inability to incorporate region based features Image Unary Potential MAP-CRF Solution

Label Consistency in Image Regions Pixels constituting some regions belong to Same plane (Orientation) (Hoiem, Efros, & Herbert, ICCV’05) Same object (Russel, Efros, Sivic, Freeman, & Zisserman, CVPR06) Image (MSRC) Segmentation (Mean shift)

Image labelling using segments Unsupervised Segmentation Object Labelling Geometric Context [Hoiem et al, ICCV05] Object Segmentation [He et al. ECCV06, Yang et al. CVPR07, Rabinovich et al. ICCV07, Batra et al. CVPR08] Interactive Video Segmentation [Wang, SIGGRAPH 2005 ] Not robust to Inconsistent Segments!

Our Higher Order CRF Model Encourages label consistency in regions Multiple Segmentations c

Label Consistency in Segments Encourages consistency within super-pixels Takes the form of a PN Potts model [Kohli et al. CVPR 2007] c

Label Consistency in Segments Encourages consistency within super-pixels Takes the form of a PN Potts model [Kohli et al. CVPR 2007] c Cost: 0

Label Consistency in Segments Encourages consistency within super-pixels Takes the form of a PN Potts model [Kohli et al. CVPR 2007] c Cost: f (|c|)

Label Consistency in Segments Encourages consistency within super-pixels Takes the form of a PN Potts model [Kohli et al. CVPR 2007] Does not distinguish between Good/Bad Segments ! c Cost: f (|c|)

Quality based Label Consistency Label inconsistency cost depends on segment quality How to measure quality G(c)? [Ren and Malik ICCV03, Rabinovich et al. ICCV07, many others] Colour and Texture Similarity Contour Energy Measure quality from variance in feature responses Higher order generalization of contrast-sensitive pairwise potential

Quality based Label Consistency Mean shift segmentation Segment Quality (darker is better) MSRC image

Robust Consistency Potentials gmax PN Potts 1 Too Rigid! Inconsistent Pixels gmax 1 T Robust Inconsistent Pixels

Robust Consistency Potentials Maximum Inconsistency Cost Number of Inconsistent Pixels Slope gmax 1 T Robust Inconsistent Pixels

Minimizing Higher order Energy Functions Message passing is computationally expensive High runtime and space complexity - O(LN) L = Number of Labels, N = Size of Clique Efficient BP for Higher Order MRFs [Lan et al. ECCV 06, Potetz CVPR 2007] 2x2 clique potentials for Image Denoising Take minutes per iteration (Hours to converge)

Minimizing Higher order Energy Functions Graph Cut based move making algorithm [Kohli et al. CVPR 2007] Can handle very high order energy functions Extremely efficient: computation time in the order of seconds Only applicable to some classes of functions (PN Potts) Cannot handle robust consistency potential This paper Can minimize a much larger class of higher order energy functions Same time complexity as [Kohli et al. CVPR 2007]

Move making algorithms Expansion and Swap move algorithms [Boykov Veksler and Zabih, PAMI 2001] Makes a series of changes to the solution (moves) Each move results in a solution with smaller energy Current Solution How to minimize move functions? Move to new solution Generate pseudo-boolean move function Minimize move function to get optimal move

Minimizing Move Functions using Graph Cuts Most pairwise CRF models used in Computer Vision lead to submodular move functions Second order Pseudo-boolean Function Minimization (submodular) st-mincut (Positive weights) Optimal moves can be found extremely efficiently using graphs cuts

Minimizing Higher Order Energy Functions: Our results We show that a large class of higher order potentials lead to higher order submodular move functions Can be minimized in polynomial time Submodular Function Minimization Minimizing general submodular functions is computationally expensive Complexity O(n6) Cannot handle large problems! Details in Technical Report

Minimizing Higher Order Energy Functions: Our results Minimizing Higher order functions using Graph cuts Higher order functions can be transformed to second order functions by adding auxillary variables Exponential number of auxillary variables needed in general Result 2 Our higher order functions can be transformed to second-order functions using ≤2 auxillary variables per potential. Can be minimized extremely efficiently Complexity << O(n6) Details in Technical Report

Overview of our Method + + Higher Order Energy Segmentation Solution Unary Potentials [Shotton et al. ECCV 2006] + Energy Minimization Contrast Sensitive Pairwise Potentials + Segmentation Solution Higher Order Potentials (Multiple Segmentations)

Experimental results Datasets: MSRC (21), Sowerby (7) [Shotton et al. ECCV 2006] [He et al. CVPR 04]

Qualitative Results Image (MSRC-21) Pairwise CRF Higher order CRF Ground Truth Grass Sheep

Qualitative Results (Contd..) Image (MSRC-21) Pairwise CRF Higher order CRF Ground Truth Results can be improved using image specific colour models Rother et al. SIGGRAPH 2004 Shotton et al. ECCV 2006

Quantitative Results: Problems Rough ground truth segmentations Fine structures have small influence on overall pixel accuracy

Generating Accurate Segmentations Generated accurate segmentation of 27 images 30 minutes per image Image (MSRC-21) Original Segmentation New Segmentation

Relationship between Qualitative and Quantitative Results Pairwise CRF Higher order CRF Ground Truth Image (MSRC-21) Overall Pixel Accuracy 95.8% 98.7% Small changes in pixel accuracy can lead to large improvements in segmentation results.

Quantitative Accuracy Measure accuracy in labelling boundary pixels. Accuracy evaluated in boundary bands of variable width Hand-labelled Segmentation Trimap (8-pixels) Trimap (16-pixels) Image (MSRC-21)

Quantitative Accuracy Measure accuracy in labelling boundary pixels. Accuracy evaluated in boundary bands of variable width

Conclusions Method to enforce label consistency in image regions Generalization of the commonly used Pairwise CRF model Allows integration of pixel and region level features for image labelling problems

Thanks

Number of Segmentations Running Time Results Time (sec) Number of Segmentations Inconsistency Cost

Qualitative Results (Contd..) Image (MSRC-21) Pairwise CRF Higher order CRF Ground Truth

Transformation to second order functions Auxiliary variables