Associative Hierarchical CRFs for Object Class Image Segmentation Ľubor Ladický 1 1 Oxford Brookes University 2 Microsoft Research Cambridge Based on the.

Slides:

Advertisements

Similar presentations

Using Strong Shape Priors for Multiview Reconstruction Yunda SunPushmeet Kohli Mathieu BrayPhilip HS Torr Department of Computing Oxford Brookes University.

Advertisements

POSE–CUT Simultaneous Segmentation and 3D Pose Estimation of Humans using Dynamic Graph Cuts Mathieu Bray Pushmeet Kohli Philip H.S. Torr Department of.

Mean-Field Theory and Its Applications In Computer Vision1 1.

O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.

Mean-Field Theory and Its Applications In Computer Vision2

Indoor Segmentation and Support Inference from RGBD Images Nathan Silberman, Derek Hoiem, Pushmeet Kohli, Rob Fergus.

Semantic Contours from Inverse Detectors Bharath Hariharan et.al. (ICCV-11)

The Layout Consistent Random Field for detecting and segmenting occluded objects CVPR, June 2006 John Winn Jamie Shotton.

Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.

Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.

EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.

Ľubor Ladický1 Phil Torr2 Andrew Zisserman1

Joint Optimisation for Object Class Segmentation and Dense Stereo Reconstruction Ľubor Ladický, Paul Sturgess, Christopher Russell, Sunando Sengupta, Yalin.

Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.

Learning to Combine Bottom-Up and Top-Down Segmentation Anat Levin and Yair Weiss School of CS&Eng, The Hebrew University of Jerusalem, Israel.

Scene Labeling Using Beam Search Under Mutex Constraints ID: O-2B-6 Anirban Roy and Sinisa Todorovic Oregon State University 1.

Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of.

ICCV 2007 tutorial Part III Message-passing algorithms for energy minimization Vladimir Kolmogorov University College London.

Efficient Inference for Fully-Connected CRFs with Stationarity

Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.

Graph-based image segmentation Václav Hlaváč Czech Technical University in Prague Faculty of Electrical Engineering Department of Cybernetics Prague, Czech.

GraphCut-based Optimisation for Computer Vision Ľubor Ladický.

Biased Normalized Cuts 1 Subhransu Maji and Jithndra Malik University of California, Berkeley IEEE Conference on Computer Vision and Pattern Recognition.

Simultaneous Segmentation and 3D Pose Estimation of Humans or Detection + Segmentation = Tracking? Philip H.S. Torr Pawan Kumar, Pushmeet Kohli, Matt Bray.

Models for Scene Understanding – Global Energy models and a Style-Parameterized boosting algorithm (StyP-Boost) Jonathan Warrell, 1 Simon Prince, 2 Philip.

Beyond Actions: Discriminative Models for Contextual Group Activities Tian Lan School of Computing Science Simon Fraser University August 12, 2010 M.Sc.

1. Introduction Humanising GrabCut: Learning to segment humans using the Kinect Varun Gulshan, Victor Lempitksy and Andrew Zisserman Dept. of Engineering.

Robust Higher Order Potentials For Enforcing Label Consistency

Schedule Introduction Models: small cliques and special potentials Tea break Inference: Relaxation techniques:

LARGE-SCALE NONPARAMETRIC IMAGE PARSING Joseph Tighe and Svetlana Lazebnik University of North Carolina at Chapel Hill CVPR 2011Workshop on Large-Scale.

P 3 & Beyond Solving Energies with Higher Order Cliques Pushmeet Kohli Pawan Kumar Philip H. S. Torr Oxford Brookes University CVPR 2007.

Region-based Voting Exemplar 1 Query 1 Exemplar 2.

Improved Moves for Truncated Convex Models M. Pawan Kumar Philip Torr.

TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton*, J. Winn†, C. Rother†, and A.

Oxford Brookes Seminar Thursday 3 rd September, 2009 University College London1 Representing Object-level Knowledge for Segmentation and Image Parsing:

Efficiently Solving Convex Relaxations M. Pawan Kumar University of Oxford for MAP Estimation Philip Torr Oxford Brookes University.

The Layout Consistent Random Field for Recognizing and Segmenting Partially Occluded Objects By John Winn & Jamie Shotton CVPR 2006 presented by Tomasz.

Recovering Articulated Object Models from 3D Range Data Dragomir Anguelov Daphne Koller Hoi-Cheung Pang Praveen Srinivasan Sebastian Thrun Computer Science.

Graph Cut based Inference with Co-occurrence Statistics Ľubor Ladický, Chris Russell, Pushmeet Kohli, Philip Torr.

What Energy Functions Can be Minimized Using Graph Cuts? Shai Bagon Advanced Topics in Computer Vision June 2010.

An Iterative Optimization Approach for Unified Image Segmentation and Matting Hello everyone, my name is Jue Wang, I’m glad to be here to present our paper.

Relaxations and Moves for MAP Estimation in MRFs M. Pawan Kumar STANFORDSTANFORD Vladimir KolmogorovPhilip TorrDaphne Koller.

Measuring Uncertainty in Graph Cut Solutions Pushmeet Kohli Philip H.S. Torr Department of Computing Oxford Brookes University.

Extensions of submodularity and their application in computer vision

Hierarchical Subquery Evaluation for Active Learning on a Graph Oisin Mac Aodha, Neill Campbell, Jan Kautz, Gabriel Brostow CVPR 2014 University College.

What, Where & How Many? Combining Object Detectors and CRFs

Graph-based Segmentation

Object Detection Sliding Window Based Approach Context Helps

MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/24/10.

City University of Hong Kong 18 th Intl. Conf. Pattern Recognition Self-Validated and Spatially Coherent Clustering with NS-MRF and Graph Cuts Wei Feng.

“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)

Multiplicative Bounds for Metric Labeling M. Pawan Kumar École Centrale Paris Joint work with Phil Torr, Daphne Koller.

Probabilistic Inference Lecture 5 M. Pawan Kumar Slides available online

Associative Hierarchical CRFs for Object Class Image Segmentation

O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.

Presenter ： Kuang-Jui Hsu Date ： 2011/3/24(Thur.).

CVPR 2013 Diversity Tutorial Beyond MAP: Making Multiple Predictions: Diversity, DPPs and more. Dhruv Batra Virginia Tech Alex Kulesza Univ. of Michigan.

A Dynamic Conditional Random Field Model for Object Segmentation in Image Sequences Duke University Machine Learning Group Presented by Qiuhua Liu March.

1 Scale and Rotation Invariant Matching Using Linearly Augmented Tree Hao Jiang Boston College Tai-peng Tian, Stan Sclaroff Boston University.

CS654: Digital Image Analysis Lecture 28: Advanced topics in Image Segmentation Image courtesy: IEEE, IJCV.

Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.

Image segmentation.

Gaussian Conditional Random Field Network for Semantic Segmentation

Krishna Kumar Singh, Yong Jae Lee University of California, Davis

Semantic Object and Instance Segmentation

Nonparametric Semantic Segmentation

Efficient Graph Cut Optimization for Full CRFs with Quantized Edges

Learning to Combine Bottom-Up and Top-Down Segmentation

Liyuan Li, Jerry Kah Eng Hoe, Xinguo Yu, Li Dong, and Xinqi Chu

“Traditional” image segmentation

Presentation transcript:

Associative Hierarchical CRFs for Object Class Image Segmentation Ľubor Ladický 1 1 Oxford Brookes University 2 Microsoft Research Cambridge Based on the work done with : Chris Russell 1, Pushmeet Kohli 1,2, Paul Sturgess 1, Karteek Alahari 1, Philip H.S. Torr 1

Overview Object-class Segmentation problem Datasets MSRC dataset VOC2009 dataset CamVid dataset Standard approaches Pairwise CRF over Pixels Pairwise CRF over Segments Detection-driven Segmentation Model Robust-P N model Associative Hierarchical CRF Detectors in AHCRF Label co-occurrence in CRFs Potential training Inference Results Discussion General overviewOur approach

Overview Object-class Segmentation problem Datasets MSRC dataset VOC2009 dataset CamVid dataset Standard approaches Pairwise CRF over Pixels Pairwise CRF over Segments Detection-driven Segmentation Model Robust-P N model Associative Hierarchical CRF Detectors in AHCRF Label co-occurrence in CRFs Potential training Inference Results Discussion General overviewOur approach

Object-class Segmentation problem MSRC ImageOur Result Aims to assign a label for each pixel of an image Classifier trained on the training set Performance evaluated on never seen test set

Overview Object-class Segmentation problem Datasets MSRC dataset VOC2009 dataset CamVid dataset Standard approaches Pairwise CRF over Pixels Pairwise CRF over Segments Detection-driven Segmentation Model Robust-P N model Associative Hierarchical CRF Detectors in AHCRF Label co-occurrence in CRFs Potential training Inference Results Discussion General overviewOur approach

Datasets MSRC Dataset 591 images 320 x classes MSRC Image Ground Truth

Datasets VOC2009 Dataset 1499 training test images 500 x foreground classes + 1 background class VOC Image Ground Truth

Datasets CamVid Dataset 367 training test video frames 960 x classes CamVid Image Ground Truth

Overview Object-class Segmentation problem Datasets MSRC dataset VOC2009 dataset CamVid dataset Standard approaches Pairwise CRF over Pixels Pairwise CRF over Segments Detection-driven Segmentation Model Robust-P N model Associative Hierarchical CRF Detectors in AHCRF Label co-occurrence in CRFs Potential training Inference Results Discussion General overviewOur approach

Pairwise CRF over Pixels CRF Shotton ECCV06

Pairwise CRF over Pixels No quantization errors Lacks long range interactions Results oversmoothed Better performance on CamVid & MSRC datasets

Pairwise CRF over Segments Yang et al. CVPR07, Batra et al. CVPR08, Shi, Malik PAMI2000, Comaniciu, Meer PAMI2002, Felzenschwalb, Huttenlocher, IJCV2004

Pairwise CRF over Segments Allows long range interactions Better performance for VOC dataset Can not recover from incorrect segmentation Impossible to obtain perfect unsupervised segmentation

Pairwise CRF over Segments Allows long range interactions Better performance for VOC dataset Can not recover from incorrect segmentation Impossible to obtain perfect unsupervised segmentation

Detection-driven Segmentation Larlus et al. CVPR08, Felzenszwalb et al. CVPR08, Vedaldi et al. ICCV09 GrabCut Detector

Detection-driven Segmentation The best performance for VOC dataset Can not be used for background classes Can not recover from incorrect detection

Overview Object-class Segmentation problem Datasets MSRC dataset VOC2009 dataset CamVid dataset Standard approaches Pairwise CRF over Pixels Pairwise CRF over Segments Detection-driven Segmentation Model Robust-P N model Associative Hierarchical CRF Detectors in AHCRF Label co-occurrence in CRFs Potential training Inference Results Discussion General overviewOur approach

Robust P N approach Kohli, Ladický, Torr CVPR08

Robust P N approach Segment consistency as a weak constraint Robust to misleading segmentations Allows multiple overlapping segmentations General formulation not used in application Unary and pairwise potentials only at the pixel level

Robust P N reformulation Ladický, Russell, Kohli, Torr ICCV09

Old formulation of Robust P N potential is equivalent to pairwise formulation where Robust P N reformulation Ladický, Russell, Kohli, Torr ICCV09

Associative Hierarchical CRF Allows unary potentials for region variables Allows pairwise potentials for region variables Allows multiple layers and multiple hierarchies Zhu et al. NIPS2008, Lim et al. ICCV2009, Hinton 2002

Analysis of the new model Ladický, Russell, Kohli, Torr ICCV09 Let's have one segmentation and potentials only over segment level

Analysis of the new model Let's have one segmentation and potentials only over segment level Energy of two pixels from the same clique is symmetric and semi-metric Minimum will be segment-consistent The cost of every segment consistent labelling is the same as the cost of the pairwise CRF labelling over segments Equivalent to pairwise CRF over segments Ladický, Russell, Kohli, Torr ICCV09

Associative Hierarchical CRF Merges information over multiple scales Allows multiple hierarchies Allows long range interactions Easy to learn weights Interlayer connection limited(?) to associative relationship Ladický, Russell, Kohli, Torr ICCV09

Comparison to other methods Ladický, Russell, Kohli, Torr ICCV09 Robust P N Segment CRFHierarchical CRF

Detectors in Associative Hierarchical CRFs Ladický, Alahari, Sturgess, Russell, Torr CVPR10 (submitted) GrabCutDetector Detector potential takes the form where

Detectors in Associative Hierarchical CRFs Ladický, Alahari, Sturgess, Russell, Torr CVPR10 (submitted) Special 2-label case of Robust P N Allows multiple overlapping detections Can recover from incorrect detection Same inference methods may be applied as for Associative Hierarchical CRFs We can distinguish between different instances of objects in the final result

Label co-occurrence in CRFs Ladický, Russell, Kohli, Torr CVPR10 (submitted) Possible labelling ? Why not ?

Label co-occurrence in CRFs Ladický, Russell, Kohli, Torr CVPR10 (submitted) Our requirements Global Energy function – no hard decisions Invariance to number of pixels taking given label Efficiency – should be tractable Standard approaches Hard decisions in the preprocessing step (Csurka et al. BMVC08) Unary potential (Torralba et al. ICCV03) Pairwise potential (Rabinovich et al. ICCV07, CVPR08) Only solution : E C = C(L(x))

Overview Object-class Segmentation problem Datasets MSRC dataset VOC2009 dataset CamVid dataset Standard approaches Pairwise CRF over Pixels Pairwise CRF over Segments Detection-driven Segmentation Model Robust-P N model Associative Hierarchical CRF Detectors in AHCRF Label co-occurrence in CRFs Potential training Inference Results Discussion General overviewOur approach

Multi-feature Boosted Unary Pixel Potential Ladický, Russell, Kohli, Torr ICCV09 Sturgess, Alahari, Ladický, Torr BMVC09 Unary likelihoods based on spatial configuration (Shotton et al. ECCV06) Classifier trained using boosting

Multi-feature Boosted Unary Segment Potential Ladický, Russell, Kohli, Torr ICCV09 Classifier trained using boosting

Other Potentials Ladický, Russell, Kohli, Torr ICCV09 Ladický, Russell, Kohli, Torr CVPR10 (submitted) Ladický, Alahari, Sturgess, Russell, Torr CVPR10 (submitted) Intensity dependent pairwise pixel potential Colour EMD-distance dependent pairwise segment potential Detector potentials based on off-the-shelf detectors (Felzenszwalb et al. CVPR08, Vedaldi et al. ICCV09) Generatively trained Co-occurence potential

Overview Object-class Segmentation problem Datasets MSRC dataset VOC2009 dataset CamVid dataset Standard approaches Pairwise CRF over Pixels Pairwise CRF over Segments Detection-driven Segmentation Model Robust-P N model Associative Hierarchical CRF Detectors in AHCRF Label co-occurrence in CRFs Potential training Inference Results Discussion General overviewOur approach

Inference over Hierarchical CRF Russell, Ladický, Kohli, Torr AISTATS10 (submitted)

Inference over Hierarchical CRF Problem is NP-hard Any message passing algorithm (TRW-S, BP,..) or ICM can be applied to pairwise model αβ-swap (potentials must be semi-metric) Ishikawa construction over (α-F-β transition) αexpansion (potentials must be metric) Reparametrization of interlayer connection to metric potential Ishikawa construction over (α-F-old transition) For more details read our technical report Russell, Ladický, Kohli, Torr AISTATS10 (submitted)

Inference over Hierarchical CRF Problem is NP-hard Any message passing algorithm (TRW-S, BP,..), ICM or LP relaxation can be applied to pairwise model αβ-swap (potentials must be semi-metric) Ishikawa construction over (α-F-β transition) αexpansion (potentials must be metric) Reparametrization of interlayer connection to metric potential Ishikawa construction over (α-F-old transition) For more details read our technical report Russell, Ladický, Kohli, Torr AISTATS10 (submitted)

Inference over Hierarchical CRF Problem is NP-hard Any message passing algorithm (TRW-S, BP,..), ICM or LP relaxation can be applied to pairwise model αβ-swap (potentials must be semi-metric) Ishikawa construction over (α-F-β transition) αexpansion (potentials must be metric) Reparametrization of interlayer connection to metric potential Ishikawa construction over (α-F-old transition) For more details read our technical report Russell, Ladický, Kohli, Torr AISTATS10 (submitted)

Inference over Hierarchical CRF Problem is NP-hard Any message passing algorithm (TRW-S, BP,..), ICM or LP relaxation can be applied to pairwise model αβ-swap (potentials must be semi-metric) Ishikawa construction over (α-F-β transition) α-expansion (potentials must be metric) Reparametrization of interlayer connection to metric potential Ishikawa construction over (α-F-old transition) Russell, Ladický, Kohli, Torr AISTATS10 (submitted)

Inference over Hierarchical CRF Russell, Ladický, Kohli, Torr AISTATS10 (submitted)

Inference for Co-occurence Integer programming formulation possible using one variable for each subset of the label set LP relaxation applicable by relaxing IP program αβ-swap possible if the cooccurence cost C(L(x)) is monotonically increasing with respect to L(x) α-expansion approximate move possible if the cooccurence cost C(L(x)) is monotonically increasing with respect to L(x) Ladický, Russell, Kohli, Torr CVPR10 (submitted)

Overview Object-class Segmentation problem Datasets MSRC dataset VOC2009 dataset CamVid dataset Standard approaches Pairwise CRF over Pixels Pairwise CRF over Segments Detection-driven Segmentation Model Robust-P N model Associative Hierarchical CRF Detectors in AHCRF Label co-occurrence in CRFs Potential training Inference Results Discussion General overviewOur approach

VOC2009 Results

MSRC Results

Qualitative comparison of results without and with co-occurence Ladický, Russell, Kohli, Torr CVPR10 (submitted) MSRC Image Without COWith CO Without CO

CamVid Results Sturgess, Alahari Ladický, Torr BMVC09 Ladický, Alahari, Sturgess, Russell, Torr CVPR10 (submitted) Our Result CamVid Image Ground Truth Brostow et al.

Qualitative comparisons Quantitative comparison of methods on VOC2009 dataset Quantitative comparison of methods on MSRC dataset Quantitative comparison of methods on CamVid dataset

Take home message

Use our model

Take home message Use our model Use your favourite potentials

Take home message Use our model Use your favourite potentials Use your friend's favourite potentials

Take home message Use our model Use your favourite potentials Use your friend's favourite potentials Use your friend's friend's favourite potentials

Take home message Use our model Use your favourite potentials Use your friend's favourite potentials Use your friend's friend's favourite potentials Vision solved

Take home message Use our model Use your favourite potentials Use your friend's favourite potentials Use your friend's friend's favourite potentials Vision solved (..almost)

Thank you Questions?