Models for Scene Understanding – Global Energy models and a Style-Parameterized boosting algorithm (StyP-Boost) Jonathan Warrell, 1 Simon Prince, 2 Philip.

Slides:



Advertisements
Similar presentations
Using Strong Shape Priors for Multiview Reconstruction Yunda SunPushmeet Kohli Mathieu BrayPhilip HS Torr Department of Computing Oxford Brookes University.
Advertisements

POSE–CUT Simultaneous Segmentation and 3D Pose Estimation of Humans using Dynamic Graph Cuts Mathieu Bray Pushmeet Kohli Philip H.S. Torr Department of.
Mean-Field Theory and Its Applications In Computer Vision1 1.
Bayesian Belief Propagation
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
The Layout Consistent Random Field for detecting and segmenting occluded objects CVPR, June 2006 John Winn Jamie Shotton.
Learning Specific-Class Segmentation from Diverse Data M. Pawan Kumar, Haitherm Turki, Dan Preston and Daphne Koller at ICCV 2011 VGG reading group, 29.
Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.
Ľubor Ladický1 Phil Torr2 Andrew Zisserman1
Joint Optimisation for Object Class Segmentation and Dense Stereo Reconstruction Ľubor Ladický, Paul Sturgess, Christopher Russell, Sunando Sengupta, Yalin.
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Scene Labeling Using Beam Search Under Mutex Constraints ID: O-2B-6 Anirban Roy and Sinisa Todorovic Oregon State University 1.
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Semantic Texton Forests for Image Categorization and Segmentation We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד.
Shuai Zheng et al. Dense Semantic Image Segmentation with Objects and Attributes. IEEE CVPR /06/20141/6 Dense semantic image segmentation with objects.
Structured Hough Voting for Vision-based Highway Border Detection
INTRODUCTION Heesoo Myeong, Ju Yong Chang, and Kyoung Mu Lee Department of EECS, ASRI, Seoul National University, Seoul, Korea Learning.
Efficient Inference for Fully-Connected CRFs with Stationarity
LOCUS (Learning Object Classes with Unsupervised Segmentation) A variational approach to learning model- based segmentation. John Winn Microsoft Research.
Analyzing Semantic Segmentation Using Hybrid Human-Machine CRFs Roozbeh Mottaghi 1, Sanja Fidler 2, Jian Yao 2, Raquel Urtasun 2, Devi Parikh 3 1 UCLA.
GraphCut-based Optimisation for Computer Vision Ľubor Ladický.
Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.
Simultaneous Segmentation and 3D Pose Estimation of Humans or Detection + Segmentation = Tracking? Philip H.S. Torr Pawan Kumar, Pushmeet Kohli, Matt Bray.
1. Introduction Humanising GrabCut: Learning to segment humans using the Kinect Varun Gulshan, Victor Lempitksy and Andrew Zisserman Dept. of Engineering.
Robust Higher Order Potentials For Enforcing Label Consistency
Schedule Introduction Models: small cliques and special potentials Tea break Inference: Relaxation techniques:
LARGE-SCALE NONPARAMETRIC IMAGE PARSING Joseph Tighe and Svetlana Lazebnik University of North Carolina at Chapel Hill CVPR 2011Workshop on Large-Scale.
P 3 & Beyond Solving Energies with Higher Order Cliques Pushmeet Kohli Pawan Kumar Philip H. S. Torr Oxford Brookes University CVPR 2007.
TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton*, J. Winn†, C. Rother†, and A.
Oxford Brookes Seminar Thursday 3 rd September, 2009 University College London1 Representing Object-level Knowledge for Segmentation and Image Parsing:
Graph Cut based Inference with Co-occurrence Statistics Ľubor Ladický, Chris Russell, Pushmeet Kohli, Philip Torr.
What Energy Functions Can be Minimized Using Graph Cuts? Shai Bagon Advanced Topics in Computer Vision June 2010.
Relaxations and Moves for MAP Estimation in MRFs M. Pawan Kumar STANFORDSTANFORD Vladimir KolmogorovPhilip TorrDaphne Koller.
Measuring Uncertainty in Graph Cut Solutions Pushmeet Kohli Philip H.S. Torr Department of Computing Oxford Brookes University.
What, Where & How Many? Combining Object Detectors and CRFs
Heather Dunlop : Advanced Perception January 25, 2006
Graph-based Segmentation
Latent Boosting for Action Recognition Zhi Feng Huang et al. BMVC Jeany Son.
Reconstructing Relief Surfaces George Vogiatzis, Philip Torr, Steven Seitz and Roberto Cipolla BMVC 2004.
MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/31/15.
MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/24/10.
Multiplicative Bounds for Metric Labeling M. Pawan Kumar École Centrale Paris Joint work with Phil Torr, Daphne Koller.
Texture We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
Texture We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
INTRODUCTION Heesoo Myeong and Kyoung Mu Lee Department of ECE, ASRI, Seoul National University, Seoul, Korea Tensor-based High-order.
Associative Hierarchical CRFs for Object Class Image Segmentation Ľubor Ladický 1 1 Oxford Brookes University 2 Microsoft Research Cambridge Based on the.
Topic Models Presented by Iulian Pruteanu Friday, July 28 th, 2006.
Associative Hierarchical CRFs for Object Class Image Segmentation
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.
Motion Estimation using Markov Random Fields Hrvoje Bogunović Image Processing Group Faculty of Electrical Engineering and Computing University of Zagreb.
Markov Random Fields & Conditional Random Fields
Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.
Mean field approximation for CRF inference
Locally Linear Support Vector Machines Ľubor Ladický Philip H.S. Torr.
TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton ; University of Cambridge J. Jinn,
M ACHINE L EARNING : F OUNDATIONS C OURSE TAU – 2012A P ROF. Y ISHAY M ANSOUR o TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class.
Joint Object-Material Category Segmentation from Audio-Visual Cues
DISCRIMINATIVELY TRAINED DENSE SURFACE NORMAL ESTIMATION ANDREW SHARP.
Strong Supervision From Weak Annotation Interactive Training of Deformable Part Models ICCV /05/23.
Image segmentation.
Holistic Scene Understanding Virginia Tech ECE /02/26 Stanislaw Antol.
MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/27/12.
Segmentation of Building Facades using Procedural Shape Priors
Nonparametric Semantic Segmentation
Adversarially Tuned Scene Generation
Object-Graphs for Context-Aware Category Discovery
Efficient Graph Cut Optimization for Full CRFs with Quantized Edges
The EM Algorithm With Applications To Image Epitome
“Traditional” image segmentation
Presentation transcript:

Models for Scene Understanding – Global Energy models and a Style-Parameterized boosting algorithm (StyP-Boost) Jonathan Warrell, 1 Simon Prince, 2 Philip Torr, 1 Lubor Ladicky, 1 Chris Russell 1 1 Oxford Brookes University, 2 University College London

Overview CRF-based semantic segmentation Recent models Detectors Stereo Co-occurence Hierarchical Energies Style parameterized boosting (StyP-Boost) Open questions / problems

CRF-based semantic segmentation Semantic segmentation = dense labeling using fixed object set

CRF-based semantic segmentation Conditional Random Field model (pairwise) Observed Variables Hidden Variables UnaryPairwise

Example:  -expansion Sky House Tree Ground Initialize with Tree Status: Expand GroundExpand HouseExpand Sky Courtesy: Pushmeet Kohli

Move Making Algorithms Search Neighbourhood Current Solution Optimal Move Solution Space Energy

Higher order CRF models Higher order models UnaryPairwiseHigher-order

Segment-based Potentials No. of pixels not taking l in c

Detector-based Potentials Strength of detector response Lubor Ladicky, Paul Sturgess, Karteek Alahari, Chris Russell, Philip H.S. Torr, What,Where & How Many? Combining Object Detectors and CRFs, ECCV 2010 What,Where & How Many? Combining Object Detectors and CRFs

Co-occurrence Potentials Lubor Ladicky, Chris Russell, Pushmeet Kohli, Philip H.S. Torr, Graph Cut based Inference with Co-occurrence Statistics, ECCV, 2010Graph Cut based Inference with Co-occurrence Statistics Global image label set

Joint Stereo + Segmentation Joint potentials Lubor Ladicky, Paul Sturgess, Chris Russell, Sunando Sengupta, Philip H.S. Torr, Joint Optimisation for Object Class Segmentation and Dense Stereo Reconstruction, BMVC 2010Joint Optimisation for Object Class Segmentation and Dense Stereo Reconstruction Object only potentials

Hierarchical Energies Lubor Ladicky, Chris Russell, Pushmeet Kohli, Philip H.S. Torr, Associative Hierarchical CRFs for Object Class Image Segmentation, ICCV, 2009.Associative Hierarchical CRFs for Object Class Image Segmentation Energy between levels 1 and 0

Style-based Potentials Jonathan Warrell, Simon Prince, Philip H.S. Torr, StyP-Boost: A Bilinear Boosting Algorithm for Learning Style-Parameterized Classifiers, BMVC, 2010StyP-Boost: A Bilinear Boosting Algorithm for Learning Style-Parameterized Classifiers Style-based unary potential Style 1:Style 2:

TextonBoost (Shotton et al ’09) Image first convolved with 17-d filter bank Vectors are clustered, and assigned to ~150 texton indices

TextonBoost (Shotton et al ’09) Texture-layout features derived from textons Boosted classifier predicts semantic class

DenseBoost (Ladicky et al ’09) DenseBoost extends TextonBoost to include HOG ColourHOG Structure / Motion features State of the art performance on MSRC (Ladicky et al ’09) CamVid (Sturgess et al ’09) Paul Sturgess, Karteek Alahari, Lubor Ladicky, Philip H.S. Torr, Combining Appearance and Structure from Motion Features for Road Scene Understanding, BMVC, 2009Combining Appearance and Structure from Motion Features for Road Scene Understanding Lubor Ladicky, Chris Russell, Pushmeet Kohli, Philip H.S. Torr, Associative Hierarchical CRFs for Object Class Image Segmentation, ICCV, 2009.Associative Hierarchical CRFs for Object Class Image Segmentation

StyP-Boost Framework (Training) Training Set Objective Classifier form Local featuresStyle ParametersTarget vectors

StyP-Boost Framework (Training) Training Set Objective Classifier form Loss for class k Strong learner for class k

StyP-Boost Framework (Training) Training Set Objective Classifier form Weak learner mStyle s

Corel: Styles through clustering Styles found in Corel through clustering 2-styles (98%) 3-styles(96%) 4-styles (89%)

Corel: Styles through clustering Cluster images based on label histograms during training (2-4 clusters) Train classifier to predict cluster from image Use smoothed classifier posteriors as style parameters (training and testing) cluster label

Corel: Qualitative results StyP-Boost reduces noise from classes which don’t co-occur

Corel: Qualitative results StyP-Boost provides better discrimination of co-occuring classes

Corel: Quantitative results Training set Test set

Open questions / Problems Learning from sparsely labeled data Lamp-post Sign

Open questions / Problems Incorporating 3D and Video Image CRF Ground-plan CRF Volumetric CRF

Open questions / Problems Using temporal information Extend detector potentials to include tracking Use global scene variables for times of day, seasons etc.

Further Questions Further Questions?