Scene Labeling Using Beam Search Under Mutex Constraints ID: O-2B-6 Anirban Roy and Sinisa Todorovic Oregon State University 1.

Slides:



Advertisements
Similar presentations
Using Strong Shape Priors for Multiview Reconstruction Yunda SunPushmeet Kohli Mathieu BrayPhilip HS Torr Department of Computing Oxford Brookes University.
Advertisements

Mean-Field Theory and Its Applications In Computer Vision1 1.
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Indoor Segmentation and Support Inference from RGBD Images Nathan Silberman, Derek Hoiem, Pushmeet Kohli, Rob Fergus.
The Layout Consistent Random Field for detecting and segmenting occluded objects CVPR, June 2006 John Winn Jamie Shotton.
3rd Workshop On Semantic Perception, Mapping and Exploration (SPME) Karlsruhe, Germany,2013 Semantic Parsing for Priming Object Detection in RGB-D Scenes.
Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.
Constrained Approximate Maximum Entropy Learning (CAMEL) Varun Ganapathi, David Vickrey, John Duchi, Daphne Koller Stanford University TexPoint fonts used.
Joint Optimisation for Object Class Segmentation and Dense Stereo Reconstruction Ľubor Ladický, Paul Sturgess, Christopher Russell, Sunando Sengupta, Yalin.
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Part 4: Combined segmentation and recognition by Rob Fergus (MIT)
Learning to Combine Bottom-Up and Top-Down Segmentation Anat Levin and Yair Weiss School of CS&Eng, The Hebrew University of Jerusalem, Israel.
Shape Sharing for Object Segmentation
Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.
Large-Scale Object Recognition using Label Relation Graphs Jia Deng 1,2, Nan Ding 2, Yangqing Jia 2, Andrea Frome 2, Kevin Murphy 2, Samy Bengio 2, Yuan.
Face Alignment by Explicit Shape Regression
Efficient Inference for Fully-Connected CRFs with Stationarity
Pseudo-Bound Optimization for Binary Energies Presenter: Meng Tang Joint work with Ismail Ben AyedYuri Boykov 1 / 27.
Analyzing Semantic Segmentation Using Hybrid Human-Machine CRFs Roozbeh Mottaghi 1, Sanja Fidler 2, Jian Yao 2, Raquel Urtasun 2, Devi Parikh 3 1 UCLA.
Biased Normalized Cuts 1 Subhransu Maji and Jithndra Malik University of California, Berkeley IEEE Conference on Computer Vision and Pattern Recognition.
Models for Scene Understanding – Global Energy models and a Style-Parameterized boosting algorithm (StyP-Boost) Jonathan Warrell, 1 Simon Prince, 2 Philip.
Computer Vision Group, University of BonnVision Laboratory, Stanford University Abstract This paper empirically compares nine image dissimilarity measures.
Robust Higher Order Potentials For Enforcing Label Consistency
Hierarchical Region-Based Segmentation by Ratio-Contour Jun Wang April 28, 2004 Course Project of CSCE 790.
Schedule Introduction Models: small cliques and special potentials Tea break Inference: Relaxation techniques:
LARGE-SCALE NONPARAMETRIC IMAGE PARSING Joseph Tighe and Svetlana Lazebnik University of North Carolina at Chapel Hill CVPR 2011Workshop on Large-Scale.
Oxford Brookes Seminar Thursday 3 rd September, 2009 University College London1 Representing Object-level Knowledge for Segmentation and Image Parsing:
Graph Cut based Inference with Co-occurrence Statistics Ľubor Ladický, Chris Russell, Pushmeet Kohli, Philip Torr.
What Energy Functions Can be Minimized Using Graph Cuts? Shai Bagon Advanced Topics in Computer Vision June 2010.
Object Recognition by Discriminative Methods Sinisa Todorovic 1st Sino-USA Summer School in VLPR July, 2009.
What, Where & How Many? Combining Object Detectors and CRFs
Graph-based Segmentation
Richard Socher Cliff Chiung-Yu Lin Andrew Y. Ng Christopher D. Manning
3D LayoutCRF Derek Hoiem Carsten Rother John Winn.
Conditional Topic Random Fields Jun Zhu and Eric P. Xing ICML 2010 Presentation and Discussion by Eric Wang January 12, 2011.
MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/24/10.
City University of Hong Kong 18 th Intl. Conf. Pattern Recognition Self-Validated and Spatially Coherent Clustering with NS-MRF and Graph Cuts Wei Feng.
Random Walk with Restart (RWR) for Image Segmentation
TTH 1:30-2:48 Winter DL266 CIS 788v04 Zhu Topic 5. Human Faces Human face is extensively studied.
Multi-task Low-rank Affinity Pursuit for Image Segmentation Bin Cheng, Guangcan Liu, Jingdong Wang, Zhongyang Huang, Shuicheng Yan (ICCV’ 2011) Presented.
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
Texture We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
Texture We would like to thank Amnon Drory for this deck הבהרה : החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
Classifiers Given a feature representation for images, how do we learn a model for distinguishing features from different classes? Zebra Non-zebra Decision.
INTRODUCTION Heesoo Myeong and Kyoung Mu Lee Department of ECE, ASRI, Seoul National University, Seoul, Korea Tensor-based High-order.
Associative Hierarchical CRFs for Object Class Image Segmentation Ľubor Ladický 1 1 Oxford Brookes University 2 Microsoft Research Cambridge Based on the.
Putting Context into Vision Derek Hoiem September 15, 2004.
Geodesic Saliency Using Background Priors
Tell Me What You See and I will Show You Where It Is Jia Xu 1 Alexander G. Schwing 2 Raquel Urtasun 2,3 1 University of Wisconsin-Madison 2 University.
Topic Models Presented by Iulian Pruteanu Friday, July 28 th, 2006.
Associative Hierarchical CRFs for Object Class Image Segmentation
A New Method for Automatic Clothing Tagging Utilizing Image-Click-Ads Introduction Conclusion Can We Do Better to Reduce Workload?
Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.
Feedforward semantic segmentation with zoom-out features
Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.
Coherent Scene Understanding with 3D Geometric Reasoning Jiyan Pan 12/3/2012.
Learning Hierarchical Features for Scene Labeling
IEEE 2015 Conference on Computer Vision and Pattern Recognition Active Learning for Structured Probabilistic Models with Histogram Approximation Qing SunAnkit.
Stacked Hierarchical Labeling Dan MunozDrew BagnellMartial Hebert.
Learning Hierarchical Features for Scene Labeling Cle’ment Farabet, Camille Couprie, Laurent Najman, and Yann LeCun by Dong Nie.
Image segmentation.
Gaussian Conditional Random Field Network for Semantic Segmentation
Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.
Holistic Scene Understanding Virginia Tech ECE /02/26 Stanislaw Antol.
Recent developments in object detection
Learning a Region-based Scene Segmentation Model
Nonparametric Semantic Segmentation
Efficient Graph Cut Optimization for Full CRFs with Quantized Edges
Learning to Combine Bottom-Up and Top-Down Segmentation
“Traditional” image segmentation
Presentation transcript:

Scene Labeling Using Beam Search Under Mutex Constraints ID: O-2B-6 Anirban Roy and Sinisa Todorovic Oregon State University 1

Problem: Semantic Segmentation 2

Prior Work: Labeling Individual Superpixels Random forest, Logistic regression [Payet et al. PAMI 13, Shotton et al. CVPR08, Eslami et al. CVPR12] Decision Forest: [ Shotton et al. CVPR08 ] 3

Prior Work: Labeling Individual Superpixels Deep learning (DL ) [Socher et al. ICML11] [DL: Socher et al. ICML 11 ] 4

Prior Work: Labeling Individual Superpixels Segmentation trees [ Arbelaez et al. CVPR 12, Todorovic & Ahuja CVPR08, Lim et al. ICCV09] 5 Original image Hierarchical Segmentation [Arbelaez et al. CVPR 12]

Prior Work: Holistic Approaches CRF, Hierarchical models [ Kohli et al. CVPR08, Gould et al. IJCV08, Zhnag et al. CVPR12, Kumar et al. CVPR 10, Lempitsky et al. NIPS11, Mottaghi et al. CVPR13, Zhu et al. PAMI12] Deep learning (DL) + CRF [Farabet et al. PAMI13, Kae et al. CVPR11] [CRF: Gould et al. IJCV08] 6

Our Approach 7 Input Image Superpixels

Our Approach 8 Input Image Superpixels Smoothness Context Domain Knowledge CRF

Our Approach 9 Input Image Superpixels CRF inference Smoothness Context Mutual exclusion Domain Knowledge CRF

Our Approach 10 Input ImageSemantic segmentation Superpixels CRF inference Smoothness Context Mutual exclusion Domain Knowledge CRF

Motivation: Mutex Constraints Key Idea: Mutual Exclusion constraints should help Input ImageSemantic segmentation without Mutex 11

Motivation: Mutex Constraints Input ImageSemantic segmentation with Mutex Semantic segmentation without Mutex Key Idea: Mutual Exclusion constraints should help Note that Context ≠ Mutex 12

Motivation: Mutex Constraints Key Idea: Mutex = (object, object, relationship) Input ImageSemantic segmentation with Mutex Semantic segmentation without Mutex {Left, Right, Above, Below, Surrounded by, Nested within, etc.} 13

Related Work on Mutex Constraints in Different Problems Event recognition and Activity recognition [Tran & Davis ECCV08, Brendel et al. CVPR11] Video segmentation [Ma & Latecki CVPR12] 14

How to Incorporate Mutex? Appearance Smoothness & Context CRF Energy Mutex violations 15

Consequences of Mutex Violation Input ImageSemantic segmentation without Mutex Input Image Semantic segmentation without Mutex Violation of smoothness  Error Violation of mutex  Serious Error 16

How to Incorporate Mutex? 17 Appearance Smoothness & Context CRF Energy Mutex violations Modeling issue: Violation of kth mutex constraint => M k  ∞ => E = ?

How to Incorporate Mutex? Modeling issue: Violation of kth mutex constraint => M k  ∞ => E = ? 18 Appearance Smoothness & Context CRF Energy Mutex violations

Our Model 19 Appearance Smoothness & Context CRF Energy [ Kohli et al. CVPR08, Gould et al. IJCV08, Zhnag et al. CVPR12, Kumar et al. CVPR 10, Lempitsky et al. NIPS11, Mottaghi et al. CVPR13, Zhu et al. PAMI12]

CRF Inference as QP 20

CRF Inference as QP Superpixel Class label Assignment Vector 21

CRF Inference as QP Matrix of potentials (j, j’) (i,i’) = Superpixel Class label 22 Pairwise Potentials Unary Potentials

Mutex : Label i’ i x ii’ = 1 Label j’ j x jj’ = 0 Formalizing Mutex Constraints is assigned to must not be assigned to 23

Mutex : Label i’ i x ii’ = 1 Label j’ j x jj’ = 0 Formalizing Mutex Constraints Linear option: x ii’ + x jj’ = 1 Quadratic option: x ii’ x jj’ = 0 Which one is better? is assigned to must not be assigned to OR 24

Mutex Constraints Compact representation: Must be Matrix of mutex M 25 1 (i,i’) (j,j’)

Mutex Constraints Compact representation: (i,i’) (j,j’) Must be (k, k’) Can be Matrix of mutex M 26 10

Inference as QP 27

Inference as QP 28 Relaxation?

CRF Inference as a Beam Search Initial labeling Candidate labelings 29

CRF Inference as a Beam Search Initial labeling Candidate labelings 30

CRF Inference as a Beam Search Initial labeling Candidate labelings 31

CRF Inference as a Beam Search Initial labeling Candidate labelings 32

CRF Inference as a Beam Search Initial labeling Candidate labelings 33

CRF Inference as a Beam Search Maximum score Initial labeling Candidate labelings 34

Our Search Framework STATE: Label assignment that satisfies mutex constraints SUCCESSOR: Generates new states from previous ones HEURISTIC: Selects top B states for SUCCESSOR SCORE: Selects the best state in the beam search 35

SUCCESSOR Generates New States STATE: a labeling assignment 36

Probabilistically cuts edges to get Connected components of superpixels of same labels 37 SUCCESSOR Generates New States

Randomly selects a connected components 38 SUCCESSOR Generates New States

Changes labels of the selected connected component 39 Changes in the labeling of superpixels SUCCESSOR Generates New States

SUCCESSOR Accepting New States Accepts the new state if it satisfies all constraints next state previous state 40 Efficient computation:

Heuristic and Score Functions SCORE: Negative CRF energy HEURISTIC: Again efficient computation 41

Results 42

Input Parameter Evaluation The MSRC dataset. Beam Width # Restarts Accuracy Running Time 43

44 Pixelwise Accuracy (%) Accuracy Our Approach91. 5 CRF w/o mutex CRF w/ mutex + QP solver MSRC

Pixelwise Accuracy (%) Stanford Background 45 Accuracy Our Approach 81 CRF: Gould, ICCV ConvNet + CRF: Farabet et al. PAMI

Qualitative Results 46

Summary CRF based segmentation with mutex constraints CRF inference = QP  Solved using beam search Beam search is: – Efficient – Solves QP directly in the discrete domain – Guarantees that all mutex constraints are satisfied – Robust against parameter variations Mutex constraints increase accuracy by 9% on MSRC 47