Interactively Co-segmentating Topically Related Images with Intelligent Scribble Guidance Dhruv Batra, Carnegie Mellon University Adarsh Kowdle, Cornell.

Slides:



Advertisements
Similar presentations
POSE–CUT Simultaneous Segmentation and 3D Pose Estimation of Humans using Dynamic Graph Cuts Mathieu Bray Pushmeet Kohli Philip H.S. Torr Department of.
Advertisements

Applications of one-class classification
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Automatic Photo Pop-up Derek Hoiem Alexei A.Efros Martial Hebert Carnegie Mellon University.
Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.
Top-Down & Bottom-Up Segmentation
Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.
Graph cut Chien-chi Chen.
Pattern Recognition and Machine Learning
A generic model to compose vision modules for holistic scene understanding Adarsh Kowdle *, Congcong Li *, Ashutosh Saxena, and Tsuhan Chen Cornell University,
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Learning Techniques for Video Shot Detection Under the guidance of Prof. Sharat Chandran by M. Nithya.
I Images as graphs Fully-connected graph – node for every pixel – link between every pair of pixels, p,q – similarity w ij for each link j w ij c Source:
GrabCut Interactive Image (and Stereo) Segmentation Carsten Rother Vladimir Kolmogorov Andrew Blake Antonio Criminisi Geoffrey Cross [based on Siggraph.
Foreground/Background Image Segmentation. What is our goal? To label each pixel in an image as belonging to either the foreground of the scene or the.
Texture Segmentation Based on Voting of Blocks, Bayesian Flooding and Region Merging C. Panagiotakis (1), I. Grinias (2) and G. Tziritas (3)
GrabCut Interactive Image (and Stereo) Segmentation Joon Jae Lee Keimyung University Welcome. I will present Grabcut – an Interactive tool for foreground.
Pedestrian Detection in Crowded Scenes Dhruv Batra ECE CMU.
Background Removal of Multiview Images by Learning Shape Priors Yu-Pao Tsai, Cheng-Hung Ko, Yi-Ping Hung, and Zen-Chung Shih.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Region Segmentation. Find sets of pixels, such that All pixels in region i satisfy some constraint of similarity.
Abstract Extracting a matte by previous approaches require the input image to be pre-segmented into three regions (trimap). This pre-segmentation based.
Object Class Recognition Using Discriminative Local Features Gyuri Dorko and Cordelia Schmid.
Presented by Zeehasham Rasheed
Lecture#6: segmentation Anat Levin Introduction to Computer Vision Class Fall 2009 Department of Computer Science and App math, Weizmann Institute of Science.
Optimal Adaptation for Statistical Classifiers Xiao Li.
Presented By : Murad Tukan
Jeff Howbert Introduction to Machine Learning Winter Machine Learning Feature Creation and Selection.
Automatic User Interaction Correction via Multi-label Graph-cuts Antonio Hernández-Vela, Carlos Primo and Sergio Escalera Workshop on Human Interaction.
Graph-based Segmentation
MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/31/15.
Multimodal Interaction Dr. Mike Spann
MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/24/10.
Recognition using Regions (Demo) Sudheendra V. Outline Generating multiple segmentations –Normalized cuts [Ren & Malik (2003)] Uniform regions –Watershed.
NATIONAL TECHNICAL UNIVERSITY OF ATHENS Image, Video And Multimedia Systems Laboratory Background
I 3D: Interactive Planar Reconstruction of Objects and Scenes Adarsh KowdleYao-Jen Chang Tsuhan Chen School of Electrical and Computer Engineering Cornell.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Scene Completion Using Millions of Photographs James Hays, Alexei A. Efros Carnegie Mellon University ACM SIGGRAPH 2007.
Paired Sampling in Density-Sensitive Active Learning Pinar Donmez joint work with Jaime G. Carbonell Language Technologies Institute School of Computer.
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
Learning Spectral Clustering, With Application to Speech Separation F. R. Bach and M. I. Jordan, JMLR 2006.
Associative Hierarchical CRFs for Object Class Image Segmentation
A New Method for Automatic Clothing Tagging Utilizing Image-Click-Ads Introduction Conclusion Can We Do Better to Reduce Workload?
Effective Automatic Image Annotation Via A Coherent Language Model and Active Learning Rong Jin, Joyce Y. Chai Michigan State University Luo Si Carnegie.
© Devi Parikh 2008 Devi Parikh and Tsuhan Chen Carnegie Mellon University April 3, ICASSP 2008 Bringing Diverse Classifiers to Common Grounds: dtransform.
A Multiresolution Symbolic Representation of Time Series Vasileios Megalooikonomou Qiang Wang Guo Li Christos Faloutsos Presented by Rui Li.
A New Method for Crater Detection Heather Dunlop November 2, 2006.
CS654: Digital Image Analysis Lecture 28: Advanced topics in Image Segmentation Image courtesy: IEEE, IJCV.
Object-Graphs for Context-Aware Category Discovery Yong Jae Lee and Kristen Grauman University of Texas at Austin 1.
Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.
Extracting Adaptive Contextual Cues From Unlabeled Regions Congcong Li +, Devi Parikh *, Tsuhan Chen + + Cornell University * Toyota Technological Institute.
Cell Segmentation in Microscopy Imagery Using a Bag of Local Bayesian Classifiers Zhaozheng Yin RI/CMU, Fall 2009.
IEEE 2015 Conference on Computer Vision and Pattern Recognition Active Learning for Structured Probabilistic Models with Histogram Approximation Qing SunAnkit.
SUPPORT VECTOR MACHINES Presented by: Naman Fatehpuria Sumana Venkatesh.
Edge Preserving Spatially Varying Mixtures for Image Segmentation Giorgos Sfikas, Christophoros Nikou, Nikolaos Galatsanos (CVPR 2008) Presented by Lihan.
Image segmentation.
Shape2Pose: Human Centric Shape Analysis CMPT888 Vladimir G. Kim Siddhartha Chaudhuri Leonidas Guibas Thomas Funkhouser Stanford University Princeton University.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Recognition of biological cells – development
GrabCut Interactive Foreground Extraction using Iterated Graph Cuts Carsten Rother Vladimir Kolmogorov Andrew Blake Microsoft Research Cambridge-UK.
Using Transductive SVMs for Object Classification in Images
Object-Graphs for Context-Aware Category Discovery
Dingding Liu* Yingen Xiong† Linda Shapiro* Kari Pulli†
Human-in-the-loop ECE6504 Xiao Lin.
Automatic User Interaction Correction via Multi-label Graph-cuts
“grabcut”- Interactive Foreground Extraction using Iterated Graph Cuts
Machine Learning – a Probabilistic Perspective
Physics-guided machine learning for milling stability:
Semi-Supervised Learning
“Traditional” image segmentation
Presentation transcript:

Interactively Co-segmentating Topically Related Images with Intelligent Scribble Guidance Dhruv Batra, Carnegie Mellon University Adarsh Kowdle, Cornell University Devi Parikh, Toyota Technological Institute Jiebo Luo, Eastman Kodak Company Tsuhan Chen, Cornell University

Outline 1 Introduction 2 iCoseg: Energy Minimization 3 iCoseg: Guiding User Scribbles 4 The CMU-Cornell iCoseg Dataset 5 Experiments 6 Interactive Co-segmentation for Object-of- Interest 3D Modeling

Introduction We develop an algorithm that allows users to decide what foreground is, and then guide the output of the co-segmentation algorithm towards it via scribbles

Outline 1 Introduction 2 iCoseg: Energy Minimization 3 iCoseg: Guiding User Scribbles 4 The CMU-Cornell iCoseg Dataset 5 Experiments 6 Interactive Co-segmentation for Object-of- Interest 3D Modeling

Energy Minimization 1. Data (Unary) Term : indicating the cost of assigning a superpixel to foreground and background classes 2. Smoothness (Pairwise) Term : used for penalizing label disagreement between neighbours I (·) : an indicator function that is 1(0) if the input argument is true (false) d ij : the distance between features at superpixels i and j β : a scale parameter

Outline 1 Introduction 2 iCoseg: Energy Minimization 3 iCoseg: Guiding User Scribbles 3.1 Uncertainty-Based Cues 3.2 Scribble-Based Cues 3.3 Image-Level Cues 3.4 Combined Recommendation Map 4 The CMU-Cornell iCoseg Dataset 5 Experiments 6 Interactive Co-segmentation for Object-of- Interest 3D Modeling

Image cues: Segment Size Codeword Distribution Recommendation Map

3.1 Uncertainty-Based Cues 1. Node Uncertainty (NU):Fitting A1 = {GMM f,GMM b } to the labelled superpixel features. Using this learnt A1, for each superpixel we normalize the foreground and background likelihoods to get a 2-class distribution and then compute the entropy of this distribution. 2. Edge Uncertainty (EU):To feed unlabelled data- points to a set of classifiers and request label for the datapoint with maximal disagreement among classifier outcomes.

3.1 Uncertainty-Based Cues 3. Graph-Cut Uncertainty(GC): Capture the confidence in the energy minimizing state returned by graph cuts. 3.2 Scribble-Based Cues 4. Distance Transform over Scribbles (DT): Compute the distance of every pixel to the nearest scribble location.

3.2 Scribble-Based Cues 5. Intervening Contours over Scribbles (IC): The value of this cue at each pixel is the maximum edge magnitude in the straight line to the closest scribble.

3.3 Image-Level Cues 6. Segment Size (SS): When very few scribbles are marked, energy minimization methods typically overs- mooth and results in “whitewash” segmentations (entire image labelled as foreground or background). 7. Codeword Distribution over Images (CD):Motivation being that scribbling on images containing more diversity among features would lead to better foreground /background models. To compute this cue, we cluster the features computed from all superpixels in the group to form a codebook.

3.4 Combined Recommendation Map Learning a mapping F :φ i → ϵ i, φ i is the 7-dimensional feature vector for superpixel i ϵ i is the error indicator vector which is 1 if the predicted segmentation at node ϵ i is incorrect, and 0 otherwise. We chose logistic regression as the form of this mapping.

Outline 1 Introduction 2 iCoseg: Energy Minimization 3 iCoseg: Guiding User Scribbles 4 The CMU-Cornell iCoseg Dataset 5 Experiments 6 Interactive Co-segmentation for Object-of- Interest 3D Modeling

Publicly available CMU-Cornell iCoseg: 38 groups 643 images ~17 im/gp Sport

The CMU-Cornell iCoseg Dataset Dataset Annotation: The ground-truth annotations for the dataset were manually generated by a single annotator using a labelling tool.

Dataset Statistics Size The histogram of the number of images in groups

Dataset Statistics Appearance

Dataset Statistics Scale

Outline 1 Introduction 2 iCoseg: Energy Minimization 3 iCoseg: Guiding User Scribbles 4 The CMU-Cornell iCoseg Dataset 5 Experiments 5.1 Machine Experiments 5.2 User Study 6 Interactive Co-segmentation for Object-of- Interest 3D Modeling

5.1 Machine Experiments

5.2 User Study

Outline 1 Introduction 2 iCoseg: Energy Minimization 3 iCoseg: Guiding User Scribbles 4 The CMU-Cornell iCoseg Dataset 5 Experiments 6 Interactive Co-segmentation for Object-of- Interest 3D Modeling

Interactive 3D modelling

3D Modeling

Conclusions iCoseg that co-segments all images in the group using an energy minimization framework, and an automatic recommendation system that intelligently recommends a region among all images in the group where the user should scribble next. Achieve good quality segmentations with significantly lower time and effort than exhaustively examining all cutouts.