A Search-Classify Approach for Cluttered Indoor Scene Understanding Liangliang Nan 1, Ke Xie 1, Andrei Sharf 2 1 SIAT, China 2 Ben Gurion University, Israel.

Slides:



Advertisements
Similar presentations
Kapitel 14 Recognition Scene understanding / visual object categorization Pose clustering Object recognition by local features Image categorization Bag-of-features.
Advertisements

Discontinuity Preserving Stereo with Small Baseline Multi-Flash Illumination Rogerio Feris 1, Ramesh Raskar 2, Longbin Chen 1, Karhan Tan 3 and Matthew.
Indoor Segmentation and Support Inference from RGBD Images Nathan Silberman, Derek Hoiem, Pushmeet Kohli, Rob Fergus.
SmartBoxes for Interactive Urban Reconstruction Liangliang Nan 1, Andrei Sharf 1, Hao Zhang 2, Daniel Cohen-Or 3, Baoquan Chen 1 1 Shenzhen Institutes.
1/50 Photo-Inspired Model-Driven 3D Object Modeling Kai Xu 1,2 Hanlin Zheng 3 Hao (Richard) Zhang 2 Daniel Cohen-Or 4 Ligang Liu 3 Yueshan Xiong 1 1 National.
Wheres Waldo: Matching People in Images of Crowds Rahul GargDeva RamananSteven M. Seitz Noah Snavely Problem Definition University of Washington University.
The Layout Consistent Random Field for detecting and segmenting occluded objects CVPR, June 2006 John Winn Jamie Shotton.
3rd Workshop On Semantic Perception, Mapping and Exploration (SPME) Karlsruhe, Germany,2013 Semantic Parsing for Priming Object Detection in RGB-D Scenes.
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
Creating Consistent Scene Graphs Using a Probabilistic Grammar Tianqiang LiuSiddhartha ChaudhuriVladimir G. Kim Qi-Xing HuangNiloy J. MitraThomas Funkhouser.
SPONSORED BY SA2014.SIGGRAPH.ORG Annotating RGBD Images of Indoor Scenes Yu-Shiang Wong and Hung-Kuo Chu National Tsing Hua University CGV LAB.
Tracking Learning Detection
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features Kristen Grauman Trevor Darrell MIT.
Patch to the Future: Unsupervised Visual Prediction
Amir Hosein Omidvarnia Spring 2007 Principles of 3D Face Recognition.
1.Introduction 2.Article [1] Real Time Motion Capture Using a Single TOF Camera (2010) 3.Article [2] Real Time Human Pose Recognition In Parts Using a.
Cambridge, Massachusetts Pose Estimation in Heavy Clutter using a Multi-Flash Camera Ming-Yu Liu, Oncel Tuzel, Ashok Veeraraghavan, Rama Chellappa, Amit.
Creating Consistent Scene Graphs Using a Probabilistic Grammar Tianqiang LiuSiddhartha ChaudhuriVladimir G. Kim Qi-Xing HuangNiloy J. MitraThomas Funkhouser.
1 Building a Dictionary of Image Fragments Zicheng Liao Ali Farhadi Yang Wang Ian Endres David Forsyth Department of Computer Science, University of Illinois.
Robust Object Tracking via Sparsity-based Collaborative Model
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
Structure Recovery by Part Assembly Chao-Hui Shen 1 Hongbo Fu 2 Kang Chen 1 Shi-Min Hu 1 1 Tsinghua University 2 City University of Hong Kong.
Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.
High-level Component Filtering for Robust Scene Text Detection
R-CNN By Zhang Liliang.
Spatial Pyramid Pooling in Deep Convolutional
A structured learning framework for content- based image indexing and visual Query (Joo-Hwee, Jesse S. Jin) Presentation By: Salman Ahmad (270279)
Survey of Object Classification in 3D Range Scans
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
Object Detection Sliding Window Based Approach Context Helps
Kourosh MESHGI Shin-ichi MAEDA Shigeyuki OBA Shin ISHII 18 MAR 2014 Integrated System Biology Lab (Ishii Lab) Graduate School of Informatics Kyoto University.
Local invariant features Cordelia Schmid INRIA, Grenoble.
Interactive Discovery and Semantic Labeling of Patterns in Spatial Data Thomas Funkhouser, Adam Finkelstein, David Blei, and Christiane Fellbaum Princeton.
Conjoining Gestalt Rules for Abstraction of Architectural Drawings Liangliang(Leon) Nan 1, Andrei Sharf 2, Ke Xie 1, Tien-Tsin Wong 3 Oliver Deussen 4,
Why Categorize in Computer Vision ?. Why Use Categories? People love categories!
1 Action Classification: An Integration of Randomization and Discrimination in A Dense Feature Representation Computer Science Department, Stanford University.
MESA LAB Two papers in icfda14 Guimei Zhang MESA LAB MESA (Mechatronics, Embedded Systems and Automation) LAB School of Engineering, University of California,
SPONSORED BY Data-driven Segmentation and Labeling of Freehand Sketches Zhe Huang, Hongbo Fu, Rynson W.H. Lau City University of Hong Kong.
Organizing Heterogeneous Scene Collections through Contextual Focal Points Kai Xu, Rui Ma, Hao Zhang, Chenyang Zhu, Ariel Shamir, Daniel Cohen-Or, Hui.
Automated Detection and Classification Models SAR Automatic Target Recognition Proposal J.Bell, Y. Petillot.
Experiments Test different parking lot images captured in different luminance conditions The test samples include 1300 available parking spaces and 1500.
Ground Truth Free Evaluation of Segment Based Maps Rolf Lakaemper Temple University, Philadelphia,PA,USA.
IIIT Hyderabad Learning Semantic Interaction among Graspable Objects Swagatika Panda, A.H. Abdul Hafez, C.V. Jawahar Center for Visual Information Technology,
Face Detection Ying Wu Electrical and Computer Engineering Northwestern University, Evanston, IL
Local invariant features Cordelia Schmid INRIA, Grenoble.
Acquiring 3D Indoor Environments with Variability and Repetition Young Min Kim Stanford University Niloy J. Mitra UCL/ KAUST Dong-Ming Yan KAUST Leonidas.
Efficient Visual Object Tracking with Online Nearest Neighbor Classifier Many slides adapt from Steve Gu.
Removing Moving Objects from Point Cloud Scenes Krystof Litomisky and Bir Bhanu International Workshop on Depth Image Analysis November 11, 2012.
Discussion of Pictorial Structures Pedro Felzenszwalb Daniel Huttenlocher Sicily Workshop September, 2006.
Sponsored by Deformation-Driven Topology-Varying 3D Shape Correspondence Ibraheem Alhashim Kai Xu Yixin Zhuang Junjie Cao Patricio Simari Hao Zhang Presenter:
Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC
Recognition Using Visual Phrases
SA2014.SIGGRAPH.ORG SPONSORED BY Automatic Semantic Modeling of Indoor Scenes from Low-quality RGB-D Data using Contextual Information Kang Chen 1 Yu-Kun.
Wonjun Kim and Changick Kim, Member, IEEE
Martina Uray Heinz Mayer Joanneum Research Graz Institute of Digital Image Processing Horst Bischof Graz University of Technology Institute for Computer.
3D Perception and Environment Map Generation for Humanoid Robot Navigation A DISCUSSION OF: -BY ANGELA FILLEY.
Recognizing Deformable Shapes
Supervised Time Series Pattern Discovery through Local Importance
Nonparametric Semantic Segmentation
Paper Presentation: Shape and Matching
Real-Time Human Pose Recognition in Parts from Single Depth Image
Object Tracking Based on Appearance and Depth Information
Image Segmentation Techniques
Multiple Instance Learning: applications to computer vision
RGB-D Image for Scene Recognition by Jiaqi Guo
Poster Spotlights Shape Anchors for Data-driven
Brief Review of Recognition + Context
Recognizing Deformable Shapes
Department of Computer Science Ben-Gurion University of the Negev
SmartBoxes for Interactive Urban Reconstruction
Presentation transcript:

A Search-Classify Approach for Cluttered Indoor Scene Understanding Liangliang Nan 1, Ke Xie 1, Andrei Sharf 2 1 SIAT, China 2 Ben Gurion University, Israel

Digitalization of indoor scenes Indoor scenes from Google 3D Warehouse

Acquisition of indoor scenes

Goal Scene understanding

Challenges Clutter –Densely populated –Arbitrary arrangements Partial representation –Occlusions Complex geometry

Classification & Segmentation Two interleaved problems –What are the objects? –Where are the objects? Chicken-egg problem –Classification needs segmentation –Segmentation needs a prior

Our solution Search –Propagate / accumulate patches Classify –Query classifier to detect object

Related Work Indoor scenes ( This Session ) – [Fisher et al. 2012] [Shao et al. 2012] [Kim et al. 2012] Semantic relationship – [Fisher et al. 2010, 2011] Recognition using depth + texture (RGB-D) – [Quigley et al.2009], [Lai and Fox 2010] Outdoor classification – [Golovinskiy et al. 2009] Semantic labeling – [Koppula et al. 2011] Controlled region growing process

Our search-classify idea

Method overview Training Search-Classify

Point cloud features –Height-size ratio of BBox –Aspect ratio of each layer –Bottom-top, mid-top size ratio –Change in COM along horizontal slabs BhBh BdBd BwBw

Classifier Handle missing data –Occlusion Random decision forest –Efficient multi-class classifier Trained with both scanned and synthetic data –Manually segmented and labeled –510 chairs –250 tables –110 cabinets –40 monitors etc. [Shotton et al. 2008, 2011]

Search-Classify Starts from seeds –Random patch triplets –Remove seeds with low confidence Accumulating neighbor patches –Highest classification confidence Stop condition –Steep decrease in classification confidence Seed

Segmented - classified objects problems –Overlap, outliers, ambiguities etc. Refinement –Outliers = patches with large distance Segmentation refinement by template fitting

Template deformation Different styles for each class Predefined scalable parts Templates can deform [Xu et al. 2010]

Template deformation Different styles for each class Predefined scalable parts Templates can deform [Xu et al. 2010]

Fitting via template deformation ConfidenceFitting errorBest fitting Best matching template –One-side Euclidean distance from points to template

Results and discussion

Scalability test with varied object density 0 (25) 1 (45) 5 (60)

Results and discussion Comparison Lai et al Ours

Limitation Upward assumption –Features –Template fitting

Future work Contextual information

Thank you