RGB-D Image for Scene Recognition by Jiaqi Guo

Slides:



Advertisements
Similar presentations
Matthias Wimmer, Bernd Radig, Michael Beetz Chair for Image Understanding Computer Science TU München, Germany A Person and Context.
Advertisements

3D Model Matching with Viewpoint-Invariant Patches(VIP) Reporter :鄒嘉恆 Date : 10/06/2009.
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Evaluating Color Descriptors for Object and Scene Recognition Koen E.A. van de Sande, Student Member, IEEE, Theo Gevers, Member, IEEE, and Cees G.M. Snoek,
Classifying Objects as New or Learned with Convolutional Networks and SGD By Kevin Xiong and Evan Phibbs Mentored by Yufei Wang.
Yuanlu Xu Human Re-identification: A Survey.
Automatic scene inference for 3D object compositing Kevin Karsch (UIUC), Sunkavalli, K. Hadap, S.; Carr, N.; Jin, H.; Fonte, R.; Sittig, M., David Forsyth.
Learning Methods for Generic Object Recognition with Invariance to Pose and Lighting by Yann LeCun, Fu Jie Huang, and Léon Bottou in Proceedings of CVPR'04,
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
Exchanging Faces in Images SIGGRAPH ’04 Blanz V., Scherbaum K., Vetter T., Seidel HP. Speaker: Alvin Date: 21 July 2004.
Image Segmentation. Introduction The purpose of image segmentation is to partition an image into meaningful regions with respect to a particular application.
Robust Object Segmentation Using Adaptive Thresholding Xiaxi Huang and Nikolaos V. Boulgouris International Conference on Image Processing 2007.
Real-time Hand Pose Recognition Using Low- Resolution Depth Images
A Novel 2D To 3D Image Technique Based On Object- Oriented Conversion.
3D Scene Models Object recognition and scene understanding Krista Ehinger.
Matthias Wimmer, Bernd Radig, Michael Beetz Chair for Image Understanding Computer Science Technische Universität München Adaptive.
My Research Experience Cheng Qian. Outline 3D Reconstruction Based on Range Images Color Engineering Thermal Image Restoration.
Convolutional Neural Networks for Image Processing with Applications in Mobile Robotics By, Sruthi Moola.
FEATURE EXTRACTION FOR JAVA CHARACTER RECOGNITION Rudy Adipranata, Liliana, Meiliana Indrawijaya, Gregorius Satia Budhi Informatics Department, Petra Christian.
Image Recognition using Hierarchical Temporal Memory Radoslav Škoviera Ústav merania SAV Fakulta matematiky, fyziky a informatiky UK.
Hurieh Khalajzadeh Mohammad Mansouri Mohammad Teshnehlab
Perception Introduction Pattern Recognition Image Formation
ASPRS Annual Conference 2005, Baltimore, March Utilizing Multi-Resolution Image data vs. Pansharpened Image data for Change Detection V. Vijayaraj,
Building local part models for category-level recognition C. Schmid, INRIA Grenoble Joint work with G. Dorko, S. Lazebnik, J. Ponce.
Automated Target Recognition Using Mathematical Morphology Prof. Robert Haralick Ilknur Icke José Hanchi Computer Science Dept. The Graduate Center of.
1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.
Levels of Image Data Representation 4.2. Traditional Image Data Structures 4.3. Hierarchical Data Structures Chapter 4 – Data structures for.
A NOVEL METHOD FOR COLOR FACE RECOGNITION USING KNN CLASSIFIER
Multi Scale CRF Based RGB-D Image Segmentation Using Inter Frames Potentials Taha Hamedani Robot Perception Lab Ferdowsi University of Mashhad The 2 nd.
DISCRIMINATIVELY TRAINED DENSE SURFACE NORMAL ESTIMATION ANDREW SHARP.
Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,
Lecture 4b Data augmentation for CNN training
Facial Smile Detection Based on Deep Learning Features Authors: Kaihao Zhang, Yongzhen Huang, Hong Wu and Liang Wang Center for Research on Intelligent.
Cancer Metastases Classification in Histological Whole Slide Images
Processing visual information for Computer Vision
CS 4501: Introduction to Computer Vision Object Localization, Detection, Semantic Segmentation Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy.
Learning to Compare Image Patches via Convolutional Neural Networks
Demo.
Introduction to Skin and Face Detection
Data Mining, Neural Network and Genetic Programming
Intrinsic images and shape refinement
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
AUTOMATIC IMAGE ORIENTATION DETECTION
Comparing NARF and SIFT Key Point Extraction Algorithms
Depth estimation and Plane detection
Recognition using Nearest Neighbor (or kNN)
Lecture 5 Smaller Network: CNN
Real-Time Human Pose Recognition in Parts from Single Depth Image
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Common Classification Tasks
Pearson Lanka (Pvt) Ltd.
Wrap-up Computer Vision Spring 2018, Lecture 28
Efficient Image Classification on Vertically Decomposed Data
Paper Presentation Aryeh Zapinsky
Fall 2012 Longin Jan Latecki
Dog/Cat Classifier Christina Stiff.
Pose Estimation for non-cooperative Spacecraft Rendevous using CNN
Outline Background Motivation Proposed Model Experimental Results
Source: Pattern Recognition Vol. 38, May, 2005, pp
Depth Aware Inpainting for Novel View Synthesis Jayant Thatte
ECE734 Project-Scale Invariant Feature Transform Algorithm
Support vector machine-based text detection in digital video
Wrap-up Computer Vision Spring 2019, Lecture 26
Adding Surface Detail 고려대학교 컴퓨터 그래픽스 연구실.
Adding Surface Detail 고려대학교 컴퓨터 그래픽스 연구실.
VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION
Semantic Segmentation
Directional Occlusion with Neural Network
Presentation transcript:

RGB-D Image for Scene Recognition by Jiaqi Guo

Outline Introduction Dataset & Related Works Method & Metrics Future Direction

Outline Introduction Dataset & Related Works Method & Metrics Future Direction

What is RGB-D? Monocular vs. Binocular Missing Information in Monocular Systems -- Depth RGB Image Depth Image

Depth Sensors Distance between Camera and Object Using Structured Light or ToL

Goal: Explore how depth information would help in scene recognition

Outline Introduction Dataset & Related Works Method & Metrics Future Direction

RGB-D Scene Dataset

SUN RGB-D 3D Bounding Boxes, 2D Polygons & Room Layout

Related Works on RGB Scene Feature Selection Manually Scale Invariant Feature Transform (SIFT) Shape, Illumination and Reflectance from Shading (SIRFS) Semantic Classes detected by Object Recognition Automatically Using Places convolutional neural network (Places- CNN) Features are automatically selected in different trained layers

Features Selected by Places-CNN

Outline Introduction Dataset & Related Works Method & Metrics Future Direction

Explore How Depth Information Fits In Better Segmentation Algorithms for Object Detection Complementary Information for Other Scene Features Such as Texture, Illumination and Shading

Explore How Depth Information Fits In Same ROI, Same Texture, Different Distance (Depth)

Method & Metrics Add depth information into labels when training, this would further divide the original classes into sub-groups For each pixel, classifiers are decided according to its depth value Separate dataset into halves and conduct controlled experiment Control Group: Desired feature extracted from original object classes Experimental Group: Desired feature extracted from subdivided objects groups Evaluation would be conducted on test set for each group

Outline Introduction Dataset & Related Works Method & Metrics Future Direction

Future Direction Whether color information is redundant Need to determine the best range when subdividing depth Depth map improvement: Denoise and fill gaps Depth map improvement: Improve resolution and depth limit

Thank you for listening !