Implementing GIST on the GPU. Refrence Original Work  Aude Oliva, Antonio Torralba  Modeling the shape of the scene: a holistic representation of the.

Slides:



Advertisements
Similar presentations
Distinctive Image Features from Scale-Invariant Keypoints
Advertisements

Computer Vision, Part 1. Topics for Vision Lectures 1.Content-Based Image Retrieval (CBIR) 2.Object recognition and scene understanding.
Human Detection Phanindra Varma. Detection -- Overview  Human detection in static images is based on the HOG (Histogram of Oriented Gradients) encoding.
Evaluating Color Descriptors for Object and Scene Recognition Koen E.A. van de Sande, Student Member, IEEE, Theo Gevers, Member, IEEE, and Cees G.M. Snoek,
November 12, 2013Computer Vision Lecture 12: Texture 1Signature Another popular method of representing shape is called the signature. In order to compute.
Computer vision: models, learning and inference Chapter 13 Image preprocessing and feature extraction.
Instructor: Mircea Nicolescu Lecture 15 CS 485 / 685 Computer Vision.
ECE 562 Computer Architecture and Design Project: Improving Feature Extraction Using SIFT on GPU Rodrigo Savage, Wo-Tak Wu.
November 4, 2014Computer Vision Lecture 15: Shape Representation II 1Signature Another popular method of representing shape is called the signature. In.
Small Codes and Large Image Databases for Recognition CVPR 2008 Antonio Torralba, MIT Rob Fergus, NYU Yair Weiss, Hebrew University.
Fast and Compact Retrieval Methods in Computer Vision Part II A. Torralba, R. Fergus and Y. Weiss. Small Codes and Large Image Databases for Recognition.
Statistics of Natural Image Categories Antonio Torralba and Aude Oliva. Network: Computation in Neural Systems, 14(2003) Jonathan Huang
Distinctive Image Feature from Scale-Invariant KeyPoints
Object Recognition Using Distinctive Image Feature From Scale-Invariant Key point D. Lowe, IJCV 2004 Presenting – Anat Kaspi.
Scale Invariant Feature Transform (SIFT)
A Novel 2D To 3D Image Technique Based On Object- Oriented Conversion.
Creating and Exploring a Large Photorealistic Virtual Space INRIA / CSAIL / Adobe First IEEE Workshop on Internet Vision, associated with CVPR 2008.
Overview Introduction to local features
Computer vision.
Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.
Object Bank Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 4 th, 2013.
General Tensor Discriminant Analysis and Gabor Features for Gait Recognition by D. Tao, X. Li, and J. Maybank, TPAMI 2007 Presented by Iulian Pruteanu.
Overview Harris interest points Comparing interest points (SSD, ZNCC, SIFT) Scale & affine invariant interest points Evaluation and comparison of different.
Reporter: Fei-Fei Chen. Wide-baseline matching Object recognition Texture recognition Scene classification Robot wandering Motion tracking.
Texture. Texture is an innate property of all surfaces (clouds, trees, bricks, hair etc…). It refers to visual patterns of homogeneity and does not result.
Intelligent Vision Systems ENT 496 Object Shape Identification and Representation Hema C.R. Lecture 7.
September 17, 2013Computer Vision Lecture 5: Image Filtering 1ColorRGB HSI.
Automated Target Recognition Using Mathematical Morphology Prof. Robert Haralick Ilknur Icke José Hanchi Computer Science Dept. The Graduate Center of.
December 9, 2014Computer Vision Lecture 23: Motion Analysis 1 Now we will talk about… Motion Analysis.
Pulkit Agrawal Y7322 BVV Sri Raj Dutt Y7110 Sushobhan Nayak Y7460.
Modeling the Shape of a Scene: Seeing the trees as a forest Scene Understanding Seminar
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
CSC508 Convolution Operators. CSC508 Convolution Arguably the most fundamental operation of computer vision It’s a neighborhood operator –Similar to the.
Histograms of Oriented Gradients for Human Detection(HOG)
Edge Detection and Geometric Primitive Extraction Jinxiang Chai.
CS654: Digital Image Analysis Lecture 36: Feature Extraction and Analysis.
Review on Graphics Basics. Outline Polygon rendering pipeline Affine transformations Projective transformations Lighting and shading From vertices to.
Content-Based Image Retrieval QBIC Homepage The State Hermitage Museum db2www/qbicSearch.mac/qbic?selLang=English.
Course14 Dynamic Vision. Biological vision can cope with changing world Moving and changing objects Change illumination Change View-point.
Distinctive Image Features from Scale-Invariant Keypoints
SIFT DESCRIPTOR K Wasif Mrityunjay
A Statistical Approach to Texture Classification Nicholas Chan Heather Dunlop Project Dec. 14, 2005.
Line Matching Jonghee Park GIST CV-Lab..  Lines –Fundamental feature in many computer vision fields 3D reconstruction, SLAM, motion estimation –Useful.
SUN Database: Large-scale Scene Recognition from Abbey to Zoo Jianxiong Xiao *James Haysy Krista A. Ehinger Aude Oliva Antonio Torralba Massachusetts Institute.
Digital Image Processing CSC331
Image Features (I) Dr. Chang Shu COMP 4900C Winter 2008.
The SUN Database Slides by Jennifer Baulier. What is the SUN database? Scene Understanding Database Scene = a place humans could act within Full database.
Recognizing specific objects Matching with SIFT Original suggestion Lowe, 1999,2004.
WLD: A Robust Local Image Descriptor Jie Chen, Shiguang Shan, Chu He, Guoying Zhao, Matti Pietikäinen, Xilin Chen, Wen Gao 报告人:蒲薇榄.
Independent Component Analysis features of Color & Stereo images Authors: Patrik O. Hoyer Aapo Hyvarinen CIS 526: Neural Computation Presented by: Ajay.
Correspondence and Stereopsis. Introduction Disparity – Informally: difference between two pictures – Allows us to gain a strong sense of depth Stereopsis.
Blob detection.
SIFT Scale-Invariant Feature Transform David Lowe
Recognition of biological cells – development
Lecture 07 13/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
3D Graphics Rendering PPT By Ricardo Veguilla.
Mean Shift Segmentation
Feature description and matching
Fourier Transform: Real-World Images
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Level Set Tree Feature Detection
Computer Vision Lecture 16: Texture II
Parallel Spectral Renderer
RGB-D Image for Scene Recognition by Jiaqi Guo
Magnetic Resonance Imaging
Feature descriptors and matching
SIFT SIFT is an carefully designed procedure with empirically determined parameters for the invariant and distinctive features.
Finding Line and Curve Segments from Edge Images
Perspective Photos.
Introduction to Artificial Intelligence Lecture 22: Computer Vision II
Presentation transcript:

Implementing GIST on the GPU

Refrence Original Work  Aude Oliva, Antonio Torralba  Modeling the shape of the scene: a holistic representation of the spatial envelope  International Journal of Computer Vision, Vol. 42(3): ,  CPU Implementation  Torralba.pdf Our Work  Parallelize to work on the GPU

Introduction Recognition of real world scenes Spatial Envelope  A very low dimensional representation of the scene is called the Spatial Envelope.  Takes care of Naturalness Openness Roughness Expansion Ruggedness

Intuitive Notion Naturalness  Straight lines indicate man made structures  Crooked, rough lies indicate natural environment. Openness  Presence of horizon reflects highly open environment Roughness  Refers to the size of its major components.  Is correlated with the fractal dimension of the scene

Intuitive Notion contd… Degree of Expansion  Converging parallel lines allows to percept depth gradient of the space.  A flat view of a building would have a low DoE  A street with long vanishing lines would have a high DoE. Degree of Ruggedness  The deviation of the ground wrt horizon  Open environments have a flat horizontal ground level  Mountainous landscapes have rugged ground  Ruggedness produce oblique contours, hide the horizon.  Man-made environments built on flat ground, less rugged.

Basic Algorithm Create Gabor Filter Bank Preprocess image  Local contrast normalization  Local luminance invariance normalization for color images Descriptor  Convolve image with filters  For each filter Divide image into blocks Mean of each block is a number in feature vector

Parameters Involved Number of Scales  Typically = 3 Number of orientations per scale  Typically = 8 Image size  Typically 320x240 to 1024x1024

Ways to Parallelize On Number of scales * orientations per scale  Number of threads = 24  Less than typical image size On Image MxN Pixels  Number of threads = M*N >> 24.  High degree of parallelism.

Parallelization for creation of Gabor filter Pixel Level Threads calculating Gabor filter value

Parallelization for prefiltering image Pixel Level Fast Fourier Transforms Pixel by pixel Multiplications => Inherent Parallelism

RGB image components Filters Descriptor Division into blocks Calculating Descriptor

Graphs

Speed up Image SizeCPU Time ( ms )GPU Time ( ms )‏ 64 x x x x x

Thank You !!