Fast Census Transform-based Stereo Algorithm using SSE2

Slides:



Advertisements
Similar presentations
Improved Census Transforms for Resource-Optimized Stereo Vision
Advertisements

Streaming SIMD Extension (SSE)
Gratuitous Picture US Naval Artillery Rangefinder from World War I (1918)!!
Cuong Cao Pham and Jae Wook Jeon, Member, IEEE
Real-Time Accurate Stereo Matching using Modified Two-Pass Aggregation and Winner- Take-All Guided Dynamic Programming Xuefeng Chang, Zhong Zhou, Yingjie.
Adviser : Ming-Yuan Shieh Student ID : M Student : Chung-Chieh Lien VIDEO OBJECT SEGMENTATION AND ITS SALIENT MOTION DETECTION USING ADAPTIVE BACKGROUND.
Robust video fingerprinting system Daniel Luis
SURF: Speeded-Up Robust Features
Lecture 5 Template matching
CMPT-884 Jan 18, 2010 Video Copy Detection using Hadoop Presented by: Cameron Harvey Naghmeh Khodabakhshi CMPT 820 December 2, 2010.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
HCI Final Project Robust Real Time Face Detection Paul Viola, Michael Jones, Robust Real-Time Face Detetion, International Journal of Computer Vision,
Computing motion between images
Effective Gaussian mixture learning for video background subtraction Dar-Shyang Lee, Member, IEEE.
Augmented Reality: Object Tracking and Active Appearance Model
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
A Performance and Energy Comparison of FPGAs, GPUs, and Multicores for Sliding-Window Applications From J. Fowers, G. Brown, P. Cooke, and G. Stitt, University.
100+ Times Faster Weighted Median Filter [cvpr ‘14]
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 11, NOVEMBER 2011 Qian Zhang, King Ngi Ngan Department of Electronic Engineering, the Chinese university.
Viewpoint Tracking for 3D Display Systems A look at the system proposed by Yusuf Bediz, Gözde Bozdağı Akar.
Visualization- Determining Depth From Stereo Saurav Basu BITS Pilani 2002.
Real-Time Face Detection and Tracking Using Multiple Cameras RIT Computer Engineering Senior Design Project John RuppertJustin HnatowJared Holsopple This.
Jason Li Jeremy Fowers Ground Target Following for Unmanned Aerial Vehicles.
Introduction Belief propagation: known to produce accurate results for stereo processing/ motion estimation High storage requirements limit the use of.
Factor Graphs Young Ki Baik Computer Vision Lab. Seoul National University.
Stereo Matching Information Permeability For Stereo Matching – Cevahir Cigla and A.Aydın Alatan – Signal Processing: Image Communication, 2013 Radiometric.
GmImgProc Alexandra Olteanu SCPD Alexandru Ştefănescu SCPD.
Fast Approximate Energy Minimization via Graph Cuts
3D Fingertip and Palm Tracking in Depth Image Sequences
Mutual Information-based Stereo Matching Combined with SIFT Descriptor in Log-chromaticity Color Space Yong Seok Heo, Kyoung Mu Lee, and Sang Uk Lee.
資訊碩一 蔡勇儀  Introduction  Method  Background generation and updating  Detection of moving object  Shape control points.
A Local Adaptive Approach for Dense Stereo Matching in Architectural Scene Reconstruction C. Stentoumis 1, L. Grammatikopoulos 2, I. Kalisperakis 2, E.
Joint Depth Map and Color Consistency Estimation for Stereo Images with Different Illuminations and Cameras Yong Seok Heo, Kyoung Mu Lee and Sang Uk Lee.
A Non-local Cost Aggregation Method for Stereo Matching
Shape from Stereo  Disparity between two images  Photogrammetry  Finding Corresponding Points Correlation based methods Feature based methods.
Dynamic 3D Scene Analysis from a Moving Vehicle Young Ki Baik (CV Lab.) (Wed)
Stereo Object Detection and Tracking Using Clustering and Bayesian Filtering Texas Tech University 2011 NSF Research Experiences for Undergraduates Site.
DIEGO AGUIRRE COMPUTER VISION INTRODUCTION 1. QUESTION What is Computer Vision? 2.
A Frequency-Domain Approach to Registration Estimation in 3-D Space Phillip Curtis Pierre Payeur Vision, Imaging, Video and Autonomous Systems Research.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Experiments Irena Farberov,Andrey Terushkin. Stereo The word "stereo" comes from the Greek word "stereos" which means firm or solid. With stereo vision.
Visual SLAM Visual SLAM SPL Seminar (Fri) Young Ki Baik Computer Vision Lab.
1 Real-Time Stereo-Matching for Micro Air Vehicles Pascal Dufour Master Thesis Presentation.
Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.
Adaptive Rigid Multi-region Selection for 3D face recognition K. Chang, K. Bowyer, P. Flynn Paper presentation Kin-chung (Ryan) Wong 2006/7/27.
Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.
HONGIK UNIVERSITY School of Radio Science & Communication Engineering Visual Information Processing Lab Hong-Ik University School of Radio Science & Communication.
Implementation, Comparison and Literature Review of Spatio-temporal and Compressed domains Object detection. By Gokul Krishna Srinivasan Submitted to Dr.
Ning Sun, Hassan Mansour, Rabab Ward Proceedings of 2010 IEEE 17th International Conference on Image Processing September 26-29, 2010, Hong Kong HDR Image.
Learning to Detect Faces A Large-Scale Application of Machine Learning (This material is not in the text: for further information see the paper by P.
Solving for Stereo Correspondence Many slides drawn from Lana Lazebnik, UIUC.
Improved Census Transforms for Resource-Optimized Stereo Vision
Visual Odometry David Nister, CVPR 2004
John Morris Stereo Vision (continued) Iolanthe returns to the Waitemata Harbour.
Attila Kiss, Tamás Németh, Szabolcs Sergyán, Zoltán Vámossy, László Csink Budapest Tech Recognition of a Moving Object in a Stereo Environment Using a.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Fast face localization and verification J.Matas, K.Johnson,J.Kittler Presented by: Dong Xie.
Recognizing specific objects Matching with SIFT Original suggestion Lowe, 1999,2004.
數位三維視訊 楊 家 輝 Jar-Ferr Yang 電腦與通信工程研究所 電機工程學系 國立成功大學 Institute of Computer and Communication Engineering Department of Electrical Engineering National Cheng.
Local Stereo Matching Using Motion Cue and Modified Census in Video Disparity Estimation Zucheul Lee, Ramsin Khoshabeh, Jason Juang and Truong Q. Nguyen.
Video object segmentation and its salient motion detection using adaptive background generation Kim, T.K.; Im, J.H.; Paik, J.K.;  Electronics Letters 
An Implementation Method of the Box Filter on FPGA
Design for Embedded Image Processing on FPGAs
Paper – Stephen Se, David Lowe, Jim Little
핵심어 검출을 위한 단일 끝점 DTW 알고리즘 Yong-Sun Choi and Soo-Young Lee
Student: Wanli Ouyang (歐陽萬里) Supervisor: Prof. W.K. Cham
Efficient Image Classification on Vertically Decomposed Data
Efficient Deformable Template Matching for Face Tracking
Binocular Stereo Vision
Liyuan Li, Jerry Kah Eng Hoe, Xinguo Yu, Li Dong, and Xinqi Chu
Presentation transcript:

Fast Census Transform-based Stereo Algorithm using SSE2 Young Ki Baik* Kyoung Mu Lee Computer Vision Lab. School of Electrical Engineering and Computer Science Seoul National University

Contents Stereo Vision Census Transform Stereo Vision Fast approaches Experimental result Conclusion and Future work These are contents what I’ll present you. At first, I’ll introduce stereo vision and census transform stereo vision. After that, I’ll show you fast approaches for stereo vision and experimental result. Finally, I’ll finish my presentation with conclusion and future work.

Introduction What is the stereo vision? The stereo vision is the method to extract 3D information using image from different view points. Topographical survey Obstacle detection Object tracking Face recognition What is the stereo vision? Stereo vision is the method to extract 3D information using image from different view points. And It can be applied to Topographical survey, obstacle detection, object tracking, face recognition and so on.

Introduction Trade off of algorithms Algorithm for accurate results Complex computation and iteration Slow processing time Unable to realize real-time system Algorithm for fast processing time Simple computation and no iteration Fast processing time Unable to realize accurate system Stereo vision has the trade off. For accurate results, algorithm has complex computation and iteration, slow processing time. For these reason, algorithms are unable to realize real-time system. For fast processing time, algorithm has simple computation and no iteration, fast processing time. But it is unable to realize accurate system. Our approach, we will focus on fast processing algorithm for stereo vision.

Introduction Fast stereo vision algorithm Window size invariant method Box filtering method “Box-filtering techniques”, M.J.McDonnell (CGIP-81’) “Real time correlation-based stereo : algorithm, implementations and applications”, Olivier Faugeras , Zhengyou Zhang , … (Tech.Rep.RR-2013, INRIA,1993) Disparity range invariant method Rectangular subregioning method “Rectangular Subregioning and 3-D Maximum-Surface Techniques for Fast Stereo Matching”, Changming Sun (CVPR-2001) Parallel processing technique For the fast stereo vision algorithm, these 3 items can be considered. At first, window size invariant method or box filtering method was introduced from these references. Also, disparity range invariant method was introduced by Changming sun. Finally, parallel processing technique is used for fast stereo vision.

Introduction Problem Census transform Real images from grabbers can not assure of brightness consistency in corresponding region. Intensity correlation method is not proper for real images. Census transform Census transform has been evaluated as the method robust to radiometric distortion. J. Banks and P. Corke, "Quantitative evaluation of matching methods and validity measures for stereo vision," Int. J. Robotics Research, vol. 20, pp. 512-532, July 2001. Heiko Hirschmller, "Improvements in Real-Time Correlation Based Stereo Vision", Proceedings IEEE Workshop on Stereo and Multi-Baseline Vision, pp. 141-148, Kauai, Hawaii, December 2001. In common stereo vision algorithm, it does not consider following problem. The problem is that Real images from grabbers can not assure of brightness consistency in corresponding region. Therefore Intensity correlation method is not proper for real images. Generally, LOG filtering method are used for solving these problem. But, census transform has been evaluated as the method robust to radiometric distortion from these papers. So, we use census transform method in order to implement more accurate and fast stereo vision algorithm.

Census Transform Stereo Vision Census transform converts relative intensity difference to 0 or 1 and deforms 1 dimensional vector as much as window size of census transform. 210 159 99 86 39 198 170 32 67 47 45 78 102 98 30 40 33 115 109 31 26 130 121 1 X Census transform window (CTW) I’ll talk about census transform. Census transform converts relative intensity difference to 0 to 1 and deforms 1 dimensional vector as much as window size of census transform. And vector will be multiple proportion of 8, therefore we can use parallel processing effectively.

Census Transform Stereo Vision Result of census transform Census transform makes data of (image size * vector size). (Square size of CTW)-1 Height Width Height And, census transform makes data of image size by vector size. With this result… Width

Census Transform Stereo Vision Sum of Hamming distance The Hamming distance of two transformed vectors with correlation windows is used to find corresponding region. Sum of Hamming distance Disparity range The Hamming distance of two transformed vectors with correlation windows is used to find corresponding region. And we can obtain 3d information from this 3d space. This is census transform based stereo vision. Right census transformed vector Left census transformed vector 3D disparity space

Census Transform Stereo Vision Complexity of algorithm Census transform-based stereo vision (CTSV) has high complexity. N : Searching window size D : Disparity range C : Census transform window size Method Complexity Conventional stereo vision ND Census transform CND But census transform-based stereo vision has high complexity. As you can see in this table, complexity of census transform-based stereo vision is CND where N is searching window size. D is disparity range C is Census transform window size In order to overcome this slow processing time, we try to apply fast approaches to CTSV.

Fast Census Transform Stereo Vision Hamming distance Parallel processing - SSE2 8bit look-up table Parallel processing - SSE2 Correlation fast census transform-based stereo vision consists of these 3 parts. We use parallel processing for census transform, 8-bit look-up table or parallel processing for computing Hamming distance, Moving window technique and parallel processing for correlation. Moving window technique Parallel processing - SSE2

Fast Census Transform Stereo Vision SSE2 (Streaming SIMD Extension 2) 128-bit SIMD packed integer & floating point arithmetic operation Cache and memory management operation Continuous memory structure is required No advantages in separate data SSE2 offers 128-bit SIMD packed integer and floating point arithmetic operation and It provides cache and memory management operation. In order to use SSE2, continuous memory structure is required and there are no advantages in separate data.

Fast Census Transform Stereo Vision Fast approaches (census transform) Usage of parallel processing (SSE2) 16 pixels are loaded to XMM(SSE2 memory) and computed at once. For the census transform, we use SSE2. In this process, 16 pixels are loaded to XMM and computed at once.

Fast Census Transform Stereo Vision Fast approaches (sum of Hamming distance) Usage of 8bit look-up table (LUT) Parallel processing : SSE2 Parallel processing is faster than 8 bit LUT For computing sum of Hamming distance, we use 8 bit look up table or parallel processing. And, we can verify that parallel processing is faster than 8 bit look up table.

Fast Census Transform Stereo Vision Fast approaches (correlation) Moving window technique For correlation, we use moving window technique which considers near memory access with box filtering.

Fast Census Transform Stereo Vision Fast approaches (correlation) Combination of moving window and SIMD Moving window technique for x, y-axis SSE2 for d-axis In order to make faster approach for correlation, We combine moving window for x, y-axis and SSE2 for d-axis.

Experimental result Environment Condition System : Pentium-IV 2.4GHz Cache memory : 512Kbyte Camera : Stereo Mega-D (Videre design) Condition Image size : 320 x 240 gray stereo images Census transform window size :5x5, 7x7, 9x9 Disparity searching range : 32 Correlation window size : 11x11 I’ll show you experimental result. We use pentinu-4 2.4GHz and 512Kbyte cache memory, and stereo mega-d camera from videre design. We use 320x240 gray stereo images. And we test alternative census transform window size. Disparity searching range is 32. And correlation window size is 11x11.

Experimental result Detail processing time 32 disparity searching range Method Census Transform Window Time (msec) Frame Rate CT Cal HD Corr Total with SSE2 5 5.9 17.5 5.8 29.2 34.2 7 11.6 29.6 5.6 46.8 21.4 9 35.4 52.6 4.3 92.3 10.8 without 34.3 24.1 14.2 72.5 13.8 66.2 35.1 14.1 115.4 8.7 208.3 66.5 289.0 3.5 For the census transform-based stereo vision with SSE2, we can get 34.2 frame rate.

Experimental result Performance of census transform stereo vision And This is the graph of performance for census transform stereo vision

Conclusion and Future work Moving window technique reduces processing time to constant except in transforming stage SSE2 instructions reduces running time by 2.5 to 3 times Possibility for faster result Specialized Instruction 16 bit look-up table Fixed window size of census transform Future work Applying real-time approach to another stereo algorithm Combine stereo system to other applications We can verify that moving window technique reduces processing them to constant except in transforming stage And sse2 instructions reduces running time by 2.5 to 3 times. We can expect faster results with specialized instruction in sse2, 16 bit look-up table and fixed census transform window size. For future works, we will apply real-time approaches to another stereo algorithm and combine stereo system to other applications. This is end of my talk. Thank you for your attention. Is there any question?