SoC and FPGA Oriented High-quality Stereo Vision System

Slides:

Advertisements

Similar presentations

Improved Census Transforms for Resource-Optimized Stereo Vision

Advertisements

Aggregating local image descriptors into compact codes

Wavelets Fast Multiresolution Image Querying Jacobs et.al. SIGGRAPH95.

Spatial-Temporal Consistency in Video Disparity Estimation ICASSP 2011 Ramsin Khoshabeh, Stanley H. Chan, Truong Q. Nguyen.

 Understanding the Sources of Inefficiency in General-Purpose Chips.

M.S. Student, Hee-Jong Hong

Real-Time Accurate Stereo Matching using Modified Two-Pass Aggregation and Winner- Take-All Guided Dynamic Programming Xuefeng Chang, Zhong Zhou, Yingjie.

Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,

Yu-Han Chen, Tung-Chien Chen, Chuan-Yung Tsai, Sung-Fang Tsai, and Liang-Gee Chen, Fellow, IEEE IEEE CSVT

UNIVERSITY OF MASSACHUSETTS Dept

Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.

1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.

1 Single Reference Frame Multiple Current Macroblocks Scheme for Multiple Reference IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY Tung-Chien.

A Performance and Energy Comparison of FPGAs, GPUs, and Multicores for Sliding-Window Applications From J. Fowers, G. Brown, P. Cooke, and G. Stitt, University.

A Low-Power VLSI Architecture for Full-Search Block-Matching Motion Estimation Viet L. Do and Kenneth Y. Yun IEEE Transactions on Circuits and Systems.

Joint Histogram Based Cost Aggregation For Stereo Matching Dongbo Min, Member, IEEE, Jiangbo Lu, Member, IEEE, Minh N. Do, Senior Member, IEEE IEEE TRANSACTION.

Introduction Belief propagation: known to produce accurate results for stereo processing/ motion estimation High storage requirements limit the use of.

Stereo Matching Low-Textured Survey 1.Stereo Matching-Based Low-Textured Scene Reconstruction for Autonomous Land Vehicles (ALV) Navigation 2.A Robust.

Stereo Matching Information Permeability For Stereo Matching – Cevahir Cigla and A.Aydın Alatan – Signal Processing: Image Communication, 2013 Radiometric.

Wavelet-Based Multiresolution Matching for Content-Based Image Retrieval Presented by Tienwei Tsai Department of Computer Science and Engineering Tatung.

A Local Adaptive Approach for Dense Stereo Matching in Architectural Scene Reconstruction C. Stentoumis 1, L. Grammatikopoulos 2, I. Kalisperakis 2, E.

Takuya Matsuo, Norishige Fukushima and Yutaka Ishibashi

March 20, 2007 ISPD An Effective Clustering Algorithm for Mixed-size Placement Jianhua Li, Laleh Behjat, and Jie Huang Jianhua Li, Laleh Behjat,

Digital Image Processing CCS331 Relationships of Pixel 1.

Low-Power H.264 Video Compression Architecture for Mobile Communication Student: Tai-Jung Huang Advisor: Jar-Ferr Yang Teacher: Jenn-Jier Lien.

CS332 Visual Processing Department of Computer Science Wellesley College Binocular Stereo Vision Region-based stereo matching algorithms Properties of.

A Region Based Stereo Matching Algorithm Using Cooperative Optimization Zeng-Fu Wang, Zhi-Gang Zheng University of Science and Technology of China Computer.

1 Real-Time Stereo-Matching for Micro Air Vehicles Pascal Dufour Master Thesis Presentation.

Content-Based Image Retrieval Using Fuzzy Cognition Concepts Presented by Tienwei Tsai Department of Computer Science and Engineering Tatung University.

Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.

Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp

Finish Hardware Accelerated Voxel Coloring Anselmo A. Montenegro †, Luiz Velho †, Paulo Carvalho † and Marcelo Gattass ‡ †

Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.

2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )

Image-Based Segmentation of Indoor Corridor Floors for a Mobile Robot Yinxiao Li and Stanley T. Birchfield The Holcombe Department of Electrical and Computer.

A NOVEL METHOD FOR COLOR FACE RECOGNITION USING KNN CLASSIFIER

Spatiotemporal Saliency Map of a Video Sequence in FPGA hardware David Boland Acknowledgements: Professor Peter Cheung Mr Yang Liu.

A Multiresolution Symbolic Representation of Time Series Vasileios Megalooikonomou Qiang Wang Guo Li Christos Faloutsos Presented by Rui Li.

Improved Census Transforms for Resource-Optimized Stereo Vision

An Image Retrieval Approach Based on Dominant Wavelet Features Presented by Te-Wei Chiang 2006/4/1.

Recursive Architectures for 2DLNS Multiplication RESEARCH CENTRE FOR INTEGRATED MICROSYSTEMS - UNIVERSITY OF WINDSOR 11 Recursive Architectures for 2DLNS.

ELEC692 VLSI Signal Processing Architecture Lecture 12 Numerical Strength Reduction.

Hierarchical Systolic Array Design for Full-Search Block Matching Motion Estimation Noam Gur Arie,August 2005.

1 Munther Abualkibash University of Bridgeport, CT.

April 21, 2016Introduction to Artificial Intelligence Lecture 22: Computer Vision II 1 Canny Edge Detector The Canny edge detector is a good approximation.

A Plane-Based Approach to Mondrian Stereo Matching

Vision-Guided Humanoid Footstep Planning for Dynamic Environments

Hiba Tariq School of Engineering

An Implementation Method of the Box Filter on FPGA

Summary of “Efficient Deep Learning for Stereo Matching”

Semi-Global Matching with self-adjusting penalties

Paper – Stephen Se, David Lowe, Jim Little

Jure Zbontar, Yann LeCun

Efficient Image Classification on Vertically Decomposed Data

UNIVERSITY OF MASSACHUSETTS Dept

Models and Architectures

Models and Architectures

Enhanced-alignment Measure for Binary Foreground Map Evaluation

Efficient Image Classification on Vertically Decomposed Data

Anisotropic Double Cross Search Algorithm using Multiresolution-Spatio-Temporal Context for Fast Lossy In-Band Motion Estimation Yu Liu and King Ngi Ngan.

Sum of Absolute Differences Hardware Accelerator

Project P06441: See Through Fog Imaging

1CECA, Peking University, China

Fourier Transform of Boundaries

HALO-FREE DESIGN FOR RETINEX BASED REAL-TIME VIDEO ENHANCEMENT SYSTEM

SUBJECT : COMPUTER GRAPHICS

Presented by Mohammad Rashidujjaman Rifat Ph.D Student,

Scalable light field coding using weighted binary images

VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

Computing the Stereo Matching Cost with a Convolutional Neural Network

Presentation transcript:

SoC and FPGA Oriented High-quality Stereo Vision System Yanzhe Li 1,2 , Kai Huang 1 , Luc Claesen 2 1 Institute of VLSI Design, Zhejiang University, China 2 Engineering Technology − Electronics−ICT Dept., Hasselt University, Belgium

Stereo Vision Introduction Algorithm Implementation Stereo vision is one of the most active research topics in computer vision and widely used in many applications. Intelligent surveillance 3D video conferencing autonomous vehicles mobile robots Intelligent surveillance 3D video conferencing mobile robots Autonomous vehicles 2018/9/192018/9/19

Stereo Matching Introduction Algorithm Implementation Stereo matching is a complicated and time-consuming procedure. Considering that many applications often require high-quality performance and real-time processing speed, hardware acceleration of stereo matching algorithms is inevitable. DSPs: limited by the computational ability and fail to support real-time processing. GPUs: always result in excessive power consumption. FPGAs and ASICs: provide a balance between computational power and energy efficiency. Stereo matching, which is treated as the key role in a stereo vision system, takes a pair of rectified images, estimates the movement of each pixel between two images and displays the associated movement in a disparity map. 2018/9/192018/9/19

Stereo Matching Algorithm Introduction Algorithm Implementation Stereo Matching Algorithm Cost Calculation Cost Aggregation Disparity Selection Disparity Refinement In this paper, the mini-census and segmentation-based ADSW algorithms are combined to achieve a high matching accuracy. Different from many hardware designs without refinement, we present a disparity refinement step with segmentation information and find that it could improve the quality of initial disparity maps significantly. Cost calculation, cost aggregation, disparity selection and disparity refinement are four well-defined steps in stereo matching algorithms. Here we utilize the mini-census transform in the cost calculation step and the segmentation based ADSW algorithm in the cost aggregation step. The refinement step consists of three stages: consistency check, disparity voting and invalid disparity inpainting. Finally, disparity maps of both images are finished simultaneously. 2018/9/192018/9/19

Hardware-oriented Optimization Introduction Algorithm Implementation Some optimizations are introduced to make the algorithm more hardware- compatible. Converting color representation A scale-and-truncate approximation of the weight function A method called AWDE is introduced A vertical-horizontal approach in disparity voting AWDE method To reduce computation complexity and make the algorithm more hardware-compatible, we propose some optimizations shown in the shaded blocks. Luminance (Y) color representation is adopted instead of CIELAB color representation. Manhattan distance is used rather than Euclidean distance to avoid square root computations. A scale-and-truncate approximation of the weight function is proposed. A method called AWDE is introduced. It uses different window sizes for different textures on the image and determines the window size by the mean absolute deviation (MAD) of the pixel in the center of 7x7 block. To determine the most frequent disparity value in the window efficiently, a vertical-horizontal approach is applied. It firstly searches for the majority disparity in each column vertically, then picks out the majority one horizontally as the final disparity. 2018/9/192018/9/19 Weight function

Hardware Architecture Algorithm Implementation Experiments A hardware architecture is designed for the proposed algorithm. The fully pipelined architecture can be decomposed into three stages: pre-processing stereo-matching post-processing The key to improve throughput is to be a fully pipelined architecture, which can be decomposed into three stages: pre-processing, stereo-matching and post-processing. The pre-processing stage does not deal with any disparity information and performs pixel-based operations on each pixel for both images independently. The stereo-matching stage operates on the transformed data and computes disparity maps. Here the computing structure combines the disparity-level parallelism with the pixel-level parallelism for cost aggregation. The post-processing stage is implemented to refine the initial disparity maps when they are ready. 2018/9/192018/9/19

Hardware Architecture Algorithm Implementation Experiments Pre-processing: a register matrix employed to provide pipelined window Stereo-matching: Pdis x Prow in parallel Post-processing: consistency check, disparity voting and invalid disparity inpainting In the pre-processing stage, as the operations are all window-based, register matrix is employed to provide pipelined window content. The whole register matrix of 7 × 7 is used for window size determination. Meanwhile the six green ones are used for the mini-census transform and the center column of red ones are used for segmentation. Finally, a result of 12 (2+6+4) bits is written into the line buffer. In the stereo-matching stage, the weight generator receives segmentation information from both images depending on the window size of the center pixel and computes weight coefficients. The circuit that generates the weight coefficient of a single pixel is shown. The cost aggregator calculates the Hamming distances as matching costs and shifts them with the according weights. The final aggregated cost is computed by summing the weighted scores using a tree adder, then normalizing it. In the post-processing stage, the consistency check module compares the initial disparity maps and segmentation information in order to generate one more bit which labels each disparity as valid or not. The disparity voting module updates the center disparity in the 25 × 25 support window using the vertical-first method. Here, a bitwise fast voting technique is applied to handle each column. It drives each bit of the most frequent disparity independently from the other bits so that the hardware cost depends on the number of disparity bits in binary. While counting bit votes, the valid information must be taken into account. 2018/9/192018/9/19

Experiments Algorithm Implementation Experiments To discuss the quality of our design, the disparity maps are evaluated based on the Middlebury benchmarks. Results of high-definition images The table lists the accuracy comparison with some state-of-art implementations. The error rate is the overall error rate with the tolerance of one disparity level. The comparison shows that the accuracy of our design is not only among the best in hardware accelerated stereo systems, but also competitive with current software implementations. The refinement step contributes significantly to the final disparity maps and many visual improvements are obvious, including the elimination of speckle noise, few errors at the image borders and sharply delineated edges. To further evaluate the proposed algorithm, some high definition images are used. Here the image pair of Art is at a 1110 × 1390 resolution for a 128-pixel disparity range. Accuracy comparison on Middlebury benchmarks 2018/9/192018/9/19 2018/9/192018/9/19 True disparity maps and experimental results

Conclusion and Future Work We proposed a stereo matching algorithm based on the mini-census transform and segmentation-based ADSW. The disparity refinement step with segmentation information has been presented. A fully pipelined and scalable hardware architecture is designed with hardware-oriented optimizations. Future Work A part of global matching algorithms can be introduced into the algorithm. 2018/9/192018/9/19

Thank You Yanzhe Li Luc Claesen liyz<at>vlsi.zju.edu.cn luc.claesen<at>uhasselt.be