EE591f Digital Video Processing

Slides:



Advertisements
Similar presentations
Active Shape Models Suppose we have a statistical shape model –Trained from sets of examples How do we use it to interpret new images? Use an “Active Shape.
Advertisements

Texture Synthesis on [Arbitrary Manifold] Surfaces Presented by: Sam Z. Glassenberg* * Several slides borrowed from Wei/Levoy presentation.
MPEG4 Natural Video Coding Functionalities: –Coding of arbitrary shaped objects –Efficient compression of video and images over wide range of bit rates.
LI Hong-ye, LIU Ming-jun, ZHANG Zhi-qiang MINES
A Performance Analysis of the ITU-T Draft H.26L Video Coding Standard Anthony Joch, Faouzi Kossentini, Panos Nasiopoulos Packetvideo Workshop 2002 Department.
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
Face Alignment with Part-Based Modeling
MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.
On Constrained Optimization Approach To Object Segmentation Chia Han, Xun Wang, Feng Gao, Zhigang Peng, Xiaokun Li, Lei He, William Wee Artificial Intelligence.
Ai-Mei Huang And Truong Nguyen Image processing, 2006 IEEE international conference on Motion vector processing based on residual energy information for.
Computer Vision Optical Flow
1 Adaptive slice-level parallelism for H.264/AVC encoding using pre macroblock mode selection Bongsoo Jung, Byeungwoo Jeon Journal of Visual Communication.
K.-S. Choi and S.-J. Ko Sch. of Electr. Eng., Korea Univ., Seoul, South Korea IEEE, Electronics Letters Issue Date : June Hierarchical Motion Estimation.
2009/04/07 Yun-Yang Ma.  Overview  What is CUDA ◦ Architecture ◦ Programming Model ◦ Memory Model  H.264 Motion Estimation on CUDA ◦ Method ◦ Experimental.
Yen-Lin Lee and Truong Nguyen ECE Dept., UCSD, La Jolla, CA Method and Architecture Design for Motion Compensated Frame Interpolation in High-Definition.
Novel Point-Oriented Inner Searches for Fast Block Motion Lai-Man Po, Chi-Wang Ting, Ka-Man Wong, and Ka-Ho Ng IEEE TRANSACTIONS ON MULTIMEDIA, VOL.9,
Mobile Motion Tracking using Onboard Camera Lam Man Kit CEG Wong Yuk Man CEG.
Department of Computer Engineering University of California at Santa Cruz Video Compression Hai Tao.
MultiHypothesis Pictures For H.26L Markus Flierl Telecommunications Laboratory University of Erlangen-Nuremberg Erlangen, Germany
An Efficient Low Bit-Rate Video-coding Algorithm Focusing on Moving Regions Kwok-Wai Wong, Kin-Man Lam, Wan-Chi Siu IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS.
Computing motion between images
MOTION ESTIMATION An Overview BY: ABHISHEK GIROTRA Trainee Design Engineer.
Stereo and Multiview Sequence Processing. Outline Stereopsis Stereo Imaging Principle Disparity Estimation Intermediate View Synthesis Stereo Sequence.
A New Diamond Search Algorithm for Fast Block- Matching Motion Estimation Shan Zhu and Kai-Kuang Ma IEEE TRANSACTIONS ON IMAGE PROCESSION, VOL. 9, NO.
Image (and Video) Coding and Processing Lecture: Motion Compensation Wade Trappe Most of these slides are borrowed from Min Wu and KJR Liu of UMD.
Investigation of Motion-Compensated Lifted Wavelet Transforms Information Systems Laboratory Department of Electrical Engineering Stanford University Markus.
EE569 Digital Video Processing
1 Numerical geometry of non-rigid shapes Non-Euclidean Embedding Non-Euclidean Embedding Lecture 6 © Alexander & Michael Bronstein tosca.cs.technion.ac.il/book.
Jump to first page The research report Block matching algorithm Motion compensation Spatial transformation Xiaomei Yu.
Video Compression Concepts Nimrod Peleg Update: Dec
EE392J Final Project, March 20, Multiple Camera Object Tracking Helmy Eltoukhy and Khaled Salama.
 Coding efficiency/Compression ratio:  The loss of information or distortion measure:
Interframe Coding Heejune AHN Embedded Communications Laboratory Seoul National Univ. of Technology Fall 2008 Last updated
國立屏東商業技術學院 資訊工程系 ( 所 ) 多媒體技術發展實驗室 Laboratory of Multimedia Technology Development Department of Computer Science and Information Engineering Nation Pingtung.
1 Efficient Reference Frame Selector for H.264 Tien-Ying Kuo, Hsin-Ju Lu IEEE CSVT 2008.
Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.
EE591f Digital Video Processing 1 Roadmap Introduction Intra-frame coding –Review of JPEG Inter-frame coding –Conditional Replenishment (CR) –Motion Compensated.
H.264/AVC 基於影像複雜度與提早結束之快速 階層運動估計方法 Content-Based Hierarchical Fast Motion Estimation with Early Termination in H.264/AVC 研究生:何銘哲 指導教授:蔣依吾博士 中山大學資訊工程學系.
Sub pixel motion estimation for Wyner-Ziv side information generation Subrahmanya M V (Under the guidance of Dr. Rao and Dr.Jin-soo Kim)
An Efficient Search Strategy for Block Motion Estimation Using Image Features Digital Video Processing 1 Term Project Feng Li Michael Su Xiaofeng Fan.
1 University of Texas at Austin Machine Learning Group 图像与视频处理 计算机学院 Motion Detection and Estimation.
MOTION ESTIMATION IMPLEMENTATION IN RECONFIGURABLE PLATFORMS
Compression of Real-Time Cardiac MRI Video Sequences EE 368B Final Project December 8, 2000 Neal K. Bangerter and Julie C. Sabataitis.
Pyramidal Implementation of Lucas Kanade Feature Tracker Jia Huang Xiaoyan Liu Han Xin Yizhen Tan.
Ai-Mei Huang, Student Member, IEEE, and Truong Nguyen, Fellow, IEEE.
Raquel A. Romano 1 Scientific Computing Seminar May 12, 2004 Projective Geometry for Computer Vision Projective Geometry for Computer Vision Raquel A.
Page 11/28/2016 CSE 40373/60373: Multimedia Systems Quantization  F(u, v) represents a DCT coefficient, Q(u, v) is a “quantization matrix” entry, and.
Video Compression and Standards
Motion Estimation Presented By: Dr. S. K. Singh Department of Computer Engineering, Indian Institute of Technology (B.H.U.) Varanasi
Motion Estimation Multimedia Systems and Standards S2 IF Telkom University.
BLOCK BASED MOTION ESTIMATION. Road Map Block Based Motion Estimation Algorithms. Procedure Of 3-Step Search Algorithm. 4-Step Search Algorithm. N-Step.
Hierarchical Systolic Array Design for Full-Search Block Matching Motion Estimation Noam Gur Arie,August 2005.
Principles of Video Compression Dr. S. M. N. Arosha Senanayake, Senior Member/IEEE Associate Professor in Artificial Intelligence Room No: M2.06
MOTION Model. Road Map Motion Model Non Parametric Motion Field : Algorithms 1.Optical flow field estimation. 2.Block based motion estimation. 3.Pel –recursive.
1שידור ווידיאו ואודיו ברשת האינטרנט Dr. Ofer Hadar Communication Systems Engineering Department Ben-Gurion University of the Negev URL:
Ai-Mei Huang And Truong Nguyen Image processing, 2006 IEEE international conference on Motion vector processing based on residual energy information for.
Video Coding: Block Motion Estimation Trac D. Tran ECE Department The Johns Hopkins University Baltimore MD
Multi-Frame Motion Estimation and Mode Decision in H.264 Codec Shauli Rozen Amit Yedidia Supervised by Dr. Shlomo Greenberg Communication Systems Engineering.
Video Compression Video : Sequence of frames Each Frame : 2-D Array of Pixels Video: 3-D data – 2-D Spatial, 1-D Temporal Video has both : – Spatial Redundancy.
CMPT365 Multimedia Systems 1 Media Compression - Video Spring 2015 CMPT 365 Multimedia Systems.
Dr. Ofer Hadar Communication Systems Engineering Department
Anisotropic Double Cross Search Algorithm using Multiresolution-Spatio-Temporal Context for Fast Lossy In-Band Motion Estimation Yu Liu and King Ngi Ngan.
POSTPROCESSING Review analysis results and evaluate the performance
Mesh Parameterization: Theory and Practice
MPEG4 Natural Video Coding
Filtering Things to take away from this lecture An image as a function
Foundation of Video Coding Part II: Scalar and Vector Quantization
Image and Video Processing
Bongsoo Jung, Byeungwoo Jeon
LSH-based Motion Estimation
Presentation transcript:

EE591f Digital Video Processing Roadmap Introduction to Block Matching Algorithm Fast BMA Three classes of speed-up strategies Generalized BMA From integer-pel to fractional-pel From fixed block size to variable block size Deformable BMA (DBMA) or mesh-based BMA Experimental results How do block size and motion accuracy affect the MCP efficiency? EE591f Digital Video Processing

An Intuitive Way of Understanding Block Matching Algorithm (BMA) ? b e template f database EE591f Digital Video Processing

Block Matching in Motion Estimation d reference frame inquiry block in current frame a: (-3,-2) b: (-3,-1) c: (0,0) d: (1,2) EE591f Digital Video Processing

Motion Estimation and Compensation Motion Compensation With the estimated motion vector, the block in the reference frame is displaced to generate a prediction of the inquiry block in the current frame. Such procedure is called “motion compensation”. Motion Compensated Prediction (MCP) residues current frame displaced reference frame (d1,d2) : estimated motion vector EE591f Digital Video Processing

EE591f Digital Video Processing Two Key Elements in BMA Matching criterion: How do I measure the similarity between two blocks? Mean Square Error (MSE): L2 norm Mean Absolute Difference (MAD): L1 norm Search strategy: How do I find the best match of the given block? Exhaustive search: global minimum Non-exhaustive search: close to global minimum EE591f Digital Video Processing

Goal: Find the Best Tradeoff variance of MCP residues computational cost EE591f Digital Video Processing

EE591f Digital Video Processing Roadmap Introduction to Block Matching Algorithm Fast BMA Three classes of speed-up strategies Generalized BMA From fixed block size to variable block size From integer-pel to fractional-pel Experimental results How do block size and motion accuracy affect the MCP efficiency? EE591f Digital Video Processing

Benchmark: Exhaustive Search An example of window size T=7 It searches (2T+1)2=225 points in total EE591f Digital Video Processing

Fast Block Matching Algorithms Class-A (I-IV): ad-hoc speed-up strategies Class-B (V-VII): advanced speed-up strategies (wise use of computational resource to account for probabilities) Class-C (VIII): hierarchical strategy General Principle – trade complexity with performance EE591f Digital Video Processing

Fast BMA (I): 3-Step-Search 25 points EE591f Digital Video Processing

Fast BMA (II): Logarithmic Search search at most 5+4+2+3+2+8= 24 points EE591f Digital Video Processing

Fast BMA (III): Orthogonal Search search at most 2(3+2+2+2+2+2)= 26 points EE591f Digital Video Processing

Fast BMA (IV): Cross Search search at most 5+4+4+4= 17 points EE591f Digital Video Processing

Why does probabilistic modeling of MV help? Empirical pdf of motion vectors EE591f Digital Video Processing

Fast BMA (V): New 3-Step Search EE591f Digital Video Processing

New 3-Step Search: Examples EE591f Digital Video Processing

Fast BMA (VI): 4-Step Search Search the 9 checking points located at a 5-by-5 window to see if the point reaching the minimum distortion is found at the center? N Y N Is it at the corner or not? Search 5 additional Checking points Search 3 additional Checking points Y Repeat the procedure in the dashed box Final 3-by-3 search EE591f Digital Video Processing

4-Step Search: Examples EE591f Digital Video Processing

The Idea of Successive Refinement Note that in all previous approaches to fast BMA, we only consider the possibility of reducing the number of search points For each search point, we still need to calculate the matching criterion for a B-times-B block To further reduce the complexity, we might consider reducing the cost of each matching as well EE591f Digital Video Processing

Multi-resolution Representation of Images Multi-resolution representation by pyramid EE591f Digital Video Processing

EE591f Digital Video Processing Why does Hierarchical Strategy Help? Level-2 ME result Level-1 ME result Level-0 EE591f Digital Video Processing

Hierarchical Block Matching Algorithm (HBMA) EE591f Digital Video Processing

EE591f Digital Video Processing

EE591f Digital Video Processing Example: Three-level HBMA EE591f Digital Video Processing

Fast BMA (VIII): Hierarchical Search EE591f Digital Video Processing

EE591f Digital Video Processing Summary Why do we care fast BMA? Driven by the application demands of video coding Can we go beyond BMA? The block-based constraint is simple but not appropriate for accounting for arbitrary shape of moving objects The integer-pel accuracy is not sufficient to account for continuous nature of motion EE591f Digital Video Processing

EE591f Digital Video Processing Roadmap Introduction to Block Matching Algorithm Fast BMA Three classes of speed-up strategies Generalized BMA From integer-pel to fractional-pel From fixed block size to variable block size Deformable BMA (DBMA) or mesh-based BMA* Experimental results How do block size and motion accuracy affect the MCP efficiency? EE591f Digital Video Processing

Why Do We Need Fraction-pel? EE591f Digital Video Processing

EE591f Digital Video Processing Fractional-pel BMA 2N N linear interpolation 2M M original reference frame interpolated reference frame EE591f Digital Video Processing

EE591f Digital Video Processing Half-pel BMA 1 1 1 1 current frame digits indicate physical distances reference frame EE591f Digital Video Processing

Bilinear Interpolation (x,y) (x+1,y) (2x,2y) (2x+1,2y) (2x,2y+1) (2x+1,2y+1) (x,y+!) (x+1,y+1) O[2x,2y]=I[x,y] O[2x+1,2y]=(I[x,y]+I[x+1,y])/2 O[2x,2y+1]=(I[x,y]+I[x,y+1])/2 O[2x+1,2y+1]=(I[x,y]+I[x+1,y]+I[x,y+1]+I[x+1,y+1])/4 Generalize to 1/K pixel where K >2 EE591f Digital Video Processing

Hierarchical Strategy for Half-pel BMA Integer-pel Half-pel EE591f Digital Video Processing

Beyond Half-pel Accuracy There exist results supporting the further prediction efficiency gain from half-pel to quarter-pel; sometimes it is even worthwhile to reach 1/8-pel accuracy The improved prediction efficiency is comprised by modestly increased computational complexity and overhead Question: for what kind of video, finer-accuracy improves the MCP efficiency most? EE591f Digital Video Processing

Generalizations of BMA Variable block-size matching algorithms Widely used by various video coding standards H.264 includes three variable block sizes: 4-by-4, 8-by-8 and 16-by-16 Fractional-pel accuracy BMA Half-pel : MPEG-1/2/4, H.263/H.263+ Quarter-pel: H.264 (even 1/8-pel) Tradeoff between overhead on motion and MCP efficiency EE591f Digital Video Processing

Variable Block-size BMA 16-by-16 8-by-8 4-by-4 EE591f Digital Video Processing

BMA Strategy Adopted by H.263 Macroblock level Block level EE591f Digital Video Processing

BMA Strategy Adopted by H.264 Note: require overhead to signal which partition is adopted by the encoder EE591f Digital Video Processing

Deformable Block Matching Algorithm EE591f Digital Video Processing

EE591f Digital Video Processing Overview of DBMA Three steps: Partition the anchor frame into regular blocks Model the motion in each block by a more complex motion The 2-D motion caused by a flat surface patch undergoing rigid 3-D motion can be approximated well by projective mapping Projective Mapping can be approximated by affine mapping and bilinear mapping Estimate the motion parameters block by block independently Discontinuity problem cross block boundaries still remain EE591f Digital Video Processing

Affine and Bilinear Model Affine (6 parameters): Good for mapping triangles to triangles Bilinear (8 parameters): Good for mapping blocks to quadrangles EE591f Digital Video Processing

Mesh-Based Motion Estimation A control grid is used to partition a frame into non-overlapping polygon elements. The nodal motion is constrained so that a feasible mesh is still formed with the motion. (a) Using a triangular mesh (b) Using a quadrilateral mesh EE591f Digital Video Processing

Mesh-based vs Block-based (a) block-based ME (b) mesh-based ME (c) mesh-based motion tracking EE591f Digital Video Processing

EE591f Digital Video Processing Example: BMA vs Mesh-based Target Anchor EBMA (half-pel) (29.86dB) Predicted EE591f Digital Video Processing Mesh-based method (29.72dB)

EE591f Digital Video Processing Roadmap Introduction to Block Matching Algorithm Fast BMA Three classes of speed-up strategies Generalized BMA From fixed block size to variable block size From integer-pel to fractional-pel Experimental results How do block size and motion accuracy affect the MCP efficiency? EE591f Digital Video Processing

EE591f Digital Video Processing Experiment Results Frame #1 Frame #2 EE591f Digital Video Processing

Motion-Compensated Prediction Residues 16-by-16 block, integer-pel, var(e)=271.8 EE591f Digital Video Processing

EE591f Digital Video Processing Motion-Compensated Prediction Residues 8-by-8 block, integer-pel, var(e)=220.8 EE591f Digital Video Processing

EE591f Digital Video Processing Motion-Compensated Prediction Residues 16-by-16 block, half-pel, var(e)=164.2 EE591f Digital Video Processing

EE591f Digital Video Processing Motion-Compensated Prediction Residues 8-by-8 block, half-pel, var(e)=123.8 EE591f Digital Video Processing