Video Coding Using Spatially Varying Transform Cixun Zhang, Kermal Ugur, Jani Lainema, Antti Hallapuro and Moncef IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM.

Slides:

Advertisements

Similar presentations

Packet Video Error Concealment With Auto Regressive Model Yongbing Zhang, Xinguang Xiang, Debin Zhao, Siwe Ma, Student Member, IEEE, and Wen Gao, Fellow,

Advertisements

Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.

H.264 Intra Frame Coder System Design Özgür Taşdizen Microelectronics Program at Sabanci University 4/8/2005.

MPEG4 Natural Video Coding Functionalities: –Coding of arbitrary shaped objects –Efficient compression of video and images over wide range of bit rates.

INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS, ICT '09. TAREK OUNI WALID AYEDI MOHAMED ABID NATIONAL ENGINEERING SCHOOL OF SFAX New Low Complexity.

A Performance Analysis of the ITU-T Draft H.26L Video Coding Standard Anthony Joch, Faouzi Kossentini, Panos Nasiopoulos Packetvideo Workshop 2002 Department.

-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang

1 Video Coding Concept Kai-Chao Yang. 2 Video Sequence and Picture Video sequence Large amount of temporal redundancy Intra Picture/VOP/Slice (I-Picture)

Implementation and Study of Unified Loop Filter in H.264 EE 5359 Multimedia Processing Spring 2012 Guidance : Prof K R Rao Pavan Kumar Reddy Gajjala

An Early Block Type Decision Method for Intra Prediction in H.264/AVC Jungho Do, Sangkwon Na and Chong-Min Kyung VLSI Systems Lab. Korea Advanced Institute.

H.264/AVC Baseline Profile Decoder Complexity Analysis Michael Horowitz, Anthony Joch, Faouzi Kossentini, and Antti Hallapuro IEEE TRANSACTIONS ON CIRCUITS.

1 Adaptive slice-level parallelism for H.264/AVC encoding using pre macroblock mode selection Bongsoo Jung, Byeungwoo Jeon Journal of Visual Communication.

{ Fast Disparity Estimation Using Spatio- temporal Correlation of Disparity Field for Multiview Video Coding Wei Zhu, Xiang Tian, Fan Zhou and Yaowu Chen.

CABAC Based Bit Estimation for Fast H.264 RD Optimization Decision

Reji Mathew and David S. Taubman CSVT  Introduction  Quad-tree representation  Quad-tree motion modeling  Motion vector prediction strategies.

Wei Zhu, Xiang Tian, Fan Zhou and Yaowu Chen IEEE TCE, 2010.

Highly Parallel Rate-Distortion Optimized Intra-Mode Decision on Multicore Graphics Processors Ngai-Man Cheung, Oscar C. Au, Senior Member, IEEE, Man-Cheung.

Overview of the Scalable Video Coding Extension of the H

Adaptive Deblocking Filter

Analysis, Fast Algorithm, and VLSI Architecture Design for H

Scalable Wavelet Video Coding Using Aliasing- Reduced Hierarchical Motion Compensation Xuguang Yang, Member, IEEE, and Kannan Ramchandran, Member, IEEE.

1 An Efficient Mode Decision Algorithm for H.264/AVC Encoding Optimization IEEE TRANSACTION ON MULTIMEDIA Hanli Wang, Student Member, IEEE, Sam Kwong,

BY AMRUTA KULKARNI STUDENT ID : UNDER SUPERVISION OF DR. K.R. RAO Complexity Reduction Algorithm for Intra Mode Selection in H.264/AVC Video.

BIN LI, HOUQIAN LI, LI LI, AND JINLEI ZHANG IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL.23, NO.9, SEPTEMBER

BY AMRUTA KULKARNI STUDENT ID : UNDER SUPERVISION OF DR. K.R. RAO Complexity Reduction Algorithm for Intra Mode Selection in H.264/AVC Video.

Adaptive Deblocking Filter in H.264 Ehsan Maani Course Project:

A Nonlinear Loop Filter for Quantization Noise Removal in Hybrid Video Compression Onur G. Guleryuz DoCoMo USA Labs

An Introduction to H.264/AVC and 3D Video Coding.

Image and Video Compression

Lossy Compression Based on spatial redundancy Measure of spatial redundancy: 2D covariance Cov X (i,j)=  2 e -  (i*i+j*j) Vertical correlation   

PROJECT PROPOSAL HEVC DEBLOCKING FILTER AND ITS IMPLIMENTATION RAKESH SAI SRIRAMBHATLA UTA ID: EE 5359 Under the guidance of DR. K. R. RAO.

Kai-Chao Yang Hierarchical Prediction Structures in H.264/AVC.

1 Efficient Reference Frame Selector for H.264 Tien-Ying Kuo, Hsin-Ju Lu IEEE CSVT 2008.

Priyadarshini Anjanappa UTA ID:

Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.

- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison of H.264/MPEG4.

Codec structuretMyn1 Codec structure In an MPEG system, the DCT and motion- compensated interframe prediction are combined. The coder subtracts the motion-compensated.

Video Compression Standards for High Definition Video : A Comparative Study Of H.264, Dirac pro And AVS P2 By Sudeep Gangavati EE5359 Spring 2012, UT Arlington.

Low-Power H.264 Video Compression Architecture for Mobile Communication Student: Tai-Jung Huang Advisor: Jar-Ferr Yang Teacher: Jenn-Jier Lien.

Fast Mode Decision for H.264/AVC Based on Rate-Distortion Clustering IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 14, NO. 3, JUNE 2012 Yu-Huan Sung Jia-Ching.

Directional DCT Presented by, -Shreyanka Subbarayappa, Sadaf Ahamed, Tejas Sathe, Priyadarshini Anjanappa K. R. RAO 1.

Compression video overview 演講者：林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.

Rate-GOP Based Rate Control for HEVC SHANSHE WANG, SIWEI MA, SHIQI WANG, DEBIN ZHAO, AND WEN GAO IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING,

- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison between H.264.

Figure 1.a AVS China encoder [3] Video Bit stream.

-BY KUSHAL KUNIGAL UNDER GUIDANCE OF DR. K.R.RAO. SPRING 2011, ELECTRICAL ENGINEERING DEPARTMENT, UNIVERSITY OF TEXAS AT ARLINGTON FPGA Implementation.

Guillaume Laroche, Joel Jung, Beatrice Pesquet-Popescu CSVT

Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp

High-efficiency video coding: tools and complexity Oct

Vamsi Krishna Vegunta University of Texas, Arlington

IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者：劉冠宇.

MPEG4 Fine Grained Scalable Multi-Resolution Layered Video Encoding Authors from: University of Georgia Speaker: Chang-Kuan Lin.

Unified Loop Filter for High-performance Video Coding Yu Liu and Yan Huo ICME2010, July 19-23, Singapore.

Video Compression—From Concepts to the H.264/AVC Standard

Page 11/28/2016 CSE 40373/60373: Multimedia Systems Quantization  F(u, v) represents a DCT coefficient, Q(u, v) is a “quantization matrix” entry, and.

A Frame-Level Rate Control Scheme Based on Texture and Nontexture Rate Models for HEVC IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY,

Implementation and comparison study of H.264 and AVS china EE 5359 Multimedia Processing Spring 2012 Guidance : Prof K R Rao Pavan Kumar Reddy Gajjala.

Multi-Frame Motion Estimation and Mode Decision in H.264 Codec Shauli Rozen Amit Yedidia Supervised by Dr. Shlomo Greenberg Communication Systems Engineering.

Computational Controlled Mode Selection for H.264/AVC June Computational Controlled Mode Selection for H.264/AVC Ariel Kit & Amir Nusboim Supervised.

Complexity varying intra prediction in H.264 Supervisors: Dr. Ofer Hadar, Mr. Evgeny Kaminsky Students: Amit David, Yoav Galon.

Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.

Daala: A Perceptually-Driven Still Picture Codec

Early termination for tz search in hevc motion estimation

Overview of the Scalable Video Coding

Quad-Tree Motion Modeling with Leaf Merging

Study and Optimization of the Deblocking Filter in H

Fast Decision of Block size, Prediction Mode and Intra Block for H

MPEG4 Natural Video Coding

Reduction of blocking artifacts in DCT-coded images

Bongsoo Jung, Byeungwoo Jeon

Presentation transcript:

Video Coding Using Spatially Varying Transform Cixun Zhang, Kermal Ugur, Jani Lainema, Antti Hallapuro and Moncef IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 21, NO. 2, FEBURARY 2011

Outline Introduction SVT (Spatially varying transform) – Selection of SVT block-size – Selection and coding of candidate LP – Filtering of SVT block boundaries Implementing SVT in H.264/AVC FSVT Experimental Result Conclusion

Introduction Why SVT? Some drawback of H.264/AVC – Most standard doesn’t align the underlying transform with the possible edge location. – [4] directional DCTs is proposed to improve the efficiency for directional edges, but not efficient in vertical, horizontal, and nondirectional edges. – Coding the entire prediction error signal may not be the best in RD tradeoff, e.g., SKIP mode [4] B. Zeng and J. Fu, “Directional discrete cosine transforms: A new framework for image coding,” IEEE Trans. Circuits, Syst. Video Technol., vol. 18, no. 3, pp. 305–313, Mar

Rate-Distortion – The classical method of making encoding decisions is for the video encoder to choose the result which yields the highest quality output image. However, this has the disadvantage that the choice it makes might require more bits while giving comparatively little quality benefit. One common example of this problem is in motion estimation, [1] and in particular regarding the use of quarter pixel- precision motion estimation. Adding the extra precision to the motion of a block during motion estimation might increase quality, but in some cases that extra quality isn't worth the extra bits necessary to encode the motion vector to a higher precision.motion estimation [1]quarter pixel- precision motion estimationblock

Introduction (cont.) Basic idea of SVT : – Do not restrict the transform coding inside regular block boundaries. – i.e., selecting and coding the best portion of the prediction error to achieve coding efficiency improvement in terms of RD tradeoff. – SVT can be considered as a special SKIP mode, part of the macroblock (Do not be coded into bitstream) is skipped

Introduction (cont.) Shifting the transform has been used in denoising.[6] – [9] (Often used in post- processing) (e.g. in-loop-filter) Have bad effort if applied at the boundary and to the small area (e.g. macroblock) [6] A. Nosratinia, “Denoising JPEG images by re-application of JPEG,” in Proc. IEEE Workshop MMSP, Dec. 1998, pp. 611–615. [7] R. Samadani, A. Sundararajan, and A. Said, “Deringing and deblocking DCT compression artifacts with efficient shifted transforms,” in Proc. IEEE ICIP, Oct. 2004, pp. 1799–1802. [8] J. Katto, J. Suzuki, S. Itagaki, S. Sakaida, and K. Iguchi, “Denoising intra-coded moving pictures using motion estimation and pixel shift,” in Proc. IEEE ICASSP, Mar. 2008, pp. 1393–1396. [9] O. G. Guleryuz, “Weighted averaging for denoising with overcomplete dictionaries,” IEEE Trans. Image Process., vol. 16, no. 12, pp – 3034, Dec

Introduction (cont.) Proposed method has no drawback mentioned above. And the location parameter(LP) is coded in the bitstream for decoder to reconstruct MB. Drawback – High encoding complexity due to the brute force search process to select the best LP. – Solution : FSVT

Outline Introduction SVT (Spatially varying transform) – Selection of SVT block-size – Selection and coding of candidate LP – Filtering of SVT block boundaries Implementing SVT in H.264/AVC FSVT Experimental Result Conclusion

SVT Transform coding is widely used to decorrelate the prediction error and achieve high compression rates. Traditional transform coding drawback – If prediction error at fixed locations has a structure that is not suitable for underlying transform, many high frequency transform coefficients will be generated. (more bits to code) – Notorious visual artifacts may appear (e.g. ringing) when these coefficients get quantized.

SVT (cont.) What’s new is SVT: – Transform coding is not restricted inside regular block boundary. (can be applied to any portion of the prediction error) – The selection is due to the reduction of complexity. This means that the position and shape of the transform block is variable, and the information(shape and position) is signaled to the decoder.

SVT (cont.) Three issues of SVT – Selection of SVT block-size – Selection and coding of candidate LP – Filtering of SVT block boundaries

Selection of SVT Block-Size M*N SVT is applied on a selected M*N block inside a macroblock(size 16*16) and ONLY THIS BLOCK IS TRANSFORM CODED. (17-M)*(17-N) possible LPs. Factors of choosing M and N – Larger M & N will result in fewer possible LPs. – Larger M & N will result in low distortion but need more bits in coding the transform coefficient. – Larger block-size transform is more suitable for flat areas and smaller is suitable for sharp edges.

Selection of SVT Block-Size (cont.) To facilitate the transform design, M = 2^m and N = 2^n. 4 SVT block size in this chapter : 8*8, 4*16, 16*4 and 0*0 (means SKIP mode) Block size can be changed according to different sequence for better performance.

Selection of SVT Block Size (cont.) Due to the well established variable block-size transform (VBT), variable block-size SVT is better than fixed block-size. Different block size issue (drawback) : – When the number of SVT become larger, the bits need to code the LPs gains more.

Selection of SVT Block Size (cont.) As mentioned before, VBT can be used for SVT. For 8*8 SVT, transform kernel in H.264 can be used. For 4*16 and 16*4 SVT, 4*4 transform kernel in H.264 and 16*16 transform kernel in [14] can be used with the butterfly structure of 8*8. [14] S. Ma and C.-C. Kuo, “High-definition video coding with supermacroblocks,” in Proc. SPIE Vis. Commun. Image Process., vol. 6508, Jan. 2007, pp. 1–12.

Selection and Coding of Candidate LPs When there are nonzero transform coefficient of the SVT, its location needs to be coded and transmitted. The best LP selected according to RDO(rate distortion optimization) [15] [15] T. Wiegand, H. Schwarz, A. Joch, F. Kossentini, and G. J. Sullivan, “Rate-constrained coder control and comparison of video coding standards,” IEEE Trans. Circuits, Syst. Video Technol., vol. 13, no. 7, pp.688–703, Jul

Selection and Coding of Candidate LPs (cont.)

As mentioned before, 6-bit fixed length is needed for representing LP index. And in chroma case :

Filtering of SVT Block Boundaries For using SVT, deblocking process needs to be adjusted because the selected SVT block may not align with the regular block boundaries. Both the edges of the selected SVT block and the macroblock may be filtered.

Filtering of SVT Block Boundaries (cont.)

Outline Introduction SVT (Spatially varying transform) – Selection of SVT block-size – Selection and coding of candidate LP – Filtering of SVT block boundaries Implementing SVT in H.264/AVC FSVT Experimental Result Conclusion

Implementing SVT in H.264/AVC

Implementing SVT in H.264/AVC (cont.) Several key parts of H.264/AVC need to be adjusted. – Macroblock types – Coded block pattern – Entropy coding – deblocking

Marcoblock Type

Coded Block Pattern In experiment, luma CBP is often equal to 1 in high fidelity video coding. Based on the observation, set the new macroblock modes to have luma CBP equal to 1.

Entropy Coding In H.264, CAVLC use a different coding table based on the total number of nonzero coefficients. For SVT, a fixed coding table is used. In order to derive some information about the number of nonzero coefficients in each 4*4 luma block, the following two steps are used :

Entropy Coding (cont.) Step 1 : If luma block overlaps with a coded block that has nonzero coefficients in the selected SVT block, then mark it to have nonzero coefficients. (Using for deblocking) Step 2 : The number of nonzero transform coefficient for 4*4 block is empirically set by And finally, distribute the total nonzero transform coefficient to the blocks that mark as having nonzero coefficient.

Deblocking As mentioned above in SVT chapter

Outline Introduction SVT (Spatially varying transform) – Selection of SVT block-size – Selection and coding of candidate LP – Filtering of SVT block boundaries Implementing SVT in H.264/AVC FSVT Experimental Result Conclusion

FSVT (Fast Algorithms for SVT) The encoding complexity of SVT is higher due to the brute force search process in RDO. Typically, conducting transform, quantization, and entropy coding,.etc, are needed for RDO. The basic idea to reduce the encoding complexity is to reduce the number of LPs.

FSVT (cont.) There are two case : – 1. Skip testing SVT for macroblocks for which SVT is unlikely to be useful. (by examining RD cost) – 2. The proposed fast algorithm selects LPs based on the motion difference and utilizes a hierarchical search algorithm to select best LP.

Macroblock Level Fast Algorithm SVT is applied for macorblock modes only if Where J are the minimum RD cost without SVT coding. J mode refers to RD cost of the current macroblock mode to be tested with SVT.

Macroblock Level Fast Algorithm (cont.) The threshold represent empirical upper limit of bitrate reduction.

Block-Level Fast Algorithm 1. Selection of Available Candidate LPs Based on Motion Difference 2. Hierarchical Search Algorithm

Selection of Available Candidate LPs Skip testing a candidate LP if one of the following condition is true : – 1. If that SVT block at that position overlaps with at least two neighboring motion compensation blocks and motion vectors of these blocks are larger or equal to predefined threshold. – 2. If the reference frames of these neighboring blocks are different.

Hierarchical Search Algorithm Idea : find the best LP in a relatively coarse resolution and refine the result in a finer resolution. Step1 : Find lowest RD cost as set1, and his two neighbors as set2 Step2 : Find best zone. A zone is available if and only if all three candidate LPs is available Step3 : Select best LP from set1, set2, and best zone

Outline Introduction SVT (Spatially varying transform) – Selection of SVT block-size – Selection and coding of candidate LP – Filtering of SVT block boundaries Implementing SVT in H.264/AVC FSVT Experimental Result Conclusion

Experimental Environment VBSVT and FVBSVT are performed in both HD and lower resolution video coding. Some coding parameter used – High Profile – QP I = 22,27,32,37 QP P = QP I + 1 – CAVLC/CABAC – Frame structure IPPP – MV search range 64/32 pixels for 720p/CIF – RDO in the high complexity mode

Experimental Environment Intel® Core™2 Quad CPU 2G Measure the average bitrate reduction compared to H.264/AVC using Bjontegaard tool[20]. Two configuration are tested – Low complexity configuration: 4*4 transform is not used – High complexity configuration: Codec with full usage of the tools provided in H.264

Experimental Result

Experimental Result (cont.)

Outline Introduction SVT (Spatially varying transform) – Selection of SVT block-size – Selection and coding of candidate LP – Filtering of SVT block boundaries Implementing SVT in H.264/AVC FSVT Experimental Result Conclusion

By varying the position of the transform block and its size, the prediction error is better localized, and coding efficiency is improved. The encoding complexity of SVT is relatively high because of brute force searching. (RDO) To deal with question above, FSVT is proposed to skip testing most of macroblock that not suitable with SVT.