Fast Mode Decision for H.264/AVC Based on Rate-Distortion Clustering IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 14, NO. 3, JUNE 2012 Yu-Huan Sung Jia-Ching.

Slides:



Advertisements
Similar presentations
Packet Video Error Concealment With Auto Regressive Model Yongbing Zhang, Xinguang Xiang, Debin Zhao, Siwe Ma, Student Member, IEEE, and Wen Gao, Fellow,
Advertisements

Adaptive Fast Block-Matching Algorithm by Switching Search Patterns for Sequences With Wide-Range Motion Content 韋弘
-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang
Concealment of Whole-Picture Loss in Hierarchical B-Picture Scalable Video Coding IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 11, NO. 1, JANUARY 2009 Xiangyang.
A Highly Parallel Framework for HEVC Coding Unit Partitioning Tree Decision on Many-core Processors Chenggang Yan, Yongdong Zhang, Jizheng Xu, Feng Dai,
An Early Block Type Decision Method for Intra Prediction in H.264/AVC Jungho Do, Sangkwon Na and Chong-Min Kyung VLSI Systems Lab. Korea Advanced Institute.
Efficient Bit Allocation and CTU level Rate Control for HEVC Picture Coding Symposium, 2013, IEEE Junjun Si, Siwei Ma, Wen Gao Insitute of Digital Media,
H.264/AVC Baseline Profile Decoder Complexity Analysis Michael Horowitz, Anthony Joch, Faouzi Kossentini, and Antti Hallapuro IEEE TRANSACTIONS ON CIRCUITS.
1 Adaptive slice-level parallelism for H.264/AVC encoding using pre macroblock mode selection Bongsoo Jung, Byeungwoo Jeon Journal of Visual Communication.
K.-S. Choi and S.-J. Ko Sch. of Electr. Eng., Korea Univ., Seoul, South Korea IEEE, Electronics Letters Issue Date : June Hierarchical Motion Estimation.
{ Fast Disparity Estimation Using Spatio- temporal Correlation of Disparity Field for Multiview Video Coding Wei Zhu, Xiang Tian, Fan Zhou and Yaowu Chen.
An Improved 3DRS Algorithm for Video De-interlacing Songnan Li, Jianguo Du, Debin Zhao, Qian Huang, Wen Gao in IEEE Proc. Picture Coding Symposium (PCS),
Limin Liu, Member, IEEE Zhen Li, Member, IEEE Edward J. Delp, Fellow, IEEE CSVT 2009.
Fast Mode Decision for Multiview Video Coding Liquan Shen, Tao Yan, Zhi Liu, Zhaoyang Zhang, Ping An, Lei Yang ICIP
CABAC Based Bit Estimation for Fast H.264 RD Optimization Decision
Wei Zhu, Xiang Tian, Fan Zhou and Yaowu Chen IEEE TCE, 2010.
Ai-Mei Huang and Truong Nguyen Video Processing LabECE Dept, UCSD, La Jolla, CA This paper appears in: Image Processing, ICIP IEEE International.
Highly Parallel Rate-Distortion Optimized Intra-Mode Decision on Multicore Graphics Processors Ngai-Man Cheung, Oscar C. Au, Senior Member, IEEE, Man-Cheung.
FAST MACROBLOCK MODE SELECTION BASED ON MOTION CONTENT CLASSIFICATION IN H.264/AVC Ming Yang, Wensheng Wang ICIP 2004.
Outline Introduction Introduction Fast Inter Prediction Mode Decision for H.264 – –Pre-encoding An Efficient Inter Mode Decision Approach for H.264 Video.
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
Low-complexity mode decision for MVC Liquan Shen, Zhi Liu, Ping An, Ran Ma and Zhaoyang Zhang CSVT
1 Single Reference Frame Multiple Current Macroblocks Scheme for Multiple Reference IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY Tung-Chien.
Fast Mode Decision And Motion Estimation For JVT/H.264 Pen Yin, Hye – Yeon Cheong Tourapis, Alexis Michael Tourapis and Jill Boyce IEEE ICIP 2003 Sep.
Decision Trees for Error Concealment in Video Decoding Song Cen and Pamela C. Cosman, Senior Member, IEEE IEEE TRANSACTION ON MULTIMEDIA, VOL. 5, NO. 1,
Scalable Wavelet Video Coding Using Aliasing- Reduced Hierarchical Motion Compensation Xuguang Yang, Member, IEEE, and Kannan Ramchandran, Member, IEEE.
Multi-Frame Reference in H.264/AVC 卓傳育. Outline Introduction to Multi-Frame Reference in H.264/AVC Multi-Frame Reference Problem Two papers propose to.
1 An Efficient Mode Decision Algorithm for H.264/AVC Encoding Optimization IEEE TRANSACTION ON MULTIMEDIA Hanli Wang, Student Member, IEEE, Sam Kwong,
Feature-Based Intra-/InterCoding Mode Selection for H.264/AVC C. Kim and C.-C. Jay Kuo CSVT, April 2007.
Source-Channel Prediction in Error Resilient Video Coding Hua Yang and Kenneth Rose Signal Compression Laboratory ECE Department University of California,
Scalable Rate Control for MPEG-4 Video Hung-Ju Lee, Member, IEEE, Tihao Chiang, Senior Member, IEEE, and Ya-Qin Zhang, Fellow, IEEE IEEE TRANSACTIONS ON.
A New Rate-Complexity-QP Algorithm for HEVC Intra-Picture Rate Control LING TIAN, YIMIN ZHOU, AND XIAOJUN CAO 2014 INTERNATIONAL CONFERENCE ON COMPUTING,
BIN LI, HOUQIAN LI, LI LI, AND JINLEI ZHANG IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL.23, NO.9, SEPTEMBER
Block Partitioning Structure in the HEVC Standard
BY AMRUTA KULKARNI STUDENT ID : UNDER SUPERVISION OF DR. K.R. RAO Complexity Reduction Algorithm for Intra Mode Selection in H.264/AVC Video.
An Introduction to H.264/AVC and 3D Video Coding.
HARDEEPSINH JADEJA UTA ID: What is Transcoding The operation of converting video in one format to another format. It is the ability to take.
Liquan Shen Zhi Liu Xinpeng Zhang Wenqiang Zhao Zhaoyang Zhang An Effective CU Size Decision Method for HEVC Encoders IEEE TRANSACTIONS ON MULTIMEDIA,
Low complexity H.264 Encoder using machine learning.
1 Efficient Reference Frame Selector for H.264 Tien-Ying Kuo, Hsin-Ju Lu IEEE CSVT 2008.
Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.
Sadaf Ahamed G/4G Cellular Telephony Figure 1.Typical situation on 3G/4G cellular telephony [8]
Video Compression Standards for High Definition Video : A Comparative Study Of H.264, Dirac pro And AVS P2 By Sudeep Gangavati EE5359 Spring 2012, UT Arlington.
Low-Power H.264 Video Compression Architecture for Mobile Communication Student: Tai-Jung Huang Advisor: Jar-Ferr Yang Teacher: Jenn-Jier Lien.
H.264/AVC 基於影像複雜度與提早結束之快速 階層運動估計方法 Content-Based Hierarchical Fast Motion Estimation with Early Termination in H.264/AVC 研究生:何銘哲 指導教授:蔣依吾博士 中山大學資訊工程學系.
2 3 Be introduced in H.264 FRExt profile, but most H.264 profiles do not support it. Do not need motion estimation operation.
Rate-GOP Based Rate Control for HEVC SHANSHE WANG, SIWEI MA, SHIQI WANG, DEBIN ZHAO, AND WEN GAO IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING,
- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison between H.264.
Figure 1.a AVS China encoder [3] Video Bit stream.
-BY KUSHAL KUNIGAL UNDER GUIDANCE OF DR. K.R.RAO. SPRING 2011, ELECTRICAL ENGINEERING DEPARTMENT, UNIVERSITY OF TEXAS AT ARLINGTON FPGA Implementation.
Guillaume Laroche, Joel Jung, Beatrice Pesquet-Popescu CSVT
Computational Complexity Management of a Real-Time H.264/AVC Encoder C S Kannangara, I E Richardson, and A J Miller CSVT
Fast motion estimation and mode decision for H.264 video coding in packet loss environment Li Liu, Xinhua Zhuang Computer Science Department, University.
An efficient Video Coding using Phase-matched Error from Phase Correlation Information Manoranjan Paul 1 and Golam Sorwar IEEE.
Video Coding Using Spatially Varying Transform Cixun Zhang, Kermal Ugur, Jani Lainema, Antti Hallapuro and Moncef IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM.
Vamsi Krishna Vegunta University of Texas, Arlington
IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者:劉冠宇.
A hybrid error concealment scheme for MPEG-2 video transmission based on best neighborhood matching algorithm Li-Wei Kang and Jin-Jang Leou Journal of.
Outline  Introduction  Observations and analysis  Proposed algorithm  Experimental results 2.
Shen-Chuan Tai, Chien-Shiang Hong, Cheng-An Fu National Cheng Kung University, Tainan City,Taiwan (R.O.C.),DCMC Lab Pacific-Rim Symposium on Image and.
Implementation and comparison study of H.264 and AVS china EE 5359 Multimedia Processing Spring 2012 Guidance : Prof K R Rao Pavan Kumar Reddy Gajjala.
Fine-granular Motion Matching for Inter-view Motion Skip Mode in Multi-view Video Coding Haitao Yanh, Yilin Chang, Junyan Huo CSVT.
Fast disparity motion estimation in MVC based on range prediction Xiao Zhong Xu, Yun He ICIP 2008.
E ARLY TERMINATION FOR TZ SEARCH IN HEVC MOTION ESTIMATION PRESENTED BY: Rajath Shivananda ( ) 1 EE 5359 Multimedia Processing Individual Project.
Computational Controlled Mode Selection for H.264/AVC June Computational Controlled Mode Selection for H.264/AVC Ariel Kit & Amir Nusboim Supervised.
Adaptive Block Coding Order for Intra Prediction in HEVC
Automatic Video Shot Detection from MPEG Bit Stream
Early termination for tz search in hevc motion estimation
Fast Decision of Block size, Prediction Mode and Intra Block for H
Bongsoo Jung, Byeungwoo Jeon
Presentation transcript:

Fast Mode Decision for H.264/AVC Based on Rate-Distortion Clustering IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 14, NO. 3, JUNE 2012 Yu-Huan Sung Jia-Ching Wang, Senior Member, IEEE

Outline Introduction Related Works Feature Selection Proposed Fast Mode Decision Experiment Results Conclusion

Introduction The up-to-date video coding standard H.264/AVC – twice the compression ratio of other video coding standards. – maintaining nearly the same visual equality. However, an extremely high computational complexity is a tradeoff of the performance gains. – Video conferencing – Live TV broadcasting – Mobile computing

Introduction H.264/AVC adopts many features that can enhance coding performance. – Variable block-size MC – Sub-pixel ME – Multiple reference pictures selection – Directional intra prediction – In-the-loop de-blocking filtering, etc. The features incur a heavy burden during the encoding process.

Introduction Reducing the computational time has received considerable attention recently. Reducing the encoding time involves two main parts : 1.Inter-mode decision 2.Intra-mode decision according to a RD cost optimization scheme.

Introduction The proposed method presents a Multi-Phase Classification (MPC) scheme – use a nearest mean criterion. – determine inter-modes and intra-modes. MPC is a hierarchical classification scheme that allows an MB to be classified into a category phase by phase.

Introduction The MPC presents a three-phase classification scheme. – a phase consists of several categories. – partition from current phase into next phase. – categories are the sub-sets of the upper phase. Each category within a phase is represented as a feature point in the feature space. – assign an MB to a category with the minimum distance.

Outline Introduction Related Works Feature Selection Proposed Method Experiment Results Conclusion

Related Works Four ways to develop the fast mode decision algorithm in previous works. The first approach is SIKP-mode detection – early identified if an MB can be skipped. – Kannangara et al. [3] and Zhao et al. [4]. [3] C. Kannangara et al., “Low-complexity skip prediction for H.264 through Lagrangian cost estimation,” IEEE Trans. Circuits Syst. Video Technol., vol. 16, no. 2, pp. 202–208, Feb [4] Y. Zhao, M. Bystrom, and I. E. G. Richardson, “A MAP frame work for efficient skip/code mode decision in H.264,” in Proc. ICIP2006, Atlanta, GA, Oct. 8–11, 2006.

Related Works The second approach is mode prediction – directly or indirectly predict the best mode for the current MB. – Wu et al. [5], Ri et al. [6] and Paul et al. [17]. The third approach is mode classification – classifies the current MB into a specific category. – the corresponding candidate modes will be checked to find the best. – Kim et al. [7], Yu et al. [8], Liu et al. [9], Zeng et al. [10] and Zhao et al. [11]. [5] D.Wu, F. Pan, K. P. Lim, and S.Wu et al., “Fast intermode decision in H.264/AVC video coding,” IEEE Trans. Circuits Syst. Video Technol., vol. 15, no. 7, pp. 953–958, Jul [6] S. H. Ri, Y. Vatis, and J. Ostermann, “Fast inter-mode decision in an H.264/AVC encoder using mode and Lagrangian cost correlation,” IEEE Trans. Circuits Syst. Video Technol., vol. 19, no. 2, pp. 302–306, Feb [17] M. Paul,W. Lin, C. T. Lau, and B. S. Lee, “Direct inter-mode selection for H.264 video coding using phase correlation,” IEEE Trans. Image Process., vol. 20, no. 2, pp. 461–473, Feb

Related Works The last approach redefines the optimization cost function – number of operations needed for mode selection can be reduced. [7] C. Kim and C. C. Jay Kuo, “Feature-based intra-/inter coding mode selection for H.264/AVC,” IEEE Trans. Circuits and Syst. Video Technol., vol. 17, no. 4, pp. 441–453, Apr [8] A. C. W. Yu, G. R. Martin, and H. Park, “Fast inter-mode selection in the H.264/AVC standard using a hierarchical decision process,” IEEE Trans. Circuits Syst. Video Technol., vol. 18, no. 2, pp. 186–195, Feb [9] Z. Liu, L. Shen, and Z. Zhang, “An efficient intermode decision algorithm based on motion homogeneity for H.264/AVC,” IEEE Trans. Circuits Syst. Video Technol., vol. 19, no. 1, pp. 128–132, Jan [10] H. Zeng, C. Cai, and K.-K. Ma, “Fast mode decision for H.264/AVC based on macro block motion activity,” IEEE Trans. Circuits Syst. Video Technol., vol. 19, no. 4, pp. 491–499, Apr [11] T. Zhao, H.Wang, S. Kwong, and C.-C. Jay Kuo, “Fast mode decision based on mode adaptation,” IEEE Trans. Circuits Syst. Video Technol., vol. 20, no. 5, pp. 697–705, May 2010.

Outline Introduction Related Works Feature Selection – Feature Vector – Feature Space and Classifier Proposed Fast Mode Decision Experiment Results Conclusion

Feature Vector There is a strong correlation of RD cost between the best mode and the temporal-spatial modes. A three-dimensional feature vector that comprises RD costs of neighboring MBs is used to discriminate between the different modes for mode decision.

Feature Vector RD costs range to various extents under different coding modes and motion contents and should not be directly used as a universal criterion. Using a three-dimensional feature vector – ensure that an MB can be assigned to the most probable category accurately. – adapt to the variable motion contents of various video sequences properly.

Feature Vector The three components of a feature vector, f skip, f spat, and f temp, are expressed as :

Flowchart of Initialization

Feature Vector RD cost is expressed as :

Feature Space and Classifier The 3D feature space

Feature Space and Classifier Feature Space and Voronoi Diagram f temp f skip

Outline Introduction Related Works Feature Selection Proposed Fast Mode Decision Experiment Results Conclusion

Fast Mode Decision Nearest Mean Criterion – assign MBs into a specific category. – classify MBs by using Euclidean distance. – predict the best mode of an MB by finding a mean M i (cluster center).

Fast Mode Decision Category Organization – directly assigning the mode with minimum distance to the given MB. unsatisfactory prediction accuracy. – grouping modes with similar characteristics into a category. reducing the probability of a false prediction.

Fast Mode Decision

Flowchart of Phases

Fast Mode Decision Mode decision process can be further accelerated by Early Termination. – activated => if the f skip is below a specific threshold. – SKIP mode is the best mode. Initial threshold is set to be the average RD costs of SKIP- MBs in the training sequences, and will be dynamically updated according to :

Flowchart of Early Termination......

Error Propagation and Performance Degradation Control A performance control process is incorporated into the proposed method. Avoid serious performance degradation caused by repeated use of wrongly predicted results or accidental false predictions. The idea is providing an inspection for the coding result of each MB produced from the fast mode decision algorithm.

Error Propagation and Performance Degradation Control An adaptive RD cost inspection is proposed and all it needs have been gained already. – temporal RD costs – spatial RD costs A fast mode decision is made and the corresponding RD cost is obtained, an inspection is performed by :

Flowchart of Inspection

Outline Introduction Related Works Feature Selection Proposed Fast Mode Decision Experiment Results Conclusion

Training and Test Conditions The means of each category and the related statistics are generated by JM17.0 [15]. Ten video sequences are Silent, Ice, Hall, Highway, Miss- America, Carphone, Tempete, Soccer, Bus, and Table Tennis. Video format is QCIF-format. QP values are 20, 24, 28, 32, and 36. Two GOP structures (IPPP and IBBP) are used for the training purpose.

Training and Test Conditions The number of frames to be encoded is set to 100. The search range of motion estimation is 16, and the search strategy is full search. The number of reference frames is 1, and the intra-period is set to 4.

Performance of Used Mode

Performance Comparisons (1/2)

Performance Comparisons (2/2)

Performance Comparisons with GOP size

RD Curves

Outline Introduction Related Works Feature Selection Proposed Fast Mode Decision Experiment Results Conclusion

Experimental results indicate that the quality loss and bitrate increasing are only 0.02 dB and 1.65%, respectively. Reducing 67.5% encoding time on average among the 12 video sequences of different GOP structures. Encompass a wide variety of motion contents and different resolutions.