Overview of H.264 Video Coding

Slides:



Advertisements
Similar presentations
March 24, 2004 Will H.264 Live Up to the Promise of MPEG-4 ? Vide / SURA March Marshall Eubanks Chief Technology Officer.
Advertisements

with RGB Reversibility
AVC Compression Update: FRExt and future Matthew Goldman Vice President of Technology Compression Systems.
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
MPEG-2 to H.264/AVC Transcoding Techniques Jun Xin Xilient Inc. Cupertino, CA.
H.264 Intra Frame Coder System Design Özgür Taşdizen Microelectronics Program at Sabanci University 4/8/2005.
MPEG4 Natural Video Coding Functionalities: –Coding of arbitrary shaped objects –Efficient compression of video and images over wide range of bit rates.
A Performance Analysis of the ITU-T Draft H.26L Video Coding Standard Anthony Joch, Faouzi Kossentini, Panos Nasiopoulos Packetvideo Workshop 2002 Department.
Basics of MPEG Picture sizes: up to 4095 x 4095 Most algorithms are for the CCIR 601 format for video frames Y-Cb-Cr color space NTSC: 525 lines per frame.
Overview of the H. 264/AVC video coding standard.
-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang
Chapter 11.3 MPEG-2 MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications: Simple,
Technion - IIT Dept. of Electrical Engineering Signal and Image Processing lab Transrating and Transcoding of Coded Video Signals David Malah Ran Bar-Sella.
1 Video Coding Concept Kai-Chao Yang. 2 Video Sequence and Picture Video sequence Large amount of temporal redundancy Intra Picture/VOP/Slice (I-Picture)
Implementation and Study of Unified Loop Filter in H.264 EE 5359 Multimedia Processing Spring 2012 Guidance : Prof K R Rao Pavan Kumar Reddy Gajjala
Source Coding for Video Application
SWE 423: Multimedia Systems
Ch. 6- H.264/AVC Part I (pp.160~199) Sheng-kai Lin
Overview of the Scalable Video Coding Extension of the H
Overview of the H.264/AVC Video Coding Standard
H.264/Advanced Video Coding – A New Standard Song Jiqiang Oct 21, 2003.
Overview of AVS Video Standard Kai-Chao Yang. Outline Audio Video Coding Standard (AVS) Audio Video Coding Standard (AVS) AVS Schedule AVS Schedule AVS.
H.264 / MPEG-4 Part 10 Nimrod Peleg March 2003.
Overview and Introduction to H.264/AVC Fidelity Range Extensions
BY AMRUTA KULKARNI STUDENT ID : UNDER SUPERVISION OF DR. K.R. RAO Complexity Reduction Algorithm for Intra Mode Selection in H.264/AVC Video.
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 13 – H.264 (Part 8) Klara Nahrstedt Spring 2012.
Concepts of Multimedia Processing and Transmission
H.264/AVC.
1 Image and Video Compression: An Overview Jayanta Mukhopadhyay Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur,
An Introduction to H.264/AVC and 3D Video Coding.
Topic: Advanced Video Coding Standard (Comparison of HEVC with H.264 and H.264 with MPEG-2) A PROJECT UNDER THE GUIDANCE OF DR. K. R. RAO COURSE: EE5359.
MPEG-2 Digital Video Coding Standard
EE 5359 H.264 to VC 1 Transcoding Vidhya Vijayakumar Multimedia Processing Lab MSEE, University of Arlington Guided.
Moving PicturestMyn1 Moving Pictures MPEG, Motion Picture Experts Group MPEG is a set of standards designed to support ”Coding of Moving Pictures and Associated.
Image Processing Architecture, © Oleh TretiakPage 1Lecture 9 ECEC-453 Image Processing Architecture Lecture 9, 2/12/ 2004 MPEG 1 Oleh Tretiak.
Page 19/15/2015 CSE 40373/60373: Multimedia Systems 11.1 MPEG 1 and 2  MPEG: Moving Pictures Experts Group for the development of digital video  It is.
CS 414 – Multimedia Systems Design Lecture 14 – H.264, H.265
Audio Video coding Standard of (AVS) China Submitted by, Swaminathan Sridhar EE 5359 Multimedia Processing Project.
MPEG-1 and MPEG-2 Digital Video Coding Standards Author: Thomas Sikora Presenter: Chaojun Liang.
MPEG Motion Picture Expert Group Moving Picture Encoded Group Prateek raj gautam(725/09)
Profiles and levelstMyn1 Profiles and levels MPEG-2 is intended to be generic, supporting a diverse range of applications Different algorithmic elements.
Windows Media Video 9 Tarun Bhatia Multimedia Processing Lab University Of Texas at Arlington 11/05/04.
CMPT365 Multimedia Systems 1 Media Compression - Video Coding Standards Spring 2015 CMPT 365 Multimedia Systems.
Vineeth Shetty Kolkeri University of Texas, Arlington
Outline JVT/H.26L: History, Goals, Applications, Structure
By, ( ) Low Complexity Rate Control for VC-1 to H.264 Transcoding.
- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison of H.264/MPEG4.
Video Compression Standards for High Definition Video : A Comparative Study Of H.264, Dirac pro And AVS P2 By Sudeep Gangavati EE5359 Spring 2012, UT Arlington.
EE 5359 TOPICS IN SIGNAL PROCESSING PROJECT ANALYSIS OF AVS-M FOR LOW PICTURE RESOLUTION MOBILE APPLICATIONS Under Guidance of: Dr. K. R. Rao Dept. of.
Compression video overview 演講者:林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.
- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison between H.264.
Image Processing Architecture, © 2001, 2002 Oleh TretiakPage 1Lecture 15 ECEC-453 Image Processing Architecture 3/11/2004 Exam Review Oleh Tretiak Drexel.
Fundamentals of Multimedia Chapter 12 MPEG Video Coding II MPEG-4, 7 Ze-Nian Li & Mark S. Drew.
Figure 1.a AVS China encoder [3] Video Bit stream.
Vineeth Shetty Kolkeri University of Texas, Arlington
High-efficiency video coding: tools and complexity Oct
Vamsi Krishna Vegunta University of Texas, Arlington
UNDER THE GUIDANCE DR. K. R. RAO SUBMITTED BY SHAHEER AHMED ID : Encoding H.264 by Thread Level Parallelism.
Video Compression—From Concepts to the H.264/AVC Standard
Block-based coding Multimedia Systems and Standards S2 IF Telkom University.
V ENUS INTERNATIONAL COLLEGE OF TECHNOLOGY Guided by : Rinkal mam.
Implementation and comparison study of H.264 and AVS china EE 5359 Multimedia Processing Spring 2012 Guidance : Prof K R Rao Pavan Kumar Reddy Gajjala.
MPEG Video Coding I: MPEG-1 1. Overview  MPEG: Moving Pictures Experts Group, established in 1988 for the development of digital video.  It is appropriately.
H. 261 Video Compression Techniques 1. H.261  H.261: An earlier digital video compression standard, its principle of MC-based compression is retained.
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
CSI-447: Multimedia Systems
Research Topic Error Concealment Techniques in H.264/AVC for Wireless Video Transmission Vineeth Shetty Kolkeri EE Graduate,UTA.
Standards Presentation ECE 8873 – Data Compression and Modeling
MPEG4 Natural Video Coding
Overview and Introduction to H.264/AVC Fidelity Range Extensions
Presentation transcript:

Overview of H.264 Video Coding Trần Duy Trác ECE Department The Johns Hopkins University Baltimore, MD 21218

Outline Video coding standards H.264/ MPEG-4 AVC Challenging problems History Generic framework H.264/ MPEG-4 AVC Main features Key technical innovations Coding performance Profiles: basic, main and high profiles Challenging problems Applications and markets

History of Video Standards

ITU H.26x History ITU H.26L: “long-term” solution for low bit-rate video coding for communication apps Predecessors include H.261 (1990): “px64”, video conf. solution H.263 (1995): next conf. solution, used in H.323 H.263+, H.263++, follow-on solutions H.26L project dates back to early ’90s Call for formal proposals in January 1998 First draft in August 1999 Joining forces with MPEG: Dec. 2001 H.264 (H.26L) completed in May 2003

MPEG History MPEG-1 (1993) MPEG-2 (1994) MPEG-4 (1999 - ) AVC (2003) Video on CD (VCD) MPEG-2 (1994) DTV Broadcast, DVD, HD MPEG-4 (1999 - ) Cell phone, interactive, high rate communication Object-oriented Over-ambitious? AVC (2003) Conventional to HD Emphasis on compression performance and loss resilience

Generic Framework* Bitstream Prediction loop Video in Previous frame DCT, Q Entropy Coding + _ Bitstream Q-1, IDCT Prediction loop MC Next Frame ME Buffer Video in Previous frame * H.261, 263, 263+, MPEG-1/2/4

H.264 Video Coding Development history Main features Key compression techniques Tools Framework Performance Profiles Basic and main profiles High profile Other new profiles

Development History Dec 2001 – Start Dec 2002 – Tech freeze Joint Video Team (JVT) formed between ITU/MPEG Dec 2002 – Tech freeze May 2003 – ITU-T Rec. H.264 June 2003 – ISO/IEC final draft (FDIS) July 2003 – Launch of FRExt (Fidelity Range) extension project Oct 2003 – ISO/IEC (14496-10) AVC Dec 2003 – Verification tests by MPEG Jun 2004 – FRExt project is finalized Jan 2005 – Scalable Video Coding (SVC) project starts Jul 2006 – Multi-View Video Coding (MVC) project starts

Main Features High compression performance Exact match decoding Advanced compression tools Average 50% bit rate reduction given fixed fidelity compared to other standards Exact match decoding Integer transform Improved perceptual quality In-loop deblocking filter Network friendliness NAL (network abstraction layer) Enhanced error resilience

H.264 Technical Tools Structure Picture type: I, P, B, SI, SP Sequence ->GOP->Picture->Slice->MB->Block Picture type: I, P, B, SI, SP Frame structure: interlaced, progressive Adaptive frame/field: per picture, per MB Deblocking filter – in loop MV resolution – ¼ pixel Tree-like motion segmentation – 16x16 to 4x4 Entropy coding – CAVLC/CABAC Data partition – NAL unit, priority ASO (arbitrary slice order) – independently decodable FMO (flexible macroblock order) – map ABP (adaptive bi-prediction) – adaptive weighting

Block Diagram: H.264 Encoder Next Frame ME Buffer Intra Prediction Switch MC Loop Filter Q-1, IDCT DCT, Q Entropy Coding + _ Bitstream Motion Compensation Loop Prediction loop Video in

Innovation 1: Transform Quantization step size control is nonlinear: step size increases gradually by about 12% (double after 6 steps)

16 bit 4x4 DCT EXACT MATCH simplified transform 4x4 transform Non-orthonormality of the integer transform, i.e., position dependent scaling Requires only 16 bit arithmetic (including intermediate values) Expanded to 8x8 for Chroma by 2x2 transform of the DC values

Quantization Quantization of transform coefficients Logarithmic step size control Extended range of step sizes Smaller step size for chroma 16-bit multiply, add and shift Table-driven: 2 times in Qstep for every 6th increment in Qp

Innovation 2: Intra Prediction Directional spatial prediction (9 types for luma, 1 for 4x4 chroma) e.g., Mode 3: diagonal down/right prediction a, f, k, p are predicted by (A + 2Q + I + 2) >> 2 1 2 3 4 5 6 7 8 Q A B C D E F G H I a b c d J e f g h K i j k l L m n o p M N O P

4x4 Intra Block Prediction Modes Nine 4x4 block prediction modes

16x16 Luma (8x8 Chroma) Intra Prediction Four 16x16 Luma (8x8 chrominance) intra predication modes

Innovation 3: Flexible Block MC Motion vector accuracy 1/4 (6-tap filter) (1/8 sample bilinear for Chroma) 8x8 4x8 1 2 3 4x4 8x4 Types 16x16 8x16 MB 16x8

Example: H.264 MC

Innovation 4: Multiple Reference Frames New frame

Multiple Reference Frames Reference blocks Weighted bi-prediction

Innovation 5: In-Loop Deblocking

In-Loop Deblocking Filter Improves subjective visual quality Much better than out-of-loop post-filtering Highly context adaptive Without loop filter With H.264/AVC loop filter

Innovation 6: Two Entropy Coding Methods - CAVLC (Context-Adaptive Variable- Length Coding) - CABAC (Context-Adaptive Binary Arithmetic Coding)

H.264 Entropy Coding Exp-Golomb Code CAVLC (Context adaptive VLC) For all symbols except transform coefficients Variable length codes with a regular construction, e.g., 0 -> 1; 1-> 010; 2 -> 011; 3 -> 00100; 4 -> 00101; 5 -> 00110 6 -> 00111; 7 -> 0001000; 8 -> 0001001 … CAVLC (Context adaptive VLC) For transform coefficients No end-of-block, but the number of coefficients is encoded Coefficients are scanned backwards Contexts are built dependent on transform coefficients CABAC (Context-based binary arithmetic coding) Uses adaptive probability models for most symbols Exploiting symbol correlations by using contexts Average bi-rate saving over CAVLC 10-15%

Innovation 7: Network Abstraction Layer

H.264 vs. MPEG-2: Low bit-rate (1)

H.264 vs. MPEG-2: Low bit-rate (2) MPEG-2 203kbps H.264 39 kbps

Comparison to Other Standards

Basic H.264 Profiles Baseline (Video-conferencing & Wireless) I and P frames (no B frame) Interlace Adaptive frame/field In-loop deblocking filter ¼ -sample motion compensation Variable block motion estimation CAVLC Some error resilience features, e.g., ASO, FMO Main profile (Broadcast) All baseline features except enhanced error resilience features B frame CABAC MB-level frame/field switching Adaptive weighting for B and P picture prediction

Enhanced H.264 Profiles Extended Profiles (Streaming) High profile Main profiles + Error resilience - CABAC More error resilience: data partition SP/SI switching pictures High profile Old name: Fidelity-Range Extensions (FRExt) Main profile Switchable 8x8 transform Scaling matrix for subjective quality optimization Implementation beyond Main Profile affects Intra prediction, transform, deblocking filter control, CABAC decoding

High Profile H.264/AVC standard finished 2003 High profile ITU-T/H.264 finalized May, 2003 MPEG-4 AVC finalized July, 2003 High profile Initiated in July 2003 and finished in July 2004 Motivation: higher quality and higher rates Consider more than 8 bits sequences, and various color spaces Improved coding efficiency (bit-rate reduction): e.g., 12% for HD films and progressive HD video Complexity issues: No increase in computational requirements Slight increase in memory requirements (CABAC, transform) No reason not to move to High profile !

New Features in High Profile Larger transforms 8x8 transform Drop 4x8, 8x4, and larger transforms Quantization matrix 4x4, 8x8, intra, inter trans. coefficients weighted differently Full capabilities not yet explored (visual weighting) Coding in various space 4:4:4, 4:2:2, 4:2:0, and monochrome New integer color transform Efficient lossless interframe coding Film grain characterization for analysis/synthesis representation Stereo-view video support De-blocking filter display preference

8x8 16-bit Transform Computational complexity One 8x8 block has the same number of adds (64) and 4 extra shifts (20 vs. 16) compared with four 4x4 transform

8x8 Transform Coefficients Scan Two Scans Different scan for frame/field coding Frame scan Field scan

8x8 Intra Block Prediction Nine intra-prediction modes similar to the nine modes for 4x4 block prediction &

Quantization Matrix Similar concept to MPEG-2 design Vary step size based on frequency Adapted to modified transform structure More efficient representation of weights Separate matrix for inter and intra Matrix can be included in picture/slice head information Eight downloadable matrices (at least for 4:2:0) Intra 4x4 Y, Cb, Cr Intra 8x8 Y Inter 4x4 Y, Cb, Cr Inter 8x8 Y

Reversible Integer Color Transform Color transform for YUV Integer color transform (YCoCg)

Other High Profile Details Deblocking filters: Only control of filter is adjusted: do no filter for 4x4 blocks Filter operation itself does not change CABAC 61 contexts and their corresponding initial values No change to CABAC engine Information signaling 8x8 transform on/off flag at the picture head information 8x8 transform on/off flag at per macroblock allows adaptive use

H.264 High Profile vs. MPEG-2 BigShip HD sequence (1280x720, 720p)

Subjective Performance * Subjective tests by Blu-Ray Disk Founders of FRExt HP 4:2:0/8 (HP) 1920x1080x24p (1080p), 3 clips. Notional 3:1 advantage to MPEG-2 8 Mbps HP scored better than 24 Mbps MPEG-2! Apparent transparency at 16 Mbps! 5: Perfect 4: Good 3: Fair (OK for DVD) 2: Poor 1: Very Poor *JVT-L033, M1116, Draft JVT Redmond report

High Profile I-Frame Coding vs. JPEG2000 High profile I frame coding with RD-optimization model selection RD-optimized JPEG2000 coder used

Challenging Problems Major problem: reduce the computational complexity without sacrificing the performance Motion estimation Fast motion search Reference frames selection Macroblock mode decisions Seven inter modes, intra mode with prediction Try all and select the best? Mode decision criterion needed Etc. Implementation issues Read time H.264 encoding and decoding Hardware implementations

Applications and Markets Storage Video CD, DVD, Hard Disk, Web publishing Broadcast Satellite, Cable, Terrestrial Conversational Video-conferencing, Cell phones, PDAs Streaming Video-on-demand, music video, streaming ads Future Applications! – unknown

Instant Video Messaging H.264 Opportunities Map Hardware-Based Codec Implementation Software-Based MPEG-2, Open Standards Dominant WMT, Real Dominant Portable Gaming HD STB IP STB PC Streaming Video Conferencing Mobile Videophony PVR/ HomeNet HD DVD Players Instant Video Messaging MCCD’s Mobile Streaming Still Cameras Security/Defense HD DVD Media Digital Cinema Annual Shipments

Example: HD DVD Multimedia With H.264, put 2 hours of HD on DVD-9 Note: a 100-min HD movie fits in 8.25 GB @ 11 Mb/s Keep MPEG-2 skin Systems, audio… minor change to DVD player Small cost, big quality jump Even better with blue-ray when ready Tech is “laser-agnostic” Studios can recycle catalog in HD Double the money!! Source: DVD-FAQ (Jim Taylor)

H.264/AVC Organization Adoptions ITU-T systems adoption completed as early as May 2003 MPEG-2 and MPEG-4 systems & file format adoption HD DVD in DVD Forum: Mandatory player support Blue-Ray Disc Founders (BDF) High Profile (HP) is their first choice beyond MPEG-2 Digital Multimedia Broadcast in Rep. of Korea Mobile broadcast announcement in Japan France Terrestrial Broadcast announcement H.264/AVC HD instead of MPEG-2

Companies Publicly Known to Implement H.264 Standard Ahead Software / ATEME Amphion Apple Computer British Telecom Broadcom / Sand Video (chips) Conexant (chipset for STB) Cradle Deutsche Telekom DG2L Dicas DSP Research / W&W Communications Emblaze Group Envivio Equator FastVDO France Telecom Hantro Harmonic (filtering and motion estimation) HHI (PC & DSP encode & decode; demos) i3 Micro Technology iVast Intel KDDI R&D Labs Ligos LSI Logic / Videolocus Mainconcept Mcubeworks Media Excel Mobile Video Imaging Mobilygen Modulus Video (main profile levels 3 & 4 b’cast encoders & professional-use decoders) Moonlight Cordless Motorola Neomagic Nokia Oki Electric Optibase Packetvideo PixelTools PixSil Technology Polycom (videoconferencing & MCUs) Prodys Radvision (videoconferencing) Richcore Samsung (Terrestrial DMB receiver) Scientific Atlanta Setabox SkyStream Networks Sony (encode & decode, software & hardware, including PlayStation Portable 2004 & videoconferencing systems) ST Micro (decoder chip in ‘03) Tandberg (shipping with all videoconferencing endpoints since July ’03, GW and MCU since Oct.) TandbergTV Tektronix Techno Mathematical Telesuite thin multimedia Thomson TI (DSP partner with UBV for one of two UBV real-time implementations) Toshiba Tuxia UB Video (demoed real-time encode and decode, software and DSP implementations) Videosoft / Vanguard Software Solutions (s/w, enc/dec) VideoTele.com (a division of Tut Systems) VCON Vqual W&W Communications / DSP Research CAUTION: This information should be considered preliminary and should not be considered to be product announcements – only preliminary implementation work. It may be a while before robust interoperable implementations are well-established.

References IEEE Transactions on Circuits and Systems for Video Technology, July 2003. http://www.vcodex.com/h264.html ftp site: http://bs.hhi.de/~suehring/tml/ P. Topiwala, H.264/AVC: Overview and Introduction to Fidelity-Range Extensions, http://www.fastvdo.com T. Wiegand, S. Gordon, A. Luthra: H.264/AVC High Profile, Presented to DVB, Sept 2004 H.264 Overview, AddPac Tech. Co. Ltd. JVT-L033, M1116, Draft JVT Redmond report G. Sullivan, P. Topiwala, and A. Luthra, The H.264/AVC Advanced Video Coding Standard: Overview and Introduction to the Fidelity Range Extensions, SPIE Conference on Applications of Digital Image Processing XXVII, Special Session on Advances in the New Emerging Standard: H.264/AVC, August, 2004 L. Liu, P. Topiwala, P. Rault and T. D. Tran, Comparison of JPEG2000 with H.264/AVC FRExt I - Frame Coding on 720p Video Sequences, JVT-N010, Jan. 2005 Google H.264

What’s Next? H.264+ or H.265! NGVC: Next-Generation Video Coding Goal: 50% bit-rate reduction, same complexity, same perceptual video quality Some new tools under investigation Adaptive interpolation filter (AIF) for sub-pixel MEMC "Super-macroblock" structure up to 64x64 with additional transforms Adaptive prediction error coding (APEC) in spatial and frequency domain Adaptive quantization matrix selection (AQMS) Competition-based scheme for motion vector selection and coding Mode-dependent adaptive transform for intra coding