Advanced Video Compression Standards Gary Sullivan Microsoft Corp. Software Design Engineer ITU-T Rapporteur of Advanced Video.

Slides:



Advertisements
Similar presentations
Low-Complexity Transform and Quantization in H.264/AVC
Advertisements

March 24, 2004 Will H.264 Live Up to the Promise of MPEG-4 ? Vide / SURA March Marshall Eubanks Chief Technology Officer.
International Telecommunication Union Workshop on Standardization in E-health Geneva, May 2003 Standard encoding protocols for image and video coding.
with RGB Reversibility
Internet for multimedia content Yogendra Pal Chief Engineer, All India Radio.
AVC Compression Update: FRExt and future Matthew Goldman Vice President of Technology Compression Systems.
Multimedia System Video
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
KIANOOSH MOKHTARIAN SCHOOL OF COMPUTING SCIENCE SIMON FRASER UNIVERSITY 6/24/2007 Overview of the Scalable Video Coding Extension of the H.264/AVC Standard.
2005/01/191/14 Overview of Fine Granularity Scalability in MPEG-4 Video Standard Weiping Li Fellow, IEEE IEEE Transactions on Circuits and Systems for.
A Graduate Course on Multimedia Technology 3. Multimedia Communication © Wolfgang Effelsberg Media Scaling and Media Filtering Definition of.
MPEG-2 to H.264/AVC Transcoding Techniques Jun Xin Xilient Inc. Cupertino, CA.
H.261: A Standard for VideoConferencing Applications Nimrod Peleg Update: Nov
H.264 Intra Frame Coder System Design Özgür Taşdizen Microelectronics Program at Sabanci University 4/8/2005.
A Performance Analysis of the ITU-T Draft H.26L Video Coding Standard Anthony Joch, Faouzi Kossentini, Panos Nasiopoulos Packetvideo Workshop 2002 Department.
Basics of MPEG Picture sizes: up to 4095 x 4095 Most algorithms are for the CCIR 601 format for video frames Y-Cb-Cr color space NTSC: 525 lines per frame.
-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang
Technion - IIT Dept. of Electrical Engineering Signal and Image Processing lab Transrating and Transcoding of Coded Video Signals David Malah Ran Bar-Sella.
1 Video Coding Concept Kai-Chao Yang. 2 Video Sequence and Picture Video sequence Large amount of temporal redundancy Intra Picture/VOP/Slice (I-Picture)
Source Coding for Video Application
SWE 423: Multimedia Systems
Chapter 7 End-to-End Data
Overview of the H.264/AVC Video Coding Standard
H.264/Advanced Video Coding – A New Standard Song Jiqiang Oct 21, 2003.
Overview of Fine Granularity Scalability in MPEG-4 Video Standard Weiping Li, Fellow, IEEE.
H.264 / MPEG-4 Part 10 Nimrod Peleg March 2003.
1 Image and Video Compression: An Overview Jayanta Mukhopadhyay Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur,
An Introduction to H.264/AVC and 3D Video Coding.
EE 5359 H.264 to VC 1 Transcoding Vidhya Vijayakumar Multimedia Processing Lab MSEE, University of Arlington Guided.
MPEG-4 & Windows Media Dr. Jordi Ribas-Corbera Lead Program Manager, Codecs Digital Media Division Microsoft Corp
Moving PicturestMyn1 Moving Pictures MPEG, Motion Picture Experts Group MPEG is a set of standards designed to support ”Coding of Moving Pictures and Associated.
Image Processing Architecture, © Oleh TretiakPage 1Lecture 9 ECEC-453 Image Processing Architecture Lecture 9, 2/12/ 2004 MPEG 1 Oleh Tretiak.
 Coding efficiency/Compression ratio:  The loss of information or distortion measure:
Page 19/15/2015 CSE 40373/60373: Multimedia Systems 11.1 MPEG 1 and 2  MPEG: Moving Pictures Experts Group for the development of digital video  It is.
M P E G A Presentation by Sebastian Graf and Thomas Winterscheid.
MPEG-1 and MPEG-2 Digital Video Coding Standards Author: Thomas Sikora Presenter: Chaojun Liang.
Profiles and levelstMyn1 Profiles and levels MPEG-2 is intended to be generic, supporting a diverse range of applications Different algorithmic elements.
Windows Media Video 9 Tarun Bhatia Multimedia Processing Lab University Of Texas at Arlington 11/05/04.
CMPT365 Multimedia Systems 1 Media Compression - Video Coding Standards Spring 2015 CMPT 365 Multimedia Systems.
Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience Faouzi Kossentini Signal Processing and Multimedia Group Department of.
Outline JVT/H.26L: History, Goals, Applications, Structure
Data Compression. Compression? Compression refers to the ways in which the amount of data needed to store an image or other file can be reduced. This.
Image Compression Supervised By: Mr.Nael Alian Student: Anwaar Ahmed Abu-AlQomboz ID: IT College “Multimedia”
Image Processing Architecture, © 2001, 2002 Oleh TretiakPage 1Lecture 15 ECEC-453 Image Processing Architecture 3/11/2004 Exam Review Oleh Tretiak Drexel.
Rate-distortion Optimized Mode Selection Based on Multi-channel Realizations Markus Gärtner Davide Bertozzi Classroom Presentation 13 th March 2001.
Figure 1.a AVS China encoder [3] Video Bit stream.
Image Processing Architecture, © Oleh TretiakPage 1Lecture 10 ECEC 453 Image Processing Architecture Lecture 10, 2/17/2004 MPEG-2, Industrial.
Compression of Real-Time Cardiac MRI Video Sequences EE 368B Final Project December 8, 2000 Neal K. Bangerter and Julie C. Sabataitis.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
Image/Video Coding Techniques for IPTV Applications Wen-Jyi Hwang ( 黃文吉 ) Department of Computer Science and Information Engineering, National Taiwan Normal.
Fine Granularity Scalability in MPEG-4 Video by Weiping Li Presentation by Warren Cheung.
Rate-distortion Optimized Mode Selection Based on Multi-path Channel Simulation Markus Gärtner Davide Bertozzi Project Proposal Classroom Presentation.
Video Compression—From Concepts to the H.264/AVC Standard
Block-based coding Multimedia Systems and Standards S2 IF Telkom University.
Video Compression and Standards
Overview of Digital Video Compression Multimedia Systems and Standards S2 IF Telkom University.
Flow Control in Compressed Video Communications #2 Multimedia Systems and Standards S2 IF ITTelkom.
COMPARATIVE STUDY OF HEVC and H.264 INTRA FRAME CODING AND JPEG2000 BY Under the Guidance of Harshdeep Brahmasury Jain Dr. K. R. RAO ID MS Electrical.
UNIT V. Linear Predictive coding With the advent of inexpensive digital signal processing circuits, the source simply analyzing the audio waveform to.
V ENUS INTERNATIONAL COLLEGE OF TECHNOLOGY Guided by : Rinkal mam.
Multi-Frame Motion Estimation and Mode Decision in H.264 Codec Shauli Rozen Amit Yedidia Supervised by Dr. Shlomo Greenberg Communication Systems Engineering.
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Present by 楊信弘 Advisor: 鄭芳炫
JPEG Compression What is JPEG? Motivation
CSI-447: Multimedia Systems
Overview of the Scalable Video Coding
Research Topic Error Concealment Techniques in H.264/AVC for Wireless Video Transmission Vineeth Shetty Kolkeri EE Graduate,UTA.
Standards Presentation ECE 8873 – Data Compression and Modeling
JPEG Still Image Data Compression Standard
Presentation transcript:

Advanced Video Compression Standards Gary Sullivan Microsoft Corp. Software Design Engineer ITU-T Rapporteur of Advanced Video Coding ITU-T Recommendation H.263 Editor Stanford, February 15, 2001

Overview Video Standardization Concepts History Recent events –Standardization projects –H.263 v1 & H.263+ & H –MPEG-4 v1, v2, v3 –H.26L Progress –Microsoft Windows Media Video Future Stuff –H.26L Finalization –MPEG Video Plans –Trends

Formal Standards Specification available to all at little or no cost Anyone allowed to implement Agreement officially by consensus, not decided by a single organization’s interests Relatively open committee with variety of participants (including hostile competitors, with no contract to support a common agenda, often meeting with formal government approval) In practice, each standards organization tends to have its own “personality”

Video Coding Standardization Organizations Two organizations dominate video compression standardization: –ITU-T Video Coding Experts Group (VCEG) International Telecommunications Union – Telecommunications Standardization Sector (ITU-T, a United Nations Organization, formerly CCITT), Study Group 16, Question 6 –ISO/IEC Moving Picture Experts Group (MPEG) International Standardization Organization and International Electrotechnical Commission, Joint Technical Committee Number 1, Subcommittee 29, Working Group 11

Dynamics of the Video Standardization Process VCEG is older and more focused on conventional (esp. low-delay) video coding goals (e.g. good compression and packet-loss/error resilience) MPEG is larger and takes on more ambitious goals (e.g. “object oriented video”, “synthetic-natural hybrid coding”, and digital cinema) Sometimes the major organizations team up (e.g. ISO, IEC and ITU teamed up for both MPEG-2 and JPEG) Relatively little industry consortium activity (DV and organizations that tweak the video coding standards in minor ways, such as DVD, 3GPP, 3GPP2, SMPTE, IETF, etc.) Growing activity for internet streaming media outside of formal standardization (e.g., Microsoft, Real Networks, Quicktime)

The Scope of Picture and Video Coding Standardization Only the Syntax and Decoder are standardized: –Permits optimization beyond the obvious –Permits complexity reduction for implementability –Provides no quality guarantees – only interoperability Pre-ProcessingEncoding Source Destination Post-ProcessingDecoding Scope of Standard

H.261: The Basis of Modern Video Compression ITU-T (ex-CCITT) Rec. H.261: The first widespread practical success –First design (late ‘90) embodying typical structure that dominates today: 16x16 macroblock motion compensation, 8x8 DCT, scalar quantization, and variable-length coding –Key aspects later dropped by other standards: loop filter, integer motion comp., 2-D VLC, header overhead –v2 (early ‘93) added a backward-compatible high-resolution graphics trick mode –Operated at kbps –Still in use, although mostly as a backward-compatibility feature – overtaken by H.263

Entropy Decode, Quant. Recon., Inverse DCT Typical MC+DCT Video Coder Motion Comp. Predictor DCT, Quantize, Entropy Encode Motion Estimation Frame Buffer (Delay) Motion Compensated Prediction Input Frame Encoded Residual (To Channel) Approximated Input Frame (To Display) Motion Vector and Block Mode Data (To Channel) Prior Coded Frame Approx (Dotted Box Shows Decoder)

Video Coding Efficiency Foreman 10 Hz, QCIF 100 frames encoded Integer-pel motion compensation (H ) Half-pel motion compensation (MPEG ) TMN-10 Variable block size motion compensation (H ) Frame difference coding (H ) Intraframe DCT coding (DCT 1974, JPEG 1992) PSNR [dB] Bit-Rate [kbps] ? 67 %

MPEG-1: Practicality at Higher Bit Rates Formally ISO/IEC (‘93), developed by ISO/IEC JTC1 SC29 WG11 (MPEG) – use is fairly widespread, but mostly overtaken by MPEG-2 –Superior quality to H.261 when operated a higher bit rates (  1 Mbps for CIF 352x288 resolution) –Can provide approximately VHS quality between 1-2 Mbps using SIF 352x240/288 resolution –Technical features: Adds bi-directional motion prediction and half-pixel motion to H.261 design

MPEG-2/H.262: Even Higher Bit Rates and Interlace Formally ISO/IEC & ITU-T H.262, developed (‘94) jointly by ITU-T and ISO/IEC SC29 WG11 (MPEG) – Now in wide use for DVD and standard and high-definition DTV (the most commonly used video coding standard) –Primary new technical features: support for interlaced- scan pictures and scalability –Essentially the same as MPEG-1 for progressive-scan pictures, and MPEG-1 forward compatibility required –Not especially useful below 4 Mbps (range of use normally 5-30 Mbps)

H.263: The Next Generation ITU-T Rec. H.263 (v1: 1995): The next generation of video coding performance, developed by ITU-T – the current best standard for practical video telecommunication (has overtaken H.261 as dominant videoconferencing codec) –Superior to H.261 at all bit rates –Wins by a factor of two at very low rates –Versions 2 (late 1997/early 1998) & v3 (2000) later developed

MPEG-4: Baseline H.263 and Many Creative Extras MPEG-4 (v1: early 1999), formally ISO/IEC : Contains the H.263 design and adds all prior features and various creative new extras –Includes segmented coding of shapes, zero-tree wavelet coding of still textures, coding of synthetic and semi-synthetic content, etc. –v2 (early 2000) & v3 (early 2001) later added

MPEG-4 and H.263 Standardization Dynamics MPEG-4 project launched soon after H.263 completed MPEG-4 project was very ambitious and was planned to be significantly different from H.263 Compatibility with H.263 was not initially planned in MPEG-4 (although it eventually turned out to be significantly compatible!) ITU-T decided to extend its H.263 quickly and compatibly rather than join up with longer, more ambitious, potentially-incompatible MPEG-4 effort for the features the ITU wanted Much cross-fertilization of ideas and people in projects

Detailed Recent History In Video Coding Standardization ITU-T Events –H.263v1 completed late ‘95 –H.263+ project (H.263 v2) technically final Sept ‘97 –H project (H.263 v3) technically final July ‘00 –H.26L project underway (test version available) ISO/IEC Events –MPEG-4 v1 completed early ’99 –MPEG-4 v2 completed early ’00 –MPEG-4 v3 completed early ‘01 –Potential for new work under evaluation

H New Version 3 Features Part 1 of 2 Annex U: Fidelity enhancement by macroblock and block-level reference picture selection – a significant improvement in compression quality Annex V: Packet Loss & Error Resilience using data partitioning with reversible VLCs (roughly similar to MPEG-4 data partitioning, but improved by using reversible coding of motion vectors rather than coefficients)

H New Version 3 Features Part 2 of 2 Annex W:Additional Supplemental Enhancement Information –IDCT Mismatch Elimination (specific fixed-point fast IDCT) –Arbitrary binary user data –Text messages (arbitrary, copyright, caption, video description, and URI) –Error Resilience: Picture header repetition (current, previous, next+TR, next-TR) Spare reference pictures for error concealment –Interlaced field indications (top & bottom)

H Annex U Rate Distortion Performance Bit-Rate [kbps] PSNR [dB] TMN reference pictures H.263 combined with Long-Term Memory Prediction 17 % Foreman 10 Hz, QCIF 100 frames encoded

Number of Reference Frames Average Bit-Rate Savings in 34 dB Foreman Container Mobile & Calendar Stefan Mother & Daughter Silent Tempete Average 13.5 % Average Bit Rate Savings

MPEG-4 Version 3 Just Completed (part 1 of 2) “Studio Profile” –Various additions oriented toward professional use of video within specialized studio environments –Adds 4:2:2 and 4:4:4 sampling structures –Adds more MPEG-2 elements to MPEG-4 “Fine Granularity Scalability Streaming Video Profile”, a new form of scalable video coding –Uses a scalable enhancement layer –Temporal prediction in enhancement layer is stopped to prevent temporal error propagation –Enhancement layer coded by bit-planes to form a “progressive- transmission” bitstream

MPEG-4 Version 3 Just Completed (part 2 of 2) “Advanced Simple Profile”, a combination of v1 features, containing: –“Simple Profile” features –B pictures –MPEG-2-style quantization –Interlace features (at higher levels only) –¼-pel motion –Global motion comp –Single stream support in new “level 0”

ITU-T VCEG H.26L Project Goals (Completion 2002) Compression beyond capability of H.263vN Real-time low-cost complexity Delay reduction Enhanced error and packet loss resilience Bit-rate adaptivity (e.g. scalability & BR reduction) Spatio-temporal resolution adaptivity Robustness to source material behavior

H.26L Status Test Model Long-Term Number 6: Designed January ’01 (Eibsee), description and software soon available on the ‘net TML-5 software and spec availalbe (Geneva, November ’00) Gain goal over 1999 standards: 50% savings in bits for same fidelity! (at all bit rates)

The H.26L TML-6 Design Part 1 of 4 Still using a hybrid of DPCM and transform coding as in prior standards. Common elements include: –16x16 macroblocks –Conventional sampling of chrominance and association of luminance and chrominance data –Block motion displacement –Block transforms (not wavelets or fractals) –Scalar quantization –Variable-length coding

The H.26L TML-6 Design Part 2 of 4 Motion Compensation: –Multiple reference pictures (per H Annex U) –B picture support (per several prior standards) –Multihypothesis concept being evaluated –1/4 sample accuracy motion (sort of per MPEG-4, could possibly go to 1/8 pel) –6x6 tap filtering to 1/2 sample accuracy, bilinear filtering to 1/4 sample accuracy –Various block sizes and shapes for motion compensation (7 segmentations of the macroblock) –“Funny position” with heavier filtering –Affine motion under consideration

The H.26L TML-6 Design Part 3 of 4 Intra Coding Structure: –Directional spatial prediction (6 types for luma, one for chroma) –Alterations under consideration Transform –Variable block size for intra (16x16, 8x8, 4x4) –Technically not exactly a DCT, but an integer transform closely approximating a DCT –Based primarily on 4x4 transform size (all prior standards used 8x8) –Expanded to 8x8 for chroma by 2x2 DC transform –Adaptive block size under consideration

The H.26L TML-6 Design Part 4 of 4 Two inverse scan patterns Logarithmic step size control Smaller step size for chroma (per H.263 Annex T) Universal variable-length coding (configurability under consideration) Adaptive arithmetic coding under strong consideration In-loop deblocking filter Distinct Network Adaptation Layer (NAL) design for network transport Inter-sequence transitional pictures under consideration

Future Work in MPEG MPEG to assess new video technology and address digital cinema needs –Calls for proposals issued –Tests to be conducted in next few months –ITU-T VCEG bringing H.26L as reference –Exploring potential future cooperative work between VCEG and MPEG

 Many devices  Wired or wireless  Access from anywhere  Software Integration  Personalized delivery Digital Media World Rich Services

Live Feed Windows Media Encoder Windows Media Services Server Windows Media Player PC, Hand-held, STB UNICAST, MULTICAST Stored Content Live Content On-demand Content Content Authoring Distribution Playback LicenseServer Streaming from a WM Server (or Web Server) Windows Media Download & Play Streaming

WM Video Coding 2 main video codecs: –Standard MPEG-4 Video –WM Video v8  ~ 40% bit rate savings over MS MPEG-4 Examples of WM Video : “Near-VHS to VHS quality”  320 x , Fps (250 – 500 Kbps)  PII 300 encoder, P5 200 decoder “Near-DVD to DVD quality”  640 x 480, fps (500 Kbps – 1.5 Mbps)  Dual PIII 700 encoder, PII 400 decoder WM Player also supports other video codecs such as MPEG-1

Windows Media Technologies Video-Related Features WMV 8 Codec: A big step forward in compression performance Screen Codec: Outstanding compression –(near) Lossless ! –640 x 480, 10 Fps < 20 Kbps (modem) –800 x 600, 15 Fps < 45 Kbps (ISDN/LAN) Advanced Streaming Format (ASF) file format Digital Rights Management (DRM): Critical for Content Providers

Future Trends Prediction is difficult - especially of the future. – Bohr ( ) If we do not succeed, then we run the risk of failure. – Quayle (Phoenix Rep. Forum, 1990)

Principles of Rate-Distortion Theory Errors using inadequate data are much less than those using no data at all. – Charles Babbage ( ) A little inaccuracy sometimes saves tons of explanation. – Saki (H.H. Munro, , The Comments of Moung Ka)

On Rate-Distortion Optimization Rate-distortion optimization and searching techniques will increase in importance –Most enhancements take the form of an expanded range of choices –More choices implies more need for searching and optimization –Lagrange multiplier optimization provides an understandable, straightforward framework –Recent understanding of coupling of step size and Lagrange multiplier makes it straightforward

Some Future Projections Coding Efficiency will continue to improve (Proof by existence): –4x4 coding –Long-term memory –Enhanced motion accuracy –Enhanced motion models –Enhanced intra coding People continue to come up with good ideas (and relatively predictable ones!)

What Area will Yield the Most Improvement? Although “prediction is difficult”, it is the area that will yield the most performance improvement Today’s coded motion model is primitive Several motion model improvement areas have yet to be fully exploited Waveform difference coding gain is limited

Won’t This be Unnecessary when Megabits become free? The need for better compression will not be reduced –Got more bits? Give me higher resolution. –Got more bits? Give me more channels. –Improving worth effort? 20% of a lot is a lot. –Bit rates have a slower doubling time than computing power.

Increasing “Layers” of Standardization In olden days: Design a system for a network with a video coder as part of that system design. Now: –Standardize a “language” of syntax with maximum flexibility and a rich feature set –Standardize how to configure the standard –Standardize how to encapsulate the standard data on a network –Standardize digital rights management for the data –Standardize the system to carry the data

Other Kinds of Layers Continuing interest in Layered coding: –Scalability in MPEG-2, H.263+, and MPEG-4 –Layered coding ongoing work (Microsoft, MPEG Enhanced FGS) –Mixed success toward products Motivation 1: The bit-rate scalability dream Motivation 2: The limitations of resolution Motivation 3: The error resilience need

Conclusions There will be plenty of need for further work. There will be plenty of need for more processing power. There will be plenty of need for more bits. There will be plenty of need for good ideas. And those good ideas will come. Dream no small dreams, for they have no power to move the hearts of men. - Goethe ( )