-1/20- Scalable Video Coding Scalable Extension of H.264 / AVC.

Slides:



Advertisements
Similar presentations
AVC Compression Update: FRExt and future Matthew Goldman Vice President of Technology Compression Systems.
Advertisements

Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
KIANOOSH MOKHTARIAN SCHOOL OF COMPUTING SCIENCE SIMON FRASER UNIVERSITY 6/24/2007 Overview of the Scalable Video Coding Extension of the H.264/AVC Standard.
2005/01/191/14 Overview of Fine Granularity Scalability in MPEG-4 Video Standard Weiping Li Fellow, IEEE IEEE Transactions on Circuits and Systems for.
MPEG4 Natural Video Coding Functionalities: –Coding of arbitrary shaped objects –Efficient compression of video and images over wide range of bit rates.
A Performance Analysis of the ITU-T Draft H.26L Video Coding Standard Anthony Joch, Faouzi Kossentini, Panos Nasiopoulos Packetvideo Workshop 2002 Department.
Basics of MPEG Picture sizes: up to 4095 x 4095 Most algorithms are for the CCIR 601 format for video frames Y-Cb-Cr color space NTSC: 525 lines per frame.
-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang
1 Video Coding Concept Kai-Chao Yang. 2 Video Sequence and Picture Video sequence Large amount of temporal redundancy Intra Picture/VOP/Slice (I-Picture)
Optimal Scalable Video Multiplexing in Mobile Broadcast Networks
Limin Liu, Member, IEEE Zhen Li, Member, IEEE Edward J. Delp, Fellow, IEEE CSVT 2009.
Li Liu, Robert Cohen, Huifang Sun, Anthony Vetro, Xinhua Zhuang BMSB
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada End-to-End Secure Delivery of Scalable Video Streams Mohamed Hefeeda (Joint.
Overview of the Scalable Video Coding Extension of the H
DWT based Scalable video coding with scalable motion coding Syed Jawwad Bukhari.
Fine Grained Scalable Video Coding For Streaming Multimedia Communications Zahid Ali 2 April 2006.
Video Transmission Adopting Scalable Video Coding over Time- varying Networks Chun-Su Park, Nam-Hyeong Kim, Sang-Hee Park, Goo-Rak Kwon, and Sung-Jea Ko,
Overview of Fine Granularity Scalability in MPEG-4 Video Standard Weiping Li, Fellow, IEEE.
Motion-compensation Fine-Granular-Scalability (MC-FGS) for wireless multimedia M. van der Schaar, H. Radha Proceedings of IEEE Symposium on Multimedia.
Introduction to Video Transcoding Of MCLAB Seminar Series By Felix.
Communication & Multimedia C. -Y. Tsai 2005/8/17 1 MCTF in Current Scalable Video Coding Schemes Student: Chia-Yang Tsai Advisor: Prof. Hsueh-Ming Hang.
Efficient Fine Granularity Scalability Using Adaptive Leaky Factor Yunlong Gao and Lap-Pui Chau, Senior Member, IEEE IEEE TRANSACTIONS ON BROADCASTING,
H.264/AVC for Wireless Applications Thomas Stockhammer, and Thomas Wiegand Institute for Communications Engineering, Munich University of Technology, Germany.
Institute of Electronics, National Chiao Tung University Scalable Extension of H.264/AVC Student: Hung-Chih Lin Advisor: Prof. Hsueh-Ming Hang.
4/24/2002SCL UCSB1 Optimal End-to-end Distortion Estimation for Drift Management in Scalable Video Coding H. Yang, R. Zhang and K. Rose Signal Compression.
09/24/02ICIP20021 Drift Management and Adaptive Bit Rate Allocation in Scalable Video Coding H. Yang, R. Zhang and K. Rose Signal Compression Lab ECE Department.
Prof. V. M. Gadre Department of Electrical Engineering, IIT Bombay.
An Introduction to H.264/AVC and 3D Video Coding.
1. 1. Problem Statement 2. Overview of H.264/AVC Scalable Extension I. Temporal Scalability II. Spatial Scalability III. Complexity Reduction 3. Previous.
MPEG-2 Digital Video Coding Standard
Overview of the Scalable Video Coding Extension of the H.264/AVC Standard Kai-Chao Yang 12007/8Kai-Chao Yang, NTHU, Taiwan.
Heiko Schwarz, Detlev Marpe, and Thomas Wiegand CSVT, Sept. 2007
EE 5359 H.264 to VC 1 Transcoding Vidhya Vijayakumar Multimedia Processing Lab MSEE, University of Arlington Guided.
3D EXTENSION of HEVC: Multi-View plus Depth Parashar Nayana Karunakar Student Id: Department of Electrical Engineering.
GODIAN MABINDAH RUTHERFORD UNUSI RICHARD MWANGI.  Differential coding operates by making numbers small. This is a major goal in compression technology:
Kai-Chao Yang Hierarchical Prediction Structures in H.264/AVC.
MPEG-2 Standard By Rigoberto Fernandez. MPEG Standards MPEG (Moving Pictures Experts Group) is a group of people that meet under ISO (International Standards.
Overview of the Stereo and Multiview Video Coding Extensions of the H
Farid Molazem Network Systems Lab Simon Fraser University Scalable Video Transmission for MobileTV.
Scalable Video Conferencing Using Subband Transform Coding and Layered Multicast Transmission Mathias Johanson Swedish Research Institute for Information.
Video in future 不屈号的航海长 July, 2009
Picture typestMyn1 Picture types There are three types of coded pictures. I (intra) pictures are fields or frames coded as a stand-alone still image. These.
Profiles and levelstMyn1 Profiles and levels MPEG-2 is intended to be generic, supporting a diverse range of applications Different algorithmic elements.
PERSONAL TELEPRESENCE USING SCALABLE VIDEO CODING Alex Eleftheriadis, Chief Scientist
FEC and RDO in SVC Thomas Wiegand 1. Outline Introduction SVC Bit-Stream Raptor Codes Layer-Aware FEC Simulation Results Linear Signal Model Description.
Layered Coding Basic Overview. Outline Pyramidal Coding Scalability in the Standard Codecs Layered Coding with Wavelets Conclusion.
Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.
- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison of H.264/MPEG4.
Scalable Video Coding Prof. V. M. Gadre Department of Electrical Engineering, IIT Bombay.
EE 5359 TOPICS IN SIGNAL PROCESSING PROJECT ANALYSIS OF AVS-M FOR LOW PICTURE RESOLUTION MOBILE APPLICATIONS Under Guidance of: Dr. K. R. Rao Dept. of.
Power saving control for the mobile DVB-H receivers based on H.264/SVC standard Eugeny Belyaev, Vitaly Grinko, Ann Ukhanova Saint-Petersburg State University.
- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison between H.264.
報告人:林祐沁 學生 指導教授:童曉儒 老師 March 2, Wireless Video Surveillance Server Based on CDMA1x and H.264.
Figure 1.a AVS China encoder [3] Video Bit stream.
Guillaume Laroche, Joel Jung, Beatrice Pesquet-Popescu CSVT
Scalable Video Coding and Transport Over Broad-band wireless networks Authors: D. Wu, Y. Hou, and Y.-Q. Zhang Source: Proceedings of the IEEE, Volume:
Advance in Scalable Video Coding Proc. IEEE 2005, Invited paper Jens-Rainer Ohm, Member, IEEE.
Fine Granularity Scalability in MPEG-4 Video by Weiping Li Presentation by Warren Cheung.
Overview of Fine Granularity Scalability in MPEG-4 Video Standard Weiping Li Presented by : Brian Eriksson.
IntroductiontMyn1 Introduction MPEG, Moving Picture Experts Group was started in 1988 as a working group within ISO/IEC with the aim of defining standards.
MPEG4 Fine Grained Scalable Multi-Resolution Layered Video Encoding Authors from: University of Georgia Speaker: Chang-Kuan Lin.
Video Compression—From Concepts to the H.264/AVC Standard
Introduction to MPEG Video Coding Dr. S. M. N. Arosha Senanayake, Senior Member/IEEE Associate Professor in Artificial Intelligence Room No: M2.06
Implementation and comparison study of H.264 and AVS china EE 5359 Multimedia Processing Spring 2012 Guidance : Prof K R Rao Pavan Kumar Reddy Gajjala.
MPEG Video Coding — MPEG-2. Administrative Issues (04/19/2016)  Draft of Final Report is due on Thursday, April 21, 2016 (extended to April 26)  Submit.
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Quality Evaluation and Comparison of SVC Encoders
Overview of the Scalable Video Coding
Research Topic Error Concealment Techniques in H.264/AVC for Wireless Video Transmission Vineeth Shetty Kolkeri EE Graduate,UTA.
MPEG4 Natural Video Coding
Presentation transcript:

-1/20- Scalable Video Coding Scalable Extension of H.264 / AVC

Scalable Video Coding  Video streaming over internet is gaining more and more popularity due to video conferencing and video telephony applications.  The heterogeneous, dynamic and best effort structure of the internet, motivates to introduce a scalability feature as adapting video streams to fluctuations in the available bandwidths.  Optimize the video quality for a large range of bit-rates.  A video bit stream is called scalable if part of the stream can be removed in such a way that the resulting bit stream is still decodable.  Scalability here implies:  Single encode  Multiple possibilities to transmit and decode bitstream

Scalable Video Coding A video bit stream is called scalable if part of the stream can be removed in such a way that the resulting bit stream is still decodable, to adapt to the various needs of end users and to varying terminal capabilities or network conditions.

SVC - Standardization 4

SVC Principle : one encoding 5

SVC Principle : multiple decoding 6

H.264/AVC Simulcast vs. SVC  Typical gains in quality by doing SVC spatial scalability (as opposed to Simulcast) may be in the range  of 0.5dB to 1.5dB PSNR gain  Or equivalently 10 to 30% bit rate reduction  This gap will be more if there are more than one SNR layer per spatial layer H.264 simulcast SD HD SVC HD+SD

Functionalities and Applications  SVC has capability of reconstructing lower resolution or lower quality signals from partial bit streams.  Partial decoding of the bit stream allows-  Graceful degradation in case part of bit stream is lost.  Bit-rate adaptation  Format adaptation  Power adaptation  Beneficial for transmission services with uncertainties regarding  Resolution required at the terminal.  Channel conditions or device types.

SVC Basics  Straight forward extension to H.264 with very limited added complexity  Layered approach  One base layer  One or more enhancement layers.  Base layer is H.264/AVC compliant.  An SVC stream can be decoded by an H.264 decoder.  Enhancement layers enable Temporal, Spatial or Quality (SNR) scalability.

SVC Profiles  SVC Standard defines 3 profiles  Scalable Baseline profile  Targeted for conversational and surveillance applications.  Support for Spatial Scalable coding is restricted to ratios 1.5 and 2, between successive spatial layers.  Interlaced video not supported.  Scalable High profile  Designed for broadcast, storage and streaming applications.  Spatial scalable coding with arbitrary resolution ratios supported.  Interlaced video supported  Scalable High Intra profile  Designed for professional applications.  Contains only IDR pictures for all layers.  All other coding tools are same as Scalable High Profile.

Temporal Scalability (Dyadic prediction structure)  Group of Pictures (GOP)  Key Picture: Typically Intra-coded  Hierarchically predicted B Pictures: Motion-Compensated Prediction Frame Rate = 3.75 fps Frame Rate = 7.5 fpsFrame Rate = 15 fpsFrame Rate = 30 fps Prediction GOP border Key Picture T0T0 T0T0 T1T1 T2T2 T2T2 T3T3 T3T3 T3T3 T3T3 T x : Temporal Layer Identifier Structural Delay = 7 frames

Hierarchical B-pictures Above is a non-dyadic prediction structure, which provides 2 independently decodable subsequences with 1/9 th and 1/3 rd of full frame rate. Structural delay = 8 frames Figure courtesy “Overview of Scalable Video Coding extension of H.264 / AVC” SCHWARZ et al., IEEE Transactions on circuits and Systems for Video Technology, Sept Above is a non-dyadic prediction structure, which provides 0 structural delay, but low coding efficiency, compared to above examples. Any chosen prediction structure need not be constant over time. It can be arbitrarily modified, e.g., to improve coding efficiency.

 IPP : GOP Size 1  No Temporal scalability  Only Temporal Level 0  IBP : GOP Size 2  Temporal Levels 0, 1  GOP Size 4  Temporal Levels 0, 1, 2  GOP Size 8  Temporal Levels 0, 1, 2, 3 Group Of Pictures (GOP)

Coding efficiency of Hierarchical Prediction Structures  Significant improvement in coding efficiency for high delay app.  Depends on how QP is chosen for different temporal layers.  larger GOP size gives larger PSNR improvement  Smaller QP for lower layer Figure courtesy “Overview of Scalable Video Coding extension of H.264 / AVC” SCHWARZ et al., IEEE Transactions on circuits and Systems for Video Technology, Sept. 2007

Spatial Scalability Sub-sample and Encode to form Base Layer Decode and Up-sample to original Resolution Subtract Predicted from Original Encode residue to form Enhancement Layer The base layer contains a reduced-resolution version of each coded frame. Decoding the base layer alone produces a low-resolution output sequence and decoding the base layer with enhancement layer(s) produces a higher-resolution output.

Spatial Scalability  The prediction signals are formed by  MCP inside the enhancement layer (Temporal) (small motion and high spatial detail)  Up-sampling from the lower layer (Spatial)  Average of the above two predictions (Temporal + Spatial)  Inter-layer prediction  Three kinds of inter-layer prediction  Inter-layer motion prediction  Inter-layer residual prediction  Inter-layer intra prediction ( when the co-located lower layer MB is intra coded )  Base mode MB  Only residuals are transmitted, but no additional side info.

Extended Spatial Scalability (ESS)  This is required in many applications where different display sizes from broadcasting, communications and IT environments are commonly mixed, having different aspect ratios (like 4:3 or 16:9 etc).

Quality / Fidelity / SNR Scalability  Types  Coarse Grain Scalability (CGS)  Medium Grain Scalability (MGS)  Fine Grain Scalability (FGS)  Not supported by SVC standard because of very poor enhancement layer coding efficiency.  Bit rate adaptation at same spatial/temporal resolution  SVC supports up to 16 SNR layers for each spatial layer

Coarse-grain quality scalability (CGS)  A special case of spatial scalability  Identical sizes (resolution) for base and enhancement layers  Smaller quantization step sizes for higher enhancement residual layers  Designed for only several selected bit-rate points  Supported bit-rate points = Number of layers  Switch can only occur at IDR access units

Medium-grain quality scalability (MGS)  More enhancement layers are supported  Refinement quality layers of residual  Key pictures  Drift control  Switch can occur at any access units  CGS + key pictures + refinement quality layers  Drift control  Drift: The effect caused by unsynchronized MCP at the encoder and decoder side  Trade-off of MCP in quality SVC  Coding efficiency  drift

SVC Encoder Dependency layer The same motion/prediction information Temporal Decomposition

SVC: Combined Scalability Spatio-Temporal-Quality Cube

Combined Scalability  Dependency and Quality refinement layers D = 2 Q = 2 Q = 1 Q = 0 D = 1 Q = 2 Q = 1 Q = 0 D = 0 Q = 2 Q = 1 Q = 0 Scalable bit- stream

Combined Scalability T0T0 D1D1 Q1Q1 Q0Q0 D0D0 Q1Q1 Q0Q0 T2T2 T1T1 T2T2 T0T0

 Bit-stream format  Bit-stream switching  Inside a dependency layer  Switching everywhere  Outside a dependency layer  Switching up only at IDR access units  Switching down everywhere if using multiple-loop decoding NAL unit header NAL unit header extension NAL unit payload PTDQ P (priority_id): indicates the importance of a NAL unit T (temporal_id): indicates temporal level D (dependency_id): indicates spatial/CGS layer Q (quality_id): indicates MGS/FGS layer

Profiles of SVC  Scalable Baseline  For conversational and surveillance applications requiring low decoding complexity  Spatial scalability: fixed ratio (1, 1.5, or 2) and MB-aligned cropping  Temporal and quality scalability: arbitrary  No interlaced coding tools  B-slices, weighted prediction, CABAC, and 8x8 luma transform  The base layer conforms Baseline profile of H.264/AVC  Scalable High  For broadcast, streaming, and storage  Spatial, temporal, and quality scalability: arbitrary  The base layer conforms High profile of H.264/AVC  Scalable High Intra  Scalable High + all IDR pictures

Conclusions  Temporal scalability  Hierarchical prediction structure  Spatial and quality scalability  Inter-layer prediction of Intra, motion, and residual information  Single-loop MC decoding  Identical size for each spatial layer – CGS  CGS + key pictures + quality refinement layer – MGS  applications  Power adaption – decoding needed part of the video stream  Graceful degradation – when “right” parts are lost  Format adaption – backwards compatible extension in mobile TV  What’s next in SVC?  Bit-depth scalability (8-bit 4:2:0  10-bit 4:2:0)  Color format scalability (4:2:0  4:4:4) 2007/8 MC2008, VCLAB 27

References  H. Schwarz, D. Marpe, and T. Wiegand, “Overview of the Scalable Video Coding Extension of the H.264/AVC Standard,” CSVT  T. Wiegand, “Scalable Video Coding,” Joint Video Team, doc. JVT-W132, San Jose, USA, April  T. Wiegand, “Scalable Video Coding,” Digital Image Communication, Course at Technical University of Berlin, (Available on  H. Schwarz, D. Marpe, and T. Wiegand, “Constrained Inter- Layer Prediction for Single-Loop Decoding in Spatial Scalability,” Proc. of ICIP’ /8 MC2008, VCLAB 28