IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 7, NO. 5, OCTOBER 2005

Slides:



Advertisements
Similar presentations
Packet Video Error Concealment With Auto Regressive Model Yongbing Zhang, Xinguang Xiang, Debin Zhao, Siwe Ma, Student Member, IEEE, and Wen Gao, Fellow,
Advertisements

Wen-Hsiao Peng Chun-Chi Chen
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
MPEG4 Natural Video Coding Functionalities: –Coding of arbitrary shaped objects –Efficient compression of video and images over wide range of bit rates.
INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS, ICT '09. TAREK OUNI WALID AYEDI MOHAMED ABID NATIONAL ENGINEERING SCHOOL OF SFAX New Low Complexity.
Basics of MPEG Picture sizes: up to 4095 x 4095 Most algorithms are for the CCIR 601 format for video frames Y-Cb-Cr color space NTSC: 525 lines per frame.
-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang
Technion - IIT Dept. of Electrical Engineering Signal and Image Processing lab Transrating and Transcoding of Coded Video Signals David Malah Ran Bar-Sella.
SWE 423: Multimedia Systems
H.264/AVC Baseline Profile Decoder Complexity Analysis Michael Horowitz, Anthony Joch, Faouzi Kossentini, and Antti Hallapuro IEEE TRANSACTIONS ON CIRCUITS.
{ Fast Disparity Estimation Using Spatio- temporal Correlation of Disparity Field for Multiview Video Coding Wei Zhu, Xiang Tian, Fan Zhou and Yaowu Chen.
SCHOOL OF COMPUTING SCIENCE SIMON FRASER UNIVERSITY CMPT 820 : Error Mitigation Schaar and Chou, Multimedia over IP and Wireless Networks: Compression,
Yu-Han Chen, Tung-Chien Chen, Chuan-Yung Tsai, Sung-Fang Tsai, and Liang-Gee Chen, Fellow, IEEE IEEE CSVT
Efficient multi-frame motion estimation algorithms for MPEG-4 AVC/JVTH.264 Mei-Juan Chen, Yi-Yen Chiang, Hung- Ju Li and Ming-Chieh Chi ISCAS 2004.
DWT based Scalable video coding with scalable motion coding Syed Jawwad Bukhari.
Video Transmission Adopting Scalable Video Coding over Time- varying Networks Chun-Su Park, Nam-Hyeong Kim, Sang-Hee Park, Goo-Rak Kwon, and Sung-Jea Ko,
Rate-Distortion Optimized Layered Coding with Unequal Error Protection for Robust Internet Video Michael Gallant, Member, IEEE, and Faouzi Kossentini,
Overview of Fine Granularity Scalability in MPEG-4 Video Standard Weiping Li, Fellow, IEEE.
Decision Trees for Error Concealment in Video Decoding Song Cen and Pamela C. Cosman, Senior Member, IEEE IEEE TRANSACTION ON MULTIMEDIA, VOL. 5, NO. 1,
Scalable Wavelet Video Coding Using Aliasing- Reduced Hierarchical Motion Compensation Xuguang Yang, Member, IEEE, and Kannan Ramchandran, Member, IEEE.
A New Content-Based Hybrid Video Transcoding Method YongQing Liang YapPeng Tan Presented by Robert Hung.
Introduction to Video Transcoding Of MCLAB Seminar Series By Felix.
Motion Vector Refinement for High-Performance Transcoding Jeongnam Youn, Ming-Ting Sun, Fellow,IEEE, Chia-Wen Lin IEEE TRANSACTIONS ON MULTIMEDIA, MARCH.
Efficient Fine Granularity Scalability Using Adaptive Leaky Factor Yunlong Gao and Lap-Pui Chau, Senior Member, IEEE IEEE TRANSACTIONS ON BROADCASTING,
Error Resilience in a Generic Compressed Video Stream Transmitted over a Wireless Channel Muhammad Bilal
1 An Efficient Mode Decision Algorithm for H.264/AVC Encoding Optimization IEEE TRANSACTION ON MULTIMEDIA Hanli Wang, Student Member, IEEE, Sam Kwong,
Scalable Rate Control for MPEG-4 Video Hung-Ju Lee, Member, IEEE, Tihao Chiang, Senior Member, IEEE, and Ya-Qin Zhang, Fellow, IEEE IEEE TRANSACTIONS ON.
Fundamentals of Multimedia Chapter 11 MPEG Video Coding I MPEG-1 and 2
H.264/AVC for Wireless Applications Thomas Stockhammer, and Thomas Wiegand Institute for Communications Engineering, Munich University of Technology, Germany.
An Introduction to H.264/AVC and 3D Video Coding.
Video Compression Concepts Nimrod Peleg Update: Dec
1. 1. Problem Statement 2. Overview of H.264/AVC Scalable Extension I. Temporal Scalability II. Spatial Scalability III. Complexity Reduction 3. Previous.
January 26, Nick Feamster Development of a Transcoding Algorithm from MPEG to H.263.
MPEG-2 Digital Video Coding Standard
Video Streaming via Transcoding Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC 28223
1 Motivation Video Communication over Heterogeneous Networks –Diverse client devices –Various network connection bandwidths Limitations of Scalable Video.
1 Thread-Parallel MPEG-2, MPEG4 and H.264 Video Encoders for SoC Multi- Processor Architecture Tom R. Jacobs, Vassilios A. Chouliars, and David J. Mulvaney.
Liquan Shen Zhi Liu Xinpeng Zhang Wenqiang Zhao Zhaoyang Zhang An Effective CU Size Decision Method for HEVC Encoders IEEE TRANSACTIONS ON MULTIMEDIA,
Kai-Chao Yang Hierarchical Prediction Structures in H.264/AVC.
Electrical Engineering National Central University Video-Audio Processing Laboratory Data Error in (Networked) Video M.K.Tsai 04 / 08 / 2003.
 Coding efficiency/Compression ratio:  The loss of information or distortion measure:
Page 19/15/2015 CSE 40373/60373: Multimedia Systems 11.1 MPEG 1 and 2  MPEG: Moving Pictures Experts Group for the development of digital video  It is.
MPEG-1 and MPEG-2 Digital Video Coding Standards Author: Thomas Sikora Presenter: Chaojun Liang.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
Windows Media Video 9 Tarun Bhatia Multimedia Processing Lab University Of Texas at Arlington 11/05/04.
Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience Faouzi Kossentini Signal Processing and Multimedia Group Department of.
CS :: Fall 2003 Media Scaling / Content Adaptation Ketan Mayer-Patel.
Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.
1 Adaptable applications Towards Balancing Network and Terminal Resources to Improve Video Quality D. Jarnikov.
Adaptive Rate Control for HEVC Visual Communications and Image Processing (VCIP), 2012 IEEE Junjun Si, Siwei Ma, Xinfeng Zhang, Wen Gao 1.
Compression video overview 演講者:林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.
Rate-distortion Optimized Mode Selection Based on Multi-channel Realizations Markus Gärtner Davide Bertozzi Classroom Presentation 13 th March 2001.
Guillaume Laroche, Joel Jung, Beatrice Pesquet-Popescu CSVT
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
Scalable Video Coding and Transport Over Broad-band wireless networks Authors: D. Wu, Y. Hou, and Y.-Q. Zhang Source: Proceedings of the IEEE, Volume:
Advance in Scalable Video Coding Proc. IEEE 2005, Invited paper Jens-Rainer Ohm, Member, IEEE.
Overview of Fine Granularity Scalability in MPEG-4 Video Standard Weiping Li Presented by : Brian Eriksson.
Block-based coding Multimedia Systems and Standards S2 IF Telkom University.
Video Compression and Standards
Flow Control in Compressed Video Communications #2 Multimedia Systems and Standards S2 IF ITTelkom.
Fundamentals of Multimedia Chapter 17 Wireless Networks 건국대학교 인터넷미디어공학부 임 창 훈.
MPEG CODING PROCESS. Contents  What is MPEG Encoding?  Why MPEG Encoding?  Types of frames in MPEG 1  Layer of MPEG1 Video  MPEG 1 Intra frame Encoding.
Principles of Video Compression Dr. S. M. N. Arosha Senanayake, Senior Member/IEEE Associate Professor in Artificial Intelligence Room No: M2.06
MPEG Video Coding I: MPEG-1 1. Overview  MPEG: Moving Pictures Experts Group, established in 1988 for the development of digital video.  It is appropriately.
H. 261 Video Compression Techniques 1. H.261  H.261: An earlier digital video compression standard, its principle of MC-based compression is retained.
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
CSI-447: Multimedia Systems
Overview of the Scalable Video Coding
ENEE 631 Project Video Codec and Shot Segmentation
Standards Presentation ECE 8873 – Data Compression and Modeling
Presentation transcript:

IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 7, NO. 5, OCTOBER 2005 Video Transcoding: An Overview of Various Techniques and Research Issues IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 7, NO. 5, OCTOBER 2005 Ishfaq Ahmad, Senior Member, IEEE, Xiaohui Wei, Student Member, IEEE, Yu Sun, Student Member, IEEE, and Ya-Qin Zhang, Fellow, IEEE 2006/09/19 S.K.Chang

Outline Video Transcoding Introduction Video Transcoding architecture Homogeneous video transcoding Heterogeneous video transcoding Related research issues Conclusion

Video Transcoding Introduction One of the fundamental challenges in deploying multimedia systems is to deliver smooth and uninterruptible flow of audio-visual information, anytime and anywhere. It may consist of various devices interconnected via heterogeneous wireline and wireless networks. multimedia content originally authored and compressed with a certain format may need bit rate adjustment / format conversion to allow access by receiving devices with diverse capabilities. A transcoding mechanism is required to make the content adaptive to the capabilities of diverse networks and client devices.

Video Transcoding Introduction Scalable coding is another approach to enable bit-rate adjustment. Traditional scalability in video compression can be of three types: SNR scalability, spatial scalability, and temporal scalability. To achieve different levels of video quality. scalable coding is inflex1ible since the number of different predefined layers is limited the bit-rate of the target video can not be reduced lower than the bit-rate of the base layer.

Video Transcoding Introduction

Video Transcoding Introduction

Video Transcoding architecture Transcoder Open-Loop Transcoder V.S. Closed-Loop Transcoder Feedback loop

Video Transcoding architecture

Video Transcoding architecture Drift error

Video Transcoding architecture

Video Transcoding architecture Spatial-Domain Video Transcoding performs dynamic bit-rate adaptation via the rate-control at the encoder side. is flexible since the decoder-loop and the encoder-loop can be totally independent of each other is drift-free, computational complexity is high for real-time applications.

Video Transcoding architecture

Video Transcoding architecture Spatial-Domain Video Transcoding When transcoding without spatial/temporal resolution reduction, the SDTA architecture can be further simplified in which only one feedback loop is employed.

Video Transcoding architecture

Video Transcoding architecture MV reuse Motion estimation in video encoding accounts for 60% ~ 70% of the encoder computation. Two optional functional blocks placed between the decoder and encoder: spatial/temporal resolution reduction (STR) Module STR allows the source video to be transcoded to target video with different spatial/temporal resolution accordingly. MV composition and refinement(MVCR) Module MVRC is needed to adjust the MVs when STR is applied.

Video Transcoding architecture Frequency-Domain Video Transcoding Exploiting the structural redundancy of the architecture in SDTA and the linearity of the DCT/IDCT, a structurally simpler but functionally equivalent frequency-domain transcoding architecture is possible In this architecture, only VLD and inverse quantization are performed to get DCT value of each block in the decoder end. At the encoder end, the motion compensated residue errors are encoded through re-quantization, and VLC. The reference frame memory in the encoder end stores the DCT values after inverse quantization, that are then fed to the frequency-domain MC module to reduce drift error. Motion compensation is performed in the frequency domain using a MV reusing algorithm.

Video Transcoding architecture Frequency-Domain Video Transcoding An FDTA may need less computation but suffer from the drift problem due to nonlinearity operations, which includes sub-pixel motion compensation, and DCT coefficients clipping during MC. FDTA is also lack flexibility and are mostly fitted for bi-rate transcoding.

Video Transcoding architecture

Video Transcoding architecture Hybrid-Domain Transcoding Architecture

Homogeneous video transcoding Reducing Bits With Fixed Resolution Re-Quantization: A simple technique to transcoding a video to lower bit rate is to increase the quantization step at the encoder part Requantizing is a good compromise between the complexity and reconstructed image quality,and can control the bit-rate reduction. Selective Transmission: Since most of the energy is concentrated at the lower frequency band of an image, discarding (truncating) some of the higher ac frequency coefficients can preserve the picture quality, but may introduce a blocking effect in the reconstructed target video.

Homogeneous video transcoding Spatial Resolution Reduction Filtering and Subsampling Pixel Averaging every m m pixels are represented by a single pixel of their average value. Discarding High Order DCT Coefficients DCT decimation delivers better quality for image down-sampling over filtering or pixel-averaging, but for large bit rate reduction greater than 25%,this method produces poor-quality blocky pictures.

Homogeneous video transcoding Spatial Resolution Reduction MV Composition and Refinement: Random Mean This technique may yield poor results if the magnitude of one of the input MVs is significantly larger than the rest. Weighted Average each MV is weighted by the spatial activity of the perspective prediction error. This method is prone to noise in candidate MVs and may bias the MV when original MVs are aimed in various directions. Weighted Median Computing the Euclidean distances between each MV. This method yields good performance, but requires substantial computation in determining the median MV. DCmax This method takes a little more computation than the Mean, but yields better performance than the Mean and the WA.

Homogeneous video transcoding Spatial Resolution Reduction MV refinements techniques At the encoder end are proposed. Since the passed MVs will almost be the same as the recalculated ones, we can refine them to get more appropriate values. The refinement can be done in a small search window around the passed MV.

Homogeneous video transcoding Spatial Resolution Reduction MB Coding Mode Decision: The high-quality original bitstream are not optimum fore-encoding at the reduced rate in rate reduction by requantizing. Solution re-evalate the macro block type at the encoder of the transcoder Bjork’s method a) If it was coded as INTRA (at the transmitter) again code it in INTRA. b) If it was coded as SKIPPED again code it as SKIPPED. c) If it was coded in INTER, check to see if all coefficients are zero and if they are coded as SKIPPED, else check again whether the macro block has to be coded in INTRA or INTER mode. Merge method 1) If there exists at least one INTRA type among the four MBs then pass it as INTRA; pass as INTER type if there is no INTRA MB and at least one INTER MB; pass as SKIP if all MBs are of the SKIP type. 2) Re-evaluate the MB types in the encoder.

Homogeneous video transcoding Temporal Resolution Reduction Reduction in frame rate may save bits that can be used in the remaining frames to maintain acceptable overall picture quality With dropped frames, the incoming MVs are not valid because they point to the frames that do not exist in the transcoded bit-stream.

Homogeneous video transcoding Temporal Resolution Reduction Bilinear Interpolation: Forward Dominant Vector Selection Telescopic Vector Composition accumulates all the MVs of the corresponding macro blocks of the dropped frames and add each resultant composed MV to its correspondence in the current frame. Activity-Dominant Vector Selection Utilizes the activity of the macro block to decide the choice of the MV. The activity is represented by counting the number of nonzero quantized DCT coefficients or other statistics. These quantities are proportional to the spatial-activity measurement. The higher the activity of the macro block, the more significant will be the motion of the macro block. The computation for counting the nonzero coefficients is very little.

Homogeneous video transcoding Transcoding Between Multiple and Single Layers

Heterogeneous video transcoding Main Issues syntax conversion module, and may change the picture type, picture resolution, directionality of MVs, and picture rate.

Heterogeneous video transcoding Generic Heterogeneous Transcoder Syntax conversion (SC) is needed to convert the syntax of source video to that of the target video.

Related research issues Rate Control in Transcoding Rate control for transcoding a pre-compressed bit stream may exploit certain information extracted from compressed bit streams to assist in bit-rate regulation. This information can be motion estimation, input bit rate/output bit rate, INTRA/INTER mode decision, and picture complexity A Lagrangia-based rate-distortion optimization technique has been exploited for bit allocation during transcoding, but it is suitable for nonreal-time application due to high computational complexity.

Related research issues Error-Resilient Transcoding for Video Over Wireless Channel An error-resilient transcoder can improve video quality in the presence of errors while maintaining the input bit rate over wireless channels. Reyes describe a method to maintain quality for video transcoding for wireless channels which is based on analytical models that characterize how corruption propagates in a video and subjected to bit errors. Dogan used adaptive intra refresh and feedback control signaling methods to improve the error resilience of compressed video in the transcoding operation.

Related research issues Logo Insertion Scheme in Video Transcoding Logo insertion to provide pret Issue: lower the effect of logo Object-Based Transcoding Object importance can be taken into account Transcoding to H.264 H.264 is different from the previous video compression standards. The syntax and the algorithms used in H.264 are so different that transcoding from traditional DCT-based standards to H.264 will face many difficulties, especially to perform transcoding in the frequency domain. 4X4 Integer transformation V.S. 8X8 DCT Prediction block structures and MV prediction

Conclusion Video transcoding is a core technology for providing universal multimedia access by the users with different access links and devices. This paper reviewed several existing video transcoding techniques which provide trade off between the computational complexity and reconstructed video quality. SDTAs provide the best video quality but with more complexity FDTAs provide a bit lower quality but with lower complexity. HDTAs take advantages of both architectures to provide a trade off Object-based transcoding architectures and techniques offer important research directions. H.264 are very different from that of in the previous traditional video compression standards. H.264 related transcoding would become a more challenge issue in the future research of video transcoding.