Philipp Merkle, Aljoscha Smolic Karsten Müller, Thomas Wiegand CSVT 2007.

Slides:



Advertisements
Similar presentations
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Advertisements

KIANOOSH MOKHTARIAN SCHOOL OF COMPUTING SCIENCE SIMON FRASER UNIVERSITY 6/24/2007 Overview of the Scalable Video Coding Extension of the H.264/AVC Standard.
H.264 Intra Frame Coder System Design Özgür Taşdizen Microelectronics Program at Sabanci University 4/8/2005.
Basics of MPEG Picture sizes: up to 4095 x 4095 Most algorithms are for the CCIR 601 format for video frames Y-Cb-Cr color space NTSC: 525 lines per frame.
-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang
1 Video Coding Concept Kai-Chao Yang. 2 Video Sequence and Picture Video sequence Large amount of temporal redundancy Intra Picture/VOP/Slice (I-Picture)
Communication & Multimedia C. -Y. Tsai 2006/4/20 1 Multiview Video Compression Student: Chia-Yang Tsai Advisor: Prof. Hsueh-Ming Hang Institute of Electronics,
1 Adaptive slice-level parallelism for H.264/AVC encoding using pre macroblock mode selection Bongsoo Jung, Byeungwoo Jeon Journal of Visual Communication.
{ Fast Disparity Estimation Using Spatio- temporal Correlation of Disparity Field for Multiview Video Coding Wei Zhu, Xiang Tian, Fan Zhou and Yaowu Chen.
Limin Liu, Member, IEEE Zhen Li, Member, IEEE Edward J. Delp, Fellow, IEEE CSVT 2009.
Light Field Compression Using 2-D Warping and Block Matching Shinjini Kundu Anand Kamat Tarcar EE398A Final Project 1 EE398A - Compression of Light Fields.
Fast Mode Decision for Multiview Video Coding Liquan Shen, Tao Yan, Zhi Liu, Zhaoyang Zhang, Ping An, Lei Yang ICIP
A Fast and Efficient Multi-View Depth Image Coding Method Based on Temporal and Inter- View Correlations of Texture Images Jin Yong Lee Ho Chen Wey Du.
CMPT-884 Jan 18, 2010 Error Concealment Presented by: Cameron Harvey CMPT 820 October
SCHOOL OF COMPUTING SCIENCE SIMON FRASER UNIVERSITY CMPT 820 : Error Mitigation Schaar and Chou, Multimedia over IP and Wireless Networks: Compression,
Evaluation of Data-Parallel Splitting Approaches for H.264 Decoding
Reji Mathew and David S. Taubman CSVT  Introduction  Quad-tree representation  Quad-tree motion modeling  Motion vector prediction strategies.
Wei Zhu, Xiang Tian, Fan Zhou and Yaowu Chen IEEE TCE, 2010.
Overview of the Scalable Video Coding Extension of the H
Overview of Error Resiliency Schemes in H.264/AVC Standard Sunil Kumar, Liyang Xu, Mrinal K. Mandal, and Sethuraman Panchanathan Elsevier Journal of Visual.
An Error-Resilient GOP Structure for Robust Video Transmission Tao Fang, Lap-Pui Chau Electrical and Electronic Engineering, Nanyan Techonological University.
Efficient Motion Vector Recovery Algorithm for H.264 Based on a Polynomial Model Jinghong Zheng and Lap-Pui Chau IEEE TRANSACTIONS ON MULTIMEDIA, June.
1 Single Reference Frame Multiple Current Macroblocks Scheme for Multiple Reference IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY Tung-Chien.
Scalable Wavelet Video Coding Using Aliasing- Reduced Hierarchical Motion Compensation Xuguang Yang, Member, IEEE, and Kannan Ramchandran, Member, IEEE.
Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech.
Motion Vector Refinement for High-Performance Transcoding Jeongnam Youn, Ming-Ting Sun, Fellow,IEEE, Chia-Wen Lin IEEE TRANSACTIONS ON MULTIMEDIA, MARCH.
Efficient Fine Granularity Scalability Using Adaptive Leaky Factor Yunlong Gao and Lap-Pui Chau, Senior Member, IEEE IEEE TRANSACTIONS ON BROADCASTING,
Error Resilience in a Generic Compressed Video Stream Transmitted over a Wireless Channel Muhammad Bilal
Block Partitioning Structure in the HEVC Standard
H.264/AVC for Wireless Applications Thomas Stockhammer, and Thomas Wiegand Institute for Communications Engineering, Munich University of Technology, Germany.
Xinqiao LiuRate constrained conditional replenishment1 Rate-Constrained Conditional Replenishment with Adaptive Change Detection Xinqiao Liu December 8,
Error Resilience of Video Transmission By Rate-Distortion Optimization and Adaptive Packetization Yuxin Liu, Paul Salama and Edwad Delp ICME 2002.
HARDEEPSINH JADEJA UTA ID: What is Transcoding The operation of converting video in one format to another format. It is the ability to take.
1. 1. Problem Statement 2. Overview of H.264/AVC Scalable Extension I. Temporal Scalability II. Spatial Scalability III. Complexity Reduction 3. Previous.
MPEG-2 Digital Video Coding Standard
3D EXTENSION of HEVC: Multi-View plus Depth Parashar Nayana Karunakar Student Id: Department of Electrical Engineering.
3D EXTENSION of HEVC: Multi-View plus Depth Parashar Nayana Karunakar Student Id: Department of Electrical Engineering.
Kai-Chao Yang Hierarchical Prediction Structures in H.264/AVC.
MPEG-2 Standard By Rigoberto Fernandez. MPEG Standards MPEG (Moving Pictures Experts Group) is a group of people that meet under ISO (International Standards.
Overview of the Stereo and Multiview Video Coding Extensions of the H
Video in future 不屈号的航海长 July, 2009
Frame by Frame Bit Allocation for Motion-Compensated Video Michael Ringenburg May 9, 2003.
Video Coding. Introduction Video Coding The objective of video coding is to compress moving images. The MPEG (Moving Picture Experts Group) and H.26X.
MPEG Motion Picture Expert Group Moving Picture Encoded Group Prateek raj gautam(725/09)
1 Efficient Reference Frame Selector for H.264 Tien-Ying Kuo, Hsin-Ju Lu IEEE CSVT 2008.
Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.
Compression video overview 演講者:林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.
- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison between H.264.
Compression of Real-Time Cardiac MRI Video Sequences EE 368B Final Project December 8, 2000 Neal K. Bangerter and Julie C. Sabataitis.
Guillaume Laroche, Joel Jung, Beatrice Pesquet-Popescu CSVT
Fast motion estimation and mode decision for H.264 video coding in packet loss environment Li Liu, Xinhua Zhuang Computer Science Department, University.
Motion Estimation Multimedia Systems and Standards S2 IF Telkom University.
Hierarchical Systolic Array Design for Full-Search Block Matching Motion Estimation Noam Gur Arie,August 2005.
Outline  Introduction  Observations and analysis  Proposed algorithm  Experimental results 2.
Fine-granular Motion Matching for Inter-view Motion Skip Mode in Multi-view Video Coding Haitao Yanh, Yilin Chang, Junyan Huo CSVT.
Fast disparity motion estimation in MVC based on range prediction Xiao Zhong Xu, Yun He ICIP 2008.
Presenting: Shlomo Ben-Shoshan, Nir Straze Supervisors: Dr. Ofer Hadar, Dr. Evgeny Kaminsky.
Video Compression Video : Sequence of frames Each Frame : 2-D Array of Pixels Video: 3-D data – 2-D Spatial, 1-D Temporal Video has both : – Spatial Redundancy.
CMPT365 Multimedia Systems 1 Media Compression - Video Spring 2015 CMPT 365 Multimedia Systems.
MPEG Video Coding I: MPEG-1 1. Overview  MPEG: Moving Pictures Experts Group, established in 1988 for the development of digital video.  It is appropriately.
Computational Controlled Mode Selection for H.264/AVC June Computational Controlled Mode Selection for H.264/AVC Ariel Kit & Amir Nusboim Supervised.
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Yimin Zhou, Hongyu Wang, Ling Tian and Ce Zhu
Overview of the Scalable Video Coding
Error Concealment In The Pixel Domain And MATLAB commands
Quad-Tree Motion Modeling with Leaf Merging
Fully Scalable Multiview Wavelet Video Coding
Standards Presentation ECE 8873 – Data Compression and Modeling
Bongsoo Jung, Byeungwoo Jeon
Scalable light field coding using weighted binary images
Presentation transcript:

Philipp Merkle, Aljoscha Smolic Karsten Müller, Thomas Wiegand CSVT 2007

Outline Multi-view video coding (MVC) introduction Requirements and test conditions for MVC Prediction structures Experimental results Conclusion 2

MVC Introduction MVC: Multi-view Video Coding Multi-view video (MVV): A system that uses multiple camera views of the same scene is called. Usage: 3DTV, free viewpoint video(FVV), etc. 3

Requirements for MVC Temporal random access View random access Scalability Backward compatibility Quality consistency Parallel processing 4

Temporal and inter-view correlation 5 T T T temporal/inter-view mixed mode Inter-view temporal/inter-view mixed mode Temporal

Temporal and inter-view correlation analysis 6 H.264/AVC encoder was used with the following settings: Motion compensation block size of 16*16 Search range of ±32 pixels Lagrange parameter (λ) of 29.5 denotes the decrease of the average in comparison to temporal prediction only.

Simply including temporal and inter-view prediction modes 7 Temporal and inter-view correlation analysis (cont’d)

Lagrangian cost function Lagrangian cost function: D denotes distortion. R denotes number of bits to transmit all components of the motion vector. For each block in a picture, algorithm chooses MV within a search rage that minimizes. The distortion in the subject macroblock B is calculated by: 8 (1) (2) (3)

1D camera: Ballroom, Exit, Rena, Race1, Uli, (line) Breakdancers (arched) 2D camera: Flamenco2 (cross), AkkoKayo (array) Use 5 to 16 camera views Target high quality TV-type video (640*480 or 1024*768) then limited channel communication- type video. 9 Test data and test conditions

Knowledge – hierarchical B picture, QP cascading Hierarchical B picture, key picture, non-key picture: QP cascading : [1] 10 key picture [1] “Analysis of hierarchical B pictures and MCTF”, ICME 2006, IEEE International Conference on Multimedia and Expo, Toronto, Ontario, Canada, July 2006

Knowledge – DPB size Decoded Picture Buffer (DPB) size is increased to: [2] 11 [2] “Efficient Compression of Multi-view Video Exploiting Inter-view Dependencies Based on H.264/AVC”, ICME 2006, IEEE International Conference on Multimedia and Expo, Toronto, Ontario, Canada, July 2006 Memory-efficient reordering of multi-view input for compression

Two tasks 1. To adapt the multi-view prediction schemes to the specific camera arrangements of the test data sets. 2. To adapt the prediction structures to the random access specification. 12

Prediction structure Simulcast coding structure To allow synchronization and random access, all key pictures are coded in intra mode. 13

Prediction structure (cont’d) The first view is called base view (remains the I frame). 14

Prediction structure (cont’d) Alternative structures of inter-view for key pictures 15 KS_IPPKS_PIPKS_IBP KS_IPP KS_PIP KS_IBP Linear camera arrangement2D Camera array

Prediction structure (cont’d) Inter-view prediction for key and non-key pictures 16 AS_IPP mode

Experimental results – objective evaluation 17 Ballroom test result Average coding gains compared with anchor coding

Experimental results – subjective evaluation Different bit-rates were selected for the different data sets. 18 Ballroom test result Race1 test result

Experimental results – subjective evaluation AS_IBP outperforms the anchors significantly. The gain decreases slightly with higher bit-rates. 19 Average results over all test sequences

Influence of camera density Using Rena sequence, and consisting of 16 linear arranged cameras with a 5 cm distance between two adjacent cameras Repeated for each shifted set of 9 adjacent cameras The structure are applied to every time instance of the MVV sequence without temporal prediction. 20

Results of experiments on camera density Coding gain increases with decreasing camera distance and decreasing reconstruction quality. 21

Results of experiments on camera density (cont’d) Results of average per camera rate relative to the one camera case(→) A larger QP value leads to a larger coding gain 22

Conclusion Resulting multi-view prediction: achieving significant coding gains and being highly flexible. Parallel processing is supported by the presented sequential processing approach. Problems: Large disparities between the different views of multi- view video sequences Illumination and color inconsistencies across views 23