Bernd Girod: Image Compression and Graphics 1 Image Compression and Graphics: More Than a Sum of Parts? Bernd Girod Collaborators: Peter Eisert, Marcus.

Slides:



Advertisements
Similar presentations
INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS, ICT '09. TAREK OUNI WALID AYEDI MOHAMED ABID NATIONAL ENGINEERING SCHOOL OF SFAX New Low Complexity.
Advertisements

Basics of MPEG Picture sizes: up to 4095 x 4095 Most algorithms are for the CCIR 601 format for video frames Y-Cb-Cr color space NTSC: 525 lines per frame.
MASKS © 2004 Invitation to 3D vision Lecture 7 Step-by-Step Model Buidling.
MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.
Light Field Compression Using 2-D Warping and Block Matching Shinjini Kundu Anand Kamat Tarcar EE398A Final Project 1 EE398A - Compression of Light Fields.
A new approach for modeling and rendering existing architectural scenes from a sparse set of still photographs Combines both geometry-based and image.
Reinventing Compression: The New Paradigm of Distributed Video Coding Bernd Girod Information Systems Laboratory Stanford University.
1 Static Sprite Generation Prof ︰ David, Lin Student ︰ Jang-Ta, Jiang
MultiHypothesis Pictures For H.26L Markus Flierl Telecommunications Laboratory University of Erlangen-Nuremberg Erlangen, Germany
Real-time Combined 2D+3D Active Appearance Models Jing Xiao, Simon Baker,Iain Matthew, and Takeo Kanade CVPR 2004 Presented by Pat Chan 23/11/2004.
Digital Days 29/6/2001 ISTORAMA: A Content-Based Image Search Engine and Hierarchical Triangulation of 3D Surfaces. Dr. Ioannis Kompatsiaris Centre for.
Wyner-Ziv Coding of Motion Video
Scalable Wavelet Video Coding Using Aliasing- Reduced Hierarchical Motion Compensation Xuguang Yang, Member, IEEE, and Kannan Ramchandran, Member, IEEE.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL An Incremental Weighted Least Squares Approach To Surface Light Fields Greg Coombe Anselmo Lastra.
Stereo and Multiview Sequence Processing. Outline Stereopsis Stereo Imaging Principle Disparity Estimation Intermediate View Synthesis Stereo Sequence.
Wyner-Ziv Residual Coding of Video Anne Aaron, David Varodayan and Bernd Girod Information Systems Laboratory Stanford University.
Investigation of Motion-Compensated Lifted Wavelet Transforms Information Systems Laboratory Department of Electrical Engineering Stanford University Markus.
Linearizing (assuming small (u,v)): Brightness Constancy Equation: The Brightness Constraint Where:),(),(yxJyxII t  Each pixel provides 1 equation in.
Source-Channel Prediction in Error Resilient Video Coding Hua Yang and Kenneth Rose Signal Compression Laboratory ECE Department University of California,
1 Department of Electrical Engineering Stanford University Anne Aaron, Shantanu Rane and Bernd Girod Wyner-Ziv Video Coding with Hash-Based Motion Compensation.
CSCE 641: Computer Graphics Image-based Rendering Jinxiang Chai.
` 1 Department of Electrical Engineering, Stanford University Anne Aaron, Prashant Ramanathan and Bernd Girod Wyner-Ziv Coding of Light Fields for Random.
Optical Flow Digital Photography CSE558, Spring 2003 Richard Szeliski (notes cribbed from P. Anandan)
1 Department of Electrical Engineering, Stanford University Anne Aaron, Shantanu Rane, Eric Setton and Bernd Girod Transform-domain Wyner-Ziv Codec for.
Distributed Video Coding VLBV, Sardinia, September 16, 2005 Bernd Girod Information Systems Laboratory Stanford University.
Jump to first page The research report Block matching algorithm Motion compensation Spatial transformation Xiaomei Yu.
The Story So Far The algorithms presented so far exploit: –Sparse sets of images (some data may not be available) –User help with correspondences (time.
Geometry Videos Symposium on Computer Animation 2003 Hector M. Briceño Collaborators: Pedro V. Sander, Leonard McMillan, Steven Gortler, and Hugues Hoppe.
Face Alignment Using Cascaded Boosted Regression Active Shape Models
CSC 589 Lecture 22 Image Alignment and least square methods Bei Xiao American University April 13.
Philipp Merkle, Aljoscha Smolic Karsten Müller, Thomas Wiegand CSVT 2007.
 Coding efficiency/Compression ratio:  The loss of information or distortion measure:
Frame by Frame Bit Allocation for Motion-Compensated Video Michael Ringenburg May 9, 2003.
Video Coding. Introduction Video Coding The objective of video coding is to compress moving images. The MPEG (Moving Picture Experts Group) and H.26X.
Linearizing (assuming small (u,v)): Brightness Constancy Equation: The Brightness Constraint Where:),(),(yxJyxII t  Each pixel provides 1 equation in.
The Brightness Constraint
Exploitation of 3D Video Technologies Takashi Matsuyama Graduate School of Informatics, Kyoto University 12 th International Conference on Informatics.
Introduction Compression Performance Conclusions Large Camera Arrays Capture multi-viewpoint images of a scene/object. Potential applications abound: surveillance,
High-Resolution Interactive Panoramas with MPEG-4 발표자 : 김영백 임베디드시스템연구실.
Videos Mei-Chen Yeh. Outline Video representation Basic video compression concepts – Motion estimation and compensation Some slides are modified from.
Image Processing and Computer Vision: 91. Image and Video Coding Compressing data to a smaller volume without losing (too much) information.
Codec structuretMyn1 Codec structure In an MPEG system, the DCT and motion- compensated interframe prediction are combined. The coder subtracts the motion-compensated.
Image Compression Supervised By: Mr.Nael Alian Student: Anwaar Ahmed Abu-AlQomboz ID: IT College “Multimedia”
3D Geometry Coding using Mixture Models and the Estimation Quantization Algorithm Sridhar Lavu Masters Defense Electrical & Computer Engineering DSP GroupRice.
Rate-distortion Optimized Mode Selection Based on Multi-channel Realizations Markus Gärtner Davide Bertozzi Classroom Presentation 13 th March 2001.
Compression of Real-Time Cardiac MRI Video Sequences EE 368B Final Project December 8, 2000 Neal K. Bangerter and Julie C. Sabataitis.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
Wyner-Ziv Coding of Motion Video Presented by fakewen.
Lecture 6 Rasterisation, Antialiasing, Texture Mapping,
JPEG Image Compression Standard Introduction Lossless and Lossy Coding Schemes JPEG Standard Details Summary.
Page 11/28/2016 CSE 40373/60373: Multimedia Systems Quantization  F(u, v) represents a DCT coefficient, Q(u, v) is a “quantization matrix” entry, and.
Block-based coding Multimedia Systems and Standards S2 IF Telkom University.
Video Compression and Standards
Linearizing (assuming small (u,v)): Brightness Constancy Equation: The Brightness Constraint Where:),(),(yxJyxII t  Each pixel provides 1 equation in.
1 Yu Liu 1, Feng Wu 2 and King Ngi Ngan 1 1 Department of Electronic Engineering, The Chinese University of Hong Kong 2 Microsoft Research Asia, Beijing,
Dual Representations for Light Field Compression EE368C Project January 30, 2001 Peter Chou Prashant Ramanathan.
CASA 2006 CASA 2006 A Skinning Approach for Dynamic Mesh Compression Khaled Mamou Titus Zaharia Françoise Prêteux.
EE591f Digital Video Processing
Motion Estimation Multimedia Systems and Standards S2 IF Telkom University.
1 Department of Electrical Engineering, Stanford University Anne Aaron, Shantanu Rane, Rui Zhang and Bernd Girod Wyner-Ziv Coding for Video: Applications.
Eigen Texture Method : Appearance compression based method Surface Light Fields for 3D photography Presented by Youngihn Kho.
1 Department of Electrical Engineering, Stanford University EE 392J Final Project Presentation Shantanu Rane Hash-Aided Motion Estimation & Rate Control.
MOTION Model. Road Map Motion Model Non Parametric Motion Field : Algorithms 1.Optical flow field estimation. 2.Block based motion estimation. 3.Pel –recursive.
Compression of 3D images using disparity based coder
The Brightness Constraint
The Brightness Constraint
The Brightness Constraint
Standards Presentation ECE 8873 – Data Compression and Modeling
Multi-Hypothesis Disparity-Compensated Light Field Compression
Presentation transcript:

Bernd Girod: Image Compression and Graphics 1 Image Compression and Graphics: More Than a Sum of Parts? Bernd Girod Collaborators: Peter Eisert, Marcus Magnor, Prashant Ramanathan, Eckehard Steinbach (all Stanford), Thomas Wiegand (HHI) Image, Video, and Multimedia Systems Group Information Systems Laboratory Stanford University

Bernd Girod: Image Compression and Graphics triangles Can 3-D Geometry Help to Compress Images? Conjecture: 3-d geometry models help compression, if a single 3-D model captures the dependencies between many views (or frames of a sequence). Conjecture: 3-d geometry models help compression, if a single 3-D model captures the dependencies between many views (or frames of a sequence).

Bernd Girod: Image Compression and Graphics 3 Outline of this Talk Compression of many simultaneous views (e.g. light-fields) –Encoding view-dependent texture maps with 4-d wavelets –Hierarchical image-domain light-field coder –Why image-domain encoding is (usually) superior to texture-map encoding Model-based compression of talking head sequences –Modeling and estimation of facial expressions –Avatars –Incorporate synthetic video into motion-compensated hybrid coding

Bernd Girod: Image Compression and Graphics 4 Multi-View Image Capture Coding schemes suitable for 2-plane parametrization Hemispherical image arrangement (arbitrary recording positions)

Bernd Girod: Image Compression and Graphics 5 Align Views by Mapping onto Object Surface Camera views: No correlation between corresponding pixels View- dependent texture map: Strong correlation between corresponding texels

Bernd Girod: Image Compression and Graphics 6 3-D Reconstruction from Many Views  processes all views simultaneously  exploits texture and silhouette information  yields solid 3-D voxel model Volumetric Reconstruction Subdivide object’s bounding box into voxels Generation of multiple hypotheses for each voxel Hypothesis elimination by projecting visible voxels into light-field images Iterate over all voxels until remaining hypotheses are “photo-consistent”

Bernd Girod: Image Compression and Graphics 7 Surface Representation Initial octahedral geometry Geometry refinement –determine vertex normals –move vertices to model surface –subdivide triangles 128 triangles512 triangles2048 triangles8192 triangles voxel model

Bernd Girod: Image Compression and Graphics 8 Warp of each image into a view-dependent texture map Texture map correlated in 4-d Interpolate missing texels Texture Map Encoding with 4-d Wavelets 4D Haar Wavelet Transform Embedded encoding of wavelet coefficients (4D-SPIHT) Arrange images into 2-d array

Bernd Girod: Image Compression and Graphics 9 Results: Wavelet Texture Map Encoder Reconstruction quality in luminance PSNR (dB)

Bernd Girod: Image Compression and Graphics 10 Results: Wavelet Texture Map Encoder

Bernd Girod: Image Compression and Graphics dB bpp 26.3 Kbytes Progressive Decoding 36.6 dB bpp 736 Kbytes

Bernd Girod: Image Compression and Graphics 12 Align Views by Model-aided Prediction ? Given: geometry model, reference images Render geometry for reference images and prediction image For each pixel: determine triangle, coordinates Find corresponding pixels in reference images Copy & average visible pixels

Bernd Girod: Image Compression and Graphics 13 Hierarchical Image Coding Order project camera positions on hemisphere subdivide into 4 quadrants INTRA-encode corner images encode center image - image prediction - residual error coding encode mid-side images subdivide into sub-quadrants encode center and mid-side images subdivide repeatedly

Bernd Girod: Image Compression and Graphics 14 Model-aided Image-Domain Light-Field Coder DCT Coefficients Light-Field Image I[u,v] Disparity Map Generation Multiframe Disparity Compensation Residual-Error DCT Coder Residual-Error Decoder Image Buffer Compressed Geometry Model 3-D Geometry Reconstruction Geometry Coder Geometry Decoder -

Bernd Girod: Image Compression and Graphics 15 Picture Quality original Mouse light field 257 RGB images, 384x288 pixels 81.3 Mbytes compressed 300: bpp (267 KBytes) 37.9 dB PSNR

Bernd Girod: Image Compression and Graphics 16 Model-aided vs. Texture Coding Model-aided Texture

Bernd Girod: Image Compression and Graphics 17 Natural vs. Synthetic Image Set 40 % 70 %

Bernd Girod: Image Compression and Graphics 18 2 dB 7 dB 9 dB Inaccurate Geometry

Bernd Girod: Image Compression and Graphics 19 Model-based videophone

Bernd Girod: Image Compression and Graphics 20 Modeling of Facial Expressions Head geometry composed of 101 triangular B-spline patches Facial expressions by superposition of 66 FAPs (Facial Animation Parameters) according to MPEG-4 standard FAPs act on control points of triangular B-spline patches

Bernd Girod: Image Compression and Graphics 21 Estimation of Facial Expressions Displacement field constrained by FAPs Linearize for small FAPs Optical flow constraint equation Solve overdetermined system by linear regression Apply iteratively in analysis-synthesis loop Incorporate spatial resolution pyramid

Bernd Girod: Image Compression and Graphics 22 Results: Peter Sequence: Peter, 230 frames, CIF resolution, 25 fps OriginalSynthesized Compressed 25,000:1 1.2 kbps dB PSNR

Bernd Girod: Image Compression and Graphics 23 Results: Eckehard Sequence: Eckehard CIF resolution, 25 fps OriginalSynthesized 1.1 kbps, 32.6 dB PSNR

Bernd Girod: Image Compression and Graphics 24 Results: Peter as Eckehard Sequence: Peter, 230 frames, CIF resolution, 25 fps OriginalSynthesized

Bernd Girod: Image Compression and Graphics 25 Results: Eckehard as Peter Sequence: Eckehard CIF resolution, 25 fps OriginalSynthesized

Bernd Girod: Image Compression and Graphics 26 Results: Peter as Akiyo Sequence: Peter, 230 frames, CIF resolution, 25 fps OriginalSynthesized

Bernd Girod: Image Compression and Graphics But, What About Unknown Objects? Sequence: Clap OriginalSynthesized 1.2 kbps

Bernd Girod: Image Compression and Graphics 28 Model-Aided Coding: Incorporating Synthetic Video into MC Hybrid Coding Intraframe DCT Coder Intraframe Decoder Multiframe Motion Compensation - Decoder e Coder Control Control data (incl. motion vectors) DCT coefficients Input Video Model-based Decoder FAPs Model- based Coder

Bernd Girod: Image Compression and Graphics 29 R-D-Optimal Mode Decision Synthesized frame Previous decoded frame Selection Mask minimizing D+ R Predicted frame

Bernd Girod: Image Compression and Graphics 30 Results: Peter Sequence: Clap, 8.33 fps, CIF resolution H kbpsModel-Aided 12 kbps

Bernd Girod: Image Compression and Graphics 31 Results: Akiyo Sequence: Akiyo, 10 fps, CIF resolution H kbpsModel-Aided 10 kbps

Bernd Girod: Image Compression and Graphics 32 R-D Performance of Model-Aided Coder Sequence: PeterSequence: Akiyo ~ 40% ~ 35%

Bernd Girod: Image Compression and Graphics 33 Can 3-d geometry help to compress images? YES IF many views of the same 3-D object/scene shall be compressed. Applications in –Multiview image coding (light-field compression) –Compression of video sequences –Very high compression ratios (100: ,000:1) Require accurate vision algorithms for 3-d reconstruction Image-domain compression more resilient against inaccurate geometry and hence more practical than texture-map encoding Conclusion

Bernd Girod: Image Compression and Graphics THE END