Presented by Yehuda Dar Advanced Topics in Computer Vision ( 048921 )Winter 2011-2012.

Slides:



Advertisements
Similar presentations
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Advertisements

A generic model to compose vision modules for holistic scene understanding Adarsh Kowdle *, Congcong Li *, Ashutosh Saxena, and Tsuhan Chen Cornell University,
INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS, ICT '09. TAREK OUNI WALID AYEDI MOHAMED ABID NATIONAL ENGINEERING SCHOOL OF SFAX New Low Complexity.
Automatic Video Shot Detection from MPEG Bit Stream Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC.
Patch to the Future: Unsupervised Visual Prediction
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Vision Based Control Motion Matt Baker Kevin VanDyke.
Qualifying Exam: Contour Grouping Vida Movahedi Supervisor: James Elder Supervisory Committee: Minas Spetsakis, Jeff Edmonds York University Summer 2009.
Robust Object Tracking via Sparsity-based Collaborative Model
H.264/AVC Baseline Profile Decoder Complexity Analysis Michael Horowitz, Anthony Joch, Faouzi Kossentini, and Antti Hallapuro IEEE TRANSACTIONS ON CIRCUITS.
IEEE TCSVT 2011 Wonjun Kim Chanho Jung Changick Kim
ICME 2008 Huiying Liu, Shuqiang Jiang, Qingming Huang, Changsheng Xu.
Learning to Detect A Salient Object Reporter: 鄭綱 (3/2)
Fast Mode Decision for Multiview Video Coding Liquan Shen, Tao Yan, Zhi Liu, Zhaoyang Zhang, Ping An, Lei Yang ICIP
CMPT-884 Jan 18, 2010 Error Concealment Presented by: Cameron Harvey CMPT 820 October
Video Coding with Spatio-temporal Texture Synthesis and Edge-based inpainting Chunbo Zhu, Xiaoyan Sun, Feng Wu, and Houqiang Li ICME 2008.
SCHOOL OF COMPUTING SCIENCE SIMON FRASER UNIVERSITY CMPT 820 : Error Mitigation Schaar and Chou, Multimedia over IP and Wireless Networks: Compression,
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Natan Jacobson, Yen-Lin Lee, Vijay Mahadevan, Nuno Vasconcelos, Truong Q. Nguyen IEEE, ICME 2010.
Evaluation of Data-Parallel Splitting Approaches for H.264 Decoding
1 Robust Video Stabilization Based on Particle Filter Tracking of Projected Camera Motion (IEEE 2009) Junlan Yang University of Illinois,Chicago.
Adaptive MPEG-2 Video Data Hiding Scheme Anindya Sarkar, Upmanyu Madhow, Shivkumar Chandrasekaran, B. S. Manjunath Presented by: Anindya Sarkar Vision.
International Conference on Image Analysis and Recognition (ICIAR’09). Halifax, Canada, 6-8 July Video Compression and Retrieval of Moving Object.
Object Recognition Using Geometric Hashing
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
A Sequence-Based Rate Control Framework for Consistent Quality Real-Time Video Bo Xie and Wenjun Zeng CSVT 2006.
Xinqiao LiuRate constrained conditional replenishment1 Rate-Constrained Conditional Replenishment with Adaptive Change Detection Xinqiao Liu December 8,
An Introduction to H.264/AVC and 3D Video Coding.
Video Compression Concepts Nimrod Peleg Update: Dec
Image and Video Compression
Video Streaming via Transcoding Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC 28223
Yun CAO Xianfeng ZHAO Dengguo FENG Rennong SHENG Video Steganography with Perturbed Motion Estimation.
A Tutorial on Object Detection Using OpenCV
Kai-Chao Yang Hierarchical Prediction Structures in H.264/AVC.
 Coding efficiency/Compression ratio:  The loss of information or distortion measure:
MPEG: (Moving Pictures Expert Group) A Video Compression Standard for Multimedia Applications Seo Yeong Geon Dept. of Computer Science in GNU.
A Generic Virtual Content Insertion System Based on Visual Attention Analysis H. Liu 1, 2, S. Jiang 1, Q. Huang 1, 2, C. Xu 2, 3 1 Institute of Computing.
Object Bank Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 4 th, 2013.
Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.
Hierarchical Distributed Genetic Algorithm for Image Segmentation Hanchuan Peng, Fuhui Long*, Zheru Chi, and Wanshi Siu {fhlong, phc,
Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.
June, 1999 An Introduction to MPEG School of Computer Science, University of Central Florida, VLSI and M-5 Research Group Tao.
Compression video overview 演講者:林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.
Figure 1.a AVS China encoder [3] Video Bit stream.
Compression of Real-Time Cardiac MRI Video Sequences EE 368B Final Project December 8, 2000 Neal K. Bangerter and Julie C. Sabataitis.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,
Vamsi Krishna Vegunta University of Texas, Arlington
Video Compression and Standards
COMPARATIVE STUDY OF HEVC and H.264 INTRA FRAME CODING AND JPEG2000 BY Under the Guidance of Harshdeep Brahmasury Jain Dr. K. R. RAO ID MS Electrical.
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Learning video saliency from human gaze using candidate selection CVPR2013 Poster.
(B1) What are the advantages and disadvantages of digital TV systems? Hint: Consider factors on noise, data security, VOD etc. 1.
6/9/20161 Video Compression Techniques Image, Video and Audio Compression standards have been specified and released by two main groups since 1985: International.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
CMPT365 Multimedia Systems 1 Media Compression - Video Spring 2015 CMPT 365 Multimedia Systems.
H. 261 Video Compression Techniques 1. H.261  H.261: An earlier digital video compression standard, its principle of MC-based compression is retained.
A. M. R. R. Bandara & L. Ranathunga
Automatic Video Shot Detection from MPEG Bit Stream
Traffic Sign Recognition Using Discriminative Local Features Andrzej Ruta, Yongmin Li, Xiaohui Liu School of Information Systems, Computing and Mathematics.
Article Review Todd Hricik.
Error Concealment In The Pixel Domain And MATLAB commands
User-Oriented Approach in Spatial and Temporal Domain Video Coding
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
A New Approach to Track Multiple Vehicles With the Combination of Robust Detection and Two Classifiers Weidong Min , Mengdan Fan, Xiaoguang Guo, and Qing.
Text Detection in Images and Video
Rob Fergus Computer Vision
ENEE 631 Project Video Codec and Shot Segmentation
Scalable light field coding using weighted binary images
Presentation transcript:

Presented by Yehuda Dar Advanced Topics in Computer Vision ( )Winter

Video Compression Basics Fundamental tradeoff among: Bit-rate Distortion Computational complexity

Video Compression Basics Utilized redundancies: Spatial Temporal Psycho-visual Statistical

H.264 Overview

H.264 Redundancy Utilization MeansUtilizationRedundancy Transform coding Intra coding (spatial prediction) High Spatial Motion estimation & compensation High Temporal YCbCr color space 4:2:0 sampling DC \ AC coefficients quantization Medium Psycho-visual Entropy coding High Statistical

Compression using Computer Vision Motivation: Better utilization of the psycho-visual redundancy Application-specific compression methods Exploring new approaches

A Review of: A Scheme for Attentional Video Compression R. Gupta and S. Chaundhury PAMI 2011

Method Outline Salient region detection Foveated video coding Integration into H.264 Foveated image coding demonstration Figure from Guo & Zhang, Trans. Image Process., 2010

Saliency Map Step 1: Creating a 3D Feature Map Based onCalculation methodFeature type Liu et al, CVPR 2007 Color spatial variance Global Huang et al, ICPR 2010 Center-surround multi-scale ratio of dissimilarity Local Yu et al, ICDL 2009 Pulse-DCTRarity

Relevance Vector Machine (RVM) Used here as a binary classifier Advantages over support-vector-machine (SVM): Provides posterior probabilities Better generalization ability Faster decisions

Saliency Map Step 2: Unify Features using RVM Global local rarity average ground truth count pixels ‘salient’ \ ‘non salient’ RVM sample label Training Procedure for MBs:

Saliency Map Step 2: Unify Features using RVM Trained RVM Usage: RVM New input Binary label ‘salient’ \ ‘non salient’ Probability Relative saliency

Saliency Map: Result Comparison inputgloballocal [Huang et al, ICPR 2010] rarity [Yu et al, ICDL 2009] proposed [Harel et al, NIPS 2006] [Bruce & Tsotsos, NIPS 2006] Figures from Gupta & Chaundhury, PAMI 2011

Saliency Map: ROC Curve Figure from Gupta & Chaundhury, PAMI 2011 Proposed [Harel et al, NIPS 2006]

Integration Into H.264: Calculation of Saliency Values Recalculating saliency map only when it significantly changes Mutual-information between successive frames indicates changes in saliency: Figures from Gupta & Chaundhury, PAMI 2011

Integration Into H.264: Propagation of Saliency Values For inter-coded MBs, the saliency value is a weighted- average of those pointed by the motion-vector Figures from Gupta & Chaundhury, PAMI 2011

Integration Into H.264: Salient-Adaptive Quantization Non-uniform bit-allocation Smaller saliency value => coarser quantization

Integration Into H.264 Figure from Gupta & Chaundhury, PAMI 2011

Paper Evaluation Novelty: Methods for: saliency map saliency value propagation Assumption: All the MBs in P-frames are inter-coded (problematic) Writing level: Good Partially self-contained

Paper Evaluation Feasibility: Higher complexity than H.264 encoders Not for real-time encoders Useful at low bit-rates Objects entering the scene may be considered unimportant Experimental evaluation: Saliency: visual comparison: good ROC curve comparison: partial Compression: None (authors’ future direction)

Future Directions Improving encoding complexity less complex saliency method Better object entrance treatment Using mutual-information of frame areas Treat intra-coded MBs in P-frames

A Review of: 3D Models Coding and Morphing for Efficient Video Compression F. Galpin, R. Balter, L. Morin, K. Deguchi CVPR 2004

Method Outline 3D model extraction 3D model-based video coding Reconstruction using adaptive geometric morphing

3D Models Stream Generation Figure from Galpin et al, CVPR 2004

Stream Compression Three data types to compress: 3D model Texture images Camera parameters

Texture Image Compression Figure from Galpin et al, CVPR 2004 Reconstruction Process:

3D Model Compression The 3D model originates in decimated depth map Compressed by: Wavelet transform Depth-adaptive quantization Figures from Galpin et al, CVPR 2004

Video Reconstruction: Texture Fading Figure from Galpin et al, CVPR 2004

Video Reconstruction: Texture Fading without texture fadingwith texture fading Figures from Galpin et al, CVPR 2004

Video Reconstruction: Geometric Morphing Improving 3D model interpolation Figure from Galpin et al, CVPR 2004

Video Reconstruction: Geometric Morphing regular interpolationinterpolation with geometric morphing Figures from Galpin et al, CVPR 2004

Result Comparison with H.264

Paper Evaluation Novelty: Compression using unknown 3D model Assumptions: Static scene Moving monocular camera Neglected camera rotation GOP intrinsic parameters are fixed Writing level: Good Not self-contained

Paper Evaluation Feasibility: Only for static scene video High encoder\decoder complexity Real-time unsuitable Useful at very low bit-rates Experimental evaluation: Sufficient visual comparison with H.264 No run-time information

Future Directions Treat moving objects Improve complexity At least for real-time decoding

Approach Comparison 3D modelAttention Static sceneAnyVideo type Very lowLowBit-rates useful at High Encoder complexity HighRegularDecoder complexity UnsuitablePossibleIntegration in H.264 InferiorPromisingOverall evaluation