Object Based Video Coding - A Multimedia Communication Perspective Muhammad Hassan Khan 2004-03-0020.

Slides:



Advertisements
Similar presentations
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Advertisements

Segmentation from Examples By: A’laa Kryeem Lecturer: Hagit Hel-Or.
INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS, ICT '09. TAREK OUNI WALID AYEDI MOHAMED ABID NATIONAL ENGINEERING SCHOOL OF SFAX New Low Complexity.
A Performance Analysis of the ITU-T Draft H.26L Video Coding Standard Anthony Joch, Faouzi Kossentini, Panos Nasiopoulos Packetvideo Workshop 2002 Department.
-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang
Practical and Scalable Transmission of Segmented Video Sequences to Multiple Players using H.264 Fabian Di Fiore, Panagiotis Issaris Expertise Centre for.
MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.
1 Video Coding Concept Kai-Chao Yang. 2 Video Sequence and Picture Video sequence Large amount of temporal redundancy Intra Picture/VOP/Slice (I-Picture)
Image Segmentation some examples Zhiqiang wang
Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.
A presentation by Modupe Omueti For CMPT 820:Multimedia Systems
Direction-Adaptive KLT for Image Compression Vinay Raj Hampapur Wendy Ni Stanford University March 8, 2011.
Rendering with Concentric Mosaics Heung – Yeung Shum and Li – Wei He Presentation By: Jonathan A. Bockelman.
Multimedia for the Web: Creating Digital Excitement Multimedia Element -- Graphics.
CS 376b Introduction to Computer Vision 04 / 08 / 2008 Instructor: Michael Eckmann.
Overview of the H.264/AVC Video Coding Standard
MPEG: A Video Compression Standard for Multimedia Applications Didier Le Gall Communications of the ACM Volume 34, Number 4 Pages 46-58, 1991.
Direct Methods for Visual Scene Reconstruction Paper by Richard Szeliski & Sing Bing Kang Presented by Kristin Branson November 7, 2002.
Problem Sets Problem Set 3 –Distributed Tuesday, 3/18. –Due Thursday, 4/3 Problem Set 4 –Distributed Tuesday, 4/1 –Due Tuesday, 4/15. Probably a total.
Image Segmentation. Introduction The purpose of image segmentation is to partition an image into meaningful regions with respect to a particular application.
MPEG: A Video Compression Standard for Multimedia Applications Didier Le Gall Communications of the ACM Volume 34, Number 4 Pages 46-58, 1991.
Stereo & Iterative Graph-Cuts Alex Rav-Acha Vision Course Hebrew University.
Automatic Key Video Object Plane Selection Using the Shape Information in the MPEG-4 Compressed Domain Berna Erol and Faouzi Kossentini, Senior Member,
Stereo Computation using Iterative Graph-Cuts
1 Error Concealment for Shape in MPEG-4 Object-Based Coding Student: Jieh-Yuean Hsu Advisor: Prof. David W. Lin.
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 11, NOVEMBER 2011 Qian Zhang, King Ngi Ngan Department of Electronic Engineering, the Chinese university.
Real Time Abnormal Motion Detection in Surveillance Video Nahum Kiryati Tammy Riklin Raviv Yan Ivanchenko Shay Rochel Vision and Image Analysis Laboratory.
Manhattan-world Stereo Y. Furukawa, B. Curless, S. M. Seitz, and R. Szeliski 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.
Image Renaissance Using Discrete Optimization Cédric AllèneNikos Paragios ENPC – CERTIS ESIEE – A²SI ECP - MAS France.
 Coding efficiency/Compression ratio:  The loss of information or distortion measure:
Multimedia Databases (MMDB)
Mutual Information-based Stereo Matching Combined with SIFT Descriptor in Log-chromaticity Color Space Yong Seok Heo, Kyoung Mu Lee, and Sang Uk Lee.
Graph Cut & Energy Minimization
Comparative study of various still image coding techniques. Harish Bhandiwad EE5359 Multimedia Processing.
Object Based Video Coding - A Multimedia Communication Perspective Muhammad Hassan Khan
Graph Cut 韋弘 2010/2/22. Outline Background Graph cut Ford–Fulkerson algorithm Application Extended reading.
Methods of Video Object Segmentation in Compressed Domain Cheng Quan Jia.
Multimedia Elements: Sound, Animation, and Video.
Generalized Hough Transform
Sadaf Ahamed G/4G Cellular Telephony Figure 1.Typical situation on 3G/4G cellular telephony [8]
Video Compression Standards for High Definition Video : A Comparative Study Of H.264, Dirac pro And AVS P2 By Sudeep Gangavati EE5359 Spring 2012, UT Arlington.
Sub pixel motion estimation for Wyner-Ziv side information generation Subrahmanya M V (Under the guidance of Dr. Rao and Dr.Jin-soo Kim)
Course 9 Texture. Definition: Texture is repeating patterns of local variations in image intensity, which is too fine to be distinguished. Texture evokes.
- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison between H.264.
Figure 1.a AVS China encoder [3] Video Bit stream.
A survey of different shape analysis techniques 1 A Survey of Different Shape Analysis Techniques -- Huang Nan.
-BY KUSHAL KUNIGAL UNDER GUIDANCE OF DR. K.R.RAO. SPRING 2011, ELECTRICAL ENGINEERING DEPARTMENT, UNIVERSITY OF TEXAS AT ARLINGTON FPGA Implementation.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
MPEG-4 Systems Introduction & Elementary Stream Management
Gaussian Mixture Models and Expectation-Maximization Algorithm.
EE 5359 Multimedia Project -Shreyanka Subbarayappa
-BY KUSHAL KUNIGAL UNDER GUIDANCE OF DR. K.R.RAO. SPRING 2011, ELECTRICAL ENGINEERING DEPARTMENT, UNIVERSITY OF TEXAS AT ARLINGTON FPGA Implementation.
Porting of Fast Intra Prediction in HM7.0 to HM9.2
Page 11/28/2016 CSE 40373/60373: Multimedia Systems Quantization  F(u, v) represents a DCT coefficient, Q(u, v) is a “quantization matrix” entry, and.
Video Compression and Standards
1 Yu Liu 1, Feng Wu 2 and King Ngi Ngan 1 1 Department of Electronic Engineering, The Chinese University of Hong Kong 2 Microsoft Research Asia, Beijing,
UNDER THE GUIDANCE DR. K. R. RAO SUBMITTED BY SHAHEER AHMED ID : Encoding H.264 by Thread Level Parallelism.
Instructor : Dr. K. R. Rao Presented by : Vigneshwaran Sivaravindiran
Motion Estimation Multimedia Systems and Standards S2 IF Telkom University.
Photoconsistency constraint C2 q C1 p l = 2 l = 3 Depth labels If this 3D point is visible in both cameras, pixels p and q should have similar intensities.
By: Santosh Kumar Muniyappa ( ) Guided by: Dr. K. R. Rao Final Report Multimedia Processing (EE 5359)
1 Review and Summary We have covered a LOT of material, spending more time and more detail on 2D image segmentation and analysis, but hopefully giving.
E ARLY TERMINATION FOR TZ SEARCH IN HEVC MOTION ESTIMATION PRESENTED BY: Rajath Shivananda ( ) 1 EE 5359 Multimedia Processing Individual Project.
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
IT472: Digital Image Processing
Early termination for tz search in hevc motion estimation
Object tracking in video scenes Object tracking in video scenes
Presented by: Yang Yu Spatiotemporal GMM for Background Subtraction with Superpixel Hierarchy Mingliang Chen, Xing Wei, Qingxiong.
Image Segmentation.
A Block Based MAP Segmentation for Image Compression
Presentation transcript:

Object Based Video Coding - A Multimedia Communication Perspective Muhammad Hassan Khan

Recap … Motivation for Video Coding Today’s Video Coding Problems with today’s video coding Desirable Features Solution to get desirable features  Object Based Video Coding  MPEG-4 Support  Model Based Coding Major Problem: Segmentation Segmentation by Graph Cuts

Overview of Today’s Presentation Details of the Segmentation Process  Segmentation using Graph Cuts  Results What can we do once we have the segmented regions Block-based Vs Parametric Motion Representation Compatibility with MPEG-4

Segmentation using Graph Cuts Lets quickly see what is a graph cut!

Segmentation using Graph Cuts Lets now see what is Max Flow – Min Cut

Segmentation using Graph Cuts How does it relate to segmentation of images?  It is primarily a pixel labeling problem  Consider we want to label a pixel D = Distance Function (Depends on the current pixel) S = Smoothness Function (Depends on the neighborhood)  To be minimized = α D + (1- α) S  α serves as a prior!  Hence graph based segmentation answers the question: What is the best segmentation, given this function?  We still haven’t answered how the two relate…

Segmentation using Graph Cuts Let us construct a simple graph to see how the two (graph cuts and segmentation of images) relate α β D(α) D(β) S

Segmentation using Graph Cuts Start with an initial labeling Find the Min-Cut Adjust the labels Iterate until a good minimization of the function is reached

Results

What can we do once we have the segmented regions? Shape Description  Generalized Hough Transform  R-Table based representation  We need to know a few things Centroid of a shape Texture Model  Not explored in detail yet!

Example for Centroid

Shape Description-Finding Centroid For each boundary point  Find r = (x’, y’) x c = x + x’ y c = y + y’ Φ is the angle which the tangent at (x, y) makes with the x-axis φ x’ y’ (x c, y c ) r x y

Shape Description-Creating R-Table R-Table φ x’ y’ (x c, y c ) r x y

x y Φ=0 Φ=45 Φ=90 Φ=135 Φ=180 Φ=225 Φ=270 Φ=315 Shape Description-Creating R-Table

Encoding The R-Table This can heavily exploit the redundancy between the magnitudes and directions of R- Table entries We might as well go for DPCM  Heaven Knows Benefits  Objects encoded independently and can hence be manipulated independently in the transform domain

Block-based Vs Parametric Motion Representation Block based  Use variable block sizes within the segmented object based on texture model  Use smaller blocks around the boundary pixels Parametric Motion  We know that given that the world is planer, two images taken from a perspective camera of the same scene are related by a projective transformation  We can assume each object to lie in a plane, similar to the concept of VOP, and compute the projective transformation parameters to estimate motion

Compatibility with MPEG-4 Hierarchical Description The scene divided into objects Our Shape/Texture Representation Goes Here

References Gary J. Sullivan, Pankaj Topiwala, Ajay Luthra SPIE Conference on Applications of Digital Image Processing XXVII, Special Session on Advances in the New Emerging Standard: H.264/AVC, August, 2004 Gabriel Antunes, Abrantes, Fernando Pereira, MPEG-4 Facial Animation Technology : Survey, Implementation and Results, IEEE Transactions on Circuits and Systems for Video Technology, Vol. 9, No. 2, March 1999 Roger H Clarke, Image and Video Compression: A Survey Department of Computing and Electrical Engineering, Heriot-Watt University, Riccarton, Edinburgh EH14 4 AS, Scotland. Noel Brady, MPEG-4 Standardized Methods for the Compression of Arbitrarily Shaped Video Objects, IEEE Transactions on Circuits and Systems for Video Technology, Vol. 9, No. 8, December 1999 Boykov, Y.; Veksler, O.; Zabih, R.; Fast approximate energy minimization via graph cuts, Pattern Analysis and Machine Intelligence, IEEE Transactions on Volume 23, Issue 11, Nov Page(s):

References P. Gerken, “Object-based analysis-synthesis coding of image sequences at very low bit rates,” IEEE Circuits System. Video Technology., vol. 4, pp. 228–235, June T. Kaneko and M. Okudaira, “Encoding of arbitrary curves based on the chain code representation,” IEEE Trans. Communications., vol. 33, July P. Nunes, F. Marques, F. Pereira, and A. Gasull, “A contour-based approach to binary shape coding using a multiple grid chain code,” Signal Process. Image Communications., to be published. Moving Picture Experts Group. [Online]. Available www: G. Abrantes and F. Pereira, “Interactive analysis for MPEG-4 facial models configuration,” in EUROGRAPHICS’98–Short Presentations, Lisbon, Portugal, Sept. 1998, pp –1.6.4.