Object Based Video Coding - A Multimedia Communication Perspective Muhammad Hassan Khan 2004-03-0020.

Slides:



Advertisements
Similar presentations
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Advertisements

INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS, ICT '09. TAREK OUNI WALID AYEDI MOHAMED ABID NATIONAL ENGINEERING SCHOOL OF SFAX New Low Complexity.
Automatic Video Shot Detection from MPEG Bit Stream Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC.
MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.
1 Video Coding Concept Kai-Chao Yang. 2 Video Sequence and Picture Video sequence Large amount of temporal redundancy Intra Picture/VOP/Slice (I-Picture)
VIPER DSPS 1998 Slide 1 A DSP Solution to Error Concealment in Digital Video Eduardo Asbun and Edward J. Delp Video and Image Processing Laboratory (VIPER)
Error detection and concealment for Multimedia Communications Senior Design Fall 06 and Spring 07.
Chapter 7 End-to-End Data
Spring 2003CS 4611 Multimedia Outline Compression RTP Scheduling.
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
T.Sharon-A.Frank 1 Multimedia Size of Data Frame.
Decision Trees for Error Concealment in Video Decoding Song Cen and Pamela C. Cosman, Senior Member, IEEE IEEE TRANSACTION ON MULTIMEDIA, VOL. 5, NO. 1,
A Picture is Worth a Thousand Words Milton Chen. What’s a Picture Worth? A thousand words - Descartes ( ) A thousand bytes - modern translation.
Object Based Video Coding - A Multimedia Communication Perspective Muhammad Hassan Khan
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 11, NOVEMBER 2011 Qian Zhang, King Ngi Ngan Department of Electronic Engineering, the Chinese university.
WATERLOO ELECTRICAL AND COMPUTER ENGINEERING 10s: Communications and Information Systems 1 WATERLOO ELECTRICAL AND COMPUTER ENGINEERING 10s Communications.
HARDEEPSINH JADEJA UTA ID: What is Transcoding The operation of converting video in one format to another format. It is the ability to take.
On Error Preserving Encryption Algorithms for Wireless Video Transmission Ali Saman Tosun and Wu-Chi Feng The Ohio State University Department of Computer.
Image Compression - JPEG. Video Compression MPEG –Audio compression Lossy / perceptually lossless / lossless 3 layers Models based on speech generation.
Kai-Chao Yang Hierarchical Prediction Structures in H.264/AVC.
MPEG MPEG-VideoThis deals with the compression of video signals to about 1.5 Mbits/s; MPEG-AudioThis deals with the compression of digital audio signals.
MPEG: (Moving Pictures Expert Group) A Video Compression Standard for Multimedia Applications Seo Yeong Geon Dept. of Computer Science in GNU.
Multimedia Databases (MMDB)
Mean-shift and its application for object tracking
Graph Cut & Energy Minimization
Graph Cut 韋弘 2010/2/22. Outline Background Graph cut Ford–Fulkerson algorithm Application Extended reading.
Interactive Graph Cuts for Optimal Boundary & Region Segmentation of Objects in N-D Images (Fri) Young Ki Baik, Computer Vision Lab.
Methods of Video Object Segmentation in Compressed Domain Cheng Quan Jia.
Multimedia Elements: Sound, Animation, and Video.
1 Mpeg-4 Overview Gerhard Roth. 2 Overview Much more general than all previous mpegs –standard finished in the last two years standardized ways to support:
Data Compression. Compression? Compression refers to the ways in which the amount of data needed to store an image or other file can be reduced. This.
Diploma Project Real Time Motion Estimation on HDTV Video Streams (using the Xilinx FPGA) Supervisor :Averena L.I. Student:Das Samarjit.
Sub pixel motion estimation for Wyner-Ziv side information generation Subrahmanya M V (Under the guidance of Dr. Rao and Dr.Jin-soo Kim)
Compression video overview 演講者:林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.
Rate-distortion Optimized Mode Selection Based on Multi-channel Realizations Markus Gärtner Davide Bertozzi Classroom Presentation 13 th March 2001.
Figure 1.a AVS China encoder [3] Video Bit stream.
Spring 2000CS 4611 Multimedia Outline Compression RTP Scheduling.
-BY KUSHAL KUNIGAL UNDER GUIDANCE OF DR. K.R.RAO. SPRING 2011, ELECTRICAL ENGINEERING DEPARTMENT, UNIVERSITY OF TEXAS AT ARLINGTON FPGA Implementation.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
MPEG-4 Systems Introduction & Elementary Stream Management
Lecture 19: Solving the Correspondence Problem with Graph Cuts CAP 5415 Fall 2006.
Marwan Al-Namari 1 Digital Representations. Bits and Bytes Devices can only be in one of two states 0 or 1, yes or no, on or off, … Bit: a unit of data.
Implementation, Comparison and Literature Review of Spatio-temporal and Compressed domains Object detection. By Gokul Krishna Srinivasan Submitted to Dr.
Gaussian Mixture Models and Expectation-Maximization Algorithm.
-BY KUSHAL KUNIGAL UNDER GUIDANCE OF DR. K.R.RAO. SPRING 2011, ELECTRICAL ENGINEERING DEPARTMENT, UNIVERSITY OF TEXAS AT ARLINGTON FPGA Implementation.
The task of compression consists of two components, an encoding algorithm that takes a file and generates a “compressed” representation (hopefully with.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Page 11/28/2016 CSE 40373/60373: Multimedia Systems Quantization  F(u, v) represents a DCT coefficient, Q(u, v) is a “quantization matrix” entry, and.
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 11 COMPRESSION.
Video Compression and Standards
Overview of Digital Video Compression Multimedia Systems and Standards S2 IF Telkom University.
COMPARATIVE STUDY OF HEVC and H.264 INTRA FRAME CODING AND JPEG2000 BY Under the Guidance of Harshdeep Brahmasury Jain Dr. K. R. RAO ID MS Electrical.
1 Yu Liu 1, Feng Wu 2 and King Ngi Ngan 1 1 Department of Electronic Engineering, The Chinese University of Hong Kong 2 Microsoft Research Asia, Beijing,
Query by Image and Video Content: The QBIC System M. Flickner et al. IEEE Computer Special Issue on Content-Based Retrieval Vol. 28, No. 9, September 1995.
Automatic Caption Localization in Compressed Video By Yu Zhong, Hongjiang Zhang, and Anil K. Jain, Fellow, IEEE IEEE Transactions on Pattern Analysis and.
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Visual Information Retrieval
Automatic Video Shot Detection from MPEG Bit Stream
Multimedia Outline Compression RTP Scheduling Spring 2000 CS 461.
DCT IMAGE COMPRESSION.
Object tracking in video scenes Object tracking in video scenes
Standards Presentation ECE 8873 – Data Compression and Modeling
15 Data Compression Foundations of Computer Science ã Cengage Learning.
A Block Based MAP Segmentation for Image Compression
Research Institute for Future Media Computing
15 Data Compression Foundations of Computer Science ã Cengage Learning.
Presentation transcript:

Object Based Video Coding - A Multimedia Communication Perspective Muhammad Hassan Khan

Overview Motivation for Video Coding Today’s Video Coding Problems with today’s video coding Desirable Features Solution to get desirable features  Object Based Video Coding  MPEG-4 Support  Model Based Coding Major Problem: Segmentation Segmentation by Graph Cuts Architecture to Incorporate this segmentation mechanism with MPEG-4 bit stream

Why Video Coding? Consider a 1 minute video with 60fps No of frames = 60 x 60 = 3600 Given that each color frame in the video was a 640 x 480 pixels The size of the raw video comes out to be?  3600 x 640 x 480 x 3 = 3,317,760,000 bits  Of the order of Gbs Now a days one might say that memory is no big deal… BUT What if we want to transfer this file from one node to the other node over a network!  Things would collapse very soon  Just imagine if the video was 1 hour duration rather than 1 minute!!!  I hope the need for video coding is now obvious

Today’s Video Coding YUV (lossy) MotionDCT Quantize (lossy) EntropyOrder Designed for natural scenes Higher frequency DCT coefficients are quantized Sharp edges are not well preserved

Problems with Today’s Video Coding Poor performance in case of  Anything with sharp edges  Highly textured regions  Texts (Channel Logos) The bit stream produced by today’s coders is also debatable in that weather it is the MOST optimal bit stream  In fact what is a most optimal bit stream is still a question

Desired Features Better compression Improved quality Interactivity and Manipulation of Content Error Resilience Processing of content in the compressed domain Identification and selective coding/decoding of the object of interest Facilitate Search / Indexing (MPEG-7)

Solution to Get Desirable Features MPEG-4  Support for Object Based Coding Rather than conventional block based coding for natural images The scene should be divided hierarchically into objects The scene will now be described by the objects placed in a hierarchical manner A sample is presented in the next slide

Hierarchical Description The scene divided into objects

The decoding process

Meshed Video 2D mesh tessellates the video into patches Motion vector for each vertex Motivation  Modeling (Motion and Shape)

Problems with Mesh Based Coding Works fine with previously known models and caters for a small class of objects The reliable tracking of features or control points along the video  E.g. FAPs A ready-made model is assumed, 2D or 3D model of the object has to be known A more general approach was required  Object Based Video Coding Shape, Color, and Motion

Requires a Major Step! Segmentation  Dividing the scene into objects  In simplest form these objects can be foreground and background  In more complex situations there can be multiple objects in the scene  Segmentation is required to extract the objects Computing Motion  Object Based Motion  Parameterized Motion Information

Segmentation by Graph Cuts Uses Max-Flow Min-Cut Algorithm from Graph Theory Divides the data into regions based on an energy function, usually employed to the intensities of the image A smoothness function is also used to make sure that the segmentation achieved is consistent Details will be provided in the final presentation

Architecture We will also propose mechanism to assign motion information to the segmented objects Our approach will be as consistent as possible with the support provided by MPEG

References Gary J. Sullivan, Pankaj Topiwala, Ajay Luthra SPIE Conference on Applications of Digital Image Processing XXVII, Special Session on Advances in the New Emerging Standard: H.264/AVC, August, 2004 Gabriel Antunes, Abrantes, Fernando Pereira, MPEG-4 Facial Animation Technology : Survey, Implementation and Results, IEEE Transactions on Circuits and Systems for Video Technology, Vol. 9, No. 2, March 1999 Roger H Clarke, Image and Video Compression: A Survey Department of Computing and Electrical Engineering, Heriot-Watt University, Riccarton, Edinburgh EH14 4 AS, Scotland. Noel Brady, MPEG-4 Standardized Methods for the Compression of Arbitrarily Shaped Video Objects, IEEE Transactions on Circuits and Systems for Video Technology, Vol. 9, No. 8, December 1999 Boykov, Y.; Veksler, O.; Zabih, R.; Fast approximate energy minimization via graph cuts, Pattern Analysis and Machine Intelligence, IEEE Transactions on Volume 23, Issue 11, Nov Page(s):