Discussion on Video Analysis and Extraction, MPEG-4 and MPEG-7 Encoding and Decoding in Java, Java 3D, or OpenGL Presented by: Emmanuel Velasco City College.

Slides:



Advertisements
Similar presentations
4.2 Multimedia Elements Audio 1. Learning Outcomes: At the end of the lesson, students should be: a) describe the purpose of using audio in multimedia.
Advertisements

Automatic Video Shot Detection from MPEG Bit Stream Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC.
Multi-Model Digital Video Library Professor: Michael Lyu Member: Jacky Ma Joan Chung Multi-Model Digital Video Library LYU9904 Multi-Model Digital Video.
DL:Lesson 11 Multimedia Search Luca Dini
1 Content-Based Retrieval (CBR) -in multimedia systems Presented by: Chao Cai Date: March 28, 2006 C SC 561.
A presentation by Modupe Omueti For CMPT 820:Multimedia Systems
3. Technical and administrative metadata standards Metadata Standards and Applications.
From Virtual Classroom to the Virtual Streets Emmanuel Velasco City College of New York Capstone 1 Fall 2004.
Una DooneyMultimediaSlide 1 What is Multimedia? A combination of different media types such as text, graphics, audio, video and animation etc in a single.
WMES3103 : INFORMATION RETRIEVAL
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
ISP 433/633 Week 5 Multimedia IR. Goals –Increase access to media content –Decrease effort in media handling and reuse –Improve usefulness of media content.
MUSCLE movie data base is a multimodal movie corpus collected to develop content- based multimedia processing like: - speaker clustering - speaker turn.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
Outline of Presentation Introduction of digital video libraries Introduction of the CMU Informedia Project Informedia: user perspective Informedia:
MPEG-7 Multimedia Content Description Standard January 8, 2003 John R. Smith Pervasive Media Management Group IBM T. J. Watson Research Center 19 Skyline.
Multimedia Enabling Software. The Human Perceptual System Since the multimedia systems are intended to be used by human, it is a pragmatic approach to.
                      Digital Video 1.
By Breanna Myers Ms. Williams-Grant 5 th Period Business Computer Applications
Chapter II The Multimedia Sysyem. What is multimedia? Multimedia means that computer information can be represented through audio, video, and animation.
E0262 – MIS – Multimedia Storage Techniques SMIL – Synchronized Multimedia Integration Language.
MULTIMEDIA M U A T H H U M A I D R a s h A t a l l a h.
GIS technologies and Web Mapping Services
CHAPTER 7 Current Trends in Database.  Difficulties with RDBMS storage and usage  Demand for data in forms other than just text  Adoption of e-Business.
Naresuan University Multimedia Paisarn Muneesawang
1 Seminar Presentation Multimedia Audio / Video Communication Standards Instructor: Dr. Imran Ahmad By: Ju Wang November 7, 2003.
Multimedia Databases (MMDB)
© 2011 The McGraw-Hill Companies, Inc. All rights reserved Chapter 6: Video.
An Overview of MPEG-21 Cory McKay. Introduction Built on top of MPEG-4 and MPEG-7 standards Much more than just an audiovisual standard Meant to be a.
The MPEG Standard MPEG-1 (1992) actually a video player
CHAPTER FOUR COMPUTER SOFTWARE.
A Proposal for a Video Modeling for Composing Multimedia Document Cécile ROISIN - Tien TRAN_THUONG - Lionel VILLARD Presented by: Tien TRAN THUONG Project.
CMPD273 Multimedia System Prepared by Nazrita Ibrahim © UNITEN2002 Multimedia System Characteristic Reference: F. Fluckiger: “Understanding networked multimedia,
Multimedia Elements: Sound, Animation, and Video.
CHAPTER TEN AUTHORING.
By NIST/ITL/IAD, Mike Rubinfeld, January 16, 2002 Page 1 L3 Overview L3 Standards Overview By Mike Rubinfeld Chairman, INCITS/L3 (MPEG & JPEG) NIST, Gaithersburg,
COMPUTER PARTS AND COMPONENTS INPUT DEVICES
ECE8873 MPEG-7 Deryck Yeung. Overview Summary of MPEG-1,MPEG-2 and MPEG-4 Why another standard? MPEG-7 What’s next? Conclusion.
1 Mpeg-4 Overview Gerhard Roth. 2 Overview Much more general than all previous mpegs –standard finished in the last two years standardized ways to support:
MULTIMEDIA TECHNOLOGY SMM 3001 MEDIA - VIDEO. In this chapter How digital video differs from conventional analog video How digital video differs from.
Begin a new story or edit a Photo Story project that has already been created. Make your selection and click next. 1.
Introduction to Interactive Media Interactive Media Tools: Authoring Applications.
Reading Flash. Training target: Read the following reading materials and use the reading skills mentioned in the passages above. You may also choose some.
MMDB-9 J. Teuhola Standardization: MPEG-7 “Multimedia Content Description Interface” Standard for describing multimedia content (metadata).
MPEG-4: Multimedia Coding Standard Supporting Mobile Multimedia System Lian Mo, Alan Jiang, Junhua Ding April, 2001.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
MPEG-7 Audio Overview Ichiro Fujinaga MUMT 611 McGill University.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
MULTIMEDIA DATA MODELS AND AUTHORING
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
MPEG 7 &MPEG 21.
MPEG-7 What is MPEG-7 ? MPEG-7 is a multimedia content description standard. These descriptions are based on catalogue (e.g., title, creator, rights),
MPEG-4 Binary Information for Scenes (BIFS)
Visual Information Retrieval
Automatic Video Shot Detection from MPEG Bit Stream
CS644 Advanced Topics in Networking
Introduction Multimedia initial focus
VIDEO.
SWE 423: Multimedia Systems
Inserting Graphics, Media, and Objects
Chapter 6: Video.
An Overview of MPEG-21 Cory McKay.
Overview What is Multimedia? Characteristics of multimedia
Multimedia Content Description Interface
Using Animation and Multimedia
Presentation transcript:

Discussion on Video Analysis and Extraction, MPEG-4 and MPEG-7 Encoding and Decoding in Java, Java 3D, or OpenGL Presented by: Emmanuel Velasco City College of New York

Video Analysis and Extraction As more videos are being created, digitized and archived, the need for content-based search and retrieval is necessary. This involves analyzing a video and extracting its contents. As more videos are being created, digitized and archived, the need for content-based search and retrieval is necessary. This involves analyzing a video and extracting its contents. The videos are cut into frames. The frames are analyzed and the objects can be extracted using image processing techniques. The videos are cut into frames. The frames are analyzed and the objects can be extracted using image processing techniques.

Video Analysis and Extraction Temporal Video Segmentation Cut detection: The changes in the contents are visible and occur instantaneously between consecutive frames. Cut detection: The changes in the contents are visible and occur instantaneously between consecutive frames. Gradual transition detection: The image transition makes gradual changes. This requires multiple frames to be analyzed. Gradual transitions include fade in, fade out, wipe and dissolve. Gradual transition detection: The image transition makes gradual changes. This requires multiple frames to be analyzed. Gradual transitions include fade in, fade out, wipe and dissolve.

Video Analysis and Extraction Examples: Cut transition Cut transition Gradual transition Gradual transition

Video Analysis and Extraction The cut transition is easier to detect. We check the frame differences between two consecutive frames and see if the difference is greater than a certain threshold. If it is, then a cut is determined. The cut transition is easier to detect. We check the frame differences between two consecutive frames and see if the difference is greater than a certain threshold. If it is, then a cut is determined. Gradual transitions are harder to detect. There are several methods, which include the twin- comparison algorithm. This works by noticing that the first and last transition frames are different, and any consecutive frames between them are similar. Gradual transitions are harder to detect. There are several methods, which include the twin- comparison algorithm. This works by noticing that the first and last transition frames are different, and any consecutive frames between them are similar.

Video Analysis and Extraction Twin-Algorithm Results

Video Analysis and Extraction Scene and Object Detection We want to identify objects in a video. One method of finding this is the opposite of transition detection. Instead of finding the differences between frames above a threshold, we want to find image regions below a certain threshold. We want to identify objects in a video. One method of finding this is the opposite of transition detection. Instead of finding the differences between frames above a threshold, we want to find image regions below a certain threshold. Another method is to take an image and try all possible transformations between the edges of the two images. Another method is to take an image and try all possible transformations between the edges of the two images.

Video Analysis and Extraction Text Extraction We want to retrieve the captions in an video. While most text segmentation is done on high resolution media, video is low resolution. We want to retrieve the captions in an video. While most text segmentation is done on high resolution media, video is low resolution. One method is to assume that the gray levels of the text is lighter or darker than the background. Using a minimum difference with the background, the text can be extracted. One method is to assume that the gray levels of the text is lighter or darker than the background. Using a minimum difference with the background, the text can be extracted.

Video Analysis and Extraction Example of Text Extraction

Video Analysis and Extraction So we see that video analysis and extraction is useful in our projects. The Classroom Project: Object detection is used for finding the location of the professor. Text extraction is useful for capturing text in the PowerPoint slides shown in a video.

Video Analysis and Extraction The NYC Traffic Project: Object detection is used for detecting how heavy or light the traffic is. Transition detection is used to see if we are looking at the same view, or if the view has changed.

MPEG-4 Is an ISO/IEC compression standard created by the Moving Pictures Expert Group (MPEG). Is an ISO/IEC compression standard created by the Moving Pictures Expert Group (MPEG). Has been successfully used in: Has been successfully used in: digital televisiondigital television interactive graphics applicationsinteractive graphics applications interactive multimediainteractive multimedia

MPEG-4 Can bring multimedia to new networks such as mobile networks. Can bring multimedia to new networks such as mobile networks. Media objects are audio, video, or audiovisual contents and can be natural (recorded using a camera and/or microphone) or synthetic (generated using a computer). Media objects are audio, video, or audiovisual contents and can be natural (recorded using a camera and/or microphone) or synthetic (generated using a computer).

MPEG-4 An example of an MPEG-4 scene. An example of an MPEG-4 scene.

MPEG-4 The media objects are independent from their background. This allows easy extraction of the object and easier editing of an object. The media objects are independent from their background. This allows easy extraction of the object and easier editing of an object. The objects are synchronized by time and space. The objects are synchronized by time and space.

MPEG-4 With a set of media objects, MPEG-4 allows us to: With a set of media objects, MPEG-4 allows us to: place objects anywhere in a given coordinate system.place objects anywhere in a given coordinate system. apply transforms to change an visual object geometrically or change an audio object acoustically.apply transforms to change an visual object geometrically or change an audio object acoustically. group objects together (such as the visual image of the person, and their voice).group objects together (such as the visual image of the person, and their voice). apply streamed data to media objects to modify their attributes.apply streamed data to media objects to modify their attributes. change the user’s viewpoint or listening point anywhere in the scene.change the user’s viewpoint or listening point anywhere in the scene.

Encoder / Decoder Definitions Encoder: To format (electronic data) according to a standard format. Encoder: To format (electronic data) according to a standard format. Decoder: to recognize and interpret (an electronic signal) Decoder: to recognize and interpret (an electronic signal)

MPEG-4 Encoder / Decoder While many MPEG-4 encoders and decoders exists as standalone applications, we want to be able to encode and decode using Java, Java 3D, or OpenGL.

MPEG-4 Encoder / Decoder IBM Toolkit for MPEG-4 is a set of Java classes and API with five applications. IBM Toolkit for MPEG-4 is a set of Java classes and API with five applications. AVgen: a simple, easy-to-use GUI tool for creating audio/video-only content for ISMA- or 3GPP-compliant devicesAVgen: a simple, easy-to-use GUI tool for creating audio/video-only content for ISMA- or 3GPP-compliant devices XMTBatch: a tool for creating rich MPEG-4 content beyond simple audio and videoXMTBatch: a tool for creating rich MPEG-4 content beyond simple audio and video M4Play: an MPEG-4 client playback applicationM4Play: an MPEG-4 client playback application M4Applet for ISMA: a Java player applet for ISMA- compliant contentM4Applet for ISMA: a Java player applet for ISMA- compliant content M4Applet for HTTP: a Java applet for MPEG-4 content played back over HTTP.M4Applet for HTTP: a Java applet for MPEG-4 content played back over HTTP.

MPEG-4 Encoder / Decoder IBM MPEG-4 XMT Editor Tool Add media object Time Frame Object Attributes

MPEG-4 Encoder / Decoder IBM MPEG-4 Demos: IBM MPEG-4 Demos: SKLMP4 Encoder / Decoder SKLMP4 Encoder / Decoder is a C++ library that is capable of encoding and decoding MPEG-4

MPEG-4 MPEG-4 can make it easier for us to extract the objects since each object is independent of each other. The Classroom Project: The professor is an image object, separated from the PowerPoint background.

MPEG-4 The NYC Traffic Project: The background (roads) are separate from the objects (cars). The interactivity that MPEG-4 allows can make the user interface easier to interact with. They can point and click on the map and view the cameras in that location.

MPEG-7 Since audiovisual data is increasing and coming from many different sources, searching for a certain type of media content will be more difficult. Therefore we need a way to search the data quickly and efficiently. The solution is MPEG-7. Since audiovisual data is increasing and coming from many different sources, searching for a certain type of media content will be more difficult. Therefore we need a way to search the data quickly and efficiently. The solution is MPEG-7. MPEG-7 is a standard for describing media content. Unlike MPEG-1, MPEG-2, and MPEG-4, MPEG-7 is not a standard for the actual coding of moving pictures and audio. MPEG-7 is a standard for describing media content. Unlike MPEG-1, MPEG-2, and MPEG-4, MPEG-7 is not a standard for the actual coding of moving pictures and audio.

MPEG-7 MPEG-7 uses XML Schema as the language of choice for content description. MPEG-7 uses XML Schema as the language of choice for content description. These descriptions may include information describing the creation of the content (title, author). It may include the storage features of the content (storage format, encoding). It can contain low level features in the content (color, texture, shape, motion, audio). These descriptions may include information describing the creation of the content (title, author). It may include the storage features of the content (storage format, encoding). It can contain low level features in the content (color, texture, shape, motion, audio).

So what will MPEG-7 standardize? A set of descriptors (D): Descriptors define the syntax and the semantics of each feature (metadata element). A set of descriptors (D): Descriptors define the syntax and the semantics of each feature (metadata element). A set of description schemes (DS): A description scheme specifies the structure and semantics of the relationships between its components. A set of description schemes (DS): A description scheme specifies the structure and semantics of the relationships between its components.

So what will MPEG-7 standardize? Description Definition Language (DDL): to define the syntax of the descriptors and description schemes. Description Definition Language (DDL): to define the syntax of the descriptors and description schemes.

Some possible MPEG-7 Applications Audio: play a few notes on the keyboard, and it will return musical pieces with similar tunes. Audio: play a few notes on the keyboard, and it will return musical pieces with similar tunes. Graphics: sketch a few lines on a screen and get a set of images containing similar graphics or logos. Graphics: sketch a few lines on a screen and get a set of images containing similar graphics or logos. Images: define objects, color patterns or textures and retrieve images that look like the image described. Images: define objects, color patterns or textures and retrieve images that look like the image described.

MPEG-7 Encoder / Decoder MPEG-7 Library is a set of C++ classes, implementing the MPEG-7 standard. MPEG-7 Library is a set of C++ classes, implementing the MPEG-7 standard. Java MPEG-7 Audio Encoder is a java library that provides a MPEG-7 audio encoder to describe an audio content with some descriptors of the MPEG-7 standard. Java MPEG-7 Audio Encoder is a java library that provides a MPEG-7 audio encoder to describe an audio content with some descriptors of the MPEG-7 standard. aachen.de/team/crysandt/software/mpeg7audioenc/ aachen.de/team/crysandt/software/mpeg7audioenc/

MPEG-7 Once we have a lot of media contents, MPEG-7 allows us to search through them easier. The Classroom Project: If we have a lot of videos, sound, or both. We can find the content we need quickly. The NYC Traffic Project: If there are many cameras at several locations, finding a specific location can be easier.

Discussion on Video Analysis and Extraction, MPEG-4 and MPEG-7 Encoding and Decoding in Java, Java 3D, or OpenGL Presented by: Emmanuel Velasco City College of New York