Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,

Slides:



Advertisements
Similar presentations
ELibrary Science Product Demonstration Get ready to experience science in a whole new way –eLibrary Science offers targeted science text and tools.
Advertisements

Kien A. Hua Division of Computer Science University of Central Florida.
A New Scheme For Robust Blind Digital Video Watermarking Supervised by Prof. LYU, Rung Tsong Michael Presented by Chan Pik Wah, Pat Mar 5, 2002 Department.
Multi-Model Digital Video Library Professor: Michael Lyu Member: Jacky Ma Joan Chung Multi-Model Digital Video Library LYU9904 Multi-Model Digital Video.
LYU0101 Wireless Digital Library on PDA Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu First semester FYP Presentation 2001~2002.
An Automatic Video Information Processing and Conversion System for Multimedia Messaging Service and Mobile Internet Miss Chan Pik Wah, Pat Miss Ngai Cheuk.
Discussion on Video Analysis and Extraction, MPEG-4 and MPEG-7 Encoding and Decoding in Java, Java 3D, or OpenGL Presented by: Emmanuel Velasco City College.
Information Retrieval in Practice
Video Table-of-Contents: Construction and Matching Master of Philosophy 3 rd Term Presentation - Presented by Ng Chung Wing.
LYU0101 Wireless Digital Information System Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu Second semester FYP Presentation 2001~2002.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
XML and Data Integration Edward Yau (2002/03/27).
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Multimedia Search and Retrieval: New Concepts, System Implementation, and Application Qian Huang, Atul Puri, Zhu Liu IEEE TRANSACTION ON CIRCUITS AND SYSTEMS.
LYU0101 Wireless Digital Information System Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu Second semester FYP Presentation 2001~2002.
ADVISE: Advanced Digital Video Information Segmentation Engine
©Silberschatz, Korth and Sudarshan10.1Database System Concepts W3C Activities HTML: is the lingua franca for publishing on the Web XHTML: an XML application.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
LYU0203 Smart Traveller with Visual Translator for OCR and Face Recognition Supervised by Prof. LYU, Rung Tsong Michael Prepared by: Wong Chi Hang Tsang.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
ART: Augmented Reality Table for Interactive Trading Card Game Albert H.T. Lam, Kevin C. H. Chow, Edward H. H. Yau and Michael R. Lyu Department of Computer.
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Multimedia Security Digital Video Watermarking Supervised by Prof. LYU, Rung Tsong Michael Presented by Chan Pik Wah, Pat Nov 20, 2002 Department of Computer.
Outline of Presentation Introduction of digital video libraries Introduction of the CMU Informedia Project Informedia: user perspective Informedia:
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
1 Chapter 11 Developing Custom Help. 11 Chapter Objectives Use HTML to create customized Help topics for an application Use the HTML Help Workshop to.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Overview of Search Engines
Batch VIP — A backend system of video processing VIEW Technologies The Chinese University of Hong Kong.
1 Lessons Learned From Building a Terabyte Digital Video Library Presented by Jia Yao Multimedia Communications and Visualization Laboratory Department.
Multimedia Databases (MMDB)
ITCS 6010 SALT. Speech Application Language Tags (SALT) Speech interface markup language Extension of HTML and other markup languages Adds speech and.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
Web Services for Satellite Emulation Development Kathy J. LiszkaAllen P. Holtz The University of AkronNASA Glenn Research Center.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal VideoConference Archives Indexing System.
XML stands for Extensible Mark-up Language XML is a mark-up language much like HTML XML was designed to carry data, not to display data XML tags are not.
March 31, 1998NSF IDM 98, Group F1 Group F Multi-modal Issues, Systems and Applications.
1 Applications of video-content analysis and retrieval IEEE Multimedia Magazine 2002 JUL-SEP Reporter: 林浩棟.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Implementation of a Relational Database as an Aid to Automatic Target Recognition Christopher C. Frost Computer Science Mentor: Steven Vanstone.
MMDB-9 J. Teuhola Standardization: MPEG-7 “Multimedia Content Description Interface” Standard for describing multimedia content (metadata).
Accessibility of math educational materials for visually impaired people Piotr Brzoza Krzysztof Dobosz.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
©Silberschatz, Korth and Sudarshan10.1Database System Concepts W3C - The World Wide Web Consortium W3C - The World Wide Web Consortium.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
Copyright © 2002 Pearson Education, Inc. Slide 3-1 Internet II A consortium of more than 180 universities, government agencies, and private businesses.
Collaborative Query Previews in Digital Libraries Lin Fu, Dion Goh, Schubert Foo Division of Information Studies School of Communication and Information.
Query by Image and Video Content: The QBIC System M. Flickner et al. IEEE Computer Special Issue on Content-Based Retrieval Vol. 28, No. 9, September 1995.
MULTIMEDIA DATA MODELS AND AUTHORING
W3C Multimodal Interaction Activities Deborah A. Dahl August 9, 2006.
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
I Copyright © 2004, Oracle. All rights reserved. Introduction.
Information Retrieval in Practice
Topic Map & SMIL Prototypes KUL-ESAT-DOCARCH
Digital Video Library - Jacky Ma.
Visual Information Retrieval
Automatic Video Shot Detection from MPEG Bit Stream
Supervisor: Prof Michael Lyu Presented by: Lewis Ng, Philip Chan
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Prepared for Md. Zakir Hossain Lecturer, CSE, DUET Prepared by Miton Chandra Datta
A Wireless Handheld Multi-modal Video Library Client System
Ahmet Fatih Mustacoglu
Multimedia Content Description Interface
Presented by: Jacky Ma Date: 11 Dec 2001
Discussion Class 9 Informedia.
Presentation transcript:

Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah, Pat Ngai Cheuk Han, Table LYU0102 XML for Interoperable Digital Video Library

Outline Project Overview Project Overview Extraction Techniques Extraction Techniques Video Optical Character Recognition (VOCR) Video Optical Character Recognition (VOCR) Scene Change Detection Scene Change Detection Storage Storage XML XML Knowledge Enrichment Knowledge Enrichment Implementation Implementation Tasks in next semester Tasks in next semester

Motivations Rapid increase in the usage of multimedia information Rapid increase in the usage of multimedia information New approach: DIGITAL VIDEO LIBRARY New approach: DIGITAL VIDEO LIBRARY Project Outline

Motivations Little attention paying on video information extraction and storage Little attention paying on video information extraction and storage Scalability of the system in terms of adding new extraction components Scalability of the system in terms of adding new extraction components Lack of a generic framework for presentation and visualization of video information Lack of a generic framework for presentation and visualization of video information Project Outline

Targets Provide an open architecture that can integrate different digital video library functions Provide an open architecture that can integrate different digital video library functions Increase the reusability of the information extracted from videos Increase the reusability of the information extracted from videos Deliver and present the video to multiple computing platform Deliver and present the video to multiple computing platform Project Outline

Ways to achieve Modal concept of the digital video library functions Modal concept of the digital video library functions Collaborating the video information processing module Collaborating the video information processing module Using XML for storage Using XML for storage Universal format Universal format Flexible, scalable Flexible, scalable Present in different ways Present in different ways Easy to search based on particular tags Easy to search based on particular tags Generic framework for presentation and visualization of video information Generic framework for presentation and visualization of video information Project Outline

Overview of our project Project Outline

Achievements Implement two of the video information extraction techniques Implement two of the video information extraction techniques Video Optical Character Detection Video Optical Character Detection Scene Changes Detection Scene Changes Detection Store the extracted information as XML Store the extracted information as XML Build an XML editor in the tool for editing Build an XML editor in the tool for editing Do knowledge enrichment base on the information extracted Do knowledge enrichment base on the information extracted Project Outline

Extraction Techniques Text Detection Camera Motion Face Detection Scene Changes Word Relevance Audio Level Extraction Techniques

Video OCR for Digital News Help to locate topics by extracting the words in the captions Help to locate topics by extracting the words in the captions News captions provide vital search information of the video News captions provide vital search information of the video Video OCR results extracted the keywords on the frames Video OCR results extracted the keywords on the frames The results can be used together with the words extracted from the transcript for indexing The results can be used together with the words extracted from the transcript for indexing Extraction Techniques

Video OCR for Digital News Extraction Techniques

Scene change Detection Technique Detection Technique Effective method for segmenting a video sequence into significant components Effective method for segmenting a video sequence into significant components Extraction Techniques

Existing Method Image difference method Image difference method Histogram Difference Method Histogram Difference Method Histogram Difference Method using DC Coefficient Image Histogram Difference Method using DC Coefficient Image Our Algorithm & Implementation Our Algorithm & Implementation Histogram difference method with dynamic threshold Histogram difference method with dynamic threshold Extraction Techniques

Build and compared the histogram with the pervious scene Build and compared the histogram with the pervious scene Calculate the histogram difference Calculate the histogram difference If (total difference) > threshold If (total difference) > threshold => scene change Use the first frame as key frame Use the first frame as key frame Our Algorithm & Implementation Extraction Techniques

XML Extensible Markup Language Extensible Markup Language W3C W3C Create its own mark-up language for describing the contents Create its own mark-up language for describing the contents Storage

Advantages of using XML Platform and system independent Platform and system independent Create your own tag Create your own tag Adopt Unicode Adopt Unicode Universal format Universal format Scalable Scalable Storage

XML schema Storage

XML Parser A parser is an interface between an XML document and the application program A parser is an interface between an XML document and the application program Document Object Model (DOM) Document Object Model (DOM) Storage

How to present XML Tree model becomes very similar to an XML schema Tree model becomes very similar to an XML schema Represented as nodes that show element/attribute names or the text content and their relative places within the XML Represented as nodes that show element/attribute names or the text content and their relative places within the XML Storage

Content creation in digital video library Collaborating different video information extraction techniques, mainly Collaborating different video information extraction techniques, mainly Knowledge Cross-referencing Knowledge Cross-referencing Knowledge Enrichment Knowledge Enrichment Access to video by content Access to video by content Communicate information trends across time, space Communicate information trends across time, space Provide fast and effective searching Provide fast and effective searching Storage

Knowledge Enrichment Geographic information Geographic information Extract geographic names of countries and cities from text recognized from the video OCR or speech recognition Extract geographic names of countries and cities from text recognized from the video OCR or speech recognition Knowledge from geographic naming database enrich the information Knowledge from geographic naming database enrich the information Allow query or browse for events at a particular location or within some “ distance ” of that location Allow query or browse for events at a particular location or within some “ distance ” of that location Storage

Our Implementation Use a known set of places along with their spatial coordinates and some additional information for knowledge enrichment Use a known set of places along with their spatial coordinates and some additional information for knowledge enrichment Use the XML file as the source material to be processed Use the XML file as the source material to be processed Try to extract names of major cities by processing the text in the source Try to extract names of major cities by processing the text in the source Storage

Geographic naming database An XML file with the following format is used An XML file with the following format is used For each city: For each city: City ID City ID Name of city Name of city Name of country Name of country Longitude Longitude Latitude Latitude Storage

The updated XML file Storage

Knowledge enrichment component in our tool Extract and list out all the cities mentioned in the video Extract and list out all the cities mentioned in the video Allow user to select any of them to look for further information of that city Allow user to select any of them to look for further information of that city Storage

Program Platform Microsoft Visual C++ ® Microsoft Visual C++ ® Object-Oriented Object-Oriented Faster MFC applications Faster MFC applications Composite Controls  ActiveX Composite Controls  ActiveX Microsoft ® DirectShow ® Microsoft ® DirectShow ® Component object Model (COM) Component object Model (COM) High-quality capture and playback of multimedia streams High-quality capture and playback of multimedia streams Implementation

Video Player Implementation

Control filter graph manager filter graph manager Dialog Box create with the class CFormView Dialog Box create with the class CFormView Implementation

Scene Change & VOCD CScrollView CMenu Add the extracted information to XML Implementation

XML Editor TreeView TreeView XML read by parser XML read by parser Tag in XML Tag in XML Node in Tree Node in Tree Implementation

Knowledge Enrichment Dialog Box create as class CFormView Dialog Box create as class CFormView Read the database Read the database Compare with the XML generated Compare with the XML generated Implementation

Problems & Solutions Implementation

Problems & Solutions Multi-modal tool --- Integrate all the Component Multi-modal tool --- Integrate all the Component Docking Window is used Docking Window is used Flexible & Efficient to add different new modals Flexible & Efficient to add different new modals CSizingControlBar CSizingControlBar Implementation

Problems & Solutions Implementation

Demo

Tasks in Next Semester Focus on using XML to do multimedia presentation Focus on using XML to do multimedia presentation Style Sheet (XSLT) not suitable for multimedia document generation Style Sheet (XSLT) not suitable for multimedia document generation New format on multimedia presentation New format on multimedia presentation SMIL SMIL Future

Tasks in Next Semester Time-based multimedia content Time-based multimedia content Capable to synchronize the playback of all multimedia elements Capable to synchronize the playback of all multimedia elements Transform XML generated into SMIL format for presentation Transform XML generated into SMIL format for presentation Design a style sheet suitable for multimedia document generation Design a style sheet suitable for multimedia document generation Future

Q & A