Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,

Slides:



Advertisements
Similar presentations
XSL XSLT and XPath 11-Apr-17.
Advertisements

A New Scheme For Robust Blind Digital Video Watermarking Supervised by Prof. LYU, Rung Tsong Michael Presented by Chan Pik Wah, Pat Mar 5, 2002 Department.
Multi-Model Digital Video Library Professor: Michael Lyu Member: Jacky Ma Joan Chung Multi-Model Digital Video Library LYU9904 Multi-Model Digital Video.
LYU0101 Wireless Digital Library on PDA Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu First semester FYP Presentation 2001~2002.
XISL language XISL= eXtensible Interaction Sheet Language or XISL=eXtensible Interaction Scenario Language.
An Automatic Video Information Processing and Conversion System for Multimedia Messaging Service and Mobile Internet Miss Chan Pik Wah, Pat Miss Ngai Cheuk.
Video Table-of-Contents: Construction and Matching Master of Philosophy 3 rd Term Presentation - Presented by Ng Chung Wing.
LYU0101 Wireless Digital Information System Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu Second semester FYP Presentation 2001~2002.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Timing in XML XML and XSL Timing framework in XML Approaches Inline syntax (SMIL) Styled Timing Timesheets Timesheets and SMIL comparison.
LYU0101 Wireless Digital Information System Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu Second semester FYP Presentation 2001~2002.
ADVISE: Advanced Digital Video Information Segmentation Engine
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
Timing in XML Timing framework in XML Approaches Inline syntax (SMIL) Styled Timing Timesheets Timesheets and SMIL comparison.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
LYU0203 Smart Traveller with Visual Translator for OCR and Face Recognition Supervised by Prof. LYU, Rung Tsong Michael Prepared by: Wong Chi Hang Tsang.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
ART: Augmented Reality Table for Interactive Trading Card Game Albert H.T. Lam, Kevin C. H. Chow, Edward H. H. Yau and Michael R. Lyu Department of Computer.
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
Multimedia Security Digital Video Watermarking Supervised by Prof. LYU, Rung Tsong Michael Presented by Chan Pik Wah, Pat Nov 20, 2002 Department of Computer.
Outline of Presentation Introduction of digital video libraries Introduction of the CMU Informedia Project Informedia: user perspective Informedia:
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
E GOV Universal Access Ahmed Gomaa CIMIC Rutgers University.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Development of Web-based Collaborative Environment For Distant Learning Supervised by Prof. Michael Lyu Presented by Ma Ka Po.
Overview of Search Engines
Multimedia Enabling Software. The Human Perceptual System Since the multimedia systems are intended to be used by human, it is a pragmatic approach to.
Digital Object: A Virtual Online Storage Solution 598C Course Project Huajing Li.
E0262 – MIS – Multimedia Storage Techniques SMIL – Synchronized Multimedia Integration Language.
Chapter 11-Multimedia Authoring Tools. Overview Introduction to multimedia authoring tools. Types of authoring tools. Cross-platform authoring notes.
Copyright © 2012 Accenture All Rights Reserved.Copyright © 2012 Accenture All Rights Reserved. Accenture, its logo, and High Performance Delivered are.
IS432 Semi-Structured Data Lecture 5: XSLT Dr. Gamal Al-Shorbagy.
WORKING WITH XSLT AND XPATH
Integrating Timing into XML Documents Patrick Schmitz MS Research BARC Telepresence.
CHAPTER FOUR COMPUTER SOFTWARE.
Introduction to Interactive Media Interactive Media Tools: Software.
DSpace UI Alexey Maslov. DSpace in general A digital library tool useful for storage, maintenance, and retrieval of digital documents Two types of interaction:
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
CMPD273 Multimedia System Prepared by Nazrita Ibrahim © UNITEN2002 Multimedia System Characteristic Reference: F. Fluckiger: “Understanding networked multimedia,
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
Presented by Nassib Awad
CITA 330 Section 6 XSLT. Transforming XML Documents to XHTML Documents XSLT is an XML dialect which is declared under namespace "
CHAPTER TEN AUTHORING.
© 2008 The McGraw-Hill Companies, Inc. All rights reserved. ACCESS 2007 M I C R O S O F T ® THE PROFESSIONAL APPROACH S E R I E S Lesson 13 – Advanced.
Web Services for Satellite Emulation Development Kathy J. LiszkaAllen P. Holtz The University of AkronNASA Glenn Research Center.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal VideoConference Archives Indexing System.
FYP: LYU0001 Wireless-based Mobile E-Commerce on the Web Supervisor: Prof. Michael R. Lyu By: Tony, Wat Hong Fai Harris, Yan Wai Keung.
Session: 1. © Aptech Ltd. 2Introduction to the Web / Session 1  Explain the evolution of HTML  Explain the page structure used by HTML  List the drawbacks.
1 Overview of XSL. 2 Outline We will use Roger Costello’s tutorial The purpose of this presentation is  To give a quick overview of XSL  To describe.
Of 50 E GOV Universal Access Ahmed Gomaa CIMIC Rutgers University.
March 31, 1998NSF IDM 98, Group F1 Group F Multi-modal Issues, Systems and Applications.
The Synchronized Multimedia Integration Language (SMIL) Kuo-Hao Li.
Introduction to Interactive Media Interactive Media Tools: Authoring Applications.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
XSLT. XSLT stands for Extensible Stylesheet Language Transformations XSLT is used to transform XML documents into other kinds of documents. XSLT can produce.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Chapter – 8 Software Tools.
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
XML Notes taken from w3schools. What is XML? XML stands for EXtensible Markup Language. XML was designed to store and transport data. XML was designed.
Topic Map & SMIL Prototypes KUL-ESAT-DOCARCH
Digital Video Library - Jacky Ma.
Supervisor: Prof Michael Lyu Presented by: Lewis Ng, Philip Chan
CHAPTER 8 Multimedia Authoring Tools
Prepared for Md. Zakir Hossain Lecturer, CSE, DUET Prepared by Miton Chandra Datta
Presentation transcript:

Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah, Pat Ngai Cheuk Han, Table LYU0102 XML for Interoperable Digital Video Library

Outline Introduction to XVIP Overview of Project Extraction Techniques Face Detection Speech Recognition Multimedia Transformation & Presentation XSL SMIL Transformation Problems & Solutions Conclusion

Motivations Rapid increase in the usage of multimedia information New approach: DIGITAL VIDEO LIBRARY Project Outline

Motivations Little attention paying on video information extraction and storage Scalability of the system in terms of adding new extraction components Lack of a generic framework for presentation and visualization of video information Project Outline

Overview of XVIP Project Outline

Achievements in last Semester 2 Extraction Techniques Scene Change VOCR Integrate data into XML XML Editor Knowledge Enrichment Project Outline

Achievements in this Semester 2 more extraction techniques Face Detection Speech Recognition New data integrated to XML XML to SMIL Transformer Project Outline

Extraction Techniques Video Scene Change VOCD Face Detection Speech Recognition XML

Face Detection Object-presence detections are also an important technique. Identify and index features to support image similarity matching. Face detection is a good example Extraction Techniques

Face Detection Name of people appearing in the video How they are interacting with the environment More searchable Extraction Techniques

Face Detection Neural Network-Based Algorithm The basic algorithm used for face detection Extraction Techniques

Face Detection Face Recognition Facial Expression Analysis Enrich the XML Easier for user to search the content of video Extraction Techniques

Speech Recognition Speech recognition technology can make any spoken data useful for library indexing and retrieval Extraction Techniques

Speech Recognition Engine Extraction Techniques

Speech Recognition ViaVoice Error rate > 50% Extraction Techniques

Usage of XML XML Indexing & Searching Combine with other XML for Knowledge Enrichment Presentation Exchange data with different application

Presentation of the video data XML is not presentable without processing HTML with images, but is static SMIL is good for multimedia presentation No existing tools for integrating different XML data into a SMIL presentation Current transformation language has a lot of limitations in transforming XML to SMIL SMIL

SMIL stands for Synchronized Multimedia Integration Language is currently a W3C Recommendation. It is a markup language that can synchronize and integrate multimedia. It enables authors to specify when and what should be presented. RealPlayer, QuickTime, IE support SMIL

Advantages SMIL is text-based Easy to develop with a text editor Generate customized presentations Generate customized SMIL file based on preferences recorded in the visitor's browser SMIL effort is led by the W3C W3C tries to shape a specification that is beneficial to all parties involved. Avoid using container formats. SMIL can stream many media formats, no need to merge clips into a single streaming file. SMIL

Timing and Synchronization Sequence element: …… Parallel element: … SMIL

XSL Stands for “Extensible Stylesheet Language” XSL is the language defined by the W3C to add formatting information to XML data. XSLT -- most commonly used XSL standard Transforms one XML document into another. Used in our FYP. XSL

Working Principle Source Tree XSL Stylesheet Output

Transformation Process Transformation Input files XML file generated by XVIP XML files of additional information Output files A SMIL file Some RealText files

Design 1 Build with VC++ solely Read all the input files, get the information Create the output the files for the SMIL presentation. Transformation Disadvantages Layout of the SMIL presentation need to be hard-coded in the VC++ program. The layout becomes hard to change and the transformer becomes hard to extend.

Design 1 with modification Modification Provide an additional file or interface as a template for user to define the layout of SMIL presentation. Disadvantage The flexibility provided is still limited. Not a standard way to define a template. Transformation

Design 2 Use XSLT assisting the transformation. User can define his own template with XSL. Advantages Program-independent Extensible Standard templates Transformation Limitations of XSLT It can only read one input data file and one XSL file, then generate one output. It cannot do combin- ation among files.

Design 2 Solutions: Knowledge Enrichment Combine additional information with the XML file from XVIP before converting to SMIL Creating output files Use separate XSL files to generate RealText files Use separate XSL files to generate layout of the presentation and displaying order of objects in different regions, then combine them to a SMIL file Transformation

Knowledge Enrichment Transformation Combined XML file Information of major cities XML file from XVIP

Combined XML file XML file contains information of major cities that are related to the video. 香港 中國南部一個沿海城市 China 紐約 隸屬美國紐約州的城市 America Transformation

Create RealText files Geographical Information Biographical Information Video Transcript Transformation

Create SMIL file Transformation Layout Displaying order

Create SMIL file Transformation SMIL Presentation Combining the temporary files

Problems & Solutions Problem 1 The result from XSLT processor is in UTF-8 encoding format, but SMIL needs the format ANSI. Solution: Write a function “UTF8toANSI” for conversion. Problems & Solutions

Problem 2 XSLT has limitation. It can only read one XML, one XSL file and generate one output file. Our transformation process has more than one input files Solution: Do knowledge enrichment and produce a combined XML result file before creating the output files. Problems & Solutions

Conclusion XVIP contains: Four video information modalities Scene change detection VOCD Speech recognition Face detection Information integration module with XML For storing the extracted video data in XML format Conclusion

XML editor For editing the XML file generated Knowledge enrichment component For adding additional information to the XML- based video data XML to SMIL transformer For converting the XML-based video data into SMIL presentation Conclusion

XVIP : provides multiple functions for extracting video information stores video information in a flexible and scalable way Comprises a transformer to generate presentation on the information Paper “XVIP: An XML-Based Video Information Processing System”, Michael Lyu, Edward Yau, C.H.Ngai, P.W.Chan, was accepted by COMPSAC Conclusion

Q & A