The Automatic Generation of Formal Annotations in a MultiMedia Indexing and Searching Environment Thierry Declerck DFKI GmbH Annotation Workshop, DI, 15.

Slides:



Advertisements
Similar presentations
PARA DIME Source: GN 1 Towards Parametrizable Domain-Adaptive Information Extraction A case study: Adapting the SMES system to the Soccer domain Dr. Günter.
Advertisements

Harald Sack Jörg Waitelonis Friedrich-Schiller-Universität Jena Germany SAAW st Semantic Authoring and Annotation Workshop Athens, GA, USA, November.
National Technical University of Athens Department of Electrical and Computer Engineering Image, Video and Multimedia Systems Laboratory
Multilinguality & Semantic Search Eelco Mossel (University of Hamburg) Review Meeting, January 2008, Zürich.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Information Systems Today: Managing in the Digital World
Business Development Suit Presented by Thomas Mathews.
MUMIS User Group Workshop P. Wittenburg Max-Planck-Institut für Psycholinguistik Nijmegen.
Distributed search for complex heterogeneous media Werner Bailer, José-Manuel López-Cobo, Guillermo Álvaro, Georg Thallinger Search Computing Workshop.
Enabling Access to Sound Archives through Integration, Enrichment and Retrieval WP1. Project Management.
DL:Lesson 11 Multimedia Search Luca Dini
Personalized Abstraction of Broadcasted American Football Video by Highlight Selection Noboru Babaguchi (Professor at Osaka Univ.) Yoshihiko Kawai and.
Mining the web to improve semantic-based multimedia search and digital libraries
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Basi di dati distribuite Prof. M.T. PAZIENZA a.a
Using Information Extraction for Question Answering Done by Rani Qumsiyeh.
Information Extraction, Language Technology and the Semantic Web Thierry Declerck & Paul Buitelaar (DFKI GmbH)
AceMedia Personal content management in a mobile environment Jonathan Teh Motorola Labs.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Information Retrieval in Practice
Language Technology and the Semantic Web Thierry Declerck & Paul Buitelaar (Saarland University & DFKI GmbH)
Logic Programming for Natural Language Processing Menyoung Lee TJHSST Computer Systems Lab Mentor: Matt Parker Analytic Services, Inc.
WP5.4 - Introduction  Knowledge Extraction from Complementary Sources  This activity is concerned with augmenting the semantic multimedia metadata basis.
WP5.4/3.1/4.2/5.5 meeting 29th of November 2007, DFKI.
Interoperability Scenario Producing summary versions of compound multimedia historical documents.
Processing of large document collections Part 10 (Information extraction: multilingual IE, IE from web, IE from semi-structured data) Helena Ahonen-Myka.
Multimedia Databases (MMDB)
Spoken dialog for e-learning supported by domain ontologies Dario Bianchi, Monica Mordonini and Agostino Poggi Dipartimento di Ingegneria dell’Informazione.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
An Architecture for Mining Resources Complementary to Audio-Visual Streams J. Nemrava, P. Buitelaar, N. Simou, D. Sadlier, V. Svátek, T. Declerck, A. Cobet,
The PrestoSpace Project Valentin Tablan. 2 Sheffield NLP Group, January 24 th 2006 Project Mission The 20th Century was the first with an audiovisual.
AnswerBus Question Answering System Zhiping Zheng School of Information, University of Michigan HLT 2002.
Overview of the merger prototype. Overview Backgrounds: The MUMIS project Cross document annotation merging Alignment of parallel fragments Unification.
Query Processing In Multimedia Databases Dheeraj Kumar Mekala Devarasetty Bhanu Kiran.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal VideoConference Archives Indexing System.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.
MUMIS Franciska de Jong & Thijs Westerveld University of Twente Multimedia Indexing and Searching.
Project Overview Vangelis Karkaletsis NCSR “Demokritos” Frascati, July 17, 2002 (IST )
Artificial Intelligence Research Center Pereslavl-Zalessky, Russia Program Systems Institute, RAS.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 1 Database Systems.
March 31, 1998NSF IDM 98, Group F1 Group F Multi-modal Issues, Systems and Applications.
1 Applications of video-content analysis and retrieval IEEE Multimedia Magazine 2002 JUL-SEP Reporter: 林浩棟.
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
© 2003 DFKI Language Technology Lab Language Technology Information Extraction Retrieving relevant concepts and structured relations in unrestricted free.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
MPEG-7 Audio Overview Ichiro Fujinaga MUMT 611 McGill University.
MULTIMEDIA DATA MODELS AND AUTHORING
BIT 3193 MULTIMEDIA DATABASE CHAPTER 4 : QUERING MULTIMEDIA DATABASES.
General Architecture of Retrieval Systems 1Adrienn Skrop.
MPEG 7 &MPEG 21.
LREC – Workshop on Crossing media for Improved Information Access, Genova, Italy, 23 May Cross-Media Indexing in the Reveal-This System Murat Yakici,
Using Human Language Technology for Automatic Annotation and Indexing of Digital Library Content Kalina Bontcheva, Diana Maynard, Hamish Cunningham, Horacio.
© NCSR, Frascati, July 18-19, 2002 CROSSMARC big picture Domain-specific Web sites Domain-specific Spidering Domain Ontology XHTML pages WEB Focused Crawling.
Multi-Source Information Extraction Valentin Tablan University of Sheffield.
Interactive Multimedia Authoring
Working meeting of WP4 Task WP4.1
Digital Video Library - Jacky Ma.
Visual Information Retrieval
Introduction Multimedia initial focus
Social Knowledge Mining
MANAGING DATA RESOURCES
COMP 208/214/215/216 – Lecture 7 Documenting Design.
The ultimate in data organization
Presentation transcript:

The Automatic Generation of Formal Annotations in a MultiMedia Indexing and Searching Environment Thierry Declerck DFKI GmbH Annotation Workshop, DI, 15. Februar 2002

The MUMIS Consortium CTITUniversity of Twente, Enschede, NLNLP/IE TSI University of Nijmegen, Nijmegen, NLASR DFKISaarbrücken, DNLP/IE MPI Nijmegen, NLMM Archives DCSUniversity of Sheffield, UKNLP/IE ESTEAMGothenburg, SE (location Athens, GR)Translation Software VDAHilversum, NL Video

Objectives of MUMIS Technology development to automatically index (with formal annotations) lengthy multimedia recordings (off-line process) Find and annotate relevant events, together with the involved entities and relations. Also detect Metadata information. Technology development to exploit indexed multimedia archives (on-line process) Search for interesting scenes and play them via Internet Test Domain: Soccer Games / UEFA Tournament 2000

Off-line Task Indexing by Automatic Speech Recognition (Radio/TV Broadcasts) Automatically transforms the speech signals into texts (for 3 languages Dutch, English and German) Natural Language Processing (Information Extraction) Analyse all available textual documents (newspapers, speech transcripts, tickers, formal texts...), identify and extract interesting entities, relations and events. Also detect Metadata information. Merging all the annotations produced so far Create a database with formal annotations Use video processing to adjust time marks

Gain What gets lost? Is it necessary? Potential: direct Internet Service, less dependencies Current ProcedureMUMIS Procedure Manual Video Annotation Automatic Video Annotation and DB Integration Integration Central DB Query via PC Results on PC And Select & Play Contact Video Archive Get Video Tapes Search on Tape on VCR Segment & Play

The Generation of Formal Annotations Metadata (type of game, teams, date, final score, players etc.), as they can be used a.o. for classifying and filtering videos in the MM digital archive Events (particular actions with time codes, involved entities and related events), as they can be extracted from the video sequences All Formal Annotations available in XML Standard

The Event Table Related to domain ontology and multilingual terminology. Guiding the generation of formal annotations Final whistle#90>t>120Subj=referee, score etc…Final score Shot on Goal#0>t>120Subj=pl, loc=loc, cons=cons,.. Dribbling#0>t>120Subj=pl, loc=loc, … Substitution#0>t>120Subj=pl, I.obj=pl, cause=c, …Team (adding pl) Red Card#0>t>120Subj=ref, I.obj=pl, cause=c, …Team (red at t) Goal#0>t>pen.Subj=pl, I.obj=team, score=s,Order of goal … EventIDTimeSubcat/ModificationMetadata

Off-line Task Events indexed in video recording 1:0 60 m25 m SchollBasler CampbellMatthäusBaslerNeville DribblingFreekick 28min24 min18 min17 min DefensePassGoalFoul Radio Commenting 3 Languages Radio Commenting 3 Languages Radio Commenting 3 Languages Audio Commenting (TV, Radio) 3 Languages Newspaper Text Newspaper Text Newspaper Text Newspaper Texts 3 Languages Newspaper Text Newspaper Text Newspaper Text Tickers etc. 3 Languages multilingual IE => event tables Merging of Annotations Event = goal Player = Basler Dist. = 25 m Time = 18 Score = 1:0 Event = goal Type = Freekick Player = Basler Dist. = 25 m Time = 17 Score: leading Event = goal Player= Basler Team = Germany Time = 18 Score = 1:0 Finalscore = 1:0 Event = goal Type = Freekick Player = Basler Team = Germany Time = 18 Score = 1:0 Final score = 1:0 Distance = 25 m

The Role of IE in MUMIS Information Extraction (IE) is the task of identifying, collecting and normalizing relevant information for a specific application or user. The relevant information is typically represented in form of predefined templates, which are filled by means of Natural Language (NL) analysis (Template = Event Table in MUMIS) IE combines pattern matching mechanisms, (shallow) NLP and domain knowledge (terminology and ontology).

Extension of our IE system in MUMIS Multilingual and multisource IE. Incremental information building Cross-document co-reference resolution Combine Metadata and event extraction => better organisation and dynamic updating of information (KM) Multiple presentation of results: Template, Event table, integration in MPEG-7 XML and Hyperlinks (Named Entities, rel. to Knowledge Management)

The DFKI Implementation Based on XML output of SPPC (Dev. At DFKI) Mapping the XML into a feature structure (the CorpA/schug Program) Cascaded grammar descriptions for enriching (or correcting) the SPPC output Including agreement processing and detection of grammatical functions Adapting the Paradime triangle for template generation and filling

The Paradime Scenario

Information Extraction IE is generally subdivided in following tasks: - Named Entity task (NE) - Template Element task (TE) - Template Relation task (TR) - Scenario Template task (ST) - Co-reference task (CO)

Subtasks of IE Named Entity task (NE): Mark into the text each string that represents, a person, organization, or location name, or a date or time, or a currency or percentage figure. Template Element task (TE): Extract basic information related to organization, person, and artifact entities, drawing evidence from everywhere in the text.

Subtasks of IE (2) Template Relation task (TR): Extract relational information on employee_of, manufacture_of, location_of relations etc. (TR expresses domain- independent relationships). Scenario Template task (ST): Extract pre-specified event information and relate the event information to particular organization, person, or artifact entities (ST identifies domain and task specific entities and relations). Co-reference task (CO): Capture information on co- referring expressions, i.e. all mentions of a given entity, including those marked in NE and TE.

IE applied to soccer Terms as descriptors for the NE task Team: Titelverteidiger Brasilien, den respektlosen Außenseiter Schottland Player:Superstar Ronaldo, von Bewacher Calderwood noch von Abwehrchef Hendry, von Jackson als drittem Stürmer, Torschütze Cesar, von Roberto Carlos (16.), Referee: vom spanischen Schiedsrichter Garcia Aranda Trainer: Schottlands Trainer Brown, Kapitän Hendry seinen Keeper Leighton Location: im Stade de France von St. Denis (more fine-grained location detection would be: Stadion: im Stade de France and City: von St. Denis ) Attendance: Vor Zuschauern

IE applied to soccer (2) Terms for NE Task Time: in der 73. Minute, nach gerade einmal 3:50 Minuten, von Roberto Carlos (16.), nach einer knappen halben Stunde, scheiterte Rivaldo (49./52.) jeweils nur knapp, das vor der Pause Versäumte versuchten die Brasilianer nach Wiederbeginn,... Date: am Mittwoch, der Turnierstart (?), im WM-Eröffnungsspiel (?) Score/Result: Brasilien besiegt Schottland 2:1, einen 2:1 (1:1)- Sieg, der zwischenzeitliche Ausgleich, in der 4. Minute in Führung gebracht, köpfte zum 1:0 ein

IE applied to soccer (3) Relations for TR Task Opponents: Brasilien besiegt Schottland, feierte der Top-Favorit... einen glücklichen 2:1 (1:1)-Sieg über den respektlosen Außenseiter Schottland, Player_of: hatte Cesar Sampaio den vierfachen Weltmeister... in Führung gebracht, Collins gelang... der zwischenzeitliche Ausgleich für die Schotten, der Keeper des FC Aberdeen, Brasiliens Keeper Taffarel Trainer_of: Schottlands Trainer Brown...

IE applied to soccer (4) Events for ST task : Goal: in der 4. Minute in Führung gebracht, das schnellste Tor... markiert, Cesar Sampaio köpfte zum 1:0 ein, Collins (38.) verwandelte den Strafstoß, hätte Kapitän Hendry seinen Keeper Leighton um ein Haar zum zweiten Mal bezwungen, von dem der Ball ins Tor prallte Foul: als er den durchlaufenden Gallacher im Strafraum allzu energisch am Trikot zog Substitution: und mußte in der 59. Minute für Crespo Platz machen...

IE applied to soccer (5) Description of the Templates: Team team-template TACTIC [ ] SCORE [ ] NAME [ ] PLAYER [ ] TRAINER [ ] goal-template TIME [ ] SCORE [S] PLAYER [P] TEAM [team-templ ] TYPE [ ] SUCCESS [ ] team-template TACTIC [ ] SCORE [S] NAME [ ] PLAYER [P] TRAINER [ ]

Merging Component Acting on the generated formal annotations (Metadata and Events), but also interleaving with the generation process of those Checking consistency, eliminating redundancy (Template Merging), in accordance with domain ontology Completing the information with domain knowledge, inference Machine

Use of Standards XML as the annotation language and data interchange format MPEG-7: standard for the description of features of multimedia content, XML compliant (for content description), with a slot for textual annotations

More about MPEG (Moving Picture Coding Experts Group) MPEG-1: For the storage and retrieval of movie pictures and audio on storage media MPEG-2: For digital television MPEG-4: Codes content as objects and enables those objects to be manipulated MPEG-7: Where 1,2 and 4 make content available, MPEG-7 allows to find the content one needs

On-line Tasks Searching and Displaying Search for interesting events with formal queries Give me all goals from Overmars shot with his head in 1. Half. Event=Goal; Player=Overmars; Time<=45; Previous-Event=Headball Indicate hits by thumbnails & let user select scene Play scene via the Internet & allow scrolling Of course: slow motion, fast play, start/stop, etc

On-line Tasks Searching and Displaying Search for interesting events with formal queries Give me all goals from Overmars shot with his head in 1. Half. Event=Goal; Player=Overmars; Time<=45; Previous-Event=Headball Indicate hits by thumbnails & let user select scene Play scene via the Internet & allow scrolling Of course: slow motion, fast play, start/stop, etc

On-line Tasks Knowledge Guided User Interface & Search Engine München - Ajax 1998 München - Porto 1996 Deutschland - Brasilien 1998 Play Movie Fragment of that Game FreekickGoalPassDefense 17 min18 min24 min28min FoulFreekickDribbling KohlerBaslerMatthäusWörns BaslerBierhoff 25 m 60 m

On-line SW Architecture Client Applet JMF WWW Server Java Server Media Server MPEG1 Media Server MPEG1 Media Server MPEG1 DB Server rDBMS Media Server MPEG1 File Server HTTP RMI (RTP, RTSP) JDBC Client Objects Hit Rendering Objects Media Server Objects Query Engine Objects MetadataAnnotations KeyframesMPEG Movies Lexica Ontology Query interface: pre-selection guided by domain knowledge interactive, visual feedback Server structure: fully distributed JMF media presentation RMI-based interaction

On-line HW Architecture efficient & reliable storage management (near-line capacity, media change, 2. Location) high storage capacity (n TB, 1 h MPEG1 = 1 GB) powerful media servers / powerful network RAID Tape Library FC Switch Media Server GB Switch Internet 1Gbps Gb-Switch Router

Acknowledgements UEFA DFB, FA, KNVB EBU, WDR, NOS, SWR