Mining the web to improve semantic-based multimedia search and digital libraries

Slides:



Advertisements
Similar presentations
Modelling web resources Ketil Albertsen, Paradigma project National Library of Norway.
Advertisements

26/10/2008 SWESE'08 1 Enhanced Semantic Access to Software Artefacts Danica Damljanović and Kalina Bontcheva.
Distributed search for complex heterogeneous media Werner Bailer, José-Manuel López-Cobo, Guillermo Álvaro, Georg Thallinger Search Computing Workshop.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Third-generation information architecture November 4, 2008.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Advanced Distributed Learning. Conditions Before SCORM  Couldn’t move courses from one Learning Management System to another  Couldn’t reuse content.
AceMedia Personal content management in a mobile environment Jonathan Teh Motorola Labs.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
Metadata Presentation by Rick Pitchford Chief Engineer, School of Communication COM 633, Content Analysis Methods Fall 2009.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
MUSCLE WP9 E-Team Integration of structural and semantic models for multimedia metadata management Aims: (Semi-)automatic MM metadata specification process.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
ACCESS TO QUALITY RESOURCES ON RUSSIA Tanja Pursiainen, University of Helsinki, Aleksanteri institute. EVA 2004 Moscow, 29 November 2004.
ASIDIC Spring Conference ‘Smart Content’ Uncovering the Value and Benefits of Semantic Technology Richard C. Fusco Director, Content Strategy – McGraw-Hill.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
A Scalable Framework for the Collaborative Annotation of Live Data Streams Thesis Proposal Tao Huang
1 Samson Cheung EE 639, Fall 2004 Lecture 1: Applications & Trends Multimedia Information Systems advent: open communicator browser, screen cam, hari’s.
DAISY AND DEVELOPING COUNTRIES PERSPECTIVE BY DIPENDRA MANOCHA.
1 Seminar Presentation Multimedia Audio / Video Communication Standards Instructor: Dr. Imran Ahmad By: Ju Wang November 7, 2003.
Semantic Publishing Update Second TUC meeting Munich 22/23 April 2013 Barry Bishop, Ontotext.
Mining the Semantic Web: Requirements for Machine Learning Fabio Ciravegna, Sam Chapman Presented by Steve Hookway 10/20/05.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
PLATFORM INDEPENDENT SOFTWARE DEVELOPMENT MONITORING Mária Bieliková, Karol Rástočný, Eduard Kuric, et. al.
Funded by: European Commission – 6th Framework Project Reference: IST WP 2: Learning Web-service Domain Ontologies Miha Grčar Jožef Stefan.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
The PrestoSpace Project Valentin Tablan. 2 Sheffield NLP Group, January 24 th 2006 Project Mission The 20th Century was the first with an audiovisual.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
LESLIE ESPINOZA OCTOBER 10, 2013 PERIOD 1 History Of Multimedia.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal VideoConference Archives Indexing System.
21/05/'07 upd 06/05/08CmpE 588 Spring 2008 EMU1 Semantic Technology Application Show Cases Atilla ELÇİ Dept. of Computer Engineering Eastern Mediterranean.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
E-learning: an overview Michael Rowe Department of Physiotherapy.
Web-Assisted Annotation, Semantic Indexing and Search of Television and Radio News (proceedings page 255) Mike Dowman Valentin Tablan Hamish Cunningham.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
1 CS 430: Information Discovery Lecture 19 User Interfaces.
Majid Sazvar Knowledge Engineering Research Group Ferdowsi University of Mashhad Semantic Web Reasoning.
OWL Representing Information Using the Web Ontology Language.
1 Applications of video-content analysis and retrieval IEEE Multimedia Magazine 2002 JUL-SEP Reporter: 林浩棟.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
1 Language Technologies (2) Valentin Tablan University of Sheffield, UK ACAI 05 ADVANCED COURSE ON KNOWLEDGE DISCOVERY.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Reviews Crawler (Detection, Extraction & Analysis) FOSS Practicum By: Syed Ahmed & Rakhi Gupta April 28, 2010.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Semantic Web COMS 6135 Class Presentation Jian Pan Department of Computer Science Columbia University Web Enhanced Information Management.
 digital methodologies for global media research Randy Kluver Dept of Communication Texas A&M University.
Slide no 1 Cognitive Systems in FP6 scope and focus Colette Maloney DG Information Society.
© 1990—2006 Visual Knowledge Software® | Private and Confidential | 2 Semantic Agent Wikis For Engineering.
TextOre Energy Analytics Applying Text Mining Solutions Toward Extraction of Energy Related Data from Local Records.
©2003 Paula Matuszek CSC 9010: AeroText, Ontologies, AeroDAML Dr. Paula Matuszek (610)
LREC – Workshop on Crossing media for Improved Information Access, Genova, Italy, 23 May Cross-Media Indexing in the Reveal-This System Murat Yakici,
Using Human Language Technology for Automatic Annotation and Indexing of Digital Library Content Kalina Bontcheva, Diana Maynard, Hamish Cunningham, Horacio.
Towards an Infrastructure for the Synchronisation of Library Metadata Christoph Böhme | 14 | Metadata Synchronisation| 28 November
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
ESWC 2005, Crete, Greece Semantically Enhanced Television News through Web and Video Integration Multimedia and the Semantic Web workshop Borislav PopovMike.
Multimedia Semantic Analysis in the PrestoSpace Project Valentin Tablan, Hamish Cunningham, Cristian Ursu NLP Research Group University of Sheffield Regent.
Multi-Source Information Extraction Valentin Tablan University of Sheffield.
Data mining in web applications
Digital Video Library - Jacky Ma.
Supervisor: Prof Michael Lyu Presented by: Lewis Ng, Philip Chan
SEmantic Knowledge Technology
Peggy van der Kreeft Deutsche Welle
Content Augmentation for Mixed-Mode News Broadcasts Mike Dowman
Presentation transcript:

Mining the web to improve semantic-based multimedia search and digital libraries Horacio Saggion Kalina Bontcheva University of Sheffield 21 November 2006 IST Event 2006 Web Mining and Semantic Web: Networking with industry and academia [This work has been partially supported by SEKT ( PrestoSpace ( andhttp://sekt.semanticweb.org/ TAO ( projects]

2(9) Web mining and semantic annotation: why? Semantic annotation produces explicit representation of knowledge, given content –Knowledge is often implicit in the data sources –…or hard to extract automatically to a sufficient accuracy Frequently knowledge can be mined from the web and merged with the original content to improve semantic search and reasoning capabilities

3(9) Web mining and semantic annotation: how? GATE is a widely used open-source infrastructure for text mining ( –Ten years old, with 1000s of users at 100s of sites –Supports major document formats and languages –Helps build semantic annotation components –Integrate these with content and knowledge mined from the web –Create, test, and deploy these into an end-to-end application (some examples next)

4(9) RichNews: Multimedia Annotation The problem: –Access to archive material in the BBC is provided by some form of semantic annotation and indexing –Manual annotation is time consuming (up to 10x real time) and expensive Rich News (developed within the Prestospace project) aims to (partially) automate the annotation of news programs –Developed on BBC TV and radio news –Involving human in the loop is possible if desired Recordings of broadcasts go in one end Index of semantic metadata describing each news story comes out the other

5(9) Web mining in RichNews Why web mining: –Speech recognition produces poor quality transcripts with many mistakes –Closed captions/subtitles not always available –These news stories can also be found on the BBC and other web sites The solution: –Obtain key terms from the ASR transcripts –Search the web for related stories from same date –Find best matching stories –Obtain semantic annotations from this richer text –Merge with semantic annotations on transcript to obtain more precise knowledge, grounded in the video stream

6(9) RichNews Example

7(9) TAO – Augmenting Software Artefacts with Semantics TAO project – Transitioning Applications to Ontologies Case study on augmenting software artefacts with semantics Learning ontologies from multiple software artefacts Knowledge about a software project often spread across different sources on the web: –Source code, discussion messages, bug descriptions, documentation

8(9) New Challenges Moving towards mining and semantically annotating Web 2.0 –Opinion mining from blogs and discussion forums –Mining wikis –Social network analysis Mining multimedia content Initial experiments in ongoing projects, but we need further work on these emerging social-oriented web

9(9) Thank you! These slides: Further details: –RichNews: assisted-annotation.pdf assisted-annotation.pdf –SEKT: –TAO: