Download presentation
Presentation is loading. Please wait.
Published byGavin Bridgwater Modified over 10 years ago
1
Multimedia Semantics – From MPEG-7 to Web 3.0 Jane Hunter jane@itee.uq.edu.au SAMT 2008
2
Agenda MPEG-7 MPEG-7 Ontologies Semantic Gap Approaches Web 2.0 Web 3.0 –New forms of multimedia –Hybrid classification –Weighted classifications Conclusions SAMT 2008
3
Multimedia Semantics How to find user-relevant multimedia Using search terms meaningful to the user Content-based search –Events – tries, coral bleaching –People - “Barak Obama” –Objects – astrocytoma, cancerous cells –Scenes – church scenes in “In Bruges” (Focus is not on metadata – format, creator, date Focus is not on query-by-example)
4
The Semantic Gap SAMT 2008
5
MPEG-7 Standard Multimedia Content Description Interface ISO/IEC standard by MPEG Objectives are to: –Standardize content-based descriptions for audiovisual information –Address a wide range of applications –Describe images, audio (speech, music), video, graphics, 3D models, compositions –Be independent of storage, coding, display, transmission, medium, or technology
6
MPEG-7 ComponentsDescriptors: (Syntax & semantics of feature representation) D7 D2 D5 D6 D4 D1 D9 D8 D10 extension Description Definition extension Language Definition 10101 1 0 Encoding & Delivery Tags <scene id=1> <time>.... <camera>.. <annotation Instantiation D3 Description Schemes (Relationships between Ds and DSs) D1 D3D2 D5D4D6 DS2 DS3 DS1 DS4 Structuring Video-on-Demand TV-Anytime
7
Multimedia DSs Datatype & structures Link & media localization Basic DSs Basic elements Navigation & Access Summary Variation Analytic Model Collection & Classification Content organization Content description Content management Creation & production MediaContent Usage User User preferences Conceptual aspects Structural aspects
8
Visual and Audio Descriptors
9
MPEG-7 Ontology Necessary in order to: Bridge the semantic gap Enhance discovery Add multimedia to the semantic web Enable machine reasoning both within and across multimedia content Enhance semantic interoperability Facilitate integration of multimedia content through common understandings
10
MPEG-7 Class Hierarchy
11
Colour Descriptors
12
Semantic Inferencing Architecture SAMT 2008 Ontologies – MPEG-7 + Domain-specific Define RuleML, SWRL rules
13
Rules-By-Example
14
SAMT 2008
16
Richer User-Centred Queries How does the mean catalyst size in electrodes of width < 20 microns effect electrode conductivity? Manufacturing Performance Microscopy Image analysis Annotations Inferencing
17
Upper Ontologies SAMT 2008 ABC Ontology MPEG-7 Ontology CIDOC CRM – Museum content FUSION – Fuel Cell images OBOE – Environmental images/video PhysicalEntity AbstractEntity InformationObject Event Place Time Agent
18
Related Approaches Low level feature extraction – still room for improvement Statistical classification methods –Cluster multimedia objects based on similarity Machine Learning/Black box approaches –Bayesian, Probabilistic models –Neural networks –Genetic algorithms –Decision tree learning SAMT 2008
19
Web 2.0, eScience -> New Content 3D – cultural, biomedical, nano-structural 4D – Scientific Visualizations Compound Objects (Provenance, Learning Objects) Access Grid, EVO Sessions Multi-sensor networks – webcams, sensors, remote sensing satellite imagery YouTube, PodCasts Video/Audio Blogs FaceBook, MySpace SAMT 2008
21
Indexing and Search for 3D Cultural Artefacts SAMT 2008
22
AccessGrid Session Recording & Playback
25
Web 2.0 Social networks/community participation - define information environments - drive the technologies - rank information resources - define own tags (folksonomies) Distributed Multimedia Collections
26
Metadata vs Tags Tags –Topical, relevant, adaptive, light-weight –Cheap – generated by masses –Inconsistent, inaccurate, won’t scale, flat structure –Systems don’t interoperate (TagCommons) Authoritative metadata –Complex, fixed, hierarchical structures –Don’t evolve, irrelevant, anachronistic terms –Very expensive –High quality
27
Annotations vs Tags Tags –Light-weight, unstructured, organic, folksonomies Annotations –Structured, schemas, controlled vocabs, ontologies –creator, date, type, content/description, context Ontology-based tags in RDF enable: – machine-processing of annotations –resources to become part of semantic web – enhanced discovery and reasoning
28
Collaborative 3D Object Annotation Semi-Automatic Indexing/Description (DC/MPEG-7) & Access/Rights/Traditional Care Content Viewer (e.g. MPEG-2, JPEG2000, 3D objects) Shared Annotation/ Discussion (Annotea)
29
System Architecture OAI Repository OAI Repository OAI Repository Web Search Interface Periodically Harvested Metadata OAI-PMH Community generating annotations Annotation Server OAI-PMH Harvested Metadata Store Periodically Harvested Annotations Augmented Metadata Institutional Repositories/Metadata Authenticated Annotation Service Shibboleth Web Search interface
30
Tag Clouds
32
Issues Quality Control on community-generated data Ontology-directed folksonomies How to identify tags to add to ontology –Tag convergence over time When, where to add them? SAMT 2008
33
Annotating Relationships Tag/Annotate a set of objects (or segments) Annotate one multimedia object with another multimedia object –Audio description of a photo Tag/Annotate relationships between objects or segments – label the relationship Kickstart greater semantic inferencing SAMT 2008
34
Web Video Server Secure Annotation Server Co-Annotea User Interface - attach, share, view, edit associations Web Browser + Plug-ins HTTP See this association between scenes in the Seven Samurai and Magnificent Seven Lecturer Students of Film and Media 101 Annotating Relationships
35
SAMT 2008
38
OAI-ORE - Complex Compound Objects PDF PS HTML MP3 OAI/ORE Named Graphs/Resource Maps: Define set of components Typed Relationships between components Different views of the compound object Metadata attached to compound object Publish as RDF Named Graph cites is_derived_from hasRepresentation View1.html hasRepresentation View2.smil Identifier URI DOI PURL http://arXiv.org/astro-ph/061175/ http://openarchives.org/ore
39
OAI-ORE Compound Objects SAMT 2008
40
Future -> Hybrid Approaches Traditional authoritative metadata – high quality, expensive, hierarchical, rigid Combine with: –automatic low-level feature extraction – colour, texture, shape, pitch –social tagging – topical, cheap, flat, non-conformist –machine learning/statistical – need corpus Assign weightings based on: –reliability of source/method –social network information - FOAF SAMT 2008
41
Web 3.0 – Social Web meets Semantic Web Photo in LOC VRA Metadata Library of Congress Curator Community Tags/Tag Cloud FlickrCommons MPEG-7 Ds - Colour histogram - Region shape -> label Image processing + rules Machine- learning, statistical methods Automatic Tags Aggregated description – that includes sources and weightings
42
Hybrid Architectures SAMT 2008 Collection of images assign tags Collective Intelligence Tagged Collection Corpus Machine Learning Method New photo Preprocessing Structured, weighted semantically rich machine-processable metadata Place (Geotags), Date Creator Format, Size, Regions, MPEG-7 Semantic tags e.g., Flickr SciVee Workflow Social tags
43
Weighted Multimedia Metadata Bibliographic: Title, Creator, Subject, Genre, Date_created Formatting: Format, encoding, storage, dimensions, hardware/software Structural: Regions, Segments Content: Description, Transcript Colours, Textures Shapes, Motion Pitch, Tempo, Volume Life History/Provenance: Events, Rights mgt. Semantics: People, place, objects, events, concepts Machine Learning Bayesian Neural Network Statistical Image Processing Audio Processing Weightings High Low-Medium – depends on Trust rating Medium Source Manual Automatic Manual Semi-Automatic Automatic
44
Hybrid Search and Browse Interfaces SAMT 2008 Tag Clouds + Ontology-based querying Combined keyword plus descriptor (e.g., shape) searches SPARQL + spatio-temporal search interfaces Give me sub-tropical rainforests in S-E Qld above 3000m with reduced rainfall and containing endangered species
45
Future Research Challenges I YouTube Flickr PodCasts Blogosphere YouTube PodCasts Challenges – 1. Identify relationships between resources on the Web (is_about) 2. Extract structured information/metadata/tags from unstructured sources (RDFa?) 3. Assign trust or reliability weighting based on the source - how to crawl blogs and identify discussions around particular videos - how to extract structured information about the video/image from a blog Increasing volumes of community-generated content + community generated metadata/annotations in multiple formats about
46
Future Research Challenges II Multimedia search engines that support: –Ontology-based searches Ontology is dynamic – incorporates folksonomies over time Combined with content-based, QBE, spatio-temporal searches –Dynamic inferencing – rules change over time –Identification of your social network and trust metrics –Ranking of results based on: tags assigned by users from same social networks weightings applied to metadata that reflect reliability, precision SAMT 2008
47
Acknowledgements Australia Research Council Dept of Innovation Science and Research NCRIS 5.16 Suzanne Little Laura Hollink Ronald Schroeter Imran Khan Ron Chernich Anna Gerber SAMT 2008
48
Questions? http://www.itee.uq.edu.au/~eresearch j.hunter@uq.edu.au SAMT 2008
Similar presentations
© 2025 SlidePlayer.com Inc.
All rights reserved.