Microdata and schema.org. Basics Microdata is a simple semantic markup scheme that’s an alternative to RDFa Microdata Developed by WHATWG and supported.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

DCMI Workshop on Metadata and Search Vendor Panel Presentation Bradley P. Allen
1 ICS-FORTH EU-NSF Semantic Web Workshop 3-5 Oct Christophides Vassilis Database Technology for the Semantic Web Vassilis Christophides Dimitris Plexousakis.
1 ICS-FORTH & Univ. of Crete SeLene November 15, 2002 A View Definition Language for the Semantic Web Maganaraki Aimilia.
Metadata vocabularies and ontologies Dr. Manjula Patel Technical Research and Development
February Harvesting RDF metadata Building digital library portals with harvested metadata workshop EU-DL All Projects concertation meeting DELOS.
Semantic Descriptions for RESTful Services SA-REST by Knoesis Service Research Lab Tomas Vitvar WSMO Phone Conference January 09,
OASIS OData Technical Committee. AGENDA Introduction OASIS OData Technical Committee OData Overview Work of the Technical Committee Q&A.
Steffen Staab 1WeST Web Science & Technologies University of Koblenz ▪ Landau, Germany Structured Data on the Web Introduction to.
HTML5, OVERVIEW AND NEW FEATURES PowerPoint by Mason O’Mara.
HTML5 ETDs Edward A. Fox, Sung Hee Park, Nicholas Lynberg, Jesse Racer, Phil McElmurray Digital Library Research Laboratory Virginia Tech ETD 2010, June.
A guide to HTML. Slide 1 HTML: Hypertext Markup Language Pull down View, then Source, to see the HTML code. Slide 1.
RDF Schemata (with apologies to the W3C, the plural is not ‘schemas’) CSCI 7818 – Web Technologies 14 November 2001 Van Lepthien.
The Semantic Web – WEEK 4: RDF
Natural Language and Speech Processing Professor: Dr. Christel Kemke Winter 2004 Knowledge Representation on the Semantic Web by Femi G. Olumofin.
XML Technology in E-Commerce
CS570 Artificial Intelligence Semantic Web & Ontology 2
RDFa: Embedding RDF Knowledge in HTML Some content from a presentation by Ivan Herman of the W3c, Introduction to RDFa, given at the 2011 Semantic Technologies.
4.01 How Web Pages Work.
RDF Tutorial.
HTML5, part I – markup Štěpán Developer Evangelist Microsoft, Czech Republic.
SEMANACCO - a web-based application for accommodation providers to easily create semantically annotated descriptions of their accommodations, based on.
An Introduction to XML Based on the W3C XML Recommendations.
MICRODATA IN HTML 5.0 Technologies for Web Application Development Martin Nečaský Department of Software Engineering, Faculty of Mathematics and Physics,
Embedding Knowledge in HTML Some content from a presentations by Ivan Herman of the W3c.
RDFa Lite. RDFa 1.1 Lite is a subset of RDFa 1.1 Five simple attributes: vocab, typeof, property, resource, and prefix Completely upwards compatible RDFa.
Sematic Web Microdata, Microformat and RDF Advanced Web-based Systems | Misbhauddin.
1 An Introduction To The Semantic Web. 2 Information Access on the Web Find an mp3 of a song that was on the Billboard Top Ten that features a cowbell.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
1 COS 425: Database and Information Management Systems XML and information exchange.
From SHIQ and RDF to OWL: The Making of a Web Ontology Language
CS 415 N-Tier Application Development By Umair Ashraf July 16,2013 National University of Computer and Emerging Sciences Lecture # 12 HTML/ XHTML/ HTML5.
Microdata for Dallas County Historical & Genealogical Cemetery Data Tony Hanson Webmaster 1.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
RDF: Concepts and Abstract Syntax W3C Recommendation 10 February Michael Felderer Digital Enterprise.
HTML 5 New Standardization of HTML. I NTRODUCTION HTML5 is The New HTML Standard, New Elements New Attributes Full CSS3 Support Video and Audio 2D/3D.
Microdata and schema.org. Basics Microdata is a simple semantic markup scheme that’s an alternative to RDFa Microdata Developed by WHATWG and supported.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Aidministrator nederland b.v. Adding formal semantics to the Web Jeen Broekstra, Michel Klein, Stefan Decker, Dieter Fensel,
Chapter 6 Understanding Each Other CSE 431 – Intelligent Agents.
Practical RDF Chapter 1. RDF: An Introduction
Network of Excellence in Internet Science Network of Excellence in Internet Science (EINS) 2 nd REVIEW Brussels, 4-5 February 2014 FP7-ICT
1 Representing Data with XML September 27, 2005 Shawn Henry with slides from Neal Arthorne.
The Semantic Web Web Science Systems Development Spring 2015.
 “Micro Data (like RDFa and Microformats) is a form of semantic mark-up designed to describe elements on a web page e.g. review, person, event etc. This.
The LOM RDF binding – update Mikael Nilsson The Knowledge Management.
Michael Eckert1CS590SW: Web Ontology Language (OWL) Web Ontology Language (OWL) CS590SW: Semantic Web (Winter Quarter 2003) Presentation: Michael Eckert.
© Copyright 2008 STI INNSBRUCK HTML Data Guide W3C Interest Group Note 08 March 2012 OC Working Group –
Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 12 This presentation © 2004, MacAvon Media Productions Hypertext and Hypermedia.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Embedding Knowledge in HTML Some content from a presentations by Ivan Herman of the W3c.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
 Structured Data An Introduction to Semantic Web “It is very hard for search engines to understand the structure and semantics of data embedded in an.
PRACTICAL KNOWLEDGE REPRESENTATION FOR THE WEB Frank van Harmelen Dieter Fensel AIFB Kim Kangil Structural Complexity Laboratory.
Representing Data with XML February 26, 2004 Neal Arthorne.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
RDFa Primer Bridging the Human and Data webs Presented by: Didit ( )
Microdata in HTML 5.0 Web Applications Martin Nečaský Department of Software Engineering, Faculty of Mathematics and Physics, Charles University in Prague,
1 ITEC 4830 Graphics Design Chapter 1 Dreamweaver MX 2004 Basics.
HTML5 Structure. Slide 2 Introduction Introduce the purpose of the new HTML5 semantic tags Introduce the HTML5 outlining mode Using semantic metadata.
Agenda * What is HTML5 -- Its history and motivation * HTML/XHTML as Human / Machine Readable Format * HTML and its related technologies * Brief summary.
Embedding Knowledge in HTML
RDFa How and Why Ralph R. Swick World Wide Web Consortium
Microdata and schema.org
Embedding Knowledge in HTML
Making data discoverable through mainstream search engines
Microdata and schema.org
RDA Community and linked data
Embedding Knowledge in HTML
Presentation transcript:

Microdata and schema.org

Basics Microdata is a simple semantic markup scheme that’s an alternative to RDFa Microdata Developed by WHATWG and supported by major search companies (Google, Microsoft, Yahoo, Yandex) Like RDFa, it uses HTML tag attributes to host metadata Vocabularies are controlled and hosted at schema.org schema.org

What is WHATWG? Web Hypertext Application Technology Working Group Web Hypertext Application Technology Working Group – Community interested in evolving the Web with focus on HTML and Web API development – Ian Hickson is a key person, now at Google Ian Hickson Founded in 2004 by individuals from Apple, Mozilla and Opera after a W3C workshop – Concern about W3C's embrace of XHTML Current work on HTML5HTML5 Developed Microdata specMicrodata

HTML5 Started by WHATWG as an alternative to XHTML, joined by W3C – A W3C candidate recommendation in 2012 (draft)draft – WHATWG will evolve it as a “living standard” HTML5 ≈ HTML + CSS + js Native support for graphics, video, audio, speech, semantic markup, … Current partial support in major browsers & extensions

HTML taxonomy and status

Microdata The microdata effort has two parts: – A markup scheme – A set of vocabularies/ontologies The markup is similar to RDFa in providing ways to identify subjects, types, properties & objects There’s also a standard way to encode microdata as RDFa The sanctioned vocabularies are found at schema.org and include a small number of very useful ones: people, movies, etc. schema.org

An example Avatar Director: James Cameron (born 1954) Science fiction Trailer

An example: itemscope An itemscope attribute identifies a content subtree that is the subject about which we want to say something Avatar Director: James Cameron (born 1954) Science fiction Trailer

An example: itemtype An itemscope attribute identifies a content subtree that is the subject about which we want to say something The itemtype attribute specifies the subject’s type Avatar Director: James Cameron (born 1954) Science fiction Trailer

Microdata RDF

Microdata RDF

An example: itemtype An itemscope attribute identifies a content subtree that is the subject about which we want to say something The itemtype attribute specifies the subject’s type Avatar Director: James Cameron (born 1954) Science fiction Trailer [ ] a schema:Movie.

An example: itemprop An itemscope attribute identifies a content subtree that is the subject about which we want to say something The itemtype attribute specifies the subject’s type An itemprop attribute gives a property of that type Avatar Director: James Cameron (born 1954) Science fiction Trailer

An example: itemprop An itemscope attribute identifies a content subtree that is the subject about which we want to say something The itemtype attribute specifies the subject’s type An itemprop attribute gives a property of that type Avatar Director: James Cameron (born 1954) Science fiction Trailer [ ] a schema:Movie ; schema:genre "Science fiction" ; schema:name "Avatar" ; schema:trailer. [ ] a schema:Movie ; schema:genre "Science fiction" ; schema:name "Avatar" ; schema:trailer.

An example: embedded items An itemprop immediately followed by another itemcope makes the value an object Avatar Director: James Cameron (born 1954 ) Science fiction Trailer

An example: embedded items An itemprop immediately followed by another itemcope makes the value an object Avatar Director: James Cameron (born 1954 ) Science fiction Trailer [ ] a schema:Movie ; schema:director [ a schema:Person ; schema:birthDate "1954" ; schema:name "James Cameron" ] ; schema:genre "Science fiction" ; schema:name "Avatar" ; schema:trailer. [ ] a schema:Movie ; schema:director [ a schema:Person ; schema:birthDate "1954" ; schema:name "James Cameron" ] ; schema:genre "Science fiction" ; schema:name "Avatar" ; schema:trailer.

schema.org vocabulary Full type hierarchy in one fileone file 548 classes, 711 properties (5/4/14) Data types: Boolean, Date, DateTime, Number (Float, Integer) Text (URL), Time Objects: Rooted at Thing with two ‘metaclasses’ (Class and Property) and eight subclasses

Microdata as a KR language More than RDF, less than RDFS Properties have an expected type (range) – Might be a string – A list of types, any of which are OK Properties attached ≥ 1 types (domain) Classes can have multiple parents and inherit (properties) from all of them No axioms (e.g., disjointness, cardinality, etc.) No subPropertyOf like relation

Mixing vocabularies Microdata is intended to work with just one vocabulary – the one at schema.org Advantages – Simple, organized, well designed – Controlled by the schema.org people Disadvantages: too simple, controlled – Too simple, narrow, mono-lingual – Controlled by the schema.org people Schema.rdfs.org defines mappings between schema.org and popular RDF ontologies

Schema RDF

Extending the schema.org ontology You can subclass existing classes – Person/Engineer – Person/Engineer/ElectricalEngineer Subclass exisiting properties – musicGroupMember/leadVocalist – musicGroupMember/leadGuitar1 – musicGroupMember/leadGuitar2

Extension Problems Do agreed upon meaning – Through axioms supported by the language (e.g., equivalence, disjointness, etc.) – No place for documentation (annotations, labels, comments) Without a namespace mechanism, your Person/Engineer and mine can be confused and might mean different things

Serialization Schema.org has a data model and serializationsdata model – Microdata is the original, native sterilization – RDFa is more expressive and works with the RDF stack – Everyone agrees that RDFa Lite is a good encoding: as simple as Microdata but more expressive – JSON-LD is also an accepted encoding Search engines look for Microdata and RDFa encodings and are beginning to look for JSON- LD Schema.org considers RDFa to be the “canonical machine representation of schema.org”

Microdata is a good effort by the search companies to use a simple semantic language The semantics is pragmatic – e.g., expected types: A string is accepted where a thing is expected – “some data is better than none” The real value is in – the supported vocabularies and – their use by Search companies => Immediate motivation for using semantic markup Conclusions