SKOS Simple Knowledge Organization System Antoine Isaac Dublin Core tutorial, Sept. 21, 2011.

Slides:



Advertisements
Similar presentations
Using SKOS in practice, with examples from the classification domain
Advertisements

1/ 26 AGROVOC and the OWL Web Ontology Language: the Agriculture Ontology Service - Concept Server OWL model NKOS workshop Alicante,
Controlled Vocabularies in TELPlus Antoine ISAAC Vrije Universiteit Amsterdam EDLProject Workshop November 2007.
Interoperability Aspects in Europeana Antoine Isaac Workshop on Research Metadata in Context 7./8. September 2010, Nijmegen.
A Semantic Web View on Concepts and their Alignments Antoine Isaac Vrije Universiteit Amsterdam Europeana Concepts in Context, Köln, July 19 th 2010.
6. Applying metadata standards: Controlled vocabularies and quality issues Metadata Standards and Applications Workshop.
Semantic Web and Linked Data for cultural heritage materials Approaches in Europeana Antoine Isaac Vrije Universiteit Amsterdam Europeana DANS Linked Data.
Thesauri, Terminologies and the Semantic Web
Margherita Sini, FAO 1/ FAO projects in the area of the Semantic Technologies 23rd APAN Meeting Manila, Philippines
Standards for networked knowledge organisation systems Ron Davies European Library Automation Group Bucharest, April 2006.
SKOS and Linked Data Antoine Isaac ISKO, London, Sept. 14th 2010.
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
A web-based repository service for vocabularies and alignments in the Cultural Heritage domain Lourens van der Meij Antoine Isaac Claus Zinn.
Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac
On practical aspects of enhancing semantic interoperability using SKOS and KOS alignment Antoine ISAAC Vrije Universiteit Amsterdam National Library of.
Aligning Thesauri for an integrated Access to Cultural Heritage Collections Antoine ISAAC (including slides by Frank van Harmelen) STITCH Project UDC Conference.
Vocabulary Matching for Book Indexing Suggestion in Linked Libraries – A Prototype Implementation & Evaluation Antoine Isaac, Dirk Kramer, Lourens van.
A Registry for controlled vocabularies at the Library of Congress
1 Languages for aboutness n Indexing languages: –Terminological tools Thesauri (CV – controlled vocabulary) Subject headings lists (CV) Authority files.
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
Accessing Cultural Heritage using Semantic Web Techniques Antoine ISAAC VU Amsterdam - KB Digital Access to Cultural Heritage Master March 20 th, 2008.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Thesaurusmanagement Quickstart Introduction. What are controlled vocabularies? organized arrangement of words and phrases used to index content and/or.
1/ 27 The Agriculture Ontology Service Initiative APAN Conference 20 July 2006 Singapore.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
9/10/20151 SKOS. 9/10/20152 SKOS Describes thesauruses and taxonomies Properties: broader, narrower, subject, related Classes: Concept, Collection
Rutherford Appleton Laboratory SKOS Ecoterm 2006 Alistair Miles CCLRC Rutherford Appleton Laboratory Semantic Web Best Practices and Deployment.
The OAI-ORE based data model of Europeana and the Digital Public Library of America: implications for educational publishing Dov Winer MAKASH – Advancing.
A J Miles Rutherford Appleton Laboratory SKOS Standards and Best Practises for USING Knowledge Organisation Systems ON THE Semantic Web NKOS workshop ECDL.
Inside the DDC Dewey goes Europe: On the use and development of the Dewey Decimal Classification (DDC) in European libraries Austrian National Library.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
AthenaPlus: WP4 Eva Coudyzer Koninklijke Musea voor Kunst en Geschiedenis Europeana Overlegplatform, 7 juni 2013.
Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije.
D4: SKOS and HIVE—Enhancing the Creation, Design and Flow of Information Speakers: Hollie White Jane Greenberg Coordinator: Alan Keely.
SKOS Tutorial Catch Mark van Assem, Antoine Isaac Vrije Universiteit Amsterdam Based on slides by Alistair Miles CCLRC Rutherford Appleton Laboratory
Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical.
Coastal Atlas Interoperability - Ontologies (Advanced topics that we did not get to in detail) Luis Bermudez Stephanie Watson Marine Metadata Interoperability.
, 1/21, © Library and Documentation Systems Division 21 st APAN Meeting Tokyo, January 2006 AGROVOC and AOS, Margherita Sini, FAO From.
The KOS interoperability in aquatic science field through mapping processes Carmen Reverté Reverté Aquatic Ecosystems Documentation Center. IRTA. (Sant.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
ISO 25964: a standard in support of interoperability Stella G Dextre Clarke Project Leader, ISO NP
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
It’s all semantics! The premises and promises of the semantic web. Tony Ross Centre for Digital Library Research, University of Strathclyde
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
APAN AG-WG Bangkok Food and Agriculture Organization of the UN Library and Documentation Systems Division Margherita Sini Slide Sustainable.
Oreste Signore- Quality/1 Amman, December 2006 Standards for quality of cultural websites Ministerial NEtwoRk for Valorising Activities in digitisation.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
EConnect WP1 & semantic issues VU members –Guus Schreiber, Antoine Isaac, Jacco van Ossenbruggen, Jan Wielemaker.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
The Agricultural Ontology Service: A proposal to create a Knowledge Organisation Framework in the Area of Food and Agriculture Johannes Keizer, Food and.
FSR, Feb 2014, Rome Which knowledge organization systems for conceptual interoperability? Claudio Gnoli ISKO Italy.
Trait ontology approach Marie-Angélique LAPORTE NCEAS June 7 th 2010.
Margherita Sini, FAO 1 / 19 Using RSS to Share KOS Metadata Margherita Sini, Gauri Salokhe IV Ecoterm Vienna, Austria April.
UNEP Terminology Workshop - Geneva, April 15, Environmental Terminology & Thesaurus Workshop UN Environment Programme Regional Office of Europe.
“New Dimensions in KOS” CENDI/NKOS Workshop September 11, 2008 Washington, DC, USA An international conference to share and advance knowledge and experience.
Transitioning from FGDC CSDGM Metadata to ISO 191** Metadata National Coastal Data Development Center A division of the National Oceanographic Data Center.
Diane Vizine-Goetz Senior Research Scientist, OCLC Research Joan S. Mitchell Editor in Chief, DDC Michael Panzer Assistant Editor, DDC Publisher and Librarian.
SKOS : A language to describe simple knowledge structures for the web
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Semantic Web Overview Diane Vizine-Goetz OCLC Research.
Linked Library (+AM) Data Presented LITA Next-Generation Catalog IG Corey A Harper Publish, Enrich, Relate and Un-Silo.
D3.4 Report on Cross-Language Subject Access Options Subject access seminar, Prague Patrice Landry Swiss National Library.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Information Organization
Applications of IFLA Namespaces
PREMIS Tools and Services
RDA in a non-MARC environment
Antoine Isaac SEMIC conference
Presentation transcript:

SKOS Simple Knowledge Organization System Antoine Isaac Dublin Core tutorial, Sept. 21, 2011

This presenter Europeana Web & Media Lab, Vrije Universiteit Amsterdam Web & Media Lab W3C Library Linked Data group ( ) W3C Semantic Web Deployment groupW3C Semantic Web Deployment group SKOS

This tutorial Demo: SKOS data on the web SKOS Background Simple SKOS features More advanced SKOS Applications, tools & data

Knowledge Organization Systems? Domain-specific KOSs – Libraries: LCSH, DDC, UDC – Art history: AAT, ULAN – Medicine: UMLS, MESH – Geography: TGN – Food: AGROVOC Generic KOSs – Lexical vocabularies: WordNet – Country codes, languages …

SKOS Demo Following one’s nose to “concepts” as linked data American LCSH French RAMEAU German SWD Agrovoc STW Further on to DBPedia

Linked data

Knowledge Organization Systems for Linked Data? (hundreds of) thousands of concepts Loose semantics – but still, semantics! Car wheel BroaderTerm Car Proven to be useful for applications Search, description It is useful to enable publishing and re-use of legacy KOSs, in an area which is always craving for semantics

LCSH is to Thesaurus as Doorbell is to Mammal: Visualizing Structural Problems in the Library of Congress Subject Headings Simon Spero, DC 2008, wArticle/937

This tutorial Demo: SKOS data on the web SKOS Background Simple SKOS features More advanced SKOS Applications, tools & data

W3C Semantic Web Deployment Working Group Tom Baker, Guus Schreiber, Alistair Miles, Sean Bechhofer, Antoine Isaac, Ralph Swick, Ed Summers, Jon Phipps, Margherita Sini, Diego Berrueta, Clay Redding, and many others…

SKOS Simple Knowledge Organization System an official W3C recommendation! Scope: knowledge organization systems (KOS) such as thesauri, classification systems, subject heading lists… SKOS is for representings KOSs in RDF in a simple way

SKOS There are many KOS models and formats But also common features and application requirements Lexical information, semantic links SKOS is a model to port KOSs to RDF in a simple way – Not aimed at fitting everything! – Not aimed at replacing existing (non-web) formats!

Representing semantics The formal way: OWL Semantic Web ontology languageOWL Semantic Web ontology language Used for ontologies that enable machine reasoning Mother is a class It is the intersection of the classes Woman and Parent Parent is the class of entities of type Person that are related to at least one other resource of type Person using the child property …

SKOS is not for formal ontologies Turning KOSs into ontologies is possible, but KOSs – are large – have often a focus on terminological information Child UsedFor Offspring Softer semantics can be useful as such for many applications! Semantic search, annotation…

SKOS is not for formal ontologies Rob Styles (Talis): SKOS as a “stepping stone” into Semantic Web and Linked Data Allows straightforward conversion and re-use of existing knowledge Without some of the benefits granted by – Formal axioms (reasoning) – Cleaning data (high precision)

W3C standardization process Input: draft specification (SKOS 2005) Collect use cases & derive requirements Create issue list: requirements not handled by the draft spec Propose resolutions for issues Get consensus on new spec Find two independent implementations for each feature in the spec Continuously: asking for public feedback/comments Lot of feedback coming from the SKOS community list public-esw- Guus Schreiber

Use Cases and Requirements Gathering use cases for SKOS – Existing or anticipated applications – E.g., "Semantic search service across mapped multilingual thesauri in the agriculture domain" From use cases, requirements were elicited – E.g., using generalization links between concepts (can be used for hierarchical browsing)

This tutorial Demo: SKOS data on the web SKOS Background Simple SKOS features More advanced SKOS Applications, tools & data

Basic SKOS A set of features common to various KOS types and useful for many applications Concepts Lexical properties Semantic relations Notes

Thesaurus example Animals cats UF (used for) domestic cats RT (related term) wildcats BT (broader term) animals SN (scope note) used only for domestic cats domestic cats USE cats wildcats ISO 2788 model

Concepts and labels cats UF (used for) domestic cats skos: = rdf: = ex: =

Note: multilingual labels

SKOS is concept-oriented cats UF (used for) domestic cats USE/UF functions, as in ISO2788 But: Concepts are first-order (RDF) resources Labels are RDF literals (simple string values) Labels are linked via the concept resource

Semantic relations cats RT (related term) wildcats BT (broader term) animals

Documenting concepts Alistair Miles

A SKOS graph animals cats UF domestic cats RT wildcats BT animals SN used only for domestic cats domestic cats USE cats wildcats

Example: RDF XML serialization animals cats domestic cats used only for domestic cats wildcats animals cats UF domestic cats RT wildcats BT animals SN used only for domestic cats domestic cats USE cats wildcats

Converting data to SKOS LCSH, SKOS and Linked Data Ed Summers, Antoine Isaac, Clay Redding, Dan Krech DC

Getting that data It can be tedious: Complex data (MARC) Data archaeology: mining models from data Creating URIs: mostly from local IDs Assigning language tags for labels Mapping tables don’t save you from using your favorite data conversion software XSLT, Marc-perl… But it’s never really impossible

Methological references at Pete Johnston’s posts on conversion to SKOS: based-thesauri-and-skos-part-1.html based-thesauri-and-skos-part-2-linked-data.html

Concept Schemes Explicit representation of vocabularies

Concept Schemes Linking concepts to concept schemes

SKOS mappings SKOS allows bridging across KOSs from different contexts KOS 1: animals cats wildcats KOS 2: animal human object

Networking controlled vocabularies in SKOS closeMatch and exactMatch for equivalence – exactMatch is stronger and context-independent (transitive) broadMatch and narrowMatch for hierarchical links relatedMatch for other cases of interest KOS 1: animals cats wildcats KOS 2: animal human object

SKOS mappings A common way to represent important info for KOS use cases Focusing on types of mapping relationships Semantics – broadMatch is a sub-property of broader – Allows to seamlessly use mappings as basic KOS relationships – Still keeps the difference at the statement level

This tutorial Demo: SKOS data on the web SKOS Background Simple SKOS features More advanced SKOS – semantics Applications, tools & data

Semantics for SKOS? SKOS model enforces basic constraints on SKOS data SKOS must cope with existing information, and not infer new knowledge, beyond what KOS publishers intend Minimal semantic commitment Over-commitment harms interoperability SKOS is not a guideline to create KOS E.g., SKOS does not say how to create good labels

(Hard) A concept has only one prefLabel per language (Soft) No two concepts from a same concept scheme should have the same prefLabel in a given language Semantics for SKOS - labels

Semantics for SKOS There are rules to infer new facts E.g., broader and narrower are inverse of each other

Semantics of skos:broader Is skos:broader "transitive"? Infering a new link can be wrong, sometimes! Some KOSs are not always hierarchically clean skos:broader is not transitive in general

skos:broader has a super-property skos:broaderTransitive with semantics of “has ancestor” 1: every broader implies a broaderTransitive 2: broaderTransitive is transitive! Semantics of skos:broader

SKOS semantics SKOS is represented as an OWL ontology In total 46 axioms Axioms may be less rich than expected for OWL fans See

SKOS and OWL -- again “OWL is a Harley-Davison, SKOS is a mountain bike” — Tom Baker SKOS and OWL are meant for quite different things SKOS = Model to represent KOSs in a simple way Ontology for concepts – the elements in (CH) vocabularies

Raising difficult issues: what counts as a "concept"? A concept is an artifact – used in descriptions, e.g., as subjects – used as a cluster for different labels with a similar meaning – in semantic relationships with other concepts Should a person name authority be represented using a class (foaf:Person) or a skos:Concept? Or both? E.g., discussion at conceptualisations-skos-foaffocus-modelling-choices.html conceptualisations-skos-foaffocus-modelling-choices.html

This tutorial Demo: SKOS data on the web SKOS Background Simple SKOS features More advanced SKOS – complex constructs Applications, tools & data

From SKOS Use Cases: Use Case #3 — Semantic search service across mapped multilingual thesauri in the agriculture domain “The AIMS project includes String-to-String relationships” “Requires: R-RelationshipsBetweenLabels”R-RelationshipsBetweenLabels In basic SKOS, labels are RDF literals and cannot be subjects of RDF statements Relationships between lexical labels

Done as an extension: SKOS-XL – skos-xl:Label – skos-xl:labelRelation skos-xl:labelRelation ex:translation

Other features Concept grouping skos:Collection, skos:member… Notations skos:notations

Killed darling example Synthesis of new subjects Using subdivisions: Brass bands—Sponsorship “Coordination” seems too application- and/or KOS- specific At least it did for the SWD Group, compared to other KOS features It is also quite complex, not for Simple-KOS

Handled by MADS/RDF implemented at id.loc.govid.loc.gov

Extending SKOS Vocabularies dedicated to specific KOS aspects can be defined as extensions to SKOS madsrdf:authoritativeLabel rdfs:subPropertyOf skos:prefLabel Ensures compatibility with tools that consume simple SKOS

This tutorial Demo: SKOS data on the web SKOS Background Simple SKOS features More advanced SKOS Applications, tools & data

Benefits of SKOS? Easily fitting KOSs into the Semantic Web & Linked Data vision Web-oriented representation Re-use & sharing of concepts and their descriptions Linking between concepts from different contexts Extensibility

Johan Stapel, Koninklijke Bibliotheek (now bibliotheek.nl) A vision for the Dutch National Library

Unifying access to collections Experiment from the STITCH project KB Illuminated Manuscripts BnF Mandragore Manuscripts

Semantic reconciliation of collections Blue triangles: (collection-)specific vocabularies

Reconciliation through vocabulary alignment

Demo: SKOS, browsing and alignment Subject vocabulary, collection 1 Subjects

Demo: SKOS, browsing and alignment Hierarchical path from root to selected subject Possible specialization for selected subject

Document from Collection 2 Semantic alignment of subjects activated Demo: SKOS, browsing and alignment

Subject from voc2 aligned to voc1:amphibians”

Building a search engine on top of metadata is difficult Intrinsic quality problems: correctness, coverage Especially when data is so heterogeneous Language issue

Prototype: Europeana Thought Lab

Noticeable facts KOS-independent systems A vocabulary can easily replace another in the system Use standard SKOS constructs skos:broader, skos:prefLabel, skos:exactMatch Computing links is helped by SKOS' straightforward representation of (multilingual) labels It is actually a case of monolingual (e.g., French-to-French or Russian- to-Russian) linking!

Semantic Annotation Michiel Hildebrand

Benefiting from the availability of different vocabularies Michiel Hildebrand

Direct access to the context of annotations

with technology from Mondeca.com Or in a quite different domain…

This tutorial Demo: SKOS data on the web SKOS Background Simple SKOS features More advanced SKOS Applications, tools & data

SKOS “Implementations”? Report by W3C Semantic Web deployment group – Tools to exploit or create SKOS data – Vocabularies: KOSs converted to SKOS Miles, Bechhofer, SKOS Implementation Report, May 19th

SKOS “Implementations”?

Tools SKOSEd, Poolparty, ThManager, iQvoc, ITM, TemaTres, FAO workbench, the Metadata Registry, HIVE, ONKI… Editors, browsers, validators, registries APIs/Web services Annotation tools Search engines But any general semantic web / linked data tool could be relevant

Available data General SKOS data W3C wiki pagehttp:// Datasets on the Data Hub: Inventory of Library Linked Data resources W3C LLD Incubator Deliverable on available value vocabularies coming very soon! Datasets on the Data Hub: (you can contribute!)

Available data Specific registry pages The Metadata Registry ONKI HIVE …

Government data

Astronomy research

Some landmark KOS LD implementations Many Libraries – not a surprise! Swedish National Library’s Libris catalogue and thesaurus Library of Congress’ vocabularies, including LCSH DNB’s Gemeinsame Normdatei (incl. SWD subject headings) Documentation at BnF’s RAMEAU subject headings OCLC’s DDC classification and VIAF STW economy thesaurus National Library of Hungary’s catalogue and thesauri (example) Other fields Wikipedia categories through Dbpedia New York Times subject headings IVOA astronomy vocabularies GEMET environmental thesaurus Agrovoc Linked Life Data Taxonconcept UK Public sector vocabularies (e.g., )

Challenge: Linking! Manual mapping of large vocabularies is labour-intensive MACS project: LCSH, RAMEAU and SWD CRISS-CROSS project: SWD and DDC Automatic linking is not perfect but can help

KOS Alignments? Quite many of them are linked to some other resource LCSH, SWD and RAMEAU interlinked through MACS mappings GND -> DBpedia, VIAF Libris -> LCSH Agrovoc -> CAT, NAL, SWD, GEMET NYT -> freebase, DBpedia, GeoNames dbPedia links are overwhelming Hungary, STW, TaxonConcept, GND…

Issue: inter-linking KOS data KOSs become valuable when they bring a “semantic layer” over other resources E.g. books and the topics they are about Links between concept schemes are still scarce Links between objects and KOS are often only implicit in the data More efforts on semantic annotation with KOS and KOS alignment are needed

Take-home messages: status quo Publication and linking of linked KOS data is still work in progress, But we can start building applications that make use of the wealth of data already available

Take-home messages: technical benefits of SKOS Not just a more sophisticated way to represent data! Ease of getting data from external sources Ease of publishing data Ease of linking across datasets

If we stop here, thanks for your attention! Any (more) questions?

Acknowledgements Material on a couple of slides borrowed from Alistair Miles, Michiel Hildebrand, Johan Stapel and Guus Schreiber Participants of the Semantic Web Deployment working group

References SKOS Reference SKOS Primer SKOS homepage SKOS wiki SKOS mailing