Authority versus authenticity: the shift from labels to identifiers

Slides:



Advertisements
Similar presentations
Presented to the ALCTS FRBR Interest Group, ALA Annual, 24 June 2011
Advertisements

Interoperability and semantics in RDF representations of FRBR, FRAD and FRSAD Gordon Dunsire Presented at the Cologne Conference on Interoperability and.
Authority control, new library standards, and the Semantic Web
Bibliographic data in the Semantic Web – what issues do we face in getting it there? Gordon Dunsire Presented to the ALCTS Cataloging and Classification.
Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
From content standards to RDF Gordon Dunsire Presented at AKM 15, Porec, 2011.
Introduction to linked data Gordon Dunsire Presented at the Cataloguing and Indexing Group Scotland seminar Linked data and the Semantic Web: what have.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
An introduction to RDF and library linked data Gordon Dunsire Presented at the Dewey Decimal Classification Executive Briefing 15 Sep 2011, London.
Bibliographic data in the Semantic Web – what issues do we face in getting it there? Gordon Dunsire Presented to the ALCTS Cataloging and Classification.
RDA and the semantic Web Lectio magistralis in Library Science by Gordon Dunsire Florence University, Florence, Italy 4th March, 2014.
Representation of the UNIMARC bibliographic data format in Resource Description Framework Gordon Dunsire, Mirna Willer, Predrag Perožić Presented at DC-2013,
IFLA Namespaces Gordon Dunsire Chair, IFLA Namespaces Technical Group Session 204 — IFLA library standards and the IFLA Committee on Standards – how can.
The Web of data with meaning... By Michael Griffiths.
National libraries and identity in the Semantic Web Gordon Dunsire BNE, Madrid, 14 Dec 2011.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
RDF: Concepts and Abstract Syntax W3C Recommendation 10 February Michael Felderer Digital Enterprise.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
IFLA Satellite Meeting, 13 August 2014, Frankfurt-am-Main, Germany
An introduction to open linked data for librarians Gordon Dunsire National Library of Finland, Helsinki 11 December 2012.
Chapter 6 Text and Multimedia Languages and Properties
Quests, collections, and community knowledge: local perspectives on metadata Gordon Dunsire Presented to 11th Prato CIRN Conference October ,
Multilingual Issues in the Representation of International Bibliographic Standards for the Semantic Web Gordon Dunsire Independent Consultant; Chair of.
RDA data and applications Gordon Dunsire Presented to staff of the British Library, Boston Spa, 20 Mar 2014.
ISBD for the Semantic Web: namespaces, elements, vocabularies, application profile Gordon Dunsire Presented at Centar zu Stalno Stručno Usavršavanje (CSSU),
Creating an Application Profile Tutorial 3 DC2004, Shanghai Library 13 October 2004 Thomas Baker, Fraunhofer Society Robina Clayphan, British Library Pete.
The Semantic Web and expert metadata: pull apart then bring together Presented at 12.seminar Arhivi, Knjižnice, Muzeji Nov 2008, Pore č, Croatia.
Relevance of the consolidated edition ISBD for national bibliographies Professor Mirna Willer, PhD University of Zadar Department of Information Sciences.
Role of national bibliographic agencies in linked data environment Gordon Dunsire Presented to staff of the Bibliothèque nationale de France, Paris, 25.
Linked data and the implications for library cataloguing: metadata models and structures in the Semantic Web Gordon Dunsire Presented at the Canadian Library.
Key issues in publishing and consuming linked data for libraries Gordon Dunsire Presented to CILIP Linked Data Executive Briefing 24 November 2015, London.
Some thoughts on technology and LIS: a future past today Gordon Dunsire Presented at UCL Department of Information Studies Employers’ Forum, 15 June 2015.
RSC Strategy and RDA Internationalization Gordon Dunsire, Chair, RDA Steering Committee Presented at Selmathon 2, 10 May 2016, Stockholm, Sweden.
RDA and Linked Data Gordon Dunsire Presented at Cita BNE - RDA and Linked Data, 15 April 2016, Madrid, Spain.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
RDA and Linked Data Gordon Dunsire Presented at Selmathon 1, 9 May 2016, Stockholm, Sweden.
Marathon RDA Gordon Dunsire Chair, RDA Steering Committee Presented at Bibliothèque national de France, Paris, 2 May 2016.
Subjects in the FR family
RSC Strategy Gordon Dunsire, Chair, RDA Steering Committee
LRM-RDA Gordon Dunsire
RDA work plan: current and future activities
Quo vadis? Getting there with linked data
Aligning RDA with the LRM
Linked Data Web that can be processed by machines
Jenn Riley Metadata Librarian Digital Library Program
Telling tails: metadata standards and the digital humanities
RDA data and context Gordon Dunsire
IFLA FRBR-Library Reference Model and RDA
Recording RDA data as linked data
UNIMARC and linked data
RDA, linked data, and update on development
Getting started With Linked Data.
Applications of IFLA Namespaces
A model to link them all IFLA-LRM as a driver for harmonization of cataloguing standards related to serials and other continuing resources Gordon Dunsire.
Department of Information Sciences, University of Zadar, Croatia
Appellations, Authorities, and Access
LRM-RDA Gordon Dunsire.
Gordon Dunsire, Françoise Leresche, Mirna Willer
Cataloging the Internet
Name authority control in an evolving landscape
Gordon Dunsire, Françoise Leresche, Mirna Willer
Accommodating local cataloguing traditions in a global context
From Big Bang to beta An overview of the 3R Project
RDA and semantic data Gordon Dunsire
Introducing IFLA-LRM Gordon Dunsire, Chair, RSC
RDA cataloguing and linked data
RDA in a non-MARC environment
RDA Community and linked data
Future directions for RDA
Jenn Riley Metadata Librarian Digital Library Program
Presentation transcript:

Authority versus authenticity: the shift from labels to identifiers Mirna Willer and Gordon Dunsire Presented at APAE 2016 Conference and School, Zadar, Croatia, 25 October 2016

RDF: Resource Description Framework A method for storing and linking data at global level; the basic syntax of the Semantic Web, the web of (meta)data RDF implemented as an extension of the Internet and World-Wide Web (the web of documents). Data is stored as single atomic statements using the syntax subject – predicate – object: This book – has author – J.K. Rowling J.K. Rowling – is the author of – this book

RDF graphs RDF syntax can be represented as a mathematical graph of nodes and connectors. has author This book J.K. Rowling is author of

URI: Uniform Resource Identifier RDF is intended for machine-processing, but machines are too dumb to use ambiguous (human) labels for the statement or graph; e.g. "This book", "author", etc. Machine-readable identifiers are used. Each identifier must be unique at global level (the Semantic Web). The URI builds on the established protocols and services of the World-Wide Web: http, URL, content negotiation for browsers, etc.

Identifying the components of a triple RDF requires the subject and predicate of a statement (triple) to be identified with URIs; the object may be the data value to be stored, or identified by a URI. Predicate URI SubjectURI Object data value Human-readable! Predicate URI SubjectURI Object URI

Linking triples; weaving the Semantic Web URIs can be matched by machine to form clusters and chains of triples. ObjectURI Common SubjectURI ObjectURI Data value Object URI = Subject URI SubjectURI URI Data value URI URI

AAA: Anybody can say Anything about Any thing There is no intrinsic test of "truth" Semantic logic can detect contradictions in a set of two or more statements: (1) This thing – is a – cat (2) This thing – is a – dog (3) Cat – is disjoint with – dog [A thing cannot be dog AND cat] One or more statements is "false" – but which one(s)? Provenance provides a measure of reliability

OWA: Open World Assumption Absence of data is not data of absence The "record" is never complete: There is always something more to say about any thing. Non-identical statements are separate statements, even if they record the "same" data. In a "closed world", absence of data (blanks) can indicate that the aspect/element is not applicable.

Provenance and cataloguing content rules Provenance: Who said that?; When was it said? Why was it said? The values used for bibliographic content as the data of a triple's object are determined by the application of library cataloguing codes. Codes have converged to a common basis (a result of the attempted imposition of top-down global standards as part of "Universal Bibliographic Control"), but still diverge in interpretation, context, and culture, leading to different values from different codes and cataloguers.

IFLA Library Reference Model The LRM is the most recent library bibliographic standard, and provides a high-level model on which cataloguing codes and finer metadata structure can be built. The model is optimized for Semantic Web technologies. In particular, the LRM provides two controversial ideas that impact on identity, authority, and provenance.

"Authority" in the LRM Only human beings can "author", or be responsible for, a bibliographic resource. Fictitious or legendary entities that are claimed to be authors are assumed to be pseudonyms of a person or group of persons. Text-based resources (especially print) describe themselves in manifestation statements. Descriptive data values may be transcribed from the physical instantiation of a resource (a manifestation).

Case study

Title statement: "Fantastic beasts & where to find them" Statement of responsibility (British catalogue): "Newt Scamander [i.e.] by J. K. Rowling" Statement of responsibility (Italian catalogue): "by J. K. Rowling ; [introduction by] Newt Scamander"

Catalogue Manifestation statement Primary Access Point Additional Access Point UK [i.e.] by J. K. Rowling Rowling, J. K.   Italy [introduction by] Newt Scamander Scamander, Newt Translations: Germany Newt Scamander Italy (2010) [di] Newt Scamander ; J. K. Rowling Italy (2015) di J. K. Rowling ; Newt Scamander Spain Newt Scamander ; por J. K. Rowling Scamander, Newt (1897-) France Newt Scamander [i.e. J. K. Rowling] Rowling, Joanne Kathleen (1965-) Croatia Rowling, Joanne Kathleen

ISNI: International Standard Name Identifier

Authority control Creation and maintenance of a unique "authorized access point" (name, label, heading) for an entity. Rowling, J. K. Joanne Kathleen 1965 July 31- novelist Normalized name Expanded initials Date of birth Profession has profession http://isni.org/isni/000000012148628X Heading (d)evolves to metadata record Or set of triples

Manifestation statement Manifestation statements are social constructs: "Published" manifestations are products of industrial and commercial processes. They are influenced by branding and other commercial issues – dependent on cultural contexts. Translations, derivations, etc. may have different social context. Transcription is rarely exact: What I see is what you get. What about non-print materials, or text content online?

Current and future knowledge What is recorded now, may change in the future What is recorded now should not be regarded as absolute or fixed (OWA). Future knowledge may lead to adjustment of current data. But the future cannot be anticipated, including future requirements of users. The needs of the present should take precedence over needs of the future. Present records are fixed – RDF good practice never deletes data (deprecation) – AAA! All adjustments are additions (not replacements).

Conclusion The shift from labels to global identifiers: Provides more effective (less ambiguous) identifiers Provides stable identifiers Provides identifiers required for the Semantic Web Allows "labels" or "access points" to be treated as entities: data recorded for labels are data records of entities Improves interoperability of data recorded for different contexts Focuses authority on the authenticity and provenance of metadata statements

Thank you! gordon@gordondunsire.com mwiller@unizd.hr isni.org