Repositories thru the looking glass Andy Powell Eduserv Foundation

Slides:



Advertisements
Similar presentations
Presented to the ALCTS FRBR Interest Group, ALA Annual, 24 June 2011
Advertisements

Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
Agents and the DC Abstract Model Andy Powell UKOLN, University of Bath DC Agents WG Meeting DC-2005, Madrid.
DC Architecture WG meeting Monday Sept 12 Slot 1: Slot 2: Location: Seminar Room 4.1.E01.
Creating Institutional Repositories Stephen Pinfield.
A centre of expertise in digital information management IMS Digital Repositories Interoperability Andy Powell UKOLN,
UKOLN, University of Bath
A centre of expertise in digital information management UKOLN is supported by: British Academy e-Resources Policy Review: UKOLN Report.
Andy Powell, Eduserv Foundation July 2006 Repository Roadmap – technical issues.
Andy Powell, Eduserv Foundation Feb 2007 The Dublin Core Abstract Model – a packaging standard?
A centre of expertise in digital information management UKOLN is supported by: The Dublin Core Application Profile for Scholarly Works.
JISC CETIS Metadata and Digital Repository SIG meeting, Manchester 16 April 2007 A Dublin Core Application Profile for Scholarly Works (eprints) ‏ Julie.
A centre of expertise in digital information management UKOLN is supported by: XML and the DCMI Abstract Model DC Architecture WG Meeting,
International Conference on Dublin Core and Metadata Applications DC-Scholar, 24 th September /10/2014 Scholarly Works Application.
A centre of expertise in digital information management UKOLN is supported by: The Dublin Core Application Profile for Scholarly Works.
Eprints Application Profile
Open Repositories 2007 Eprints Application Profile The Eprints Application Profile: a FRBR approach to modelling repository metadata Julie Allinson, UKOLN,
A centre of expertise in digital information management UKOLN is.
A centre of expertise in digital information management UKOLN is.
RDA and the semantic Web Lectio magistralis in Library Science by Gordon Dunsire Florence University, Florence, Italy 4th March, 2014.
Z39.50 and the Web ZIG July 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
CS570 Artificial Intelligence Semantic Web & Ontology 2
Object Re-Use and Exchange Mellon Retreat, Nassau Inn, Princeton, NJ, March Herbert Van de Sompel, Carl Lagoze The OAI Object Re-Use & Exchange.
Images Application Profile meeting 29th October 2007, London Julie Allinson Digital Library Manager Library & Archives, University of York SWAP a Dublin.
UKOLN is supported by: Repositories and the wider context Exchange of Experience on Institutional/Digital Repositories 3 November 2006, Liverpool Julie.
A centre of expertise in digital information management UKOLN is supported by: Eprints Application Profile UK Repositories Search Project.
Eprints Special Session DC-2006, Mexico Wednesday Oct 4, Julie Allinson (UKOLN, University of Bath) and Andy Powell (Eduserv Foundation)
RDF Kitty Turner. Current Situation there is hardly any metadata on the Web search engine sites do the equivalent of going through a library, reading.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
© 2006 DCMI DC-2006 – International Conference on Dublin Core and Metadata Applications 3-6 October 2006 Thomas Baker Dublin Core Metadata Initiative.
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Chinese-European Workshop on Digital Preservation Beijing (China), July.
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Andy Powell, Eduserv Foundation June 2006 Eprints Application Profile.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
SWAP FOR DUMMIES. Scholarly Works Application Profile a Dublin Core Application Profile for describing scholarly works (eprints) held in institutional.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
A centre of expertise in digital information management UKOLN is supported by: FRBR and Metadata Application Profiles Peter Cliff, Research.
Creating an Application Profile Tutorial 3 DC2004, Shanghai Library 13 October 2004 Thomas Baker, Fraunhofer Society Robina Clayphan, British Library Pete.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Metadata and Versioning VIF workshop 22 nd April
Evidence from Metadata INST 734 Doug Oard Module 8.
JISC Information Environment Service Registry (IESR) Ann Apps MIMAS, The University of Manchester, UK.
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Pete Johnston, Eduserv Foundation 16 April 2007 An Introduction to the DCMI Abstract Model JISC.
Future of Cataloguing: how RDA positions us for the future for RDA Workshop June, 2010.
A centre of expertise in digital information management UKOLN is supported by: Functional Requirements Eprints Application Profile Working.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
DC Architecture WG meeting Wednesday Seminar Room: 5205 (2nd Floor)
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Metadata & Repositories Jackie Knowles RSP Support Officer.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Session 3 Metadata & Workflow
Resource Discovery Landscape
Accessing a national digital library: an architecture for the UK DNER
NSDL Data Repository (NDR)
RDA in a non-MARC environment
JISC Information Environment Service Registry (IESR)
Institutional Repositories
Attributes and Values Describing Entities.
Presentation transcript:

Repositories thru the looking glass Andy Powell Eduserv Foundation

There are many methods for predicting the future. For example, you can read horoscopes, tea leaves, tarot cards, or crystal balls. Collectively, these methods are known as nutty methods. Or you can put well-researched facts into sophisticated computer models, more commonly referred to as a complete waste of time. Dilbert

Either that wallpaper goes or I do. Oscar Wildes last words

some background…

The DCMI Abstract Model a set of rules defining how DC metadata descriptions are constructed – A description is made up of one or more statements … – Each statement instantiates a property/value pair and is made up of … – … – Each value string is a simple, human-readable string … – … a set of human-readable statements (as per above) also formalised using UML

The DCMI Abstract Model independent of particular syntaxes but descriptions that comply with the model can be encoded using any of the recognised DCMI encodings – i.e. XHTML, XML and RDF simple – largely based on resource, property, value triple – formally mapped to the RDF model highly extensible

The DCMI Abstract Model record (encoded as HTML, XML or RDF/XML) description set description (about a resource (URI)) statement property (URI) value (URI) vocabulary encoding scheme (URI) value string language (e.g. en-GB) syntax encoding scheme (URI)

The DCMI Abstract Model relationships between the descriptions in a description set and the resources being described made explicit oddly, most metadata standards do not do this DC application profiles now start by defining which set of resources are being described… …then assigning the set of properties and so on that will be used to describe them

E.g. an application profile for CDs start with the set of entities that we want to describe and the key relationships between those entities e.g. a CD collection entity/relationship model… then define a set of properties for each collectionCD artist owner record label owned by contained in created by released by

JISC Information Environment

are we heading in the right direction?

open access not if but when

3 issues…

issue #1 have we got our terminology right?

a university-based institutional repository is a set of services that a university offers to the members of its community for the management and dissemination of digital materials created by the institution and its community members. It is most essentially an organizational commitment to the stewardship of these digital materials, including long-term preservation where appropriate, as well as organization and access or distribution. … An institutional repository is not simply a fixed set of software and hardware (Cliff Lynch, 2003)

a focus on making content available on the Web would be more intuitive to researchers

a focus on content management would change our emphasis OAI-PMH out… search engine optimisation, usability, accessibility, tagging, information architecture, cool URIs in…

issue #2 service oriented vs. resource oriented

REST = Representational State Transfer an architectural style with a focus on resources, their identifiers (e.g. URIs), and a simple uniform set of operations that each resource supports (e.g. GET, PUT, POST, DELETE)

issue #3 national vs. global

The impact of Web 2.0 prosumer remote apps social API diffusion concentration

thinking about the future…

1.what would a Web 2.0 repository look like? 2.potential impact of the Semantic Web on repositories

1.what would a Web 2.0 repository look like? 2.potential impact of the Semantic Web on repositories

high-quality browser-based document viewer (not Acrobat!) tagging, commentary, more-like-this, favorites, … persistent (cool) URIs to content ability to form simple social groups ability to embed documents in other Web sites high visibility to Google offer RSS as primary API use of Amazon S3 to cope with scalability

a Web 2.0 repository would be a global service global concentration is an enabler of social interaction

But… they dont do preservation they dont handle complex workflows they dont expose rich metadata – yes, scholarly communication has some particular functional requirements which are not met by Google… – author searching, citation counting, object complexity – not handled well by the current Web – how are these requirements best met? thru richer metadata?

1.what would a Web 2.0 repository look like? 2.potential impact of the Semantic Web on repositories

SWAP The Scholarly Works Application Profile

A model based on FRBR Functional Requirements for Bibliographic Records an application model for the entities that bibliographic records are intended to describe FRBR models the world using 4 key entities – Work, Expression, Manifestation and Item

FRBR and scholarly works FRBR is a useful model in the context of scholarly works (eprints) because it allows us to answer questions like – what is the URL of the most appropriate copy (an item) of the PDF format (a manifestation) of the pre- print version (an expression) for this eprint (the work)? – are these two copies related? if so, how?

FRBR for scholarly works The eprint as a scholarly work Authors Original 1.0Authors Original 1.1 Version of Record (French) htmlpdf publishers copy institutional repository copy scholarly work (work) version (expression) format (manifestation) copy (item) … Version of Record (English)

SWAP application profile model ScholarlyWork Expression 0.. isExpressedAs Manifestation isManifestedAs 0.. Copy isAvailableAs 0.. isCreatedBy isPublishedBy 0.. isEditedBy 0.. isFundedBy isSupervisedBy AffiliatedInstitution Agent

SWAP and FRBR ScholarlyWork Expression 0.. isExpressedAs Manifestation isManifestedAs 0.. Copy isAvailableAs 0.. isCreatedBy isPublishedBy 0.. isEditedBy 0.. isFundedBy isSupervisedBy AffiliatedInstitution Agent FRBR Work FRBR Expression FRBR Manifestation FRBR Item

SWAP and FRBR ScholarlyWork Expression 0.. isExpressedAs Manifestation isManifestedAs 0.. Copy isAvailableAs 0.. isCreatedBy isPublishedBy 0.. isEditedBy 0.. isFundedBy isSupervisedBy AffiliatedInstitution Agent the eprint (an abstract concept) the version of record or the french version or version 2.1 the PDF format of the version of record the publishers copy of the PDF … the author or the publisher

Attributes the application model defines the entities and relationships each entity needs to be described using an agreed set of attributes

Example attributes ScholarlyWork: title subject abstract affiliated institution identifier ScholarlyWork: title subject abstract affiliated institution identifier Agent: name type of agent date of birth mailbox homepage identifier Agent: name type of agent date of birth mailbox homepage identifier Expression: title date available status version number language genre / type copyright holder bibliographic citation identifier Expression: title date available status version number language genre / type copyright holder bibliographic citation identifier Manifestation: format date modified Manifestation: format date modified Copy: date available access rights licence identifier Copy: date available access rights licence identifier

Final thoughts on the model this model makes it easier to rationalise traditional and modern citations – traditional citations tend to be made between eprint expressions – hypertext links tend to be made between eprint copies (or items in FRBR terms) adopting a simple underlying model now may be expedient in the short term but costly to interoperability in the long term – the underlying model need to be as complex as it needs to be, but not more so! a complex underlying model may be manifest in relatively simple metadata and/or end-user interfaces existing eprint systems may well capture this level of detail currently – but use of simple DC stops them exposing it to others!

time to reflect?

Repositories what can we learn learn from Web 2.0? – user interface design matters – global concentration is an enabler of social interaction simple DC is both too simple and too complex richer DC application profiles such as SWAP may be a way forward but need to ensure that their use does not over- complicate user interfaces and workflows

Open Access in policy terms - talking about the aim, making content available on the Web would be much better than the objective, putting content in repositories

more generally… resource orientation REST Semantic Web Web architecture …are important digital libraries ignore them at their peril

Thank you images by eNil, Poppyseed Bandits, m o d e, striatic, estherase, Gen Kanai, //bwr - Hieronymus Karl Frederick, dullhunk, Today is a good day and yours truly