Digital preservation: identifiers and rights

Slides:



Advertisements
Similar presentations
EPICUR Kathrin Schroeder Die Deutsche Bibliothek ETD The application of Persistent Identifiers as one approach to ensure long-term.
Advertisements

Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
DOI update on progress Norman Paskin DOIs and journal publishing Ed Pentz, CrossRef DOIs and book publishing David Sidman, CDI [DOIs and non-English language.
Doi> DOI Standardisation DOI Tools and Technologies.
DOI update on progress Norman Paskin DOIs and journal publishing Ed Pentz, CrossRef DOIs and book publishing David Sidman, CDI [DOIs and non-English language.
Doi> DOI and MPEG RDD Standard IDF Strategy meeting Bologna 2005.
CISCIS CIS - The Common Information System Keith Hill International DOI Foundation 7th May, 1998.
Doi> Digital Object Identifier and ISO TC46/SC9 IDF meeting Bologna 2005.
doi> Digital Object Identifier: overview
Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
IDF Open Meeting 2008: Resource Access for a Digital World International DOI Foundation Brussels, June
ITU Focus Group on Identity Management Geneva, February 2007 Norman Paskin Content industry standards activities T E R T I U S L t d.
Digital Object Identifier workshop doi> Norman Paskin The International DOI Foundation.
Enabling Access By Permission Standards for rights expression within the ONIX family Brian Green.
Doi> Norman Paskin, International DOI Foundation Digital Object Identifier.
IDF open meeting 2007 doi>. Eight possible innovations doi> Innovative uses of the DOI System.
Doi> DOI – new applications panel IDF Annual Members meeting Bologna 2005.
THE DONOR PROJECT Titia van der Werf-Davelaar. Project Financed by: Innovation of Scientific Information Provision (IWI) Duration: –phase 1: 1 may 1998.
United Nations Spatial Data Infrastructure Dr Kristin Stock Social Change Online and Centre for Geospatial Science, University of Nottingham.
Vocabulary Mapping Framework Tertius Ltd / International DOI Foundation Norman Paskin London Nov Next Steps.
Configuration management
DOI System: overview Norman Paskin International DOI Foundation.
The analysis Godfrey Rust, Data Definitions, London W3C DRM workshop, January 2001 January 2001.
Key to the management of intellectual property in digital media BISG/NISO The Changing Standards Landscape Washington DC, June Norman Paskin IDENTIFY.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Semantic Web Thanks to folks at LAIT lab Sources include :
Persistent identifiers – an Overview Juha Hakala The National Library of Finland
1 Persistent identifiers, long-term access and the DiVA preservation strategy Eva Müller Electronic Publishing Centre Uppsala University Library, Sweden.
3. Technical and administrative metadata standards Metadata Standards and Applications.
Grid Computing, B. Wilkinson, 20043a.1 WEB SERVICES Introduction.
1 MPEG-21 : Goals and Achievements Ian Burnett, Rik Van de Walle, Keith Hill, Jan Bormans and Fernando Pereira IEEE Multimedia, October-November 2003.
Philips Research France Delivery Context in MPEG-21 Sylvain Devillers Philips Research France Anthony Vetro Mitsubishi Electric Research Laboratories.
The Digital Object Identifier: A Tool for E-Commerce and Rights Management doi> Glen Secor 26 Nov 01.
MPEG-21 Multimedia Framework: Status and Directions January 8, 2003 John R. Smith Pervasive Media Management Group IBM T. J. Watson Research Center 19.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Persistent Identifiers Reinhard.
CNRI Handle System and its Applications
Doi> Norman Paskin, International DOI Foundation Digital Object Identifier.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Digital Object Identifier Charles Ellis: Chairman, International DOI Foundation Norman Paskin: Director, International DOI Foundation Steve Stone: Director,
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Piero Attanasio mEDRA: the European DOI agency The DOI as a tool for interoperability between private and public sector Athens, 14 January.
NAMING: A Key Component of Robust Resolution/Linking Albert Simmonds Business Manager Open Names Service Online Computer Library Center Dublin, Ohio, USA.
Linking resources Praha, June 2001 Ole Husby, BIBSYS
An Overview of MPEG-21 Cory McKay. Introduction Built on top of MPEG-4 and MPEG-7 standards Much more than just an audiovisual standard Meant to be a.
MPEG-21 : Overview MUMT 611 Doug Van Nort. Introduction Rather than audiovisual content, purpose is set of standards to deliver multimedia in secure environment.
Configuration Management (CM)
OCLC Online Computer Library Center Erpanet Symposium on Persistent Identifiers A framework for understanding Identifiers and “info” URIs Stuart Weibel.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
DOI Workshop, Luxembourg - 20 May Identifiers in Context Andy Powell UKOLN University of Bath UKOLN.
European Endeavor Users Group Meeting Helsinki, Sept Esa-Pekka Keskitalo, System Analyst Helsinki University Library OpenURL 1.0.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
What’s MPEG-21 ? (a short summary of available papers by OCCAMM)
Rights management through Digital Objects doi> Norman Paskin The International DOI Foundation.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
MPEG 21 – An Overview MUMT 611 Elliot Sinyor January 2005.
OWL Representing Information Using the Web Ontology Language.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Providing web services to mobile users: The architecture design of an m-service portal Minder Chen - Dongsong Zhang - Lina Zhou Presented by: Juan M. Cubillos.
Digital Object Identifier doi> Norman Paskin The International DOI Foundation W3C DRM workshop January 22/
Internet 2 and DoDHE: Research Issues From The iSchool Perspective Mike Eisenberg Dean and Professor The Information School University of Washington, Oct.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Making Sense of the Alphabet Soup of Standards
An Overview of MPEG-21 Cory McKay.
Digital Object Identifier
2. An overview of SDMX (What is SDMX? Part I)
Metadata in Digital Preservation: Setting the Scene
MUMT611: Music Information Acquisition, Preservation, and Retrieval
Presentation transcript:

Digital preservation: identifiers and rights 5 Dec 2002 Preparing for Digital Preservation What is being preserved: Identification and Rights Management issues Norman Paskin International DOI Foundation doi> (c) IDF 2002

doi> Recommended background material Preservation Management of Digital Materials – The Handbook N. Beagrie/M. Jones/DPC www.dpconline.org/graphics/handbook/ 3.4 Rights management 4.4 Metadata and Documentation 4.5 Access Digital preservation: an introduction to the standards issues surrounding the deposit of non-print publications M Bide/E J Potter/A Watkinson Sept 1999 www.bic.org.uk/digpres.doc

doi> Outline of presentation 1. Identifiers 1.1 Identifiers and metadata 1.2 Interoperability 1.3 Different meanings of “identifier” 1.4 Persistence “Keep a copy” - ? Rights 3.1 Accessing “definitive copy” 3.2 Rights framework

doi> 1.1 Identifiers and metadata An identifier = an unambiguous string denoting an entity 0550 10234 5

doi> 1.1 Identifiers and metadata An item of metadata = “a relationship that someone claims to exist between two entities” (indecs), each of which may have an identifier: 0550 10234 5 [BookData says] the cover of this book is red Pantone 4567

doi> 1.1 Identifiers and metadata To be useful, an identifier requires some metadata: 0550 10234 5 [Books in print says] The title of this identified book is…. Chambers Dictionary

doi> 1.1 Identifiers and metadata entity: something that is identified “Nothing exists until is identified” Entities may include: Abstractions (red); technical means (MP3 player); labels (title); things (book) etc. ontology: structured relationships between entities “an explicit formal specification of how to represent the entities that are assumed to exist in some area of interest and the relationships that hold among them” (such as: “page” is component of “book”) Examples: indecs framework; ONIX; FRBR

Digital preservation: identifiers and rights 5 Dec 2002 1.2 Interoperability doi> In a distributed environment, there is no one central physical archive A distributed virtual archive requires that all the players and components interoperate (c) IDF 2002

Digital preservation: identifiers and rights 5 Dec 2002 1.2 Interoperability doi> Across media books, serials, audio, audiovisual, software, abstract works, visual material, etc Across functions cataloguing, discovery, workflow, rights management, archiving Across levels of metadata Simple, complex Across linguistic and semantic barriers Across territorial barriers Across technology platforms (c) IDF 2002

Digital preservation: identifiers and rights 5 Dec 2002 1.2 Interoperability doi> Preservation: "How do we interoperate with the future?“ Preservation issues (identifiers, metadata, rights) are the same as any other interoperability problem (c) IDF 2002

doi> doi> 1.3 Meanings of “identifier” [1] Labels: the output of “numbering schemes” ISBN: ISO 2108:1992 International Standard Book Numbering ISSN: ISO 3297:1998 International Standard Serial Number ISRC: ISO 3901:2001 International Standard Recording Code ISRN: ISO 10444:1997 International Standard Technical Report Number ISMN: ISO 10957:1993 International Standard Music Number ISWC: ISO 15707:2001 International Standard Musical Work Code ISAN: Draft ISO 15706 International Standard Audiovisual Number V-ISAN: Draft ISO 20925 Version Identifier for audiovisual works ISTC: Draft ISO 21047 International Standard Text Code PII: Publisher Item Identifier etc

doi> doi> 1.3 Meanings of “identifier” [2] “infrastructure specifications”: specifying how to make labels actionable Do not generate a label, but if you have one, specify how to use it in some particular context URN: Uniform Resource name URI: Uniform [Universal] Resource Identifier PURL: Persistent Uniform Resource Locator e.g. ISBN as URN Note same concept in also in other non-digital contexts e.g. ISBN as EAN (978….) bar code or RFID

doi> doi> doi> 1.3 Meanings of “identifier” [3] “implemented systems” Implement labels, through actionable specification, in a managed way EAN/UPC: physical product codes : implement ISO bar codes, RFIDs in the supply chain DOI: digital object identifiers : implement URN/URIs in intellectual property (+metadata, policy) doi>

doi> doi> doi> 1.3 Meanings of “identifier” “For use on the Internet, an ISBN label can become a URN specification; an ISBN label can be incorporated into a DOI, which is an implemented identifier system following the URI specification.” Is clearer than “an ISBN identifier can become a URN identifier; an ISBN identifier can be incorporated into a Digital Object identifier, which is an implemented URI identifier” (?) A particular use of the word may be a mix of meanings [1], [2]& [3]

1.4 Persistence doi>

doi> Printed identifiers, bookmarks, etc Content URL URL URL URL

Digital preservation: identifiers and rights 5 Dec 2002 doi> doi> doi> URL URL 404 File not found URL URL URL URL URL URL Content URL URL "Linkrot": recent estimates 16% in 6 months URL URL (c) IDF 2002

doi> doi> doi> Redirection (resolution) e.g. DOI Assigner URL doi> doi> doi> Redirection (resolution) e.g. DOI Assigner Content DOI directory DOI directory DOI directory Content

doi> doi> doi> Assigner Assigner More than just "locate" Response Page purchase content view free excerpt get related items get archive copy request permissions Assigner Assigner DOI directory Content More than just "locate"

doi> doi> doi> Assigner Archive Response Page purchase content view free excerpt get related items get archive copy request permissions DOI directory purchase content Archive

Digital preservation: identifiers and rights 5 Dec 2002 1.4 Persistence doi> Persistent identifier Resolution (redirection) Persistence of the associated metadata Persistence of the resolution system Persistence of the identified copy digital preservation: migration, emulation, encapsulation Persistence is a matter of social infrastructure Technology can help but not guarantee (c) IDF 2002

doi> doi> doi> Internet: DOI, URN, URL, PURL Distinguish two issues: The technical specification of “what is” a URN and a URI etc. identifiers in sense [2] 2. What this means for practical implementation identifiers in sense [3]

doi> doi> doi> Internet persistent id specs See DOI Handbook 4.9 DOI as a URI 4.10 DOI as a URN 6.10 DOI and PURL Aim: persistent across time and unique across network space; useful and implemented PURLs are tied to http and are single redirect etc. URI/URNs are intended to be abstract names independent of protocols (approx) DOIs are URIs (formal specification) DOIs are URNs (in effect) URN and URI proponents disagree (& there are other proposed specs e.g. ARK)

doi> doi> Internet persistent id specs http:// www.w3.org/addressing (But largely from IETF, W3C did not see need for URN) URI URN URL Resolution (N2L) urn: ftp: gopher: http:

doi> doi> doi> DOI as URI IETF formal spec “URI scheme for Digital Object identifier” Paskin, Norman; Neylon, Eamonn; Hammond, Tony; Sun, Sam; Uniform Resource Identifier (URI) scheme for Digital Object Identifiers (DOIs); An abstract specification (uri:doi:) Would be doi: (like tel:) [uri: is not part of the uri spec, unlike urn:] May be a pure name or de-referenced by any service The namespace provides its own mechanism (“Bootstrapping”) On its own, it’s just a specification! Requires code distribution for any implementation

doi> doi> doi> DOI as URN URN is less clear: Higher level situation muddy Set of IETF drafts that define URN Set of registered namespaces (e.g. isbn) DOI could be but isn’t- no advantage Unlike URI, provides a specific DNS-based middle layer (RDS) to find the appropriate resolution service Scalability and security questioned; and: Little or no resolution implementation urn:isbn:123456789 can be defined ; but what does it do over and above isbn:123456789? neither have a readily available, well known, global, resolution A DOI is more than URN or URI Adds Policy, business rules, business model Adds Metadata specifications (cf ISBN, EAN, Visa)

doi> Outline of presentation 1. Identifiers 1.1 Identifiers and metadata 1.2 Interoperability 1.3 Different meanings of “identifier” 1.4 Persistence “Keep a copy” - ? Rights 3.1 Accessing “definitive copy” 3.2 Rights framework

doi> doi> doi> 2. “Keep a copy” A B Digital preservation is “keeping a copy” What is it you are archiving? (or managing, or counting) What’s a copy? Something that is “the same as ” Is A the same as B? Consider a photocopy….text; author; work; paper; spatial location…. A ….etc B

doi> doi> doi> 2. “Keep a copy” A B “Is A the same as B?” is meaningless Can only say “Is A the same as B for the purpose of…?” “the same” for some is “two different things” for others Purpose is defined by attributes “Nothing exists until is identified” …and its relevant attributes identified Structured metadata is needed (e.g. ONIX for digital preservation?) A B

doi> doi> doi> 3.1 Accessing the definitive copy “How can an identifier be used to locate a specific local copy, which may have different access rights?” [see www.doi.org, FAQ 26] Resolution of identifiers to global services. Contextualization of requests to those services to local requirements. split this into separate global and subsequent delegated local resolution steps e.g. OpenURL a globally-maintained database is clearly the wrong place to hold information on every local collection. ("Linking to the Appropriate Copy: Report of a DOI-Based Prototype"; (O. Beit-Arie, et. al.) D-Lib Magazine, www.dlib.org September 2001) A definitive archive copy could be separately identified (with its own DOI) – a matter of policy Functional granularity

doi> 3.2 Rights framework ISO/IEC MPEG-21 as exemplar Digital item: a structured digital object with a standard representation, identification and metadata The fundamental unit of distribution & transaction in the MPEG-21 framework Maps to “Digital Object” (DOI, Digital Object Architecture) or “Resource” (IETF) "Digital objects provide a means of organizing and identifying content for purposes of storage, access or distribution… …metadata may include restrictions on access to digital objects, notices of ownership, and licensing agreements…" (www.xiwt.org/documents/ManagAccess.html)

Rights Expression Language 3.2 Rights framework Use Enforcement of rights & permissions DRM Technology Platform Application layer Rendering, environment etc. Expression layer Rights Expression Language Machine-capable interpretation of rights: XRML etc Vocabulary layer Rights metadata Data Dictionary Metadata set 1 Metadata set 2

doi> 3.2 Rights framework Standards infrastructure must accommodate many different components (MPEG 21 standard is many parts) But a structured digital object with a standard representation, identification and metadata is "The fundamental unit” Must be interoperable with existing metadata standards - e.g. ONIX, SMPTE so need Dictionaries MPEG 21 Rights Data Dictionary & Rights Expression Language Purpose: "To achieve the goal of expressing rights for all Users of MPEG-21’s Digital Items"

doi> Describing rights using (meta)data Primary rights events (claims, deals) are described using pieces of data: Rights Statement (“claim”) [party] owns [right] in [creation] in [time] and [place] Rights Agreement (“deal”) [party] agreed with [party] in [time] and [place] that [event] Pieces of "rights metadata" used in each semantic structure

doi> Describing rights using (meta)data Primary rights events (claims, deals) are described using pieces of data: Rights Statement (“claim”) [party] owns [right] in [creation] in [time] and [place] Rights Agreement (“deal”) [party] agreed with [party] in [time] and [place] that [event] Creations typically have standard identifiers, which may have associated structured data, or which may act as keys to get this data Other pieces of data also need standard identifiers (time, party..)

doi> What is "rights metadata"? A mix of data from many sources: Rights “events” Statements, agreements, transfers, permissions, prohibitions, requirements, assertions, approvals

doi> What is “rights metadata”? A mix of data from many sources: Rights “events” Descriptive metadata Creations, Creation types, contributor roles, user roles, tools, classifications, measures

doi> What is “rights metadata”? A mix of data from many sources: Rights “events” Descriptive metadata Legal metadata Rights, persons, intellectual property

doi> What is “rights metadata”? A mix of data from many sources: Rights “events” Descriptive metadata Legal metadata Financial metadata Terms, conventions These sets of “rights metadata" are standardized and maintained in different places.

doi> Distributed rights management This mix of data from many sources is used in many different places by different people in chains of rights events: agreement transfer statement permission prohibition assertion requirement etc [party] can [verb] [amount] to [creation] at [time] in [place]. Each entity can be expanded to reveal more data

Digital preservation: identifiers and rights 5 Dec 2002 Distributed rights management doi> agreement transfer statement permission prohibition assertion requirement etc Each of these is an information object –an entity - which may need to link to or use information objects in other databases. The information used by each must therefore be standardised/interoperable (c) IDF 2002

Yes: work already done which shows how 3.2 Rights framework doi> Is there a way of getting to this "interoperation of data from many sources"? Yes: work already done which shows how

doi> indecs (www.indecs.org) Interoperability of Data in E-Commerce Systems Produced principles for structured metadata and basis for a data dictionary for interoperability Principles used by DOI, ONIX, etc Applicable to other structured approaches e.g. SMPTE (and creates means of interoperability with them) Now extended to rights transactions: <indecs>2 rdd Consortium (includes IDF) Accepted as basis of MPEG-21 Rights Data Dictionary

Digital preservation: identifiers and rights 5 Dec 2002 The MPEG-21 RDD doi> A data dictionary is a place where the process of semantics definitions meets technology MPEG standards have traditionally been about engineering solutions MPEG-21 is a multimedia and a lifecycle framework: its rights terminology does not exist in a vacuum Interacts with a large number of existing and developing schemes and systems The number of terms involved is likely to grow steadily and significantly MPEG-21 is taking the lead in establishing an RDD; it is likely to be widely supported if it is flexible and interoperable (c) IDF 2002

Digital preservation: identifiers and rights 5 Dec 2002 Rights & description are interdependent (1) doi> “Rights” metadata describes what people can (or can’t) do with assets, and when, where, how and with what they can do it. “Descriptive” metadata describes what people did with assets: the same thing, but in the past. The majority of terms are common. Any descriptive term may be relevant to the conditions of an agreement When new works are created through derivation, aggregation or copying, new descriptions are needed which rely on both descriptive & rights metadata (c) IDF 2002

Digital preservation: identifiers and rights 5 Dec 2002 Rights & description are interdependent (2) doi> Ownership changes and changes of law or jurisdiction often require querying of descriptive metadata for implementation in systems “Requirements” can be dependent on description in complex (and unfamiliar) ways Terms from descriptive schemes such as ONIX, Mi3P, DOI-NS, PRISM, MPEG7 Descriptor Schemes, DC and SCORM (and many others) will need to be integrated with any effective RDD (c) IDF 2002

Digital preservation: identifiers and rights 5 Dec 2002 Relationship with other metadata schemes doi> Many content metadata schemes are in use and development and there will be many more These all impact on rights descriptions. Users will be reluctant (or unable) to adopt separate terms for “rights” descriptions automated interoperability into and out of RDD terms needed Users need to describe “non-digital” rights in tandem with digital The meaning of terms in external schemes must be fully mapped to RDD terms so that they form a part of the available data dictionary and enable users to automate their participation (c) IDF 2002

Digital preservation: identifiers and rights 5 Dec 2002 <indecs> Data Dictionary doi> To provide a method for generating a set of clear, consistent, structured and integrated terms and definitions, to the required level of granularity, for an MPEG Rights Data Dictionary To provide a comprehensive methodology for the interoperability of terms from different schemes and systems used in the management of rights and permissions through mapping. Will be used by DOI Application Profiles DOIs can deliver this required interoperability To describe but in no way prescribe how rights and permissions operate To provide a framework for future governance. (c) IDF 2002

doi> Outline of presentation 1. Identifiers 1.1 Identifiers and metadata 1.2 Interoperability 1.3 Different meanings of “identifier” 1.4 Persistence “Keep a copy” - ? Rights 3.1 Accessing “definitive copy” 3.2 Rights framework

doi> Additional material “DRM Technology: Identification and Metadata” Norman Paskin In: Digital Rights Management: Technical, Economic, Juridical and Political Aspects (ed. Becker et al) Springer Lecture Notes in Computer Science series In press "Towards a Rights Data Dictionary - Identifiers and Semantics at work on the net". imi insights, June 2002 http://www.epsltd.com/IMI/IMI.htm (subscription access) Copies available from author on request (n.paskin@doi.org)

Digital preservation: identifiers and rights 5 Dec 2002 Norman Paskin, International DOI Foundation n.paskin@doi.org doi> (c) IDF 2002