Digital Object Architecture Tutorial

Slides:



Advertisements
Similar presentations
The Corporation for National Research Initiatives The Handle System Persistent, Secure, Reliable Identifier Resolution.
Advertisements

Handle System Workshop Introduction Laurence Lannom 21 June 2007 Corporation for National Research Initiatives Reston, VA
ADL Registry (Plus a Little Technological Context) Larry Lannom Corporation for National Research Initiatives
Digital Object Architecture and the Handle System Larry Lannom 20 June 2006 Corporation for National Research Initiatives
Handle System: DOI Technical Infrastructure Corporation for National Research Initiatives Larry Lannom December 10, 1997.
doi> Digital Object Identifier: overview
Contextual Linking Architecture Christophe Blanchi June Corporation for National Research Initiatives Approved for.
Corporation For National Research Initiatives DOIs and the Handle System 5 August 1998 Larry Lannom CNRI.
Resolution issues and DOI doi>. POLICIES Any form of identifier NUMBERING DESCRIPTION framework: DOI can describe any form of intellectual property, at.
Corporation For National Research Initiatives DOIs and the Handle System 7 May 1998 Larry Lannom CNRI.
Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
A Unified Approach to Combat Counterfeiting: Use of the Digital Object Architecture and ITU-T Recommendation X.1255 Robert E. Kahn President & CEO CNRI,
Naming Computer Engineering Department Distributed Systems Course Asst. Prof. Dr. Ahmet Sayar Kocaeli University - Fall 2014.
8.
Technical Architectures
Handle System Overview February 2011 Larry Lannom Corporation for National Research Initiatives
Grid Computing, B. Wilkinson, 20043a.1 WEB SERVICES Introduction.
Handle System Overview Larry Lannom 18 May 2004 Corporation for National Research Initiatives Copyright©
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
OCT1 Principles From Chapter One of “Distributed Systems Concepts and Design”
2 Systems Architecture, Fifth Edition Chapter Goals Describe client/server and multi-tier application architecture and discuss their advantages compared.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.
E-Business: Intra-Business E-Commerce
CORDRA Philip V.W. Dodds March The “Problem Space” The SCORM framework specifies how to develop and deploy content objects that can be shared and.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
A centralized system.  Active Directory is Microsoft's trademarked directory service, an integral part of the Windows architecture. Like other directory.
Domain Name System | DNSSEC. 2  Internet Protocol address uniquely identifies laptops or phones or other devices  The Domain Name System matches IP.
CNRI Handle System and its Applications
Resolving Unique and Persistent Identifiers for Digital Objects Why Worry About Identifiers? Individuals and organizations, including governments and businesses,
INTRODUCTION TO WEB DATABASE PROGRAMMING
The Handle System: and its role in a Digital Object Architecture Robert E. Kahn CNRI Workshop on Frontiers in Distributed Information Systems Presidio.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Internet Basics Dr. Norm Friesen June 22, Questions What is the Internet? What is the Web? How are they different? How do they work? How do they.
WSIS Forum 2011 May 19, 2011 Presentation by Robert E. Kahn
XHTML Introductory1 Linking and Publishing Basic Web Pages Chapter 3.
1 Multi Cloud Navid Pustchi April 25, 2014 World-Leading Research with Real-World Impact!
Reflections on the Digital Object Architecture by Robert E. Kahn, CNRI A presentation at a Symposium on Trusted Repositories in Rome, Italy on November.
Distributed Computing COEN 317 DC2: Naming, part 1.
1 Web Development & Design Foundations with XHTML Chapter 1 Key Concepts.
Ten Minute Handle System Overview July 2012 Larry Lannom Corporation for National Research Initiatives
 2004 Prentice Hall, Inc. All rights reserved. 1 Segment – 6 Web Server & database.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Attaching Rights to Content Larry Lannom Corporation for National Research Initiatives Copyright ©
Managing Digital Objects on the Net by Robert E. Kahn Corporation for National Research Initiatives Reston, Virginia National Online 2001 New York City.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
Corporation For National Research Initiatives Technical Issues in Electronic Publishing Corporation for National Research Initiatives William Y. Arms.
1 G52IWS: Web Services Chris Greenhalgh. 2 Contents The World Wide Web Web Services example scenario Motivations Basic Operational Model Supporting standards.
CSI 3125, Preliminaries, page 1 Networking. CSI 3125, Preliminaries, page 2 Networking A network represents interconnection of computers that is capable.
Persistent Identifiers (PIDs) & Digital Objects (DOs) Christine Staiger & Robert Verkerk SURFsara.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Active Directory. Computers in organizations Computers are linked together for communication and sharing of resources There is always a need to administer.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
1 CS 502: Computing Methods for Digital Libraries Guest Lecture William Y. Arms Identifiers: URNs, Handles, PURLs, DOIs and more.
COMPUTER NETWORKS Hwajung Lee. Image Source:
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Active Directory Domain Services (AD DS). Identity and Access (IDA) – An IDA infrastructure should: Store information about users, groups, computers and.
Digital Object Architecture (DOA) Information session
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Com-ITU/CEPT - January 2016, Copenhagen, Denmark
Internet and Distributed Application Services
Issues need harmonization
Digital Object Architecture (DOA) in ITU
The ITU-T X.500 series and X.509 in a changing world
Corporation for National Research Initiatives
CS 501: Software Engineering Fall 1999
Digital Object Interface Protocol (DOIP)
敦群數位科技有限公司(vanGene Digital Inc.) 游家德(Jade Yu.)
Presentation transcript:

Digital Object Architecture Tutorial Christophe Blanchi DONA Foundation ITU March 18 2016

Why the Digital Object Architecture? The Digital Object Architecture’s goal is to provide a solution to the following digital information management issues: Provide standard access to heterogeneous information. Identification Search and retrieval Security and trust Typing Interoperability across heterogeneous information systems. Independent of the specific underlying technologies that host and serve the information. Interoperability over long periods of time. 10 – 100 – 10000 years? Active management of the systems that the information resides on. Very large level of scalability. Distributed architecture Open architecture framework Standard protocols and procedures

A Trip Through the Internet Memory Lane The development of the Internet was about how to get digital packets reliably routed from computer A to B. Early routing information in the DARPANET was based on wire numbers. Later routing information was done using IP addresses and targeted individual machines. IP addresses where given DNS names for ease of human use. The web simplified the manner in which someone could access information by rendering it and making a “click” automate command line instructions. The web is an application of the Internet but not an Internet extension.

What is the “Internet”? Is it a network? Is it a global Information System? Is it a generic logical abstraction? None of the above? All of the above? Bob Kahn is of the opinion that the Internet is really a set of Protocols and Procedures that describes how networks can interact with each other. The underlying networks are replaceable. Its is independent of the hardware it operates on. It has scaled in a massive way. There has been little change to the networks that required changes to the the Internet protocol and procedures.

The Root of the Digital Object Architecture Information is more than packets. Information needs to be a “First Class Citizen” on the Internet. Information is complex, it has context, uses, monetary value, etc… Information needs to be locatable. Information needs to be understandable and reusable. Information needs to be protected, secure, and trusted. Information needs to persist over time. The Web has made information widely available but there are many issues that remain when dealing with information management. Big Data IoT

Digital Object Architecture’s Origins The Digital Object Architecture had its roots in a digital library project CS-TR project (Computer Science – Technical Report) funded by DARPA. The Digital Object Architecture was first described in a paper: A Framework for Distributed Digital Object Services by Kahn/Wilensky. The work was developed to be publicly available. The underlying philosophy: build the minimum that is needed to achieve the desired functionality. Define a series of protocols and procedures that are technology independent.

DOA - Information Management on Networks Client Repositories Resource Discovery Search Engines, Metadata Databases, Catalogues, Registries, etc. Identifier Resolution Service

What is a Digital Object? Handle Identifier The minimum set of attributes needed to exist on a network. A persistent, globally unique, secure identifier. State metadata about the object. Zero of more Data Elements. A Data Element has a unique identifier. A Data Element has state metadata about itself. Date Created 12-10-2014 Date Last Modified 12-11-2015 Type 0.TYPE/DS1 Author 20.200/00 Date Created 12-10-2014 Date Last Modified Data Element Identifier Size 134124 Type 0.TYPE/T1 Checksum e044b32a… Payload Date Created 12-10-2014 Date Last Modified Data Element Identifier Size 134124 Type 0.TYPE/T2 Checksum 0e9d7928b… Payload

Digital Object Interactions Digital Object Access Protocol All Digital Objects are accessible using the Digital Object Protocol. Independent of the underlying technical systems. Each Digital Object describes itself and its contents. Metadata – to find them Types – to process its data Source and integrity Each Digital Object specifies and contains its own access control rules.

The Digital Object Registry Provides a infrastructure based search system Find Digital Objects by their attributes and targeted contents. Find Handles by their values. Open source software. Supports access control based searches. Light weight interface for registration. Light weight interface for searches. Registries can be federated. Registries can be Digital Objects.

What is a Handle? 2304568.40/12345678 Prefix Suffix Naming Authority Item ID (any format) A Handle is a globally unique and resolvable identifier Prefix is resolvable by the Global Handle Registry to a Local Handle System (LHS). The identifier is resolvable by Local Handle System into set of typed values. Character Set: Unicode 2.0 Encoding: UTF-8 Prefix: Currently allocating only numeric values. Suffix: No restrictions.

Handles Resolve to Typed Data Data Type Handle Data 10.1525/b.2009.59.5.9 HS_ADMIN handle=0.na/10.1525; index=200; [delete hdl,add val,read val,modify val,del admin,add admin,list] http://caliber.ucpress.net/doi/abs/10.1525/bio.2009.59.5.9 URL 10320/loc <locations chooseby="locatt, country, weighted"> <location id="1" cr_type="MR-LIST" href="http://mr.crossref.org/ iPage?doi=10.1525%2Fbio.2009.59.5.9" weight="1" /> <location id="2" cr_src="unca" label="SECONDARY_BIOONE" cr_type="MR-LIST" href="http://www.bioone.org/doi/full/10.1525/ bio.2009.59.5.9" weight="0" /> </locations> 0000000B4453415F5055425F4B4559000000000015009760508F15230B…. HS_PUBKEY eyJhbGciOiJSUzI1NiJ9.eyJkaWdlc3RzIjp7ImFsZyI6IlNIQS0yNTYiLCJkaWdlc…. HS_SIGNATURE

Handle System Provides basic identifier resolution system for the Internet. Resolve an object identifier to its current state data. Identifier persists even if location and other attributes of the object changes. Logically a single system, but physically and organizationally distributed. Highly scalable. Associates one or more typed values, e.g., IP address, public key, URL, to each identifier. Optimized for speed and reliability. Secure resolution with its own PKI as an option. Open, well-defined protocol and data model. Provides infrastructure for application domains, e.g., digital libraries & publishing, e-research, id mgmt.

Handle Resolution …... The Handle System is a collection of Handle MPA 1 MPA 2 MPA 3 MPA 4 MPA 6 DONA Resolution request For 10.1525/59.5.9 Global Handle Registry Local Handle Service Site 1 Site 2 Site 3 …... Site n The Handle System is a collection of Handle Services, each of which consists of one or more replicated sites, each of which may have one or more servers. #1 #2 #n #4 #3 ... #1 #2 http://www.acme.com/ URL 10.1525/59.5.9 4 20

Global Handle Registry Handle Clients hdl:12.23/56 1. Client sends request any MPA GHR Service in the GHR to resolve 0.NA/12.34 (prefix handle for 12.34/56) Global Handle Registry Client gets request to resolve hdl:12.34/56

Handle Clients Global Handle Registry hdl:12.34/56 Global Handle Registry 2. An MPA GHR service Responds with Service Information for 12.34 IP xc xc xc ... xcccxv xccx xc xc xc .. xc xc Client gets request to resolve hdl:12.34/56 xcccxv xccx xc xc xc .. xc xc xcccxv xccx xc xc xc .. Service Information Acme Local Handle Service

Handle Clients ... ... ... ... ... ... ... IP Address Port # xcccxv xc .. ... xccx IP Address Port # Public Key ... Primary Site Server 1 123.45.67.8 2641 K03RLQ... ... ... Server 2 123.52.67.9 2641 5&M#FG... Secondary Site A Server 1 321.54.678.12 2641 F^*JLS... ... Server 2 321.54.678.14 2641 3E$T%... ... Server 3 762.34.1.1 2641 A2S4D... ... Secondary Site B Server 1 123.45.67.4 2641 N0L8H7... ... Service Information - Acme Local Handle Service

Handle Clients ... ... ... ... ... ... ... IP Address Port # xcccxv xc .. ... xccx IP Address Port # Public Key ... Primary Site Server 1 123.45.67.8 2641 K03RLQ... ... ... Server 2 123.52.67.9 2641 5&M#FG... Secondary Site A Server 1 321.54.678.12 2641 F^*JLS... ... Server 2 321.54.678.14 2641 3E$T%... ... Server 3 762.34.1.1 2641 A2S4D... ... Secondary Site B Server 1 123.45.67.4 2641 N0L8H7... ... Service Information - Acme Local Handle Service

Handle Clients ... ... ... ... ... ... ... IP Address Port # xcccxv xc .. ... xccx IP Address Port # Public Key ... Primary Site Server 1 123.45.67.8 2641 K03RLQ... ... ... Server 2 123.52.67.9 2641 5&M#FG... Secondary Site A Server 1 321.54.678.12 2641 F^*JLS... ... Server 2 321.54.678.14 2641 3E$T%... ... Server 3 762.34.1.1 2641 A2S4D... ... Secondary Site B Server 1 123.45.67.4 2641 N0L8H7... ... Service Information - Acme Local Handle Service

Global Handle Registry Handle Clients hdl:123/456 Global Handle Registry 3. Client queries Server 3 in Secondary Site A for 12.34/56 Client gets request to resolve hdl:12.34/56 #1 #1 #2 Secondary Site B #2 #1 #3 Primary Site Secondary Site A Acme Local Handle Service

Global Handle Registry Handle Clients hdl:123/456 Global Handle Registry 4. Server responds with handle data Client gets request to resolve hdl:12.34/56 #1 #1 #2 Secondary Site B #2 #1 #3 Primary Site Secondary Site A Acme Local Handle Service

Handle Clients Resolution With a Web Browser HTTP Get Proxy/Web Server http://hdl.handle.net/12.34/56 Resolution With a Web Browser Proxy/Web Server HTTP Get LHS GHR Handle Resolution Handle System

Handle Clients Resolution With a Web Browser HTTP Redirect http://acme.com/index.html Resolution With a Web Browser HTTP Redirect Proxy/Web Server LHS GHR Handle Data Handle System

Handle Clients Resolution with a Handle Client Plug-in Handle Data hdl:12.34/56 Resolution with a Handle Client Plug-in Handle Data Handle Resolution LHS GHR Handle System

Web Server and/or Admin Handle Clients Handle Admin via Web Form Web Server and/or Admin Servlets LHS GHR Handle System

Web Server and/or Admin Handle Clients Handle Admin via Web Form Web Server and/or Admin Servlets LHS GHR Handle System

Handle Clients Custom Admin Client LHS GHR Handle System

Handle Administration Handle Clients Handle Resolution Embedded in Another Process Handle Administration Embedded in Another Process LHS GHR Handle System

Who is responsible for operating the GHR? The original GHR was operated by CNRI in Reston VA in the US since the mid to late 1990s. Until recently, CNRI had the sole credential and authorization to create all new prefixes. The user community requested that the prefix creation be decentralized across multiple organizations. Development of a new Multi-Primary GHR architecture. Multi-Primary Administrators (MPA): organizations / entities that are credentialed and authorized by DONA to create derived prefixes. Each MPA is allotted a single prefix (e.g. 0.NA/21) Every MPA can create an unlimited number of derived prefixes from its allotted prefix and allot them to whomever they see fit. All MPA verify and replicate any and all valid prefix creations from all other MPAs. DONA and the MPAs coordinate the operations of the GHR. The new GHR maintain backwards compatibility with legacy GHR.

Legacy – CNRI as sole GHR administrator. CNRI GHR CNRI Primary CNRI Mirror Organization A Prefix 123 Could create handles 123/suffix on their LHS Organization B Prefix 234 Could create handles 234/suffix on their LHS Organization N Prefix xyz Could create handles xyz/suffix on their LHS

New GHR – Coordinated Across MPAs Multi Organizational Coordinated GHR DONA GHR SVC MPA Org A GHR SVC 20 MPA Org N GHR SVC 99 LHS 20.1 LHS 20.10 LHS 99.1 LHS 20.1.1

The Role of the DONA Foundation Based in Geneva Switzerland. Provide coordination, software, software development, and other strategic services for the technical development, evolution, application, and other uses in the public interest around the world of the Digital Object Architecture (DOA) with a mission to promote interoperability across heterogeneous information systems. DONA will promote the X.1255 standard and the use of the DOA across many different countries, domains, and industries. Make the developed DOA standards and/or software accessible to the community to further their development and adoption.

DONA Foundation’s GHR Operations DONA coordinates with all Multi-Primary Administrators (MPA) to maintain the stable and secure operation of the the Global Handle Registry (GHR). DONA credentials and authorizes new MPAs. The DONA Foundation will work in collaboration with the MPAs to improve the architectural, technical, and performance of the GHR. The Multi-Primary GHR Operations started on the 9th of December 2015.

Questions?