Producer-Archive Workflow Network (PAWN) Goals Consistent with the Open Archival Information System (OAIS) model Use of web/grid technologies and platform.

Slides:



Advertisements
Similar presentations
Introduction to TransXChange
Advertisements

MIT Lincoln Laboratory A Service-Oriented Approach to Application Development Robert Darneille & Gary Schorer WPI MQP Presentations ICS Group 10 October.
Fedora Users’ Conference Rutgers University May 14, 2005 Researching Fedora's Ability to Serve as a Preservation System for Electronic University Records.
PKI Activities at Virginia January 2004 CSG Meeting Jim Jokl.
DESIGNING A PUBLIC KEY INFRASTRUCTURE
National Center for Supercomputing Applications Integrating MyProxy with Site Authentication Jim Basney Senior Research Scientist National Center for Supercomputing.
Introduction to PKI Seminar What is PKI? Robert Brentrup July 13, 2004.
ADAPT An Approach to Digital Archiving and Preservation Technology Principal Investigator: Joseph JaJa Lead Programmers: Mike Smorul and Mike McGann Graduate.
PAWN: Producer-Archive Workflow Network University of Maryland Institute for Advanced Computer Studies Joseph Ja’Ja, Mike Smorul, Mike McGann.
May Archiving PAWN: A Policy-Driven Software Environment for Implementing Producer- Archive Interactions in Support of Long Term Digital.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Supporting Customized Archival Practices Using the Producer-Archive Workflow Network (PAWN) Mike Smorul, Mike McGann, Joseph JaJa.
Brief Overview of Major Enhancements to PAWN. Producer – Archive Workflow Network (PAWN) Distributed and secure ingestion of digital objects into the.
The KnowledgeBank: Powered by DSpace Laura Tull Systems Librarian Ohio State University Libraries WiLSWorld July 27, 2004.
July NAGARA 1 Producer-Archive Workflow Network Mike Smorul, Mike McGann, Joseph JaJa Institute for Advanced Computer Science Studies University.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
PAWN Progress July 06, Overview of changes New flexible environment for setting up and managing interactions between producers and the archive Domains.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
CN1276 Server Kemtis Kunanuraksapong MSIS with Distinction MCTS, MCDST, MCP, A+
Robust Technologies for Automated Ingestion and Long-Term Preservation of Digital Information Principal Investigator: Joseph JaJa Lead Programmers: Mike.
PAWN: Producer-Archive Workflow Network University of Maryland Institute for Advanced Computer Studies Joseph JaJa, Mike Smorul, Mike McGann.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
PAWN: Producer-Archive Workflow Network University of Maryland Institute for Advanced Computer Studies Joseph Ja’Ja, Mike Smorul, Mike McGann.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
Copyright, 1996 © Dale Carnegie & Associates, Inc. Digital Certificates Presented by Sunit Chauhan.
Robust Technologies for Automated Ingestion and Long-Term Preservation of Digital Information PI: Joseph JaJa Co-PIs: Allison Druin and Doug Oard Major.
Chapter 2 Database Environment Pearson Education © 2014.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Archival Prototypes and Lessons Learned Mike Smorul UMIACS.
Chapter 4 Database Management Systems. Chapter 4Slide 2 What is a Database Management System (DBMS)?  Database An organized collection of related data.
Christopher Chapman | MCT Content PM, Microsoft Learning, PDG Planning, Microsoft.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
An Overview of Selected ISO Standards Applicable to Digital Archives Science Archives in the 21st Century 25 April 2007 Donald Sawyer - NASA/GSFC/NSSDC.
OPeNDAP Hyrax Back-End Server (BES) Authentication and Authorization Patrick West
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
Rule-Based Data Management Systems Reagan W. Moore Wayne Schroeder Mike Wan Arcot Rajasekar {moore, schroede, mwan, {moore, schroede, mwan,
Computer Security: Principles and Practice First Edition by William Stallings and Lawrie Brown Lecture slides by Lawrie Brown Chapter 22 – Internet Authentication.
Certificate-Based Operations. Module Objectives By the end of this module participants will be able to: Define how cryptography is used to secure information.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
1 Vigil : Enforcing Security in Ubiquitous Environments Authors : Lalana Kagal, Jeffrey Undercoffer, Anupam Joshi, Tim Finin Presented by : Amit Choudhri.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
Lifecycle Metadata for Digital Objects October 18, 2004 Transfer / Authenticity Metadata.
Elmasri and Navathe, Fundamentals of Database Systems, Fourth Edition Copyright © 2004 Pearson Education, Inc. Slide 2-1 Data Models Data Model: A set.
Bayu Adhi Tama, M.T.I 1 © Pearson Education Limited 1995, 2005.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
M-1 INGEST OVERVIEW Don Sawyer National Space Science Data Center NASA/GSFC October 13, 1999.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
NARA Report: NARA Persistent Archives Prototype Bill Underwood GTRI, Atlanta CCSDS, MOIMS DAI / IPR WGs Toulouse, 2 Nov-5 Nov 2004.
Active Directory Domain Services (AD DS). Identity and Access (IDA) – An IDA infrastructure should: Store information about users, groups, computers and.
Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003.
PAWN: Producer-Archive Workflow Network
Joint Meeting of CSUL Committees,
Joseph JaJa, Mike Smorul, and Sangchul Song
A step-by-step guide to DOI registration
Chapter 2 Database Environment Pearson Education © 2009.
Goals Introduce the Windows Server 2003 family of operating systems
Implementing an Institutional Repository: Part II
Database Environment Transparencies
敦群數位科技有限公司(vanGene Digital Inc.) 游家德(Jade Yu.)
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Presentation transcript:

Producer-Archive Workflow Network (PAWN) Goals Consistent with the Open Archival Information System (OAIS) model Use of web/grid technologies and platform independent Ease of integration with current pilot system based on data grids XML representation of metadata and bitstreams Accountability of transfer and guarantee of data integrity Project Members Joseph JaJa Mike Smorul Yang Wang Mike McGann Fritz McCall Chris Wambler Gary Jackson Tim Norris Producer ComponentsArchive Components Database to track registered objects Certificate Authority management Management server supplies web service interfaces to ingestion clients and management operations. Clients are designed to be standalone, with security certificates issued by producer Receiving servers validate connecting clients and validate SIPs Validation Services are simple webservice calls. Abstract I/O layer into digital archive. All components are scalable using standard load balancing techniques. Secure Distributed Ingestion Distributed security management through multiple Certificate Authorities (CA) Compatible with existing producer CA’s SSL encrypted and authenticated connections Automatic Certificate Revocation List (CRL) checking Scalable using standard load balancing technology Ingestion Workflow 1.Negotiate Submission Agreement. Create XML document regarding expected file formats, metadata, and layout of submission 2.Workflow Initialization and Submission Information Packet (SIP) creation. Trust relationship between Archive and Producer is established Clients are issued and register data 3.Transfer of SIPs to archive. A Submission Information Packet is created on a client. Client contacts archive and transfers SIP 4.Validation of SIP transfer Metadata and bitstreams are checked for integrity against checksums All items are also checked against requirements document Bitstreams are validated against test specified in requirements document. 5.Organization of data and transfer into persistent archive. Metadata may be transformed into an optimal object format depending on digital archive requirements Defining an Information Packet PAWN uses the Metdata Encoding and Transmission Standard (METS) schema to describe the contents and metadata of a Submission Information Packet (SIP). Each client generates a SIP containing a METS XML document and bitstreams to transfer to an archive. PAWN uses a template document based on METS combined with a set of rules that allow PAWN to enforce restrictions on how a SIP should arrive at an archive. These restrictions allow for the following types of control: Structural Limitations on the hierarchical ordering of document can be enforces Format Formats can be defined in a few ways, including required validation tests as defined by an archive, or simpler mime-types Metadata Metadata can be restricted by schema to certain structural areas PAWN Client Multiple PAWN clients run at each producer, each client can independently register and transfer holdings to an archive. Clients perform two functions, registering its holdings with a producer management server, and later transferring its holdings to an archive. During registration clients will notify a management server about holdings that it wants to transfer to an archive, along with metadata that is locally harvested. After registration a client will later create a SIP and transfer it to an archive. The two step transfer process allows oversight at the producer. Between registration and submission of data, context at a producer wide level may be attached to holdings.