Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003.

Slides:



Advertisements
Similar presentations
Building Shared Collections Using the Storage Resource Broker Storage Resource Broker Reagan W. Moore
Advertisements

3 September 2004NVO Coordination Meeting1 Grid-Technologies NVO and the Grid Reagan W. Moore George Kremenek Leesa Brieger Ewa Deelman Roy Williams John.
San Diego Supercomputer Center & National Partnership for Advance Computational Infrastructure Storage Resource Broker Reagan W. Moore San Diego Supercomputer.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
OGF-23 iRODS Metadata Grid File System Reagan Moore San Diego Supercomputer Center.
© 2007 Open Grid Forum Data Management Challenge - The View from OGF OGF22 – February 28, 2008 Cambridge, MA, USA Erwin Laure David E. Martin Data Area.
The Storage Resource Broker and.
Overview of the SDSC Storage Resource Broker Wayne Schroeder (and other SRB team members) May, 2004 San Diego Supercomputer Center, University of California.
Peter Berrisford RAL – Data Management Group SRB Services.
Digital Preservation Lifecycle Management Building a demonstration prototype for the preservation of large-scale multi-media collections Arcot Rajasekar.
Data Management Expert Panel - WP2. WP2 Overview.
Data Grid: Storage Resource Broker Mike Smorul. SRB Overview Developed at San Diego Supercomputing Center. Provides the abstraction mechanisms needed.
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure Data Grids, Digital Libraries, and Persistent Archives ESIP.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids Reagan W. Moore San Diego Supercomputer Center.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids, Digital Libraries and Persistent Archives Reagan.
NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Particle Physics Data Grid PPDG Data Handling System Reagan.
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure Integration of Data Grids, Digital Libraries, and Persistent.
San Diego Supercomputer Center NARA Research Prototype Persistent Archive Building Preservation Environments with Data Grid Technology (NARA Research Prototype.
San Diego Supercomputer CenterNational Partnership for Advanced Computational Infrastructure1 Grid Based Solutions for Distributed Data Management Reagan.
Federating Archives in the DELAMAN Network Reagan W. Moore San Diego Supercomputer Center Storage Resource.
Security Requirements for Shared Collections Storage Resource Broker Reagan W. Moore
“Enabling Success: IT Infrastructure & Repositories” Andrew Bennett, University of Qld Library APSR : The Successful Repository University of Queensland.
VL-e PoC Introduction Maurice Bouwhuis VL-e work shop, April 7 th, 2006.
Applying Data Grids to Support Distributed Data Management Storage Resource Broker Reagan W. Moore Ian Fisk Bing Zhu University of California, San Diego.
Please Describe Data ingestion. This includes support for real-time sensor data (object ring buffers) as well as simulation output (grid portals) –We have.
Modern Data Management Overview Storage Resource Broker Reagan W. Moore
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Data Grid Web Services Chip Watson Jie Chen, Ying Chen, Bryan Hess, Walt Akers.
Data Grid Interactions with Firewalls Michael Wan Reagan Moore SDSC/UCSD/NPACI.
SDSC Projects Part 1: BUILDING PRESERVATION ENVIRONMENTS (Reagan Moore, Storage Resource Broker (SRB) and collection migration technologies:
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for Advanced.
Data Grids and Data Management Storage Resource Broker Reagan W. Moore
National Partnership for Advanced Computational Infrastructure Digital Library Architecture Reagan Moore Chaitan Baru Amarnath Gupta George Kremenek Bertram.
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
Information Management and Distributed Data Reagan W. Moore Wayne Schroeder Mike Wan Arcot Rajasekar Richard Marciano {moore, schroede, mwan, sekar,
Jan Storage Resource Broker Managing Distributed Data in a Grid A discussion of a paper published by a group of researchers at the San Diego Supercomputer.
Data Grids and Data Management Storage Resource Broker Reagan W. Moore
Managing Simulation Output Storage Resource Broker Reagan W. Moore
Rule-Based Data Management Systems Reagan W. Moore Wayne Schroeder Mike Wan Arcot Rajasekar {moore, schroede, mwan, {moore, schroede, mwan,
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for Advanced.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Data Grid Services/SRB/SRM & Practical Hai-Ning Wu Academia Sinica Grid Computing.
Production Data Grids SRB - iRODS Storage Resource Broker Reagan W. Moore
Data Grid Management Systems (DGMS) Arun Jagatheesan San Diego Supercomputer Center
Rule-Based Preservation Systems Reagan W. Moore Wayne Schroeder Mike Wan Arcot Rajasekar Richard Marciano {moore, schroede, mwan, sekar,
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Archive for the NSDL Reagan W. Moore Charlie Cowart.
San Diego Supercomputer CenterNational Partnership for Advanced Computational Infrastructure1 Data Grids, Digital Libraries, and Persistent Archives Reagan.
Michael Doherty RAL UK e-Science AHM 2-4 September 2003 SRB in Action.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Introduction to The Storage Resource.
M-1 INGEST OVERVIEW Don Sawyer National Space Science Data Center NASA/GSFC October 13, 1999.
SDSC Storage Resource Broker & Meta-data Catalog SRB Archives HPSS, ADSM, UniTree, DMF Databases DB2, Oracle, Sybase File Systems Unix, NT, Mac OSX Application.
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
National Archives and Records Administration1 Integrated Rules Ordered Data System (“IRODS”) Technology Research: Digital Preservation Technology in a.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
The Storage Resource Broker and.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
Building Preservation Environments Reagan W. Moore San Diego Supercomputer Center Storage Resource Broker.
Building Preservation Environments from Federated Data Grids Reagan W. Moore San Diego Supercomputer Center Storage.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil
Collection Based Persistent Archives
Policy-Based Data Management integrated Rule Oriented Data System
Arcot Rajasekar Michael Wan Reagan Moore (sekar, mwan,
VORB Virtual Object Ring Buffers
Technical Issues in Sustainability
Presentation transcript:

Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003

OGSA cross WG discussion template2 Outline Requirements Key concepts/functionality Architecture/Model (if any) Services/portTypes (if any) Relation with invited groups

OGSA cross WG discussion template3 Requirements Need variety of interfaces –Support access through archivist selected API: C library, C++ library, Shell command, Perl, Python, Windows DLL, Mac DLL, Windows browser, Web browsers, OAI, WSDL, Linux I/O redirection, Java, GridFTP Manage consistency between context (state information resulting from service) and content (digital entities) Support transformative migrations between data types –DFDL based description of component structure –METS based description of compound document Manage authenticity –Digital signatures, audit trails, collection-owned data, procedures for validating digital entity/collection Support persistent archive –Logical name space for infrastructure independent name –Manage technology evolution (standards, encoding format, software)

OGSA cross WG discussion template4 Key concepts Automation of all archival processes –Logical name spaces for data, resources, users, applications Four identifiers: Unique handle, Descriptive metadata, Logical name, Physical file name –Build a persistent service that survives across technology evolution –Support collection-owned data, specify roles for each user and access controls on each digital entity –Manage logical name space as a collection hierarchy, and manage consistency of state information mapped onto the logical name space … –Provide bulk operations (registration, load, unload, metadata update, …) Archival processes to generate archival context –Build upon a standard set of operations that can be performed on the metadata, collection, data, and storage systems –Collection tokens that define restricted semantics –Operations for interacting with catalogs in a database, digital entities in a storage repository –Bulk operations to improve performance

OGSA cross WG discussion template5 SRB server SRB agent SRB server Federated Client Server MCAT Read Application SRB agent Logical Name Or Attribute Condition 1.Logical-to-Physical mapping 2.Identification of Replicas 3.Access & Audit Control Peer-to-peer Brokering Server(s) Spawning Data Access Parallel Data Access R1 R2 5/6

OGSA cross WG discussion template6 Shell / Perl / Python Java, NT Browsers OAI WSDL GridFTP http Modular Architecture (Add new APIs, new Storage Repositories, new Information Repositories) Archives HPSS, ADSM, UniTree, DMF Databases DB2, Oracle, SQLserver, Postgres, mySQL File Systems Unix, NT, Mac OSX Application HRM, ORB Access APIs Servers Storage Abstraction Catalog Abstraction Databases DB2, Oracle, Sybase, Postgres, mySQL C, C++, Libraries Logical Name Space Latency Management Data Transport Metadata Transport Consistency Management / Authorization-Authentication MCAT Enabled Server Linux I/O Mac DLL / Windows DLL

OGSA cross WG discussion template7 Basic Interaction Mechanisms Access mechanisms that require remote operations –Byte level access –Latency management mechanisms –Object oriented access –Heterogeneous system access (database operations, ORB operations, HRM operations) –See “Recommendation for Standard Operations at Remote Sites” An access mechanism is any operation that may require interaction between manipulation and transport –Example - streaming of partial results as process is executed

OGSA cross WG discussion template8 Consistency between Context and Content Consistent management of state information generated by services. Examples: –Management of replicas –Synchronization of replicas –Aggregation of data in containers –Write locks on containers –Replication of containers –Authenticity metadata - audit trails Consistency on bulk operations –Roll-back on partial completion –Synchronization across storage repository outages –Load leveling vs. fault tolerance vs. replication

OGSA cross WG discussion template9 GGF / Standards Interactions Data Format Description Language - digital ontology for describing structure Data Transport - remote operations vs transport Grid File System - remote operations vs consistency Grid Protocol Architecture - Consistency between context and content Semantic Web OWL - Ontology Web Language Digital Library Federation METS - Metadata Encoding and Transmission Standard NSF Digital Library Initiative OAI- Open Archive Initiative NASA/NARA OAIS - Open Archival Information System