Presentation is loading. Please wait.

Presentation is loading. Please wait.

DIGITAL PRESERVATION IN HYDRA/FEDORA March 24, 2015 G ET A HEAD ON Y OUR R EPOSITORY.

Similar presentations


Presentation on theme: "DIGITAL PRESERVATION IN HYDRA/FEDORA March 24, 2015 G ET A HEAD ON Y OUR R EPOSITORY."— Presentation transcript:

1 DIGITAL PRESERVATION IN HYDRA/FEDORA March 24, 2015 G ET A HEAD ON Y OUR R EPOSITORY

2 About Hydra/Fedora Flexible Extensible Digital Object Repository Architecture Open-source project Provides a platform for digital preservation and presentation Used by hundreds of organizations, with over 52 Fedora Members contributing financially; Yale is one of these. Originally developed at Cornell, now led by Fedora Project Steering Group under stewardship of DuraSpace.org (http://www.fedora-commons.org)http://www.fedora-commons.org Yale is also a Fedora development partner, and Mike Friscia serves on the Fedora Leadership Committee Currently actively engaged in development of Fedora 4

3 Hydra Began in 2008 as collaboration between Stanford, UVA, Univ. of Hull, and Fedora Commons YUL joined in 2013 as 18 th member. Membership now up to around 27—recent additions include Princeton, Cornell, Case Western Another 25 or more institutions are working in the Hydra framework without yet being formal members, including Brown, Johns Hopkins, Trinity College Dublin, Oxford, UC Berkeley and others

4 Hydra Partners OR = Open Repositories Conference

5 DuraSpace (f) Stanford University (f) University of Hull (f) University of Virginia (f) MediaShelf University of Notre Dame Northwestern University Columbia University Penn State University Indiana University London School of Economics and Political Science Rock and Roll Hall of Fame and Museum Royal Library of Denmark Data Curation Experts WGBH Boston Public Library Duke University Yale University Virginia Tech University of Cincinnati Princeton University Library Cornell University Oregon Digital (University of Oregon and Oregon State University) Case Western Reserve University Tufts University Duoc UC University of Alberta

6 A Worldwide Presence

7 G ET A HEAD ON Y OUR R EPOSITORY Hydra at Yale

8 What Is Hydra? 1.A framework for repository-powered applications, with multiple, tailored UIs, and a robust repository back end  One body, many heads 2.A set of solution bundles 3.A community  If you want to go fast, go alone. If you want to go far, go together.

9 Fedora (Preservation) Fedora (Preservation) Solr (Index) Solr (Index) Active Fedora and Solrizer Search/Facet Logic Hydra-Head Creating and managing objects (CRUD) Blacklight Discovering and viewing objects (R) Ladybird (Yale’s Cataloging Tool) Managed Storage Bookreader Complex Object Display Bookreader Complex Object Display Single Image Zoom Media Server Metadata Images Image Request Image Retrieval Downloadable PDF Link to Images Hydra Interface (IT use only) Hydra Interface (IT use only) Data Import RSS SQL Server Hydra Access Controls

10 Content model

11 Access Conditions Defined for each file in a content model Wide range of authorization definitions Customizable Example:

12 Ingest Workflow

13 Research Data into Hydra Colectica software exports contents in BagIt format Bag enters a watched folder in Ladybird Ladybird validates the bag contents Checksum validation File characterization Ladybird maintains the original file hierarchy as a collection of complex objects Each Ladybird object mapped to an Unstructured Content Model Each content model is then ingested into Hydra

14 Unstructured Content model

15

16 DPN

17 G ET A HEAD ON Y OUR R EPOSITORY Digital Preservation in Hydra

18 Hydra Solution Bundles Sufia CurateND ScholarSphere HydraDAM Argo Chronos

19

20

21

22 Encryp- ted Integrity check Storage pillars Preservationprofile IIIIIIIVV-VIII IXX XI 1: Storage without bit preservationX X 2: Digital born collection of material that has access restrictions XXXXX X 3: Legally deposited born digital material that is not in the Webarchive XXXXX X 4: Born digital collection material, without access restrictions XXXXX X 5: Retro digitized (expensive) m aterials with analog copies XXXX 6: Secret digital materials X XXXXX 7: Top secret digital materials X X Preservation Profiles

23 G ET A HEAD ON Y OUR R EPOSITORY Future Development

24 Fedora 4 Roadmap: Audit Service Portland Common Data Model Migration Tools Asynchronous Storage Linked Data Platform Managed External Data Streams

25 Fedora 4 Auditing Track Events: agent, date, activity, entity Allow import/export of events High performance Stored separate from repository entities Export in RDF format Provide SPARQL-Query search endpoint

26 Portland Common Data Model

27 HydraDAM2

28 -grants-digital-preservation.shtml

29 G ET A HEAD ON Y OUR R EPOSITORY Hydra Infrastructure

30 Hydra Architecture Open source, community developed software Fedora Commons Apache Solr Blacklight MySQL Hydra Project open source, community developed software Locally developed software; Ladybird, Media Delivery Service

31 Repository Storage – Current State Yale ITS Disk-based Enterprise Storage Yale Library Tape- based Archival Storage Repository New Haven/West Haven, CT Rocky Hill, CT Iron Mtn., Offline Replicated Set - Tape

32 Repository Storage – Current State Risks of current state: Data resides in single region, the Northeast Tape media handling and refresh constraints at petabyte scale One month window in which primary and backup are in same location

33 Repository Storage – Future State Yale ITS Disk-based Enterprise Storage Repository New Haven/West Haven, CT Digital Preservation Network or Yale ITS Out-of- region Storage or Cloud storage provider (ex. Amazon Glacier) Out-of-Region

34 G ET A HEAD ON Y OUR R EPOSITORY Yale Hydra Roadmap

35 Migrations in Progress

36 Hydra Growth at Yale (TB)

37 Hydra Roadmap Complete Kissinger collection (1.7 million pages, 10 million files) Complete migration of legacy digital collections Discovery and display for curated research data Self-archiving (Sufia) project with ITS to support Yale faculty, student, and research content (first Fedora 4 collections) Move all collections to Fedora 4 (IIIF, RDF, auditing, other advanced features) Unified search Integration with ArchivesSpace (ArcLight Hydra project) ORCid support Online exhibitions in Spotlight Video streaming support, HydraDAM for video preservation DPN or other offsite copy support

38 Digital Preservation Services Multiple Copies Bit Preservation Secure Storage with Managed Access Provenance and Authenticity Assurance Standards Compliance Obsolescence Monitoring Format migration and emulation services

39 QUESTIONS? “Not all digital objects are digital assets. Only those which store value and will realise future benefit can be described as assets. Those which won’t are liabilities.” -4C Roadmap, “Investing in Curation: A Shared Path to Sustainability”

40 Resources DPN_OR_2014.pdf DPN_OR_2014.pdf https://curate.nd.edu/ https://scholarsphere.psu.edu/


Download ppt "DIGITAL PRESERVATION IN HYDRA/FEDORA March 24, 2015 G ET A HEAD ON Y OUR R EPOSITORY."

Similar presentations


Ads by Google