Hypatia Hydra Platform for Access to Information in Archives DLF Forum * Baltimore * October 31, 2011 Stanford University Bradley Daigle Julie Meloni Tom.

Slides:



Advertisements
Similar presentations
 Permanent Staff Analyst / Programmers (2.5) Digital Projects Librarian (1) Special Collections Analyst (1) Web Designer / Developer (.5) Director Grant.
Advertisements

October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
Leveraging a Rich Discovery Interface in Open Repository Architectures.
Blacklight at Stanford: A Highly Leveraged, Reusable, Discovery Engine
Digital Repository Development at Yale University Library Michael Dula CTO, Yale University Library December 8, 2014.
Hydra Partners Meeting March 2012 Bill Branan DuraCloud Technical Lead.
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
One Body, Many More Heads, One Year Later Open Repositories 2012, Edinburgh.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
One Body, Many Heads for Repository-Powered Library Applications Chris Awre Head of Information Management Library and Learning Innovation University of.
EMu and Archives NA EMu Users Conference – Oct Slide 1 EMu and Archives Experiences from the Canada Science and Technology Museum Corporation.
Digital preservation Hydra Europe, LSE 24 April 2015 Anders Conrad.
Shared IR Project Overview Rick Johnson Lead Project Director Shared IR University of Notre Dame Hydra Connect 2014 January 22, 2014Hydra Connect
Hydra in Hull | Hydra European Symposium | Dublin | 7/8 April 2014 | 1 in Hull Richard Green Hydra Europe Symposium, Dublin, 7-8 April 2014.
US Hydra use overview Hydra Europe Symposium, Trinity College, Dublin, 7 th April 2014 Chris Awre Head of Information Management Library and Learning Innovation.
Issue: Unknown / Unrecognized Filesystems Initial Analysis Extract Metadata Identify Restricted Info Identify Duplicates Generate Reports.
Greg Harris President & CEO We Can Work It Out Establishing the World’s First Rock and Roll Library.
1Hydra Connect 2: Working Group Framework Empowering the Community through a Framework for Interest Groups and Working Groups Robin Ruggaber University.
Hydra from 35,000ft Chris Awre Hydra Europe Symposium London School of Economics, 23 rd April 2015.
Adopting Hydra Making the case and getting going Chris Awre Hydra Europe Symposium London School of Economics, 23 rd April 2015.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
The DSpace Course Module – An introduction to DSpace.
G ET A HEAD ON Y OUR R EPOSITORY Tom Cramer Chief Technology Strategist Stanford University Libraries.
Digital Library Collections (DLC) Website A platform for integrated access to CUL/IS specialized, digital collections September 2014 Status Report.
Copyright 2013 © President & Fellows of Harvard College Digital Forensics at Harvard Business School NE NDSA Lightning Talk, 10 May 2013 Rachel Wise, Baker.
PROJECT HYDRA SNEAK PEAK – ADVANCE SHOWING Brought to you by the Digital Repository Task Force Steve Marine (chair), Ted Baldwin, Dan Gottlieb, Kevin Grace,
Hydra Europe Symposium | April 2015 | 1 Hydra and open access Chris Awre Hydra Europe Symposium London School of Economics, 24 th April 2015.
One Body, Many Heads for Repository-Powered Digital Content Applications Hydra Europe Symposium, Trinity College, Dublin, 7 th April 2014 Chris Awre Head.
Challenges of Digital Media Preservation Karen Cariani, Director Media Library and Archives Dave MacCarn, Chief Technologist.
One Body, Many Heads for Repository-Powered Library Applications Tom Cramer Chief Technology Strategist Stanford University Libraries CNI * 13 December.
Overview of IU Digital Collections Search Hui Zhang Jon Dunn Indiana University Digital Library Program IU Digital Library Brown Bag October 19, 2011.
Archivists' Toolkit - CRADLE Presentation, 10 Feb The Archivists’ Toolkit CRADLE Presentation 10 Feb
Archive Engine West Contextualizing Digital Objects with EAD Metadata Jodi Allison-Bunnell, Orbis Cascade Alliance Worthy Martin, Institute for Advanced.
Archivists' Toolkit - CDL Presentation, October 17, 2005 The Archivists’ Toolkit Lee Mandell Brad Westbrook.
Northwestern University Transportation Library Menu Collection.
BUILDING ON COMMON GROUND: EXPLORING THE INTERSECTION OF ARCHIVES AND DATA CURATION Lizzy Rolando & Wendy Hagenmaier 6/3/2015IASSIST 2015.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
The Seaside Research Portal: A Best of Breed Approach to Digital Exhibits and Collection Management Rick Johnson, Head of Digital Library Services University.
Introduction to metadata
May 2, 2013 An introduction to DSpace. Module 1 – An Introduction By the end of this module, you will … Understand what DSpace is, and what it can be.
VIVO and Scholarly Repositories: Synergistic Opportunities.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
G ET A HEAD ON Y OUR R EPOSITORY Tom Cramer Chief Technology Strategist Stanford University Libraries.
Contributions to Archives Infrastructure Bradley Westbrook AT Project Manager.
But I Don't Have Access to Your Server, and My Grad Student Left Last Month! Meeting the Challenges of Research Data Curation via Metadata Juliane Schneider.
Archivists' Toolkit - All Hands Meeting Use Case Method Vernacular technique for modeling user requirements. Tells the story of how a user accomplishes.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
DSpace An Open Source Dynamic Digital Repository Xizi (Cecilia) Cai IS565 Spring 2013 DL Topic Presentation.
Hydra at The Royal Libary Hydra Europe Symposium, Dublin 7. April, 2014 Anders Conrad,
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Santi Thompson - Metadata Coordinator Annie Wu - Head, Metadata and Bibliographic Services 2013 TCDL Conference Austin, TX.
Data Wrangling: Developing Local Best Practice for Born Digital Metadata Tracy Popp, Digital Preservation Coordinator Ayla Stein, Metadata Librarian University.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Michael Friscia Yale University Library Manager, Digital Library Programming Services.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Breeda Herlihy, IR Manager, UCC Library. UCC selected DSpace in 2008 Software selection group Staff from Library IT, Computer Centre, Special Collections,
Archivists' Toolkit - All Hands Meeting Scope Both multilevel and single-level description Accommodates description of collections, series, sub-series,
13 July 2005 Archives Hub day conference The Paradigm Project: The University of Oxford & The University of Manchester
Toward a Digital Asset Management Ecosystem at Texas A&M University Libraries An update on developments in document workflows, data modeling, media service,
Creighton Barrett Dalhousie University Archives
Digital Libraries: Planning, Creating, Collaborating, & Reality
Tools for identifying duplicate files and known software files
ASEE 2011 Adriana Popescu Princeton University
Avalon's Role in the Digital Collections Ecosystem
Introduction, Features & Technology
VI-SEEM Data Repository
Library Technology Conference: Building Exhibits
Hydra: a case study Chris Awre
The Bentley Digital Media Library
Archiving and preservation services in the cloud
Presentation transcript:

Hypatia Hydra Platform for Access to Information in Archives DLF Forum * Baltimore * October 31, 2011 Stanford University Bradley Daigle Julie Meloni Tom Cramer Michael Olson Naomi Dushay University of Virginia

Introduction Tom Cramer AIMS Bradley Daigle Born Digital Materials (& Forensics) Michael Olson Hypatia Functional Requirements Michael Olson Data Models & Loading Naomi Dushay Demonstration Julie Meloni & Michael Olson Q&A Discussion In Sum & Looking Forward Tom Cramer

What is Hypatia? Hydra Platform for Access to Information in Archives Repository-powered solution for digital archival materials management, preservation and access One component in a larger (eco)system for archivists Open source software based on Hydra & Fedora The potential nucleus of a larger, sustained, collaborative effort

Origins Outgrowth of AIMS project Leveraging the Hydra project Functional requirements and content from the AIMS partners (Virginia, Hull, Stanford, Yale) Technical Development by Stanford, Virginia & MediaShelf (contract)

What is Hydra? Partners DuraSpace Northwestern University Notre Dame Rock & Roll Hall of Fame Stanford University University of Hull University of Virginia + half dozen more in ramp-up mode “Solution Bundles” IR ETD’s Research Data Video Images Archives  Hypatia Open Access Articles Digitization Workflow Digital Monograph Acquisitions Exhibits Digital Preservation Technology OSS stack featuring Fedora, solr, Ruby on Rails, Blacklight

The Workflow Born Digital Materials Forensic Extraction & Processing Hypatia Repository Object Management & Preservation Arrangement & Description EAD Physical Materials Discovery & Access

Iterations & Enrichment Born Digital Materials Forensic Extraction & Processing Hypatia Repository Object Management & Preservation Arrangement & Description EAD Physical Materials Discovery & Access Two Phase Data Processing: Reprocess for object-level access EAD Enrichment: IDs and URLs for files /containers

Functional Requirements Gathering Created by AIMS Digital Archivists’ January – March 2011 Initial Focus on Arrangement and Description – a tool for Archivists’ Second focus on Discovery and Access

How do we get to Hypatia 1 Archival Description – Encoded Archival Description (EAD) Repository data Collection data (title, extent, …..) Physical and Intellectual arrangement Challenges with EAD Encoding Standards institutional specific Doesn’t scale for born digital archives (100,000 of files)

How do we to Hypatia 2 Archival payload – disk images / files Typically stored on obsolete media Minimal descriptive metadata

The POWER of Digital Forensics Specialized software to help archivists’ preserve provenance by: Migrating data off of legacy at risk media Captures create, modify date, last accessed date Preserves original media file paths, OS and low level formatting? Original applications including fonts that created the data

Digital Forensic Processing Archivists use commercial or open source software to tag large quantities of born digital archival materials Keyword, pattern search to find files that have sensitive information (Health records, Credit Card data, etc.) Bulk edit tagging for restricted files, subject, source media (what disk did the file come from?)

Active Fedora Solrizer =

Content Digital Content Content Digital Content

Atomistic Content Model File(Asset) Exposed Object is_part_of Is_member_of_collection is_member_of

Descriptive Metadata (MODS) DC (Fedora) RELS-EXT (model, parent, …) (Fedora) Content Metadata (Stanford) Rights Metadata (hydra)

EAD Series

EAD Digital Content

Disk Image File

Disk Image File EAD jpg Set Collection Set Disk Image jpg File

EAD Set Collection Disk Image File FTK processing File Set

FTK Output not designed for this: //fo:page-sequence[2][fo:flow/fo:block[text()='Case Information']]/fo:flow/fo:table[1]/ fo:table-body/fo:table-row[3]/fo:table-cell[2]/fo:block/text()

FTK practices vary

Whole (digital) archival management A Case Study: Feigenbaum Papers at Stanford, prior to SALT Previously: -Files on a separate file store -Permissions management & preservation challenges -A distinct index with its own faceted browser (Flamenco) -A separate Drupal site for collection landing page -A separate Mysql db for tags -A separate Finding Aid, stored in Archivists Toolkit = a Nightmare to synchronize, update and migrate

Hypatia Benefits Integrated solution for archival digital objects management & access Granular permissions management for discover, read, edit, and administrate Support for multiple arrangements: physical, logical, archival Enables ongoing processing as resources become available Integrated approach for digital preservation

WIIFM? Open source code base for digital archives management Nucleus for further, community development Forensic toolkit patterns, best practices, Fedora loading scripts Data models for EAD and digital archival objects in Fedora Functional requirements for arrangement, description, discovery and access for digital archives

Next Steps Pilot usage in a archives processing digital materials Another round of development Development of bulk permissions management, arrangement & description Experiment with UI and tools for archivists and for end users

80/20 – 8 Weeks of Development

Connect Demo: Wiki: List: Code: Hydra: AIMS: