Developing PANDORA Mark Corbould Director, IT Business Systems.

Slides:



Advertisements
Similar presentations
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Advertisements

Digital Repositories: interoperability & common services Closing Remarks Dr Liz Lyon, UKOLN, University of Bath, UK
Pulling it all together… with thanks to Sheila Anderson.
Bibliothèque nationale de France Tallinn, BnF update: production and development priorities in 2015.
The Documentum Team Lance Callaway, Brooke Durbin, Perry Koob, Lorie McMillin, Jennifer Song Missouri University of Science and Technology Rolla, Missouri.
Digital Storage Options: A Transitional Perspective For Small Archives How Do We Provide a Safety Net for Small Archives? John Spencer BMS/Chace JTS 2007.
1. The Digital Library Challenge The Hybrid Library Today’s information resources collections are “hybrid” Combinations of - paper and digital format.
Special collections and digital libraries: a new role for consortia? Dale Flecker Harvard University Library.
WWW Challenges : Supporting Users in Search and Navigation Natasa Milic-Frayling Microsoft Research, Cambridge UK SOFSEM 2004 January 28, 2004.
Moving libraries to Web scale Matt Goldner Product & Technology Advocate 14 June 2011.
PANDORA and Beyond: Managing Web Archiving at the National Library of Australia Digital Preservation Seminar National Library of Australia, 21 November.
PANDORA Australia’s Web Archive Library Science Talks SNL/CERN, September 2004 Paul Koerbin Digital Archiving Branch National Library of Australia
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
Teula Morgan The Adaptable Repository: Swinburne Online Journals.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Introducing Symposia : “ The digital repository that thinks like a librarian”
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
LIBRARY The library as a digitorium New modes of creation, distribution and access.
Archiving the Web: the PANDORA archive at the National Library of Australia Preserving the Present for the Future Copenhagen, June 2001 Warwick Cathro,
M.A.Doman Model for enabling the delivery of computing as a SERVICE.
Designing Storage Architectures for Preservation Collections Library of Congress, September 17-18, 2007 Preservation and Access Repository Storage Architecture.
Digital Repository Service (DRS) Harvard University Library OIS presented by: Wendy Gogel & Andrea Goethals.
BACKUP/MASTER: Immediate Relief with Disk Backup Presented by W. Curtis Preston VP, Service Development GlassHouse Technologies, Inc.
Digital Asset Management for All? Visualising a Flexible DAMS Solution for Small and Medium Scale Institutions Paul Bevan Llyfrgell Genedlaethol Cymru.
TEAM FOUNDATION SERVER (TFS) By Sunny Niranjana Devi. M.
Svein Arne Brygfjeld National Library of Norway Nordic Web Archive.
Statewide Digitization and the FCLA Digital Archive Priscilla Caplan, Florida Center for Library Automation Statewide Digitization Planners Meeting OCLC,
PeDALS Persistent Digital Archives & Library System Richard Pearce-Moses Deputy Director for Technology & Information Resources Arizona State Library,
Finding a New Way Richard Pearce-Moses Deputy Director for Technology & Information Resources Arizona State Library, Archives and Public Records Using.
Using the SAS® Information Delivery Portal
Johannes Spitzbart Phonogrammarchiv, Austrian Academy of Sciences Österreichische Tage der Digitalen Geisteswissenschaften save the data - workshop on.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
Managing Research Data – The Organisational Challenge at Oxford James A J Wilson Friday 6 th December,
The ECHO DEPository Project A project of the University of Illinois at Urbana-Champaign and OCLC in partnership with the Library of Congress ALA Annual.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
Access Across Time: How the NAA Preserves Digital Records Andrew Wilson Assistant Director, Preservation.
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
FCLA Services Jim Corey Director, FCLA Task Force on the Future of Academic Libraries in Florida 7/19/2010.
The DiVA System: Current Status and Ongoing Development Uwe Klosa Electronic Publishing Centre, Uppsala University, Sweden Eva Müller.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
The Global Video Grid: DigitalWell Update & Plan For SRB Integration Myke Smith, Manager Streaming Media Technologies University of Washington / ResearchChannel.
The Portal to Texas History: Harnessing Technology to Enable Collaboration with Small Museums and Libraries CNI, December 6, 2005 Cathy Nelson Hartman.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Integrating a Statewide Web Gateway With Digital Collections ______________________ Eric Weig and Beth Kraemer University of Kentucky and KCVL.
Web Archiving at the National Library of Australia Russell Latham Senior Web Archivist, National Library of Australia.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
New Approaches to Content Management Video Archive Appliances: Which tool is right for you? Moderator: Dan McGraw, Seven Dials Media.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
Planning for Life after OCLC Passport for Cataloging An overview of the new OCLC cataloging service Revised April 2002.
1 A Very Large Digital Library Technology Demonstration William Y. Arms Cornell University.
CLASS Information Management Presented at NOAATECH Conference 2006 Presented by Pat Schafer (CLASS-WV Development Lead)
ALA Institutional Repository Update ALA Archives at the University of Illinois Urbana-Champaign Chris Prom Cara Bertram Denise Rayman.
Digital library infrastructure -- systems Repositories for storing digital resources protect, manage, deliver, and preserve digital resources over time.
Professional Content Management & Production Introduction & Content Related Workflows.
Digital Preservation across the technologies, strategies, open standards & interoperability aspects including the legal issues Pratik Shrivastava Scientist.
Enterprise Solutions Chapter 10 – Enterprise Content Management.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
 The End to the Means › (According to IBM ) › 03.ibm.com/innovation/us/thesmartercity/in dex_flash.html?cmp=blank&cm=v&csr=chap ter_edu&cr=youtube&ct=usbrv111&cn=agus.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Digital Library Storage Strategies Robert Cartolano, Director Library Information Technology Office November 14, 2008.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Chang, Wen-Hsi Division Director National Archives Administration, 2011/3/18/16:15-17: TELDAP International Conference.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Working with personal digital archives Susan Thomas Project Manager & Digital Archivist project Manuscripts Matter, Electronica panel London, October.
Joseph JaJa, Mike Smorul, and Sangchul Song
Statewide Digitization and the FCLA Digital Archive
DIGITAL LIBRARY.
Presentation transcript:

Developing PANDORA Mark Corbould Director, IT Business Systems

Context Perceived Wisdom Accessing information from the Internet is like trying to drink from a fire hose –“It can’t be done” –“It will not scale” –“It is too expensive” The goal posts keep moving as authors use the browser feature du jour And the technical challenges are … –Systems/tools for capture, creation, storage, display and access –Metadata support –Access control and rights management –Preservation and ongoing access

Development Strategy Expect the archive to grow exponentially (at least a factor of two each year) Develop PANDORA as a national and potentially distributed infrastructure Develop PANDORA in the context of other collecting strategies, eg electronic deposit and whole of domain web capture Buy not build

What is PANDORA Today? PANDAS –The ILMS of PANDORA –Systems/tools for capture, creation, storage –Metadata support –Access control PANDORA’s Box –The Stacks of PANDORA –Large scale storage supporting ongoing access and long term preservation PANDORA’s Lid –The Reading Room for PANDORA –Controlled public access to the archive using contemporary browsers –Appropriate resource discovery tools

PANDAS Improve workflow efficiency Provide more effective quality assurance tools Develop ability to allow publisher’s to push material into the archive Keep pace with web publishing technology Database-driven services Streaming delivery

PANDORA’s Box The archive is currently approximated 1.5 million objects requiring 150GB of storage … and growing fast The Digital Object Storage System (DOSS) –Large scale storage system for Digital Collections –Initial system configuration provides 5 TB of storage –System can be scaled to 25 TB –PANDORA will migrate to DOSS for the end of July

DOSS Architecture SCSI 80 MBs Ethernet 100 Mbs Fibre Channel 100 MBs Disk Arrays Tape Library DB Server Web Server DOMS Server SAN Switch

PANDORA’s Lid Initial release will go into production by the end of July, and will support Automatically generated title entry pages Access Controls Improved resource discovery –Browse by title –Browse by subject –Full text search –Metadata search And it will look better too!

PANDORA’s Lid futures Better integration with the Library Catalogue Full metadata support Facilitate the research use of the archive though the development of appropriate navigation tools Support more sophisticated rights management Better browser support

Towards a Distributed National Archive PANDORA currently supports distributed collection management and access through a central system The Library in partnership with other agencies will explore “more” distributed models Currently the model being discussed is that of agencies having the choice to maintain local archives and access with a central metadata repository and access portal Two possible architectures have been proposed

Distributed Storage Enhance existing system to allow agencies to have local copies of PANDORA’s Box and their own public access system Can be done in the short term Management is central Gathering may be local or central Archiving is to a local system PANDORA’s Lid provides normal functionality

Distributed PANDORA Each agency would provide local management, gathering, storage and access National metadata repository and access portal may be real or virtual Difficulties –Technology –Cost A packaged hardware and software solution providing “PANDORA Appliance”