http://lockss.stanford.edu
The LOCKSS Program Asia, September 2004 Libraries in the digital age Collection & preservation for generational access part one The LOCKSS Program Asia, September 2004
What is a Library?
What Is A Library? “Libraries have a central role in society as cultural and educational institutions, and as agents for promoting literacy…[they are] essential for better international understanding, and as such they are a fundamental precondition for peace, human rights, literacy, intellectual freedom, and a better environment for all peoples”. IFLA-- 1991
A library is a MEMORY organization Serves communities, including nations by Collecting documents - cultural artifacts (all genres, formats, medium) Providing intellectual access through interpretation and organization Preserving artifacts for future community members. A library is a MEMORY organization
Web’s Impact on Libraries Libraries now Lease subscription materials Access free materials Libraries unable to Own collections Fill memory role
Do libraries have a future? A set of links is not a library Without local collections, libraries will become "digital concierges” Linking to current digital objects Mostly common homogenous resources
Locally owned digital collections Librarians must Assert societal memory role by Selecting Building Preserving Providing access to Locally owned digital collections
Digital Publications Increasingly are the version of record Often, sole version of record Change rapidly or disappear, with no warning Have adjuncts – e.g., hyperlinks, virtual models Failure to collect digital artifacts will create a growing “dark age” of our times
MOTIVATION http://lockss.stanford.edu
LOCKSS Open source Peer to peer Persistent access preservation system Web delivered information
Paper Library System Libraries act for their institution to Acquire copies of important “stuff” Keep copies on shelves Give access to local readers Libraries cooperate to Supply copies to other libraries a reader can easily to find a copy a “bad guy” has trouble finding and destroying all copies
Paper Library System Libraries ensure content persists simply by supporting their local communities A cooperative, affordable, decentralized, preservation system with LOTS OF COPIES
LOCKSS “Library System” Libraries act for their institution to Acquire copies of important “stuff” Keep copies in transparent web caches Give access to local readers Libraries cooperate to Detect and repair damage a reader can easily find a copy a “bad guy” has trouble finding and destroying all copies
LOCKSS “Library System” Libraries ensure content persists simply by supporting their local communities A cooperative, affordable, decentralized, preservation system with LOTS OF COPIES
LOCKSS Support: Mellon, NSF Software: www.sourceforge.net Teams: Research and Production
Teams Research Production Stanford, Harvard Computer Science, Hewlett Packard, Intel Award winning research 2004: ACM Grand Finals 2nd place Best Paper SOSP Investigating LOCKSS communication Scaling, attack resistance Production Deploys research findings, builds and maintains system, provides user support
Partners 70+ Publishers 100+ Libraries
LIBRARIES and PUBLISHERS
LOCKSS software turns a PC into a persistent web cache into a preservation tool 1 PC holds ~2,500 e-j years 600MHz-128MB RAM-Bootable CD drive-Floppy disk drive
LOCKSS Caches Crawls and collects HTTP content All formats (PDF, HTML, JPEG, TIF, Audio, Video) Preserves content integrity Independent collection Cooperate to audit and repair damage Provides access Via web browser Content is never “dark”
LOCKSS Caches
Approximate Data Flows LOCKSS machines
Approximate Data Flows LOCKSS machines (proxy servers) Prevent the publisher from revoking access rights to back content
Publisher Gives Permission
Bringing Up A LOCKSS Cache Get a static IP address Get a PC (floppy, CD, and lots of storage) Download CD image from LOCKSS web site Burn CD image on CD-R Boot PC from CD and insert blank floppy Follow on-screen instructions and answer configuration questions Wait for network and preservation systems to start up Configure titles for collection & preservation
LOCKSS Allows Publishers to easily and affordably With minimal risk to business model With minimal risk to publishing platform Fulfill requirement to guarantee “perpetual access” to subscribed materials Preserve and archive intellectual property Continue to build partnerships with librarians
LOCKSS allows libraries to easily and affordably Build e- collections Preserve important materials Deliver content to current and future readers when publisher is not available Own rather than lease or access Retain traditional custodial role
http://lockss.stanford.edu End Part One