Besser--UCLA/Getty Summer Instit-intro 8/6/01 1 UCLA/Getty Summer Institute for Knowledge Sharing (opening) Howard Besser UCLA School of Education & Information.

Slides:



Advertisements
Similar presentations
Current State of Play in Digital Preservation Peter B. Hirtle Cornell University Library Society of American Archivists.
Advertisements

Creating Institutional Repositories Stephen Pinfield.
The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Strategic issues for digital projects... …or, what are we doing here?
Persistent identifiers – an Overview Juha Hakala The National Library of Finland
1. The Digital Library Challenge The Hybrid Library Today’s information resources collections are “hybrid” Combinations of - paper and digital format.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
OPEN ACCESS PUBLICATION ISSUES FOR NSF OPP Advisory Committee May 30, /24/111 |
1 Building a “Virtual Library Collection” through freely-accessible web sites: ‘Select Web Sites database’ at University of Vermont Wichada SuKantarat.
1 CS 502: Computing Methods for Digital Libraries Lecture 9 Conversion to Digital Formats Anne Kenney, Cornell University Library.
Besser--Planning (Brazil) 31/5/01 1 Planning to Maximize Longevity of Digital Information Howard Besser UCLA School of Education & Information
Perspectives from The Alberta Library Learn, think, CHANGE 2004 Online Learning Symposium November 3, 2004 Zahina Iqbal.
Besser--Dublin Core Metadata 2/14/02 1 Dublin Core Metadata Howard Besser UCLA School of Education & Information
Using Metadata in CONTENTdm Diana Brooking and Allen Maberry Metadata Implementation Group, Univ. of Washington Crossing Organizational Boundaries Oct.
Besser--CNI/JISC 6/16/00 1 Projected Changes: Prospect of digitized movies already has some mourning loss of film (SF Chronicle, 3/5/00)
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Besser--Digital Longevity 9/2/00 (12/12/99) 1 Planning to Maximize Longevity of Digital Information Howard Besser UCLA School of Education & Information.
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
Besser--ELO 4/6/02 1 Problems of Preserving Electronic Literature Electronic Literature Organization Howard Besser UCLA School of Education & Information.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Besser, frm OAC, 4/17/00 1 OAC as an example of Special Collections Digitization: the Collection, the Institution, Scholarship, Interoperability, Longevity.
Image Metadata Summary of 4/18/99 NISO/DLF Image Metadata Meeting ( Howard Besser UCLA School of Education & Information.
Archiving the Web: the PANDORA archive at the National Library of Australia Preserving the Present for the Future Copenhagen, June 2001 Warwick Cathro,
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
Digitization of the Federal Depository Library Program Judith C. Russell Superintendent of Documents & Managing Director, Information Dissemination “Electronic.
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
Chapter © 2012 Pearson Education, Inc. Publishing as Prentice Hall.
Mass digitisation? Astrid Verheusen Projectmanager Research & Development Division National library of the Netherlands LIBER-EBLIDA Workshop on Digitisation.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Configuration Management (CM)
Producción de Sistemas de Información Agosto-Diciembre 2007 Sesión # 8.
A Chicken or An Egg? Planning Your Digital Project Presentation to the Saskatchewan Libraries Conference Digitization 101 Pre-Conference Workshop May 3,
Besser--TextOneZero 5/22/01 1 The New Information Environments: Helping content persist over time Howard Besser UCLA School of Education & Information.
Besser--Colorado Longevity & Policy 6/29/01 1 Digital Longevity: Problems & Policy Colorado Cultural Heritage Collaboration in the Digital Age Howard Besser.
Besser--VALA 2/8/02 1 Moving from Isolated Digital Collections to Interoperable Digital Libraries VALA 2002 Conference Howard Besser UCLA School of Education.
Besser--LITA Dig Imaging Preconference 7/7/00 1 Creating Working Digital Libraries Howard Besser UCLA School of Education & Information
The Canadian Information Network for Research in the Social Sciences and Humanities Tim Au Yeung and Mary Westell Libraries.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
Besser--ICHIM Milan 9/5/01 1 Preserving Electronic Art: What’s the problem & What can we do about it? Howard Besser UCLA School of Education & Information.
Small steps and lasting impact: making a start with preservation or It’s not all NASA Patricia Sleeman Digital Archives and Repositories University of.
Tsinghua University Library Yang Zhao & Airong Jiang Tsinghua University Library, Beijing China 4 June, 2004 Electronic Thesis and Dissertation System.
Best Practices for Digital Imaging and Metadata Roy Tennant The Library, University of California, Berkeley
INTELLECTUAL RIGHTS AND HISTORIC CORPORA Mark Sandler University of Michigan ICOLC, March, 2003.
Besser--Seybold--Digital Asset Mgmt 8/29/00 1 Digital Asset Management: An Academic View Howard Besser UCLA School of Education & Information
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Digital Preservation across the technologies, strategies, open standards & interoperability aspects including the legal issues Pratik Shrivastava Scientist.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
1/16/2016I. Revels Digital Imaging Workshop 1 Selection Considerations For Digital Imaging Projects.
Besser--Moving Image Longevity 3/16/01 1 Moving Image Longevity Howard Besser UCLA School of Education & Information
Institutional Repositories July 2007 Intellectual property management : the DISA experience Dr D Peters DISA: Digital Innovation South Africa.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
The National Digital Information Infrastructure and Preservation Program (NDIIPP) Challenges and Solutions Laura E. Campbell Associate Librarian for Strategic.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Practical Aspects of Preservation Peter Simpson Development Officer Arts and Humanities Data Service.
Electronic Resources Collection Development Policy : Need and Challenges.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Describe the use of technology in the financial-information management function.
Implementing an Institutional Repository: Part II
UCLA School of Education & Information
Planning Digital Projects Smithsonian Archivists Oct 29, 2001
An Open Archival Repository System for UT Austin
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Image Metadata Summary of 4/18/99 NISO/DLF Image Metadata Meeting
Presentation transcript:

Besser--UCLA/Getty Summer Instit-intro 8/6/01 1 UCLA/Getty Summer Institute for Knowledge Sharing (opening) Howard Besser UCLA School of Education & Information

Besser--UCLA/Getty Summer Instit-intro 8/6/01 2 UCLA/Getty Summer Institute for Knowledge Sharing- _ Interoperability _ Importance of Standards _ Best Practices for Managing Digital Projects _ Implications of Digital Projects _ Longevity _ From Digital Collections to Digital Libraries & Museums

Besser--UCLA/Getty Summer Instit-intro 8/6/01 3 Key problems we’re facing _ Discovery _ Interoperability- _ Longevity-

Besser--UCLA/Getty Summer Instit-intro 8/6/01 4 Traditional Digital Collection Model DL user search & presentation

Besser--UCLA/Getty Summer Instit-intro 8/6/01 5 Ideal Digital Collection Model DL user search & presentation

Besser--UCLA/Getty Summer Instit-intro 8/6/01 6 For Interoperability Digital Collections Need Standards _ Descriptive Metadata for consistent description _ Discovery Metadata for finding _ Administrative Metadata for viewing and maintaining _ Structural Metadata for navigation _... Terms & Conditions Metadata for controlling access...

Besser--UCLA/Getty Summer Instit-intro 8/6/01 7 Metadata is not just indexing terms _ CBIR attributes used for retrieval on color, shape, texture, etc. _ Structural attributes used for page-turning _ Administrative attributes used for managing a digital work over time _ IPR attributes to limit unauthorized use _ Identification attributes to determine what application software is needed to view a particular digital work _ Can be located anywhere

Besser--UCLA/Getty Summer Instit-intro 8/6/01 8 Why are Standards and Metadata consensus important?  Managing digital files over time  Longevity  Interoperability  Veracity  Recording in a consistent manner  Will give vendors incentive to create applications that support this

Besser--UCLA/Getty Summer Instit-intro 8/6/01 9 Why Standards?  Why do we need standards? – To make information universally available to users – facilitate sharing and interchange of information – To preserve information (make it safe from changes in hardware and software)  Standards only work if communities widely accept them, but they’re necessary for communities to work together

Besser--UCLA/Getty Summer Instit-intro 8/6/01 10 Important Planning Considerations  File Formats  Choosing Interoperable Systems  Adhere to standards  Vendors with large installed base  Refreshing and/or Migration

Besser--UCLA/Getty Summer Instit-intro 8/6/01 11 Key Considerations for Imaging Projects  Image Quality – Archival – Current online delivery  Intellectual Property  Standards – Modular and Layered Architecture – Terminology – Technical imaging information

Besser--UCLA/Getty Summer Instit-intro 8/6/01 12 Best Practices for Managing Digital Projects- _ Who will your users be? _ Best Practices Guidelines _ Workflow and Management Issues

Besser--UCLA/Getty Summer Instit-intro 8/6/01 13 Why are you Managing this Information?  Organizational mission & type  Users  Uses

Besser--UCLA/Getty Summer Instit-intro 8/6/01 14 Scanning Best Practices _ Think about users (and potential users), uses, and type of material/collection _ Scan at the highest quality that does not exceed the likely potential users/uses/material _ Do not let today’s delivery limitations influence your scanning file sizes; understand the difference between digital masters and derivative files used for delivery _ Many documents which appear to be bitonal actually are better represented with greyscale scans _ Include color bar and ruler in the scan _ Use objective measurements to determine scanner settings (do NOT attempt to make the image good on your particular monitor or use image processing to color correct) _ Don’t use lossy compression _ Store in a common (standardized) file format _ Capture as much metadata as is reasonably possible (including metadata about the scanning process itself)

Besser--UCLA/Getty Summer Instit-intro 8/6/01 15 Why Scale is important

Besser--UCLA/Getty Summer Instit-intro 8/6/01 16 Digital Object Behaviors _ Book example

Besser--UCLA/Getty Summer Instit-intro 8/6/01 17 Metadata Standards (from MOA2, now METS) _ Administrative Metadata – for enhancing resource management _ Structural Metadata – for reflecting internal hierarchies and relationships btwn parts _ Raw/Seared/Cooked

Besser--UCLA/Getty Summer Instit-intro 8/6/01 18 More general issues of Digital Projects- _ Workflow and Management Issues _ Implications for the Collection _ Implications for the Institution _ Implications for Scholarship & Interoperability –Digital libraries –Metadata _ Longevity Issues

Besser--UCLA/Getty Summer Instit-intro 8/6/01 19 Workflow and Management Issues- _ Managing multiple image files _ Persistent Identification _ Making your works accessible throughout the Net

Besser--UCLA/Getty Summer Instit-intro 8/6/01 20 Workflow and Tracking Procedures _ Need careful planning _ Procedures for managing many different files at many different stages _ Linking of file versions

Besser--UCLA/Getty Summer Instit-intro 8/6/01 21 The number of variant forms of a work can be enormous  different views of the same object  different scans of the same photo  different resolutions  different compression schemes  different compression ratios  different file storage formats  different details of the same image ...

Image Families

Besser--UCLA/Getty Summer Instit-intro 8/6/01 23 Identification/Provenance  how to deal with different versions (browse, hi-res, medium res) derived from the same scan or different encoding schemes (TIFF, PICT, JFIF)  Vocabulary Standards to express this – VRA Surrogate Categories – CIMI's "Image Elements”

Besser--UCLA/Getty Summer Instit-intro 8/6/01 24 Persistent IDs--the Problem _ Need to separate work ID from work location _ URNs probably won’t be ready until 2003 _ Becomes a business process issue when one organization maintains the resource and another organization references it (ie. licensed from vendors or managed by separate administrative structures)

Besser--UCLA/Getty Summer Instit-intro 8/6/01 25 Making your works accessible throughout the Net _ Open Archives and Metadata Harvesting (DLF/Mellon) _ An administrative and political issue as much as a a technical one

Besser--UCLA/Getty Summer Instit-intro 8/6/01 26 More general issues of Digital Projects- _ Workflow and Management Issues _ Implications for the Collection _ Implications for the Institution _ Implications for Scholarship & Interoperability –Digital libraries –Metadata _ Longevity Issues

Besser--UCLA/Getty Summer Instit-intro 8/6/01 27 Implications for the Collection _ We’re already familiar with Reformatting _ Advantages & Disadvantages of Digitization- _ Protection- _ Unauthorized Use-

Besser--UCLA/Getty Summer Instit-intro 8/6/01 28 Broad Advantages & Disadvantages of Digitization _ Advantages _ good PR _ show off collection _ let people see items without having to needlessly pull them _ Disadvantages _ Can look like Edutainment _ Can commodify the works and make the repository look like it sold prestige to the highest bidder _ Authenticity called into question _ Decontextualization _ Representational problems-

Besser--UCLA/Getty Summer Instit-intro 8/6/01 29 Problems with How Works are Represented _ once a digital work is on the WWW, anyone can physically copy it and use it as they see fit _ often items are seen outside their context _ for images: using the normal method of mounting images on the WWW, the credit line often becomes separated from the image

Besser--UCLA/Getty Summer Instit-intro 8/6/01 30 Don’t advocate strong copyright or protection for the wrong reasons _ the people who find your content valuable are mostly your traditional audiences _ barriers before use will inhibit positive uses of your material _ threat of pursuit after misuse can effectively deter commercial misuse _ you can prevent commercial misuse without strong protection or copyright

Besser--UCLA/Getty Summer Instit-intro 8/6/01 31 Instead of fearing lost income, worry about Unauthorized Use _ Elimination of credit or attribution line (particularly for images) _ Someone else implying ownership _ Maintaining the Integrity of the Work

Besser--UCLA/Getty Summer Instit-intro 8/6/01 32 Protection Methods ^ _ Encrypting or encapsulating the digital file _ Marking the digital image with ownership (visible or not) – _ Image quality –Onscreen quality is far lower than printed quality – /Papers-projects/Projects/Trowbridge/resolution.html

Besser--UCLA/Getty Summer Instit-intro 8/6/01 33 Effect on the Institution _ Creating/Maintaining a WWW site _ Wear and tear on the original _ Handling external requests for Special Collection material _ Increase or decrease in requests to see originals? _ New Audiences _ Implications on the Institution’s Public Image

Besser--UCLA/Getty Summer Instit-intro 8/6/01 34 Scholarship and Interoperability _ Why Digital Libraries need standards for interoperability _ Metadata concerns _ Digitization means New Audiences-

Besser--UCLA/Getty Summer Instit-intro 8/6/01 35 Digitization means New Audiences _ more access for more people _ outreach to new groups _ but new groups have different usability requirements –different user interfaces –different vocabulary –new methods of navigation _ we already have enough differences btwn different institution types (& even within the same type) –MESL results –Organization & indexing reflects the biases of the original intent when records were formed

Besser--UCLA/Getty Summer Instit-intro 8/6/01 36 Serious Longevity Problems _ What we know from prior widespread digital file formats _ Images separating from their metadata _ Inaccessibility of software needed to view an image _ Inability to even decode the file format of an image

Besser--UCLA/Getty Summer Instit-intro 8/6/01 37 The Short Life of Digital Info: Digital Longevity Problems- _ Disappearing Information _ The Viewing Problem _ The Scrambling Problem _ The Inter-relation Problem _ The Custodial Problem _ The Translation Problem

Besser--UCLA/Getty Summer Instit-intro 8/6/01 38 The Viewing Problem  Digital Info requires a whole infrastructure to view it  Each piece of that infrastructure is changing at an incredibly rapid rate  How can we ever hope to deal with all the permutations and combinations

Besser--UCLA/Getty Summer Instit-intro 8/6/01 39 The Scrambling Problem Dangers from:  Compression to ease storage & delivery  Container Architecture to enhance digital commerce

Besser--UCLA/Getty Summer Instit-intro 8/6/01 40 The Inter-relation Problem  -Info is increasingly inter-related to other info  -How do we make our own Info persist when it points to and integrates with Info owned by others?  -What is the boundary of a set of information (or even of a digital object)?

Besser--UCLA/Getty Summer Instit-intro 8/6/01 41 The Custodial Problem  How do we decide what to save?  Who should save it?  How should they save it? – -methods for later access: emulation, migration, etc. – -issues of authenticity and evidence

Besser--UCLA/Getty Summer Instit-intro 8/6/01 42 The Translation Problem  Content translated into new delivery devices changes meaning – -A photo vs. a painting – -If Info is produced originally in digital form in one encoded format, will it be the same when translated into another format? – Behaviors

Besser--UCLA/Getty Summer Instit-intro 8/6/01 43 Pieces of the Solution (1/2)  -We need to insist upon clearly readable standardized ways for digital objects to self- identify their formats  -We should discourage scrambling  -We need to better understand information inter-relates to other Info, and what constitutes “boundaries” of Info objects

Besser--UCLA/Getty Summer Instit-intro 8/6/01 44 Pieces of the Solution (2/2)  -People and organizations wishing to make information persist need guidelines of how to go about doing it  -We need to better understand how translating from one storage or display format to another affects the meaning of a work  -We need to save the “behaviors” of a digital object, not just it’s “contents”

Besser--UCLA/Getty Summer Instit-intro 8/6/01 45 Conceptual Approaches to Digital Preservation _ Refreshing always necessary due to volatility of physical strata –Impact on evidential value _ Migration -- advantages & disadvantages _ Emulation -- advantages & disadvantages

Besser--UCLA/Getty Summer Instit-intro 8/6/01 46 Migration/Refreshing _ Impact on evidential value

Besser--UCLA/Getty Summer Instit-intro 8/6/01 47 Pragmatic Issues _ Save Metadata!!! (Descriptive, Administrative, Structural, …) _ Separate master from delivery _ Consistent file-naming conventions _ Good work-flow _ Develop cooperative long-range plans

Besser--UCLA/Getty Summer Instit-intro 8/6/01 48 Metadata can be the first line of defense  Can tell you – where the file is (if you can’t find the file) – where more info about the file is (if you have the file but most other metadata has become separated) – what the file format is – what the compression scheme is – what application program and version is needed for the file

Besser--UCLA/Getty Summer Instit-intro 8/6/01 49 Groups Working on the Big Longevity Problem  CPA Task Force  Getty “Time & Bits” Conference & Follow-ups  Emulation experiments in US and Europe  NEDLIB, CURL, Michigan  LC-  Mellon-funded E-Journal Archive experiments-  Internet Archive  Long Now

Besser--UCLA/Getty Summer Instit-intro 8/6/01 50 Library to Lead National Effort to Develop Digital Information Infrastructure and Preservation Program (1 of 2) U.S. Congress Provides $100 Million Special Appropriation in Support of Project _ Response to NAS report LC 21: A Digital Strategy for the Library of Congress _ The Library of Congress has been empowered by the U. S. Congress to develop a national program to preserve the burgeoning amounts of digital information, especially materials that are created only in digital formats, to ensure their accessibility for current and future generations. _ "This collaborative strategy will permit the long-term acquisition, storage and preservation of digital materials, that will assure access to the growing electronic historical and cultural record of our nation," said Dr. Billington. "Just as the Congress enabled the Library of Congress to begin the last century by making its printed catalog cards widely available, the Congress has enabled its Library to begin this century by building a digital record and making it available in the information age.”

Besser--UCLA/Getty Summer Instit-intro 8/6/01 51 Library to Lead National Effort to Develop Digital Information Infrastructure and Preservation Program (2 of 2) U.S. Congress Provides $100 Million Special Appropriation in Support of Project _ In December 2000, the 106th Congress appropriated $100 million for this effort, which instructs the Library to spend an initial $25 million to develop and execute a congressionally approved strategic plan for a National Digital Information Infrastructure and Preservation Program. Congress specified that, of this amount, $5 million may be spent during the initial phase for planning as well as the acquisition and preservation of digital information that may otherwise vanish. _ The legislation authorizes as much as $75 million of federal funding to be made available as this amount is matched by nonfederal donations, including in-kind contributions, through March 31, The effect of a government-wide recission of.22 percent in late December was to reduce this pecial appropriation to $99.8 million. _ The Library will consult with federal partners to assess joint planning considerations for shared responsibilities. The Library will also seek participation from the nonfederal sector and will execute its overall strategy in cooperation with the library, creative, publishing, technology and copyright communities in this country and abroad.

Besser--UCLA/Getty Summer Instit-intro 8/6/01 52 Moving from Digital Collections to Digital Libraries/Museums _ What’s the difference? –not experiments –real users –service –longevity _ Recent history of Library Automation-

Besser--UCLA/Getty Summer Instit-intro 8/6/01 53 Ideal Digital Collection Model DL user search & presentation

Besser--UCLA/Getty Summer Instit-intro 8/6/01 54 Developmental Stages _ Experiment with methods _ Build real operational systems _ Build interoperable operational systems _ Make the system useful for users –For DL Initiatives –For OPACs –For I & A Services –For Image Retrieval

Besser--UCLA/Getty Summer Instit-intro 8/6/01 55 One Final Question: Who will collect the digital works of today that should become the Special Collections of tomorrow? _ web sites _ zines _ electronic journals _ listserve and discussions _ drafts of works that later become famous

Besser--UCLA/Getty Summer Instit-intro 8/6/01 56 UCLA/Getty Summer Institute for Knowledge Sharing