Presentation is loading. Please wait.

Presentation is loading. Please wait.

Technology Bootcamp January 18, 2014 Large-Scale Digital Libraries Digitization Process Krystyna K. Matusiak, Ph.D. Assistant Professor Library & Information.

Similar presentations


Presentation on theme: "Technology Bootcamp January 18, 2014 Large-Scale Digital Libraries Digitization Process Krystyna K. Matusiak, Ph.D. Assistant Professor Library & Information."— Presentation transcript:

1 Technology Bootcamp January 18, 2014 Large-Scale Digital Libraries Digitization Process Krystyna K. Matusiak, Ph.D. Assistant Professor Library & Information Science Program

2 Overview Large-scale digital libraries (DLs)  The National Science Digital Library (NSDL)  HathiTrust  Europeana  The Digital Public Library of America (DPLA) Digitization as a conversion process Fundamental questions  What?  Why?  How? Digitization as a multi-step process Digitization standards and guidelines  The notion of archival master files and derivatives  Image capture: technical factors Digitization technology Overview of Digitization2

3 LARGE-SCALE DIGITAL LIBRARIES Overview of Digitization3

4 Large-Scale Digital Libraries Massive aggregations of scientific and cultural heritage content with millions of digital objects  Offer a new centralized approach to providing access to scientific and cultural materials  Aggregate content (or metadata) from smaller individual DLs and provide portals for global searching and retrieval  Address the limitations of the resource discovery in the DL environment  Build upon over two decades of extensive digitization efforts Types of content  Born-digital  Digitized Overview of Digitization4

5 Large-Scale Digital Libraries Sources of content  Local digitization: Individual DLs created by academic and public libraries, archives, historical societies, and other cultural heritage and research organizations  Mass digitization: Google Book Project; Open Content Alliance Information ecosystem – multilayered trusted networks Models  Distributed (DPLA, Europeana, NSDL)  Centralized (HathiTrust) Coverage Goals  Expanding access  Supporting digital preservation Overview of Digitization5

6 The National Science Digital Library (NSDL) http://nsdl.org/ http://nsdl.org/ 6

7 HathiTrust http://www.hathitrust.org/ http://www.hathitrust.org/ 7

8 Europeana http://www.europeana.eu/portal/ http://www.europeana.eu/portal/ 8

9 The Digital Public Library of America (DPLA) http://dp.la/ http://dp.la/ 9

10 DIGITIZATION PROCESS How have we created this critical mass of digitized content? Overview of Digitization10

11 Overview of Digitization11 Digitization is a process of conversion of analog information into a digital format through scanning or digital photography. It is a multi-step process that involves selection, image capture, creation of descriptive and technical metadata, and digital preservation of the objects created as a result of the conversion process.

12 Basic Digitization Workflow Digitization is More than Scanning Digitization Overview12 Selection Image capture Digital processing Indexing and metadata Ingesting Preservation and maintenance

13 What? Manuscripts * Books *Journals *Maps Overview of Digitization13

14 What? Archival Materials Overview of Digitization14

15 What? Cultural Heritage Materials on Tape and Film Overview of Digitization15

16 Why? Expand access – 24/7 Provide access to unique primary sources held in local archives Extend search capabilities of digital text Improve resource discovery Provide access to high-resolution images Integrate resources in multiple modes of representation Bring together dispersed collections Assist preservation and conversation efforts Overview of Digitization16

17 How? General Guidelines Digitize at the highest resolution appropriate to the nature of the source material  Avoid rescanning and handling of the originals in the future Create digital objects that are accessible and interoperable across platforms and devices  High-quality  Consistent  Authentic Produce digital objects that support the intended current and future use  Build a repository of digital master files to facilitate reprocessing and maintaining digital collections over time  Provide derivative access files for current use Create backup copies of all files on servers and have an off- site backup strategy Overview of Digitization17

18 Digital Master Files Created as a direct result of the image capture process either through scanning or photographing with a digital camera Should represent the visual information of the original material Serve as a long term archival file and a source for derivative images  Digital masters are not used for online delivery or print output General recommendations for digital master file creation include :  Scanning at the highest quality affordable  No compression or lossless compression  Non-proprietary archival formats TIFF – text or still images WAV – audio AVI or Motion JPEG 2000 or MXF – moving images * * Unlike text, still image, or audio, there is no archival file format that has been definitively established for moving images Overview of Digitization18

19 Examples  Photographic print 5x7 in. scanned in RGB mode at 600 ppi → 35 MB TIFF file, e.g. kw000010.tif  Large map 63 x 56 cm. (24. 8 x 22 in.) scanned in RGB mode at 300 ppi → 185 MB TIFF file, e.g. am001385.tif  Monograph page 23 cm (approx. 9 in.)scanned in RGB mode at 400 ppi → 25 MB TIFF file, e.g. 001_Front cover.tif Digital Masters

20 Derivative Files Created from digital master files for specific use including  Access images for digital collections or other types of Web delivery  User requests  High resolution prints General recommendations for derivative files :  Reduce the resolution depending on the intended use  72 dpi or 96 dpi for Web access  300 dpi for print output or for high-resolution viewers  Compress files to reduce their size  Select appropriate access formats PDF – text JPEG or JPEG 2000 - still images MP3 – audio MPEG-4 (MP4) or QuickTime or Real Video – moving images Overview of Digitization20

21 Image Capture Technical Factors Mode of capture  Bitonal — one bit per pixel representing black and white  Grayscale — multiple bits per pixel representing shades of gray  RGB (red-green-blue) — multiple bits per pixel representing color File formats  Tiff  JPEG  JPEG2000  RAW and DNG No compression Compression  Lossless  Lossy 21

22 Image Capture Technical Factors Resolution (ppi – pixels per inch; dpi – dots per inch)  An image 1500 x 2100 pixels displayed at 100 ppi = ? in.  The same image 1500 x 2100 pixels displayed at 300 ppi = ? in Bit depth  The number of bits used to represent each pixel determines how many colors can appear in a digital image 22 Source: BCR’s CDP Digital Imaging Best Practices.

23 Digital Masters – Photographs and Text Scanning Specifications Source: Wisconsin Heritage Online Digital Imaging Guidelines (2009). Version 2.0. Original materialScanning resolution Bit depthApproximate scanned dimensions Approx. size of preview image Photographs 16” x 20” +200 ppi24-bit color6400 x 8000 pixels146 MB 8 ½” x 11”–16” x 20”300 ppi24-bit color3200 x 4000 pixels36 MB 8” x 10”400 ppi24-bit color3200 x 4000 pixels36 MB 5” x 7”625 ppi24-bit color3200 x 4000 pixels36 MB 4” x 5”800 ppi24-bit color3200 x 4000 pixels36 MB 4” x 2 ½”1200 ppi24-bit color3200 x 4000 pixels36 MB Text Print—no images600 ppi1-bit bitonalVaries Print—with images300 ppi8-bit grayscale or 24-bit color Varies Manuscript400 ppi8-bit grayscale or 24-bit color Varies

24 Scanners Source materials in a variety of formats require versatile scanning equipment  Photographs (reflective and transparent materials)  Photographic prints → flatbed scanners  Film negatives and slides → film scanners, flatbed scanners with transparency adapters  Text (reflective materials)  Single leaf documents → flatbed scanners, sheet-fed scanners  Bound materials → overhead scanners or digital cameras  Oversize materials (reflective materials)  Maps, charts, etc. → large format scanners or digital cameras  Microfilm (transparent)  Newspapers → microfilm scanners Overview of Digitization24

25 Book scanner for Book, Oversized Prints and Maps DSLR for Oversized Prints, Maps, Scrolls, and 3D objects Film and Slide scanner Film and Slide Scanner with auto-feeder Flatbed scanner for Prints, Glass, and Transparent objects Video conversion Audio conversion Large format scanner for maps and oversized materials 25Overview of Digitization

26 Resources General Digitization Guides, Standards, and Best Practices Association for Library Collections & Technical Services (ALCTS). (2013). Minimum Digitization Capture Recommendations. http://www.ala.org/alcts/resources/preserv/minimum-digitization- capture-recommendationshttp://www.ala.org/alcts/resources/preserv/minimum-digitization- capture-recommendations BCR’s CDP Digital Imaging Best Practices (2008). [updated version of Western States Digital Imaging Best Practices] BCR CDP Digital Imaging Best Practices_2008.pdf Besser, Howard. Introduction to Imaging, Revised Edition (2003). The J. Paul Getty Trust. This book is free as a downloadable PDF. http://www.getty.edu/research/conducting_research/standards/introimages/ A Framework of Guidance for Building Good Digital Collections. 3rd Edition (2007). NISO Framework Advisory Group. http://www.niso.org/publications/rp/framework3.pdf Handbook for Digital Projects: A Management Tool for Preservation and Access. (2000). Northeast Document Conservation Center. http://www.nedcc.org/oldnedccsite/digital/dman.pdf Moving Theory into Practice: Digital Imaging Tutorial. (2000). Cornell University Library/Research Department. http://www.library.cornell.edu/preservation/tutorial/contents.html The NINCH Guide to Good Practice in the Digital Representation and Management of Cultural Heritage Materials. (2002). The National Initiative for a Networked Cultural Heritage (NINCH). http://www.nyu.edu/its/humanities/ninchguide/BCR CDP Digital Imaging Best Practices_2008.pdf http://www.getty.edu/research/conducting_research/standards/introimages/http://www.niso.org/publications/rp/framework3.pdfhttp://www.nedcc.org/oldnedccsite/digital/dman.pdfhttp://www.library.cornell.edu/preservation/tutorial/contents.htmlhttp://www.nyu.edu/its/humanities/ninchguide/ 26Overview of Digitization


Download ppt "Technology Bootcamp January 18, 2014 Large-Scale Digital Libraries Digitization Process Krystyna K. Matusiak, Ph.D. Assistant Professor Library & Information."

Similar presentations


Ads by Google