RESEARCH TOPICS Web-Interface Performance DTD Extensibility Imaging

Slides:



Advertisements
Similar presentations
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
Advertisements

Issues and approaches to preservation metadata Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
Creating textual resources Printed documents. Content of this session Types of printed documents Methods of capture Some examples.
Delivering textual resources. Overview Getting the text ready – decisions & costs Structures for delivery Full text Marked-up Image and text Indexed How.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Digital Collections: Storage and Access Jon Dunn Assistant Director for Technology IU Digital Library Program
Sharpdesk Overview Desktop Composer Search Imaging      
Services Digitisation & Content Management. 600 People – India.
OCLC Online Computer Library Center Microfilmed Newspapers: Selection for Digitization Success ALA June 25, 2006 OCLC Preservation Service Centers.
Client Lunch & Learn (12:15). Association for Information & Image Management Nov Research Scanner Utilization.
These ain’t “Old News”! Creating access to historic newspapers Christine Guenther OCLC Product Manager, Digital Services Preservation Service Centers Bethlehem,
NATIONAL LIBRARY OF MEDICINE PubMed Central Martha Fishel National Library of Medicine CENDI Meeting September 15, 2004.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Strategic Thinking and Significant Characteristics Hamish James.
JSTOR & OCR - A Case Study Kiffany Francis. What is JSTOR? “JSTOR is a not-for- profit organization with a dual mission to create and maintain a trusted.
Angelika Menne-Haritz The MEX editor - METS and the presentation of digitised archives The MEX editor: METS and the Internet presentation of.
Developing a strategy for quality Ira Revels Digital Project Manager Cornell University Library.
UNIVERSITY OF MACEDONIA ECONOMIC AND SOCIAL SCIENCES Support and Inclusion of students with disabilities at higher education institutions in Montenegroz.
Accessibility of online instructional tools and documents Terrill Thompson ATUS Technology Accessibility Consultant x 2136
Evaluating the use of OCR on a Mobile Device Presented by : Hamed Alharbi Supervisor by :Dr Brett Wilkinson.
The Voice of A Community Chinese Times Digitization Project Ian Song Prepared for the Multicultural Canada Conference
The National Digital Newspaper Program (NDNP) An NEH/LC Collaborative Program Enhancing access to historical newspapers Release: September 2006.
Pemrograman Berbasis WEB XML part 2 -Aurelio Rahmadian- Sumber: w3cschools.com.
European Metadata Initiatives: The METAe Metadata Engine Simon Tanner Higher Education Digitisation Service
2 pt 3 pt 4 pt 5pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2pt 3 pt 4pt 5 pt 1pt 2pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4pt 5 pt 1pt Terms 2 Terms 3 Terms 4 Terms 5 Terms.
Statewide Digitization and the FCLA Digital Archive Priscilla Caplan, Florida Center for Library Automation Statewide Digitization Planners Meeting OCLC,
Digital Reformatting of Text Aaron Choate Digital Library Production Services The University of Texas Libraries.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Text.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
MSS Technologies and the AIIM Grand Canyon Chapter present: Electronic Document Management System Needs Analysis.
Erin Kinney, Wyoming State Library. Motivation #1 priority that came out of 2004 statewide digitization meeting WSL received many reference questions,
Digitization of the Federal Depository Library Program Judith C. Russell Superintendent of Documents & Managing Director, Information Dissemination “Electronic.
ALI Digital Library Workshop Creating Digital Content: Digitization Jenn Riley Digital Media Specialist IU Digital Library Program
Planning a digital library How to Build a Digital Library Ian H. Witten and David Bainbridge.
CHAPTER FIVE TEXT.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
2002 September -- ejk/UF RESEARCH TOPICS Web-Interface Performance DTD Extensibility Imaging Distillation Other topics?
Mark Sullivan Digital Library of the Caribbean. Imaging  Imaging Theory & Specifications  Recommended Equipment and Software 2 dLOC Training (7/29/2013)
1 Helping communities access and explore their newspaper heritage. Rose Holley – Manager Newspaper Digitisation Program
TECHNOLOGY SUPPORT FOR ESSSS Progress, Issues, and Challenges Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library.
1 Using Digital Technologies to unlock history for researchers. Rose Holley – Manager Newspaper Digitisation Program Australian Academy of the Humanities.
Integrating a Statewide Web Gateway With Digital Collections ______________________ Eric Weig and Beth Kraemer University of Kentucky and KCVL.
Planning a digital library How to Build a Digital Library Ian H. Witten and David Bainbridge.
UVa's Digital Library CSG - September 2005 Slides courtesy of: Leslie Johnston Director, Digital Access Services, UVA Library Tim Sigmon University of.
1 Bridging the gap between the paper past and digital future.
Group 3: Art Gallery Monica Almendarez Content/Project Manager Willliam Egle Technology Manager Christina Pié Usability/ADA Compliance Manager Mirjana.
University of Florida Digital Collections.
An exercise in preservation and applied technology Making an Electronic Text.
Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007.
Document Computing Technologies for Managing Electronic Document Collections Ross Wilkinson... [et al.] Circulation Counter [RES3H] ZA4080.D
The Century Archive Project “CAP” Technology-Independent Information Storage Steven H. McCown & Michael Leonhardt Storage Technology Corporation 4 April.
Scanners. Using a Scanner Scanners are used to digitize any flat object. Several types of scanners- flatbed, sheet fed, handheld, film. Most common is.
The Future of Scholarly Communication & the Role of Libraries Roy Tennant eScholarship, The California Digital Library.
Laurie N. Taylor Lourdes Santamaría-Wheeler The Basics of Digitizing Collections.
Delivering textual and visual resources. Overview Case studies Methods for providing access Structures for delivery Full text Marked-up Image and text.
1 THE AUSTRALIAN NEWSPAPERS DIGITISATION PROGRAM (NDP) Rose Holley – Manager Newspaper Digitisation Program Presentation for Spydus 31 October 2007, NLA,
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Section 4.1 Section 4.2 Format HTML tags Identify HTML guidelines
MSU Libraries’ Course Materials Program:
Content-level intellectual control for digital archives
Digitisation in academic libraries: Experience from Makerere University Library, Kampala Uganda By Patrick Sekikome Presented at the CERN-UNESCO School.
Statewide Digitization and the FCLA Digital Archive
Digital Archival Management Solution (DAMS)
Accessible Documents: The journey so far
University of Florida Digital Collections
Terms 1 Terms 2 Terms 3 Terms 4 Terms 5 1pt 1 pt 1 pt 1pt 1 pt 2 pt
My Program Session Title
CROWLEY & NEXUS IMAGING SOLUTIONS
Current Challenges in Digitization
Presentation transcript:

RESEARCH TOPICS Web-Interface Performance DTD Extensibility Imaging Distillation Other topics? 2002 September -- ejk/UF

CONTEXT Image Only Pilots Australian Periodical Publications, 1840-1845 National Library of New Zealand. Papers Past Image & Indexing/Tagging Pilot University of Florida. Caribbean Newspaper Imaging Project University of Florida. Florida Newspaper Project Image & OCR Pilots Lambrakis Press Archives ProQuest. Historical Newspapers™ TIDEN Project : a Nordic Digital Newspaper Library Olive Software Pilot The British Library 2002 September -- ejk/UF

WEB-INTERFACE PERFORMANACE Primary Purpose: Characterize the bias of individuals conducting study Products: How to use ActivePaperTM to Your Advantage Integration with CONTENTdm, XPAT 5.0, other Alternate deliverable images Centralized service – Distributed content – Variable platforms 2002 September -- ejk/UF

DTD EXTENSIBILITY Primary Purpose: Assess the XML against established newspaper uses Products: How to use ActivePaperTM to Your Advantage Document the XML as a public DTD Establish a maintenance authority Provide for extension of the DTD Automation for extended tagging How to construct a style sheet Integration with CONTENTdm, XPAT 5.0, other Define issues per the Economic Model 2002 September -- ejk/UF

IMAGING Directory Structure and File Naming Archival Formats Optimized Imaging 2002 September -- ejk/UF

IMAGING: Directory Structure and File Naming Primary Purpose: Recommended practices Products: Methods for dealing with anomalies Automated name capture during imaging 2002 September -- ejk/UF

IMAGING: Archival Formats Primary Purpose: Description of file formats & their characteristics for archive, distillation, and distribution Products: Preservation metadata Anticipate migration Schedule & fee structure for inspection & migration Strategy for format migrations & emulation 2002 September -- ejk/UF

IMAGING: Optimized Imaging Primary Purpose: Best practices for microfilming and digitizing (quantitative assessments) Film reduction ratio Evenness of illumination on film Film background density Quality Index & DPI/PPI Skew Color-space & Bit-depth Image density/black & white points Despeckling and Sharpening Image restoration methods 2002 September -- ejk/UF

IMAGING: Optimized Imaging Environments: Operating System Scanning Hardware Lighting and Light Filtration Post-processing Other? Other Products: Control target for OCR assessment Revision: RLG Preservation Microfilming Guidelines 2002 September -- ejk/UF

DISTILLATION Document Zoning Optical Character Recognition 2002 September -- ejk/UF

DISTILLATION: Document Zoning Primary Purpose: Confirm assumptions re: document zoning OCR has difficulty processing large letters Smaller zone yield more accurate text Products: Establish reference to the . . . PDF (fully scaled) TIFF Other derivative file formats (fully scaled) 2002 September -- ejk/UF

DISTILLATION: OCR Primary Purpose: Provide quantitative OCR accuracy information Areas of Investigation: Distillation Source Images Language and Fonts Column & Line Density Relative Density/Contrast Text Curvature and Other Defects 2002 September -- ejk/UF

DISTILLATION: OCR Distillation Source Images Primary Purpose: Predict accuracy contingent upon source document (printing technologies & filming standards) Test-Set Characterization: Source type (newspaper or microfilm) Production date (technologies & standards used) Additional Products: Best practices Accuracy : Cost – Matrix 2002 September -- ejk/UF

DISTILLATION: OCR Language and Fonts Primary Purpose: Demonstrate ability to distill languages, character sets & fonts Test-Set Characterization: Language & character set groups Font face & font size groups Regional variant spellings Additional Products: Olive Software Speaks Your Language How Olive Software Learns Your Lingo Stylized text recognition & distillation guide 2002 September -- ejk/UF

DISTILLATION: OCR Column & Line Density Primary Purpose: Demonstrate ability to distill compact text Test-Set Characterization: Pre-1900 newspapers Advertisement pages Pages predominantly 8 pt. type or less Pages with less than 1 mm space between lines Pages with characters spaced at or below ⅓ mm 2002 September -- ejk/UF

DISTILLATION: OCR Relative Density/Contrast Primary Purpose: Investigate low and uneven contrast materials Test-Set Characterization: Low contrast pages Pages with low contrast zones Printing, Filming, & Age/Storage Defects Additional Products: Best practices Accuracy : Cost – Matrix Don’t forget to buy the Life Insurance 2002 September -- ejk/UF

DISTILLATION: OCR Text Curvature and Other Defects Primary Purpose: Benchmark current capability to distill curved text & other defects of printing or filming Test-Set Characterization: Curved text zones Broken character zones Broken line zones Garbage elements (stains, etc.) Additional Products: (Additional automatic image correction processes) 2002 September -- ejk/UF