Digital Archives at the National Library of Medicine A presentation at the MLA Session Lighting the Path: Digital Repositories in the Real World May 24,

Slides:



Advertisements
Similar presentations
NATIONAL LIBRARY OF MEDICINE PubMed Central Edwin Sequeira National Library of Medicine May 26, 2004.
Advertisements

PubMed/How to Search, Display, Download & (module 4.1)
PubMed/History; Accessing Full-Text Articles (module 4.4)
EndNote Web Reference Management Software (module 5.1)
EndNote Web Reference Management Software (module 5)
The results for this search are displayed in the Summary format with a total of 3808 citations.
History Study Center Primary and secondary sources documenting global history 2010.
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Catherine Worrall Slide Library Co-ordinator, University College Falmouth.
NIH Public Access Compliance Cleveland Health Sciences Library Case Western Reserve University Kathleen C. Blazar.
Bruce Johnson Library of Congress, Cataloging Distribution Service, 2008 After a presentation by: Anna Martin, Union Catalogue Project, Cambridge University,
Metadata Descriptions statements descriptions records.
PubMed Central Mahyar Ahmadpour-B. Kowsar Publicatin Corp. Kowsar Editorial Meeting 1 September 19th, 2013 Tehran, Iran.
What are your publication options? Laura Happe & Meg Franklin.
Single Search By Rakphao Theppan, librarian Searching Online Resources.
PubMed Central Update Mark R. Desierto MLA Conference May 2007.
NATIONAL LIBRARY OF MEDICINE PubMed Central Martha Fishel National Library of Medicine CENDI Meeting September 15, 2004.
Resource Discovery Module DigiTool Version 3.0. Resource Discovery 2 Deposit Approval Search & Index Dispatcher & Viewers Single & Bulk Web Services DigiTool.
Kristin Eberle Monica Hampton Carmen Velasquez Kristin Eberle Monica Hampton Carmen Velasquez Knowledge Management.
Information & Library Services SwetsWise User Guide Emma Crowley Senior Academic Services Librarian
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
Integrating Resources: the Cataloging of Chameleons Judith A. Kuhagen Cataloging Policy & Support Office Library of Congress Washington, D.C. U.S.A. Hong.
© University of Reading October 2009 CentAUR Central Archive at the University of Reading Introduction for ‘early adopters’ Alison.
LSTA Digital Imaging Grants Presentation Projects Workshop September 13, 2002 Wendy Sistrunk Music Catalog Librarian University of Missouri—Kansas City.
Release 4 of the COUNTER Code of Practice for e- Resources and new usage- based measures of impact Peter Shepherd COUNTER May 2014.
Progress in Access Technologies: NLM Video Search Jennifer Marill Chief, Technical Services Division Edward Luczak Systems Architect, Office of Computer.
1 NIH Public Access Policy Policy on Enhancing Public Access to Archived Publications Resulting From NIH-Funded Research (Public Access Policy)
PubMed/How to Search, Display, Download & (module 4.1)
NLM-Semantic Medline Data Science Data Publication Commons Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
DUBLIN CORE: BEYOND THE LIBRARY David Hirsch LIS Knowledge Organization Dr. Selenay Aytac Spring 2013.
Publishing Digital Content to a LOR Publishing Digital Content to a LOR 1.
EBSCOadmin. Select Change Password Select EBSCOadmin Security.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
PubMed/How to Search, Display, Download & (module 4.1)
OCLC Online Computer Library Center Kathy Kie December 2007 OCLC Cataloging & Metadata Services an introduction.
PubMed Overview From the HINARI Content page, we can access PubMed by clicking on Search inside HINARI full-text using PubMed. Note: If you do not properly.
PubMed/How to Search, Display, Download & (module 4.1)
EndNote X4 (14) Tutorial Medical Center Library Frank Davis, MSLS Research & Education Division Updated
Library needs and workflows Diane Boehr Head of Cataloging National Library of Medicine, NIH, DHHS
National Sea Grant Library The New Library System and Publication Submittals Communications Staff Tutorial October 2014 National Sea Grant Library The.
Digitization An Introduction to Digitization Projects and to Using the Montana Memory Project.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Planning for Life after OCLC Passport for Cataloging An overview of the new OCLC cataloging service Revised April 2002.
Introduction to metadata
Connexion Comparison Client or Browser? Fran Juergensmeyer Waukegan Public Library 2 nd Annual WILIUG Conference June 16, 2006 Cataloging from A (Authority)
EndNote. What is EndNote? EndNote is referencing software that enables you to create a database of references from your readings.
MARCIt records for e-journals project to implement MARCIt service McGill University Library Feb
PAN-European Exploitation of the Results of the Libraries Programme - EXPLOIT German Libraries Institute Berlin EXPLOIT 1 Electronic library materials.
Journal Searching Nancy B. Clark, M.Ed. Director of Medical Informatics Education FSU College of Medicine 1 All recourses are available online in Medical.
Welcome to de Gruyter Reference Global. De Gruyter Reference Global provides you with comprehensive access to high quality academic content Run a quick.
Tutorial support.ebsco.com Core Collections Complete.
UoS Libraries 2011 EndNote X5 - basic graduate session.
Collecting History: Profiles in Science Alexa T. McCray National Library of Medicine Bethesda, MD Stanford University August 21, 1999.
IN THE NAME OF GOD. Reference Citing Software.
The NLM Catalog 2005 MLA Annual Conference Diane Boehr National Library of Medicine National Institutes of Health U.S. Dept. of Health and Human Services.
PubMed …featuring more than 20 million citations for biomedical literature from MEDLINE, life science journals, and online books.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Session 2 Tools and Decisions. 2-2 Session 2 1. What tools are available to help you catalog IR’s? 2. What decisions need to be made?
An Application Profile and Prototype Metadata Management System for Licensed Electronic Resources Adam Chandler Information Technology Librarian Central.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
MEDLINE®/PubMed® PubMed for Trainers, Fall 2015 U.S. National Library of Medicine (NLM) and NLM Training Center An introduction.
A Bibliographic Management Software NORSHUHADA SAIDIN REFERENCE & RESEARCH DIVISION PERPUSTAKAAN KEJURUTERAAN UNIVERSITI SAINS MALAYSIA.
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Metadata Editor Introduction
Cataloging the Internet
IDEALS at the University Of Illinois: A Case Study of Integration Between an IR and Library Discovery Systems Sarah L. Shreeves University of Illinois.
ISI Web of Knowledge update: April 2009
Lívia Vasas, PhD 2018 The Nation Library of Medicine and its databases Mozilla Firefox or Google Chrome Lívia Vasas, PhD.
The National Library of Medicine and its databases
Presentation transcript:

Digital Archives at the National Library of Medicine A presentation at the MLA Session Lighting the Path: Digital Repositories in the Real World May 24, 2004 by Diane Boehr Cataloging Unit Head, National Library of Medicine, National Institutes of Health, Health & Human Services

Scope Historical medical works Historical medical works The NLM Archive The NLM Archive PubMed Central PubMed Central

Considerations as you begin a project It will take much longer than you anticipate It will take much longer than you anticipate You will learn a great deal about topics outside your normal work duties You will learn a great deal about topics outside your normal work duties Be willing to take baby steps and make a start Be willing to take baby steps and make a start It is very rewarding to see the fruits of your labor It is very rewarding to see the fruits of your labor

HMD Projects Historical Anatomies Historical Anatomies Medicine in the Americas Medicine in the Americas

Historical Anatomies alanatomies/home.html alanatomies/home.html Provides high-resolution downloadable scans of selected important images from illustrated anatomical atlases dating from the 15th to the 20th century Provides high-resolution downloadable scans of selected important images from illustrated anatomical atlases dating from the 15th to the 20th century Titles and images selected by Michael North, Head of Rare Books and Early Manuscripts Titles and images selected by Michael North, Head of Rare Books and Early Manuscripts

Historical Anatomies Consists of large JPEGs and zoomable digitized images from the books and a brief bibliographical and historical introduction to each title Consists of large JPEGs and zoomable digitized images from the books and a brief bibliographical and historical introduction to each title

Technical details The imaging for this project is contracted out The imaging for this project is contracted out The contractor makes archival quality TIFF files (800 ppi resolution) and from that, thumbnail and JPEG images are made for the site, using Adobe Photoshop The contractor makes archival quality TIFF files (800 ppi resolution) and from that, thumbnail and JPEG images are made for the site, using Adobe Photoshop Zoomifyer Pro is used to create the pan and zoom images Zoomifyer Pro is used to create the pan and zoom images The TIFF files are backed up on CD-ROMs The TIFF files are backed up on CD-ROMs

Search and retrieval Individual images do not have any metadata associated with them at this time Individual images do not have any metadata associated with them at this time Bibliographic citations on the site match the LocatorPlus records Bibliographic citations on the site match the LocatorPlus records As the focus of the site is selected individual images from the books, rather than the entire text, there are currently no links from the LocatorPlus records for the individual titles to images on the Web site As the focus of the site is selected individual images from the books, rather than the entire text, there are currently no links from the LocatorPlus records for the individual titles to images on the Web site

Sample screen

Medicine in the Americas Monographic original source materials on the development of medicine in New World published prior to 1914 are being digitized in their entirety Monographic original source materials on the development of medicine in New World published prior to 1914 are being digitized in their entirety ( y.fcgi?db=Books) ( y.fcgi?db=Books)

Technical details Digitizing is being done in-house Digitizing is being done in-house Books are scanned, and from the initial scan a photocopy and a TIFF file are created Books are scanned, and from the initial scan a photocopy and a TIFF file are created Photocopies are scanned to create OCR Word text files, which are then manually reviewed and cleaned up to create a searchable, downloadable PDF text in modern font Photocopies are scanned to create OCR Word text files, which are then manually reviewed and cleaned up to create a searchable, downloadable PDF text in modern font TIFF file is used to create the typeface and layout of the original published work TIFF file is used to create the typeface and layout of the original published work

Technical details Mounting of these texts on the Web and the XML coding of the Word files done using the NLM Bookshelf platform Mounting of these texts on the Web and the XML coding of the Word files done using the NLM Bookshelf platform Bookshelf developed by NCBI for medical texts supplied by publishers in SGML, or other desktop publishing formats Bookshelf developed by NCBI for medical texts supplied by publishers in SGML, or other desktop publishing formats Platform has an existing template that allows the record creators to easily input metadata without needing to know XML Platform has an existing template that allows the record creators to easily input metadata without needing to know XML

Search and Retrieval Bookshelf site only supports keyword searching Bookshelf site only supports keyword searching Standard bibliographic data from LocatorPlus and brief historical data is included with the text Standard bibliographic data from LocatorPlus and brief historical data is included with the text Catalog records have hot links to the Bookshelf site Catalog records have hot links to the Bookshelf site

Timeframes Both projects went from planning to implementation in about one year, although both projects will be adding more material to their sites Both projects went from planning to implementation in about one year, although both projects will be adding more material to their sites Use of standard, off the shelf products or existing technologies made implementation easier Use of standard, off the shelf products or existing technologies made implementation easier

NLM Archives A site to store material of permanent value that has been published on the NLM Web site, but is now outdated or superseded A site to store material of permanent value that has been published on the NLM Web site, but is now outdated or superseded Searchable, yet clearly distinguished from current material Searchable, yet clearly distinguished from current material

What do we mean by permanent? Three aspects to permanence were identified: Three aspects to permanence were identified: 1) Identifier validity: The extent to which the given name or identifier will always provide access to the same resource 1) Identifier validity: The extent to which the given name or identifier will always provide access to the same resource 2) Resource availability: The extent to which a given resource is guaranteed to remain available in electronic form 2) Resource availability: The extent to which a given resource is guaranteed to remain available in electronic form 3) Content invariability: The extent to which the content of the resource could change 3) Content invariability: The extent to which the content of the resource could change

NLM Permanence Ratings Four categories of permanence have been defined: Four categories of permanence have been defined: 1) Permanent, unchanging content: NLM has made a commitment to keep this resource permanently available. Its identifier will always provide access to the resource. Its content will not change. 1) Permanent, unchanging content: NLM has made a commitment to keep this resource permanently available. Its identifier will always provide access to the resource. Its content will not change.

NLM Permanence Ratings 2) Permanent, stable content: NLM has made a commitment to keep this resource permanently available. Its identifier will always provide access to the resource. Its content is subject only to minor corrections or additions. 2) Permanent, stable content: NLM has made a commitment to keep this resource permanently available. Its identifier will always provide access to the resource. Its content is subject only to minor corrections or additions.

NLM Permanence Ratings 3) Permanent, dynamic content: NLM has made a commitment to keep this resource permanently available. Its identifier will always provide access to the resource. Its content could be revised, replaced. 3) Permanent, dynamic content: NLM has made a commitment to keep this resource permanently available. Its identifier will always provide access to the resource. Its content could be revised, replaced.

NLM Permanence Ratings 4) Permanence not guaranteed: NLM has made no commitment to retain this resource. It could become unavailable at any time. Its identifier could be changed. 4) Permanence not guaranteed: NLM has made no commitment to retain this resource. It could become unavailable at any time. Its identifier could be changed.

Workflows Permanence ratings are assigned when a resource is promoted to the NLM Web site Permanence ratings are assigned when a resource is promoted to the NLM Web site Default permanence ratings are generated based on the category to which the resource belongs Default permanence ratings are generated based on the category to which the resource belongs Resource creators use a template which adds basic metadata, in addition to the category and permanence rating Resource creators use a template which adds basic metadata, in addition to the category and permanence rating

Templates Metadata input template is a feature of TeamSite, our Web content management software Metadata input template is a feature of TeamSite, our Web content management software No knowledge of HTML is needed to use these templates No knowledge of HTML is needed to use these templates Minimal set of required fields, with default values or drop-down menus supplied wherever possible Minimal set of required fields, with default values or drop-down menus supplied wherever possible

Required metadata 1) Title 7) Rights 2) Heading 8) Contact 3) Date first published 9) Language 4) Date last modified10) Document category 5) Next scheduled review date 11) Permanence level 6) Publisher12) URL

The NLM metadata set is based on Dublin Core, with some local adaptations The NLM metadata set is based on Dublin Core, with some local adaptations The full scheme may be seen at The full scheme may be seen at lenew.html lenew.html

Workflows Every resource has the minimal metadata assigned by the resource creator Every resource has the minimal metadata assigned by the resource creator Permanent resources are routed to the Cataloging Section Permanent resources are routed to the Cataloging Section Complete MARC bibliographic records are created Complete MARC bibliographic records are created Includes standardized access points, including MeSH and an NLM classification number Includes standardized access points, including MeSH and an NLM classification number Accessible in LocatorPlus Accessible in LocatorPlus Distributed to the utilities and other NLM licensees. Distributed to the utilities and other NLM licensees.

Workflows The enhanced metadata created in Cataloging is then added back to the header information of the online resource The enhanced metadata created in Cataloging is then added back to the header information of the online resource Preliminary metadata and the enhanced versions can be seen by clicking on "View source" Preliminary metadata and the enhanced versions can be seen by clicking on "View source"

Basic metadata

Enhanced metadata

Archive Design Separate, distinct, but integral part of the NLM Web site Separate, distinct, but integral part of the NLM Web site Searchable with standard NLM search software: Mindserver from Recommind Searchable with standard NLM search software: Mindserver from Recommind

Archive contents Out-of-date resources--older material that was once up on the site, but is no longer of current interest Out-of-date resources--older material that was once up on the site, but is no longer of current interest Earlier versions of current documents that have undergone major revisions Earlier versions of current documents that have undergone major revisions

Still to come Archiving non-HTML files, such as PDF, video and audio clips, etc. Archiving non-HTML files, such as PDF, video and audio clips, etc. Archiving resources from areas in the library which do not get promoted through TeamSite Archiving resources from areas in the library which do not get promoted through TeamSite

Impact on Cataloging PubMed Central (PMC) PubMed Central (PMC) A bibliographic record must exist in the NLM catalog before a journal is added to PMC A bibliographic record must exist in the NLM catalog before a journal is added to PMC Records must be created if the title is not already in the catalog Records must be created if the title is not already in the catalog Downloaded from OCLC Downloaded from OCLC Skeletal record created from local template Skeletal record created from local template High-priority, 24 hr. turnaround time High-priority, 24 hr. turnaround time Records are then fully cataloged Records are then fully cataloged

Impact on Cataloging PMC PMC If the title is already in the catalog, holdings must be updated If the title is already in the catalog, holdings must be updated Indicate the title is available in PMC Indicate the title is available in PMC Range of issues Range of issues Any embargo periods Any embargo periods

Impact on Cataloging NLM Archive NLM Archive Cataloger creates core level MARC records for any new resource on the NLM Web site rated Permanent Cataloger creates core level MARC records for any new resource on the NLM Web site rated Permanent View the site, as well as utilize metadata supplied by record creator for descriptive data View the site, as well as utilize metadata supplied by record creator for descriptive data Supply MeSH and NLM classification Supply MeSH and NLM classification Establish authorized name headings in the national authority file Establish authorized name headings in the national authority file Transfer this enhanced metadata back to the resource Transfer this enhanced metadata back to the resource

Impact on Cataloging HMD projects HMD projects Minimal impact on Cataloging Minimal impact on Cataloging Books being digitized already have records in the catalog Books being digitized already have records in the catalog HMD has its own cataloging staff who can make links between existing catalog records and digitized material HMD has its own cataloging staff who can make links between existing catalog records and digitized material

Impact on Cataloging Despite the increased workload, we think archiving projects are enhanced when catalogers are involved in the projects Despite the increased workload, we think archiving projects are enhanced when catalogers are involved in the projects Catalogers increase their knowledge by becoming involved in these projects Catalogers increase their knowledge by becoming involved in these projects