The Cornell Veterinarian A Metadata Perspective.

Slides:



Advertisements
Similar presentations
How to Get Published European Journal of Human Genetics www. nature
Advertisements

HATHI TRUST A Shared Digital Repository Building A Future By Preserving Our Past The Preservation Infrastructure of HathiTrust Digital Library Jeremy York.
SIG Proceedings Preparation There are two methods used to produce SIG proceedings: –Preferred Vendor preparation service –Conference Leader preparation.
PubMed/Using Limits (module 4.2). Instructions - This part of the: course is a PowerPoint demonstration intended to introduce you to PubMed/Limits. module.
PubMed/How to Search, Display, Download & (module 4.1)
Drinking from a Fire Hose: Keeping up with the Professional Literature Angela Murrell Kresge Library, TSRI
PubMed Central ANCHASL Spring Meeting April 1, 2005 Robert James Associate Director of Public Services Duke University.
PubMed Central Mahyar Ahmadpour-B. Kowsar Publicatin Corp. Kowsar Editorial Meeting 1 September 19th, 2013 Tehran, Iran.
THE SCIENCETHE SEARCHTHE SOLUTION DOIs and the Secondary Publisher; a match made in heaven? Andrea Powell Product Development Director.
Joachim Bauer Senior System Engineer, CCS
HATHITRUST A Shared Digital Repository Big Collections in an Era of Big Copyright: Practical Strategies for Making the Most of Digitized Heritage Jeremy.
NATIONAL LIBRARY OF MEDICINE PubMed Central Martha Fishel National Library of Medicine CENDI Meeting September 15, 2004.
6/15/20151 Opportunities for Collaboration: The HEARTH Project Joy Paulson and Nathan Rupp Cornell University Digital Library Federation Spring Forum New.
Swets Information Services SwetsWise Title Bank 13 th Panhellenic Libraries Conference th October Corfu.
Three Years Later: Lessons Learned from Establishing a Metadata Service Marty Kurth PCC Policy Committee Meeting November 5, 2004.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.
Janet Weber Manager, Publisher Relations OCLC MLAIB Discussion Group MLA & OCLC Update ALA Annual 28 June 2008.
Engaging metadata in HathiTrust to enhance access and discovery: The Cornell Veterinarian Metadata Working Group Forum Project Team: John Cline, Steven.
PubMed/How to Search, Display, Download & (module 4.1)
VALIDATION AND RISK MANAGEMENT FOR SHARED PRINT JOURNAL ARCHIVES: UC JSTOR AND WEST CASES CRL Print Archives Network Meeting ALA Midwinter, Chicago January.
1 CS 430: Information Discovery Lecture 15 Library Catalogs 3.
1 Session 3 Aggregations and Packages What kinds of e-serial aggregations and packages are available? How can libraries provide access to the titles or.
The Big Six Approach to Locating, Evaluating and Sharing the Information You Seek at Bristol Elementary School.
OpenURL: Linking LC’s E-Resources Ardie Bausenbach Automated Planning and Liaison Office Library of Congress November 24, 2003.
N EXT G EN C ATALOG F ORUM Where do we go from here?
Link Resolvers: An Introduction for Reference Librarians Doris Munson Systems/Reference Librarian Eastern Washington University Innovative.
Looking back, moving forward: Examining the impact of digitizing the ACS archive 232nd ACS National Meeting September 13, 2006 David Martinsen, Adam Chesler.
NLM Digital Collections Update for DCFedoraUsersGroup January 22, 2013 John Doyle National Library of Medicine.
Cataloguing Electronic resources Prepared by the Cataloguing Team at Charles Sturt University.
Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.
ERIC and the WorldCat Registry Lawrence Henry ERIC Program Manager Joanna White WorldCat Registry Product Manager.
OpenURL Link Resolvers 101
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
A rticle L icensing I nformation A vailability S ervice IDS Project Information Delivery Services Mark Sullivan Library Systems Administrator SUNY Geneseo.
Library needs and workflows Diane Boehr Head of Cataloging National Library of Medicine, NIH, DHHS
07/11/2002Thomas Baron - JACoW Workshop1 CERN Library Requirements T. Baron CERN ETT-DH-CDS.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
HathiTrust’s Past, Present and Future. Short- and Long-term Functional Objectives Short-term Page turner mechanism (and Mobile!) Branding (overall initiative;
Lifecycle Metadata for Digital Objects September 11, 2002 Major archival and digital library metadata schemes.
Librarians Creating Solutions for Librarians
Research Seminar Series Laura Abate Electronic Resources & Instructional Librarian
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
RDA Toolkit Demonstration. Overview Accessing the Toolkit Navigating the Toolkit Understanding the functionality of the Toolkit Searching the Toolkit.
MARCIt records for e-journals project to implement MARCIt service McGill University Library Feb
IAFS 1000 Case Studies: The Israeli-Palestinian Conflict and the Rwandan Genocide.
CENDI/FLICC Workshop, June 21, 2000 Slide 1 of 24 The Impact of Reference Linking on the Creation and Use of References/Citations CENDI/FLICC Workshop.
Partner Publishers’ Websites From the Partner publisher services dropdown menu, click on the Elsevier Science - Science Direct website. Note that this.
PubMed/How to Search, Display, Download & (module 4.1)
LinkOut Update Medical Library Association Annual Meeting 2005 San Antonio, TX.
IAFS 1000 Imperialism and Decolonization in International Affairs.
TDNet Implementation of ONIX SOH v. 1.1 Enumeration & Chronology Data for e-Journal Coverage ALA Annual Conference – July 2009, Chicago Moshe Efron V.P.
PubMed/How to Search, Display, Download & (module 4.1)
PubMed …featuring more than 20 million citations for biomedical literature from MEDLINE, life science journals, and online books.
HathiTrust: Possibilities Metadata Working Group Cornell University Library March 21, 2014.
E-Journal Usage Data From SFX Enhancing Our Understanding of Full-Text Usage Maribeth Manoff University of Tennessee Libraries ELUNA 2 nd Annual Meeting.
AMERICAN INSTITUTE OF PHYSICS URL:
Jason W. Karl, Ph.D. Jeffrey K. Gillan Jason W. Karl, Ph.D. Jeffrey K. Gillan 23 October 2013 Ty Montgomery Richard Bliss Ty Montgomery Richard Bliss
Remote Data Sources in Primo Ebsco API WorldCat API Local Content.
HathiTrust--a GovDocs Repository? Brian Vetruba, Catalog Librarian/Germanic Studies Librarian Washington University in St. Louis Leveraging.
PubMed Database Interface (Basic Course Module 4).
Bielefeld Academic Search Engine
Development of infographic Service for journals and articles
Summon discovers contents from one search box!
PubMed Database Interface (Basic Course Module 4 Part A)
Link Resolver and Knowledge Base in Discovery Services
HOW TO WRITE A SYSTEMATIC/NARRATIVE REVIEW
Put the names of the people in the group here
Put the names of the people in the group here
PubMed Database Interface (Basic Course: Module 4)
OpenURL: Pointing a Loaded Resolver
Presentation transcript:

The Cornell Veterinarian A Metadata Perspective

The Challenge (Reprise)

Hathi Volume Interface

Hathi Data API

Hathi METS File

Hathi METS File (Continued)

Hathifile Record Elements Hathi Volume ID: mdp Access: allow [Notes on mapping for rights attributes where contextual user data would affect access] Rights: pd [public domain] HathiTrust record number: Enumeration/Chronology: v.33 no Source: MIU Source institution record number: OCLC number: Title: The Chicago medical times.

What I [naively] thought was the solution… 1.Use the Hathi Data API to find Table of Contents for each Volume 2.Gather the related OCR 3.Parse out the article citation values from the OCR (hopefully in a mostly automated way) 4.Use the pagination data from the TOC to build links 5.What could be automated could be done manually Goal: a citation index with Hathi URLs that could be used to build an interface or given to an index like PubMED

HathiTrust OCR for TOC

PubMed Indexing and API

Path for automation (For citations in PubMed for which the HathiTrust has a single volume) Query: PubMed Volume AND Hathi Catalog ID against Hathi File to get all corresponding object id’s from the METS. Query: METS object id’s AND the PubMed start page for each citation to find the Orderlabel to get the Order number from METS files. Create each URL: The Hathi METS object id and Order number are used to create the URL, e.g

The Metadata that Got Away…  Articles not indexed by PubMed ( )  Supplemental volumes What we hope to do about it:  Still working to see if we can programmatically create URL’s for Supplemental Volumes  Manually capture citation data and URL’s for pre-1945 articles using OCR.

PubMed Data Requirements  Linking Format (when we’re only contributing URL’s)  PubMed Id’s and corresponding URL’s  Administrative metadata, e.g. access restrictions, contributing source.  Required data elements for contributing citations  Journal ISSN  Journal ID or Journal title abbreviation  Journal Publisher  Copyright statement, where applicable  Volume/Issue/Article sequence or pagination  Issue publication date  Article electronic publication date?  AND URL’s

What does it all mean? For the project:  The Cornell Veterinarian should be available via PubMed for the years already indexed soon.  We’re still scoping out what it would take to capture the remaining citations manually. If funded this will be sent to PubMed to complete the backfile. Larger picture:  Potential for improved access to other titles currently lacking full-text linking in PubMed [if in HathiTrust]  Consider suggesting improvements to the Hathi workflows.