Using Publishing Profiles to dump data out of Alma needed for resource sharing systems such as HathiTrust Margaret Briand Wolfe Systems Librarian Boston.

Slides:



Advertisements
Similar presentations
Alma Analytics : a whole new world of data extraction
Advertisements

BETH BRENNAN CHRISTINE MOULEN ELUNA 5/2/2014 Automating MARCit! for a single-record approach.
Cataloging: Millennium Silver and Beyond Claudia Conrad Product Manager, Cataloging ALA Annual 2004.
Millennium Cataloging in Release 2005 Georgia Fujikawa Manager, Training Programs.
M AKING E - RESOURCE ACCESSIBLE FROM ONLINE CATALOG *e-books *serials Yan Wang Senior Librarian Head of Cataloging & Database Maintenance Central Piedmont.
October 23, Expanding the Serials Family Continuing resources in the library catalogue.
Sage Library Consortium Cataloging-in-Publication MARC record conversion.
Classroom User Training June 29, 2005 Presented by:
How to handle the Multitude Successfully handling thousands of E-Book records using MARCEdit and BIBLOAD reports Kelly Swickard Decker Library Maryland.
Alma 1 year after STP: implementing batch services IGeLU Budapest Sep 2, 2015 Bart Peeters Head Operations LIBIS.
Running a Report.  List Bibliography Report  Found under: All Titles Purpose : Creates customized bibliographies by catalog, call number, or item characteristics.
Cataloging 12.3 to 14.2 Seminar. Cataloging 2 -New check routines -Cataloging authorizations -Other innovations -Fix and expand routines -Floating keyboard.
MARC Record Cleanup. Getting Started Delete all Titles without copies Delete missing items for 2-3 or more years Delete Lost copies for 2-3 or more years.
Web Z: A Non-Programmers Perspective Sandy Card State University of New York at Binghamton March 23, 1999.
The last book in Australia? Importing last Australian copy holdings into Millennium Christian West University of Canberra
How to sort the “Order Information report” from the service “Print Acquisitions Records acq-03” Yoel Kortick.
Ex Libris, LOD and BIBFRAME
Administration Fundamentals Normalization Rules, Merge Methods, & Match Methods.
WHY SHOULD I CARE ABOUT (PRIMO) NORM RULES?. WHAT NORMALIZATION RULES DO Content display in Primo Primo functionality Troubleshooting.
Item Records – Everything One Needs to Know – well almost.
SILO File Upload & Feedback System By Marie Harms State Library of Iowa August 18 & 19, 2010.
1 Alma Developers Your Library, Extended. November 2012  Ex Libris Ltd., Internal and Confidential.
© 2015 Ex Libris | Confidential & Proprietary Yoel Kortick Senior Librarian Cataloging introductory flow.
1 Yoel Kortick Senior Librarian Adding a local Electronic Collection.
© 2015 Ex Libris | Confidential & Proprietary Bound together titles Harvard University April Yoel Kortick Senior Librarian
© 2015 Ex Libris | Confidential & Proprietary An Introductory Explanation of Alma Analytics Yoel Kortick | Senior Librarian.
1 Designing and using normalization rules Yoel Kortick Senior Librarian, Ex Libris.
1 Yoel Kortick Senior Librarian Working with the Alma Community Zone and Electronic Resources.
Aleph Publishing services with a special focus on PRIMO-FULL and PRIMO-AVAIL version 18 Presenter: Yoel Kortick.
1 Yoel Kortick | Senior Librarian Serials automation tasks for summary holdings.
1 Yoel Kortick Senior Librarian Alma Product Management Mapping the bibliographic call number to the holding record call number.
Using Publishing Profiles to dump data out of Alma needed for resource sharing systems such as HathiTrust Margaret Briand Wolfe Systems Librarian Boston.
Creative Create Lists Elizabeth B. Thomsen Member Services Manager
Merge Rules and Routines
An Introduction to the Bibliographic Metadata Profile in Alma
Automating Cataloging Workflows with OCLC and Alma APIs
Introduction to Import Profiles July 2016
Defining and using an external search profile with multiple targets for copy cataloging Yoel Kortick Senior Librarian Alma Product Management.
Yoel Kortick Senior Librarian
Dumping data out of Alma using PERL and the Alma Analytics API
Resource Management / Acquisitions
Patron Driven Acquisition (PDA) Demand Driven Acquisition (DDA)
Managing Copyrights in Invenio
Roles for Alma Catalogers
Yoel Kortick Senior Librarian
Yoel Kortick Senior Librarian
SFX V4 – Admin Changes Lieve Rottiers.
Publishing to OCLC Yoel Kortick Senior Librarian.
Metadata Editor Introduction
Standing Orders in Alma
Cataloging introductory flow
Resource Sharing Locate
Gary R. Cocozzoli Lawrence Technological University
ALEPH Version 22 Beginning Cataloging
Importing and exporting records in Alma
Journal separation anxiety
Library Content Comparison System
DESIGNING AND USING NORMALIZATION RULES
Yoel Kortick Senior Librarian
An article in an anthology and derive new record
CSU Millennium to Alma migration
Yoel Kortick Senior Librarian
Yoel Kortick Senior Librarian
Designing and Using Normalization Rules
Binding Serial Issues with a Work Order
Alternate graphic representation 880 field
Prediction Patterns and Summary Holdings
Yoel Kortick Senior Librarian
Presentation transcript:

Using Publishing Profiles to dump data out of Alma needed for resource sharing systems such as HathiTrust Margaret Briand Wolfe Systems Librarian Boston College ELUNA May 8, 2015

When the call for data comes HathiTrust Rapid ILL Browzine Your data extraction headache here ELUNA

Frustrations dumping data out of Alma and Analytics 5,000 row Excel export limit in Alma 65,000 row Excel export limit in Analytics Alma Bibliographic Export Processes MARC21 Binary MARC XML Entire MARC is too much data to sift through Alma APIs Too slow for millions of records Daily limit to the number of API calls ELUNA

Solution: Alma Publishing Profiles Is set based Can only be published in full once, subsequent publishing contains the delta Ex Libris says full re-publish is coming in a future release Need a place for the published files to land, such as S/FTP server ELUNA

HathiTrust Files Requirement Print Holdings in 3 separate files: Single Print Monographs Multi-Part Monographs Serials ELUNA

BC’s Managed Sets for HathiTrust Sets built for 9 separate libraries for both books and serials using the Advanced Repository Search Physical titles where library = O’Neill and material type = Books Physical titles where library = O’Neill and material type = Issue or Bound Issue Can combine sets but once combined sets become itemized sets instead of logical sets I combined all serials sets into one itemized set and all sub-library book sets into one itemized set O’Neill books stayed in its own logical set ELUNA

Normalization Rules Publishing Profiles can use normalization rules to determine what data is output See Alma Help, browse normalization rules if unsure how to add or edit a rule Briefly: Resource Management -> Cataloging -> Metadata Editor -> File -> New -> Normalization Rule OR Resource Management -> Cataloging -> Metadata Editor -> Rules -> Normalization Rules ELUNA

Normalization Rules We use a rule that removes all of the MARC fields except: 001 – contains system number (MMS ID) * 035 – contains OCLC number * 022 – contains ISSN. Used when set is for serials 074 – contains government document number 901 – publishing profile puts item description in 901 subfield a (more on this soon) * Required by HathiTrust ELUNA

Publishing Profiles – Profile Details Resource Configuration -> Configuration Menu -> Publishing Profiles -> Add Profile -> General Profile BC ended up with 3 publishing profiles: 1. O’Neill Books – uses logical set 2. All other sub-libraries’ books – uses combined itemized set 3. All serials – uses combined itemized set Under Content -> Publish On: Bibliographic Level Under Publishing Protocol can choose: FTP or OAI. BC uses FTP MARC Output format = MARC21 XML or MARC 21 Binary BC uses MARC21 XML, 10,000 records per file Added filename prefix to distinguish files for each of the 3 sets ELUNA

Publishing Profiles – Profile Details ELUNA

Publishing Profiles – Data Enrichment Under Bibliographic Normalization – select normalization rule you created to only export the MARC data you want Under Physical Inventory Enrichment – Check Add Items Information if profile is for books. Set repeatable field = 901, set description subfield = a. This puts the item description/enumeration in 901 tag, subfield a. This is used to find multi-part monographs. ELUNA

Publishing Profiles – Data Enrichment ELUNA

Publishing Profiles - Actions ELUNA

What to do with all those files Unzip them – I wrote a PERL script to unzip all of the files FTP’d by Alma onto one of our servers Process them – I wrote a PERL script to read each XML file and process each record in the file. To go to Hathi Trust each record needed an MMS ID and OCLC number. For Serials files I added the ISSN(s) if present Multi-part monographs could only be identified by the presence of a description field If 074 then set Gov Doc indicator = 1 ELUNA

HathiTrust elements I ignored Holding Status CH – Current Holding WD - Withdrawn LM – Lost or missing Condition BRT – Brittle, damaged and/or deteriorating ELUNA

Why I ignored them Alma does not distinguish between items that are deleted versus items that have been withdrawn. Lost and Missing statuses are stored in the item processing type. Could add to data enrichment from items. We store brittle or deteriorating condition in the item internal note. Ditto. ELUNA

Your Turn What have you done? How can we do this better? What should we ask Ex Libris for to make this process easier? ELUNA

Contact Me Margaret Briand Wolfe ELUNA