Presentation is loading. Please wait.

Presentation is loading. Please wait.

Extensible Library Catalog Name Access Control Module Matthew Horoszowski Rob Busack Anthony Lyo Ben Greenwood Dean Rzonca Sponsored by University of Rochester.

Similar presentations


Presentation on theme: "Extensible Library Catalog Name Access Control Module Matthew Horoszowski Rob Busack Anthony Lyo Ben Greenwood Dean Rzonca Sponsored by University of Rochester."— Presentation transcript:

1 Extensible Library Catalog Name Access Control Module Matthew Horoszowski Rob Busack Anthony Lyo Ben Greenwood Dean Rzonca Sponsored by University of Rochester River Campus Library

2 Overview Project overview Features Future development Demo Questions

3 Project Name matching Names are entered differently. Multiple pens names by the same person. Finding matching records Easy when authority record of an author already exists. A new authority record is created when an author does not exist. Importing different record formats

4 Technologies Used Java XML MySQL Hibernate Ant Marc4J

5 Supported Record Types MARC Authority records MARC Bibliography records Dublin Core records

6 Features A persistent data storage Import records Match records A functional API A prototype GUI

7 Importing Identifies the correct record formats Imports Marc and DublinCore XMLs Uses Marc4j to parse raw data to Marc XMLs Detects duplications Updates records with new information

8 Matching Phases

9 Matching Loops through all unmatched records. Tries various strategies and string transformations in order of confidence. If a match is found, a link is created with evidence. If no match is found, a new Authority record is created based on the Bibliographic record information

10 Name Transformations Names are transformed to get better matches. For example, Homer Simpson  Simpson, Homer Smith, Elizabeth ($q Ann Elizabeth)  Smith, Ann Elizabeth De la Mare, Walter  Mare, Walter De la Vanughan Williams, Ralph  Williams, Ralph Vanughan

11 Discriminators Adjusts the confidence in a match based on a discrimination criterion. For example, Common names Publication dates

12 Graphical User Interface Schedules jobs Filters and sorts results Views records and matches Manually matches of records

13 Future Possibilities Support for new metadata formats A web-based interface Searching (backend to a OPAC) GUI improvements

14 Demo

15 Questions and Comments?


Download ppt "Extensible Library Catalog Name Access Control Module Matthew Horoszowski Rob Busack Anthony Lyo Ben Greenwood Dean Rzonca Sponsored by University of Rochester."

Similar presentations


Ads by Google