FRBR for Movies and Finding FRBR in MARC OLAC meeting Kelley McGrath University of Oregon January 27, 2013.

Slides:



Advertisements
Similar presentations
John Espley and Robert Pillow ALA New Orleans 26 June 2011 The RDA Sandbox and RDA Implementation Scenario One.
Advertisements

A worldwide library cooperative OCLC Online Computer Library Center OCLC CJK Users Group 2007 Annual Meeting March 24, 2007, Boston David Whitehair, OCLC.
OCLC Research OCLC Online Computer Library Center 2006 WebWise Los Angeles, CA 17 February 2006 FictionFinder: Don Quixote to Graphic Novels Diane Vizine-Goetz.
Teaching RDA Train-the-trainer course for RDA: Resource Description and Access Presented by the National Library of Australia September – November 2012.
Serials/Integrating Resources Charlene Chou March 18 th,
LIS512 lecture 2 relational databases Thomas Krichel
PART 1 NEW MODELS OF METADATA Karen Coyle Using RDA: Moving into the Metadata Future ALA TechSource Webinar Series.
Bibliographic Description of Some Non-book Formats LIS 532. Session 3.
MARC 101 for Non-Catalogers Colorado Horizon Users Group Meeting Philip S. Miller Library Castle Rock, CO May 29, 2007.
RDA & Serials. RDA Toolkit CONSER RDA Cataloging Checklist for Textual Serials (DRAFT) CONSER RDA Core Elements Where’s that Tool? CONSER RDA Cataloging.
RDA and libraries Gordon Dunsire Presented at a College Development Network webinar, 13 June 2013.
RDA Test at LC Module 1: Overview What RDA Is; Structure.
Cataloging: Millennium Silver and Beyond Claudia Conrad Product Manager, Cataloging ALA Annual 2004.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
RDA: Resource Description and Access A New Cataloging Standard for a Digital Future Jennifer Bowen OLAC 2006 Conference October 27, 2006
RDA: Resource Description and Access A New Cataloging Standard for a Digital Future Jennifer Bowen Cornell University May 16, 2006
Descriptive Cataloging of Monographs 1. Introduction DRAFT.
Kelley McGrath University of Oregon MARC Formats Interest Group ALA Midwinter 2011 Flickr: Ashimjara.
RDA AND AUTHORITY CONTROL Name: Hester Marais Job Title: Authority Describer Tel: Your institution's logo.
IPUMS to IHSN: Leveraging structured metadata for discovering multi-national census and survey data Wendy L. Thomas 4 th Conference of the European Survey.
Technical Services & Cataloging and Classification Jennifer Anielski and Christina Tracy IS 554 Public Library Management.
Is Cataloging Dead: Advocacy for Bibliographic Control Randy Roeder and Rebecca Routh ILA/ACRL Spring Conference Davenport, Iowa March 3, 2008.
Structure AACR2 Part I - Description Part II - Headings, Uniform titles, References RDA Attributes (of entities) Relationships (between entities)
Session 4B – User Experience (The Catalogue and You) New display models of bibliographic data and resources: cataloguing/resource description and search.
Jennifer Bowen, University of Rochester ALA Midwinter Conference January 22, 2012, Dallas, TX The eXtensible Catalog (XC): Transitioning to a Post-MARC.
Processing of large document collections Part 10 (Information extraction: multilingual IE, IE from web, IE from semi-structured data) Helena Ahonen-Myka.
1 On the Record Report of the Library of Congress Working Group on the Future of Bibliographic Control Diane Boehr Head of Cataloging, NLM
Outcome Based Evaluation for Digital Library Projects and Services
OCLC Online Computer Library Center Kathy Kie December 2007 OCLC Cataloging & Metadata Services an introduction.
Jenn Riley Metadata Librarian IU Digital Library Program New Developments in Cataloging.
Natural Language Processing Guangyan Song. What is NLP  Natural Language processing (NLP) is a field of computer science and linguistics concerned with.
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
RDA Coming soon to a catalogue near YOU Chris Todd National Library of New Zealand 2010, revised 2012.
Data Management Console Synonym Editor
Functional Requirements for Bibliographic Records: FRBR and Millennium
Robert Pillow, VTLS Inc. How Will RDA Impact Your System? A Forum of Vendors Discussing Implementation Plans Association for Library Collections & Technical.
From AACR2 to RDA: An Evolution Kathy Glennan University of Maryland.
RDA: Resource Description and Access A New Cataloging Standard for a Digital Future Jennifer Bowen RDA Forum ALA Annual Meeting, New Orleans, June 24,
Implementation scenarios, encoding structures and display Rob Walls Director Database Services Libraries Australia.
RDA Toolkit is an integrated, browser-based, online product that allow user to interact with a collection of cataloging-related documents and resources.
The Future of Cataloging Codes and Systems: IME ICC, FRBR, and RDA by Dr. Barbara B. Tillett Chief, Cataloging Policy & Support Office Library of Congress.
Jennifer Bowen, University of Rochester ALA Annual Conference, 2009, Chicago, Illinois 1 Defining Linked Data for the eXtensible Catalog (XC): Metadata.
Amy Dai Machine learning techniques for detecting topics in research papers.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
What users want & how FRBR can help Diane Vizine-Goetz Research Scientist OCLC Research.
Endeca: a faceted search solution for the library catalog Kristin Antelman & Emily Lynema UNC University Library Advisory Council June 15, 2006.
9/26/2007OCLC Orientation & Services1 What is OCLC?
Resource Description and Access Deirdre Kiorgaard ACOC Seminar, September 2007.
APPLYING FRBR TO LIBRARY CATALOGUES A REVIEW OF EXISTING FRBRIZATION PROJECTS Martha M. Yee September 9, 2006 draft.
Linked Data by Dr. Barbara B. Tillett Chief, Policy and Standards Division Library of Congress For Texas Library Association Conference April 12, 2011.
RDA and Special Libraries Chris Todd, Janess Stewart & Jenny McDonald.
RDA. That was then… This is now… Who do we catalogue for? Patrons in the library Patrons in the library Staff of the library Staff of the library The.
MARC What You Really Need to Know About This Stuff! Audrey Church Coordinator, School Library Media Program Longwood University.
The physical parts of a computer are called hardware.
FRBR: Cataloging’s New Frontier Emily Dust Nimsakont Nebraska Library Commission NCompass Live December 15, 2010 Photo credit:
NATURAL LANGUAGE PROCESSING Zachary McNellis. Overview  Background  Areas of NLP  How it works?  Future of NLP  References.
Mining MARC for Moving Image Data Mashcat January 13, 2016 Kelley McGrath University of Oregon.
Sally McCallum Library of Congress
 Cataloging DVD-videos with AACR2 Joseph Andrews, Andrea Payant and Lewis Sievers.
Implementing (parts of) FRAD in a FRBR-based discovery system Jenn Riley Metadata Librarian Indiana University Digital Library Program.
Renee Register Senior Product Manager OCLC Cataloging and Metadata Services Sandy Piver OCLC Publisher Services Consultant OCLC Services for the Publisher.
Jeanne Piascik Principal Cataloger University of Central Florida Technical Services Member Group FLA 2014 Annual Conference.
23 rd Annual Innovative Users Group Conference April 13 th – 16 th 2015.
The ___ is a global network of computer networks Internet.
FRBR for Movies and Finding FRBR in MARC
Cindy Cogdill LIB 5050 January 30, 2004
Finding Movies with FRBR & Facets
Cataloging Tips and Tricks
MARC: Beyond the Basics 11/24/2018 (C) 2006, Tom Kaun.
Presentation transcript:

FRBR for Movies and Finding FRBR in MARC OLAC meeting Kelley McGrath University of Oregon January 27, 2013

FRBR for Movies 2

Users Are Looking for Movies 3

Libraries Describe Publications 4

Dracula [videorecording] / Columbia Pictures ; directed by Francis Ford Coppola … 2-disc special ed. Culver City, Calif. : Columbia TriStar Home Entertainment, c videodiscs (ca. 130 min.) : sd., col. ; 4 3/4 in. + 1 booklet. Horror film series 5

Libraries Describe Publications 6

Users Care About Versions 7 Rg1024: open clip art library 日本語

Prototype: Movies & Versions Funded: OLAC (Online Audiovisual Catalogers) Developed by Chris Fitzpatrick Small scale (limited data, few fields and records, simplified data model) 8

Movie (Mostly Work) Facets 9

Results List 10 Results focused on movie (work) Fulfillment options below (expression, manifestation, item)

Version (Expression/ Manifestation/ Item) Facets 11

Prototype Prototype 24.heroku.comhttp://blazing-sunset- 24.heroku.com Sample searches and use cases 24.heroku.com/page/samples 24.heroku.com/page/samples Code 12

Why the FRBR Model? to focus displays on original movies while supporting users in selecting and obtaining appropriate versions to enable shared maintenance of discrete movie-level records and reduce data redundancy, thereby supporting efficient production of more complete and accurate metadata 13

Finding FRBR in MARC 14

Machine-Actionable Data Structured data OriginalReleaseYear = 2011 NOT Originally released as a motion picture in Mapped to FRBR entities and attributes OriginalReleaseYear = Date of the Work 15

Machine-Actionable Data Supports faceted access and the creation of more readable, grid-like displays Enables targeted search and flexible display Dracula (1992 : Francis Ford Coppola) Format: DVD, NTSC Languages: English, German Subtitles: English, French, Spanish Accessibility: Closed-captioned Region: Region 1 (U.S. and Canada only) 16

ALA Annual 2012 presentation on extracting work data from MARC records: 17

Names and Functions Want to link authorized names with controlled vocabulary for functions Director = Clint Eastwood directed by Clint Eastwood 700 $a Eastwood, Clint, $d $4 drt 18

Identify Individual Statements Based on Punctuation 245 $c Metro-Goldwyn-Mayer picture ; screenplay by George S. Kaufman and Morrie Ryskind ; directed by Sam Wood  1.Metro-Goldwyn-Mayer picture 2.screenplay by George S. Kaufman and Morrie Ryskind 3.directed by Sam Wood 19

Make a List of Terms for Each Function (Synonyms) screenplay screenwriter script scriptwriter writer written by 20 aus =

Make a List of Terms for Each Function (Translations; Unwanted Variations) directed by direction director 監督 Regie режиссер- постановщик director of photography animation director 21 drt =

Map Transcribed Names 1.screenplay by George S. Kaufman and Morrie Ryskind 2.directed by Sam Wood $a Kaufman, George S. … $a Ryskind, Morrie, … $a Wood, Sam, … 22

Map Transcribed Roles 1. screenplay by George S. Kaufman and Morrie Ryskind 2. directed by Sam Wood $a Kaufman, George S. … $4 aus $a Ryskind, Morrie, … $4 aus $a Wood, Sam, … $4 drt 23

Many Possibilities… 700 $a Wood, Sam, $4 drt     tml =     Director: Wood, Sam, Regie: Sam Wood 24

Natural Language Processing (NLP) “deals with analyzing, understanding and generating the languages that humans use naturally”—Webopedia –Artificial intelligence –Automatic summarization –Machine translation –Named entity recognition (NER ) 25

Natural language processing toolkits Named entity recognition (NER ) –“April Stevens” –“Twentieth Century Fox” –“an Austrian-French co-production, Wega Film, MK2 Productions and Les Films Alain Sarde, Arte France Cinéma” 26

Named entity recognition Approaches to matching Start with authorized names and match to statements Start with statements and match to authorized names 1. screenplay by George S. Kaufman and Morrie Ryskind 2. directed by Sam Wood $a Kaufman, George S. … $a Ryskind, Morrie, … $a Wood, Sam, … 27

Hard-Coded Rules vs. Machine Learning Rules: –Manually-compiled lists and decision trees Machine learning: –Usually based on statistical models –Supervised vs. semi-supervised vs. unsupervised learning 28

Supervised Learning Training data –Set of hand-annotated inputs and desired outputs directed by Sam Wood  drt = $a Wood, Sam, $d Computer then generalizes from training data when working on novel data 29

What You Can Do Soon Help us create a hand-annotated set of correct answers for –Training data –Evaluation Online web form coming soon… 30

What You Can Do Soon directed by Sam Wood English Sam Wood Wood, Sam, $d Person Directed by Director 31

What You Can Do Now Use 130 uniform titles 257 country of producing entity 046 $k for original date 041 $h for original language 1xx/7xx $4/$e relator codes or terms 32

What You Can Do Now $a Lawrence of Arabia (Motion picture) $a Great Britain $a United States $2 naf $k $a eng $h eng $a Lean, David, $d $4 drt $a O'Toole, Peter, $d $4 act 33

What You Can Do Now $a My neighbor Totoro (Motion picture) $a Japan $2 naf $k $a jpn $a eng $j eng $h jpn $a Miyazaki, Hayao, $d $4 drt $4 aus 34

Overview of Project 1.Develop end-user interface to take advantage of FRBR and facets 2.Extract and transform existing data MARC  normalized, FRBR-based data Cluster records for FRBR entities Create provisional work (movie) records Assess and correct errors where possible 3.Create backend interface for ongoing input and management of metadata 4.Develop guidelines and documentation for catalogers 35

Interested in Participating? Contact me at Kelley McGrath Metadata Management Librarian University of Oregon Libraries (541)