Metadata Standards and Applications 6. Vocabularies: Attributes and Values.

Slides:



Advertisements
Similar presentations
Numbers Treasure Hunt Following each question, click on the answer. If correct, the next page will load with a graphic first – these can be used to check.
Advertisements

Repaso: Unidad 2 Lección 2
1 A B C
Simplifications of Context-Free Grammars
Variations of the Turing Machine
3rd Annual Plex/2E Worldwide Users Conference 13A Batch Processing in 2E Jeffrey A. Welsh, STAR BASE Consulting, Inc. September 20, 2007.
AP STUDY SESSION 2.
1
Select from the most commonly used minutes below.
Chapter 7 System Models.
Copyright © 2003 Pearson Education, Inc. Slide 7-1 Created by Cheryl M. Hughes The Web Wizards Guide to XML by Cheryl M. Hughes.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 4 Computing Platforms.
Processes and Operating Systems
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Slide 1 FastFacts Feature Presentation October 16 th, 2008 We are using audio during this session, so please dial in to our conference line… Phone number:
David Burdett May 11, 2004 Package Binding for WS CDL.
1. 2 Begin with the end in mind! 3 Understand Audience Needs Stakeholder Analysis WIIFM Typical Presentations Expert Peer Junior.
1. 2 Begin with the end in mind! 3 Understand Audience Needs Stakeholder Analysis WIIFM Typical Presentations Expert Peer Junior.
Prepared by: Workforce Enterprise Services For: The Illinois Department of Commerce and Economic Opportunity Bureau of Workforce Development ENTRY OF EMPLOYER.
Local Customization Chapter 2. Local Customization 2-2 Objectives Customization Considerations Types of Data Elements Location for Locally Defined Data.
Create an Application Title 1Y - Youth Chapter 5.
Process a Customer Chapter 2. Process a Customer 2-2 Objectives Understand what defines a Customer Learn how to check for an existing Customer Learn how.
CALENDAR.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt BlendsDigraphsShort.
1 Chapter 12 File Management Patricia Roy Manatee Community College, Venice, FL ©2008, Prentice Hall Operating Systems: Internals and Design Principles,
UKOLN, University of Bath
Dr. Alexandra I. Cristea CS 253: Topics in Database Systems: C3.
1 Making Changes to Existing Name and Work/Expression Authority Records Module 7. Making Changes to Existing Name and Work/Expression Authority Records.
1 Click here to End Presentation Software: Installation and Updates Internet Download CD release NACIS Updates.
Welcome. © 2008 ADP, Inc. 2 Overview A Look at the Web Site Question and Answer Session Agenda.
Break Time Remaining 10:00.
Turing Machines.
PP Test Review Sections 6-1 to 6-6
1 IMDS Tutorial Integrated Microarray Database System.
EIS Bridge Tool and Staging Tables September 1, 2009 Instructor: Way Poteat Slide: 1.
Office 2003 Introductory Concepts and Techniques M i c r o s o f t Office 2003 Integration Integrating Office 2003 Applications and the World Wide Web.
Operating Systems Operating Systems - Winter 2010 Chapter 3 – Input/Output Vrije Universiteit Amsterdam.
Sample Service Screenshots Enterprise Cloud Service 11.3.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
GIS Lecture 8 Spatial Data Processing.
Adding Up In Chunks.
FAFSA on the Web Preview Presentation December 2013.
SLP – Endless Possibilities What can SLP do for your school? Everything you need to know about SLP – past, present and future.
MaK_Full ahead loaded 1 Alarm Page Directory (F11)
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt Synthetic.
Center on Knowledge Translation for Disability and Rehabilitation Research Information Retrieval for International Disability and Rehabilitation Research.
7/16/08 1 New Mexico’s Indicator-based Information System for Public Health Data (NM-IBIS) Community Health Assessment Training July 16, 2008.
2004 EBSCO Publishing Presentation on EBSCOadmin.
1 Lab 17-1 ONLINE LESSON. 2 If viewing this lesson in Powerpoint Use down or up arrows to navigate.
: 3 00.
5 minutes.
1 hi at no doifpi me be go we of at be do go hi if me no of pi we Inorder Traversal Inorder traversal. n Visit the left subtree. n Visit the node. n Visit.
Prof.ir. Klaas H.J. Robers, 14 July Graduation: a process organised by YOU.
Speak Up for Safety Dr. Susan Strauss Harassment & Bullying Consultant November 9, 2012.
1 Titre de la diapositive SDMO Industries – Training Département MICS KERYS 09- MICS KERYS – WEBSITE.
Converting a Fraction to %
Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)
CSE20 Lecture 15 Karnaugh Maps Professor CK Cheng CSE Dept. UC San Diego 1.
Clock will move after 1 minute
Chapter 13 Web Page Design Studio
Physics for Scientists & Engineers, 3rd Edition
Select a time to count down from the clock above
Import Tracking and Landed Cost Processing An Enhancement For AS/400 DMAS from  Copyright I/O International, 2001, 2005, 2008, 2012 Skip Intro Version.
Copyright Tim Morris/St Stephen's School
1.step PMIT start + initial project data input Concept Concept.
South Dakota Library Network MetaLib User Interface South Dakota Library Network 1200 University, Unit 9672 Spearfish, SD © South Dakota.
6. Applying metadata standards: Controlled vocabularies and quality issues Metadata Standards and Applications Workshop.
A Registry for controlled vocabularies at the Library of Congress
Presentation transcript:

Metadata Standards and Applications 6. Vocabularies: Attributes and Values

Goals of Session Understand how different vocabularies are used in metadata Understand how different vocabularies are used in metadata Learn about relationships in vocabularies Learn about relationships in vocabularies Understand methods of encoding vocabularies for various purposes Understand methods of encoding vocabularies for various purposes Learn about how registries are used to document vocabularies Learn about how registries are used to document vocabularies Metadata Standards & Applications 2

3 Vocabulary Issues Where vocabularies occur in metadata Where vocabularies occur in metadata Establishment of formal relationships among terms (where appropriate) Establishment of formal relationships among terms (where appropriate) Testing and validation of terms Testing and validation of terms The role of Metadata Registries The role of Metadata Registries

Metadata Standards & Applications 4 Why bother? To improve retrieval, i.e., to get an optimum balance of precision and recall To improve retrieval, i.e., to get an optimum balance of precision and recall –Precision – How many of the retrieved records are relevant? –Recall – How many of the relevant records did you retrieve?

Metadata Standards & Applications 5 Improving recall and precision Controlled Vocabularies improve recall by addressing synonyms [attire vs. dress vs. clothing] Controlled Vocabularies improve recall by addressing synonyms [attire vs. dress vs. clothing] Controlled Vocabularies improve precision by addressing homographs [bridge (game) vs. bridge (structure) vs. bridge (dental device)] Controlled Vocabularies improve precision by addressing homographs [bridge (game) vs. bridge (structure) vs. bridge (dental device)]

Metadata Standards & Applications 6 Types of Controlled Vocabularies Lists Lists Synonym Rings Synonym Rings Taxonomy Taxonomy Thesaurus Thesaurus [Classification Schemes] [Classification Schemes] Ontology Ontology

Metadata Standards & Applications 7 Thesauri & Classification Some knowledge management researchers feel that these are essentially the same, with the primary difference being whether the preferred term is a notation Some knowledge management researchers feel that these are essentially the same, with the primary difference being whether the preferred term is a notation As the need to do machine readable encoding progresses, some additional differences are emerging As the need to do machine readable encoding progresses, some additional differences are emerging

Metadata Standards & Applications 8 Lists A list is a simple group of terms Example: Example:AlabamaAlaskaArkansasCaliforniaColorado.... Frequently used in Web site pick lists and pull down menus Frequently used in Web site pick lists and pull down menus

Metadata Standards & Applications 9 Synonym Rings Synonym rings are used to expand queries for content objects Synonym rings are used to expand queries for content objects –If a user enters any one of these terms as a query to the system, all items are retrieved that contain any of the terms in the cluster Synonym rings are often used in systems where the underlying content objects are left in their unstructured natural language format Synonym rings are often used in systems where the underlying content objects are left in their unstructured natural language format – the control is achieved through the interface by drawing together similar terms into these clusters Synonym rings are used in conjunction with search engines and provide a minimal amount of control of the diversity of the language found in the texts of the underlying documents Synonym rings are used in conjunction with search engines and provide a minimal amount of control of the diversity of the language found in the texts of the underlying documents

Metadata Standards & Applications 10 Taxonomies A taxonomy is a set of preferred terms, all connected by a hierarchy or polyhierarchy Example: Example:Chemistry Organic chemistry Polymer chemistry Nylon Frequently used in web navigation systems

Metadata Standards & Applications 11 Thesauri A thesaurus is a controlled vocabulary with multiple types of relationships Example: Example:Rice UF paddy UF paddy BT Cereals BT Plant products NT Brown rice RT Rice straw

Metadata Standards & Applications 12 Ontology A useful definition: An arrangement of concepts and relations based on an underlying model of reality. A useful definition: An arrangement of concepts and relations based on an underlying model of reality. –Ex.: Organs, symptoms, and diseases in medicine No real agreement on definition every community uses the term in a slightly different way No real agreement on definition every community uses the term in a slightly different way

Metadata Standards & Applications 13 Thesaural Relationships Relationship types: Use/Used For – indicates preferred term Use/Used For – indicates preferred term Hierarchy – indicates broader and narrower terms Hierarchy – indicates broader and narrower terms Associative – almost unlimited types of relationships may be used Associative – almost unlimited types of relationships may be used It is the most complex format for controlled vocabularies and widely used. It is the most complex format for controlled vocabularies and widely used.

Metadata Standards & Applications 14

Metadata Standards & Applications 15 Z39.19 Types of Concepts Things and their physical parts Things and their physical parts Materials Materials Activities or processes Activities or processes Events or occurrences Events or occurrences Properties or states of persons, things, materials or actions Properties or states of persons, things, materials or actions Disciplines or subject fields Disciplines or subject fields Units of measurement Units of measurement Unique entities Unique entities

Metadata Standards & Applications 16 Examples Birds (things) Birds (things) Ornithology (discipline) Ornithology (discipline) Feathers (materials) Feathers (materials) Flying (activity or process) Flying (activity or process) Bird counts (event) Bird counts (event) Barn Owl (unique entity) Barn Owl (unique entity)

Metadata Standards & Applications 17 Relationships Equivalence Equivalence Hierarchical Hierarchical Associative Associative

Metadata Standards & Applications 18 Equivalence Relationships Term A and Term B overlap completely A = B

Metadata Standards & Applications 19 Hierarchical Relationships Term A is included in Term B Term A is included in Term B B A

Metadata Standards & Applications 20 Associative Relationships Semantics of terms A and B overlap Semantics of terms A and B overlap AB

Metadata Standards & Applications 21 Expressing Relationship

Metadata Standards & Applications 22 Hierarchy rules Relationships must be independent of context Relationships must be independent of context Examples: Examples: –Mice (BT Rodents); Rodents (NT Mice) –NOT Mice (BT Pests); Pests (NT Mice)

Metadata Standards & Applications 23 Hierarchy rules Terms must represent the same type of entity Terms must represent the same type of entity Examples: Examples: –Shoes (BT Footwear); Footwear (NT Shoes) –NOT Shoes (BT Shoemaking); Shoemaking (NT Shoes)

Metadata Standards & Applications 24 Vocabulary Management The degree of control over a vocabulary is (mostly) independent of its type The degree of control over a vocabulary is (mostly) independent of its type –Uncontrolled – Anybody can add anything at any time and no effort is made to keep things consistent –Managed – Software makes sure there is a list that is consistent (no duplicates, no orphan nodes) at any one time. Almost anybody can add anything, subject to consistency rules –Controlled – A documented process is followed for the update of the vocabulary. Few people have authority to change the list. Software may help, but emphasis is on human processes and custodianship

Metadata Standards & Applications 25 Informal Vocabularies New movement towards bottom up classification goes by many names: New movement towards bottom up classification goes by many names: –Tagging –Social bookmarking –Folksonomies Many in this movement, seeing problems of scale, are moving towards more formalization Many in this movement, seeing problems of scale, are moving towards more formalization

Libraries/Museums and Tagging Penn Tags Penn Tags –Still experimental, primarily internal to Penn – Library of Congress Flickr project Library of Congress Flickr project –Open public tagging, still unclear how results will be used – The Art Museum Social Tagging Project The Art Museum Social Tagging Project –Research/software project focused on museum application – Metadata Standards & Applications 26

Metadata Standards & Applications 27 Current Encoding Standards: Authorities MARC 21 MARC 21 –Authority Format used for names, subjects, series; –Classification Format used for subject classification MADS (a derivative of MARC authorities) MADS (a derivative of MARC authorities) –Used primarily for names

Metadata Standards & Applications 28 MARC 21 Authority Name

Metadata Standards & Applications 29 MARC 21 Authority Subject

Metadata Standards & Applications 30 MARC 21 Classification LCC

Metadata Standards & Applications 31 MARC 21 Classification DDC

What is MADS? Metadata Authority Description Schema Metadata Authority Description Schema –A companion to MODS for authority data using XML –Defines a subset of MARC authority elements using language-based tags –Elements have same definitions as equivalent MODS MADS can be used for metadata about people, organizations, events, subjects, time periods, genres, geographics and occupations MADS can be used for metadata about people, organizations, events, subjects, time periods, genres, geographics and occupations Metadata Standards & Applications 32

MADS Elements Authority Authority –name –titleInfo –topic –temporal –genre –geographic –hierarchicalGeographic –occupation Related Related –same subelements Variant Variant –same subelements Note Note Affiliation Affiliation url url Identifier Identifier fieldOfActivity fieldOfActivity Extension Extension recordInfo recordInfo Metadata Standards & Applications 33

Metadata Standards & Applications 34

New/Upcoming Standards:Authorities Functional Requirements for Authority Data (FRAD) Functional Requirements for Authority Data (FRAD) –A new model for authority information –Developed by the IFLA Working Group on Functional Requirements and Numbering of Authority Records (FRANAR) –VIAF (Virtual International Authority File) Prototype at: Prototype at: A Review of the Feasibility of an International Authority Data Number (ISADN) A Review of the Feasibility of an International Authority Data Number (ISADN) Simple Knowledge Organization System (SKOS) a W3C standard Simple Knowledge Organization System (SKOS) a W3C standard Metadata Standards & Applications 35

Metadata Standards & Applications 36

Functions of the Authority File Document decisions Document decisions Serve as reference tool Serve as reference tool Control forms of access points Control forms of access points Support access to bibliographic files Support access to bibliographic files Link bibliographic and authority files Link bibliographic and authority files (Slide from Glenn Patton)

Metadata Standards & Applications 38 FRANAR Concept Model, top

Metadata Standards & Applications 39 FRANAR Concept Model, bottom

FRAD person attributes From FRBR (AACR2 additions to names): Dates associated with the person Title of person Other designation associated with the person New: Gender Place of birth Place of death Country Place of residence Affiliation Address Language of person Field of activity Profession/occupation Biography/history (Slide from Ed Jones)

Metadata Standards & Applications 41 VIAF Search Result

Metadata Standards & Applications 42 VIAF DNB Display

SKOS Simple Knowledge Organisation System (SKOS) Simple Knowledge Organisation System (SKOS) –A World Wide Web Consortium (W3C) standard –Based on RDF and OWL –Currently resolving last call comments, will be finalized in early 2009 – Metadata Standards & Applications 43

Metadata Standards & Applications 44 The skos:Concept class allows you to assert that a resource is a conceptual resource. That is, the resource is itself a concept.skos:Concept

Metadata Standards & Applications 45 The RDF/XML Encoded Version

Metadata Standards & Applications 46 Preferred and Alternative Lexical Labels

Metadata Standards & Applications 47 The RDF/XML Encoded Version

Metadata Standards & Applications 48 Registries: the Big Picture (Adapted from Wagner & Weibel, The Dublin Core Metadata Registry: Requirements, Implementation, and Experience JoDI, 2005)

Metadata Standards & Applications 49 Why Registries? Support the interoperability cycle: Support the interoperability cycle: –Discovery of available schemes and schemas for description of resources –Promote reuse of extant schemes and schemas –Access to machine-readable and human- readable services –Support for crosswalking and translation Coping with a state of perpetual metadata heterogeneity (Bianchi and Petrone) Coping with a state of perpetual metadata heterogeneity (Bianchi and Petrone)

Metadata Standards & Applications 50 What Do Registries Register? Metadata Schemas (element sets, formats) Metadata Schemas (element sets, formats) –Crosswalks between metadata schemas Controlled Vocabularies Controlled Vocabularies –Mappings between vocabularies Application Profiles Application Profiles –Schema and vocabulary information in combination with specific usage instruction

Metadata Standards & Applications 51 Dublin Core RegistryTerm Level

Metadata Standards & Applications 52 NSDL RegistryProperty Vocabulary List

Metadata Standards & Applications 53 NSDL RegistryProperty Vocabulary Detail

Metadata Standards & Applications 54 Element Detail RDF

Metadata Standards & Applications 55 Concept Vocabulary Detail

Metadata Standards & Applications 56 Concept Vocabulary XML Schema

Please Play! The NSDL Registry has a sandbox where anyone can try out the registry software: The NSDL Registry has a sandbox where anyone can try out the registry software: – Please feel free to play in the Registry Sandbox! Please feel free to play in the Registry Sandbox! Note: The production registry is open as well, but not for play … Note: The production registry is open as well, but not for play … Metadata Standards & Applications 57

Metadata Standards & Applications 58 Acknowledgements Some slides used here are from presentations by Marcia Zeng and Alistair Miles Some slides used here are from presentations by Marcia Zeng and Alistair Miles