Federal Controlled Vocabularies Data Architecture Sub-Committee (DAS) April 8, 2010 Brand K. Niemann.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Dynamics Research Corporation Semantic Web Military Applications Lee Lacy (407) x104 DAML PI Meeting, Nashua, NH July 18-20, 2001.
Alexandria Digital Library Project Integration of Knowledge Organization Systems into Digital Library Architectures Linda Hill, Olha Buchel, Greg Janée.
Larry Fitzwater and Linda Spencer September 29, 1999 SDC JE-1032.
6. Applying metadata standards: Controlled vocabularies and quality issues Metadata Standards and Applications Workshop.
Terminology and Controlled Vocabulary Efforts at the U.S. Environmental Protection Agency Richard Huffine Federal Manager, EPA National Library Network.
Ontology Notes are from:
Standards for networked knowledge organisation systems Ron Davies European Library Automation Group Bucharest, April 2006.
Build Air Force OneSource in the Cloud for the Data.Gov and Open Government Vocabulary Teams UDEF Deployment Workshop Planning Meeting at the Open Group.
Environmental Terminology System and Services (ETSS) June 2007.
Thesaurus Design and Development
A Methodology for Developing a Taxonomy – A Subject Oriented Approach
A Registry for controlled vocabularies at the Library of Congress
1 Languages for aboutness n Indexing languages: –Terminological tools Thesauri (CV – controlled vocabulary) Subject headings lists (CV) Authority files.
1. 2 Module 7 Content and knowledge Management Objectives To provide basic concepts and knowledge of Content Management to CIOs and explore the applicability.
Producing and managing metadata Workshop on Writing Metadata for Development Indicators Lusaka, Zambia 30 July – 1 August 2012 Writing Metadata for Development.
Educause October 29, 2001 A GEM of a Resource: The Gateway to Educational Materials Copyright Nancy Virgil Morgan, This work is the intellectual.
Future of MDR - ISO/IEC Metadata Registries (MDR) Larry Fitzwater, SC 32 WG 2 Convener Computer Scientist U.S. Environmental Protection Agency May.
10 April 2014 The Redesigned WSDOT Data Catalog Andy Everett, Metadata Repository Librarian, Washington State DOT.
SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China.
Controlled Vocabulary & Thesaurus Design Planning & Maintenance.
Indexing Knowledge Daniel Vasicek 2014 March 27 Introduction Basic topic is : All Human Knowledge Who Cares? Simple Examples.
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
The MMI Tools Carlos Rueda Monterey Bay Aquarium Research Institute OOS Semantic Interoperability Workshop Marine Metadata Interoperability Project Boulder,
Vocabularies in the VO Alasdair J G Gray Norman Gray Iadh Ounis.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
D4: SKOS and HIVE—Enhancing the Creation, Design and Flow of Information Speakers: Hollie White Jane Greenberg Coordinator: Alan Keely.
DDI-RDF Leveraging the DDI Model for the Linked Data Web.
Incorporating ARGOVOC in DSpace-based Agricultural Repositories Dr. Devika P. Madalli & Nabonita Guha Documentation Research & Training Centre Indian Statistical.
Tommie Curtis SAIC January 17, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2023.
Using Taxonomies Effectively in the Organization KMWorld 2000 Mike Crandall Microsoft Information Services
Controlled Vocabulary & Thesaurus Design Hierarchies & Taxonomies.
Registry Services Bringing Value to US EPA, States, and Tribes Exchange Network Vendors Meeting April 24, 2007 Cynthia Dickinson EPA/OEI/OIC Data Standards.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. Presentation Title: Day:
, 1/21, © Library and Documentation Systems Division 21 st APAN Meeting Tokyo, January 2006 AGROVOC and AOS, Margherita Sini, FAO From.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
FEA DRM Management Strategy Presented by : Mary McCaffery, US EPA.
INFO Week 8 Subject Indexing & Knowledge Representation Dr. Xia Lin Assistant Professor College of Information Science and Technology Drexel University.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Thesauri usage in information retrieval systems: example of LISTA and ERIC database thesaurus Kristina Feldvari Departmant of Information Sciences, Faculty.
Controlled Vocabulary & Thesaurus Design Hierarchies.
EcoTerm IV NBII/EioNet Demo of Federated KOS Search Mike Frame Vienna, Austria April 2007.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Overview of SC 32/WG 2 Standards Projects Supporting Semantics Management Open Forum 2005 on Metadata Registries 14:45 to 15:30 13 April 2005 Larry Fitzwater.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Creating a Semantic Web with Linked Data Todd King.
Tutorial on XML Tag and Schema Registration in an ISO/IEC Metadata Registry Open Forum 2003 on Metadata Registries Tuesday, January 21, 2003; 4:45-5:30.
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
Order Out of Chaos: Creating and Valuing Taxonomies Information Highways Conference e-Content Institute April 6, 2005
Controlled Vocabulary & Thesaurus Design Associative Relationships & Thesauri.
Controlled Vocabulary & Thesaurus Design Types of Controlled Vocabularies.
ORGANIZATION OF ELEMENTS OF INFORMATION The Thesaurus.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Semantics and the EPA System of Registries Gail Hodge IIa/ Consultant to the U.S. Environmental Protection Agency 18 April 2007.
LE:NOTRE Spring Workshop The Role of Ontologies for Mapping the Domain of Landscape Architecture An introduction.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
Ontology in MBSE How ontologies fit into MBSE The benefits and challenges.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
1 How do we describe something? n What something is about? –What the content of an object is “about”? n Different methods (Wilson, 1968) –counting terms.
Information Organization
TRSS Terminology Registry Scoping Study
Progress Update MSIS: Bratislava, April 2005
Introducing Semantic Web Technologies:
2. An overview of SDMX (What is SDMX? Part I)
Session 2: Metadata and Catalogues
THESAURUS CONSTRUCTION: GROUND WATER
Presentation transcript:

Federal Controlled Vocabularies Data Architecture Sub-Committee (DAS) April 8, 2010 Brand K. Niemann

Federal Controlled Vocabularies What Are They Examples Discussion

Why a Controlled Vocabulary? Improve effectiveness of information storage and retrieval systems Knowledge workers spend 25-35% of their time searching for information with 50% success 1 The need for vocabulary control arises from two basic features of natural language, namely : Two or more words or terms can be used to represent a single concept Example: salinity/saltiness VHF/Very High Frequency Two or more words that have the same spelling can represent different concepts Example: Mercury (planet) Mercury (metal) Mercury (automobile) Mercury (mythical being) Tutorial Working Council of CIOs, Business Wire, Feb

Controlled Vocabulary Synonym Ring Authority File Taxonomy Thesaurus + Words with same meaning in a given context + Preferred Terms (USE) + Broader (BT) and Narrower Terms (NT) + Related Terms (RT) {BT, NT, USE} List Set of terms arranged in logical way Increasing structural and semantic complexity Why and when to use: Dimension and Context

Controlled Vocabulary: Dimension and Context Synonym Ring Authority File Taxonomy Thesaurus + Words with same meaning in a given context + Preferred Terms (USE) + Broader (BT) and Narrower Terms (NT) + Related Terms (RT) {BT, NT, USE} List Set of terms arranged in logical way Increasing structural and semantic complexity Dimension and Context (not a definitive list) Organizationhuman resources, marketing, accounting, etc. Function Type employment, staffing, training, etc. Subjectwater pollution, soil pollution, air pollution, etc. Identify a document or database for a data catalog (data.gov, data.gov.uk, etc.) Consistent vocabulary for describing database or document dcat and related, Dublin Core, SKOS, FOAF 1 Identify a data ItemVehicle Identification Number (VIN) Uniform Resource Indicator (URI) Identify a data ElementPatient Person First Name ISO/IEC /UDEF Relate a Resource Relate a Vocabulary 1

Controlled Vocabulary Examples Agency --Context --Dimension DOD - Center for Army Lessons Learned Intended Purpose: Organization of equipment supporting the business -- Functio n (Also by Type) NASA - NASA Thesaurus Intended Purpose: Organization of equipment supporting the business --Type EPA - Data Classes and Areas Intended Purpose: Organization of subject areas supporting the business --Subject IRS -IRS Tax Map Intended Purpose: Organization of topics for answering questions ---Subject Synonyms and Word Equivalent Radio - Radio Detection and Ranging Telescope -scope Manned Lunar Space Vehicle - Apollo 11 Mission Waste - Run-off Amended Tax Return X Authority File Radio Detection Finding - (USE) Radio Scope (USE) Telescope Run-off (USE) Waste Employment Income (USE) Wages and Salary Taxonomy + Broader (BT) and Narrower Terms (NT) {BT, NT, USE, UF} ( BT) Radar (by function) ( NT) aircraft radars (NT) airport radar systems (NT) Ground Based Radar (NT) imaging radar (NT) meteorological radar (NT) missile site radar (NT) search radar (NT) terrain analysis radar ( BT) Instruments (NT) Accelerometers (NT) Acoustic Sensors (NT) etc.. (NT) Telescopes (NT) Optical telescopes (NT) Radio telescopes (BT) Substances (NT) Chemicals (NT) Biological (NT) Contaminants (NT) Wastes (NT) Radiation (NT) Commercial Products (BT) Tax Topics (NT) IRS Help (NT) IRS Procedures (NT) Collection (NT) Alternative Filing Methods (NT) General Information (NT) Which Forms to Use Thesaurus + Related Terms (RT) ( BT) Radar (RT) AN/MPQ-65 (RT) AN/MPQ-65 Radar set (RT) navigation (RT) instruments (RT) noise (radar) (RT) radar scattering (BT) Radio Telescopes (RT) Microwaves (High Energy Radio Telescope) (BT) Wastes ( RT) Garbage (RT) Refuse (RT) Biosolids (RT) Pollution Control Facilities (BT) Itemized Deductions (NT) Should I Itemize? (ET) Publication 501 (RT) Tax Topic 551 Publication (ET) Exemptions, Standard Deduction, and Filing Information

Discussion Topics and General Considerations 1.Sources for Federal Controlled Vocabularies considerations 2.Relate vocabularies across domain considerations – Move from levels of concreteness to abstractness – Understand similarity between domains and differences between domains – Require consistency 3.Your input Language Universals and Linguistic Typology, Comrie, 1989 (Survey of World languages for comparison and classification)

Resources Guidelines for the Construction, Format, and Management of Monolingual Controlled Vocabularies 9 Related Efforts 10 Federal CV Efforts 11 Display Types 12 Automated Example 13 Ontology Spectrum 14 Sample Tools 15

Guidelines ANSI/NISO Z Guidelines for the Construction, Format, and Management of Monolingual Controlled Vocabularies – =7cc9b583cb5a62e8c15d3099e0bb46bbae9cf38a

Related Efforts Universal Data Element Framework (UDEF) Controlled vocabulary for naming data elements based on ISO/IEC Digital Express Research Institute (DERI) Data catalog (dcat) vocabulary RDF Vocabulary for exchange of data catalogs, such as data.gov and data.gov.uk (early draft) Universal Core (UCORE)Agreed upon representations for most commonly shared and understood elements. NIEM IEPDAgreed upon exchange for area of shared interest. etc

Federal CV Efforts USAF Vocabulary OneSource CENDI September 11, 2008 Workshop New Dimensions in Knowledge Organization Systems SKOS for the DoD Metadata Taxonomy Tuesday VoCampDCMay etc

Display Types More Types:

Automated Example

Controlled Vocabulary Courtesy of Leo Obrst, Mitre Corporation

Sample Tools