Charlyn P. Salcedo Instructor Types of Indexing Languages.

Slides:



Advertisements
Similar presentations
Taxonomy as Content Outline, Site Map and Search Aid SLA NWR Vancouver October 6, 2006 Marjorie M.K. Hlava President
Advertisements

1 Federica Paradisi Italian National Bibliography Classification and Indexing Division National Central Library of Florence (Italy) Linking DDC numbers.
Toward an International Sharing and Use of Subject Authority Data
THE STEPS OF SEARCH You have opened a new veterinary clinic in a small town, and want people in the vicinity to know about it. You need some new ideas.
Subject Analysis: An Introduction Based on BASIC SUBJECT CATALOGING USING LCSH edited by Lori Robare.
Advanced Information Systems Laboratory Department of Computer Science and Systems Engineering GI-DAYS MÜNSTER A software tool.
Controlling values The equivalence relationship. The vocabulary problem What is this?
IAEA International Atomic Energy Agency International Nuclear Information System (INIS) INIS/ETDE THESAURUS MAINTENANCE & USE OF COMPUTER-ASSISTED INDEXING.
6. Applying metadata standards: Controlled vocabularies and quality issues Metadata Standards and Applications Workshop.
Leveraging Your Taxonomy to Increase User Productivity MAIQuery and TM Navtree.
Text Operations: Preprocessing. Introduction Document preprocessing –to improve the precision of documents retrieved –lexical analysis, stopwords elimination,
Search Strategies Online Search Techniques. Universal Search Techniques Precision- getting results that are relevant, “on topic.” Recall- getting all.
Subject Access in the Digital Age Presented by Carol Bradsher.
WMES3103 : INFORMATION RETRIEVAL
Module 6a: Intro to Controlled Vocabularies, Taxonomies and Classification IMT530: Organization of Information Resources Winter 2007 Michael Crandall.
WISER: History Advanced OLIS searches Isabel Holowaty, History Librarian Kate Petherbridge, Upper Camera Superintendent.
Thesaurus Design and Development
Module 7b: Extracting/Controlling Terms and Semantic Relationships IMT530: Organization of Information Resources Winter 2007 Michael Crandall.
1 Vocabulary & languages in indexing & searching Connection: indexing searching
A Registry for controlled vocabularies at the Library of Congress
1 Languages for aboutness n Indexing languages: –Terminological tools Thesauri (CV – controlled vocabulary) Subject headings lists (CV) Authority files.
International Atomic Energy Agency INIS : International Nuclear Information System Yves Turgeon Head, INIS Unit International Atomic Energy Agency.
Sunday May 4 – 5 PM Bradford, Hlava, McNaughton
Vocabulary & languages in searching
EuroVoc, Eurlex, EU Bookshop Danica Maleková, Publications Office STS Bratislava, 22 October 2010.
Taxonomies: Hidden but Critical Tools Marjorie M.K. Hlava President Access Innovations, Inc.
Languages are bridges … not barriers Chiara Carlucci – CEDEFOP Library ReferNet Technical Meeting September 2009.
ODINCINDIO Marine Information Management Training Course February 2006 Organizing the collection Murari P Tapaswi National Institute of Oceanography,
Indexing Knowledge Daniel Vasicek 2014 March 27 Introduction Basic topic is : All Human Knowledge Who Cares? Simple Examples.
LIS510 lecture 9 Thomas Krichel Organization of information Libraries organize information. Otherwise nothing that is an library could ever.
1 Catalog Displays, Retrieval, and FAST May 31, 2005.
Vocabularies in the VO Alasdair J G Gray Norman Gray Iadh Ounis.
D4: SKOS and HIVE—Enhancing the Creation, Design and Flow of Information Speakers: Hollie White Jane Greenberg Coordinator: Alan Keely.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
Are LCSH still effective? Why not use keyword searching instead? Presented by Carol Bradsher October 29, 2004.
 Publications that appear regularly within certain intervals of time.  Publications that are published continuously within a regular time frame (daily,
Keyword vs. Controlled Vocabulary Searching 12 Basic Skills for IQ.
1 Discussion Class 9 Thesaurus Construction. 2 Discussion Classes Format: Question Ask a member of the class to answer Provide opportunity for others.
Conceptual Maps and Thesauri : A Comparison of Two Models of Representation Arising from Different Disciplinary Traditions Lalthoum Saàdani and Suzanne.
Current Events and Issues Using Index Databases for Finding Answers.
Librarians vs. Automation Carolyn Weber Lucio Campanelli Will Hohyon Ryu.
The UNESCO Thesaurus Meeting for Managers of UNESCO Documentation Networks Meron Ewketu UNESCO Library June
PREMIS Controlled vocabularies Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
MeSH The Medical Subject Headings from the National Library of Medicine.
1 Controlled Vocabularies Paul Miller Interoperability Focus UKOLN U KOLN is funded by Resource: the Council.
INFO Week 8 Subject Indexing & Knowledge Representation Dr. Xia Lin Assistant Professor College of Information Science and Technology Drexel University.
Thesauri usage in information retrieval systems: example of LISTA and ERIC database thesaurus Kristina Feldvari Departmant of Information Sciences, Faculty.
Subject Analysis and Vocabulary Control Spring 2006, 6 March Bharat Mehra IS 520 (Organization and Representation of Information) School of Information.
Subject Headings for Reference Everything You Need to Know About Subject Headings in One Easy Lesson By Dr. Nancy J. Becker Presented by Dr. Kevin Rioux.
June 2003INIS Training Seminar1 INIS Training Seminar 2-6 June 2003 Subject Analysis Thesaurus and Indexing Alexander Nevyjel Subject Control Unit INIS.
IMT530- Organization of Information Resources1 Feedback Lectures –More practical examples –Like guest lecturers –Generally helpful in understanding concepts.
Controlled Vocabulary & Thesaurus Design Associative Relationships & Thesauri.
Controlled Vocabulary & Thesaurus Design Types of Controlled Vocabularies.
ORGANIZATION OF ELEMENTS OF INFORMATION The Thesaurus.
Subject Access to Your Information Sandy Tucker Texas A&M University Libraries August 1, 2006 Second International Symposium on Transportation Technology.
ENVIRONMENTAL MULTILINGUAL THESAURUS Environmental Thesaurus/Terminology Workshop UN Environment Programme Regional Office of Europe International Environment.
분류론 대학원 CLASSIFICATION AS A SEARCH TOOL Do you have anything on ‘stegosaurus’? stegosaurus dinosaurs prehistoric animals, prehistoric.
Slide 6 HMD1SPI376 - Slide 6. What is the Relationship Between BT and NT?  Normally, BT and NT are "inverse" links. In other words, if X is a broader.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
Controlling values for information organization 384C – Organizing Information Spring 2016 Karen Wickett School of Information University of Texas at Austin.
1 How do we describe something? n What something is about? –What the content of an object is “about”? n Different methods (Wilson, 1968) –counting terms.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
Pre-coordinate vs Post-coordinate Subject Access: Pros and Cons and a Real Life Experience… Peter Fletcher, University of California, Los Angeles Diane.
Subject Analysis: An Introduction
Subject Headings for Reference
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Information Organization
COMP6215 Semantic Web Technologies
Indexing CHARLYN P. SALCEDO, RL.
THESAURUS CONSTRUCTION: GROUND WATER
Presentation transcript:

Charlyn P. Salcedo Instructor Types of Indexing Languages

1. Natural language (derived-term system) ‏ Characteristics are: Improves recall because it provides more access points but reduces precision Redundancy is greater Uses more current terms Tends to be favored by subject-specialists or the end-users May also be called indexing by extraction (or extractive indexing method).

2. Controlled vocabulary (assigned-term system) ‏ Functions: To control synonyms by choosing one form as the standard term To make distinctions among homographs To bring or link together terms that are closely related Establishes the size of scope of a term Usually records hierarchical and affinitive/associative relations Controls variant spellings

Syndetic devices used by a controlled vocabulary: USE and UF (use for) for synonyms BT (broader term), NT (narrower term) and RT (related term) for differing levels of specificity and certain near synonyms and antonyms

Advantages of Controlled Vocabulary Language Increases the probability that both indexer and searcher will express a particular concept in the same way. Increases the probability that the same term will be used by different indexers or by the same indexer at different times. Helps searchers to focus their thoughts when they approach the information system without a full and precise realization of what information they need.

Disadvantages of Controlled Vocabulary Language : Incompatibility of different indexing languages. High input cost. The possibility of inadequate vocabulary.

Authority List / Subject Authority List a related group of words or phrase adopted by a particular group of people. Examples: Library of Congress Subject Headings Sears List of Subject Headings Dewey Decimal Classification Types of Controlled Vocabulary

Thesaurus Latin word means ‘treasure’, & is used to control indexing vocabulary It is a set of terms structured using a small set of semantic relationships between the term/ concepts. Poly-hierachical Examples: The Art & Architecture Thesaurus* ERIC (Education Resouces Information Center) Thesaurus*

Similarities between Authority Lists and Thesauri Both attempts to provide subject access to information resources by providing terminology that can be consistent rather than uncontrolled and unpredictable. Both choose preferred terms and make references from non-used terms. Both provide hierarchies so that terms are presented in relation to their broader, narrower, and related terms.

Difference between Authority Lists and Thesauri Thesauri are made up of single terms and bound terms representing single concepts. Subject heading lists have phrases and other pre-coordinated terms in addition to single terms. Thesauri are more strictly hierarchical. Thesauri are narrow in scope. Thesauri are more likely multilingual.

Relationships of Terms INTELLIGENCE BT: Ability NT: Comprehension RT: Talent Aptitude Broader term (BT) reference shows hierarchical relationship upward in the classification tree. Narrower term (NT) reference is similar to the broader term reference, except it goes down in the classification tree. Related term (RT) reference refers to a descriptor that can be used in addition to the basic term but is not in a hierarchical relationship.

Use for (UF) reference deals primarily with synonymous or variant forms of the preferred descriptor. It is also used to lead the indexer to more general terms. TREES UF Pecan trees PROMOTION POLICIES UF Automatic promotion

Use reference refers to a preferred descriptor from a non-usable term. Examples: Pecan trees USE TREES Oak trees USE TREES

Scope Note (SN) is used to give the users about the descriptor’s usage restrictions or to clarify ambiguity. Example: CULTURAL BACKGROUND SN: The total social heritage and experience of an individual or group including institutions, folkways, literature, mores, and communal experience.

Construction of a Thesaurus 1. Identify the subject field. 2. Identify the nature of literature to be indexed. 3. Identify the users. 4. Identify the file structure. Will this be a pre- coordinate or post-coordinate system? 5. Consult published indexes, glossaries, dictionaries, and other tools in the subject areas for the raw vocabulary. 6. Cluster the terms. 7. Establish term relationships.