Developing a Bilingual Thesaurus of Information Studies.

Slides:



Advertisements
Similar presentations
Andrea Maurino Web Service Design Methodology Batini, De Paoli, Maurino, Grega, Comerio WP2-WP3 Roma 24/11/2005.
Advertisements

Current design issues for digital archives Robert Munro (presented by David Nathan) Endangered Languages Archive (ELAR), School of Oriental and African.
Chapter 1: The Database Environment
Chapter 7 System Models.
OLIF V2 Gr. Thurmair April OLIF April 2000 OLIF: Overview Rationale Principles Entries Descriptions Header Examples Status.
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
1 of 16 Information Access The External Information Providers © FAO 2005 IMARK Investing in Information for Development Information Access The External.
Relational Database and Data Modeling
1 Federica Paradisi Italian National Bibliography Classification and Indexing Division National Central Library of Florence (Italy) Linking DDC numbers.
Thesaurus speed dating conclusions. The ideal thesaurus… …is tailor-made for the special needs of its user community. In other words, it is different.
Multilinguality & Semantic Search Eelco Mossel (University of Hamburg) Review Meeting, January 2008, Zürich.
Alexandria Digital Library Project Integration of Knowledge Organization Systems into Digital Library Architectures Linda Hill, Olha Buchel, Greg Janée.
EBankII Workshop 1 Making Scientific Data Openly Available Simon Coles School of Chemistry, University of Southampton.
Fact-finding Techniques Transparencies
Database Modeling Past and Present
1 Developing Tests for Departmental Assessment Deborah Moore, Assessment Specialist Institutional Research, Planning, & Effectiveness University of Kentucky.
Database System Concepts and Architecture
CSTA K-12 Computer Science Standards (rev 2011)
© Prentice Hall CHAPTER 11 Facilitating User Computing.
Improving System Safety through Agent-Supported User/System Interfaces: Effects of Operator Behavior Model Charles SANTONI & Jean-Marc MERCANTINI (LSIS)
A new anthology! Written by Richard Ladd. Written specifically for the AP French Language and Culture Examination.
Classification in the curriculum- “Dew(e)y” eyed or Semantically speaking? Christine Urquhart.
Multilingual multimedia thesaurus for conservation and restoration collaborative networked model of construction Lucijana Leoni University of Dubrovnik.
Environmental Terminology System and Services (ETSS) June 2007.
DDI 3.0 Conceptual Model Chris Nelson. Why Have a Model Non syntactic representation of the business domain Useful for identifying common constructs –Identification,
Thesaurus Design and Development
Introduction to Student Learning Outcomes in the Major
Introduction to Databases Transparencies
Creating Architectural Descriptions. Outline Standardizing architectural descriptions: The IEEE has published, “Recommended Practice for Architectural.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
1 Languages for aboutness n Indexing languages: –Terminological tools Thesauri (CV – controlled vocabulary) Subject headings lists (CV) Authority files.
Chapter 10: Architectural Design
Thesaurusmanagement Quickstart Introduction. What are controlled vocabularies? organized arrangement of words and phrases used to index content and/or.
Meeting SB 290 District Evaluation Requirements
Chapter 9 Database Planning, Design, and Administration Sungchul Hong.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Ontology Development in the Sciences Some Fundamental Considerations Ontolytics LLC Topics:  Possible uses of ontologies  Ontologies vs. terminologies.
ITEC224 Database Programming
SDMX Standards Relationships to ISO/IEC 11179/CMR Arofan Gregory Chris Nelson Joint UNECE/Eurostat/OECD workshop on statistical metadata (METIS): Geneva.
Tommie Curtis SAIC January 17, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2023.
ScWk 242 Course Overview and Review of ScWk 240 Concepts ScWk 242 Session 1 Slides.
Conceptual Maps and Thesauri : A Comparison of Two Models of Representation Arising from Different Disciplinary Traditions Lalthoum Saàdani and Suzanne.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
Terminology and documentation*  Object of the study of terminology:  analysis and description of the units representing specialized knowledge in specialized.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
CIS/SUSL1 Fundamentals of DBMS S.V. Priyan Head/Department of Computing & Information Systems.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Semantics and the EPA System of Registries Gail Hodge IIa/ Consultant to the U.S. Environmental Protection Agency 18 April 2007.
LE:NOTRE Spring Workshop The Role of Ontologies for Mapping the Domain of Landscape Architecture An introduction.
Pattern Recognition. What is Pattern Recognition? Pattern recognition is a sub-topic of machine learning. PR is the science that concerns the description.
Introduction: Databases and Database Systems Lecture # 1 June 19,2012 National University of Computer and Emerging Sciences.
Henry M. Sondheimer, MD Association of American Medical Colleges 7 August 2013 A Common Taxonomy of Competency Domains for the Health Professions and Competencies.
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
Chapter 9 Architectural Design. Why Architecture? The architecture is not the operational software. Rather, it is a representation that enables a software.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Information Organization
COMP6215 Semantic Web Technologies
Information Organization
The Role of Ontologies for Mapping the Domain of Landscape Architecture An introduction.
WRTC Courses.
MANAGING DATA RESOURCES
Attributes and Values Describing Entities.
Patterns.
Metadata in Digital Preservation: Setting the Scene
Chapter 9 Architectural Design.
RDA in a non-MARC environment
Presentation transcript:

Developing a Bilingual Thesaurus of Information Studies

Defining requirements, evaluating applications, and testing functionality

Defining requirements Basically add terms definitions sources bilingual link equivalencies multiple terms and definitions preferred and variant terms But also online searchable multiple contributors free/open source not just A-Z contextualized, related to concepts faceted, hierarchy, ontology

Evaluating applications Round one 1.TermWiki 2.OmegaT 3.Transiterm 4.ForeignDesk 5.Google Translation Tool 6.Wikispaces 7.Document in Google Drive 8.Blackboard 9.Database-driven website Glossary software – not versatile enough Wiki – too unstructured Relational database – too much work Thesaurus software – only one open source option, others too costly Round two 10.a.k.a. 11.Thesaurus Builder 12.Thesaurus Master 13.Synaptica 14.Pool Party 15.MultiTes 16.TemaTres

Testing functionality Strengths easy to add terms and notes can subordinate and link terms can create candidate terms can add internal notes online searchable supports images specify complete or partial match handle complex equivalences Quirks two types of account two types of multilingual thesaurus Limitations unusual interface not much to look at lack of documentation French skin poorly translated separate accounts for each vocabulary no backup maintained

Testing functionality one-to-one Anglais (Langue)Manuels pour allophones CSH: English languageTextbooks for second language learners one-to-many Couverture en patchwork LCSH: Quilts LCSH: Afghans (Coverlets) many-to-one Canadiennes françaises Canadiens dorigine français Canadiens français CSH: Canadians, French-speaking Exact equivalence: Eye (MeSH) = Eye (LCSH) Notes required to specify Lexical variant: Bicuspid (MeSH) = Bicuspids (LCSH) Syntactic variant: Liver Regeneration (MeSH) = LiverRegeneration (LCSH) Single-to-multiple equivalence: Neoplasms (MeSH) = Cancer and Tumors (LCSH) No equivalent: Night Blindness (MeSH) = ? (LCSH)

Culling concepts from source documents

ISI6310 ETHICS, VALUES AND INFORMATION DILEMMAS (3cr.) An exploration of major ethical concerns currently confronting our information society. The course examines the moral and ethical values involved in information and technology-related situations faced by todays information professionals and agencies, and provides an opportunity to apply ethical theories to situations involving issues such as freedom of expression, censorship, intellectual property rights, equitable access, and privacy. ISI6710 ÉTHIQUE, VALEURS ET DILEMMES RELATIFS À L'INFORMATION (3cr.) Exploration des principales questions déthique qui se posent à notre société de linformation. Le cours examine les valeurs morales et éthiques qui entrent en jeu pour les professionnels de linformation et les agences dinformation lorsquils font face à certaines situations ayant trait à la technologie et au domaine de linformation. Il offre la possibilité dappliquer les théories de léthique à des contextes touchant divers enjeux, comme la liberté dexpression, la censure, les droits de propriété intellectuelle, laccès équitable à linformation et la vie privée. Course descriptions

Core competencies 2. Information Resources 2A. Concepts and issues related to the lifecycle of recorded knowledge and information, from creation through various stages of use to disposition. 2B. Concepts, issues, and methods related to the acquisition and disposition of resources, including evaluation, selection, purchasing, processing, storing, and deselection. 2C. Concepts, issues, and methods related to the management of various collections. 2D. Concepts, issues, and methods related to the maintenance of collections, including preservation and conservation.

Application (intended / potential) Perspective (lens / orientation / discipline) Output Methodology Phenomenon Social/Cultural/Linguistic Technological Administrative Pedagogical Cognitive science Statistics Profile Best practice Template Methodology Toolbox Schema Model Software Systems design Process design Facilities design Learning Social marketing Political mobilization Citizen engagement Observation (behaviour, etc.) Inquiry (motive, opinion, etc.) Content analysis Vector analysis Transaction analysis Process analysis Computer simulation Information object Information agent Information process Information system Information facility Research agenda

candidate term A term under consideration for admission into a controlled vocabulary because of its potential usefulness. Also known as provisional term. category A grouping of terms that are semantically or statistically associated, but which do not constitute a strict hierarchy based on genus/species, parent/child, or part/whole relationships. See also tree structure. classification scheme A method of organization according to a set of pre- established principles, usually characterized by a notation system and a hierarchical structure of relationships among the entities. Course materials controlled vocabulary A list of terms that have been enumerated explicitly. This list is controlled by and is available from a controlled vocabulary registration authority. All terms in a controlled vocabulary must have an unambiguous, non-redundant definition. Source: National Information Standards Organization. (2005). Guidelines for the construction, format, and management of monolingual controlled vocabularies (ANSI/NISO Z ).

Constructing the thesaurus backbone

Using the thesaurus in the ÉSIS program

Course assignments – adding terms, definitions, English/French equivalencies – designating preferred and non-preferred terms – critiquing/ modifying structure (facets/hierarchies) – indexing documents using terms from the thesaurus Faculty research – adding terms and definitions from research projects Curriculum review/development – analyzing program scope and coverage of concepts – balancing coverage of English and French sources