IAEA International Atomic Energy Agency International Nuclear Information System (INIS) CAI, Thesaurus, Subject Categories and Metadata Extraction Tool.

Slides:



Advertisements
Similar presentations
Taxonomy as Content Outline, Site Map and Search Aid SLA NWR Vancouver October 6, 2006 Marjorie M.K. Hlava President
Advertisements

34th Consultative Meeting of INIS Liaison Officers 3 – 5 November 2008 Vienna - Austria INIS User Needs Study (Summary of Japan’s study) Junichi KURAKAMI.
INIS and Fukushima Nuclear Accident Archive Minoru Yonezawa Japan Atomic Energy Agency 37 th Consultative Meeting of INIS Liaison Officers 14 – 15 October.
WorldWideEnergy: A paradigm shift in advancing energy information access Ms. Deborah Cutler International Program Manager Office of Scientific and Technical.
IAEA International Atomic Energy Agency 13th Joint INIS/ETDE Technical Committee Meeting October 2011, Vienna, Austria Dobrica Savić, INIS Unit Head.
IAEA International Atomic Energy Agency International Nuclear Information System (INIS) INIS/ETDE THESAURUS MAINTENANCE & USE OF COMPUTER-ASSISTED INDEXING.
IAEA International Atomic Energy Agency United Nations Library and Information Network for Knowledge Sharing (UN-LINKS) September 2013, Geneva.
International Nuclear Information System (INIS) INIS - 43 Years of International Cooperation in the Field of Nuclear Information Head, INIS Unit NIS Section.
International Atomic Energy Agency The Role of the National INIS Centre and the INIS Secretariat INIS Training Seminar October 2013, Vienna, Austria.
IAEA International Atomic Energy Agency INIS Progress and Activity Report 12th Joint INIS/ETDE Technical Committee Meeting October 2009, Vienna,
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features INIS Training Seminar 7-11 October 2013, Vienna Domenico.
IAEA International Atomic Energy Agency ICSTI 2013 Annual Members’ Meeting March 2013.
International Atomic Energy Agency INIS : International Nuclear Information System Yves Turgeon Head, INIS Unit International Atomic Energy Agency.
International Atomic Energy Agency INIS Training Seminar Principles of Information Retrieval and Query Formulation 07 – 11 October 2013 Vienna, Austria.
IAEA International Atomic Energy Agency INIS Progress and Activities Report Highlights of Activities 2006/2007.
International Atomic Energy Agency 1 International Nuclear Information System 35 Years of Successful International Co-operation T.Atieh, C.Krieger-Levine,
0 © WIPO – 2003 PF & CJF CLAIMS Computer-Assisted Categorisation of Patent Documents in the International Patent Classification Patrick Fiévet, CLAIMS.
International Atomic Energy Agency 2-5 Nov th ILO Meeting1 3.1 INIS/ETDE Reference Series 12th INIS/ETDE Joint Technical Committee Meeting
June 2003INIS Training Seminar1 INIS Training Seminar 2-6 June 2003 Subject Scope and Document Selection Alexander Nevyjel Subject Control Unit INIS Section,
IAEA International Atomic Energy Agency Agenda item 3.3 INIS IT developments 13th INIS/ETDE Joint Technical Committee Meeting October 2011, Vienna,
Agenda Item 3.5 ETDE Web 2.0 Activities Debbie Cutler 12th INIS/ETDE Joint Technical Committee INIS.
Highlights of Main Activities in China Hou Huiqun INIS LO for China Director of CINIE 1.
IAEA International Atomic Energy Agency Agenda item 2.6 INIS Collection Search 36 th Consultative Meeting of INIS Liaison Officers 4-5 October 2012, Vienna,
Experience of cooperation of the Ukrainian Center INIS with INIS Secretariat of the IAEA ( ) Zh.I.Pysanko, O.M.Kuprava, A.I.Lyps’ka, L.N.Lamonova.
IAEA International Atomic Energy Agency The 34th Consultative Meeting of the INIS Liaison Officers (2008) Actions and Recommendations Status Report - October.
June Overview of Operations & the INIS Record INIS Training Seminar 2-6 June 2003 Vienna, Austria Seyda RIEDER INIS Section Supervisor, Bibliographic.
International Nuclear Information System (INIS)
Report on UNSD activities since the last meeting of the Expert Group on International Economic and Social Classifications Meeting of the Expert Group on.
International Atomic Energy Agency 1 International Partnerships in Managing INIS and Nuclear Knowledge Anatoli Tolstenkov Head, INIS Unit INIS & Nuclear.
13th INIS/ETDE Joint Technical Committee Meeting IAEA Agenda Item 1.5 ETDE Progress and Activity Report and Database Developments 13th Joint INIS/ETDE.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
35 th Consultative INIS Liaison Officers Meeting Vienna, Austria October 2010 Usage Metrics Subgroup Status Report Debbie Cutler (augmented by Taghrid.
IAEA International Atomic Energy Agency Agenda item 2.2 INIS Collection Search 13th INIS/ETDE Joint Technical Committee Meeting October 2011, Vienna.
IAEA International Atomic Energy Agency Special Characters Implementation Zbigniew Majewski 12th Joint INIS/ETDE Technical Committee Meeting October.
11 th INIS/ETDE Joint Technical Committee Meeting Agenda Item 3.2 Update of common manuals – responsibilities and timeframe Debbie Cutler, ETDE OA 6-8.
Librarians vs. Automation Carolyn Weber Lucio Campanelli Will Hohyon Ryu.
Michigan Educational Assessment Program MEAP. Fall Purpose The Michigan Educational Assessment Program (MEAP) is Michigan’s general assessment.
The UNESCO Thesaurus Meeting for Managers of UNESCO Documentation Networks Meron Ewketu UNESCO Library June
International Atomic Energy Agency 1 Highlights of the 11th INIS – ETDE Joint Technical Committee Meeting The 34 th Consultative Meeting of INIS Liaison.
IAEA International Atomic Energy Agency INIS Training Seminar Review of INIS Activities Dobrica Savić Head of INIS Unit 23 November 2009 Vienna, Austria.
1 11th Joint INIS/ETDE Technical Committee Meeting – Vienna, November 6-11, 2007 CEA/DSM/DPI/STI – C. BRULET Some considerations for future developments.
International Atomic Energy Agency October th ILO Meeting1 2.3 INIS Bibliographic Database Production and Trends Alexander Nevyjel Head, Content.
IAEA International Atomic Energy Agency International Nuclear Information System (INIS) INIS SUBJECT ANALYSIS: Subject Indexing INIS Training Seminar
35th Consultative Meeting of INIS Liaison Officers October 2010, Vienna, Austria INIS input preparation Renate Eder Database Production & Imaging.
IAEA International Atomic Energy Agency International Nuclear Information System (INIS) INIS bibliographic input preparation 13th Joint INIS/ETDE Technical.
Computer Assisted Indexing (CAI) Erika Kancsar October 2015.
Librarians vs. Automation Carolyn Weber Lucio Campanelli Will Hohyon Ryu.
International Atomic Energy Agency INIS Promotion & Outreach INIS Training Seminar 7-11 October 2013, Vienna, Austria Taghrid ATIEH NIS Section Department.
IAEA International Atomic Energy Agency International Nuclear Information System (INIS) INIS SUBJECT ANALYSIS: Introduction and Tools INIS Training Seminar.
International Atomic Energy Agency The Role of the National INIS Centre and the INIS Secretariat INIS Training Seminar November 2011, Vienna, Austria.
Journal Records October 2015 Vienna, Austria Martin Bohatschek INIS Unit International Atomic Energy Agency
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
11 th INIS/ETDE Joint Technical Committee Meeting Agenda Item 5.2 Usage of CAI for ETDE – report of experiences so far and development of ETDE vocabulary.
ABAP Dictionary Introduction Tables in the ABAP Dictionary Performance in Table Access Consistency through Input Check Dependencies of ABAP Dictionary.
June 2003INIS Training Seminar1 INIS Training Seminar 2-6 June 2003 Subject Analysis Thesaurus and Indexing Alexander Nevyjel Subject Control Unit INIS.
International Atomic Energy Agency Oct th ILO Meeting1 2.5 Thesaurus and Subject Categories Alexander Nevyjel Head, Content Management Group.
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features The Role of the International Nuclear Information System.
35-th Consultative Meeting of INIS Liaison Officers, October, 2010, Vienna Austria BULGARIAN INIS CENTRE INIS INPUT AND PRODUCTION Ms. Albena Georgieva.
IAEA International Atomic Energy Agency Agenda item 1.6 Highlights of the 13th INIS/ETDE Joint Technical Committee Meeting Actions and Recommendations.
International Atomic Energy Agency Sources of National Literature INIS Training Seminar November 2011, Vienna, Austria Taghrid ATIEH Leader, Capacity.
Controlled Vocabulary & Thesaurus Design Associative Relationships & Thesauri.
IAEA International Atomic Energy Agency 13th Joint INIS/ETDE Technical Committee Meeting October 2011, Vienna, Austria Dobrica Savić, INIS Unit Head.
International Atomic Energy Agency 2-5 Nov th ILO Meeting1 Thesaurus and Subject Categories Alexander Nevyjel 34 th Consultative Meeting of INIS.
The business process models and quality issues at the Hungarian Central Statistical Office (HCSO) Mr. Csaba Ábry, HCSO, Methodological Department Geneva,
Jean-Yves Le Meur - CERN Geneva Switzerland - GL'99 Conference 1.
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Structural and reference metadata in the European Statistical System
Committee of Experts World Intellectual Property Organization
Taxonomies, Lexicons and Organizing Knowledge
CLAIMS CLassification Automated InforMation System
Presentation transcript:

IAEA International Atomic Energy Agency International Nuclear Information System (INIS) CAI, Thesaurus, Subject Categories and Metadata Extraction Tool (MET) 13th Joint INIS/ETDE Technical Committee Meeting October 2011, Vienna, Austria Neviana Rashkova INIS Subject Specialist

IAEA CONTENT 13 th INIS/ETDE Joint Technical Committee Meeting, October  COMPUTER ASSISTED INDEXING – CAI  INIS/ETDE THESAURUS  SUBJECT CATEGORIES  INIS INPUT QUALITY CONTROL - UPDATE in co-operation with L. Iliev, Computer Support Group

IAEA COMPUTER ASSISTED INDEXING – CAI Assists the indexer to choose subject category and descriptors based on the text analysis of abstract and title Offers an opportunity for off-line work – batch indexing Incorporates the latest version of INIS Thesaurus Uses “hidden terms” pointing to a valid Thesaurus term Currently we have: 28 accounts created for Member states 19 countries with access to CAI 6 accounts created for external users This year documents indexed - 55% of the input from: Springer, ELSEVIER, ANS, IOPP, IAEA, MemSt, AIP 13 th INIS/ETDE Joint Technical Committee Meeting, October 20113

IAEA INIS/ETDE THESAURUS Thesaurus is “a controlled and dynamic vocabulary of semantically and generically related terms which covers a specific domain of knowledge“ (part of UNISCO definition) Types of relations for terms: BT (level1,2…10); NT (1,2…10); RT – related term; UF(+) – used for, SF seen for Contains: valid terms 8677 forbidden terms total 13 th INIS/ETDE Joint Technical Committee Meeting, October 20114

IAEA INIS/ETDE THESAURUS Maintaining the INIS/ETDE Thesaurus Regularly updated simultaneously at INIS and ETDE New terms proposed by Member States Terms revised if needed Discussion Group of experts – for new proposals and updates Translations Original - in English Other languages: German, French, Arabic, Russian, Chinese INIS Liaison Officer of the respective countries provide translations with yearly updates for the new terms 13 th INIS/ETDE Joint Technical Committee Meeting, October 20115

IAEA USES OF INIS/ETDE THESAURUS For indexing WinFibre CAI – hidden terms Independent use For retrieval Incorporated in INIS search For independent advanced search For establishing of search strategy As a dictionary 13 th INIS/ETDE Joint Technical Committee Meeting, October 20116

IAEA USES OF INIS/ETDE THESAURUS Other potential applications Retrieval – for navigation search together with subject classification Automation in text analysis – provides multiple level taxonomy Learning tool – give immediate structured information about the terms and their relations 13 th INIS/ETDE Joint Technical Committee Meeting, October BRUCE-1 REACTOR Tiverton, Ontario, Canada. *BT1 candu type reactors *BT1 natural uranium reactors *BT1 phwr type reactors RT bruce site BUBBLE CHAMBERS *BT1 gas track detectors NT1 cryogenic bubble chambers NT1 heavy liquid bubble chambers NT1 ultrasonic bubble chambers RT digitizers

IAEA INIS/ETDE SUBJECT CATEGORIES INIS/ETDE subject categories update Review the existing subject categories to include newer concepts and/or areas of research and development Make the "ETDE only" categories available for INIS Consider the introduction of new categories Four new Subject categories S77 NANOSCIENCE AND NANOTECHNOLOGY S79 ASTROPHYSICS, COSMOLOGY AND ASTRONOMY S96 KNOWLEDGE MANAGEMENT AND PRESERVATION S97 MATHEMATICAL METHODS AND COMPUTING 13 th INIS/ETDE Joint Technical Committee Meeting, October 20118

IAEA INIS/ETDE SUBJECT CATEGORIES ETDE/INIS Joint Reference Series No. 2 (Rev. 1) INIS Scope Descriptions The current categorization scheme contains 49 subject categories, both for INIS and ETDE. The categories have three-character alphanumeric codes The document defines the subject categories and provides the scope descriptions Subject Index is included as an aid to subject classifiers Cross references to other categories are provided where appropriate The tool is provided to Member States to assist in subject indexing 13 th INIS/ETDE Joint Technical Committee Meeting, October 20119

IAEA INIS INPUT QUALITY CONTROL UPDATE INTRODUCTION The general goal of the procedure is to improve the quality of input Identifies documents with errors in input and extracts them for manual check by a specialist Knowledge Base created using a large number of expert decisions made by human indexers - intellectual choices for usage of a specific SC/D combination Implemented in a computer program, currently in use Uses documents from immediately preceding time period At the time of implementation – 75% of identified records were proved to be real errors 13 th INIS/ETDE Joint Technical Committee Meeting, October

IAEA CURRENT PROCEDURE Based on old statistics period documents used Subject categories changed several times new categories added artificially adjusted values to replace the real statistics Thesaurus updated many times new descriptors new concepts 13 th INIS/ETDE Joint Technical Committee Meeting, October

IAEA CURRENT PROCEDURE THE RESULTS FROM THE QA PROCEDURE DO NOT REFLECT THE REAL SITUATION Too many false warnings (~ 50% of all documents) More bad records allowed in production Not relevant any more- no consistent approach for all pairs categories/descriptors THE OLD QA PROCEDURE NEEDS REVISION 13 th INIS/ETDE Joint Technical Committee Meeting, October

IAEA UPDATED PROCEDURE Based on real statistics using the whole INIS database Takes in account all subject categories Takes in account the accumulated experience about specific error usage of category/descriptor combinations Flexible towards changes of descriptors weights UPDATED PROCEDURE IS EXPECTED TO IMPROVE QUALITY AND SAVE TIME 13 th INIS/ETDE Joint Technical Committee Meeting, October

IAEA PRELIMINARY ANALYSES Analysis of the documentation on procedure for category match value (CMV) calculation An Expert System for Quality Control in Bibliographic Databases* Claudio Todeschini International Nuclear information System, international Atomic Energy Agency, Wagramerstrasse 5, A-7400 Vienna, Austria Michael P. Farrell Carbon Dioxide Information Center, Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee U.S.A. *Based on work performed at Oak Ridge National Laboratory, operated for the U.S. Department of Energy under Contract No. DE-ACOS- 840R21400 with Martin Marietta Energy Systems, Inc. Work was partially supported by the Carbon Dioxide Research Division, U.S. Department of Energy. Analysis of the program for quality control and testing the formula Criteria for category/descriptor combination 13 th INIS/ETDE Joint Technical Committee Meeting, October

IAEA WORK DONE Conversion of all existing categories to the currently used set of categories Calculation of frequencies – table category/descriptor Comparison between two statistics new/all SC Decision about which period to use for the statistics Adjustment to avoid expected errors Identification of known combinations giving nearly100% errors Creating a table for “bad” combinations - assigned different weight (to reach very low CMV) Possibility to manually change weights 13 th INIS/ETDE Joint Technical Committee Meeting, October

IAEA FINE TUNNING EXPECTED ERRORS – examples: Material Science GROWTH - CRYSTAL GROWTH Plasma physics IGNITION – THERMONUCLEAR IGNITION Physics of Elementary Particles and Fields PRODUCTION – PARTICLE PRODUCTION COLOR, FLAVOR, HOLOGRAPHY, TRANSPORT, CAVITIES,…etc. 17 descriptors in 18 subject categories have been adjusted 13 th INIS/ETDE Joint Technical Committee Meeting, October

IAEA TOOLS DEVELOPED Tools were developed to perform the steps: Scanning the records from the Reference DB to make full statistics for the subject category-descriptor pairs Report to show difference between table and the one to replace it A table for manual “tuning” some pairs. Unfinished report to show the effect of changing the table on raw (unprocessed) and processed records 13 th INIS/ETDE Joint Technical Committee Meeting October

IAEA COMPARISON WITH IRPS (processed records) 13 th INIS/ETDE Joint Technical Committee Meeting, October

IAEA COMPARISON WITH IRPS (unprocessed records) 13 th INIS/ETDE Joint Technical Committee Meeting, October

IAEA TRESHOLD DETERMINATION 13 th INIS/ETDE Joint Technical Committee Meeting, October

IAEA TRESHOLD DETERMINATION 13 th INIS/ETDE Joint Technical Committee Meeting, October

IAEA TRESHOLD DETERMINATION 13 th INIS/ETDE Joint Technical Committee Meeting, October

IAEA TRESHOLD DETERMINATION 13 th INIS/ETDE Joint Technical Committee Meeting, October

IAEA DISCUSSION First analyses suggest a natural threshold value CMV ∈ (1,2) Analysis of the number of documents to be scanned for different threshold CMV is necessary Tests to assess errors if choose the threshold value in the different intervals are necessary Further testing over different sets of records is required before implementation Possibility for integration in WinFibre 13 th INIS/ETDE Joint Technical Committee Meeting, October

IAEA 13 th INIS/ETDE Joint Technical Committee Meeting, October Thank you!