International Atomic Energy Agency 2-5 Nov th ILO Meeting1 Workshop on Computer-assisted Indexing Alexander Nevyjel 34 th Consultative Meeting of INIS Liaison Officers 2-5 November 2008, Vienna, Austria
International Atomic Energy Agency 2-5 Nov th ILO Meeting2Agenda Review CAI procedures (workflow, formats, conventions) Thesaurus extension: Hidden terms tables Problems and how to overcome Discussion and exchange of experiences Hands-on training by INIS Subject Specialists (in their offices, open end for this afternoon) Tips, tricks, recommendations
International Atomic Energy Agency 2-5 Nov th ILO Meeting3 Objectives of Computer-assisted Indexing Maintaining database quality Saving of subject analysis manpower Improving indexing consistency
International Atomic Energy Agency 2-5 Nov th ILO Meeting4 CAI-Workflow Interactive CAI Processing Batch Mode Conventional Processing
International Atomic Energy Agency 2-5 Nov th ILO Meeting5 CAI Batch and Online Processing Input:MemSt-CC-yymmdd-xxxxxxxxxxx Output:_MemSt-CC-yymmdd-xxxxxxxxxxx MemSt is a standard prefix (meaning “member state”) CC is the country code yymmdd is the date when the file was generated xxxxxxxxxxx is any additional identification Examples MemSt-AR thisismytestfile MemSt-FR fileidentification
International Atomic Energy Agency 2-5 Nov th ILO Meeting6 CAI Batch Processing Output:_MemSt-CC-yymmdd-xxxxxxxxxxx These files will carry the CAI suggested descriptors in tag 800, preceded by the string ##CAI suggestions##; Example: 800^##CAI suggestions##; DESCRIPTOR1; DESCRIPTOR2; DESCRIPTOR3; ……. sent back to the member state for reviewing
International Atomic Energy Agency 2-5 Nov th ILO Meeting7 CAI Online File loaded to CAI online All files of a Member State appear on the queue page as batch MemSt-XX Please open only your own batch, do not touch other queues Files in a queue will be opened one after the other, in the sequence as they have been loaded
International Atomic Energy Agency 2-5 Nov th ILO Meeting8 CAI Batch Processing Reviewing Process Delete all suggested descriptors which are too general Add relevant descriptors which were not found numerical values, e.g. pressure ranges, temperature ranges, etc nuclear reactions chemical compounds, alloys, etc. CAI is cleaning up BT/NTs clean up BT/NTs from manual additions Clean up suggestions from homographic terms Delete “##CAI suggestions## “ Submit file to “INIS Input Box”
International Atomic Energy Agency 2-5 Nov th ILO Meeting9 CAI Online Reviewing Process Delete all suggested descriptors which are too general Add relevant descriptors which were not found numerical values, e.g. pressure ranges, temperature ranges, etc nuclear reactions chemical compounds, alloys, etc. CAI is cleaning up BT/NTs will give warnings for BT/NTs from manual additions Clean up suggestions from homographic terms Export file when finished File will be exported to INIS Production System (or send back to MS for reviewing if requested)
International Atomic Energy Agency 2-5 Nov th ILO Meeting10 CAI Thesaurus extension “Hidden terms” are character patterns representing the different appearances of a concept in the free text, which is indexed by one or more descriptors. handled similar to “forbidden terms” with one or more USE relations CAI internal only not exported to INIS production system not exported to FIBRE not printed in any appearance of the thesaurus support identification of descriptors in the free text
International Atomic Energy Agency 2-5 Nov th ILO Meeting11 Hidden Terms: Compounds Descriptorhidden termfree text MAGNESIUM BORIDESMgB_2MgB 2 MAGNESIUM CARBONATESMgCO_3MgCO 3 MAGNESIUM HYDRIDESMgH_2MgH 2 MAGNESIUM HYDROXIDESMg(OH)_2Mg(OH) 2 IRON BROMIDESiron dibromide IRON BROMIDESiron tribromide ARSENIC IONSAs"3"-As 3- ACETYLENEC_2H_2C 2 H 2 ACETALDEHYDEC_2H_4OC 2 H 4 O ACETIC ACIDC_2H_4O_2C 2 H 4 O 2 approx hidden terms (expected 3000)
International Atomic Energy Agency 2-5 Nov th ILO Meeting12 Hidden Terms: Isotopes Descriptorhidden termfree text CESIUM 137Cesium 137, Cesium-137 "1"3"7cs 137 Cs 137 caesium137 Caesium, 137-Caesium caesium 137Caesium 137, Caesium cesium137 Cesium, 137-Cesium 137 cs137 Cs, 137-Cs 137cs137Cs cs 137Cs 137, Cs-137 cs"1"3"7Cs 137 cs137Cs137 CESIUM 138"1"3"8"mcs 138m Cs cs"1"3"8"mCs 138m approx hidden terms
International Atomic Energy Agency 2-5 Nov th ILO Meeting13 Hidden Terms: Elementary Particles Descriptorhidden termfree text B QUARKSbottom quarks T QUARKStop quarks ELECTRON NEUTRINOS#nu#_eν e MUON NEUTRINOS#nu#_#mu#ν μ TAU NEUTRINOS#nu#_#tau#ν τ RHO-770 MESONS#rho#(770)ρ(770) RHO-770 MESONS#rho#-770ρ-770 OMEGA-782 MESONS#omega#(782)ω(782) OMEGA-782 MESONS#omega#-782ω-782 KAONS NEUTRALK"0K 0 KAONS NEUTRAL SHORT-LIVEDK"0_SK 0 S KAONS NEUTRAL LONG-LIVEDK"0_LK 0 L approx. 300 hidden terms
International Atomic Energy Agency 2-5 Nov th ILO Meeting14 Hidden Terms: UK/US Spellings Descriptorhidden term A CENTERSa centres ACTIVITY METERSactivity metres ANALOG COMPUTERSanalogue computers ANALOG SYSTEMSanalogue systems ANESTHESIAanaesthesia ARCHAEOLOGYarcheology AUSTRIAN ORGANIZATIONSaustrian organisations BALLISTIC MISSILE DEFENSEballistic missile defence BAYARD-ALPERT GAGESbayard-alpert gauges BEAM ANALYZERSbeam analysers BEHAVIORbehaviour CATALOGScatalogues approx. 800 hidden terms
International Atomic Energy Agency 2-5 Nov th ILO Meeting15 Hidden Terms: Diacritics and Countries Descriptorhidden term Diacritics: BAECKLUND TRANSFORMATIONbacklund transformation BRUECKNER METHODbruckner method BRUECKNER MODELbruckner model BRUNSBUETTEL REACTORbrunsbuttel reactor MOESSBAUER EFFECTmossbauer effect Country Names: CAMBODIAkampuchea COTE D'IVOIREivory coast GREECEhellas MYANMARburma SYRIAsyrian arab republic THAILANDsiam approx. 250 hidden terms
International Atomic Energy Agency 2-5 Nov th ILO Meeting16 Hidden Terms: Other Spellings Descriptorhidden term Singular/Plural FUNGIfungus FUNGIfunguses G MATRIXg matrices G MATRIXg matrixes Reverse Sequence ATOM-MOLECULE COLLISIONSmolecule-atom collisions ATOM-MOLECULE COLLISIONSatom-molecule scattering ATOM-MOLECULE COLLISIONSmolecule-atom scattering ATOM-MOLECULE COLLISIONSatom-molecule reactions ATOM-MOLECULE COLLISIONSmolecule-atom reactions ATOM-MOLECULE COLLISIONSatom-molecule interactions ATOM-MOLECULE COLLISIONSmolecule-atom interactions approx. 900 hidden terms
International Atomic Energy Agency 2-5 Nov th ILO Meeting17 Hidden Terms: Other Spellings Descriptorhidden term Grammatical Variations PERIODICITYperiodic PERIODICITYperiodical PERIODICITYperiodically Phrases versus compound terms RADIOWAVE RADIATIONradio wave SPACE-TIMEspacetime WAVE FUNCTIONSwavefunction Terminology GAMMA SPECTROMETERS#gamma#ray spectrometer GAMMA SPECTROMETERS#gamma#-ray spectrometer GAMMA SPECTROMETERSgammaray spectrometer GAMMA SPECTROMETERSgamma-ray spectrometer
International Atomic Energy Agency 2-5 Nov th ILO Meeting18 Hidden Terms: Other Spellings Descriptorhidden term Terminology SU-2 GROUPSsu(2) theory SU-2 GROUPSsu(2) symmetry SU-3 GROUPSsu(3) theory SU-3 GROUPSsu(3) symmetry Abbreviations CARBON DIOXIDE LASERSCO_2 laser CARBON DIOXIDE LASERSCO2 laser KOBAYASHI-MASKAWA MATRIXCKM matrix KORTEWEG-DE VRIES EQUATIONkdv equation Numerical Values KEV RANGEkev MEV RANGEmev GEV RANGEgev
International Atomic Energy Agency 2-5 Nov th ILO Meeting19 CAI Thesaurus Extension Thesaurus Valid Descriptors Forbidden Terms CAI Hidden Terms Total Terminological Knowledge Base
International Atomic Energy Agency 2-5 Nov th ILO Meeting20 Terms which need special attention Numerical values, ranges ENERGY RANGES MEV RANGE MEV RANGE MEV RANGE MEV RANGE PESSURE RANGES Recognize pressure ranges Translate from atm, bar, torr to Pascal TEMPERATURE RANGES Recognize temperature ranges Translate from Celsius, Fahrenheit to Kelvin Attention: the forbidden term (since 1992) high temperature USE TEMPERATURE RANGE K is leading often to wrong results
International Atomic Energy Agency 2-5 Nov th ILO Meeting21 Terms which need special attention Multi-meaning “+” and “-“ signs K + KAONS PLUS, KAONS MINUS, POTASSIUM IONS Case sensitivity TiN TIN (instead of TITANIUM NITRIDES) …this can be … CaN CALCIUM NITRIDES gas GALLIUM SULFIDES “…who is the …” WHO (World Health Organization) Verbs versus Nouns “… this leads us to …” LEAD “… this leaves it ….” LEAVES
International Atomic Energy Agency 2-5 Nov th ILO Meeting22 Terms which need special attention Multi-meaning MPA MAXIMUM PERMISSIBLE ACTIVITY Mega Pascal (MPa) GDP GROSS DOMESTIC PRODUCT GADOLINIUM PHOSPHIDES (GdP) COBRA SNAKES COBRA REACTOR KBR-1 REACTOR … in isotopes….. INDIUM ISOTOPES …at 195 deg K… ASTATINE 195
International Atomic Energy Agency 2-5 Nov th ILO Meeting23 Terms which need special attention Homographic terms Solutions SOLUTIONS or MATHEMATICAL SOLUTIONS Color COLOR, COLOR CENTRES, COLOR MODEL Flavor FLAVOR, FLAVOR MODELS Tunnel TUNNELS, TUNNELING, TUNNEL EFFECT Nuclear Reactions, e.g. 14 N(γ,α) 10 B Targets Beams Reactions
International Atomic Energy Agency 2-5 Nov th ILO Meeting24 Terms which need special attention Terms which are often wrong Production BEAM PRODUCTION HEAT PRODUCTION HYDROGEN PRODUCTION ISOTOPE PRODUCTION PARTICLE PRODUCTION PLASMA PRODUCTION PRODUCTION Transport AIR TRANSPORT ATOM TRANSPORT BEAM TRANSPORT CHARGED-PARTICLE TRANSPORT ENVIRONMENTAL TRANSPORT PHOTON TRANSPORT RADIOACTIVITY TRANSPORT TRANSPORT Decay NUCLEAR DECAY ALPHA DECAY BETA DECAY ……. PARTICLE DECAY ELECTROMAGNETIC… HADRONIC… RADIATIVE… WEAK…
International Atomic Energy Agency 2-5 Nov th ILO Meeting25 CAI Hands-on training by Subject Specialists PhysicsMarija Sejmenova-Gichevska A2477 ChemistryChristine Krieger-LevineA2478 ReactorsNeviana RashkovaA2479 Live ScienceBekele NegeriA2480