Download presentation
Presentation is loading. Please wait.
Published byCaren Baker Modified over 8 years ago
1
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting1 Workshop on Computer-assisted Indexing Alexander Nevyjel 34 th Consultative Meeting of INIS Liaison Officers 2-5 November 2008, Vienna, Austria
2
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting2Agenda Review CAI procedures (workflow, formats, conventions) Thesaurus extension: Hidden terms tables Problems and how to overcome Discussion and exchange of experiences Hands-on training by INIS Subject Specialists (in their offices, open end for this afternoon) Tips, tricks, recommendations
3
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting3 Objectives of Computer-assisted Indexing Maintaining database quality Saving of subject analysis manpower Improving indexing consistency
4
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting4 CAI-Workflow Interactive CAI Processing Batch Mode Conventional Processing
5
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting5 CAI Batch and Online Processing Input:MemSt-CC-yymmdd-xxxxxxxxxxx Output:_MemSt-CC-yymmdd-xxxxxxxxxxx MemSt is a standard prefix (meaning “member state”) CC is the country code yymmdd is the date when the file was generated xxxxxxxxxxx is any additional identification Examples MemSt-AR-041203-thisismytestfile MemSt-FR-041212-fileidentification
6
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting6 CAI Batch Processing Output:_MemSt-CC-yymmdd-xxxxxxxxxxx These files will carry the CAI suggested descriptors in tag 800, preceded by the string ##CAI suggestions##; Example: 800^##CAI suggestions##; DESCRIPTOR1; DESCRIPTOR2; DESCRIPTOR3; ……. sent back to the member state for reviewing
7
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting7 CAI Online File loaded to CAI online All files of a Member State appear on the queue page as batch MemSt-XX Please open only your own batch, do not touch other queues Files in a queue will be opened one after the other, in the sequence as they have been loaded
8
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting8 CAI Batch Processing Reviewing Process Delete all suggested descriptors which are too general Add relevant descriptors which were not found numerical values, e.g. pressure ranges, temperature ranges, etc nuclear reactions chemical compounds, alloys, etc. CAI is cleaning up BT/NTs clean up BT/NTs from manual additions Clean up suggestions from homographic terms Delete “##CAI suggestions## “ Submit file to “INIS Input Box”
9
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting9 CAI Online Reviewing Process Delete all suggested descriptors which are too general Add relevant descriptors which were not found numerical values, e.g. pressure ranges, temperature ranges, etc nuclear reactions chemical compounds, alloys, etc. CAI is cleaning up BT/NTs will give warnings for BT/NTs from manual additions Clean up suggestions from homographic terms Export file when finished File will be exported to INIS Production System (or send back to MS for reviewing if requested)
10
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting10 CAI Thesaurus extension “Hidden terms” are character patterns representing the different appearances of a concept in the free text, which is indexed by one or more descriptors. handled similar to “forbidden terms” with one or more USE relations CAI internal only not exported to INIS production system not exported to FIBRE not printed in any appearance of the thesaurus support identification of descriptors in the free text
11
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting11 Hidden Terms: Compounds Descriptorhidden termfree text MAGNESIUM BORIDESMgB_2MgB 2 MAGNESIUM CARBONATESMgCO_3MgCO 3 MAGNESIUM HYDRIDESMgH_2MgH 2 MAGNESIUM HYDROXIDESMg(OH)_2Mg(OH) 2 IRON BROMIDESiron dibromide IRON BROMIDESiron tribromide ARSENIC IONSAs"3"-As 3- ACETYLENEC_2H_2C 2 H 2 ACETALDEHYDEC_2H_4OC 2 H 4 O ACETIC ACIDC_2H_4O_2C 2 H 4 O 2 approx. 2000 hidden terms (expected 3000)
12
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting12 Hidden Terms: Isotopes Descriptorhidden termfree text CESIUM 137Cesium 137, Cesium-137 "1"3"7cs 137 Cs 137 caesium137 Caesium, 137-Caesium caesium 137Caesium 137, Caesium-137 137 cesium137 Cesium, 137-Cesium 137 cs137 Cs, 137-Cs 137cs137Cs cs 137Cs 137, Cs-137 cs"1"3"7Cs 137 cs137Cs137 CESIUM 138"1"3"8"mcs 138m Cs cs"1"3"8"mCs 138m approx. 26.000 hidden terms
13
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting13 Hidden Terms: Elementary Particles Descriptorhidden termfree text B QUARKSbottom quarks T QUARKStop quarks ELECTRON NEUTRINOS#nu#_eν e MUON NEUTRINOS#nu#_#mu#ν μ TAU NEUTRINOS#nu#_#tau#ν τ RHO-770 MESONS#rho#(770)ρ(770) RHO-770 MESONS#rho#-770ρ-770 OMEGA-782 MESONS#omega#(782)ω(782) OMEGA-782 MESONS#omega#-782ω-782 KAONS NEUTRALK"0K 0 KAONS NEUTRAL SHORT-LIVEDK"0_SK 0 S KAONS NEUTRAL LONG-LIVEDK"0_LK 0 L approx. 300 hidden terms
14
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting14 Hidden Terms: UK/US Spellings Descriptorhidden term A CENTERSa centres ACTIVITY METERSactivity metres ANALOG COMPUTERSanalogue computers ANALOG SYSTEMSanalogue systems ANESTHESIAanaesthesia ARCHAEOLOGYarcheology AUSTRIAN ORGANIZATIONSaustrian organisations BALLISTIC MISSILE DEFENSEballistic missile defence BAYARD-ALPERT GAGESbayard-alpert gauges BEAM ANALYZERSbeam analysers BEHAVIORbehaviour CATALOGScatalogues approx. 800 hidden terms
15
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting15 Hidden Terms: Diacritics and Countries Descriptorhidden term Diacritics: BAECKLUND TRANSFORMATIONbacklund transformation BRUECKNER METHODbruckner method BRUECKNER MODELbruckner model BRUNSBUETTEL REACTORbrunsbuttel reactor MOESSBAUER EFFECTmossbauer effect Country Names: CAMBODIAkampuchea COTE D'IVOIREivory coast GREECEhellas MYANMARburma SYRIAsyrian arab republic THAILANDsiam approx. 250 hidden terms
16
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting16 Hidden Terms: Other Spellings Descriptorhidden term Singular/Plural FUNGIfungus FUNGIfunguses G MATRIXg matrices G MATRIXg matrixes Reverse Sequence ATOM-MOLECULE COLLISIONSmolecule-atom collisions ATOM-MOLECULE COLLISIONSatom-molecule scattering ATOM-MOLECULE COLLISIONSmolecule-atom scattering ATOM-MOLECULE COLLISIONSatom-molecule reactions ATOM-MOLECULE COLLISIONSmolecule-atom reactions ATOM-MOLECULE COLLISIONSatom-molecule interactions ATOM-MOLECULE COLLISIONSmolecule-atom interactions approx. 900 hidden terms
17
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting17 Hidden Terms: Other Spellings Descriptorhidden term Grammatical Variations PERIODICITYperiodic PERIODICITYperiodical PERIODICITYperiodically Phrases versus compound terms RADIOWAVE RADIATIONradio wave SPACE-TIMEspacetime WAVE FUNCTIONSwavefunction Terminology GAMMA SPECTROMETERS#gamma#ray spectrometer GAMMA SPECTROMETERS#gamma#-ray spectrometer GAMMA SPECTROMETERSgammaray spectrometer GAMMA SPECTROMETERSgamma-ray spectrometer
18
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting18 Hidden Terms: Other Spellings Descriptorhidden term Terminology SU-2 GROUPSsu(2) theory SU-2 GROUPSsu(2) symmetry SU-3 GROUPSsu(3) theory SU-3 GROUPSsu(3) symmetry Abbreviations CARBON DIOXIDE LASERSCO_2 laser CARBON DIOXIDE LASERSCO2 laser KOBAYASHI-MASKAWA MATRIXCKM matrix KORTEWEG-DE VRIES EQUATIONkdv equation Numerical Values KEV RANGEkev MEV RANGEmev GEV RANGEgev
19
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting19 CAI Thesaurus Extension Thesaurus Valid Descriptors21.147 Forbidden Terms 9.114 CAI Hidden Terms34.105 Total64.366 Terminological Knowledge Base
20
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting20 Terms which need special attention Numerical values, ranges ENERGY RANGES MEV RANGE MEV RANGE 01-10 MEV RANGE 10-100 MEV RANGE 100-1000 PESSURE RANGES Recognize pressure ranges Translate from atm, bar, torr to Pascal TEMPERATURE RANGES Recognize temperature ranges Translate from Celsius, Fahrenheit to Kelvin Attention: the forbidden term (since 1992) high temperature USE TEMPERATURE RANGE 0400-1000 K is leading often to wrong results
21
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting21 Terms which need special attention Multi-meaning “+” and “-“ signs K + KAONS PLUS, KAONS MINUS, POTASSIUM IONS Case sensitivity TiN TIN (instead of TITANIUM NITRIDES) …this can be … CaN CALCIUM NITRIDES gas GALLIUM SULFIDES “…who is the …” WHO (World Health Organization) Verbs versus Nouns “… this leads us to …” LEAD “… this leaves it ….” LEAVES
22
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting22 Terms which need special attention Multi-meaning MPA MAXIMUM PERMISSIBLE ACTIVITY Mega Pascal (MPa) GDP GROSS DOMESTIC PRODUCT GADOLINIUM PHOSPHIDES (GdP) COBRA SNAKES COBRA REACTOR KBR-1 REACTOR … in isotopes….. INDIUM ISOTOPES …at 195 deg K… ASTATINE 195
23
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting23 Terms which need special attention Homographic terms Solutions SOLUTIONS or MATHEMATICAL SOLUTIONS Color COLOR, COLOR CENTRES, COLOR MODEL Flavor FLAVOR, FLAVOR MODELS Tunnel TUNNELS, TUNNELING, TUNNEL EFFECT Nuclear Reactions, e.g. 14 N(γ,α) 10 B Targets Beams Reactions
24
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting24 Terms which need special attention Terms which are often wrong Production BEAM PRODUCTION HEAT PRODUCTION HYDROGEN PRODUCTION ISOTOPE PRODUCTION PARTICLE PRODUCTION PLASMA PRODUCTION PRODUCTION Transport AIR TRANSPORT ATOM TRANSPORT BEAM TRANSPORT CHARGED-PARTICLE TRANSPORT ENVIRONMENTAL TRANSPORT PHOTON TRANSPORT RADIOACTIVITY TRANSPORT TRANSPORT Decay NUCLEAR DECAY ALPHA DECAY BETA DECAY ……. PARTICLE DECAY ELECTROMAGNETIC… HADRONIC… RADIATIVE… WEAK…
25
International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting25 CAI Hands-on training by Subject Specialists PhysicsMarija Sejmenova-Gichevska A2477 ChemistryChristine Krieger-LevineA2478 ReactorsNeviana RashkovaA2479 Live ScienceBekele NegeriA2480
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.