Presentation is loading. Please wait.

Presentation is loading. Please wait.

International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting1 Workshop on Computer-assisted Indexing Alexander Nevyjel 34 th Consultative Meeting of.

Similar presentations


Presentation on theme: "International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting1 Workshop on Computer-assisted Indexing Alexander Nevyjel 34 th Consultative Meeting of."— Presentation transcript:

1 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting1 Workshop on Computer-assisted Indexing Alexander Nevyjel 34 th Consultative Meeting of INIS Liaison Officers 2-5 November 2008, Vienna, Austria

2 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting2Agenda Review CAI procedures (workflow, formats, conventions) Thesaurus extension: Hidden terms tables Problems and how to overcome Discussion and exchange of experiences Hands-on training by INIS Subject Specialists (in their offices, open end for this afternoon) Tips, tricks, recommendations

3 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting3 Objectives of Computer-assisted Indexing  Maintaining database quality  Saving of subject analysis manpower  Improving indexing consistency

4 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting4 CAI-Workflow Interactive CAI Processing Batch Mode Conventional Processing

5 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting5 CAI Batch and Online Processing Input:MemSt-CC-yymmdd-xxxxxxxxxxx Output:_MemSt-CC-yymmdd-xxxxxxxxxxx MemSt is a standard prefix (meaning “member state”) CC is the country code yymmdd is the date when the file was generated xxxxxxxxxxx is any additional identification Examples MemSt-AR-041203-thisismytestfile MemSt-FR-041212-fileidentification

6 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting6 CAI Batch Processing Output:_MemSt-CC-yymmdd-xxxxxxxxxxx These files will carry the CAI suggested descriptors in tag 800, preceded by the string ##CAI suggestions##; Example: 800^##CAI suggestions##; DESCRIPTOR1; DESCRIPTOR2; DESCRIPTOR3; ……. sent back to the member state for reviewing

7 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting7 CAI Online File loaded to CAI online All files of a Member State appear on the queue page as batch MemSt-XX Please open only your own batch, do not touch other queues Files in a queue will be opened one after the other, in the sequence as they have been loaded

8 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting8 CAI Batch Processing Reviewing Process Delete all suggested descriptors which are too general Add relevant descriptors which were not found numerical values, e.g. pressure ranges, temperature ranges, etc nuclear reactions chemical compounds, alloys, etc. CAI is cleaning up BT/NTs  clean up BT/NTs from manual additions Clean up suggestions from homographic terms Delete “##CAI suggestions## “ Submit file to “INIS Input Box”

9 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting9 CAI Online Reviewing Process Delete all suggested descriptors which are too general Add relevant descriptors which were not found numerical values, e.g. pressure ranges, temperature ranges, etc nuclear reactions chemical compounds, alloys, etc. CAI is cleaning up BT/NTs  will give warnings for BT/NTs from manual additions Clean up suggestions from homographic terms Export file when finished File will be exported to INIS Production System (or send back to MS for reviewing if requested)

10 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting10 CAI Thesaurus extension “Hidden terms” are character patterns representing the different appearances of a concept in the free text, which is indexed by one or more descriptors. handled similar to “forbidden terms” with one or more USE relations CAI internal only not exported to INIS production system not exported to FIBRE not printed in any appearance of the thesaurus support identification of descriptors in the free text

11 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting11 Hidden Terms: Compounds Descriptorhidden termfree text MAGNESIUM BORIDESMgB_2MgB 2 MAGNESIUM CARBONATESMgCO_3MgCO 3 MAGNESIUM HYDRIDESMgH_2MgH 2 MAGNESIUM HYDROXIDESMg(OH)_2Mg(OH) 2 IRON BROMIDESiron dibromide IRON BROMIDESiron tribromide ARSENIC IONSAs"3"-As 3- ACETYLENEC_2H_2C 2 H 2 ACETALDEHYDEC_2H_4OC 2 H 4 O ACETIC ACIDC_2H_4O_2C 2 H 4 O 2  approx. 2000 hidden terms (expected 3000)

12 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting12 Hidden Terms: Isotopes Descriptorhidden termfree text CESIUM 137Cesium 137, Cesium-137 "1"3"7cs 137 Cs 137 caesium137 Caesium, 137-Caesium caesium 137Caesium 137, Caesium-137 137 cesium137 Cesium, 137-Cesium 137 cs137 Cs, 137-Cs 137cs137Cs cs 137Cs 137, Cs-137 cs"1"3"7Cs 137 cs137Cs137 CESIUM 138"1"3"8"mcs 138m Cs cs"1"3"8"mCs 138m  approx. 26.000 hidden terms

13 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting13 Hidden Terms: Elementary Particles Descriptorhidden termfree text B QUARKSbottom quarks T QUARKStop quarks ELECTRON NEUTRINOS#nu#_eν e MUON NEUTRINOS#nu#_#mu#ν μ TAU NEUTRINOS#nu#_#tau#ν τ RHO-770 MESONS#rho#(770)ρ(770) RHO-770 MESONS#rho#-770ρ-770 OMEGA-782 MESONS#omega#(782)ω(782) OMEGA-782 MESONS#omega#-782ω-782 KAONS NEUTRALK"0K 0 KAONS NEUTRAL SHORT-LIVEDK"0_SK 0 S KAONS NEUTRAL LONG-LIVEDK"0_LK 0 L  approx. 300 hidden terms

14 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting14 Hidden Terms: UK/US Spellings Descriptorhidden term A CENTERSa centres ACTIVITY METERSactivity metres ANALOG COMPUTERSanalogue computers ANALOG SYSTEMSanalogue systems ANESTHESIAanaesthesia ARCHAEOLOGYarcheology AUSTRIAN ORGANIZATIONSaustrian organisations BALLISTIC MISSILE DEFENSEballistic missile defence BAYARD-ALPERT GAGESbayard-alpert gauges BEAM ANALYZERSbeam analysers BEHAVIORbehaviour CATALOGScatalogues  approx. 800 hidden terms

15 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting15 Hidden Terms: Diacritics and Countries Descriptorhidden term Diacritics: BAECKLUND TRANSFORMATIONbacklund transformation BRUECKNER METHODbruckner method BRUECKNER MODELbruckner model BRUNSBUETTEL REACTORbrunsbuttel reactor MOESSBAUER EFFECTmossbauer effect Country Names: CAMBODIAkampuchea COTE D'IVOIREivory coast GREECEhellas MYANMARburma SYRIAsyrian arab republic THAILANDsiam  approx. 250 hidden terms

16 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting16 Hidden Terms: Other Spellings Descriptorhidden term Singular/Plural FUNGIfungus FUNGIfunguses G MATRIXg matrices G MATRIXg matrixes Reverse Sequence ATOM-MOLECULE COLLISIONSmolecule-atom collisions ATOM-MOLECULE COLLISIONSatom-molecule scattering ATOM-MOLECULE COLLISIONSmolecule-atom scattering ATOM-MOLECULE COLLISIONSatom-molecule reactions ATOM-MOLECULE COLLISIONSmolecule-atom reactions ATOM-MOLECULE COLLISIONSatom-molecule interactions ATOM-MOLECULE COLLISIONSmolecule-atom interactions  approx. 900 hidden terms

17 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting17 Hidden Terms: Other Spellings Descriptorhidden term Grammatical Variations PERIODICITYperiodic PERIODICITYperiodical PERIODICITYperiodically Phrases versus compound terms RADIOWAVE RADIATIONradio wave SPACE-TIMEspacetime WAVE FUNCTIONSwavefunction Terminology GAMMA SPECTROMETERS#gamma#ray spectrometer GAMMA SPECTROMETERS#gamma#-ray spectrometer GAMMA SPECTROMETERSgammaray spectrometer GAMMA SPECTROMETERSgamma-ray spectrometer

18 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting18 Hidden Terms: Other Spellings Descriptorhidden term Terminology SU-2 GROUPSsu(2) theory SU-2 GROUPSsu(2) symmetry SU-3 GROUPSsu(3) theory SU-3 GROUPSsu(3) symmetry Abbreviations CARBON DIOXIDE LASERSCO_2 laser CARBON DIOXIDE LASERSCO2 laser KOBAYASHI-MASKAWA MATRIXCKM matrix KORTEWEG-DE VRIES EQUATIONkdv equation Numerical Values KEV RANGEkev MEV RANGEmev GEV RANGEgev

19 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting19 CAI Thesaurus Extension Thesaurus Valid Descriptors21.147 Forbidden Terms 9.114 CAI Hidden Terms34.105 Total64.366  Terminological Knowledge Base

20 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting20 Terms which need special attention Numerical values, ranges ENERGY RANGES MEV RANGE MEV RANGE 01-10 MEV RANGE 10-100 MEV RANGE 100-1000 PESSURE RANGES Recognize pressure ranges Translate from atm, bar, torr to Pascal TEMPERATURE RANGES Recognize temperature ranges Translate from Celsius, Fahrenheit to Kelvin Attention: the forbidden term (since 1992) high temperature USE TEMPERATURE RANGE 0400-1000 K is leading often to wrong results

21 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting21 Terms which need special attention Multi-meaning “+” and “-“ signs K +  KAONS PLUS, KAONS MINUS, POTASSIUM IONS Case sensitivity TiN  TIN (instead of TITANIUM NITRIDES) …this can be …  CaN  CALCIUM NITRIDES gas  GALLIUM SULFIDES “…who is the …”  WHO (World Health Organization) Verbs versus Nouns “… this leads us to …”  LEAD “… this leaves it ….”  LEAVES

22 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting22 Terms which need special attention Multi-meaning MPA MAXIMUM PERMISSIBLE ACTIVITY Mega Pascal (MPa) GDP GROSS DOMESTIC PRODUCT GADOLINIUM PHOSPHIDES (GdP) COBRA  SNAKES COBRA REACTOR  KBR-1 REACTOR … in isotopes…..  INDIUM ISOTOPES …at 195 deg K…  ASTATINE 195

23 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting23 Terms which need special attention Homographic terms Solutions  SOLUTIONS or MATHEMATICAL SOLUTIONS Color  COLOR, COLOR CENTRES, COLOR MODEL Flavor  FLAVOR, FLAVOR MODELS Tunnel  TUNNELS, TUNNELING, TUNNEL EFFECT Nuclear Reactions, e.g. 14 N(γ,α) 10 B Targets Beams Reactions

24 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting24 Terms which need special attention Terms which are often wrong Production BEAM PRODUCTION HEAT PRODUCTION HYDROGEN PRODUCTION ISOTOPE PRODUCTION PARTICLE PRODUCTION PLASMA PRODUCTION PRODUCTION Transport AIR TRANSPORT ATOM TRANSPORT BEAM TRANSPORT CHARGED-PARTICLE TRANSPORT ENVIRONMENTAL TRANSPORT PHOTON TRANSPORT RADIOACTIVITY TRANSPORT TRANSPORT Decay NUCLEAR DECAY ALPHA DECAY BETA DECAY ……. PARTICLE DECAY ELECTROMAGNETIC… HADRONIC… RADIATIVE… WEAK…

25 International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting25 CAI Hands-on training by Subject Specialists PhysicsMarija Sejmenova-Gichevska A2477 ChemistryChristine Krieger-LevineA2478 ReactorsNeviana RashkovaA2479 Live ScienceBekele NegeriA2480


Download ppt "International Atomic Energy Agency 2-5 Nov 200834th ILO Meeting1 Workshop on Computer-assisted Indexing Alexander Nevyjel 34 th Consultative Meeting of."

Similar presentations


Ads by Google