Presentation is loading. Please wait.

Presentation is loading. Please wait.

Basics of Information Retrieval and Query Formulation Bekele Negeri Duresa Nuclear Information Specialist.

Similar presentations


Presentation on theme: "Basics of Information Retrieval and Query Formulation Bekele Negeri Duresa Nuclear Information Specialist."— Presentation transcript:

1 Basics of Information Retrieval and Query Formulation Bekele Negeri Duresa Nuclear Information Specialist

2 Outline  Information Retrieval  The INIS Collection  Planning searches  Formulating queries  Measure of Search effectiveness  Using search results and queries 2

3 Information Retrieval  Information Retrieval (IR) is finding material (usually documents) that satisfies an information need from within large collections (usually stored on computers).  Today we frequently think first of “web search”, but includes:  E-mail search  Searching your laptop  Corporate knowledge bases  Structured bibliographic databases 3

4 The INIS Collection  INIS Collection is comprised of  The INIS bibliographic Database (3,842,516 records)  The INIS Nonconventional Literature (NCL) collection (510,458 total and 369,140 publicly available)  A bibliographic database consists of data (records) whose attribute are described in fields and a means by which to search these fields, a search engine.  Databases may look different on screen but the underlying principles for searching and formulating search strategies are common to all. 4

5 Bibliographic Databases (Types)  Bibliographic Databases: provide only citation or reference author(s), title, subject(s) and publisher..) and with this information you should be able to locate the item in the Library.  Bibliographic with some full text content: these enhanced databases include keywords and abstracts and often, but not always, include the full text of a set of records.  Bibliographic databases with full text content: these databases include the entire full text for all articles and other documents indexed. 5

6 Planning your search  “Understanding the Problem is Half of the Solution”  Define precisely the information you are seeking  Identify the concepts that represent the problem 6

7 Planning a search: example Risk of medical radiation exposure?  To whom: to patients or doctors and medical technicians?  If to patients: are you concerned about exposure due to radiodiagnostics (CT, X-ray..) or due to radiation therapy or both?  If radiotherapy: are you concerned about radionuclide therapy or external irradiation therapy?  If about personnel: are you interested about safety policy of medical establishements? 7

8 Planning your search (Cont’d)  Topic: Risk of radiation exposure of medical staff in a radiotherapy department  Concepts:  Exposure to radiation  Medical staff  Radiotherapy  Are we looking for documents  In certain language, from certain country..  Latest publications  Only records with full text documents, or journal citations..? 8

9 IINIS Database Fields  Numerical (Exact or Range search)  Year of publication (PY)  Reference Number (RN)  Free Text  Title (TI) Authors (AU)  Source (SO)Abstract (AB)  Controlled Vocabulary  Language (LA)  Country of Input (CO)  Descriptors (DE)  Indexer-assigned descriptors (DEI)  Computer-upposted descriptors (DEC) 9

10 Search Strategy  Simple search  single search term or phrase  “Oncology”, “nuclear safety”  Advanced search (combining concepts)  Boolean Operators:OR, AND, NOT  Text Operators any (includes any), all (includes all), exact phrase  Numeric Operators equal, more, less, more or equal, less or equal  Truncation, Wildcard  Multilingual Search 10

11 Query Syntax Google Search Appliance 11

12 Query Formulation  Translating your search concepts into proper search syntax  For the Topic: Risk of radiation exposure of medical staff in a radiotherapy department  Simple to complex  MEDICAL PERSONNEL (is a BT for RADIOLOGICAL PERSONNEL)  OCCUPATIONAL EXPOSURE or OCCUPATIONAL SAFETY or RADIATION PROTECTION  RADIOTHERAPY  Try to search for individual terms and explore the database; you may identify other key concepts like radiation doses, ALARA, dose limits…  Then combine them using boolean operators (and, or, not)  Some databases allow you to combine searches while others allow you to combine your results during selection of records 12

13 Measuring Search Effectiveness  Precision & Recall  Recall: the ratio of the number of relevant records retrieved to the total number of relevant records in the database. 13

14  Precision: the ratio of the number of relevant records retrieved to the total number of irrelevant and relevant records retrieved.  Precision and recall are Inversely related High recall = comprehensive retrieval but high noise High Precision = only relevant records but miss out good records Source: http://www.creighton.edu/fileadmin/user/HSL/docs/ref/Searching_-_Recall_Precision.pdf 14

15 Optimize your search strategy  Precision  Search in particular field  Search in DEI (indexer assigned descriptor)  Use exact Phrase  Combine using “AND”  Recall  Search across fields  Combine synonyms, related terms, broad or general terms  Use “any” or “all” words  Optimise  Use your best judgment  From simple to complex 15

16 Using your Query and search results  Selecting relevant records  select format (pdf/ html/excel..)  Printing/saving  Email search results  Storing query  Save and run query  Subscribe Feeds 16

17 Thank you! 17


Download ppt "Basics of Information Retrieval and Query Formulation Bekele Negeri Duresa Nuclear Information Specialist."

Similar presentations


Ads by Google