Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Relevance Feedback Retrieval of Time Series Data Eamonn J. Keogh & Michael J. Pazzani Prepared By/ Fahad Al-jutaily Supervisor/ Dr. Mourad Ykhlef IS531.
UCLA : GSE&IS : Department of Information StudiesJF : 276lec1.ppt : 5/2/2015 : 1 I N F S I N F O R M A T I O N R E T R I E V A L S Y S T E M S Week.
Dialogue – Driven Intranet Search Suma Adindla School of Computer Science & Electronic Engineering 8th LANGUAGE & COMPUTATION DAY 2009.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
1 CBioC: Collaborative Bio- Curation Chitta Baral Department of Computer Science and Engineering Arizona State University.
Reference Collections: Task Characteristics. TREC Collection Text REtrieval Conference (TREC) –sponsored by NIST and DARPA (1992-?) Comparing approaches.
Columbia University Dept of Computer Science Center for Research on Info Access University of So. Calif Information Sciences Institute (ISI)
Presented by Zeehasham Rasheed
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
CSE 730 Information Retrieval of Biomedical Data The use of medical lexicon in biomedical IR.
Common Core State Standards Professional Learning Module Series
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Research Methods for Computer Science CSCI 6620 Spring 2014 Dr. Pettey CSCI 6620 Spring 2014 Dr. Pettey.
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
NLM-Semantic Medline Data Science Data Publication Commons Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Educator’s Guide Using Instructables With Your Students.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Universität Stuttgart Universitätsbibliothek Information Retrieval on the Grid? Results and suggestions from Project GRACE Werner Stephan Stuttgart University.
Claudia Marzi Institute for Computational Linguistics, “Antonio Zampolli” – Italian National Research Council University of Pavia – Dept. of Theoretical.
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
Survey of Semantic Annotation Platforms
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
Linking Diseases and Genes through Informatics Knowledge Bases and Ontologies Joyce A. Mitchell, Ph.D. National Library of Medicine University of Missouri.
Annual reports and feedback from UMLS licensees Kin Wah Fung MD, MSc, MA The UMLS Team National Library of Medicine Workshop on the Future of the UMLS.
Ontology Summit2007 Survey Response Analysis -- Issues Ken Baclawski Northeastern University.
Survey of Medical Informatics CS 493 – Fall 2004 September 27, 2004.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.
Shelly Warwick, MLS, Ph.D – Permission is granted to reproduce and edit this work for non-commercial educational use as long as attribution is provided.
Ontology Summit2007 Survey Response Analysis Ken Baclawski Northeastern University.
UMLS Unified Medical Language System. What is UMLS? A Unified knowledge representation system Project of NLM Large scale Distributed First launched in.
Grade 8 – Writing Standards Text Types and Purposes (1b) Write arguments to support claims with clear reasons and relevant evidence. Support claim(s) with.
Search Engine Architecture
Common Core Reading Standards for Science. CCSS.ELA-Literacy.RST CITE specific textual evidence to support analysis of science and technical texts.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Mining the Biomedical Research Literature Ken Baclawski.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Chapter 7 K NOWLEDGE R EPRESENTATION, O NTOLOGICAL E NGINEERING, AND T OPIC M APS L EO O BRST AND H OWARD L IU.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
5. 2Object-Oriented Analysis and Design with the Unified Process Objectives  Describe the activities of the requirements discipline  Describe the difference.
Jarg Corporation Seeks Sponsors/Partners, Who: Identify Solutions To Problems With Our Pilot (life science) Demonstrations of: Effective Semantic Use of.
MEDLINE®/PubMed® PubMed for Trainers, Fall 2015 U.S. National Library of Medicine (NLM) and NLM Training Center An introduction.
The Agricultural Ontology Server (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Food and Agriculture Organization.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
GUIDE. P UB M ED
Searching the Web for academic information Ruth Stubbings.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Search Engine Architecture
Lecture 12: Data Wrangling
Information Retrieval
Exploring Scholarly Data with Rexplore
CSE 635 Multimedia Information Retrieval
PubMed.
Search Engine Architecture
Information Retrieval and Web Design
Presentation transcript:

Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major † Bipin Indurkhya ‡ * Northeastern University † Jarg Corporation ‡ Tokyo University of Agriculture and Technology

Purpose  Biomedical Information Searches  Ontologies & the UMLS  Knowledge Representation Input - Natural Language Processing Retrieval - Ontologies & Semantic Frameworks Information Visualization - Keynets  Results of Usability Studies

Problem: Low Quality Search  Searching using keyword matching often has high volume and low precision.  Discrete keywords do not represent knowledge.  Result of a search are not be arranged in a semantically relevant way.  Examining search results is often tedious.  Search results include only textual documents. Introduction

Solution: Ontologies Model for knowledge extraction/management using a domain-specific vocabulary and theories expressing the meaning of the vocabulary within the community using the vocabulary.

Advantages of Ontologies  Allows semantically correct retrieval based on domain specific criteria.  No limit to the depth of knowledge that can be represented, managed and retrieved.  Multiplicity of information objects retrieved: images, video, sound, etc. as well as text.  Results of a search are grouped by how documents are relevant to the whole query.  The ontology can be updated as new terminology and relationships are introduced.

UMLS  US National Library of Medicine since 1986  Overcomes retrieval problems –Differences in terminology –Distributed database sources  Develops machine-readable “knowledge sources”  Allow researchers and health professionals to retrieve and integrate electronically available biomedical information.

 Free  Iteratively refined and expanded from feedback  Maps many different names for the same concept  Grateful Med and PubMed are applications of the UMLS

 Semantic Categories – > 130 semantic categories  Semantic Relationships –“ is a “, “ part of”, “disrupts”  Semantic Concepts (Vocabulary) – > 1,000,000 concepts map to categories

Natural Language Processing using an Ontology syntactic semantic

Keynets A technique for representing information in a visual manner that can be manipulated into meaningful associations for refinement of the knowledge extracted.  Exploits human – computer interactivity inherent in knowledge processing.  Based on Information Visualization Concept (Schneiderman, 1998)

Knowledge Representation using the UMLS and Keynets  Acyclic directed graph.  Provides a consistent categorization for all concepts.  Shows the important relationships between the concepts.  NLP using the UMLS produces Keynets, a new search strategy for knowledge processing of biomedical information. “Fc-receptors on NK cells”

Usability Study  The purpose was to explore the reactions of users to different representations of biomedical information –Keywords: Fc-receptors, cells, NK cells –Keynet:  Sample: n = 11; MD, PhD, Biomedical engineers, Pharmacologists - individuals who would typically be required to search for biomedical information

UMLS Keywords and Keynets

Survey Format Three Sections I.Demographics. II.9 semantic differential focused questions. III.Open ended questions to assess subjects overall impressions of using keynets and information visualization for knowledge representation,

Semantic Differential Question  scale 1-9, 0 = N/A e.g. confusing/clear 1 most like first word or “confusing” 9 most like the last word “clear” confusing clear

Semantic Differential Question  1a How would you rate the Keynet version in its ability to represent the biomedical text given? confusing clear  1b How would you rate the Keyword version in its ability to represent the biomedical text given? confusing clear

UMLS Keywords and Keynets

Question Survey Results n=11 Score

Results of Usability Study  Level of Understanding of Keynets –Remarkably high given short time to complete study, population diversity, different examples used. –Example – missing relationship detected (7 of 11)  Limit Complexity –Representations should be concise drilling down only at the user’s request  Keywords versus Keynet –No statistical difference, Keynets are as least as useful as Keywords in representation of biomedical information retrieval.

Summary  A new strategy is suggested for searching and retrieving biomedical information using NLP, the UMLS and Keynet displays of the retrieved results.  Issues of semantic versus syntactic representations for biomedical information retrieval.  Issues relating information visualization for the processing of biomedical information retrieval.

Conclusion Consider the computer- human interactivity issues A picture is worth a thousand keywords!!

Acknowledgement  This project was performed as part of the “Biomedical Science Information Retrieval and Management” project supported by grant # 1 R43 LM from the National Institute of Health (NIH). Its contents are solely the responsibility of the authors and do not necessarily represent the official views of the NIH.  A portion of this study was conducted in part at Jarg Corporation, 332b Second Ave., Waltham, MA  Travel expenses for this presentation were provided by a grant from the Dept. of Energy.

Addendum  Technical information related to Keynets