© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Lexically Suggest, Logically Define: QA of Qualifiers &

Slides:



Advertisements
Similar presentations
Semantic Interoperability in Health Informatics: Lessons Learned 10 January 2008Semantic Interoperability in Health Informatics: Lessons Learned 1 Medical.
Advertisements

Quality Assurance of the Content of a Large DL-based Terminology using Mixed Lexical and Semantic Criteria: Experience with SNOMED CT Alan Rector, Luigi.
Semantics Static semantics Dynamic semantics attribute grammars
Terminologies How do they impact my world? Heather Grain Presented at HIMAA Symposium 2008.
Copyright © 2001 College of American Pathologists Pseudophakic corneal edema Sample Hierarchy for Corneal Edema Idiopathic corneal edema Phakic corneal.
ECO R European Centre for Ontological Research Ontology-based Error Detection in SNOMED-CT ® Werner Ceusters European Centre for Ontological Research Universität.
Catalina Martínez-Costa, Stefan Schulz: Ontology-based reinterpretation of the SNOMED CT context model Ontology-based reinterpretation of the SNOMED CT.
SNOMED CT’s Ontological Commitment Stefan Schulz University Medical Center, Freiburg, Germany Ronald Cornet Academic Medical Center, Amsterdam, The Netherlands.
Gergely Héja - SMCS2006 Copyright Notice Usage of semanticHEALTH public presentations Each slide is copied 1 exactly as it is presented by the.
Recheck Examinations and Results Of course it’s hard. That’s what makes it difficult.
Copyright © 2001 College of American Pathologists Sample Hierarchy for Fine Needle Biopsy of Abdominal Wall Mass Tru cut biopsy of lesion of abdominal.
A Probabilistic Framework for Information Integration and Retrieval on the Semantic Web by Livia Predoiu, Heiner Stuckenschmidt Institute of Computer Science,
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
Language Specfication and Implementation - PART II: Semantics of Procedural Programming Languages Lee McCluskey Department of Computing and Mathematical.
Visualizing Heart Data from Pulse Intervals By Juan Gabriel Estrada Alvarez.
Copyright © 2001 College of American Pathologists Sample Hierarchy for Acute MI Acute MI of lateral wall Acute MI of apical-lateral wall Acute Myocardial.
Copyright © 2001 College of American Pathologists Sample Hierarchy for Hepatitis Chronic viral hepatitis Chronic viral hepatitis C About SNOMED Relationships.
AICT5 – eProject Project Planning for ICT. Process Centre receives Scenario Group Work Scenario on website in October Assessment Window Individual Work.
Ontology-based Convergence of Medical Terminologies: SNOMED CT and ICD-11 Institute for Medical Informatics, Statistics and Documentation, Medical University.
Karen Gibson.  Significant investment in eHealth is underway  Clinical records: ◦ Not only a record for the author ◦ Essential to inform the next person.
OpenEHR Terminology binding Thomas Beale. © Ocean Informatics 2010 Problem overview.
Concept Model for observables, investigations, and observation results For the IHTSDO Observables Project Group and LOINC Coordination Project Revision.
Working Together to Advance Terminology Tooling Presentation to OHT Board, Birmingham Jennifer Zelmer & Karen Gibson.
A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,
Query Languages for SNOMED: Use Cases and Issues for Binding to Health Records and to ICD & background for comments on DRAFT SNOMED Query language spec.
Overview of Formal Methods. Topics Introduction and terminology FM and Software Engineering Applications of FM Propositional and Predicate Logic Program.
Overview, Benefits and how to approach Implementing - Robyn Richards (NEHTA)
Review of OWL for Biomedicine Alan Rector & CO-ODE/NIBHI University of Manchester OpenGALEN BioHealth Informatics Group © University.
Copyright © 2001 College of American Pathologists Sample Hierarchy for Tularemia Disorder Zoonotic bacterial disease Enteric tularemia Glandular tularemia.
This material was developed by Duke University, funded by the Department of Health and Human Services, Office of the National Coordinator for Health Information.
WHO – IHTSDO: SNOMED CT – ICD-11 coordination: Conditions vs. Situations Stefan Schulz IHTSDO conference Copenhagen, Apr 24, 2012.
Copyright © 2001 College of American Pathologists Sample Hierarchy for Blood Vessel Injury to the Leg Injury of blood vessels at ankle and foot level Popliteal.
2nd Sept 2004UK e-Science all hands meeting1 Designing User Interfaces to Minimise Common Errors in Ontology Development Alan Rector, Nick Drummond, Matthew.
Some Thoughts to Consider 8 How difficult is it to get a group of people, or a group of companies, or a group of nations to agree on a particular ontology?
Approach to building ontologies A high-level view Chris Wroe.
Pre- and Post-Coordination in Biomedical Ontologies Stefan Schulz Daniel Schober Djamila Raufie Martin Boeker Medical Informatics Research Group University.
Detection of underspecifications in SNOMED CT concept definitions using language processing 1 Federal Technical University of Paraná (UTFPR), Curitiba,
Ontologies for Terminologies, Knowledge Representation & Software: Benefits & Gaps (“Don’t make the tea”) (Only a part of Knowledge Representation) Alan.
Oncologic Pathology in Biomedical Terminologies Challenges for Data Integration Olivier Bodenreider National Library of Medicine Bethesda, Maryland -
© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Quality Assurance, Ontology Engineering, and Semantic Interoperability.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Nursing Occupations Proposed by: Cynthia Lundberg, BSN Judith Warren, PhD, RN.
EBM --- Journal Reading Presenter :黃美琴 Date : 2005/10/27.
1 Alberta Health Services Capital Health Palliative Care Program Clinical Vocabulary Pilot Project Project Update Friday April 24, 2009 Dennis Lee & Francis.
Essential SNOMED. Simplifying S-CT. Supporting integration with health
Oncology in SNOMED CT NCI Workshop The Role of Ontology in Big Cancer Data Session 3: Cancer big data and the Ontology of Disease Bethesda, Maryland May.
© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Quality Assurance, Ontology Engineering, and Semantic Interoperability.
Frequent Criticisms too ambitious recreation of a Textbook of Medicine competing with SNOMED-CT replicates the work done elsewhere: DSM, ICPC, too academic.
A Proposed Approach to Binding SNOMED CT to HL7 FHIR Dr Linda Bird Senior Implementation Specialist.
Certification of Reusable Software Artifacts
SNOMED CT and Surgical Pathology
Revising the Anatomy Model to Improve Usability and Correctness
W. Scott Campbell, MBA, PhD James R. Campbell, MD
SNOMED CT Logic Profile Enhancement Yongsheng Gao, 2017/07/11
IHTSDO Observables Project Call
Scope The scope of this test is to make a preliminary assessment of the fitness of the Draft Observables and Investigation Model (Observables model)
SNOMED CT and Surgical Pathology
Can SNOMED CT be harmonized with an upper-level ontology?
CIMI Semantics Roundup
February 17, 2017 Bruce Goldberg, MD, PhD
MedDRA and Ontology Discussion of Strategy
Kin Wah Fung, MD, MS, MA National Library of Medicine, NIH
Alan Rector, Luigi Iannone, Robert Stevens
Architecture for ICD 11 and SNOMED CT Harmonization
A bit more about Read Codes and SNOMED CT
Harmonizing SNOMED CT with BioTopLite
Lexical ambiguity in SNOMED CT
Animating the reference terminology – showing classifiers at work
Test-Driven Ontology Development in Protégé
AICT5 – eProject Project Planning for ICT
Presentation transcript:

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Lexically Suggest, Logically Define: QA of Qualifiers & Expected Results of Post-Coordination in SNOMED CT Lexically Suggest, Logically Define: QA of Qualifiers & Expected Results of Post-Coordination in SNOMED CT Alan Rector & Luigi Iannone with thanks to Robert Stevens BioHealth Informatics Group School of Computer Science & Northwest Institute of BioHealth Informatics University of Manchester, Manchester M13 9PL

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Pre-coordination and post-coordination ►Pre-coordination ►SNOMED authors define “Acute bronchitis” Classifier creates correct hierarchy ►Clinical user enters “Acute bronchitis” (or its code) ►Post-coordination ►Clinical user enters “Bronchitis” + “Acute” ►Classifier finds any equivalent term or places the expression in the right place in the hierarchy Concept does not need to exist beforehand, e.g. Might define “Acute” + “Bronchitis” + “Right main stem bronchus” ‣ Would still be in the correct place in hierarchy even if no term exists. 2

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license 3 Can SNOMED post-coordination work? Do SNOMED authors pre-coordinate consistently? ►Two related questions? ►Are SNOMED qualified expressions expressed consistently? If SNOMED authors don’t do it consistently, can anyone else? ►Proxies: In either case ►The definitions should allow the description logic classifier to organize the hierarchies correctly Includes determining equivalence between pre- and post- coordinated forms ‣ Necessary but not sufficient for post-coordination to work ►For post coordination, must be well defined consistent patterns that users & software develpers understand

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license 4 First try ►Take a simple case: “acute” and “chronic” ►Look at the pattern SNOMED uses to define Acute disease and Chronic disease ►Follow Campbell, Tuttle, & Spackman and see how many diseases named “Acute…” or “Chronic …” are retrieved under the pattern

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Definition of acute & chronic ►Chronic disease == Disease & (RoleGroup some (Clinical course some Chronic)) broaden to ►Chronic finding == Clinical finding & (RoleGroup some (Clinical course some Chronic)) ►… similarly for Acute ► fully specified name: “Sudden onset AND/OR short duration” 5

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Write a script to check for candidates in OPPL2 ►Requires ►Lexical match ►Description logic/OWL semantics -- open world, negation as provably false DL Reasoner ►Query semantics -- closed world, negation as failure over concepts in corpus ►Procedural semantics – add things to ontology ►?C:CLASS=MATCH("'Chronic.*")  Lexical SELECT ?C SubClassOf 'Clinical finding (finding)'  DL Semantics WHERE FAIL ?C SubClassOf ‘Chronic clinical finding (finding)’  Query Semantics BEGIN ADD ?C SubClassOf Candidate  Procedural END; 6

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Next, classify candidates; only top-level ones need be examined ►If a concept’s definition is changed, the change will be inherited by all descendants ►What did we find? ►25%-30% of all lexical matches were “Candidate” errors, but there were cases where “Acute” and “Chronic” clearly no longer can be taken literally ‣ Chronic and acute leukemias and myeloproliferative disorders ‣ So exclude them from candidates 7

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Then remaining candidates not classified as Chronic findings: ►Why? ►Systematic? …or… ►Accidental? 8

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Look at definitions ►Systematic ►Chronic duodenal ulcer == Duodenal ulcer disease and RoleGroup some ( Associated morphology some Chronic ulcer (morphologic abnormality)  and Finding site some Duodenal structure))) ►Compare with ►Chronic disease == Disease & (RoleGroup some (Clinical course some Chronic))  ►Different qualifiers ►Associated morphology ►Clinical course 9

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Different qualifiers ►User guide says: ►Acute & chronic may be morphological Chronic inflammation means mononuclear cell infiltration Acute inflammation means polymorphonuclear cell infiltration ►For ulcers… ►Chronic ulcer (morphological abnormality) is a kind of Chronic inflammation (morphological abnormality) ►But users must understand ►Acute and chronic ulcers are defined by Associated morphology, ► Acute obstruction is defined by Clinical course, ►Chronic cholecystitis by both! ►Are these the consequences we want? ►Does this correspond to use in clinical care? Do we have evidence? Should pathology take precedence over clinical observation? 10

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Late discovery: Chronic inflammatory disease is defined as have both qualifiers! ►Chronic inflammatory disease == Chronic disease & RoleGroup some ( Associated morphology some Chronic inflammatory morphology) & RoleGroup some ( Clinical course some Chronic ) ►Means: ►Classifier will chronic inflammatory disease only if you have both Or that author asserts directly is a descendant of Chronic inflammatory disease ►To get post-coordination to work you have to use both! Will anyone remember to do so? Obviously not all SNOMED authors, 11

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license … but even authors don’t, so Many inflammations (…itis) are missed ►Authors have done some directly and not others ►“Helter skelter” / “Mish mash” modelling Systematic inconsistency What using a description logic is meant to avoid 12

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license One solution ►Change the axioms so that any disease with chronic inflammatory morphology has a chronic course ►Still within SNOMED’s DL EL++/OWL-EL SNOROCKET still classifies it efficiently ►Or vice versa for all inflammatory diseases with chronic course ►Chronic course & inflammatory morphology  Chronic inflammatory morphology 13

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license How should the decision be made? How monitored? ►New axiom may or may not be strictly “true”, but… ►What are the consequences? ►For accuracy of authoring? ►For accuracy of retrieval? ►For consistency of setting value sets? ►For post-coordination? ►For meaninful use? ►Base decisions on evidence of consequences ►Evidence-based terminologies / ontologies ►Whatever the decision, need a QA process to enforce and check it 14

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license How big is the problem? ►In a “module” based on the UMLS CORE Problem list subset: ►368 total chronic; 450 total acute ►103 (28%) chronic / 92 ( 20%) Acute were “candidates”, of these: Due to use of morphology only 85 (83%) chronic / 92 (85%) Acute Due to simple errors and omissions 18 (17%) chronic / 14 (15%) 15

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Other issues (See paper) ►Hierarchy of qualifiers ►Should Intermittent (course) be a kind of chronic (course)? What about “intermittent acute pain”? ►Pressure ulcers and decubitous ulcers are all chronic by definition Can there be an acute pressure ulcer? ►Odd anatomy ►Lower back pain is a kind of Abdominal pain Because the lower back is part of the abdominal wall is part of the abdomen ‣ (Anatomy under review by SNOMED) 16

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license You have to use a classifier ►This work can only be done by using a classifier to find inferences ►Post-coordination depends on the classifier ►To work efficiently, the classifier must be fast ►For iterative analysis, < 1 min ►SNOROCKET in Protege is very fast and reliable ►But still works better on modules than all of SNOMED 17

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Use of “modules” makes this possible ►A “signature” is a subset of the entities in a description logic/OWL KB ►A “module” for a “signature” is a subset of the axioms & entities in the KB such that ►All inferences amongst entities in the signature can be inferred from the module ►For the UMLS CORE Problem List Subset ►SNOMED Size ~300,000 Classification time 2-8 minutes ►Signature (UMLS CORE Subset) ~8500 ►Module extracted ~35,000 Classification time.25 – 2 minutes ►Also methods for extracting the changes and applying them to the whole ►Re-apply final methods to whole corpus if require ►Total effort for this study =< 2 person weeks 18

© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 licenseSummary ►Lexical suggest, semantically define works to raise issues ►Post coordination of acute and chronic unlikely to work reliably, unless ►SNOMED makes pattern consistent ►Bases decisions on consequences for use in patient care Are patient care clinicians likely to align with pathology in the ED? ►Other Findings ►Working on modules makes analysis of SNOMED practical ►There are problems in the anatomy and qualifier hierarchies ►Questions ►How many other such problems are there? How do they affect post-coordination? ►How to establish QA procedures to find out and prevent recurrence? 19