The work proposed in this study is an attempt to use Semantic Web technologies for integrating patient clinical data derived from Electronic Health Records.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Building FHIR Servers on Existing Applications
Consistent and standardized common model to support large-scale vocabulary use and adoption Robust, scalable, and common API to reduce variation in clinical.
©2011 MFMER | slide-1 The Linked Clinical Data Project Jyotishman Pathak, PhD HCLS TMO October 27, 2010.
Semantic Web Introduction
Rockefeller Phenotyping Initiative Translational Key Function Committee 8/3/2010 Laboratory of Blood and Vascular Biology Laboratory of Human Genetics.
Knowledge Graph: Connecting Big Data Semantics
Common Terminology Services 2 (CTS2)
Searching Patient Data: A Role for Librarians in the Improvement of Healthcare Margaret Henderson, MLIS, AHIP Tompkins-McCaw Library.
JSI Sensor Middleware. Slide 2 of x Embedded vs. Midleware based Architecture for Sensor Metadata Management Embedded approach assign an IP address to.
Amy Sheide Clinical Informaticist 3M Health Information Systems USA Achieving Data Standardization in Health Information Exchange and Quality Measurement.
©2013 MFMER | slide-1 Building A Knowledge Base of Severe Adverse Drug Events Based On AERS Reporting Data Using Semantic Web Technologies Guoqian Jiang,
Guoqian Jiang, MD, PhD Mayo Clinic
™ Suggestions for Semantic Web Interfaces to Relational Databases Mike Dean W3C Workshop on RDF Access to Relational Databases Cambridge,
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
Ontology Notes are from:
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
EleMAP: An Online Tool for Harmonizing Data Elements using Standardized Metadata Registries and Biomedical Vocabularies Jyotishman Pathak, PhD 1 Janey.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
The Role of Standard Terminologies in Facilitating Integration James J. Cimino, M.D. Departments of Biomedical Informatics and Medicine Columbia University.
Integrating Complementary Tools with PopMedNet TM 27 July 2015 Rich Schaaf
Semantic Web Technologies: A Paradigm for Medical Informatics Chimezie Ogbuji (Owner, Metacognition LLC.)
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Publishing data on the Web (with.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
Managing & Integrating Enterprise Data with Semantic Technologies Susie Stephens Principal Product Manager, Oracle
From Web 1.0  Web 3.0: Is RDF access to RDB enough? Vipul Kashyap Senior Medical Informatician, Clinical Informatics R&D Partners.
Butte Lab Journal Club 16 Aug 2010 Alexander A. Morgan.
1 CSE 2102 CSE 2102 Ph.D. Proposal A Process Framework For Ontology Modeling, Design, And Development Realized By Extending OWL and ODM Candidate: Rishi.
Entity Recognition via Querying DBpedia ElShaimaa Ali.
© 2015 Mayo Foundation for Medical Education and Research Guoqian Jiang, MD, PhD 1, Richard Kiefer 1, Luke V. Rasmussen 2 ; Huan Mo, MD, MS 3, Jennifer.
The Semantic Web Web Science Systems Development Spring 2015.
A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,
Digital Enterprise Research Institute HADA – An Access Controlled Application for Publishing and Discovering Linked Government Data Owen Sacco.
Advancing translational research with the Semantic Web Ruttenberg, Clark, Bug, Samwald, Bodenreider, Chen, Doherty, Forsberg, Gao, Kashyap, Kinoshita,
 Yingjie Hu, PhD student  Space and Time Knowledge Organization Lab  Department of Geography, UCSB  Summer intern, APL  Sathya Prasad  Lead and.
Survey of Medical Informatics CS 493 – Fall 2004 September 27, 2004.
On the Semantics of R2RML and its Relationship with the Direct Mapping Juan F. Sequeda Research in Bioinformatics and Semantic Web (RiBS) Lab Department.
LexRDF: A Semantic-Web Compatible Extension of LexGrid Cui Tao Jyotishman Pathak Harold R. Solbrig Wei-Qi Wei Christopher G. Chute Division of Biomedical.
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
Modeling and Representing National Climate Assessment Information using Linked Data Jin Guang Zheng 1 Curt Tilmes 2
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
Using Semantic Mapping to Manage Heterogeneity in XLIFF Interoperability by Dave Lewis, Rob Brennan, Alan Meehan, Declan O’Sullivan CNGL Centre for Global.
A Semantic-Web Representation of Clinical Element Models
LexGrid Philosophy, Model and Interfaces Harold R Solbrig Division of Biomedical Statistics and Informatics Mayo Clinic.
PHS / Department of General Practice Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Knowledge representation in TRANSFoRm AMIA.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
BBN Technologies Copyright 2009 Slide 1 The S*QL Plugin for Cytoscape Visual Analytics on the Web of Linked Data Rusty (Robert J.) Bobrow Jeff Berliner,
12/7/2015Page 1 Service-enabling Biomedical Research Enterprise Chapter 5 B. Ramamurthy.
RDF Access to Relational Databases Ashok Malhotra Oracle Corporation.
Kaiser Permanente Convergent Medical Terminology (CMT) Using Oxford RDFox and SNOMED for Quality Measures.
Toward a framework for statistical data integration Ba-Lam Do, Peb Ruswono Aryan, Tuan-Dat Trinh, Peter Wetz, Elmar Kiesling, A Min Tjoa Linked Data Lab,
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Semantic Web Portal: A Platform for Better Browsing and Visualizing Semantic Data Ying Ding et al. Jin Guang Zheng, Tetherless World Constellation.
Semantic Web COMS 6135 Class Presentation Jian Pan Department of Computer Science Columbia University Web Enhanced Information Management.
©2011 MFMER | slide-1 The Linked Clinical Data Project Jyotishman Pathak, PhD Rick Kiefer SemTIG November 4, 2011.
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
Genomic Medicine Grid Juan Pedro Sánchez Merino Instituto de Salud Carlos III
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
Semantic and geographic information system for MCDA: review and user interface building Christophe PAOLI*, Pascal OBERTI**, Marie-Laure NIVET* University.
Java Web 应用开发: J2EE 和 Tomcat 蔡 剑, Ph.D.. 本讲内容 网络系统设计模式 综合案例分析.
Kaiser Permanente Convergent Medical Terminology (CMT)
Data.gov: Web, Data Web, Social Data Web 7/22/2010 #health2stat.
LexRDF: An Approach for Representing Biomedical Ontologies in RDF
LOD reference architecture
Helena F. Deus and Jonas S. Almeida
Service-enabling Biomedical Research Enterprise
Presentation transcript:

The work proposed in this study is an attempt to use Semantic Web technologies for integrating patient clinical data derived from Electronic Health Records (EHRs) with large-scale genomics data to study genotype-phenotype associations. This aim is achieved via:  RDF-based representation of clinical data from Mayo Clinic EHR systems exposed via multiple SPARQL endpoints  Patient demographics, diagnoses, procedures and medications  Coded with Meaningful Use terminologies  RDF-based representation of genetic data from Mayo Clinic biobank repository exposed via a SPARQL endpoint  Patient single nucleotide polymorphism (SNP) genotype data  Coded with gene and sequence ontologies  Federated SPARQL 1.1 queries integrating genotype data with patient clinical data  Perform a Phenome-Wide Association Study (PheWAS) that allows a systematic study of associations between a number of common genetic variations and variety of large number of clinical phenotypes From Relational Data Model to RDF Mapping to Querying via SPARQL SELECT ?ClinicNumber ?Diagnosis WHERE { SERVICE { ?s1 snomedct: ?clinicId. ?s1 gc:mayogcid ?mayogcId. ?s2 snomedct: ?patientId. ?s2 so:SO_ ?rsId. ?s2 so:SO_ ?genotype. FILTER (?patientId =?clinicId ) } SERVICE { ?s3 mclss: internalKey ?table1Key. ?s3 tmo:TMO_0031 ?Diagnosis. ?s4 mclss: internalKey ?table2Key. ?s4 snomedct: ?ClinicNumber. FILTER (?table1Key = ?table2Key ). } FILTER(?ClinicNumber = ?mayogcid). FILTER(regex(str(?rsId), "rs5219", "i")). FILTER(regex(str(?genotype), “T:T", "i")). so:. mayogc:PatientsMap a rr:TriplesMapClass; rr:tableName "patients_hypothyroidism"; rr:subjectMap [ rr:template " ]; rr:predicateObjectMap [ rr:predicateMap [ rr:predicate snomedct: ]; rr:objectMap [ rr:column "clinicId" ] ]; rr:predicateObjectMap [ rr:predicateMap [ rr:predicate mayogc:mayogid ]; rr:objectMap [ rr:column "mayogid" ] ]. mayogc:GenesMap a rr:TriplesMapClass; rr:tableName "patient_genotypes"; rr:subjectMap [ rr:template " ]; rr:predicateObjectMap [ rr:predicateMap [ rr:predicate snomedct: ]; rr:objectMap [ rr:column "patientId" ] ]; rr:predicateObjectMap [ rr:predicateMap [ rr:predicate so:SO_ ]; rr:objectMap [ rr:column "rsId" ] ]; rr:predicateObjectMap [ rr:predicateMap [ rr:predicate so:SO_ ]; rr:objectMap [ rr:column "genotype" ] ]. SNP-disease associations for T2DM SNP rs5219 within the gene KCNJ11 Mining Genotype-Phenotype Associations from Electronic Health Records and Biorepositories using Semantic Web Technologies Jyotishman Pathak, PhD Richard C. Kiefer, Robert R. Freimuth, PhD Suzette J. Bielinski, PhD Christopher G. Chute, MD, DrPH Division of Biomedical Statistics and Informatics, Department of Health Sciences Research Mayo Clinic, Rochester, MN Background and Aims The Linked Clinical Data (LCD) project at aims to develop a semantics-driven framework for high-throughput phenotype representation, extraction, integration, and querying from electronic medical records using emerging Semantic Web technologies, such as the W3C’s Linking Open Data project. The main goals of the LCD project are to:  Investigate ontology-based techniques for representing and encoding phenotype data derived from EHRs;  Develop a framework for publishing and integrating ontology-encoded structured phenotype data for federated querying using Linked Data principles and technologies, and  Propose and validate semantic reasoning techniques to support rapid cohort identification in chronic diseases. Linked Data refers to a set of best practices for publishing and linking pieces of data, information and knowledge in the Web. Core technologies supporting Linked Data:  URIs for identifying entities or concepts,  RDF data model and RDFS/OWL ontologies for representing, structuring and linking descriptions of entities as resources,  An endpoint providing access to the resources through SPARQL queries and  HTTP for retrieving resources or descriptions of the resources. W3C Linked Open Data project billion RDF triples, 2 million links billion RDF triples, 504 million links Linked Data Methods For more information – rsID rs5219 genotype T:T patientId clinicId MayogcId ClinicNumber table1Key RK4748 table2Key RK4748 diagnosis Type2 diabetes patient_demographics wh_demographics patient_genotypes wh_diagnosis  Use an ontology to describe the columns of the relational database  Map the model to express the relationship between nodes/edges  Write a SPARQL query based on the mapping  Workflow diagram of how the data is traversed  Sample query results Results: Type 2 Diabetes Mellitus  A query determines all the individuals having a SNP associated with Type 2 Diabetes Mellitus and retrieves the clinical diagnoses (represented as ICD-9-CM codes) for each eligible subject  Using AHRQ’s Clinical Classification Software, clustering is done for creating a manageable number of clinically meaningful categories  Client applications send query requests  Using the Linked Data API, the request is translated into a federated SPARQL 1.1 query  Patient data stored in RDBMS are surfaced as an endpoint  SPARQL queries are automatically translated into SQL statements using applications, such as Spyder  Results are returned in XML, RDF or JSON formats Architecture