©2011 MFMER | slide-1 The Linked Clinical Data Project Jyotishman Pathak, PhD Rick Kiefer SemTIG November 4, 2011.

Slides:



Advertisements
Similar presentations
How to Set Up a System for Teaching Files, Conferences, and Clinical Trials Medical Imaging Resource Center.
Advertisements

Lecture plan Information retrieval (from week 11)
Consistent and standardized common model to support large-scale vocabulary use and adoption Robust, scalable, and common API to reduce variation in clinical.
©2011 MFMER | slide-1 The Linked Clinical Data Project Jyotishman Pathak, PhD HCLS TMO October 27, 2010.
Semantic Web Introduction
Chris Bizer, Richard Cyganiak: D2RQ – Lessons Learned ( ) W3C Workshop on RDF Access to Relational Databases October, 2007 — Boston, MA,
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
Rockefeller Phenotyping Initiative Translational Key Function Committee 8/3/2010 Laboratory of Blood and Vascular Biology Laboratory of Human Genetics.
The National Center for Biotechnology Information (NCBI) a primary resource for molecular biology information Database Resources.
©2013 MFMER | slide-1 Building A Knowledge Base of Severe Adverse Drug Events Based On AERS Reporting Data Using Semantic Web Technologies Guoqian Jiang,
The work proposed in this study is an attempt to use Semantic Web technologies for integrating patient clinical data derived from Electronic Health Records.
© 2007 IBM Corporation IBM Emerging Technologies Enabling an Accessible Web 2.0 Becky Gibson Web Accessibility Architect.
EleMAP: An Online Tool for Harmonizing Data Elements using Standardized Metadata Registries and Biomedical Vocabularies Jyotishman Pathak, PhD 1 Janey.
MI807: Database Systems for Managers Introduction –Course Goals & Schedule –Logistics –Syllabus Review Relational DBMS Basics –RDBMS Role in Applications.
Multiple Tiers in Action
Getting Started with Microsoft SQL Server 2012 Express Edition Appendix A DAVID M. KROENKE and DAVID J. AUER DATABASE CONCEPTS, 6 th Edition.
Open Data API delivery “Open-XDX” David Webber, Information Architect, Oracle Public Sector Open Data Exchange October, 2012.
ONTOLOGY ENGINEERING Lab #9 - November 3, Linking Relational Databases to Ontologies 2  Relational databases are still a common means of storing.
©2011 Quest Software, Inc. All rights reserved. Steve Walch, Senior Product Manager Blog: November, 2011 Partner Training Webcast.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Publishing data on the Web (with.
Translational Research and Patient Safety in Europe TRANSFoRm: Requirements analysis for the learning healthcare system.
Managing & Integrating Enterprise Data with Semantic Technologies Susie Stephens Principal Product Manager, Oracle
Chris Hyzer University of Pennsylvania
PI Data Access via OLE DB/SQL
Rajashree Deka Tetherless World Constellation Rensselaer Polytechnic Institute.
THE NATIONAL CENTER FOR BIOMEDICAL ONTOLOGY BioPortal Updates and Planned Features Trish Whetzel May 3, 2012.
From Web 1.0  Web 3.0: Is RDF access to RDB enough? Vipul Kashyap Senior Medical Informatician, Clinical Informatics R&D Partners.
Information Extraction with Linked Life Data 19/04/2011.
Paul Groth VU University Amsterdam Convergence Meeting: Semantic Interoperability for Clinical Research & Patient.
Entity Recognition via Querying DBpedia ElShaimaa Ali.
Workshop – 10, December 2014, Berlin ICCS / NTUA Greece Efthymios Chondrogiannis An Intelligent Ontology Alignment Tool Dealing with Complicated Mismatches.
The Semantic Web Web Science Systems Development Spring 2015.
Digital Enterprise Research Institute HADA – An Access Controlled Application for Publishing and Discovering Linked Government Data Owen Sacco.
Fluvial Architecture Knowledge Transfer System (FAKTS): database interrogation through SQL queries Luca Colombera, Nigel P. Mountney Fluvial & Eolian Research.
September 30, 2002EON 2002Slide 1 Integrating Ontology Storage and Ontology-based Applications A lesson for better evaluation methodology Peter Mika:
SHARPn High-Throughput Phenotyping (HTP) November 18, 2013.
SQL Queries Relational database and SQL MySQL LAMP SQL queries A MySQL Tutorial and applications Database Building Assignment.
Relational Databases to RDF (a.k.a RDB2RDF) Juan F. Sequeda Dept of Computer Science University of Texas at Austin.
Master Informatique 1 Semantic Technologies Part 11Direct Mapping Werner Nutt.
Introduction to Test Director
NMED 3850 A Advanced Online Design January 12, 2010 V. Mahadevan.
 Open source RDF framework in Java.  Supports RDF Schema inferencing and querying.  Supports SPARQL 1.1 query, update, federated query.
The Development of the Ceramics and Glass website Mia Ridge Museum Systems Team Museum of London.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
XML and Database.
CS453: Databases and State in Web Applications (Part 2) Prof. Tom Horton.
Processware 2016 Tech Launch. Welcome ! Technical Pre-Launch event for Processware 2016 First hotlab session Format of today Some talking and slides Break.
BBN Technologies Copyright 2009 Slide 1 The S*QL Plugin for Cytoscape Visual Analytics on the Web of Linked Data Rusty (Robert J.) Bobrow Jeff Berliner,
Marketing & Sales Projects Marketing & Sales Knows Program INTRANET WEB SITE - Tuesday, 2 nd of August of 2005 Valerian LARDILLIER.
Windows 7 WampServer 2.1 MySQL PHP 5.3 Script Apache Server User Record or Select Media Upload to Internet Return URL Forward URL Create.
RDF and Relational Databases
SDK Overview Rob DeCarlo Bechtel.
Semantic Web Portal: A Platform for Better Browsing and Visualizing Semantic Data Ying Ding et al. Jin Guang Zheng, Tetherless World Constellation.
Relational Database Systems Bartosz Zagorowicz. Flat Databases  Originally databases were flat.  All information was stored in a long text file, called.
Reportnet – progress and next steps Søren Roug European Environment Agency.
External Data Access Adam Rauch, 6/05/08 Team: Geoff Snyder, Kevin Beverly, Cory Nathe, Matthew Bellew, Mark Igra, George Snelling.
Physical Layer of a Repository. March 6, 2009 Agenda – What is a Repository? –What is meant by Physical Layer? –Data Source, Connection Pool, Tables and.
A discovery platform for translational research Núria Queralt Rosinach Integrative Biomedical Informatics Group (IBI) Research Programme on Biomedical.
External Data Access 5/29/08. Current Problems No way to load, process & analyze live Atlas data via critical analysis & programming tools (SAS, R, Perl)
Software Testing Training Online. Software testing is ruling the software business in current scenario. It provides an objective, independent view of.
Data Visualization with Tableau
Linked Data Competency Index
Accessing the Database Server: ODBC, OLE DB, and ADO
Data Virtualization Demoette… ODBC Clients
Linked Data Theatre Federated data.
Triple Stores.
Semantic Annotation service
A Scenario to Conceptually Illustrate
Construction of Enterprise Knowledge Graphs
Presentation transcript:

©2011 MFMER | slide-1 The Linked Clinical Data Project Jyotishman Pathak, PhD Rick Kiefer SemTIG November 4, 2011

©2011 MFMER | slide-2 Purpose The Linked Clinical Data (LCD) project aims to investigate emerging Semantic Web technologies for developing an ontology-driven framework for high- throughput phenotyping using Electronic Medical Records (EMRs) to analyze multi-factorial phenotypes. Investigate ontology-based techniques. Develop a framework for publishing and integrating. Propose and validate semantic reasoning techniques to support rapid cohort identification

©2011 MFMER | slide-3 LCD Architecture Med Index Virtuoso RDF View MCLSS Endpoint SPARQL SQL Linked Open Drug Data Endpoints Selector Thick Client Application Thin Client Application Mobile Client Application Health Quest MICS MRIS NRAF MCLSS Databases Web Server Virtual Server Viewer Formatter Linked Data API Response Request

©2011 MFMER | slide-4 Project – Automated SNPedia SNPedia contains a wealth of data but the information in the wiki is manually curated. The focus of this project is to automate the results using patient data. Using MCLSS, identify patients with specific conditions. Join with OMIM to determine the genetic locus associated with those conditions Join with dbSNP to identify potentially associated SNPs. Each of the joins will be done using a single federated SPARQL query. Results will then be compared to data in SNPedia

©2011 MFMER | slide-5 Disease to SNP architecture dbSNP OMIM MCLSS Databases Endpoints Patient Disease SNOMED/ICD9 Gene SNP Request Results RDF View Mapping SPARQL Query

©2011 MFMER | slide-6 dbSNP/OMIM federated query PREFIX omim: PREFIX dbsnp: SELECT DISTINCT ?rsID ?geneSymbol ?alleleName { SERVICE { SELECT ?geneSymbol ?alleleName WHERE { ?alleleVariant rdf:type omim:AllelicVariant; ?alleleName; omim:symbol ?geneSymbol. FILTER(regex(str(?alleleName), "Diabetes", "i")). } } SERVICE { SELECT ?rsID WHERE { ?s dbsnp:symbol ?geneSymbol; dbsnp:rsid ?rsID. } } }

©2011 MFMER | slide-7 Partial SPARQL results

©2011 MFMER | slide-8 Process – Creating dbSNP endpoint No endpoint could be found so one had to be created. Download dbSNP database from a Sybase dump Use Perl to filter the tables in order to isolate desired data and rewrite into tab delimited form. Create tables in mySQL and import the files. Use Virtuoso to link to the tables Create RDF views by mapping the table columns to the desired endpoint subjects

©2011 MFMER | slide-9 Hurdles Endpoints Difficult to find Unreliable up time Unknown age of data Schema documentation Environment Linux - could not find ODBC driver for Virtuoso Virtuoso Bridge did not work with db2 Virtual server – no admin permissions Windows 2008 server – bug in webDAV access

©2011 MFMER | slide-10 Hurdles Virtuoso Did not support federated queries until March. March release has bugs Unable to run SPARQL queries against non- local endpoints Federated queries of mixed location crashes the server Beta fix release has performance issues Documentation – outdated and poor navigation

©2011 MFMER | slide-11 Next steps MCLSS Identify small MCLSS views Federated query with SIDER and RxNorm Use TMO/etc for RDMS -> RDF mapping dbSNP RDF view Standardized RDMS -> RDF mapping Visual graph for dbSNP/OMIM SNPedia Alter Bob’s Perl script to download data Upload in mySQL for comparisions

©2011 MFMER | slide-12 Questions? Website Thank you! Bob Freimuth – Perl scripts to filter and transform the dbSNP database as well as invaluable sharing of genomic knowledge and advice.