Monthly Program Update January 12, 2012 Andrew J. Buckler, MS Principal Investigator WITH FUNDING SUPPORT PROVIDED BY NATIONAL INSTITUTE OF STANDARDS AND.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

CACORE TOOLS FEATURES. caCORE SDK Features caCORE Workbench Plugin EA/ArgoUML Plug-in development Integrated support of semantic integration in the plugin.
CVRG Presenter Disclosure Information Tahsin Kurc, PhD Center for Comprehensive Informatics Emory University CardioVascular Research Grid Core Infrastructure.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Interoperability of Distributed Component Systems Bryan Bentz, Jason Hayden, Upsorn Praphamontripong, Paul Vandal.
Monthly Program Update February 9, 2012 Andrew J. Buckler, MS Principal Investigator WITH FUNDING SUPPORT PROVIDED BY NATIONAL INSTITUTE OF STANDARDS AND.
A DICOM Import Module for XIP Alex Shnayder Lafayette College Easton, PA Supervisor: Dr. Dave Channin We gratefully acknowledge the support of: Pat Mongkolwat,
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Lecture Nine Database Planning, Design, and Administration
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Image Query (IQ) Project Update Building queries one question mark at a time March, 2009.
LEVERAGING THE ENTERPRISE INFORMATION ENVIRONMENT Louise Edmonds Senior Manager Information Management ACT Health.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
The cancer Biomedical Informatics Grid™ (caBIG™): In Vivo Imaging Workspace Projects Fred Prior, Ph.D. Mallinckrodt Institute of Radiology Washington University.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
OpenMDR: Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
Program Update October 11, 2012 Andrew J. Buckler, MS Principal Investigator, QI-Bench WITH FUNDING SUPPORT PROVIDED BY NATIONAL INSTITUTE OF STANDARDS.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Peter Fox CSCI Week 9, October 27, 2008.
Monthly Program Update March 8, 2012 Andrew J. Buckler, MS Principal Investigator WITH FUNDING SUPPORT PROVIDED BY NATIONAL INSTITUTE OF STANDARDS AND.
OpenMDR: Alternative Methods for Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
Program Update December 13, 2012 Andrew J. Buckler, MS Principal Investigator, QI-Bench WITH FUNDING SUPPORT PROVIDED BY NATIONAL INSTITUTE OF STANDARDS.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
Monthly Program Update April 12, 2012 Andrew J. Buckler, MS Principal Investigator WITH FUNDING SUPPORT PROVIDED BY NATIONAL INSTITUTE OF STANDARDS AND.
XIP: The eXtensible Imaging Platform Development Program Fred Prior, Ph.D. Mallinckrodt Institute of Radiology Washington University in St. Louis.
Imaging Workspace An Overview and Roadmap Eliot L. Siegel, MD Imaging Workspace Lead SME January 23, 2008.
Rational Unified Process Fundamentals Module 4: Disciplines II.
Monthly Program Update December 8, 2011 Andrew J. Buckler, MS Principal Investigator WITH FUNDING SUPPORT PROVIDED BY NATIONAL INSTITUTE OF STANDARDS AND.
XIP™ – the eXtensible Imaging Platform A rapid application development and deployment platform Lawrence Tarbox, Ph.D. September, 2010.
Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.
May 2012 Update May 10, 2012 Andrew J. Buckler, MS Principal Investigator, QI-Bench WITH FUNDING SUPPORT PROVIDED BY NATIONAL INSTITUTE OF STANDARDS AND.
CaBIG Semantic Infrastructure 2.0: Supporting TBPT Needs Dave Hau, M.D., M.S. Acting Director, Semantic Infrastructure NCI Center for Biomedical Informatics.
Analyze User Instructions, adapted from MVT Bob Schwanke Siemens WITH FUNDING SUPPORT PROVIDED BY NATIONAL INSTITUTE OF STANDARDS AND TECHNOLOGY.
LexEVS Overview Mayo Clinic Rochester, Minnesota June 2009.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness TA Weijing Chen Semantic eScience Week 10, November 7, 2011.
Program Update January 10, 2013 Andrew J. Buckler, MS Principal Investigator, QI-Bench WITH FUNDING SUPPORT PROVIDED BY NATIONAL INSTITUTE OF STANDARDS.
What is a Business Analyst? A Business Analyst is someone who works as a liaison among stakeholders in order to elicit, analyze, communicate and validate.
July 2012 Update July 12, 2012 Andrew J. Buckler, MS Principal Investigator, QI-Bench WITH FUNDING SUPPORT PROVIDED BY NATIONAL INSTITUTE OF STANDARDS.
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
H Using the Open Metadata Registry (OpenMDR) to generate semantically annotated grid services Rakesh Dhaval, MS, Calixto Melean,
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Nadir Saghar, Tony Pan, Ashish Sharma REST for Data Services.
Quantitative Medical Imaging FUNDING PROVIDED BY NATIONAL INSTITUTE OF STANDARDS AND TECHNOLOGY September Monthly Call September 14, 2011 Andrew J. Buckler,
Lawrence Tarbox, Ph.D. Washington University in St. Louis School of Medicine Mallinckrodt Institute of Radiology, Electronic Radiology Lab.
10/24/09CK The Open Ontology Repository Initiative: Requirements and Research Challenges Ken Baclawski Todd Schneider.
CaBench-to-Bedside (caB2B) An easy to use tool for searching across the caGrid Mukesh Sharma Washington University School of Medicine.
CaGrid Overview and Core Services caGrid Knowledge Center February 2011.
September 2012 Update September 13, 2012 Andrew J. Buckler, MS Principal Investigator, QI-Bench WITH FUNDING SUPPORT PROVIDED BY NATIONAL INSTITUTE OF.
The future of Statistical Production CSPA. 50 task team members 7 task teams CSPA 2015 project.
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
Consultant Advance Research Team. Outline UNDERSTANDING M&E DATA NEEDS PEOPLE, PARTNERSHIP AND PLANNING 1.Organizational structures with HIV M&E functions.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
What is NCIA? National Cancer Imaging Archive Searchable repository of in vivo cancer images in DICOM format Publicly available at no cost over the Internet.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
XIP™/AVT Project Plans 2012 A report to the caBIG® In-Vivo Imaging Workspace Lawrence Tarbox, Ph.D.. December 2011.
Program Update April 11, 2013 Andrew J. Buckler, MS Principal Investigator, QI-Bench WITH FUNDING SUPPORT PROVIDED BY NATIONAL INSTITUTE OF STANDARDS AND.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
In Vivo Imaging Middleware and Applications RSNA 2007 Berkant Barla Cambazoglu The Ohio State University Department of Biomedical Informatics.
Measurement Science and Engineering FUNDING PROVIDED BY NATIONAL INSTITUTE OF STANDARDS AND TECHNOLOGY June Monthly Call June 8, 2011 Andrew J. Buckler,
Measurement Science and Engineering FUNDING PROVIDED BY NATIONAL INSTITUTE OF STANDARDS AND TECHNOLOGY August Monthly Call August 10, 2011 Andrew J. Buckler,
Imaging Workspace An Overview and Roadmap Eliot L. Siegel, MD Imaging Workspace Lead SME January 23, 2008.
CTTI PROJECT Emory University, Quality Assurance and Review Center (QARC) and Washington University in St. Louis.
1 The XMSF Profile Overlay to the FEDEP Dr. Katherine L. Morse, SAIC Mr. Robert Lutz, JHU APL
Chapter 1 The Systems Development Environment
Fred Prior, Ph.D. Mallinckrodt Institute of Radiology
Chapter 1 The Systems Development Environment
FDA Objectives and Implementation Planning
Chapter 1 The Systems Development Environment
Presentation transcript:

Monthly Program Update January 12, 2012 Andrew J. Buckler, MS Principal Investigator WITH FUNDING SUPPORT PROVIDED BY NATIONAL INSTITUTE OF STANDARDS AND TECHNOLOGY

Agenda Monthly snapshot in Jira – (including status of installation at NIST) QIBA 3A project snapshot Theoretical development Architecture and SW stack 22

3 BSD-2 license Domain is Landing page provides Access to prototypes, Repositories for download and development, Acknowledgemen ts, Jira issue tracking, and Documentation 3

QIBA 3A PROJECT SNAPSHOT (recalling that this is a testbed for us) 444

Basic structure of the challenges 5 Pilot Pivotal Investigation 1 Train Test Pilot Pivotal Investigation Train Test Pilot Pivotal Investigation Train Test Pilot Pivotal Investigation n Train Test PrimaryPrimary SecondarySecondary Defined set of data Defined challenge Defined test set policy 555 First one: Presently in pilot phase, Using StudyDescription method Used batch scripting with reference method to aid data curation participants (about 20 QI-Bench users) First participant data received Analysis plan using N-way ANOVA in R started Pivotal phase starting with batch assisted curation Will be transitioning to database schema for metadata (gradually away from spreadsheet) First one: Presently in pilot phase, Using StudyDescription method Used batch scripting with reference method to aid data curation participants (about 20 QI-Bench users) First participant data received Analysis plan using N-way ANOVA in R started Pivotal phase starting with batch assisted curation Will be transitioning to database schema for metadata (gradually away from spreadsheet)

666 1.Relative performance is computed according to descriptive statistics 2.We determine a group value for each of the descriptive statistics, e.g., as the mean plus 1 stdev (or as wide as we think wise). 3.Results presented using radar plots

Bias Variability Repeatability cross-x reproducibili ty cross-y reproducibili ty New Method Group In this example, the new proposed method does not perform well enough to be considered a valid method since it falls outside the group values. 777

In this example, the new proposed method is seen to perform within group values and may even help pave the way for an improved claim. Bias Variability Repeatability cross-x reproducibility cross-y reproducibility New Method Group

THEORETICAL DEVELOPMENT progress re: utilization of logical and statistical inference at each of two levels, technical performance of assay methods, and qualification of biomarker in specific clinical context 999

Another way to look at what needs to happen 10 Formulate Reference Data Sets QIBO Specify RDF Triple Store CT Volumetry CT obtained_by Tumor growth measure_of Therapeutic Efficacy Therapeutic Efficacy used_for Analyze Y=β 0..n +β 1 (QIB)+β 2 T+ e ij Execute Feedbac k

Specify: Establish a logical specification and setup terms for mathematical analysis 11 Functionality: Establish means to semantically labeling imaging biomarker data with emphasis on representing both the clinical context in which an imaging biomarker is used as well as the specifics of the imaging protocol used to acquire the images. Set up the logistic regression model: Precisely specify dependant variable Account for covariates Enumerate independent variables and error terms (sources of variability) Establish database for collection of terms. Method: Provide GUI to traverse the QIBO concepts according to their relationships and create statements represented as RDF triples and stored in an RDF store. Each set of RDF triples will be stored as a “profile.” Relationship strength initialized based on prior estimates (if available) QIBO Specify RDF Triple Store CT Volu metry CT obtained_by Tumor growt h measure_of Therap eutic Efficac y Therap eutic Efficac y used_for

Ontologies supporting Specify 12 Extend the QIBO to link to existing established ontologies 1.leverage BFO upper ontology to align different ontologies 2.convert portions of BRIDG and LSDAM to ontology models in OWL Automated conversion would done in two steps: 1.convert current Sparx Enterprise Architect XMI EMF UML format 2.export resulting EMF UML into a RDF/OWL representation using TopBraid Composer

Formulate: advanced query framework made possible by Specify 13 allow users to select the profiles (or set of RDF triplets) created in Specify, execute a query and retrieve the results in various forms. assemble/transform the set of RDF triples to SPARQL queries: 1.form an uninterrupted chain linking the instance of the input class from the ontology to the desired output class 2.formulate/invoke necessary SPARQL queries against the web services deployed in SADI framework. interface with the query engine and will have offline (asynchronous) query execution capability. results to be exportable as serialized objects (RDF/XML and CSV) Formulate Reference Data Sets RDF Triple Store CT Volu metry CT obtained_by Tumor growt h measure_of Therap eutic Efficac y Therap eutic Efficac y used_for

Data Services supporting Formulate 14 wrap existing data services such as NBIA, caArray, caTissue, AIM and PODS using Semantic Automated Discovery and Integration (SADI) this is enabled by metadata available through the UML representations of the models exposed by these services and CDE annotations available for them through caDSR. describe service I/O semantically using the extended version of QIBO service registry of SADI will help the automated composition of computer- interpretable queries by the query engine. example: “there is a service that returns Biological Subjects that has undergone certain Biological Interventions”

Analyze: Use annotation and image markup to support statistical inference 15 Support Clinical Performance assessment (i.e., in addition to current Technical Performance) Outcome studies Integrated genomic/proteomic correlation studies Group studies for biomarker qualification (set up a basic multiple regression analysis, e.g.) Intent to treat analysis of the primary outcome via covariance model of the general form (QIB t )=β 0..n +β 1 (QIB 0 )+β 2 T+ e ij where QIB t and QIB 0 are the QIB at a time after treatment and at randomization respectively, T is a treatment group indicator, and β 0..n, β 1, and β 2 are model parameters. β 2 represents the effect of treatment and its estimate is the difference between group means on the log scale, after adjustment for any imbalance between the groups in log QIB. The error terms in the model, e ij, are assumed mutually independent and normally distributed. Depending on the nature of the QIB, the log transformation may be used instead of the direct value. Likewise calculations may be performed using z scores with corresponding conversion with raw values.

Examples of output at biomarker (above the assay level) 16 From Jack Note: W-score is the relative score of the measured HC volume corrected for intracranial volume and compared to age and sex adjusted normals. Hypothetical model of dynamic biomarkers of the Alzheimer's pathological cascade Jack et al., To inform thresholding To substantiate surrogacy (or its weaker form of “activity”)

ARCHITECTURE AND SW STACK So what is a cohesive architecture that maximizes leverage of best thinking, existing touchpoints, and stays current over time? 17

STDM standard of CDISC into repositories like FDA’s Janus. MVT portion of AVT, re- useable library of R scripts. MIDAS, BatchMake, Condor Grid; built using Zend on PHP. caB2B, NBIA, PODS data elements, DICOM query tools. QIBO, AIM, RadLex/ Snomed/ NCIt; built using Ruby on Rails. Specify context for use and assay methods. Use consensus terms in doing so. Specify Assemble applicable reference data sets. Include both imaging and non-imaging clinical data. Formulate Compose and iterate batch analyses on reference data. Accumulate quantitative read-outs for analysis. Execute Characterize the method relative to intended use. Apply the existing tools and/or extend them. Analyze Compile evidence for regulatory filings. Use standards in transfer to regulatory agencies. Package 18

MVT : Reasonable framework, but many gaps 19 There are multiple possibilities to deploy it as a web application, some of which we’ve considered: 1.Re-implement the existing implementation to use GWT in place of Swing, inclusive of both the XIPHost as well as MVT components, retaining the WG23 concept. 2.Re-implement only those parts necessary to perform the needed MVT functions using GWT with enough data handling to do so but without doing everything necessary to retain the full XIPHost capability. 3.Leverage the GUI design concept but otherwise implement without starting from the Swing code. In all cases, there is the secondary design alternative of introducing a RESTful web service layer explicitly or not. (By the way, just for fun, I performed a conversion of the current Swing code to Ajax using AjaxSwing. I got most of AVT working over the web with minimal effort, but this isn’t a serious alternative because AjaxSwing has a license fee. I did it because I wanted to see how easy such a path would be. It’s an interesting capability! But irrelevant in the end.)

Pros: optimized for DICOM, works with workstations Cons: hard to create web apps, not optimized for semantic web 20 HW XIP Application Inventor Application Modules WG 23 System Services PLUG WG 23 System Services SOCKET GRID CLIENT SERVICES DICOM SERVICES (DCMTK) OTHER SERVICES VTKITKAIMTKother OS NCIA XIP IDE RadLex AIM NCI Protégé EVS XIP MIDDLEWARE DICOM DICOM Services IVI Middleware caGrid CaBIG caDSR, EVS, RadLex, AIM ontology, etc Client access Service access Grid Data Service Grid Analytical Service AIM Data Service XIP App Service Host WG23 DICOM Image Sources 20

Alternative architectural form… 21 SW Stack J2SE (J2EE compliant) MySQL caGrid Globus Application: JBoss caCore 21 With pros and cons “opposite” that of the XIP based architecture

Functionality view annotated with architecture 22 HW XIP Application Inventor Application Modules WG 23 System Services PLUG WG 23 System Services SOCKET GRID CLIENT SERVICES DICOM SERVICES (DCMTK) OTHER SERVICES VTKITKAIMTKother OS NCIA XIP IDE RadLex AIM NCI Protégé EVS XIP MIDDLEWARE DICOM DICOM Services IVI Middleware caGrid CaBIG caDSR, EVS, RadLex, AIM ontology, etc Client access Service access Grid Data Servi ce Grid Analytica l Service AI M Dat a Ser vic e XIP Ap p Ser vic e Ho st W G2 3 DIC OM Imag e Sour ces 22 MIDAS Core ApacheFile SystemPostGreSQL Publication DB MIDAS Data Server MIDAS e- journal MIDAS Compute Server MIDAS Visualization MIDAS Client MIDAS C++ API MIDAS Web API When annotation and markup has already been done Reference data sets Annotation and markup AIM-enabled (e.g., ClearCanvas) workstation RIS worklist items DICOM Q/R

First step to rationalizing architecture: mash them together and see what falls out 23 NCIA RadLex AIM NCI XIP MIDDLEWARE DICOM DICOM Services IVI Middleware SADI framework (e.g., wrapped caGrid) CaBIG caDSR, EVS, RadLex, AIM ontology, etc Client access Service access Grid Data Service Grid Analytical Service AIM Data Service HW XIP Application Inventor Application Modules WG 23 System Services PLUG WG 23 System Services SOCKET GRID CLIENT SERVICES DICOM SERVICES (DCMTK) OTHER SERVICES VTKITK AIMTK other OS XIP IDE Protégé EVS XIP App Service Host WG23 DICOM Image Sources This is an ongoing discussion. More to come!

24

Value proposition of QI-Bench Efficiently collect and exploit evidence establishing standards for optimized quantitative imaging: – Users want confidence in the read-outs – Pharma wants to use them as endpoints – Device/SW companies want to market products that produce them without huge costs – Public wants to trust the decisions that they contribute to By providing a verification framework to develop precompetitive specifications and support test harnesses to curate and utilize reference data Doing so as an accessible and open resource facilitates collaboration among diverse stakeholders 25

Summary: QI-Bench Contributions We make it practical to increase the magnitude of data for increased statistical significance. We provide practical means to grapple with massive data sets. We address the problem of efficient use of resources to assess limits of generalizability. We make formal specification accessible to diverse groups of experts that are not skilled or interested in knowledge engineering. We map both medical as well as technical domain expertise into representations well suited to emerging capabilities of the semantic web. We enable a mechanism to assess compliance with standards or requirements within specific contexts for use. We take a “toolbox” approach to statistical analysis. We provide the capability in a manner which is accessible to varying levels of collaborative models, from individual companies or institutions to larger consortia or public-private partnerships to fully open public access. 26

QI-Bench Structure / Acknowledgements Prime: BBMSC (Andrew Buckler, Gary Wernsing, Mike Sperling, Matt Ouellette) Co-Investigators – Kitware (Rick Avila, Patrick Reynolds, Julien Jomier, Mike Grauer) – Stanford (David Paik, Tiffany Ting Liu) Financial support as well as technical content: NIST (Mary Brady, Alden Dima, Guillaume Radde) Collaborators / Colleagues / Idea Contributors – FDA (Nick Petrick, Marios Gavrielides) – UCLA (Grace Kim) – UMD (Eliot Siegel, Joe Chen, Ganesh Saiprasad) – VUmc (Otto Hoekstra) – Northwestern (Pat Mongkolwat) – Georgetown (Baris Suzek) Industry – Pharma: Novartis (Stefan Baumann), Merck (Richard Baumgartner) – Device/Software: Definiens (Maria Athelogou), Claron Technologies (Ingmar Bitter) Coordinating Programs – RSNA QIBA (e.g., Dan Sullivan, Binsheng Zhao) – Under consideration: CTMM TraIT (Andre Dekker, Jeroen Belien) 27