Metadata Driven Clinical Data Integration – Integral to Clinical Analytics April 11, 2016 Kalyan Gopalakrishnan, Priya Shetty Intelent Inc. Sudeep Pattnaik,

Slides:



Advertisements
Similar presentations
DCMI Workshop on Metadata and Search Vendor Panel Presentation Bradley P. Allen
Advertisements

EMRLD A RIM-based Data Integration Approach Pradeep Chowdhury Manager, Data Integration.
Managing and Analyzing Clinical Data
Building an Operational Enterprise Architecture and Service Oriented Architecture Best Practices Presented by: Ajay Budhraja Copyright 2006 Ajay Budhraja,
Copyright © 2013, SAS Institute Inc. All rights reserved. LEVERAGE THE CDISC DATA MODEL TO STREAMLINE ANALYTICAL WORKFLOWS KELCI J. MICLAUS, PH.D. RESEARCH.
ITEC 423 Data Warehousing and Data Mining Lecture 3.
Validata Release Coordinator Accelerated application delivery through automated end-to-end release management.
SAS® Data Integration Solution
Components and Architecture CS 543 – Data Warehousing.
Integration and Insight Aren’t Simple Enough Laura Haas IBM Distinguished Engineer Director, Computer Science Almaden Research Center.
Data Warehouse success depends on metadata
Page 1Prepared by Sapient for MITVersion 0.1 – August – September 2004 This document represents a snapshot of an evolving set of documents. For information.
Business Intelligence System September 2013 BI.
Data Warehouse Components
The Multi-model, Metadata-driven Approach to Content and Layout Adaptation Knowledge and Data Engineering Group (KDEG) Trinity College,
Managing Master Data with MDS and Microsoft Excel
Research team members Adaptive Complex Enterprise Data Warehousing Repository Generation Semantic Web Knowledge Extraction.
LEVERAGING THE ENTERPRISE INFORMATION ENVIRONMENT Louise Edmonds Senior Manager Information Management ACT Health.
© 2003, Prentice-Hall Chapter Chapter 2: The Data Warehouse Modern Data Warehousing, Mining, and Visualization: Core Concepts by George M. Marakas.
Cognizant Reusable Automation Framework for Testing C.R.A.F.T.
Software Architecture April-10Confidential Proprietary Master Data Management mainly inspired from Enterprise Master Data Management – An SOA approach.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | OFSAAAI: Modeling Platform Enterprise R Modeling Platform Gagan Deep Singh Director.
BUSINESS INTELLIGENCE/DATA INTEGRATION/ETL/INTEGRATION AN INTRODUCTION Presented by: Gautam Sinha.
FROM DATA STORE TO DATA SERVICES - DEVELOPING SCALABLE DATA ARCHITECTURE AT SURS Tomaž Špeh UNECE Workshop on the Modernisation of Statistical Production.
Beyond regulatory submission - Standards Metadata Management Kevin Lee CDISC NJ Meeting at 06/17/2015 We help our Clients deliver better outcomes, so.
Basic Concepts of Datawarehousing An Overview Prasanth Gurram.
JumpStart the Regulatory Review: Applying the Right Tools at the Right Time to the Right Audience Lilliam Rosario, Ph.D. Director Office of Computational.
Understanding Data Warehousing
Best Practices for Data Warehousing. 2 Agenda – Best Practices for DW-BI Best Practices in Data Modeling Best Practices in ETL Best Practices in Reporting.
Vertex and CDISC / MBC / 12March Vertex and CDISC Accomplishments and Strategy 12 March 2008 Lynn Anderson Associate Director Statistical Programming/Biometrics.
© 2008 Octagon Research Solutions, Inc. All Rights Reserved. 2 Octagon Research Solutions, Inc. Leading the Electronic Transformation of Clinical R&D ©
Antje Rossmanith, Roche 14th German CDISC User Group, 25-Sep-2012
December 15, 2011 Use of Semantic Adapter in caCIS Architecture.
Confidential - Property of Navitas Accelerate define.xml using defineReady - Saravanan June 17, 2015.
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Using SAS® Information Map Studio
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 10: The Data Warehouse Decision Support Systems in the 21 st.
4.2.1 Programming Models Technology drivers – Node count, scale of parallelism within the node – Heterogeneity – Complex memory hierarchies – Failure rates.
Slide 1. © 2012 Invensys. All Rights Reserved. The names, logos, and taglines identifying the products and services of Invensys are proprietary marks.
Right In Time Presented By: Maria Baron Written By: Rajesh Gadodia
Research Project on Metadata Extraction, Exploration and Pooling: Challenges and Achievements Ronald Steinhau (Entimo AG - Berlin/Germany)
3 Copyright © 2009, Oracle. All rights reserved. Accessing Non-Oracle Sources.
The Use of Metadata in Creating, Transforming and Transporting Clinical Data Gregory Steffens Director, Data Management and SAS Programming ICON Development.
OLAP in DWH Ján Genči PDT. 2 Outline OLAP Definitions and Rules The term OLAP was introduced in a paper entitled “Providing On-Line Analytical.
Open GSBPM compliant data processing system in Statistics Estonia (VAIS) 2011 MSIS Conference Maia Ennok Head of Data Warehouse Service Data Processing.
Information Integration 15 th Meeting Course Name: Business Intelligence Year: 2009.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
Data Management: Data Processing Types of Data Processing at USGS There are several ways to classify Data Processing activities at USGS, and here are some.
Copyright © 2006, Oracle. All rights reserved. Czinkóczki László oktató Using the Oracle Warehouse Builder.
Data Warehouse – Your Key to Success. Data Warehouse A data warehouse is a  subject-oriented  Integrated  Time-variant  Non-volatile  Restructure.
De Rigueur - Adding Process to Your Business Analytics Environment Diane Hatcher, SAS Institute Inc, Cary, NC Falko Schulz, SAS Institute Australia., Brisbane,
© 2006 Epiance, Inc. Confidential and Proprietary 1.
Dave Iberson-Hurst CDISC VP Technical Strategy
CIM Modeling for E&U - (Short Version)
Data Warehouse Components
Overview of MDM Site Hub
Accelerate define.xml using defineReady - Saravanan June 17, 2015.
Pentaho 7.1.
Patterns emerging from chaos
Data Warehouse Overview September 28, 2012 presented by Terry Bilskie
System Modeling Assessment & Roadmap Joint OMG/INCOSE Working Group
Metadata Construction in Collaborative Research Networks
THE ARCHITECTURAL COMPONENTS
Metadata The metadata contains
Remedy Integration Strategy Leverage the power of the industry’s leading service management solution via open APIs February 2018.
Data Wrangling as the key to success with Data Lake
Presentation transcript:

Metadata Driven Clinical Data Integration – Integral to Clinical Analytics April 11, 2016 Kalyan Gopalakrishnan, Priya Shetty Intelent Inc. Sudeep Pattnaik, Founder, Thoughtsphere

Copyright © 2016 Intelent Inc. All rights reserved. Agenda  Role of Dynamism and Automation in Integration  Integration – Two Approaches  Metadata Driven Data Processing  Metadata Driven Flows - Two Approaches

Copyright © 2016 Intelent Inc. All rights reserved. Role of Dynamism and Automation in Integration Drivers for Dynamism and Automation Dynamism Automation Source structure, transformation rules, target structure based on study analytical needs. Most could vary across studies. This warrants a set of dynamic transformation rules to accommodate heterogeneous needs. In addition the structure of the source, the physical storage, maturity of the data transfer mechanism and relevant data dictionaries could vastly vary as well Important to minimize and possibly avoid any code change, transformation pre-processing services in data ingestion layer. These are simply costly and time consuming, and discourages adoption within the enterprise. Storage structures at appropriate levels of hierarchy and stages of data lifecycle need to be dynamic Such dynamism needs to be planned either by leveraging existing metadata or manufactured metadata Alternatively or in addition, a robust user interface or means of configuration can address gaps. Key is to minimize code change. High availability of data to points of analysis From disparate sources: Raw source data, integrated data across CTMS, IxRS, EDCs, Labs, Reconciled, cleansed/not cleansed, aggregated data Based on use cases - interim analysis, submission, operational metrics, central monitoring, medical monitoring etc. Key is to automate data delivery in appropriately usable format with minimal manual intervention

Copyright © 2016 Intelent Inc. All rights reserved. Integration – Two Approaches Warehouse ApproachIntegration AspectsHub Approach Storage and Modeling Source Data Integration Data Processing Pre-Modeling Required Structure oriented Generic content model (schema) required based on storage technology. For Ex. Form/Domain level storage No Pre-Modeling, Loosely coupled Storage granularity preserved as per the source system. Data tagged at appropriate level after reconciliation. Requires source system adapters, Pre formatting to warehouse structure – ETL approach. Enabling dynamism and automation for transformations, requires: Availability of a repository of governed metadata – structural and transformational. Interface that allows study level mappings and leveraging existing library of rules Multiple adapter development, especially with external sources (Labs/partner data) System agnostic Integration. Data is ingested at source level granularity without pre-processing – ELT approach. Requires source feeds to adhere to input descriptions or requires setup / configuration Robust mapping user Study level which utilizes a mapping library with auto (machine) learning technologies – promotes mapping reuse across studies Post processing pipeline architecture Heavy reliance on data pre-processing before loading into the warehouse Time consuming and costly Transformations accomplished on an as- needed basis, in a post-processing layer, based on business needs. For Ex: Operational review processes need subject level data granularity Bio-statistical programming processes need SDTM +/- domain level tabulated data

Copyright © 2016 Intelent Inc. All rights reserved. Metadata Driven Data Processing Business Issue How do we provide quicker access to source and analysis ready data? How do we adapt to changes in regulatory standards rapidly and apply these changes to business and operational processes? How do we bring in more efficiency in the source to target mapping and transformation processes? Data Ingestion Framework ingests data from Diverse Sources (clinical data, operational data, reference data) Populate Structural Metadata (Source/Target/Reference) and Transformational Metadata (rules/derivations) in Metadata Repository Dynamic Process applies transformation rules on source data to generate target datasets Solution Overview Solution Impact High Availability of Data (Source, Integrated, Standardized) Reusability of Standard Algorithms Dynamic Automated Process Accelerated Path for Submissions Enhanced Support for Product Defense, Data Sharing, Data Mining Traceability

Copyright © 2016 Intelent Inc. All rights reserved. Approach 1 - Metadata Driven Dynamic SAS Engine Structural & Transformational Metadata extracted from Metadata Repository drives dynamic program for generating hybrid SDTM target datasets Dynamic SAS process leverages SAS Macros corresponding to transformational metadata Source to Target Transformations – Updates in metadata repository applied in next run, MedDRA Merge, ISO Formats

Copyright © 2016 Intelent Inc. All rights reserved. Approach 2 – ClinDAP - Thoughtsphere’s Metadata Driven Source System Agnostic Clinical Data Aggregation Framework Robust Mapping Framework – Reusable mapping library, Leverage existing SAS libraries, Specify complex study level transformations, Extensible targets – Hybrid SDTM, ADaM ClinDAP - Next Generation Data Aggregation Platform Source System Agnostic Data Aggregation Framework Proprietary algorithms to aggregate disparate data sources (EDC, CTMS, IVRS, Labs, ePro, etc.) Document-oriented database readily assembles any structured or unstructured data Robust Mapping Engine, extensible rule library reusable across studies (Hybrid SDTM) Interactive visualization-based data discovery Ability to operationalize analytics is possible when you enable automation and dynamism to integrate data and generate standardized datasets