A need for prescriptive define.xml

Slides:



Advertisements
Similar presentations
Dimitri Kutsenko (Entimo AG)
Advertisements

CDISC Open Source and low-cost Solutions
Communicating with Standards Keeping it Simple Pamela Ryley Vertex Pharmaceuticals, Inc. September 29, 2006.
Experience and process for collaborating with an outsource company to create the define file. Ganesh Sankaran TAKE Solutions.
SEND Standard for the Exchange of Nonclinical Data
Kendle Implementation of Clinical Data Acquisition Standards Harmonization Dr Elke Sennewald Kendle 9th German CDISC User Group Meeting Berlin, 28 September.
Dr Gordon Russell, Napier University Unit Data Dictionary 1 Data Dictionary Unit 5.3.
© CDISC 2014 Asthma TAUG Metadata Bundle Review – Experimental Content PhUSE Computational Science Symposium March th, 2015 Sam Hume 1.
Monika Kawohl Statistical Programming Accovion GmbH Tutorial: define.xml.
Updates on CDISC Standards Validation
Beyond regulatory submission - Standards Metadata Management Kevin Lee CDISC NJ Meeting at 06/17/2015 We help our Clients deliver better outcomes, so.
PhUSE SDE, 28-May A SAS based Solution for define.xml Monika Kawohl Statistical Programming Accovion.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
Antje Rossmanith, Roche 14th German CDISC User Group, 25-Sep-2012
Confidential - Property of Navitas Accelerate define.xml using defineReady - Saravanan June 17, 2015.
MODULE B: Case Report Forms Jane Fendl & Denise Thwing April 7, Version: Final 07-Apr-2010.
Second Annual Japan CDISC Group (JCG) Meeting 28 January 2004 Julie Evans Director, Technical Services.
SDTM Validation Delaware Valley CDISC user network Ketan Durve Johnson and Johnson Pharmaceutical Reasearch and Development May 11 th 2009.
1 What’s Next for Financial Management Line of Business (FMLoB)? AGA/GWSCPA 6 th Annual Conference Dianne Copeland, Director, FSIO May 8, 2007.
1 Collection Specific Vocabularies March Terminology CB - abbreviation for collection builder CV - abbreviation for controlled vocabulary.
WG4: Standards Implementation Issues with CDISC Data Models Data Guide Subteam Summary of Review of Proposed Templates and Next Steps July 23, 2012.
NIEM Information Exchange Package Documentation (IEPD) Mini Kanwal NIEM Technical Advisor Department of Homeland Security September, 7 th 2006.
AL-MAAREFA COLLEGE FOR SCIENCE AND TECHNOLOGY INFO 232: DATABASE SYSTEMS CHAPTER 1 DATABASE SYSTEMS Instructor Ms. Arwa Binsaleh.
CDISC User Group in Deutschland/Japan Hajime Shimizu (nickname: Akiba) CDISC Japan User Group introduction to team activity.
Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary.
ISO 191** Overview A “Family” of Standards. Resources ISO Standards Web Page – Technical.
Data Management in Clinical Research Rosanne M. Pogash, MPA Manager, PHS Data Management Unit January 12,
Public Libraries Survey Data File Overview. What We’ll Talk About PLS: Public Libraries Survey State level data Public library data (Administrative Entities)
Public Libraries Survey Data File Overview. 2 What We’ll Talk About PLS: Public Library Survey State level data Public library data (Administrative Entities)
How good is your SEND data? Timothy Kropp FDA/CDER/OCS 1.
© CDISC 2015 Paul Houston CDISC Europe Foundation Head of European Operations 1 CTR 2 Protocol Representation Implementation Model Clinical Trial Registration.
How Good is Your SDTM Data? Perspectives from JumpStart Mary Doi, M.D., M.S. Office of Computational Science Office of Translational Sciences Center for.
Most Common Issues in Define.xml files
Database Principles: Fundamentals of Design, Implementation, and Management Chapter 1 The Database Approach.
Practical Considerations for Data Validation
Dave Iberson-Hurst CDISC VP Technical Strategy
Monika Kawohl Statistical Programming Accovion GmbH
Validation of CDISC data sets, current practice and future
Modern Systems Analysis and Design Third Edition
Chapter 18 Maintaining Information Systems
SAP University Alliances
Experience and process for collaborating with an outsource company to create the define file. Ganesh Sankaran TAKE Solutions.
Updates on CDISC Standards Validation
Accelerate define.xml using defineReady - Saravanan June 17, 2015.
Definition SpecIfIcatIons
Beyond regulatory submission - Standards Metadata Management Kevin Lee CDISC NJ Meeting at 06/17/2015 We help our Clients deliver better outcomes, so.
Accenture Accelerated R&D Standards Metadata Management – version control and its governance Kevin Lee CDISC NJ Meeting at 01/28/2015 We help our Clients.
Verification and Validation
Template library tool and Kestrel training
Maintaining the Clinical & Nonclinical Study Data Reviewer’s Guides
MAKE SDTM EASIER START WITH CDASH !
Quality Control of SDTM Domain Mappings from Electronic Case Report Forms Noga Meiry Lewin, MS Senior SAS Programmer The Emmes Corporation Target: 38th.
Understanding Inheritance
Freundschaft Konferenz
CDISC SHARE API v1.0 CAC Update 22 February 2018
SDMX: A brief introduction
Modern Systems Analysis and Design Third Edition
Practical Considerations for Data Validation
CDISC UK Network Jun-16 – Welwyn
Definition SpecIfIcatIons
To change this title, go to Notes Master
Fabienne NOEL CDISC – 2013, December 18th
Modern Systems Analysis and Design Third Edition
WG4: Data Guide/Data Description Work Group Meeting
SDTM and ADaM Implementation FAQ
Metadata The metadata contains
The ultimate in data organization
The role of metadata in census data dissemination
An exploration of quality gaps in SDTM implementation activities and ideas on how to address these gaps through appropriate resourcing Dianne Weatherall:
WHERE TO FIND IT – Accessing the Inventory
Presentation transcript:

A need for prescriptive define.xml NJ CDISC User Group Bayer HealthCare Pharmaceuticals Inc., Whippany, NJ Sergiy Sirichenko April 11, 2016

Abbreviations CT – CDISC Control Terminology EDC – Electronic Data Capture MDR – Metadata Repository SDRG – Study Data Reviewers Guide TCG – FDA Submission Data Technical Conformance Guide VLM – Value Level Metadata

Highlights Current problems with define.xml What is metadata? Use it or lose it Benefits of prescriptive approach Implementation considerations

Major problems in define.xml Usage of outdated Define-XML v1.0 Lack of expertise Missing study specific metadata (a major purpose of define.xml)

Outdated Define-XML v1.0 10+ years old The first versions are never perfect Value Level metadata cannot be clearly specified No reference to applied variable Multiple conditions are not available Lack of structural consistency Origin and other related attributes Codelists

FDA Request for V2.0 Define-XML v2.0 released in 2013 Resolves most of the prior version’s limitations The industry has been to slow to implement v2.0 New FDA TCG recommends v2.0 as “preferred version” FDA support for v1.0 will end for studies that starts 12 month after March 15, 2017 (Federal Register. March 2016)

Lack of expertise Technical compliance issues ISO8601 format is defined as a codelist Broken XML structure Confusion about correct Origins Assigned or Derived? Confusion about correct implementation of codeLIsts New PhUSE working group for developing Define.xml Completion Guide

Missing study specific metadata Study specific information is crucial for reviewers However in most submission packages it’s missing Value of define.xml, SDRG, aCRFs is to explain what is unique in this particular study

Missing codelists Codelists are limited to variables which are assigned to standard CT Commonly missing study specific codelists for variables Category (--CAT), Subcategory (--SCAT) EXTRT, ARMCD, --TESTCD/--TEST, QNAM, TPT RDOMAIN in CO and RELREC domains XXTOX, …

Merged codelists Due to confusion between Standard CT Codelist and study Variable Codelist Example: Define.xml has one codelist (UNIT) assigned to all --DOSU, --VAMTU, --ORRESU, --STRESU variables This codelist includes all unique terms across all study “units” variables and have 450 items, while for example EXDOSU variable is populated with one “mg” term only A reference to 450-terms codelist is not relevant

What is define.xml codelist? Define.xml codelist describes data collection process and should be limited to all terms used for data collection of specific data element (a particular Variable or Value Level) For example, LBSTRESU, EGORRESU, EXDOSU usually have separate codelists based on the same (UNIT) standard CT If data is collected as a free text, then codelist may be not applicable Common example is CMDOSU, CMDOSFRQ, CELOC, etc.

Missing terms in codelist Term is present in data SD0037 check Programming error Due to misspelling, leading space characters, etc. For example, “ M” Due to missing Decoded value for some items CodeList vs. EnumaretedItem Codelist was populated based on collected data, but some options from CRF page are not included Example: Only race “WHITE” is collected, while 6 options are present on CRF

Why do we have problems with study metadata? There are some obvious challenges Lack of expertise Lack of good tools Education and technology can help But what about missing study specific metadata? Tools and expertise will be not enough

What is Metadata? Common definition: “data about data” “physical data and knowledge-containing info about business, tech processes, and data , used by corporation” Physical data: stored in software and machine-readable media Knowledge: retained by employees and other various media Highly utilized internal knowledge is often not documented

Process for descriptive define.xml Most commonly used approach Based on finalized data For and before regulatory submission Advantages Can be easily automated No need to worry until submission

Process for descriptive define.xml

Define file is not used internally Study metadata is usually stored as Non-machine readable mapping specs in excel with free text entry format Different specs from multiple systems People knowledge Define.xml is not utilized internally and created only as a “required” file for submission Considered as extra burden Focus on populating a file rather than its high quality content

Reviewer vs. Programmer FDA reviewer is limited to submitted study metadata Define.xml, aCRF, Reviewer’s Guide Company programmer uses different sources of study knowledge EDC specs Excel specs for mapping and programming Emails and discussions with team members Descriptive Define.xml is not used in internal processes

Process for prescriptive define.xml Study metadata is defined in advance Based on Protocol and SAP Utilizing industry and company standards “Begin with the end in mind” approach Prescriptive define.xml is used as study data spec and input for study specific data validation

Process for prescriptive define.xml

Benefits of prescriptive define.xml Done in advance Avoids “too late to fix” problems Early planning allows implementation oriented to regulatory expectations rather than driven by already collected data with potential issues

Benefits of prescriptive define.xml (2) Utilization of company standards and best practices Controlled by company rather than the result of inconsistent implementations by different teams and vendors Prescriptive define.xml is used as study data spec and input for study specific data validation Utilization of define.xml in data processing increases the quality of study metadata

Data Standards Library Finalize for Publishing Standards Management Establish Data Standards Library Define Company based CDISC Standards Define Company based Non-CDISC Standards Define Company Terminology EDC, Labs, IVRS, ePRO, etc ... Custom codelists Existing codelists Subsets Extensions Custom Domains, Methods Value Level, etc. Create Study Spec Select study specific elements from library Domains Variables Code lists Methods Value Level Documents Comments Finalize for Publishing Compare to - Company standards - CDISC Standards - Other Studies - Other versions Export Spec as - Define.xml - Excel Validate Execute business rules against the study spec itself Modify spec based on issues 4 blocks Define standard library Define terminology library

Company standardized metadata Define-XML (v2.0) is a good, simple and well structural metadata standard If you use metadata for managing internal processes, why not base it on Define-XML? Industry standard Simple, logical, structural, process oriented Can be easily extended for particular needs Define-XML+ Easy/automatic import of company specs into define.xml

Technical requirements Prescriptive approach needs Metadata repository Standards management Simple user interface Version control

What is wrong with Excel? Common practice is a usage of Excel for specs Free text entry does not allow entry control Misspelling Inconsistency within specs and across studies Single user (personal) document rather than a controlled multiuser environment Missing features Versioning Comparison On-line validation / edit checks Centralized

Summary Descriptive define.xml is not used in data processing, so it’s unlikely to result in high quality Utilization of prescriptive define.xml in data processing increases quality of study metadata Prescriptive metadata requires a new process and special tools for standards management Prescriptive define.xml can serve as a backbone for company data standardization process including standards management, programming specifications and validation

Contact info: Sergiy Sirichenko ssirichenko@pinnacle21.net