A SAS macro to check SDTM domains against controlled terminology

Slides:



Advertisements
Similar presentations
Dimitri Kutsenko (Entimo AG)
Advertisements

Principal Statistical Programmer Accovion GmbH, Marburg, Germany
ADaM Implementation Guide: It’s Almost Here. Are You Ready?
Experience and process for collaborating with an outsource company to create the define file. Ganesh Sankaran TAKE Solutions.
Axio Research E-Compare A Tool for Data Review Bill Coar.
CDISC ADaM 2.1 Implementation: A Challenging Next Step in the Process Presented by Tineke Callant
Quick tour of CDISC’s ADaM standard
Foundations of Relational Implementation n Defining Relational Data n Relational Data Manipulation n Relational Algebra.
Kendle Implementation of Clinical Data Acquisition Standards Harmonization Dr Elke Sennewald Kendle 9th German CDISC User Group Meeting Berlin, 28 September.
© The McGraw-Hill Companies, 2006 Chapter 5 Arrays.
Geography 465 Overview Geoprocessing in ArcGIS. MODELING Geoprocessing as modeling.
Metadata Management – Our Journey Thus Far
© 2008 Octagon Research Solutions, Inc. All Rights Reserved. 1 PhUSE 2010 Berlin * Accessing the metadata from the define.xml using XSLT transformations.
7. German CDISC User Group Meeting Define.xml Generator ODM Validator (define.xml validation) 2010/03/11 Dimitri Kutsenko Marianne Neumann.
PhP Tutorial (3). Working with Arrays – Strings in PHP What is an Array.
Updates on CDISC Standards Validation
CDISC and how Stata can help us implement it
SDTM Validation Rules Sub-team CDISC INTRAchange Feb 26 th, 2014.
Accenture Accelerated R&D Standards Metadata Management – version control and its governance Kevin Lee CDISC NJ Meeting at 01/28/2015 We help our Clients.
23 August 2015Michael Knoessl1 PhUSE 2008 Manchester / Michael Knoessl Implementing CDISC at Boehringer Ingelheim.
Dominic, age 8, living with epilepsy SDTM Implementation Guide : Clear as Mud Strategies for Developing Consistent Company Standards PhUSE 2011 – CD02.
© 2011 Octagon Research Solutions, Inc. All Rights Reserved. The contents of this document are confidential and proprietary to Octagon Research Solutions,
Qian Zhao, J&J Consumer Companies, Inc. Jun (John) Wang, J&J Consumer China Ltd Ruofei Hao, J&J Consumer Companies, Inc. PharmaSUG 2015 Paper #BB11.
Antje Rossmanith, Roche 14th German CDISC User Group, 25-Sep-2012
Confidential - Property of Navitas Accelerate define.xml using defineReady - Saravanan June 17, 2015.
© Copyright 2008 ADaM Validation and Integrity Checks Wednesday 12 th October 2011 Louise Cross ICON Clinical Research, Marlow, UK.
Copyright © 2011, SAS Institute Inc. All rights reserved. Using the SAS ® Clinical Standards Toolkit 1.4 to work with the CDISC ODM model Lex Jansen SAS.
SDTM Validation Delaware Valley CDISC user network Ketan Durve Johnson and Johnson Pharmaceutical Reasearch and Development May 11 th 2009.
RCRIM Projects: Protocol Representation and CDISC Message(s) January 2007.
1. © CDISC 2014 SDS ELT Rules Team Update Stetson Line 08 Dec
The Use of Metadata in Creating, Transforming and Transporting Clinical Data Gregory Steffens Director, Data Management and SAS Programming ICON Development.
Writing and Reading XML files with SAS (Statistical Analysis System) What is SAS ? SAS Institute (or SAS, pronounced "sass") is an American developer of.
April ADaM define.xml - Metadata Design Analysis Results Metadata List of key analyses (as defined in change order) Analysis Results Metadata per.
How Good is Your SDTM Data? Perspectives from JumpStart Mary Doi, M.D., M.S. Office of Computational Science Office of Translational Sciences Center for.
Generation of real-time SDTM datasets and metadata through a generic SDTM converter mechanism CDISC (CDASH/SDTM) integration into OC/RDC Peter Van Reusel.
ADaM or SDTM? A Comparison of Pooling Strategies for Integrated Analyses in the Age of CDISC Joerg Guettner, Lead Statistical Analyst, Bayer Pharma, Wuppertal,
Mark Wheeldon, Formedix CDISC UK Network June 7, 2016 PRACTICAL IMPLEMENTATION OF DEFINE.XML.
Submission Standards: The Big Picture Gary G. Walker Associate Director, Programming Standards, Global Data Solutions, Global Data Management.
More SQL: Complex Queries, Triggers, Views, and Schema Modification
A need for prescriptive define.xml
Domain Testing Functional testing which tests the application by giving inputs and evaluating its appropriate outputs. system does not accept invalid and.
Greg Steffens Noumena Solutions
Validation of CDISC data sets, current practice and future
7. German CDISC User Group Meeting Define
Experience and process for collaborating with an outsource company to create the define file. Ganesh Sankaran TAKE Solutions.
Views, Stored Procedures, Functions, and Triggers
Accelerate define.xml using defineReady - Saravanan June 17, 2015.
Accenture Accelerated R&D Standards Metadata Management – version control and its governance Kevin Lee CDISC NJ Meeting at 01/28/2015 We help our Clients.
Metadata Curator's Assessments on CDISC CT Mappings
SAS Data Set Utility with Excel Output
MAKE SDTM EASIER START WITH CDASH !
Creating ADaM Friendly Analysis Data from SDTM Using Meta-data by Erik Brun & Rico Schiller (CD ) H. Lundbeck A/S 13-Oct
Traceability between SDTM and ADaM converted analysis datasets
Patterns emerging from chaos
SDTM and ADaM Implementation FAQ
To change this title, go to Notes Master
Fabienne NOEL CDISC – 2013, December 18th
Review of Week 1 Database DBMS File systems vs. database systems
German Speaking CDISC User Network
Lab 2 and Merging Data (with SQL)
Lab 2 HRP223 – 2010 October 18, 2010 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected.
Review for Test1.
Generating Define.xml at Kendle using DefinedocTM
Standard Scripts Project 2
Generating Define.xml at Kendle using DefinedocTM
Arrays & Loops.
Arrays & Loops.
Work Stream Templates Basel, September 2, 2008.
PhilaSUG Spring Meeting June 05, 2019
German Speaking CDISC User Network
Presentation transcript:

A SAS macro to check SDTM domains against controlled terminology

When do we need this? During SAS development of SDTM domains Checking overall compliance to Controlled terminology (CT) for a study. Checking CT of one domain across multiple studies e.g. for pooling.

Process Flow: The metadata Let‘s focus on the Meta first.

Controlled terminology (CDISC) The first Metadata is the CT itself. Notice that the structure of the SAS dataset is slightly different: Headline information is maintained as rows.

CDISC SDTM 3.1.2 Validation Rules for controlled terminology (CT) …

Derive variables that have to comply to CT + In the next step I show how to determine the variables that have to comply to CT (ctvarsel) For simplicity of presentation I focus on one domain: AE

Intersection of two lists of values Assume we are looking at a single dataset AE. Which variables need to comply to CT? This comes down to creating the intersection of variables

Join metadata information (and create instructions) I will show how instructions are built for one domain ae and one variable

Build values list This is the end result of the following step

Creating a list iteratively For variables in ctvarsel.sas7bdat multiple observations could be present with different where conditions (e.g. TSPARMCD values in the TS domain). Therefore we need maintain all combinations of “variable” and “where”. Furthermore, assuming they are always allowed, empty values are added to the end of each list.

+ The violations dataset is the major dataset required for reporting

Creating the violations dataset

Violations dataset: One OBS per violation The violations dataset combines the SDTM dataset and the metadata needed to formulate the CT violation conditions (ctvarmd.sas7bdat). This is implemented via a %syscall set() statement to convert the values of the data step variables of dataset ctvarmd.sas7bdat into macro variables. These macro variables are then used in subsetting if statements to only keep observations of the SDTM dataset that contain a CT violation. Note that the where condition is only added to the if-statement if the variable “where” contains a value.

Extension: Multiple studies and multiple domains domains/datasets Studies (libnames) LIBNAMES = List of Libnames DSETLIST = List of domains/datasets Index _J Index _I

Extentsion: Multiple domains/datasets &DSETLIST.= ae eg vs … e.g.: &_dset1. = ae &_dset2. = eg &_dset3. = vs etc. How to create list of elements for later looping. The extensions are implemented applying appropriate macro-looping, starting with splitting up the lists (libnames and datasets) into single elements, e.g. where &DSETLIST is the list of user-specified datasets:

Extension: Multiple libnames When setting together SDTM Domain datasets their common variable attributes - label, type, length - have to be accounted for. This paper focuses on the length of the variables as this is probably the most critical attribute when dealing with SDTM domains. The following statements were used to create a macro variable &_lengthv that contains a length statement. It accounts for the maximum length of the variable observed across all libnames. This statement occurs within the looping through the datasets indicated by loop variable &_i; It assigns to all variables the maximum length across the all libnames.

Reporting and tabulation

Xml-Output (in Excel) http://support.sas.com/rnd/base/ods/odsmarkup/excelxp_help.html

XML Output

Conclusion Can be used within SAS programs ADaM or SDTM Multiple studies can be checked simultaneously Users can control and modify input metadata (sponsor or study specific codelists) Scope is addressed via macro parameters All domains of current study Specific domains across various studies etc.

Questions?