Download presentation
Presentation is loading. Please wait.
1
Clinical Study Design Henrik Ekberg, MD, PhD Malmö, Sweden Associate Editor: American Journal of Transplantation 2003- Editorial Board Member: Transplantation 2004 - Transplant International 2004 - Clinical Transplantation 2008 - Journal of Transplantation 2008 - Guangzhou October 9, 2010
2
Rejection of submitted manuscript - various reasons Rejected on priority grounds: Maybe a good study –but not a topic of interest, or done before Rejected, not allowed resubmission: –a bad study; design problems, cannot be re- written in a good way Rejected but allowed resubmission: –no serious design problems, interesting topic, but needs to be rewritten for language, discussion, figures, tables, etc.
3
Rejection of submitted manuscript - various reasons Rejected on priority grounds: Maybe a good study –but not a topic of interest, or done before Rejected, not allowed resubmission: –a bad study; design problems, cannot be re- written in a good way Rejected but allowed resubmission: –no serious design problems, interesting topic, but needs to be rewritten for language, discussion, figures, tables, etc.
4
Study design alternatives Retrospective studies = Using medical charts of existing data Uncontrolled Case-controlled Hypothesis generating Prospective studies = Protocol directives for Rx and F/u Uncontrolled, one-arm, pilot Randomized Controlled Trial (RCT) Hypothesis testing
5
Clinical study design phases Phase 1 Drug action, metabolism, PK, PD, safety Phase 2 Limited (un)controlled study for efficacy and safety Phase 3 Large randomized multicenter study Determine efficacy and safety for FDA and EMEA Phase 4 After drug release: new uses of the drug Marketing
6
Hypothesis Appropriate population Clinically relevant achievement Adequately-powered End points Comparison group (placebo) Randomized Double-blind Intent-to-treat analysis Protocol Analysis plan Key Elements of Trial Quality
7
Experimental Hypothesis May be based on a pilot or retrospective study or on hopes for a new drug Drug A > drug B (or placebo) with regards to … Null hypothesis (H 0 ): A B (no difference) A < B (non-inferiority) Key Elements of Trial Quality
8
Appropriate population Include: Normal risk kidney transplant recipients from living or deceased donors Exclude: High risk patients, such as PRA > 20% (50%?) Retransplants (all?) High donor age ? Expanded donor criteria? Cold ischemia time ? HLA- DR mismatch ? Key Elements of Trial Quality
9
Hypothesis Appropriate population Clinically relevant achievement Adequately-powered End points Comparison group (placebo) Randomized Double-blind Intent-to-treat analysis Protocol Analysis plan Key Elements of Trial Quality
10
With one-year graft survival > 90% and acute rejection rates < 20% The Success
11
With one-year graft survival > 90% and acute rejection rates < 20% we have a high level of success and further improvement is difficult to achieve and demonstrate we need very large studies! The Problem
12
Primary end point The parameter on which 1. the hypothesis is based, to be verified or rejected 2. the sample size is calculated Secondary end points Additional parameters which may 1. describe the patients, events and results 2. be used for formulations of new hypotheses End Points and Sample Size
13
1. Select the primary end point 2. Clinically relevant achievement regarding end point = Difference between control and experimental groups e.g.: GFR increased by 10 ml/min AR rate reduced by 10% 3. Determine the number of patients in each group needed to verify that the difference between the groups most likely is true (<5% risk of mistake). 4. With a certain power and p-value. End Points and Sample Size
14
“We chose to study 313 patients in each group in order to have 80% power of detecting a 33% reduction in AR rate from a baseline rate of 30% with a significance level of 0.05”. End point: Acute Rejection Clinically relevant achievement: 33% reduction (from 30% to 20%) Power: 80% Significance level: 5% Therefore: Number of patients in each group: 313 End Points and Sample Size
15
“We chose to study 313 patients in each group in order to have 80% power of detecting a 33% reduction in AR rate from a baseline rate of 30% with a significance level of 0.05”. = p-value = 0.05; means a 5% risk of obtaining a group difference by chance (although there is no true difference). = 0.20 means a Power of 80%; that is 80% chance of obtaining a group difference and 20% risk of missing it (when there is a true difference). P 1 = 0.30 and ∆ = 0.10 (33% of p 1 ) End Points and Sample Size
16
“We chose to study 313 patients in each group in order to have 80% power of detecting a 33% reduction in AR rate from a baseline rate of 30% with a significance level of 0.05”. = p-value = 0.05; means a 5% risk of obtaining a group difference by chance (although there is no true difference). = 0.20 means a Power of 80%; that is 80% chance of obtaining a group difference and 20% risk of missing it (when there is a true difference). P 1 = 0.30 and ∆ = 0.10 (33% of p 1 ) End Points and Sample Size Question: If there is a true difference between the groups and we do 100 studies with 313 patients in each group How many studies will result in a group difference, that is at least a 33% reduction of AR? 1.5 studies 2.20 studies 3.80 studies 4.95 studies
17
“We chose to study 313 patients in each group in order to have 80% power of detecting a 33% reduction in AR rate from a baseline rate of 30% with a significance level of 0.05”. = p-value = 0.05; means a 5% risk of obtaining a group difference by chance (although there is no true difference). = 0.20 means a Power of 80%; that is 80% chance of obtaining a group difference and 20% risk of missing it (when there is a true difference). P 1 = 0.30 and ∆ = 0.10 (33% of p 1 ) End Points and Sample Size Question: If there is a true difference between the groups and we do 100 studies with 313 patients in each group How many studies will result in a group difference, that is at least a 33% reduction of AR? 1.5 studies 2.20 studies 3.80 studies 4.95 studies
18
“We chose to study 313 patients in each group in order to have 80% power of detecting a 33% reduction in AR rate from a baseline rate of 30% with a significance level of 0.05”. = p-value = 0.05; means a 5% risk of obtaining a group difference by chance (although there is no true difference). = 0.20 means a Power of 80%; that is 80% chance of obtaining a group difference and 20% risk of missing it (when there is a true difference). P 1 = 0.30 and ∆ = 0.10 (33% of p 1 ) End Points and Sample Size 80 studies will show a significant difference and 20 studies will not. Comment: 20% risk of not seeing a true difference is quite high
19
“We chose to study 313 patients in each group in order to have 80% power of detecting a 33% reduction in AR rate from a baseline rate of 30% with a significance level of 0.05”. = p-value = 0.05; means a 5% risk of obtaining a group difference by chance (although there is no true difference). = 0.20 means a Power of 80%; that is 80% chance of obtaining a group difference and 20% risk of missing it (when there is a true difference). P 1 = 0.30 and ∆ = 0.10 (33% of p 1 ) End Points and Sample Size Question: If there is not a true difference between the groups and we do 100 studies with 313 patients in each group. How many studies will result in a group difference? 1. 5 studies 2. 20 studies 3. 80 studies 4. 95 studies
20
“We chose to study 313 patients in each group in order to have 80% power of detecting a 33% reduction in AR rate from a baseline rate of 30% with a significance level of 0.05”. = p-value = 0.05; means a 5% risk of obtaining a group difference by chance (although there is no true difference). = 0.20 means a Power of 80%; that is 80% chance of obtaining a group difference and 20% risk of missing it (when there is a true difference). P 1 = 0.30 and ∆ = 0.10 (33% of p 1 ) End Points and Sample Size Question: If there is not a true difference between the groups and we do 100 studies with 313 patients in each group. How many studies will result in a group difference? 1. 5 studies 2. 20 studies 3. 80 studies 4. 95 studies
21
“We chose to study 313 patients in each group in order to have 80% power of detecting a 33% reduction in AR rate from a baseline rate of 30% with a significance level of 0.05”. = p-value = 0.05; means a 5% risk of obtaining a group difference by chance (although there is no true difference). = 0.20 means a Power of 80%; that is 80% chance of obtaining a group difference and 20% risk of missing it (when there is a true difference). P 1 = 0.30 and ∆ = 0.10 (33% of p 1 ) End Points and Sample Size Question: If there is not a true difference between the groups and we do 100 studies with 313 patients in each group. How many studies will result in a group difference? 5 studies will show a group difference although this is not true
22
“We chose to study 313 patients in each group in order to have 80% power of detecting a 33% reduction in AR rate from a baseline rate of 30% with a significance level of 0.05”. P-value = 5%; The risk of seeing a difference which is not true Power = 80%; The chance of seeing a difference which is true P 1 = 0.30 and ∆ = 0.10 (33% of p 1 ) End Points and Sample Size
23
p=0.05 Sample Size for Acute Rejection AR Treatment AR Control (P 1 ) PowerSample size 0.200.3080%313
24
p=0.05 Sample Size for Acute Rejection AR Treatment AR Control (P 1 ) PowerSample size 0.200.3080%313 0.150.2080%700
25
p=0.05 Sample Size for Acute Rejection AR Treatment AR Control (P 1 ) PowerSample size 0.200.3080%313 0.150.2080%700 0.150.2090%954 We need to do large multicenter studies !!!
26
Question: The primary end point (PEP) and 10 secondary end points (SEP) were analysed; SEP in two ways each. The PEP was NS, one of the SEP was stat sign (P<0.05). Why is the analysis more reliable for PEP than SEP? Is this significant result of the SEP reliable? 10 x 2 = 20 tests What is the probability of a “significant finding” by chance? End Points and Sample Size
27
The trap of multiple tests No. of independent tests 25102050 Probability of one or more p < 0.05 by chance 10%23%40%64%92% To keep = 0.05 accept as significant only p less than
28
The trap of multiple tests No. of independent tests 25102050 Probability of one or more p < 0.05 by chance 10%23%40%64%92% To keep = 0.05 accept as significant only p less than 0.0250.0100.0050.0020.001 Use p = 0.05 / no. of tests
29
Hypothesis Appropriate population Clinically relevant achievement Adequately-powered End points Comparison group (placebo) Randomized Double-blind Intent-to-treat analysis Protocol Analysis plan Key Elements of Trial Quality
30
Clinical End Points We want to achieve improvement in patient survival and graft survival These are the Clinical end points
31
Five cadaver kidney transplant recipients received azathioprine One patient survived 365 days, becoming the first successful cadaveric transplant Uncontrolled Trial: Patient Survival (n=5) Murray, et al. New Engl J Med 1963; 268:1315
32
1-year graft survival CsA.......72% Aza........52% 1-year patient survival CsA.......94% Aza........92% European Multicentre Trial Group. Lancet 1983; 2:986 p=0.001 RCT: Graft & Patient Survival (n=232) NS
33
Acute rejection is associated with graft survival Acute rejection became the surrogate end point for graft survival Where Did We Go From Here?
34
Acute rejection at 6 mo. MMF 2g...........20% MMF 3g...........17% Pla/Aza.............41% 1-year graft survival MMF 2g...........90% MMF 3g...........89% Pla/Aza............88% Halloran, et al. Transplantation 1996; 63:39 p<0.01 RCT: Acute Rejection (n=1493) NS
35
Conclusion of MMF trials: “Acute rejection was reduced but graft survival was not improved” Was this true - or a question of insufficient power of the study? What difference in graft survival should have been expected? Where Did We Go From Here?
36
Sample size and power to verify true differences in graft survival of 4% or 5%. Graft survival in treatment groups Difference in Graft survival Sample size at 80% power Power at sample size 150 86 %90 %4 %1 03719 % 75 %80 %5 %1 09118 % Ekberg H. Transpl Rev 2003; 17: 187
37
Surrogate Endpoint Definitions Clinical endpoint: A characteristic or variable that reflects how a patient feels, functions or survives. Surrogate endpoint: A biomarker that is intended to substitute for a clinical endpoint, and predict clinical benefit … Biomarkers Definitions Working Group. Clin Pharmacol Ther 2001; 69:89
38
Risk factors after transplantation Acute rejection Graft function New onset of diabetes mellitus Cholesterol levels Treatment failure (drug toxicity) Malignancy Do they predict graft or patient survival? Risk factors and potential End points
39
Possible Surrogate Endpoints Acute rejection Acute rejection + 1/Cr return to baseline 1-year graft function Composite end point Association or Prediciton ?
40
Acute Rejection with 1/Cr return to baseline Transplants 1995–2002 Log-rank P value for equality of strata ≤0.0001. Meier-Kriesche et al. ATC 2003. Time Posttransplantation (mo) 1.0 Graft Survival (%) 0 61218243036424854606672 0.9 0.8 0.7 0.6 0.5 0.4 AR-1/SCr worse than 5% from baseline 49.4% n = 55,092 n = 4,061 n = 2,782 n = 22,212 n = 2,669 n = 1,455 n = 2,891 n = 414 n = 221 AR-1/SCr within 5% from baseline 73.4% 73.1% No acute rejection
41
Predictive Quality for Graft Loss: AR vs. AR Without Return to Baseline 6 years 2 years Follow-up 38.527.6 15.89.2 Positive Predictive Value Acute Rejection No Return to Baseline Acute Rejection Meier-Kriesche et al. ATC 2003. Conclusion: AR and AR with return to baseline are associated but not predictive of graft survival
42
“Post-transplant Renal Function at 1 Year Predicts Long-Term Kidney Transplant Survival” 0 20 40 60 80 100 01224364860 Months Posttransplantation <1.0 1.0 - 1.5 1.6 - 2.0 2.1 - 2.5 2.6 - 3.0 >3 N = 61,157 Graft Survival (%) Hariharan S et al. Kidney Int. 2002; 62: 311.
43
ROC Plot for 7-Year Overall Graft Loss From 1-Year Creatinine Baseline Level AUC = 0.624 Sensitivity 1 - Specificity 0.0 1.00.9 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 0.10.20.30.40.50.60.7 0.8 3.0 2.5 2.3 2.1 2.0 1.9 1.8 1.7 1.6 1.5 1.4 1.3 1.2 1.0 1.1 ROC = receiver operator curve. H-U Meier-Kriesche
44
ROC Plot for 7-Year Overall Graft Loss From 1-Year Creatinine Baseline Level AUC = 0.624 Sensitivity 1 - Specificity 0.0 1.00.9 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 0.10.20.30.40.50.60.7 0.8 3.0 2.5 2.3 2.1 2.0 1.9 1.8 1.7 1.6 1.5 1.4 1.3 1.2 1.0 1.1 ROC = receiver operator curve. H-U Meier-Kriesche
45
Prediction Diagnostics for Seven Year Overall Graft Loss from One Year Creatinine Level Patient population: Adult first transplant recipients from USRDS database after 1988 with minimum seven years follow up Prediction Diagnostics SensitivitySpecificityPPVNPV Creatinine Cutoff Level 1.6 62%55%53%64% 1.8 48%71%58%62% 2.0 36%82%63%61% H-U Meier-Kriesche
46
Possible Surrogate Endpoints Acute rejection Acute rejection + 1/Cr return to baseline 1-year graft function Composite end point
47
Composite end point (CEP) 1,389 KTx at Univ of Minnesota 1985-1997 Creat at 1 year (Cr 12 ) Cr 12 3.0 -> 10 yr GS from 75% to 25% Suggested Composite End Point: Graft loss 2.0 Reduction of CEP incidence by 33% 626 patients in total needed in such study Paraskevas et al Transplantation 2003; 75: 1256
48
Composite end point (CEP) CEP definition: Occurrence of at least one Acute rejection, Graft loss, Death or S-Creat > 1.5 UNOS data base 1995-2000: 59,000 patients 61.2% met the CEP - Margin for improvement - Less number of patients needed Siddiqi et al ATC 2003; #1160 Hariharan et al AJT 2003; 3: 933
49
Composite end point (CEP) CEP: Not a surrogate end point – no prediction Not a clinical end point – incl ‘surrogate’ factors Weighted score: Death1.0 x proportion Graft loss0.5 x proportion Acute rej0.25 x proportion S-crea>1.5 0.25 x proportion Hariharan et al AJT 2003; 3: 933
50
Clinical end point (short term only) Alternatively; Clinical end point (“how the patient functions …”) without prediction of long-term patient or graft survival e.g. GFR (Cockcroft-Gault formula) at 12 mo. Symphony study e.g. New Onset of Diabetes After Transplantation (NODAT) according to American Diabetes Association (ADA) definitions
51
Conclusions on End Points What are the best end points? Acute rejection Acute rejection + 1/Cr return to baseline 1-year graft function Composite end point NODAT
52
Hypothesis Appropriate population Clinically relevant achievement Adequately-powered End points Comparison group Randomized Placebo-controlled Double-blind Intent-to-treat analysis Protocol Analysis plan Key Elements of Trial Quality
53
Question: We are designing a study on CNI nephrotoxicity and are discussing the treatment of the control group. It was decided to give them CsA with trough levels 200-400 ng/ml first 2 months and then 100-200 ng/ml months 3-12. OK? The Comparison Group
54
The benefits, risks, burdens and effectiveness of a new method should be tested against those of the best current prophylactic, diagnostic, and therapeutic methods. World Medical Association Declaration of Helsinki The Comparison Group
55
The new drug or method should hypothetically and potentially be better than the best known current treatment (= standard of care) - but not yet proven to be so The Study Group
56
Placebo vs Study Drug Study drug in addition to the best current regimen e.g. placebo vs daclizumab Old Drug vs New Drug Either drug in addition to the best current regimen e.g. Aza vs MMF Controlled trial
57
Hypothesis Appropriate population Clinically relevant achievement Adequately-powered End points Comparison group (placebo) Randomized Double-blind Intent-to-treat analysis Protocol Analysis plan Key Elements of Trial Quality
58
Random Assignment of Treatment Parameters associated with outcome should be similarly distributed between study and comparison groups Methods for example: computerized and via telephone 1:1 or 2:1 Stratification (per center or LD/DD) Randomized Controlled Trial
59
Double Blind Physician not knowing which treatment Patient not knowing Problems:drug administration drug monitoring Labs and visits the same in both groups Sometimes extra blood sampling in controls (ethics?) The Blind Treating The Blind
60
Hypothesis Appropriate population Clinically relevant achievement Adequately-powered End points Comparison group (placebo) Randomized Double-blind Intent-to-treat analysis Protocol Analysis plan Key Elements of Trial Quality
61
ITT analysis – the Standard method = All participating patients are included Does not exclude treatment failures Conclusion: “With this intention, we had the results...” Limitation of the ITT Analysis In a long-term study (e.g. 3 yrs), many patients would have switched therapy or been withdrawn Physicians regard the fate of the patient more important than the study -> Reduced differences between treatment groups Intention-to-Treat Analysis
62
Per Protocol (PP) Analysis = On-treatment analysis Emphasis on the positive results of treatment Excludes premature withdrawals (“failures”) Limitation of the PP analysis Conclusion: “Only in successful cases, we had these results...” “Only patients who could follow this protocol, …” -> Seriously biased results when excluding failures Per Protocol Analysis
63
Hypothesis Appropriate population Clinically relevant achievement Adequately-powered End points Comparison group (placebo) Randomized Double-blind Intent-to-treat analysis Protocol Analysis plan Key Elements of Trial Quality
64
Synopsis and Protocol Synopsis A short summary of the study protocol Used to invite investigators to participate Protocol A detailed description of all relevant aspects of the study Used to make sure all centers perform the study correctly Used for approval of Ethical Committee and Health Authorities Patient Information and Consent
65
Hypothesis Appropriate population Clinically relevant achievement Adequately-powered End points Comparison group (placebo) Randomized Double-blind Intent-to-treat analysis Protocol Analysis plan Key Elements of Trial Quality
66
Analysis Plan A detailed description of all analyses that are planned; statistical methods, outlines of tables and graphs Including: Primary end point to verify or reject the null hypothesis Secondary end points to further describe the data and formulate new hypotheses Secondary analyses (ad hoc, made after viewing the results and not part of the analysis plan) should be avoided Interim analyses - confidentially for Data Safety Monitoring Board (DSMB). To report in public interim results during the study should not be done!
67
Further Reading A Uniform Clinical Trial Registration Policy for Journals of Kidney Disease, Dialysis and transplantation Couser WG, AJT 2005; 5: 643 www.clinicaltrials.gov Design and Analysis of Clinical Trials in Transplantation Schold JD, AJT 2008; 8: 1779
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.