High Resolution studies

Slides:



Advertisements
Similar presentations
TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Advertisements

Sampling: Theory and Methods
Quantitative Methods Topic 5 Probability Distributions
Detection of Hydrological Changes – Nonparametric Approaches
High Resolution Italian studies Pamela Minicozzi Descriptive Studies and Health Planning Unit, Department of Preventive and Predictive Medicine, Fondazione.
The Application of Propensity Score Analysis to Non-randomized Medical Device Clinical Studies: A Regulatory Perspective Lilly Yue, Ph.D.* CDRH, FDA,
BUS 220: ELEMENTARY STATISTICS
/4/2010 Box and Whisker Plots Objective: Learn how to read and draw box and whisker plots Starter: Order these numbers.
1 Matching EPIET introductory course Mahón, 2011.
1 1  1 =.
1  1 =.
MULTIPLICATION EQUATIONS 1. SOLVE FOR X 3. WHAT EVER YOU DO TO ONE SIDE YOU HAVE TO DO TO THE OTHER 2. DIVIDE BY THE NUMBER IN FRONT OF THE VARIABLE.
Addition Facts
Year 6 mental test 10 second questions Numbers and number system Numbers and the number system, fractions, decimals, proportion & probability.
C82MST Statistical Methods 2 - Lecture 2 1 Overview of Lecture Variability and Averages The Normal Distribution Comparing Population Variances Experimental.
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
ELSA English Longitudinal Study of Ageing The English Longitudinal Study of Ageing Rebecca Taylor National Centre for Social.
Stratified Sampling Module 3 Session 6.
Assumptions underlying regression analysis
SADC Course in Statistics Overview of Sampling Methods I (Session 03)
Chapter 7 Sampling and Sampling Distributions
Chapter 4: Basic Estimation Techniques
1. 2 GUIDELINES 1. Identify the variable(s) of interest (the focus) and the population of the study. 2. Develop a detailed plan for collecting data. If.
ORGANISING AN ORAL HEALTH SURVEY WHO ORAL HEALTH SURVEYS Basic Methods
Mental Math Math Team Skills Test 20-Question Sample.
CHAPTER 16 Life Tables.
How would you explain the smoking paradox. Smokers fair better after an infarction in hospital than non-smokers. This apparently disagrees with the view.
Business and Economics 6th Edition
1 Epidemiologic Measures of Association Saeed Akhtar, PhD Associate Professor, Epidemiology Division of Epidemiology and Biostatistics Aga Khan University,
Chapter 4 Inference About Process Quality
Study Size Planning for Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare Research and Quality (AHRQ)
Hours Listening To Music In A Week! David Burgueño, Nestor Garcia, Rodrigo Martinez.
Lecture 3 Validity of screening and diagnostic tests
AADAPT Workshop South Asia Goa, December 17-21, 2009 Kristen Himelein 1.
Addition 1’s to 20.
Week 1.
Number bonds to 10,
©2006 Prentice Hall Business Publishing, Auditing 11/e, Arens/Beasley/Elder Audit Sampling for Tests of Controls and Substantive Tests of Transactions.
Chapter 11: The t Test for Two Related Samples
1 Chapter 20: Statistical Tests for Ordinal Data.
January Structure of the book Section 1 (Ch 1 – 10) Basic concepts and techniques Section 2 (Ch 11 – 15): Inference for quantitative outcomes Section.
UK Renal Registry 17th Annual Report Figure 5.1. Trend in one year after 90 day incident patient survival by first modality, 2003–2012 cohorts (adjusted.
Significance Testing.  A statistical method that uses sample data to evaluate a hypothesis about a population  1. State a hypothesis  2. Use the hypothesis.
Commonly Used Distributions
Departments of Medicine and Biostatistics
Elementary Statistics MOREHEAD STATE UNIVERSITY
Selection of Research Participants: Sampling Procedures
BIOST 536 Lecture 3 1 Lecture 3 – Overview of study designs Prospective/retrospective  Prospective cohort study: Subjects followed; data collection in.
Sampling Prepared by Dr. Manal Moussa. Sampling Prepared by Dr. Manal Moussa.
11 Populations and Samples.
STAT 4060 Design and Analysis of Surveys Exam: 60% Mid Test: 20% Mini Project: 10% Continuous assessment: 10%
Course Content Introduction to the Research Process
Sampling Moazzam Ali.
Key terms in Sampling Sample: A fraction or portion of the population of interest e.g. consumers, brands, companies, products, etc Population: All the.
Multiple Choice Questions for discussion
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
CHAPTER 12 DETERMINING THE SAMPLE PLAN. Important Topics of This Chapter Differences between population and sample. Sampling frame and frame error. Developing.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
SAMPLING TECHNIQUES. Definitions Statistical inference: is a conclusion concerning a population of observations (or units) made on the bases of the results.
Sampling Techniques 19 th and 20 th. Learning Outcomes Students should be able to design the source, the type and the technique of collecting data.
Sample Size Determination
STATISTICAL DATA GATHERING: Sampling a Population.
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
Formulation of the Research Methods A. Selecting the Appropriate Design B. Selecting the Subjects C. Selecting Measurement Methods & Techniques D. Selecting.
Sample Size Determination
April 18 Intro to survival analysis Le 11.1 – 11.2
The Nature of Probability and Statistics
CS639: Data Management for Data Science
Data Collection and Sampling Techniques
Presentation transcript:

High Resolution studies Sampling and power analysis in the High Resolution studies Pamela Minicozzi Descriptive Studies and Health Planning Unit, Department of Preventive and Predictive Medicine, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan

High Resolution studies collected detailed data from patients’ clinical records, so that the influence of non-routinely collected factors (tumour molecular characteristics, diagnostic investigations, treatment, relapse) on survival and differences in standard care could be analysed

Problem Solution In each country, the population of incident cases for a particular cancer consists of N subjects N is large (so, rare cancers are not considered here) Since N is large, not all cases can be investigated use a representative sample to derive valid conclusions that are applicable to the entire original population Solution

Two questions What kind of probability sampling should we use? What sample size should we use?

Sampling

Previous High Resolution studies Samples were representative of 1-year incidence a time interval (e.g. 6 months) within the study period, provided that incidence was complete an administratively defined area covered by cancer registration

Present High Resolution studies Main types of probability sampling We want to eliminate variations in types of sampling between countries and within a single country This implies more sophisticated sampling Main types of probability sampling

Simple random sampling assign a unique number to each element of the study population determine the sample size randomly select the population elements using a table of random numbers a list of numbers generated randomly by a computer Advantage: - auxiliary information on subjects is not required Disadvantage: - if subgroups of the population are of particular interest, they may not be included in sufficient numbers in the sample

Stratified sampling identify stratification variable(s) and determine the number of strata to be used (e.g. day and month of birth, year of diagnosis, cancer registry, etc.) divide the population into strata and determine the sample size of each stratum randomly select the population elements in each stratum Advantage: - a more representative sample is obtained Disadvantage: - requires information on the proportion of the total population belonging to each stratum

Systematic sampling determine the sample size (n); thus the sampling interval “i” is n/N randomly select a number “r” from 1 to “i” select all the other subjects in the following positions: r, r+ i, r+ 2*i, etc, until the sample is exhausted Advantage: - eliminate the possibility of autocorrelation Disadvantage: - only the first element is selected on a probability basis  pseudo-random sampling

many subjects do we need? How many subjects do we need?

Hypothesis test and significance level The main elements Statistical power Hypothesis test and significance level Previous pilot study to determine the minimum sample size required to get a significant result (or to detect a meaningful effect) the probability that the difference will be detected (e.g. 80%, 90%) the probability that a positive finding is due to chance alone (e.g. 1%, 5%) Previous pilot studies they explored whether some variables can be measured with sufficient precision (or available) and checked the study vision

Previous High Resolution studies Number of patients was defined based on: observed differences in survival and risk of death incidence of the cancer under study difficulties in collecting clinical information available economic resources Notwithstanding that ... we were able to identify statistically significant relative excess risks of death up to 1.60 among European countries up to 1.40 among Italian areas for breast cancer for which differences in survival are small.  Applicable to other cancers for which survival differences are larger

Example for breast cancer (diagnosis 95-99) Plot power as a function of hazard ratio for a 5% two-sided log-rank test with 80% power over sample sizes ranging from 100 and 1000 Assume 75% survival as reference (the overall survival in Europe, range: 65-90%) 45%

Example for colorectal cancer (diagnosis 95-99) Plot power as a function of hazard ratio for a 5% two-sided log-rank test with 80% power over sample sizes ranging from 100 and 1000 Assume 50% survival as reference (the overall survival in Europe, range: 30-70%) 32%

Example for lung cancer (diagnosis 95-99) Plot power as a function of hazard ratio for a 5% two-sided log-rank test with 80% power over sample sizes ranging from 100 and 1000 Assume 10% survival as reference (the overall survival in Europe, range: 5-20%) 30%

Present High Resolution studies We want to analyse both differences in survival and adherence to standard care Power analysis for both logistic regression analysis (to analyse the odds of receiving one type of care (typically standard care)) and relative survival analysis (to analyse differences in relative survival and relative excess risks of death)

Conclusions Taking into account existing samplings and power methodology experience from previous studies different coverage of Cancer Registries available economic resources We want to standardize the selection of data include a minimum number of cases that satisfies statistical considerations related to all aims of our studies Prof. JS Long1 (Regression Models for Categorical and Limited Dependent,1997) suggests that sample sizes of less than 100 cases should be avoided and that 500 observations should be adequate for almost any situation. 1Professor of Sociology and Statistics at Indiana University

Thank you for your attention And… What about your experience?