Regression To The Mean 林 建 甫 C.F. Jeff Lin, MD. PhD.

Slides:



Advertisements
Similar presentations
Experimental and Ex Post Facto Designs
Advertisements

Hypothesis testing 5th - 9th December 2011, Rome.
Experimental and Quasi-Experimental Research
Regression Towards the Mean David A. Kenny University of Connecticut
Designing Clinical Research Studies An overview S.F. O’Brien.
Defining Characteristics
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Inadequate Designs and Design Criteria
GROUP-LEVEL DESIGNS Chapter 9.
Correlation AND EXPERIMENTAL DESIGN
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Matching level of measurement to statistical procedures
Correlations and T-tests
Chapter 9: Introduction to the t statistic
Chapters 10 and 11: Using Regression to Predict Math 1680.
EVALUATING YOUR RESEARCH DESIGN EDRS 5305 EDUCATIONAL RESEARCH & STATISTICS.
Section 9.3 Sample Means.
Choosing Statistical Procedures
The following lecture has been approved for University Undergraduate Students This lecture may contain information, ideas, concepts and discursive anecdotes.
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Applying Science Towards Understanding Behavior in Organizations Chapters 2 & 3.
Evidence Based Medicine
Sample size determination Nick Barrowman, PhD Senior Statistician Clinical Research Unit, CHEO Research Institute March 29, 2010.
Jan 17,  Hypothesis, Null hypothesis Research question Null is the hypothesis of “no relationship”  Normal Distribution Bell curve Standard normal.
Sample Size Determination Donna McClish. Issues in sample size determination Sample size formulas depend on –Study design –Outcome measure Dichotomous.
Reliability of Screening Tests RELIABILITY: The extent to which the screening test will produce the same or very similar results each time it is administered.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Quantitative and Qualitative Approaches
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
CORRELATION: Correlation analysis Correlation analysis is used to measure the strength of association (linear relationship) between two quantitative variables.
Regression to the Mean.
Chapter 6 Research Validity. Research Validity: Truthfulness of inferences made from a research study.
Chapter 10 Finding Relationships Among Variables: Non-Experimental Research.
Data Analysis.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Inferential Statistics Inferential statistics allow us to infer the characteristic(s) of a population from sample data Slightly different terms and symbols.
Experimental Control Definition Is a predictable change in behavior (dependent variable) that can be reliably produced by the systematic manipulation.
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
STATISTICS STATISTICS Numerical data. How Do We Make Sense of the Data? descriptively Researchers use statistics for two major purposes: (1) descriptively.
EC 827 Module 2 Forecasting a Single Variable from its own History.
Definition Slides Unit 1.2 Research Methods Terms.
Data Descriptions.
Educational Research Experimental Research Chapter 9 (8 th Edition) Chapter 13 (7 th Edition) Gay and Airasian.
Methods of Presenting and Interpreting Information Class 9.
Practical Research: Planning and Design, Ninth Edition Paul D. Leedy and Jeanne Ellis Ormrod © 2010 Pearson Education, Inc. All rights reserved. 1 Chapter.
Experimental Research
GS/PPAL Section N Research Methods and Information Systems
Statistical analysis.
Dependent-Samples t-Test
Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016
Statistical analysis.
Experimental Research Designs
Relative Values.
Change Score Analysis versus ANCOVA in Pretest/Posttest Designs:
Design (3): quasi-experimental and non-experimental designs
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Internal Validity and Confounding Variables
Making Causal Inferences and Ruling out Rival Explanations
Introduction to Design
Analysis based on normal distributions
Chapter 6 Research Validity.
Correlational Studies
Correcting for Regression to the Mean in Behavior and Ecology
Sampling Distributions
RESEARCH METHODS Lecture 33
An Introduction to Correlational Research
Hypothesis Testing and Confidence Intervals (Part 2): Cohen’s d, Logic of Testing, and Confidence Intervals Lecture 9 Justin Kern April 9, 2018.
RESEARCH METHODS Lecture 33
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Regression To The Mean 林 建 甫 C.F. Jeff Lin, MD. PhD. 2019/2/25 Regression To The Mean 林 建 甫 C.F. Jeff Lin, MD. PhD. 台 北 大 學 統 計 研 究 所 助 理 教 授 台 北 榮 民 總 醫 院 生 物 統 計 顧 問 美 國 密 西 根 大 學 生 物 統 計 博 士 2019/2/25 Jeff Lin, MD. PhD.

Pre-test Post-test Comparison Regression to the Mean 2019/2/25 Pre-test Post-test Comparison Regression to the Mean 2019/2/25 Jeff Lin, MD. PhD.

Regression Toward the Mean (Definition) A variable extreme on its first measurement will be closer to the center of the distribution for a later measurement (on the average). Results from within person variability (temporal variability and measurement error), i.e., . 2 e  2019/2/25 Jeff Lin, MD. PhD.

Examples Studies Comparing Ambulatory and Office BP Measurements Pickering et al. “How common is white coat hypertension.” JAMA, 1988. Office DBP Awake DBP Difference Normal volunteers 77 78 -1 (<90 mmHg) “Borderline Hypertensives” 96 93 3 (90 - 104 mmHg) “Established Hypertensives” 111 101 10 (≥105 mmHg) 2019/2/25 Jeff Lin, MD. PhD.

The Simple Explanation... when you select a group from the extreme end of a distribution... selected group’s mean overall mean ...they will do better on a subsequent measure their mean on the first measure appears to “regress toward the mean” of the second measure overall mean regression to the mean 2019/2/25 Jeff Lin, MD. PhD.

Example I: pretest posttest if the first measure is a pretest and you select the low scorers... ...and the second measure is a posttest regression to the mean will make it appear as though they gained from pre to post posttest pseudo-effect 2019/2/25 Jeff Lin, MD. PhD.

Example II: pretest posttest if the first measure is a pretest and you select the high scorers... ...and the second measure is a posttest regression to the mean will make it appear as though they lost from pre to post posttest pseudo-effect 2019/2/25 Jeff Lin, MD. PhD.

Some facts this is purely a statistical phenomenon 2019/2/25 this is purely a statistical phenomenon this is a group phenomenon some individuals will move opposite to this group trend 2019/2/25 Jeff Lin, MD. PhD.

Why Does It Happen? 2019/2/25 for low scorers, you have taken the lowest x%; what are the chances they will be the lowest x% on the second measure? for high scorers, you have taken the highest x%; what are the chances they will be the highest x% on the second measure? 2019/2/25 Jeff Lin, MD. PhD.

Why Does It Happen? 2019/2/25 regression artifacts occur whenever we sample asymmetrically from a distribution regression artifacts occur with any two variables (not just pre and posttest) and even backwards in time! 2019/2/25 Jeff Lin, MD. PhD.

the absolute amount of regression to the mean depends on two factors: What Does It Depend On? the absolute amount of regression to the mean depends on two factors: the degree of asymmetry (i.e., how far from the overall mean of the first measure the selected group's mean is) the correlation between the two measures 2019/2/25 Jeff Lin, MD. PhD.

The percent of regression to the mean is: A Simple Formula The percent of regression to the mean is: 2019/2/25 Jeff Lin, MD. PhD.

The percent of regression to the mean is: A Simple Formula The percent of regression to the mean is: Prm = 100(1 - r) 2019/2/25 Jeff Lin, MD. PhD.

A Simple Formula The percent of regression to the mean is: Prm = 100(1 - r) where r is the correlation between the two measures 2019/2/25 Jeff Lin, MD. PhD.

A Simple Formula The percent of regression to the mean is: Prm = 100(1 - r) where r is the correlation between the two measures the formula tells the %, but the actual amount depends on how far the group mean is from the overall mean on the selection variable 2019/2/25 Jeff Lin, MD. PhD.

For Example: Prm = 100(1 - r) if r = 1, there is no (i.e., 0%) regression to the mean if r = 0, there is 100% regression to the mean if r = .2, there is 80% regression to the mean if r = .5, there is 50% regression to the mean 2019/2/25 Jeff Lin, MD. PhD.

assume a standardized test with a mean of 50 Example assume a standardized test with a mean of 50 pretest 50 2019/2/25 Jeff Lin, MD. PhD.

Example pretest assume a standardized test with a mean of 50 you give your program to the lowest scorers and their mean is 30 30 50 2019/2/25 Jeff Lin, MD. PhD.

Example pretest posttest assume a standardized test with a mean of 50 you give your program to the lowest scorers and their mean is 30 assume that the correlation of pre-post is .5 30 50 posttest 2019/2/25 Jeff Lin, MD. PhD.

Example pretest posttest assume a standardized test with a mean of 50 you give your program to the lowest scorers and their mean is 30 assume that the correlation of pre-post is .5 30 50 the formula is: posttest 2019/2/25 Jeff Lin, MD. PhD.

Example pretest posttest assume a standardized test with a mean of 50 you give your program to the lowest scorers and their mean is 30 assume that the correlation of pre-post is .5 30 50 the formula is: Prm = 100(1 - r) = 100(1-.5) = 50% posttest 2019/2/25 Jeff Lin, MD. PhD. 50%

Example pretest posttest assume a standardized test with a mean of 50 you give your program to the lowest scorers and their mean is 30 assume that the correlation of pre-post is .5 30 50 the formula is: Prm = 100(1 - r) = 100(1-.5) = 50% Therefore the mean will regress up 50% (from 30 to 50), leaving a final mean of 40 and a 10 point pseudo-gain 40 posttest pseudo-effect 2019/2/25 Jeff Lin, MD. PhD.

Regression Toward the Mean Measurable characteristics of an individual do not have constant values Vary above and below the average value If two sets of measurements are made on individuals, the correlation between the first and the second series of measurements will not be perfect. It is often the case that the more distant a measured characteristics from the population means of that characteristic, the more variable the measurement tends to be. 2019/2/25 Jeff Lin, MD. PhD.

Regression Toward the Mean 2019/2/25 The more extreme the initial selection criterion (that is, the further from the population mean), the greater will be the regression toward the mean at the time of the next measurement. Whenever participants are selected from a population on the basis of some measured characteristic, the mean of a subsequent measurement will be closer to the population mean than is the first measurement mean. 2019/2/25 Jeff Lin, MD. PhD.

Regression Towards the Mean 1. Regression towards the mean complicates the interpretation of many uncontrolled studies; frequently it is not recognized 2. In randomized clinical trials regression towards the mean can influence: Cost, amount and nature of screening Choice of baseline for within group comparisons Hypothesized treatment effect based on risk factor change 2019/2/25 Jeff Lin, MD. PhD.

Regression Toward the Mean An investigator cannot simply compare pre-intervention and post-intervention values in the intervention group. She must compare post-intervention values in the intervention group with values obtained at similar times in the control group. 2019/2/25 Jeff Lin, MD. PhD.

Regression Toward the Mean Use a more extreme value than the entrance criterion when the investigators screened people before baseline. Enroll participant had his or her pressure recorded three times. Only those whose second and third measure averaged 95 mmHg or greater were invited to the clinic for further evaluation. 2019/2/25 Jeff Lin, MD. PhD.

Strategies for Reducing Regression Toward the Mean Multiple measurements; multiple visits Baseline free of selection Standardization of methods; training 2019/2/25 Jeff Lin, MD. PhD.

Thanks ! 2019/2/25 Jeff Lin, MD. PhD.