Assessment of Reliable Change: Methods and Assumptions Michael Basso, Ph.D. Associate Professor and Director of Clinical Training Department of Psychology—University.

Slides:



Advertisements
Similar presentations
ASSESSING RESPONSIVENESS OF HEALTH MEASUREMENTS. Link validity & reliability testing to purpose of the measure Some examples: In a diagnostic instrument,
Advertisements

ADVANCED STATISTICS FOR MEDICAL STUDIES Mwarumba Mwavita, Ph.D. School of Educational Studies Research Evaluation Measurement and Statistics (REMS) Oklahoma.
Assessing Change in Clinical Practice Richard Bost, Ph.D., ABPP Frances Wen, Ph.D. Michael Basso, Ph.D.
Review Find the mean square (MS) based on these two samples. A.26.1 B.32.7 C.43.6 D.65.3 E M A = 14 M B = Flood.
Objectives (BPS chapter 24)
SAMPLE SIZE ESTIMATION
Using Statistics in Research Psych 231: Research Methods in Psychology.
PSY 307 – Statistics for the Behavioral Sciences
Concept of Measurement
Topics: Inferential Statistics
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 10: Hypothesis Tests for Two Means: Related & Independent Samples.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 9: Hypothesis Tests for Means: One Sample.
The Basics of Regression continued
Topic 3: Regression.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview Central Limit Theorem The Normal Distribution The Standardised Normal.
Today Concepts underlying inferential statistics
Chapter 9 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 What is a Perfect Positive Linear Correlation? –It occurs when everyone has the.
Confidence Interval A confidence interval (or interval estimate) is a range (or an interval) of values used to estimate the true value of a population.
Chemometrics Method comparison
1 8/14/2015 Evaluating the Significance of Health-Related Quality of Life Change in Individual Patients Ron Hays October 8, 2004 UCLA GIM/HSR.
1 Psych 5500/6500 Statistics and Parameters Fall, 2008.
Standard Error of the Mean
ECONOMETRICS I CHAPTER 5: TWO-VARIABLE REGRESSION: INTERVAL ESTIMATION AND HYPOTHESIS TESTING Textbook: Damodar N. Gujarati (2004) Basic Econometrics,
1 G Lect 5b G Lecture 5b A research question involving means The significance test approach »The problem of s 2 »Student’s t distribution.
T tests comparing two means t tests comparing two means.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Chapter 1: Research Methods
Individual values of X Frequency How many individuals   Distribution of a population.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Exploring the SF-36, V2: Measuring the Quality of Life of AgrAbility Program Participants Ronald C. Jester Robert A. Wilson University of Delaware National.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Assessing Change in Clinical Practice Richard Bost, Ph.D., ABPP Frances Wen, Ph.D. Michael Basso, Ph.D Oklahoma Psychological Association 11/6/04.
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Chapter 9 Three Tests of Significance Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
CHAPTER SEVEN ESTIMATION. 7.1 A Point Estimate: A point estimate of some population parameter is a single value of a statistic (parameter space). For.
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
Testing Differences between Means, continued Statistics for Political Science Levin and Fox Chapter Seven.
Chapter 10 The t Test for Two Independent Samples
T tests comparing two means t tests comparing two means.
Assessing Responsiveness of Health Measurements Ian McDowell, INTA, Santiago, March 20, 2001.
Measurement MANA 4328 Dr. Jeanne Michalski
Reliability performance on language tests is also affected by factors other than communicative language ability. (1) test method facets They are systematic.
Chapter 11 Intelligence. Mental quality consisting of the ability to learn from experience, solve problems, and use knowledge to adapt to new situations.
Comparing the Means of Two Dependent Populations.
From the population to the sample The sampling distribution FETP India.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
5. Evaluation of measuring tools: reliability Psychometrics. 2011/12. Group A (English)
Vocab Unit 11. = a method of assessing an individual's mental aptitudes and comparing them with those of others, using numerical scores.
Unit 11 Vocabulary Individual Differences and Intelligence.
Clinical practice involves measuring quantities for a variety of purposes, such as: aiding diagnosis, predicting future patient outcomes, serving as endpoints.
T-Tests and ANOVA I Class 15.
Statistical analysis.
Comparing Two Means Prof. Andy Field.
Effect Size 10/15.
Statistical analysis.
Inference: Conclusion with Confidence
Effect Size.
Virtual COMSATS Inferential Statistics Lecture-26
12 Inferential Analysis.
Definition Slides.
IN THE NAME OF GOD.
Evaluation of measuring tools: reliability
12 Inferential Analysis.
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Presentation transcript:

Assessment of Reliable Change: Methods and Assumptions Michael Basso, Ph.D. Associate Professor and Director of Clinical Training Department of Psychology—University of Tulsa Clinical Associate Professor Department of Psychiatry—University of Oklahoma

Objectives Provide background concerning methods of assessing reliable change Describe assumptions and applications of reliable change scores Illustrate use of reliable change scores

Assessment of Clinical Change Two Basic Approaches –Assessment of Group Differences Across Time –Assessment of Individual Differences Across Time

Assessment of Group Differences Across Time Assessment of statistically reliable change –“Does the treatment yield significant benefits for groups of patients?” »i.e., do average scores at T1 and T2 come from different distributions –This approach describes the average rate of change over groups primarily –It is accomplished with repeated measure ANOVA –Problem: You could have a statistically significant difference with a very small effect size, but it might not be a clinically meaningful change

Assessment of Clinically Meaningful Change –“Did the patient’s change in performance at T2 exceed base rates of change?” »i.e., did the individual show change that exceeded expectations based on measurement error, practice effects, and regression to the mean? –This method describes the base rate of change »Change that is exceeds the base rate is not normal, and is therefore clinically meaningful –Our focus is on the assessment of clinically meaningful change in individuals, but this method can be applied to group data as well Assessment of Individual Differences Across Time

How do you establish the base-rate of change? Bear into consideration that: –It would be improbable to obtain the exact same score twice –There is no perfect test-retest correspondence because of »measurement error »regression to the mean »practice effects Assessment of Clinical Change for Individuals

Reliable change Index scores –“Does change exceed what would be expected based on measurement error alone?” –This method is based on Reliability of measurement –It is used for typical performance tests »i.e., attitude, personality, psychopathology, etc. Standardized Regression-Based Change Scores –“Does change in scores exceed expectations based on T1 (baseline) scores?” –This method is based on a validity coefficient (i.e., what T2 score is predicted by the T1 score) –It is used for maximal performance tests »i.e., IQ, neuropsychological, etc. Two Methods of Assessing Base Rates of Change

Elaborated by Jacobson and Truax (1991) –Based on the standard error of the difference »Which in turn is based on the reliability coefficient –This reflects the sampling distribution of difference scores –it implies the magnitude of differences between two test scores that vary by chance alone Assumptions –Error components are mutually independent and independent of true pretest and posttest scores –Error is normally distributed with a mean of 0 –SE of error is equal for all subjects –These assumptions are questionable in clinical settings (cf. Maassen, 2004) Reliable Change Index Scores

Standard Normal Curve—Distribution of Difference Scores

To use the RCI, you must compute the SE of difference between two scores SEdiff=(2(SD(1-r xx ) 1/2 ) 2 ) 1/2 Then, compute a confidence interval for change scores for 95% confidence, you multiply 1.96 * SEdiff for 90% confidence, you multiply 1.60 * Sediff Does the raw score change between T2 and T1 exceed the confidence interval? –If so, it represents change that exceeds the base rate expected based on measurement error »Thus, clinically meaningful change has occurred –If not, then the change is consistent with the base rate expected based on measurement error »Thus, no clinically meaningful change has occurred Reliable Change Index Scores--Method

Ferguson, Robinson, & Splaine (2002) –SF-36 in 200 patients who had undergone a Coronary Artery Bypass Grafting (CABG) surgery –SF-36 contains 8 scales »Physical Functioning »Role Functioning Physical »Bodily Pain »General Health »Vitality »Social Functioning »Role Functioning-Emotional »Mental Health Reliable Change Index Scores—An Example

Ferguson, Robinson, & Splaine (2002) –Physical Functioning »Reliability=.93 (from normative sample of 2474) »Mean of normative sample=84.15 »SD of normative sample=23.28 »SEdiff=(2(SD(1-r xx ) 1/2 ) 2 ) 1/2 SEdiff=(2(23.28(1-.93) 1/2 ) 2 ) ) 1/2 = % CI: (SEdiff)*1.96=19.32 T1 Mean=40.97 T2 Mean=63.39 Mean Diff=22.42 The mean difference exceeds Thus, clinically meaningful change has occurred as a result of surgery Reliable Change Index Scores—An Example

Ferguson, Robinson, & Splaine (2002) –Mental Health »Reliability=..84 (from normative sample of 2474) »Mean of normative sample=75.01 »SD of normative sample=21.40 »SEdiff=(2(SD(1-r xx ) 1/2 ) 2 ) 1/2 SEdiff=(2(21.40(1-..84) 1/2 ) 2 ) ) 1/2 = % CI: (SEdiff)*1.96=21.40 T1 Mean=72.08 T2 Mean=71.84 Mean Diff=-0.24 The mean difference fails to exceed Thus, no clinically meaningful change has occurred as a result of surgery Reliable Change Index Scores—An Example

Elaborated by Charter (1996) Used primarily for maximal performance tests The RCI of Jacobsen and Truax is used for typical performance tests –It assumes that errors between test scores at baseline and time 2 are uncorrelated –This assumption is untenable in maximal performance tests because of practice effects Standardized Regression Based Change Scores

Based on the standard error of prediction –SE pred =SD Y2 ((1-r Y1Y2 2 ) 1/2 ) The SE reflects the sampling distribution of predicted scores It implies the range of scores that might be expected at time two that may be expected from the baseline score and prediction error This method requires you to compute the estimated true score –Y2 True =M+((r Y1Y2 )(Y1-M)) The T2 score is prone to error, and this formula permits an unbiased estimate of the true score The SEpred is used to compute a confidence interval around the estimated true score Standardized Regression Based Change Scores

Standard Normal Curve—Distribution of Standard Error of Prediction Around Estimated True Score

To use the SRB, you must compute the estimated true T2 score Compute the confidence interval around this estimated true T2 score For 95% confidence, you multiply 1.96 * SEpred For 90% confidence, you multiply 1.60 * SEpred Does the obtained T2 score fall outside the confidence interval around the estimated true score for T2? –If so, it represents change that exceeds the base rate expected based on measurement error, regression to the mean, and practice »Thus, clinically meaningful change has occurred –If not, then the change is consistent with the base rate expected based on measurement error, practice, and regression to the mean »Thus, no clinically meaningful change has occurred Standardized Regression Based Change Scores--Method

Basso, Carona, Lowery, & Axelrod (2002) –WAIS-III re-tested in a group of control subjects over a 3-6 month interval –FSIQ »Test-Retest Reliability=.90 »T1 Mean T1=109.4 (11.6) »T2 Mean T2=115.0 (12.1) »SE pred =SD Y2 ((1-r Y1Y2 2 ) 1/2 ) SE pred =(12.1(( ) 1/2 ))= % CI: (SEdiff)*1.96=10.36 Mean Diff=5.60 –The mean difference fails to exceed the 95% CI –No individual had a score exceeding the 95% CI To apply the SRB, the T2 True Score is estimated –If the obtained score falls within the CI around the T2 True score, then no clinically meaningful change has occurred Standardized Regression Based Change Scores--An Example

Basso, Carona, Lowery, & Axelrod (2002) –An example application: –T1 obtained score=104 –T2 obtained score=116 –Estimated True T2 Score »Y True =M+((r Y1Y2 )(Y1-M)) »Y True =100+(.90)( )=103.6 »116 exceeds points from »Thus, meaningful change has occurred Standardized Regression Based Change Scores--An Example

Basso, Carona, Lowery, & Axelrod (2002) –An example application: –T1 obtained score=103 –T2 obtained score=106 –Estimated True T2 Score »Y True =M+((r Y1Y2 )(Y1-M)) »Y True =100+(.90)( )=105.4 »105 falls within points of 106 »Thus, no meaningful change has occurred Standardized Regression Based Change Scores--An Example

Questions?