Methods for Evaluating the Performance of Diagnostic Tests in the Absence of a “Gold Standard:” A Latent Class Model Approach Elizabeth S. Garrett Division.

Slides:



Advertisements
Similar presentations
Depression for WIPHL Workers Kenneth Kushner, Ph.D. March 27, 2008.
Advertisements

Depression in adults with a chronic physical health problem
Methods for Evaluating the Performance of Diagnostic Tests in the Absence of a Gold Standard: A Latent Class Model Approach Elizabeth Garrett-Mayer Division.
AFFECTIVE FACTORS IMPACTING ON ACADEMIC FUNCTIONING Student Development Services: Faculty of Commerce.
LESSON 1.4: DEPRESSION Unit 1: Mental Health. Do Now  Fill in the K-W-L chart with what you know and want to know about depression. KNOWWANT TO KNOW.
Mood disorders ( affective disorders ) prof. MUDr. Eva Češková, CSc. Dept. of Psychiatry, Dept. of Psychiatry, Masaryk University, Brno Masaryk University,
5.3 Psychological Disorders
Psychotic Disorders & Depression Related Etiology, Epidemiology, and Symptomology.
Lecturer name : Dr. ABDULQADER AL JARAD Lecture Date: Lecture Title:Depression (CNS Block, psychiatry )
1 Graphical Diagnostic Tools for Evaluating Latent Class Models: An Application to Depression in the ECA Study Elizabeth S. Garrett Department of Biostatistics.
Mood Disorders. Level of analysis Depression as a symptom Depression as a syndrome Depression as a disorder.
Mental Health from a Public Health Perspective Professor Carol S. Aneshensel Department of Community Health Sciences 10/12/09.
BY: JAYDEN WORMELL & JENA SCOTT Teen Depression. Question 1 Depression is a choice. True or False.
MOOD DISORDERS DEPRESSION DR. HASSAN SARSAK, PHD, OT.
DEPRESSION IN SCHOOL. 1.WHAT IS DEPRESSION? 2.WHO SUFFERS FROM DEPRESSION? 3.TYPES OF DEPRESSION. 4.CAUSES. 5.SYMPTOMS. 6.TREATMENT.
Schizoaffective Disorder A.An uninterrupted period of illness during which, at some time, there is either a Major Depressive Episode, a Manic Episode,
+ Bipolar Disorder Dajshone Bruce Psychology, period 3 May 1,2011.
Abnormal Psychology Dr. David M. McCord Mood Disorders.
Psychological Disorders Common Features. Affective Disorders  Major Depressive Disorder (MDD)  DSM-5 Criteria for MDD… 1. Depressed mood or irritable.
Lab 9: Depression Lab 9: Depression. Video #1 Dysthymic Disorder What criteria for Dysthymic Disorder does Susan meet? What criteria for Dysthymic Disorder.
Major Depressive Disorder Natalie Gomez Psychology Period 1.
UNIPOLAR DEPRESSION.  Has bad days  Gets tired  Gets angry  Gets the blues But that’s not clinical depression.
Psychological Disorders. Bell Ringer: Why would some people consider homosexuality as a psychological disorder 1) Harmful and/or disturbing to the individual.
DEPRESSION Dr.Jwaher A.Al-nouh Dr.Eman Abahussain
Common Presentations of Depression and Anxiety.
Module 49 Mood Disorders Module 49 - Mood disorders1.
 Depression  Schizophrenia  Phobia  General Anxiety Disorder  Post-traumatic stress disorder  Hoarding  Caffeine withdrawal  Internet gaming disorder.
Depressive Disorders and Substance Use Disorders.
CHP400: Community Health Program-lI Mohamed M. B. Alnoor Muna M H Diab SCREENING.
Depression. DMS-IV Criteria (1) depressed mood most of the day, nearly every day, as indicated by either subjective report (e.g., feels sad or empty)
Bipolar Disorder and Substance Use Disorders Bipolar I Disorder Includes one or more Manic Episodes or Mixed Episodes, sometimes with Major Depressive.
Adolescent Mental Health Depression Signs. Symptoms. Consequences.
1 Psychology 320: Psychology of Gender and Sex Differences Lecture 59.
Latent Class Regression Model Graphical Diagnostics Using an MCMC Estimation Procedure Elizabeth S. Garrett Scott L. Zeger Johns Hopkins University
Screening and its Useful Tools Thomas Songer, PhD Basic Epidemiology South Asian Cardiovascular Research Methodology Workshop.
1 Psychology 320: Gender Psychology Lecture Announcements 1. Due to unforeseen circumstances, Jill must cancel her office hour next week. However,
What is Depression? How Do I Get Help for Depression?
Assessing Estimability of Latent Class Models Using a Bayesian Estimation Approach Elizabeth S. Garrett Scott L. Zeger Johns Hopkins University Departments.
IN THE NAME OF GOD MOOD DISORDERS MOHAMAD NADI M.D PSYCHIATRIST.
1 Psychology 320: Psychology of Gender and Sex Differences March 19 Lecture 57.
Depression Management Presentation 1 of 3 Documented diagnosis PHQ tool Depression care assessment.
DR.JAWAHER A. AL-NOUH K.S.U.F.PSYCH. Depression. Introduction: Mood is a pervasive and sustained feeling tone that is experienced internally and that.
Screening.  “...the identification of unrecognized disease or defect by the application of tests, examinations or other procedures...”  “...sort out.
DEPRESSION & CHRONIC ILLNESS Robert Postlethwaite Clinical Psychologist.
The Christie NHS Foundation Trust Supporting the patient and accessing support services Suzanne Mc Keever Nurse Specialist Psycho-Oncology.
Mood Disorders By: Angela Pabon.
CHAPTER 16 Mood Disorders. Mood Mood can be defined as a pervasive and sustained emotion or feeling tone that influences a persons behavior and colours.
Eiko Fried University of Leuven, Belgium 1 "The differential impact of individual depression symptoms on impairment of psychosocial functioning"
Chapter Depression Barbour, Hoffman, and Blumenthal C H A P T E R.
MOOD DISORDERS Madiha Anas Institute of Psychology Beaconhouse National University.
Disorders Characteristics  1. Characteristics  Psychotic disorder  Affective disorder  Anxiety disorder  2. Explanations  Cognitive  Behavioural.
Depressive Disorders DSM 5. Depressive disorders At the end of this lecture the student will be able to:  Identify the psychiatric diagnostic criteria.
Major Depressive Disorder Bipolar Disorder. Magnified states of either: Depression: Lethargic, listless state characterized by sadness. Mania: Excited.
313/220 Collins St. Melbourne, VIC 3000 Australia‎
1 Psychology 320: Gender Psychology Lecture Invitational Office Hour Invitations, by Student Number for March 25 th 11:30-12:30, 3:30-4:30 Kenny.
Detecting Depression in the Primary Care Setting Presented by: Jonathan Betlinski, MD Date: 09/15/2016.
PSY 436 Instructor: Emily Bullock Yowell, Ph.D.
Depression Psychopathology.
Major Depressive Disorder
Depression while on Dialysis: What to look for and How to Deal with It
Lec. 10.
Module 36 - Introduction to Psychological Disorders
Disorder and Dysfunction ~ Revision
Mental Illness Unit Mood Disorders.
Manic Depression.
Bipolar Disorder and Substance Use Disorders
Clinical characteristics of Depression
PSY 436 Instructor: Emily E. Bullock, Ph.D.
Mood Disorders: Overview
To stay or to leave? group A had partners with initial IBM care scores of 20 or more group B & C rated their partners at less than 20 for IBM care group.
Presentation transcript:

Methods for Evaluating the Performance of Diagnostic Tests in the Absence of a “Gold Standard:” A Latent Class Model Approach Elizabeth S. Garrett Division of Biostatistics Johns Hopkins University December 9, 2002

Evaluating Diagnostic Criteria Relatively few areas of medicine have true “gold standard” tests, where test is perfectly accurate. –“Pathognomic indicators” –When indicator is present, disease is present –When indicator is absent, disease is absent Other situations: –Combination of signs and symptoms provide very accurate diagnosis. –Disease process is not well understood: controversy exists about how to define diagnosis. –Disease process is well understood but measuring disease via signs and symptoms is difficult.

Diagnostic Criteria in Psychiatry Currently, the DSM (Diagnostic and Statistical Manual of Mental Disorders) is the standard for defining mental disorders. Diagnostic algorithms are provided with which a determination of disorder absence or presence can be made Examples: major depression, schizophrenia, autism, alcoholism, generalized anxiety disorder.

Major Depressive Episode, as diagnosed by the DSM-IV (APA, 1994) A. A person who suffers from major depressive disorder must either have depressed mood or a loss of interest or pleasure in daily activities for at least a 2 week period. B. The disorder is characterized by the presence of five or more of the following nine symptoms: 1. depressed mood most of the day, nearly every day, as indicated by either subjective report or observation made by others. 2. markedly diminished interest or pleasure in all, or almost all, activities most of the day, nearly every day. 3. significant weight loss when not dieting or weight gain, or decrease or increase in appetite nearly every day. 4. insomnia or hypersomnia nearly every day. 5. psychomotor agitation or retardation nearly every day. 6. fatigue or loss of energy nearly every day. 7. feelings of worthlessness or excessive inappropriate guilt nearly every day. 8. diminished ability to think or concentrate, or indecisiveness, nearly every day. 9. recurrent thoughts of death, recurrent suicidal ideation without a specific plan, or a suicide attempt or a specific plan for committing suicide. Symptoms are not better accounted for by bereavement, the symptoms persist for longer than 2 months or are characterized by marked functional impairment, morbid preoccupation with worthlessness, suicidal ideation, psychotic symptoms, or psychomotor retardation.

How do we validate the DSM criteria? How can we be sure that these definitions are valid measures? How can we determine the sensitivity and specificity of these measures? Is there a gold standard? Is psychiatrist’s diagnosis a gold standard? What types of individuals are the diagnostic criteria diagnosing as depressed? How often are individuals misdiagnosed? What are the implications of a positive or negative diagnosis?

Example: Major Depression Epidemiologic Catchment Area Study (ECA): Collected mental health data on individuals in 5 cities, beginning in Our sample: epidemiologic sample of 1322 individuals in the East Baltimore area collected in 1993 (wave 3). Depression questions are from Diagnostic Interview Schedule, which has been shown to be valid and reliable (Robins et al., 1981) Questions about symptom presence were asked for the DSM major depression symptoms and coded as “present” if the symptoms occurred in the same two week period. Symptom groups: some questions ask about the same type of symptom: –Have you had trouble sleeping? –Have you had trouble waking? –Do you sleep too much ? Related symptoms are categorized into the same symptom group.

Distribution of Symptoms GroupSymptomPrevalence 1Depressed mood0.06 2Disinterest in sex0.08 Less fun Loss of enjoyment 3Reduced energy/fatigued Reduced concentration 0.04 Slow thoughts Indecisive 5*Feel inferior0.03 Lacking self- confidence 6Guilty/sinful0.02 GroupSymptomPrevalence 7Ideas of self-harm0.05 Want to die Suicidal thoughts Suicide attempts 8Trouble falling asleep 0.09 Trouble waking Sleep too much 9Loss of appetite0.08 Weight loss Increased appetite Weight gain 10Slow movement0.04 Fast movement fidgety ECA Wave 3, N = 1322

Evaluating the DSM Criteria Without an available gold standard, we resort to other methods Suppose that the proposed symptom (groups) define depression. Without relying on the DSM definition of depression but imposing model assumptions, what types of symptom patterns are observed in the data? Do individuals tend to “cluster” into categories based on symptom response patterns? We can evaluate this using a ‘Latent Class Model.’ Categorical analog of factor analysis.

The Latent Class Model Assumes that each individual in the population is a member of one of M latent classes. Each of the classes is defined by a vector of symptom prevalences, p m = (p 1m, p 2m, …p Km ) where there are K symptoms, m = 1,…,M. The vector y i = (y i1, y i2, …., y ik ) is individual i’s binary vector of symptom responses, i = 1,…,N. The proportion of individuals in class m is denoted by  m. The true, yet unobserved, latent class of individual i is denoted by η i, where η i  {1,2,..,M}. The symptoms “define” the latent variable of interest. M is fixed. Conditional Independence: Given class membership, symptoms are independent.

class 1 (η = 1) class 2 (η = 2) class 3 (η = 3) p 11, p 21, …,p K1 p 13, p 23, …,p K3 p 12, p 22, …,p K2 y i1, y i2, …,y iK y i’1, y i’2, …,y i’K y i’’1, y i’’2, …,y i’’K Graphical Depiction of the Latent Class Model

Statistical Details Probability distribution of Y i : Likelihood function:

Interpretation Two class model: –A non-depressed class which reports on average no symptoms (93% of sample) –A depressed class which reports on average 4 to 5 of the 10 symptoms Three class model: –A non-depressed class which reports on average no symptoms (88% of sample) –A mildly depressed class which reports on average 2 to 3 of the 10 symptoms (9% of sample) –A severely depressed class which reports on average 6 to 7 of the 10 symptoms (3% of sample) The three class model is deemed more appropriate from a statistical standpoint (model fit, adherence to model assumptions)

Results of Estimation p matrix  vector 3 Posterior probability of class membership: –Tells us probability that individual i is in one of the classes, given his response pattern.

Examples: Assume M = 2 Individual reports absence of all symptoms:

Examples: Assume M = 2 Individual reports only fatigue and sleep problems: Individual reports all symptoms except self-esteem and guilt:

Estimation Options Maximum Likelihood Approach –Widely available –Accepted approach Bayesian Approach –Markov Chain Monte Carlo estimation –Easily implemented in “WinBugs” ( Imperial College of Science, Technology and Medicine: –Benefits: Model checking methods ‘Identifiability’ can be assessed (Garrett and Zeger, 2000) ­ MCMC approach allows estimation of ANY function of parameters and standard errors.

Bayesian Estimation Approach The Gibbs Sampler is an iterative process used to estimate posterior distributions of parameters. –we sample parameters from conditional distributions e.g. P(  1 |Y,p, ,  2,  3 ) –At each iteration, we get ‘sampled’ values of p, , and . –We use the samples from the iterations to estimate posterior distributions by averaging over other parameter values.

Evaluating Depression Diagnosis Assumption: Treat the latent class model as our “gold standard” definition of depression. We can use the symptom responses to evaluate the DSM-IV diagnosis of depression Compare the DSM diagnosis to the latent class diagnosis using standard definitions: Assume two classes of depression –Class 1 is non-depressed class –Class 2 is depressed class

More specifically… where {y r : r  R} is the set of symptom patterns that are classified as a diagnosis by the DSM-IV.

Predictive Values Positive and Negative Predictive Values are simply transformations of SE and SP:

Class assignment? Complication: latent class model provides us with “posterior probabilities” of class membership. We don’t know the true latent classes, η, for individuals in the dataset. Example: M =3 –Posterior probabilities of class membership for a particular symptom pattern are 0.48, 0.48, –To which class should this individual be assigned? –How do we account for the uncertainty in the assignment?

One Approach to Class Assignment “Pseudo-classes” (Maximum Likelihood) –assign individuals to “pseudo-classes” based on posterior probability of class membership (Bandeen-Roche et al., 1997) –recall that posterior probability is based on observed pattern –e.g. individual with 0.20, 0.05, 0.75 better chance of being in class 3 not necessarily in class 3 Using class assignment, we can calculate sensitivity and specificity We can repeat assignment procedure T times, where T is large. On average, the sensitivity and specificity estimates will be correct. Drawback: we don’t get precision associated with estimates.  Standard deviation of repeated estimates does not account for imprecision in estimates of p and   Confidence intervals based on the T repeats will be too narrow.

MCMC Approach to Class Assignment η is a vector of parameters At each iteration in the Gibbs sampler, each parameter is drawn from its conditional distribution  At each iteration in Gibbs sampler, individuals are automatically assigned to classes no need to “manually” assign. For each of the W iterations of the chain, we can calculate sensitivity and specificity. Sensitivity and specificity are simply additional parameters. ­Due to the nature of the MCMC approach, the standard deviation of the posterior interval of sensitivity represents its standard error.  Precision estimates for sensitivity and specificity are valid.

Operating Characteristics of Depression Diagnoses Several definitions of depression: –DSM-III –DSM-IV –ICD-10a (mild) –ICD10b (moderate) –ICD10c (severe) We calculate sensitivity and specificity for each of five diagnoses (above) for models with M = 2 and M = 3. We do the same for PPV and NPV. Vertical lines represent 95% posterior intervals.

Interpreting results from three class model Diagnoses only have two possibilities: depressed or not depressed Two class model also has two possibilities. ­Three class model has a non-depressed class and two depression classes (mild and severe). Should we think of BOTH or just SEVERE as the “treatment class.” Why does it matter? –Clinical decision making –“Pre-clinical” depression? Which is better?

Misclassification probabilities for identifying “severe depression” using the DSM-IV criteria Two-class modelThree-class model P(false positive)< P(false negative) P(misclassification) Misclassification probabilities for identifying “any depression” using the DSM-IV criteria Two-class modelThree-class model P(false positive)< P(false negative) P(misclassification)

Revisiting questions…. Recall that three class model was chosen versus the two class model as more appropriate. We answer questions posed earlier by examining agreement of DSM-IV and the three class model.

What types of individuals are the diagnostic criteria diagnosing as depressed? DSM-IV tends to diagnose individuals who are in ‘class 3’ of the three class model (i.e. our severe depression class) The mildly depressed class tends to be ignored. Not necessarily a bad thing: –DSM criteria are developed for deciding treatment. –If mild depression does not require any treatment, then diagnosis of DSM-IV is adequate. But what if: –Class 2 individuals (ie mildly depressed) would benefit from treatment. –Class 2 is a “pre-clinical” class: intervention could prevent transition to severe depression

How often are individuals misdiagnosed? Assuming that diagnosis of severely depressed individuals is intent of DSM-IV, there is LOW probability of misclassification: P(misclassification) = If intent is to diagnose ANY depression (i.e., mild or severe), then there is much higher probability of misclassification: P(misclassification) = (Note that of these 8%, almost all are false negatives)

What are the implications of a positive or negative diagnosis? The DSM-IV has high PPV for severe depression: PPV(3)  0.90 High NPV for no depression: NPV(1)  0.90 Essentially no information is provided as to an individual’s likelihood of mild depression given either a negative or a positive diagnosis: PPV(2)  0.10 NPV(2)  0.10

Issues and Concerns Operating characteristics assume that two types of diagnosis being compared are determined independently. –Methods of assessment are different –But, large overlap of symptoms –Possibly/probably not truly independent Conditional independence of tests given simply presence or absence of disease is a common problem. –Tests may be independent given “continuum” level of disease, but not when disease status is simply categorized. –However, the latent class model does not definitively assign individuals to classes. Instead, posterior probability is estimated –Because individuals are assigned posterior probabilities, we can more easily think of a “continuum” of disease. –This is true even in the case of classes which are not ordinal in nature, because the posterior probabilities for each class will be continuous.

Conclusions DSM-IV appears to be a valid approach for diagnoses of “severe” depression. There appears to be another class of milder depression that is not identified by any of the depression definitions. By using an MCMC approach to latent class model estimation, we can estimate operating characteristics of tests and their standard errors in a straightforward way. This approach can be used quite generally for other medical diagnoses –Psychiatric diagnoses –Arthritis