Presentation is loading. Please wait.

Presentation is loading. Please wait.

Michael R. Elliott2, Xiaobi (Shelby) Huang1, Sioban Harlow3

Similar presentations


Presentation on theme: "Michael R. Elliott2, Xiaobi (Shelby) Huang1, Sioban Harlow3"— Presentation transcript:

1 Bayesian Change Point Models for Analysis of Menstrual Diary Data at the Approach of Menopause
Michael R. Elliott2, Xiaobi (Shelby) Huang1, Sioban Harlow3 1. Genzyme, a Sanofi Company 2. Department of Biostatistics, University of Michigan 3. Department of Epidemiology, University of Michigan Modern Model Methods 2016

2

3 Introduction Goal for women’s menstrual studies: identify associations between women’s menstrual characters and women’s health Why? Menstrual cycles are the most easily observed markers of ovarian functions. Alterations in bleeding are a significant source of gyne- cologic morbidity, especially in late reproductive life. Menopausal transition is a period of critical change in women's biology and health status.

4 How do we define and identify ONSET of the transition?
1981: Metcalf and Livesey. Pituitary–ovarian function in normal women during the menopausal transition. 1994: Brambilla et al. Defining the Perimenopause for Application in Epidemiologic Investigations. 2000: Mitchell et al. Three stages of the menopausal transition from the Seattle Midlife Women's Health Study: Toward a more precise definition. 2001: Soules et al. Executive summary: Stages of reproductive aging workshop (STRAW). 2002: Taffe and Dennerstein. Menstrual patterns leading to the final menstrual period. 2007: The ReSTAGE Collaboration. Recommendations from a multi-study evaluation of proposed criteria for Staging Reproductive Aging.

5 Visual browsing of menstruation patterns.
Previous Approaches Visual browsing of menstruation patterns. Summary statistics of sliding windows over age. Linear mixed model. Major problem: lack of precision - Traditional longitudinal models tend to underutilize information from subject-level in clinical and epidemiological research settings, at least in part because of the lack of methods for such analyses. Tranditional longitudinal models tend to underutilize information from subject-level in clinical and epidemiological research settings, at least in part because of the lack of methods for such analyses.

6 Goals Initial goal: compare menstrual pattern changes between two generations of women Subsequent goals: Model how menstrual cycle length and variability change when women approach menopause. Develop method to impute various types of missingness. Find potential biomarkers for women’s menopausal transition. Define subgroups of menstruation patterns. Our initial goal is to… However, we only have one generation’ s data… so we are more focused on the subsequent goals at this time and the model will be compatible when we have 2nd generation’s data.

7 Outline TREMIN Trust Data Bayesian Changepoint Model Missing Data Imputation Menstruation Patterns

8 TREMIN Trust Data TREMIN:
Ongoing 70 year longitudinal menstrual calendar study Initiated by Dr. Alan Treloar of University of Minnesota in 1934 Cohort I: , 2350 U. Minnesota undergraduates Cohort II: , 1367 U. Minnesota undergraduates One of the only two data sets worldwide for individual women’s menstrual diary data across their reproductive life span.

9 Data in analysis 2350 Women in TREMIN Cohort I
<25 years old at enrollment, participated >5 yrs, have at least 15 consecutive segments after 35 yrs old 735 Women Used hormone < 4 yrs, no gynecological surgery before 40 yrs old 617 Women In Analysis; Each Has Observations 105 (17.0%) with complete data 313(50.7%) have observed final menstruation periods (FMP)

10 Missing due to hormone use
Missing Data Missing due to hormone use Hysterectomy or bilateral oophorectomy surgeries Non-reporting or withdrawal from the study Non- menstrual intervals are not treated as missing: Pregnancy intervals First two cycles after a birth First cycle after a spontaneous abortion

11 Four Typical Women in TREMIN Cohort
-Blue line: cycle lengths (on log scale). -Black dot (●): Observed FMP. -Red dot (●): Truncated by surgery. -Green bars (||): Pregnancy interval. -Red bars (||): Missing due to hormone use. -Black bars (||)/circle (○): Intermittent missingness due to nonreporting.

12 Outline TREMIN Trust Data Bayesian Changepoint Model Missing Data Imputation Menstruation Patterns Subgroups of Menstruation Patterns

13 Patterns of Menstruation Cycle Lengths
Regular cycling Premenopausal irregularity (plot form Lisabeth et al. 2004)

14 Thoughts of Modeling Common pattern: how menstrual cycle length changes over age Variability has the same pattern Despite the overall pattern, individual women have their unique change points, intercepts and slopes

15 Bayesian Changepoint Model for Mean and Variance
Subject Level: Population Level: Some notations: - ith subject’s tth cycle length. - age of ith subject’s tth menstruation cycle. - covariates of ith subject.

16 Inference Joint posterior distribution:

17 Outline TREMIN Trust Data Bayesian Changepoint Model Missing Data Imputation Menstruation Patterns Subgroups of Menstruation Patterns

18 Imputation of Missing Data - Complexities
Large amount of missingness Various reasons of missing: hormone, surgery, loss of follow up Cycle lengths and ages should match When to stop if FMP was not observed? How to impute FMP?

19 Imputation Procedure Step 1: Obtain initial parameters from complete data analysis: subjects with complete cases, assign subjects with missing data, draw Step 2: Impute the missing data using : Imputation draws are from the model prediction: Update ages and cycle lengths together:

20 Imputation: How to fill the missing gaps
Age End Start (L) Original data Imputed age Imputed cycle length (year) (L’) Imputation Cut the last segment length to fit the gap length Adjusted imputed age Adjusted imputed cycle length (year) of one set Adjusting Find 50 sets of imputations and perform importance sampling 20

21 Imputation: Final Menstruation Periods
If FMPs are not observed: impute and update the data until imputed FMP or when , whichever happens first. Model the age at FMP as a piecewise exponential distribution with hazard , for Knots are set at one year or 0.5 year gaps between age 40 and 60, assuming the risk of having FMP before age 40 is zero. Find the probability of FMP occurring between time interval , given the event has not occurred before

22 Imputation: Gaps till FMPs
Every time after a segment is imputed, draw a bernoulli variable to judge whether it is the final menstruation period. If any imputed cycle is longer than 365 days or an imputed age is larger than 60, stop imputing and treat the corresponding age as FMP. Censoring FMP Age 48.0 48.07 48.16 48.28 52.3 W=0 W=0 W=0 W=1

23 Imputation Procedure Step 3: Update parameters using Gibbs steps based on the imputed data set we obtained in step 2. Step 4: Using the updated parameters in 3 to impute another imputed data set using method stated in step 2. Step 5: Repeat step 3 and 4 for many times until we obtain converged MCMC chains.

24 Posterior Model Check Convergence: Model fit:
Two MCMC chains with different starting values;10,000 iterations each after “burn-in”. Gelman and Rubin statistic: 99.2% individual level parameters and all population level parameters achieved convergence. Model fit: Posterior predictive Chi-square test for cycle lengths. Compare observed FMP with replicated FMPs.

25 Outline TREMIN Trust Data Bayesian Changepoint Model Missing Data Imputation Menstruation Patterns Subgroups of Menstruation Patterns

26 Results: Individual Level Parameters
Histogram of

27 Individual Level Parameters
Posterior mean and associated 95% posterior predictive interval of the cycle length mean and the upper and lower 2.5 percentiles for the cycle distribution:

28 Population Level Parameters
Posterior mean and 95% predictive intervals for mean population level parameters :

29 Menstruation Pattern Characteristics
Mean cycle length declines slightly until changepoint, then increases rapidly. Cycle lengths are stable on average until change- point, then variability explodes. Variability begins increasing well in advance (3 years) of longer cycle lengths.

30 Population Level Parameters
Posterior mean for correlations: Mean intercept Mean slope before change-point slope after change- point Change-point Log-Var intercept Log-Var Point Var 1 -0.13 -0.01 0.29 0.17 -0.14 0.27 Mean slope before changepoint -0.02 0.00 -0.07 0.03 -0.00 0.01 Mean slope after changepoint 0.25 0.08 -0.08 0.33 0.24 Changepoint for mean 0.15 -0.25 0.43 0.79 Log-Variance intercept -0.69 0.44 0.02 Log-Variance slope before changepoint -0.74 0.09 Log-Variance slope after changepoint 0.34 Variance changepoint

31 Correlations Among Characteristics
Later change points for variance are highly associated with later change points for mean. Later change points for both mean and variance are also correlated with longer and more variable segment lengths, and more rapid increases in mean and variance after the change point; consequently mean and variance slopes after change points are positively correlated. Greater mean length at age 35 is associated with greater declines in variability before the variance change point and greater increases in variability after. Larger segment variability is associated with longer mean segment length. Larger segment variability is highly associated with more rapid declines in variability before but larger increases in variability after the variance change point: thus change in variability before and after the variance change point is negatively correlated.

32 Menstruation Patterns and Menopause
Accelerated failure time model with gaussian link: Age of FMP ~ pattern parameters Women with late menopause have: Later changepoints Smaller variance of cycle lengths at age of 35 Less rapid decrease in variance of cycle lengths before changepoints Less rapid increase in mean and variance of cycle lengths after changepoints Less abrupt changes of variance slopes before and after changepoints

33 Publication and Related Work
Publication of this work: "Modeling Menstrual Cycle Length and Variability at the Approach of Menopause Using Bayesian Changepoint Model," X. Huang, S. D. Harlow, M. R. Elliott, 2014, Journal of the Royal Statistical Society C: Applied Statistics, 63(3): Comparing changepoints to previously defined transition markers. Publication: "Distinguishing 6 Population Subgroups by Timing and Characteristics of the Menopausal Transition," X. Huang, S. D. Harlow, M. R. Elliott, American Journal of Epidemiology, 175(1): Include data from cohort II and study the difference of women’s menstruation patterns between cohort I and cohort II.

34 Acknowledgement Grant R01HD from the National Institute of Child Health and Development. Data from TREMIN Trust.

35 Thank You!

36 Additional Literatures
1987: Davidian and Caroll. Variance function estimation. 2000: Harlow et al. Analysis of menstrual diary data across the reproductive life span: Application of the bipartite model approach and the importance of within-woman variance. 2001: Thum and Bhattacharya. Detecting a change in school performance: a Bayesian analysis for a multilevel joint point problem. 2003: Hall et al. Bayesian and profile likelihood changepoint methods for modeling cognitive function over time. 2004: Lisabeth et al. A new statistical approach demonstrated menstrual patterns during the menopausal transition did not vary by age at menopause 2007: Crainiceanu et al. Spatially adaptive Bayesian penalized splines with heteroscedastic errors.

37 Appendix: Gibbs Sampling
are the corresponding part of prior multivariate normal mean and covariance matrix conditional on other parameters

38 Gibbs Sampling - Continue

39 Gibbs Sampling - Continued

40 Gibbs Sampling - Continued

41 Appendix – Survival Model of FMPs
Assume that last observed ages of all subjects are from piecewise exponential distribution Use prior: The posterior distribution is

42 Appendix – Predict FMPs
The cumulative hazard and survival function: Conditional and unconditional distribution of FMP occurrence by time

43 Posterior Predictive Model Check -Cycle Length
Posterior predictive Chi-square test: Created histogram of p-values of Chi-square tests for all subjects, each test based on 200 replications.

44 Observed and Predicted FMPs

45 Observed and Imputed FMPs
- observed FMP - imputed FMP and 95% predictive interval x - age at censoring To consider the appropriateness of the final menstrual period modeling, we plot the observed and predicted FMPs together with the censoring ages for 100 randomly selected women in Figure 6. The method for estimating FMP when not observed appears to have worked well, with the distribution for the predicted FMPs corresponding closely to the observed FMPs when the censoring age is relatively early and little information is usually available to predict FMP.

46 Posterior Model Check – FMPs
Replicate imputations for FMPs for subjects with observed FMPs Compare each observed FMPs with corresponding 200 draws of predicted FMPs Histogram of proportion of mean(FMPrep) > Observed FMP For subjects with observed FMPs, we imputed their cycle lengths from the beginning to get their predicted FMPs using the method described previously. We then compared the each subject's observed FMP to corresponding 200 replicated FMPs and summarized the proportion of mean replicated FMPs Larger than oserved FMP in Figure 5.

47 Changepoints

48 Principle Component Analysis of Pattern Measures

49 Sensitivity Analysis


Download ppt "Michael R. Elliott2, Xiaobi (Shelby) Huang1, Sioban Harlow3"

Similar presentations


Ads by Google