Study Context:
I am studying the relationship between a biomarker of cellular aging (telomere length) and menopause.
We have 2 datapoints for a subset of the women. Therefore, our participants could fall into the following categories:
- Pre-menopausal at baseline and post-menopausal at follow-up
- Post-menopausal at baseline and post-menopausal at follow-up
- Only one measurement (baseline)
- Only one measurement (follow-up)
We also have the timing of menopause, and when the samples were taken. Thus, one of our questions is "Does going through menopause (and how long since) affect the rate of change in telomere length between baseline and follow-up?"
Modeling:
I've come across 3 approaches that might work.
- Autoregression
- Difference Score
- Linear mixed effects model (with participant as a random factor).
I've been convinced about the advantages of linear mixed effects models to model change over time. However, our data only has 2 time points. My understanding from Singer and Willett 2003 is that fitting LMEs to 2 time points doesn't really make sense (because the intra-individual correlation will always be 1 and the error 0). For this reason, I've sort of abandoned thoughts of using LMEs.
I found good introductions to autoregression and difference-score methods here. This led me to believe that the difference-score approach is superior, because it attempts to model questions about intraindividual change (as we trying to do).
However, there is published work showing that 'controlling' for baseline telomere length in delta models (autoregression and difference-score) overestimates non-zero estimates of measurement error and can lead to false positives (if the baseline values are already quite different). Based on this study, LMEs seem to perform well.
Stats questions:
- Are LMEs a bad choice to model data with only two time points like this?
- If LMEs are acceptable, how would I model the timing of menopause relative to the first and second measurement? Because each participant has 2 rows, I assume I would need to factor the timing of menopause into the model for both rows somehow.
- If LMEs are not acceptable, is there a way to model deltas (change) without including baseline measurement in the model, which appears to bias estimates.
I've used nlme and lmer, so bonus for helping me figure this out using R!