ANCOVA correction for regression to the mean in a repeated-measures wellness monitoring system — is this sufficient?
I have a consumer health monitoring system where users take blood tests every 4-12 weeks and get health scores. Classic selection bias: users who start monitoring because they feel unwell have worse baselines. On retest, scores improve even without intervention (regression to the mean).
My proposed correction: ANCOVA-based: Corrected_gain = Observed_gain - (1 - r_test_retest) × (Baseline - Population_mean)
Where r_test_retest is the ICC for each health domain score (estimated from pilot repeated-measures data).
Questions:
- Is ANCOVA sufficient here, or does Lord's paradox apply? (The "treatment" isn't randomized — users self-select into a lifestyle program.)
- Should I use the population mean from my reference dataset (N=7,840 general population) or the mean of my user cohort (biased toward health-conscious)?
- In the user-facing UI: I plan to show the trend with a caveat ("Your improvement trend becomes more reliable after 2-3 test cycles") rather than suppressing it. Is this honest, or is it misleading for a consumer audience?
- After how many test cycles does the regression effect become negligible for practical purposes? My gut says 2-3, but I'd like a citation or formula.