Voß Sabrina, Hoyer Annika, Brinks Ralph
Chair for Medical Biometry and Epidemiology, Faculty of Health/School of Medicine, Witten/Herdecke University, Witten, Germany.
Biostatistics and Medical Biometry, Medical School OWL, Bielefeld University, Bielefeld, Germany.
PLoS One. 2025 May 14;20(5):e0321924. doi: 10.1371/journal.pone.0321924. eCollection 2025.
Usually, age-specific incidence rates of chronic diseases are estimated from longitudinal studies that follow participants over time and record incident cases. However, these studies can be cost- and time-expensive and are prone to loss to follow up. An alternative method allows incidence estimation based on aggregated data from (cross-sectional) prevalence and mortality studies using relations between incidence, prevalence and mortality described by the illness-death model and a related partial differential equation. Currently, adequate options for the assessment of the accuracy of the achieved incidence estimates are missing and bootstrap resampling methods are used instead. Therefore, we developed novel ways to estimate incidence rates based on the maximum likelihood principle with corresponding confidence intervals. Historical data about breathlessness in British coal miners and diabetes in Germany are used to illustrate the applicability of this method in scenarios with non-differential and differential mortality. We have two scenarios of available data in the case of differential mortality: mortality of diseased and all-cause mortality, or all-cause mortality and mortality rate ratio. Our results show that estimation of incidence rates and corresponding confidence intervals of chronic conditions based on aggregated data with the maximum likelihood method using a binomial likelihood function is possible and can replace resampling techniques.
通常,慢性病的年龄特异性发病率是通过纵向研究估计得出的,这些研究对参与者进行长期跟踪并记录新发病例。然而,这些研究可能成本高昂且耗时,并且容易出现失访情况。另一种方法是基于(横断面)患病率和死亡率研究的汇总数据,利用疾病-死亡模型及相关偏微分方程所描述的发病率、患病率和死亡率之间的关系来估计发病率。目前,缺乏评估所得发病率估计准确性的适当方法,因此使用了自助重采样方法。所以,我们基于最大似然原理开发了新的发病率估计方法及相应的置信区间。以英国煤矿工人的呼吸困难情况和德国的糖尿病情况的历史数据为例,说明该方法在非差异死亡率和差异死亡率情况下的适用性。在差异死亡率的情况下,我们有两种可用数据场景:患病死亡率和全因死亡率,或全因死亡率和死亡率比。我们的结果表明,使用二项式似然函数,基于汇总数据通过最大似然法估计慢性病的发病率及相应置信区间是可行的,并且可以替代重采样技术。