German Federal Institute for Risk Assessment, Max-Dohrn-Str. 8-10, Berlin, 10589, Germany.
University of Veterinary Medicine Hannover, Foundation, Bünteweg 2, Hannover, 30559, Germany.
BMC Public Health. 2020 Jul 20;20(1):1135. doi: 10.1186/s12889-020-09177-4.
Various methods exist for statistical inference about a prevalence that consider misclassifications due to an imperfect diagnostic test. However, traditional methods are known to suffer from truncation of the prevalence estimate and the confidence intervals constructed around the point estimate, as well as from under-performance of the confidence intervals' coverage.
In this study, we used simulated data sets to validate a Bayesian prevalence estimation method and compare its performance to frequentist methods, i.e. the Rogan-Gladen estimate for prevalence, RGE, in combination with several methods of confidence interval construction. Our performance measures are (i) error distribution of the point estimate against the simulated true prevalence and (ii) coverage and length of the confidence interval, or credible interval in the case of the Bayesian method.
Across all data sets, the Bayesian point estimate and the RGE produced similar error distributions with slight advantages of the former over the latter. In addition, the Bayesian estimate did not suffer from the RGE's truncation problem at zero or unity. With respect to coverage performance of the confidence and credible intervals, all of the traditional frequentist methods exhibited strong under-coverage, whereas the Bayesian credible interval as well as a newly developed frequentist method by Lang and Reiczigel performed as desired, with the Bayesian method having a very slight advantage in terms of interval length.
The Bayesian prevalence estimation method should be prefered over traditional frequentist methods. An acceptable alternative is to combine the Rogan-Gladen point estimate with the Lang-Reiczigel confidence interval.
存在多种用于统计推断患病率的方法,这些方法考虑了由于诊断测试不完善而导致的误诊。然而,传统方法存在点估计和置信区间截断的问题,并且置信区间的覆盖性能也较差。
在本研究中,我们使用模拟数据集验证了一种贝叶斯患病率估计方法,并将其性能与频率派方法(即 Rogan-Gladen 患病率估计,RGE)进行了比较,同时还比较了几种置信区间构建方法。我们的性能衡量标准是(i)点估计对模拟真实患病率的误差分布,以及(ii)置信区间或贝叶斯方法中的可信区间的覆盖范围和长度。
在所有数据集上,贝叶斯点估计和 RGE 产生了相似的误差分布,前者略优于后者。此外,贝叶斯估计不会受到 RGE 在零或单位处截断的问题的影响。就置信区间和可信区间的覆盖性能而言,所有传统的频率派方法都表现出严重的低估,而贝叶斯可信区间以及 Lang 和 Reiczigel 新开发的频率派方法则表现良好,贝叶斯方法在区间长度方面略有优势。
应优先选择贝叶斯患病率估计方法而不是传统的频率派方法。一个可接受的替代方法是将 Rogan-Gladen 点估计与 Lang-Reiczigel 置信区间相结合。