当参考标准不完善时，评估诊断或筛选试验准确性时的依赖性误差的影响。

Effect of dependent errors in the assessment of diagnostic or screening test accuracy when the reference standard is imperfect.

机构信息

Clinical Epidemiology and Biostatistics, McMaster University, Hamilton, Ontario, Canada.

出版信息

Stat Med. 2012 May 20;31(11-12):1129-38. doi: 10.1002/sim.4444. Epub 2012 Feb 21.

Abstract

When no gold standard is available to evaluate a diagnostic or screening test, as is often the case, an imperfect reference standard test must be used instead. Furthermore, the errors of the test and its reference standard may not be independent. Some authors have opined that positively dependent errors will lead to overestimation of test performance. Although positive dependence does increase agreement between the test and the reference standard, it is not clear if test accuracy will necessarily be overestimated in this situation, and the case of negatively associated test errors is even less clear. To examine this issue in more detail, we derive the apparent sensitivity, specificity, and overall accuracy of a test relative to an imperfect reference standard and the bias in these parameters. We demonstrate that either positive or negative bias can occur if the reference standard is imperfect. The type and magnitude of bias depend on several components: the disease prevalence, the true test sensitivity and specificity, the covariance between the false-negative test errors among the true disease cases, and the covariance between the false-positive test errors among the true noncases. If, for example, sensitivity and specificity are 0.8 for both the test and reference standard and the errors have a moderate positive dependence, test sensitivity is then underestimated at low prevalence but overestimated at high prevalence, while the opposite occurs for specificity. We illustrate these ideas through general numerical calculations and an empirical example of screening for breast cancer with magnetic resonance imaging and mammography.

摘要

当没有黄金标准可用于评估诊断或筛查测试时，通常情况下必须使用不完美的参考标准测试来替代。此外，测试及其参考标准的误差可能不是独立的。一些作者认为，阳性相关误差会导致测试性能的高估。虽然阳性相关性确实会增加测试与参考标准之间的一致性，但在这种情况下，测试准确性是否必然会被高估尚不清楚，而与测试误差负相关的情况则更不清楚。为了更详细地研究这个问题，我们推导出相对于不完美的参考标准，测试的表观灵敏度、特异性和总体准确性以及这些参数的偏差。我们证明，如果参考标准不完美，就会出现正偏差或负偏差。偏差的类型和大小取决于几个因素：疾病的流行率、真实的测试灵敏度和特异性、真实疾病病例中假阴性测试误差之间的协方差，以及真实非病例中假阳性测试误差之间的协方差。例如，如果测试和参考标准的灵敏度和特异性均为 0.8，并且误差具有中度正相关性，那么在低流行率下，测试灵敏度会被低估，但在高流行率下会被高估，而特异性则相反。我们通过一般数值计算和磁共振成像与乳房 X 光检查筛查乳腺癌的实证例子来说明这些想法。

相似文献

Effect of dependent errors in the assessment of diagnostic or screening test accuracy when the reference standard is imperfect.

Stat Med. 2012 May 20;31(11-12):1129-38. doi: 10.1002/sim.4444. Epub 2012 Feb 21.

Bias in estimating accuracy of a binary screening test with differential disease verification.

Stat Med. 2011 Jul 10;30(15):1852-64. doi: 10.1002/sim.4232. Epub 2011 Apr 15.

Bias due to composite reference standards in diagnostic accuracy studies.

Stat Med. 2016 Apr 30;35(9):1454-70. doi: 10.1002/sim.6803. Epub 2015 Nov 10.

Some issues in resolution of diagnostic tests using an imperfect gold standard.

Stat Med. 2001 Jul 15;20(13):1987-2001. doi: 10.1002/sim.819.

Diagnostic accuracy of mammography, clinical examination, US, and MR imaging in preoperative assessment of breast cancer.

Radiology. 2004 Dec;233(3):830-49. doi: 10.1148/radiol.2333031484. Epub 2004 Oct 14.

Magnetic resonance imaging: the evolution of breast imaging.

Breast. 2013 Aug;22 Suppl 2:S77-82. doi: 10.1016/j.breast.2013.07.014.

A Bayesian approach to simultaneously adjusting for verification and reference standard bias in diagnostic test studies.

Stat Med. 2010 Oct 30;29(24):2532-43. doi: 10.1002/sim.4018.

Prevalence of latex allergy may be vastly overestimated when determined by in vitro assays.

Ann Allergy Asthma Immunol. 2000 Jun;84(6):628-32. doi: 10.1016/S1081-1206(10)62415-5.

MRI and mammography surveillance of women at increased risk for breast cancer: recommendations using an evidence-based approach.

Acad Radiol. 2008 Dec;15(12):1590-5. doi: 10.1016/j.acra.2008.06.006.

Estimation of test sensitivity and specificity when disease confirmation is limited to positive results.

Epidemiology. 1999 Jan;10(1):67-72.

引用本文的文献

UTILIZING A CAPTURE-RECAPTURE STRATEGY TO ACCELERATE INFECTIOUS DISEASE SURVEILLANCE.

Ann Appl Stat. 2024 Dec;18(4):3130-3145. doi: 10.1214/24-aoas1927. Epub 2024 Oct 31.

Detection and measurements of apical lesions in the upper jaw by cone beam computed tomography and panoramic radiography as a function of cortical bone thickness.

Clin Oral Investig. 2019 Nov;23(11):4067-4073. doi: 10.1007/s00784-019-02843-x. Epub 2019 Feb 22.

Different latent class models were used and evaluated for assessing the accuracy of campylobacter diagnostic tests: overcoming imperfect reference standards?

Epidemiol Infect. 2018 Sep;146(12):1556-1564. doi: 10.1017/S0950268818001723. Epub 2018 Jun 27.

Biomarker validation with an imperfect reference: Issues and bounds.

Stat Methods Med Res. 2018 Oct;27(10):2933-2945. doi: 10.1177/0962280216689806. Epub 2017 Feb 6.

Personalized prostate cancer screening among men with high risk genetic predisposition- study protocol for a prospective cohort study.

BMC Cancer. 2014 Jul 21;14:528. doi: 10.1186/1471-2407-14-528.

Estimation of diagnostic test accuracy without full verification: a review of latent class methods.

Stat Med. 2014 Oct 30;33(24):4141-69. doi: 10.1002/sim.6218. Epub 2014 Jun 9.

Using a web-based application to define the accuracy of diagnostic tests when the gold standard is imperfect.

PLoS One. 2013 Nov 12;8(11):e79489. doi: 10.1371/journal.pone.0079489. eCollection 2013.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

当参考标准不完善时，评估诊断或筛选试验准确性时的依赖性误差的影响。

Effect of dependent errors in the assessment of diagnostic or screening test accuracy when the reference standard is imperfect.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献