对具有差异疾病验证的二项筛选试验准确性的估计存在偏倚。

Bias in estimating accuracy of a binary screening test with differential disease verification.

机构信息

Division of Biostatistics, University of Southern California, Keck School of Medicine, Arcadia, CA 91006, U.S.A..

出版信息

Stat Med. 2011 Jul 10;30(15):1852-64. doi: 10.1002/sim.4232. Epub 2011 Apr 15.

DOI:10.1002/sim.4232

PMID:21495059

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3115446/

Abstract

Sensitivity, specificity, positive and negative predictive value are typically used to quantify the accuracy of a binary screening test. In some studies, it may not be ethical or feasible to obtain definitive disease ascertainment for all subjects using a gold standard test. When a gold standard test cannot be used, an imperfect reference test that is less than 100 per cent sensitive and specific may be used instead. In breast cancer screening, for example, follow-up for cancer diagnosis is used as an imperfect reference test for women where it is not possible to obtain gold standard results. This incomplete ascertainment of true disease, or differential disease verification, can result in biased estimates of accuracy. In this paper, we derive the apparent accuracy values for studies subject to differential verification. We determine how the bias is affected by the accuracy of the imperfect reference test, the percent who receive the imperfect reference standard test not receiving the gold standard, the prevalence of the disease, and the correlation between the results for the screening test and the imperfect reference test. It is shown that designs with differential disease verification can yield biased estimates of accuracy. Estimates of sensitivity in cancer screening trials may be substantially biased. However, careful design decisions, including selection of the imperfect reference test, can help to minimize bias. A hypothetical breast cancer screening study is used to illustrate the problem.

摘要

灵敏度、特异性、阳性预测值和阴性预测值通常用于量化二分类筛查试验的准确性。在某些研究中，使用金标准对所有受试者进行明确的疾病确定可能在伦理上不可行或不切实际。当无法使用金标准时，可以使用灵敏度和特异性均不足 100%的不完美参考测试来替代。例如，在乳腺癌筛查中，对无法获得金标准结果的女性，使用癌症诊断随访作为不完美的参考测试。这种对真正疾病的不完全确定或差异疾病验证会导致准确性的估计产生偏差。在本文中，我们推导出了受差异验证影响的研究的表观准确性值。我们确定了偏倚如何受到不完美参考测试的准确性、未接受金标准测试的接受不完美参考标准测试的百分比、疾病的流行率以及筛查测试和不完美参考测试结果之间的相关性的影响。结果表明，具有差异疾病验证的设计可能会产生有偏差的准确性估计。癌症筛查试验中灵敏度的估计可能会产生很大的偏差。然而，精心的设计决策，包括不完美参考测试的选择，可以帮助最小化偏差。使用一个假设的乳腺癌筛查研究来说明这个问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa51/3115446/d99daa49f208/nihms279636f1.jpg

相似文献

Bias in estimating accuracy of a binary screening test with differential disease verification.对具有差异疾病验证的二项筛选试验准确性的估计存在偏倚。

Stat Med. 2011 Jul 10;30(15):1852-64. doi: 10.1002/sim.4232. Epub 2011 Apr 15.

Effect of dependent errors in the assessment of diagnostic or screening test accuracy when the reference standard is imperfect.当参考标准不完善时，评估诊断或筛选试验准确性时的依赖性误差的影响。

Stat Med. 2012 May 20;31(11-12):1129-38. doi: 10.1002/sim.4444. Epub 2012 Feb 21.

Estimates of sensitivity and specificity can be biased when reporting the results of the second test in a screening trial conducted in series.在系列进行的筛检试验中报告第二项检测的结果时，敏感度和特异度的估计可能存在偏倚。

BMC Med Res Methodol. 2010 Jan 11;10:3. doi: 10.1186/1471-2288-10-3.

Bias in trials comparing paired continuous tests can cause researchers to choose the wrong screening modality.比较配对连续检测的试验中的偏倚可能会导致研究人员选择错误的筛查方式。

BMC Med Res Methodol. 2009 Jan 20;9:4. doi: 10.1186/1471-2288-9-4.

Reducing decision errors in the paired comparison of the diagnostic accuracy of screening tests with Gaussian outcomes.降低具有正态分布结果的筛检试验诊断准确性配对比较中的决策误差。

BMC Med Res Methodol. 2014 Mar 5;14:37. doi: 10.1186/1471-2288-14-37.

Use of artificial intelligence for image analysis in breast cancer screening programmes: systematic review of test accuracy.人工智能在乳腺癌筛查计划中的图像分析应用：测试准确性的系统评价。

BMJ. 2021 Sep 1;374:n1872. doi: 10.1136/bmj.n1872.

Simultaneous alleviation of verification and reference standard biases in a community-based tuberculosis screening study using Bayesian latent class analysis.基于贝叶斯潜在类别分析的社区结核病筛查研究中同时缓解验证偏倚和参考标准偏倚。

PLoS One. 2024 Jun 10;19(6):e0305126. doi: 10.1371/journal.pone.0305126. eCollection 2024.

A new method to address verification bias in studies of clinical screening tests: cervical cancer screening assays as an example.一种解决临床筛查试验研究中验证偏倚的新方法：以宫颈癌筛查试验为例。

J Clin Epidemiol. 2014 Mar;67(3):343-53. doi: 10.1016/j.jclinepi.2013.09.013. Epub 2013 Dec 12.

Estimating diagnostic accuracy of multiple binary tests with an imperfect reference standard.使用不完善的参考标准评估多个二元测试的诊断准确性。

Stat Med. 2009 Feb 28;28(5):780-97. doi: 10.1002/sim.3514.

引用本文的文献

Estimating Cancer Screening Sensitivity and Specificity Using Healthcare Utilization Data: Defining the Accuracy Assessment Interval.利用医疗保健利用数据估计癌症筛查的敏感性和特异性：定义准确性评估区间。

Cancer Epidemiol Biomarkers Prev. 2022 Aug 2;31(8):1517-1520. doi: 10.1158/1055-9965.EPI-22-0232.

Adjusting for verification bias in diagnostic accuracy measures when comparing multiple screening tests - an application to the IP1-PROSTAGRAM study.当比较多个筛查测试时，调整诊断准确性测量中的验证偏倚——以 IP1-PROSTAGRAM 研究为例。

BMC Med Res Methodol. 2022 Mar 18;22(1):70. doi: 10.1186/s12874-021-01481-w.

Diagnostic test evaluation methodology: A systematic review of methods employed to evaluate diagnostic tests in the absence of gold standard - An update.诊断测试评估方法学：系统综述在缺乏金标准的情况下用于评估诊断测试的方法 - 更新版。

PLoS One. 2019 Oct 11;14(10):e0223832. doi: 10.1371/journal.pone.0223832. eCollection 2019.

Anticipating missing reference standard data when planning diagnostic accuracy studies.在规划诊断准确性研究时预测缺失的参考标准数据。

BMJ. 2016 Feb 9;352:i402. doi: 10.1136/bmj.i402.

Exploring the Underdiagnosis and Prevalence of Autism Spectrum Conditions in Beijing.探索北京自闭症谱系障碍的诊断不足及患病率

Autism Res. 2015 Jun;8(3):250-60. doi: 10.1002/aur.1441. Epub 2015 May 6.

Personalized prostate cancer screening among men with high risk genetic predisposition- study protocol for a prospective cohort study.具有高风险遗传易感性男性的个性化前列腺癌筛查——一项前瞻性队列研究的研究方案

BMC Cancer. 2014 Jul 21;14:528. doi: 10.1186/1471-2407-14-528.

Estimation of diagnostic test accuracy without full verification: a review of latent class methods.未进行全面验证时诊断试验准确性的估计：潜在类别方法综述

Stat Med. 2014 Oct 30;33(24):4141-69. doi: 10.1002/sim.6218. Epub 2014 Jun 9.

本文引用的文献

A Bayesian approach to simultaneously adjusting for verification and reference standard bias in diagnostic test studies.贝叶斯方法在同时调整诊断测试研究中的验证偏倚和参考标准偏倚中的应用。

Stat Med. 2010 Oct 30;29(24):2532-43. doi: 10.1002/sim.4018.

BMC Med Res Methodol. 2010 Jan 11;10:3. doi: 10.1186/1471-2288-10-3.

Improving the biomarker pipeline to develop and evaluate cancer screening tests.改进生物标志物流程以开发和评估癌症筛查测试。

J Natl Cancer Inst. 2009 Aug 19;101(16):1116-9. doi: 10.1093/jnci/djp186. Epub 2009 Jul 2.

A review of solutions for diagnostic accuracy studies with an imperfect or missing reference standard.对使用不完美或缺失参考标准的诊断准确性研究的解决方案综述。

J Clin Epidemiol. 2009 Aug;62(8):797-806. doi: 10.1016/j.jclinepi.2009.02.005. Epub 2009 May 17.

BMC Med Res Methodol. 2009 Jan 20;9:4. doi: 10.1186/1471-2288-9-4.

Adjusting for verification bias in diagnostic test evaluation: a Bayesian approach.诊断试验评价中验证偏倚的校正：一种贝叶斯方法。

Stat Med. 2008 Jun 15;27(13):2453-73. doi: 10.1002/sim.3099.

Randomized trial of screen-film versus full-field digital mammography with soft-copy reading in population-based screening program: follow-up and final results of Oslo II study.基于人群的筛查项目中，屏-片乳腺摄影与全视野数字化乳腺摄影软读片的随机试验：奥斯陆II研究的随访及最终结果

Radiology. 2007 Sep;244(3):708-17. doi: 10.1148/radiol.2443061478.

Evidence of bias and variation in diagnostic accuracy studies.诊断准确性研究中的偏倚和变异证据。

CMAJ. 2006 Feb 14;174(4):469-76. doi: 10.1503/cmaj.050090.

Diagnostic performance of digital versus film mammography for breast-cancer screening.数字化乳腺摄影与传统胶片乳腺摄影在乳腺癌筛查中的诊断性能

N Engl J Med. 2005 Oct 27;353(17):1773-83. doi: 10.1056/NEJMoa052911. Epub 2005 Sep 16.

The incremental contribution of clinical breast examination to invasive cancer detection in a mammography screening program.临床乳腺检查在乳腺钼靶筛查项目中对浸润性癌检测的增量贡献。

AJR Am J Roentgenol. 2005 Feb;184(2):428-32. doi: 10.2214/ajr.184.2.01840428.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验