• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将专家小组用作参考标准时诊断试验准确性估计中偏倚的驱动因素:一项模拟研究

Drivers of bias in diagnostic test accuracy estimates when using expert panels as a reference standard: a simulation study.

作者信息

Kellerhuis B E, Jenniskens K, Schuit E, Hooft L, Moons K G M, Reitsma J B

机构信息

Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The Netherlands.

出版信息

BMC Med Res Methodol. 2025 Apr 23;25(1):106. doi: 10.1186/s12874-025-02557-7.

DOI:10.1186/s12874-025-02557-7
PMID:40269705
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12016307/
Abstract

BACKGROUND

Expert panels are often used as a reference standard when no gold standard is available in diagnostic test accuracy research. It is often unclear what study and expert panel characteristics produce the best estimates of diagnostic test accuracy. We simulated a large range of scenarios to assess the impact of study and expert panel characteristics on index test diagnostic accuracy estimates.

METHODS

Simulations were performed in which an expert panel was the reference standard to estimate the sensitivity and specificity of an index diagnostic test. Diagnostic accuracy was determined by combining probability estimates of target condition presence, provided by experts using four component reference tests, through a predefined threshold. Study and panel characteristics were varied in several scenarios: target condition prevalence, accuracy of component reference tests, expert panel size, study population size, and random or systematic differences between expert's probability estimates. The total bias in each scenario was quantified using mean squared error.

RESULTS

When estimating an index test with 80% sensitivity and 70% specificity, bias in estimates was hardly affected by the study population size or the number of experts. Prevalence had a large effect on bias, scenarios with a prevalence of 0.5 estimated sensitivity between 63.3% and 76.7% and specificity between 56.1% and 68.7%, whereas scenarios with a prevalence of 0.2 estimated sensitivity between 48.5% and 73.3% and specificity between 65.5% and 68.7%. Improved reference tests also reduced bias. Scenarios with four component tests of 80% sensitivity and specificity estimated index test sensitivity between 60.1% and 77.4% and specificity between 62.9% and 69.1%, whereas scenarios with four component tests of 70% sensitivity and specificity estimated index test sensitivity between 48.5% and 73.4% and specificity between 56.1% and 67.0%.

CONCLUSIONS

Bias in accuracy estimates when using an expert panel will increase if the component reference tests are less accurate. Prevalence, the true value of the index test accuracy, and random or systematic differences between experts can also impact the amount of bias, but the amount and even direction will vary between scenarios.

摘要

背景

在诊断试验准确性研究中,当没有金标准可用时,专家小组常被用作参考标准。通常不清楚哪些研究和专家小组特征能产生对诊断试验准确性的最佳估计。我们模拟了大量场景,以评估研究和专家小组特征对指标试验诊断准确性估计的影响。

方法

进行模拟,其中专家小组作为参考标准来估计指标诊断试验的敏感性和特异性。诊断准确性通过使用四个组成参考试验由专家提供的目标疾病存在概率估计值,通过预定义阈值进行组合来确定。在几种场景中改变研究和小组特征:目标疾病患病率、组成参考试验的准确性、专家小组规模、研究人群规模以及专家概率估计值之间的随机或系统差异。使用均方误差对每个场景中的总偏差进行量化。

结果

在估计敏感性为80%、特异性为70%的指标试验时,估计偏差几乎不受研究人群规模或专家数量的影响。患病率对偏差有很大影响,患病率为0.5的场景估计敏感性在63.3%至76.7%之间,特异性在56.1%至68.7%之间,而患病率为0.2的场景估计敏感性在48.5%至73.3%之间,特异性在65.5%至68.7%之间。改进的参考试验也减少了偏差。四个组成试验敏感性和特异性均为80%的场景估计指标试验敏感性在60.1%至77.4%之间,特异性在62.9%至69.1%之间,而四个组成试验敏感性和特异性均为70%的场景估计指标试验敏感性在48.5%至73.4%之间,特异性在56.1%至67.0%之间。

结论

如果组成参考试验准确性较低,使用专家小组时准确性估计中的偏差将会增加。患病率、指标试验准确性的真实值以及专家之间的随机或系统差异也会影响偏差量,但偏差量甚至方向在不同场景之间会有所不同。

相似文献

1
Drivers of bias in diagnostic test accuracy estimates when using expert panels as a reference standard: a simulation study.将专家小组用作参考标准时诊断试验准确性估计中偏倚的驱动因素:一项模拟研究
BMC Med Res Methodol. 2025 Apr 23;25(1):106. doi: 10.1186/s12874-025-02557-7.
2
Forcing dichotomous disease classification from reference standards leads to bias in diagnostic accuracy estimates: A simulation study.强制将疾病分类为二分类会导致诊断准确性估计的偏差:一项模拟研究。
J Clin Epidemiol. 2019 Jul;111:1-10. doi: 10.1016/j.jclinepi.2019.03.002. Epub 2019 Mar 20.
3
Partial verification bias and incorporation bias affected accuracy estimates of diagnostic studies for biomarkers that were part of an existing composite gold standard.部分验证偏倚和纳入偏倚影响了作为现有综合金标准一部分的生物标志物诊断研究的准确性估计。
J Clin Epidemiol. 2016 Oct;78:73-82. doi: 10.1016/j.jclinepi.2016.03.022. Epub 2016 Apr 21.
4
Bias due to composite reference standards in diagnostic accuracy studies.诊断准确性研究中因复合参考标准导致的偏倚。
Stat Med. 2016 Apr 30;35(9):1454-70. doi: 10.1002/sim.6803. Epub 2015 Nov 10.
5
Diagnostic test accuracy of nutritional tools used to identify undernutrition in patients with colorectal cancer: a systematic review.用于识别结直肠癌患者营养不良的营养评估工具的诊断测试准确性:一项系统综述
JBI Database System Rev Implement Rep. 2015 May 15;13(4):141-87. doi: 10.11124/jbisrir-2015-1673.
6
Rapid, point-of-care antigen and molecular-based tests for diagnosis of SARS-CoV-2 infection.用于 SARS-CoV-2 感染诊断的快速、即时抗原和基于分子的检测。
Cochrane Database Syst Rev. 2021 Mar 24;3(3):CD013705. doi: 10.1002/14651858.CD013705.pub2.
7
Estimation of diagnostic test accuracy without full verification: a review of latent class methods.未进行全面验证时诊断试验准确性的估计:潜在类别方法综述
Stat Med. 2014 Oct 30;33(24):4141-69. doi: 10.1002/sim.6218. Epub 2014 Jun 9.
8
Tests for diagnosis of postpartum haemorrhage at vaginal birth.经阴道分娩产后出血的诊断测试。
Cochrane Database Syst Rev. 2025 Jan 17;1(1):CD016134. doi: 10.1002/14651858.CD016134.
9
Thoracic imaging tests for the diagnosis of COVID-19.用于诊断新型冠状病毒肺炎的胸部影像学检查
Cochrane Database Syst Rev. 2020 Nov 26;11:CD013639. doi: 10.1002/14651858.CD013639.pub3.
10
A general framework for comparative Bayesian meta-analysis of diagnostic studies.诊断研究比较贝叶斯荟萃分析的一般框架。
BMC Med Res Methodol. 2015 Aug 28;15:70. doi: 10.1186/s12874-015-0061-7.

本文引用的文献

1
Diagnostic test accuracy of preoperative nutritional screening tools in adults for malnutrition: a systematic review and network meta-analysis.术前营养筛查工具在成人营养不良诊断中的准确性:系统评价和网络荟萃分析。
Int J Surg. 2024 Feb 1;110(2):1090-1098. doi: 10.1097/JS9.0000000000000845.
2
Synthetic magnetic resonance imaging for primary prostate cancer evaluation: Diagnostic potential of a non-contrast-enhanced bi-parametric approach enhanced with relaxometry measurements.用于原发性前列腺癌评估的合成磁共振成像:通过弛豫测量增强的非对比增强双参数方法的诊断潜力
Eur J Radiol Open. 2022 Feb 15;9:100403. doi: 10.1016/j.ejro.2022.100403. eCollection 2022.
3
Validation of an obstetric fistula screening questionnaire: a case-control study with clinical examination.
产科瘘筛查问卷的验证:一项基于临床检查的病例对照研究。
Reprod Health. 2022 Jan 18;19(1):12. doi: 10.1186/s12978-021-01317-2.
4
Identifying functional cognitive disorder: a proposed diagnostic risk model.识别功能性认知障碍:一种拟议的诊断风险模型。
CNS Spectr. 2022 Dec;27(6):754-763. doi: 10.1017/S1092852921000845. Epub 2021 Sep 17.
5
Glaucoma Expert-Level Detection of Angle Closure in Goniophotographs With Convolutional Neural Networks: The Chinese American Eye Study.卷积神经网络在眼前照片中对房角关闭的青光眼专家级检测:中美眼病研究。
Am J Ophthalmol. 2021 Jun;226:100-107. doi: 10.1016/j.ajo.2021.02.004. Epub 2021 Feb 9.
6
Diagnostic test accuracy of self-reported screening instruments in identifying frailty in community-dwelling older people: A systematic review.社区居住老年人中自我报告的筛查工具在识别虚弱方面的诊断测试准确性:系统评价。
Geriatr Gerontol Int. 2020 Jan;20(1):14-24. doi: 10.1111/ggi.13810. Epub 2019 Nov 14.
7
Diagnostic value of transcranial ultrasonography for selecting subjects with large vessel occlusion: a systematic review.经颅超声检查对选择大血管闭塞患者的诊断价值:一项系统评价
Ultrasound J. 2019 Oct 22;11(1):29. doi: 10.1186/s13089-019-0143-6.
8
Expert panel diagnosis demonstrated high reproducibility as reference standard in infectious diseases.专家组诊断作为传染病的参考标准,表现出了较高的可重复性。
J Clin Epidemiol. 2019 Aug;112:20-27. doi: 10.1016/j.jclinepi.2019.03.010. Epub 2019 Mar 28.
9
Forcing dichotomous disease classification from reference standards leads to bias in diagnostic accuracy estimates: A simulation study.强制将疾病分类为二分类会导致诊断准确性估计的偏差:一项模拟研究。
J Clin Epidemiol. 2019 Jul;111:1-10. doi: 10.1016/j.jclinepi.2019.03.002. Epub 2019 Mar 20.
10
Using simulation studies to evaluate statistical methods.运用模拟研究评估统计方法。
Stat Med. 2019 May 20;38(11):2074-2102. doi: 10.1002/sim.8086. Epub 2019 Jan 16.