基于多读者诊断数据的ROC曲线下面积的检验性能。

Performance of tests based on the area under the ROC curve for multireader diagnostic data.

作者信息

Hwang Yi-Ting, Hsu Ya-Ru, Su Nan-Cheng

机构信息

Department of Statistics, National Taipei University, Sancia, New Taipei City, Taiwan.

出版信息

J Appl Stat. 2024 Jul 14;52(3):555-577. doi: 10.1080/02664763.2024.2374931. eCollection 2025.

DOI:10.1080/02664763.2024.2374931

PMID:39950016

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11816630/

Abstract

One of the main objectives of disease prevention is to lower the healthcare costs and improve the quality of life. To achieve this, reliable diagnostic tools are needed. The diagnostic performance of a tool can be measured by the ROC curve and the AUC. However, some diagnostic tools such as MRI images are not objective, but depend on the interpretation of experts. Therefore, the accuracy of these tools may vary depending on who is interpreting them. To account for possible correlations when multiple readers collect data, Dorfman, Berbaum and Metz (1992) proposed using AUC pseudovalues from the jackknife sampling method and applying them to the mixed model to analyze the diagnostic reagent's accuracy. However, pseudovalues may go beyond the AUC range. Also, the random effect estimate may be negative due to a small number of readers. This paper develops tests based on AUC estimates and gives their asymptotic distribution. Moreover, a two-stage test is suggested to correct for negative random effect estimates. Four tests are created in total and their performance is evaluated by Monte Carlo simulations. The distributional assumption's robustness of these tests is checked, and their applicability is demonstrated by two real data sets.

摘要

疾病预防的主要目标之一是降低医疗成本并提高生活质量。为实现这一目标，需要可靠的诊断工具。工具的诊断性能可以通过ROC曲线和AUC来衡量。然而，一些诊断工具，如MRI图像，并不客观，而是依赖于专家的解读。因此，这些工具的准确性可能因解读人员的不同而有所差异。为了在多个读者收集数据时考虑可能的相关性，多尔夫曼、伯鲍姆和梅茨（1992年）提出使用刀切抽样法的AUC伪值，并将其应用于混合模型以分析诊断试剂的准确性。然而，伪值可能超出AUC范围。此外，由于读者数量较少，随机效应估计可能为负。本文基于AUC估计值开发了检验方法，并给出了它们的渐近分布。此外，还建议采用两阶段检验来校正负随机效应估计值。总共创建了四个检验，并通过蒙特卡罗模拟评估了它们的性能。检查了这些检验的分布假设的稳健性，并通过两个真实数据集证明了它们的适用性。

相似文献

Performance of tests based on the area under the ROC curve for multireader diagnostic data.

J Appl Stat. 2024 Jul 14;52(3):555-577. doi: 10.1080/02664763.2024.2374931. eCollection 2025.

A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.

Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.

Sexual Harassment and Prevention Training

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].

Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.

Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.

Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.

Automated devices for identifying peripheral arterial disease in people with leg ulceration: an evidence synthesis and cost-effectiveness analysis.

Health Technol Assess. 2024 Aug;28(37):1-158. doi: 10.3310/TWCG3912.

Intraoperative frozen section analysis for the diagnosis of early stage ovarian cancer in suspicious pelvic masses.

Cochrane Database Syst Rev. 2016 Mar 1;3(3):CD010360. doi: 10.1002/14651858.CD010360.pub2.

Education support services for improving school engagement and academic performance of children and adolescents with a chronic health condition.

Cochrane Database Syst Rev. 2023 Feb 8;2(2):CD011538. doi: 10.1002/14651858.CD011538.pub2.

本文引用的文献

Sample size determination for comparing accuracies between two diagnostic tests under a paired design.

Biom J. 2022 Apr;64(4):771-804. doi: 10.1002/bimj.202000036. Epub 2022 Jan 23.

Multireader sample size program for diagnostic studies: demonstration and methodology.

J Med Imaging (Bellingham). 2018 Oct;5(4):045503. doi: 10.1117/1.JMI.5.4.045503. Epub 2018 Nov 30.

Novel tests for evaluating two ROC curves under paired samples.

J Biopharm Stat. 2018;28(3):501-517. doi: 10.1080/10543406.2017.1333997. Epub 2017 Sep 5.

Panels of tumor-derived RNA markers in peripheral blood of patients with non-small cell lung cancer: their dependence on age, gender and clinical stages.

Oncotarget. 2016 Aug 2;7(31):50582-50595. doi: 10.18632/oncotarget.10558.

Use of picture archiving and communication system for imaging of radiological films in cardiac surgical intensive care unit.

J Anaesthesiol Clin Pharmacol. 2014 Jul;30(3):447-8. doi: 10.4103/0970-9185.137306.

Comparison of Paired ROC Curves through a Two-Stage Test.

J Biopharm Stat. 2015;25(5):881-902. doi: 10.1080/10543406.2014.920874. Epub 2014 Jun 6.

Comparing the areas under two correlated ROC curves: parametric and non-parametric approaches.

Biom J. 2006 Aug;48(5):745-57. doi: 10.1002/bimj.200610223.

A comparison of denominator degrees of freedom methods for multiple observer ROC analysis.

Stat Med. 2007 Feb 10;26(3):596-619. doi: 10.1002/sim.2532.

A permutation test sensitive to differences in areas for comparing ROC curves from a paired design.

Stat Med. 2005 Sep 30;24(18):2873-93. doi: 10.1002/sim.2149.

A comparison of the Dorfman-Berbaum-Metz and Obuchowski-Rockette methods for receiver operating characteristic (ROC) data.

Stat Med. 2005 May 30;24(10):1579-607. doi: 10.1002/sim.2024.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr
超能文献

基于多读者诊断数据的ROC曲线下面积的检验性能。

Performance of tests based on the area under the ROC curve for multireader diagnostic data.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Suppr超能文献

基于多读者诊断数据的ROC曲线下面积的检验性能。

Performance of tests based on the area under the ROC curve for multireader diagnostic data.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Suppr
超能文献