Suppr超能文献

肾梗阻的计算机辅助诊断:对数线性模型与标准ROC及kappa分析的效用

Computer-aided diagnosis of renal obstruction: utility of log-linear modeling versus standard ROC and kappa analysis.

作者信息

Manatunga Amita K, Binongo José Nilo G, Taylor Andrew T

机构信息

Department of Biostatistics and Bioinformatics, Emory University School of Public Health, 1364 Clifton Road NE, Atlanta, GA 30322, USA.

出版信息

EJNMMI Res. 2011 Jun 20;1(5):1-8. doi: 10.1186/2191-219X-1-5.

Abstract

BACKGROUND

The accuracy of computer-aided diagnosis (CAD) software is best evaluated by comparison to a gold standard which represents the true status of disease. In many settings, however, knowledge of the true status of disease is not possible and accuracy is evaluated against the interpretations of an expert panel. Common statistical approaches to evaluate accuracy include receiver operating characteristic (ROC) and kappa analysis but both of these methods have significant limitations and cannot answer the question of equivalence: Is the CAD performance equivalent to that of an expert? The goal of this study is to show the strength of log-linear analysis over standard ROC and kappa statistics in evaluating the accuracy of computer-aided diagnosis of renal obstruction compared to the diagnosis provided by expert readers. METHODS: Log-linear modeling was utilized to analyze a previously published database that used ROC and kappa statistics to compare diuresis renography scan interpretations (non-obstructed, equivocal, or obstructed) generated by a renal expert system (RENEX) in 185 kidneys (95 patients) with the independent and consensus scan interpretations of three experts who were blinded to clinical information and prospectively and independently graded each kidney as obstructed, equivocal, or non-obstructed. RESULTS: Log-linear modeling showed that RENEX and the expert consensus had beyond-chance agreement in both non-obstructed and obstructed readings (both p < 0.0001). Moreover, pairwise agreement between experts and pairwise agreement between each expert and RENEX were not significantly different (p = 0.41, 0.95, 0.81 for the non-obstructed, equivocal, and obstructed categories, respectively). Similarly, the three-way agreement of the three experts and three-way agreement of two experts and RENEX was not significantly different for non-obstructed (p = 0.79) and obstructed (p = 0.49) categories. CONCLUSION: Log-linear modeling showed that RENEX was equivalent to any expert in rating kidneys, particularly in the obstructed and non-obstructed categories. This conclusion, which could not be derived from the original ROC and kappa analysis, emphasizes and illustrates the role and importance of log-linear modeling in the absence of a gold standard. The log-linear analysis also provides additional evidence that RENEX has the potential to assist in the interpretation of diuresis renography studies.

摘要

背景

计算机辅助诊断(CAD)软件的准确性最好通过与代表疾病真实状况的金标准进行比较来评估。然而,在许多情况下,无法得知疾病的真实状况,准确性是根据专家小组的解读来评估的。评估准确性的常见统计方法包括接受者操作特征(ROC)分析和kappa分析,但这两种方法都有显著局限性,无法回答等效性问题:CAD的性能是否与专家的性能等效?本研究的目的是展示在评估肾梗阻计算机辅助诊断的准确性方面,对数线性分析相对于标准ROC和kappa统计的优势,与专家读者提供的诊断进行比较。

方法

利用对数线性模型分析一个先前发表的数据库,该数据库使用ROC和kappa统计来比较由肾脏专家系统(RENEX)在185个肾脏(95名患者)中生成的利尿肾图扫描解读(无梗阻、可疑或梗阻)与三位对临床信息不知情的专家的独立和一致扫描解读,并前瞻性地且独立地将每个肾脏分级为梗阻、可疑或无梗阻。

结果

对数线性模型显示RENEX与专家共识在无梗阻和梗阻解读方面均有超出偶然的一致性(p均<0.0001)。此外,专家之间的两两一致性以及每位专家与RENEX之间的两两一致性在无梗阻、可疑和梗阻类别中无显著差异(分别为p = 0.41、0.95、0.81)。同样地,三位专家的三方一致性以及两位专家和RENEX的三方一致性在无梗阻(p = 0.79)和梗阻(p = 0.49)类别中无显著差异。

结论

对数线性模型显示RENEX在对肾脏评级方面与任何专家等效,尤其是在梗阻和无梗阻类别中。这一结论无法从原始ROC和kappa分析中得出,强调并说明了在没有金标准的情况下对数线性模型的作用和重要性。对数线性分析还提供了额外证据表明RENEX有可能协助解读利尿肾图研究。

相似文献

2
Diagnostic performance of an expert system for interpretation of 99mTc MAG3 scans in suspected renal obstruction.
J Nucl Med. 2008 Feb;49(2):216-24. doi: 10.2967/jnumed.107.045484. Epub 2008 Jan 16.
4
iRENEX: a clinically informed decision support system for the interpretation of ⁹⁹mTc-MAG3 scans to detect renal obstruction.
Eur J Nucl Med Mol Imaging. 2012 Sep;39(9):1483-91. doi: 10.1007/s00259-012-2151-7. Epub 2012 May 30.
6
Decision support systems in diuresis renography.
Semin Nucl Med. 2008 Jan;38(1):67-81. doi: 10.1053/j.semnuclmed.2007.09.006.
8
A Bayesian Latent Class Model to Predict Kidney Obstruction in the Absence of Gold Standard.
J Am Stat Assoc. 2020;115(532):1645-1663. doi: 10.1080/01621459.2019.1689983. Epub 2020 Jan 6.
9
Computer assisted interpretation of Tc-99m mercaptoacetyltriglycine diuretic scintigraphy enhances resident performance.
Nucl Med Commun. 2023 Jun 1;44(6):427-433. doi: 10.1097/MNM.0000000000001691. Epub 2023 Apr 10.

引用本文的文献

1
SNMMI Procedure Standard/EANM Practice Guideline for Diuretic Renal Scintigraphy in Adults With Suspected Upper Urinary Tract Obstruction 1.0.
Semin Nucl Med. 2018 Jul;48(4):377-390. doi: 10.1053/j.semnuclmed.2018.02.010. Epub 2018 Mar 16.
2
Radionuclides in nephrourology, Part 2: pitfalls and diagnostic applications.
J Nucl Med. 2014 May;55(5):786-98. doi: 10.2967/jnumed.113.133454. Epub 2014 Mar 3.
4
Development of a relational database to capture and merge clinical history with the quantitative results of radionuclide renography.
J Nucl Med Technol. 2012 Dec;40(4):236-43. doi: 10.2967/jnmt.111.101477. Epub 2012 Sep 25.
5
iRENEX: a clinically informed decision support system for the interpretation of ⁹⁹mTc-MAG3 scans to detect renal obstruction.
Eur J Nucl Med Mol Imaging. 2012 Sep;39(9):1483-91. doi: 10.1007/s00259-012-2151-7. Epub 2012 May 30.

本文引用的文献

1
Diagnostic performance of an expert system for interpretation of 99mTc MAG3 scans in suspected renal obstruction.
J Nucl Med. 2008 Feb;49(2):216-24. doi: 10.2967/jnumed.107.045484. Epub 2008 Jan 16.
3
Decision support systems in diuresis renography.
Semin Nucl Med. 2008 Jan;38(1):67-81. doi: 10.1053/j.semnuclmed.2007.09.006.
4
CT colonography: investigation of the optimum reader paradigm by using computer-aided detection software.
Radiology. 2008 Feb;246(2):463-71. doi: 10.1148/radiol.2461070190. Epub 2007 Dec 19.
5
The new era of medical imaging--progress and pitfalls.
N Engl J Med. 2006 Jun 29;354(26):2822-8. doi: 10.1056/NEJMhpr061219.
8
Estimation in medical imaging without a gold standard.
Acad Radiol. 2002 Mar;9(3):290-7. doi: 10.1016/s1076-6332(03)80372-0.
9
Statistical description of interrater variability in ordinal ratings.
Stat Methods Med Res. 2000 Oct;9(5):475-96. doi: 10.1177/096228020000900505.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验