Wu Qian, Guo Hui, Li Ruihan, Han Jinhuan
Department of Medical Imaging Center, The Fourth Clinical Medical College of Xinjiang Medical University, Urumqi, Xinjian 830000, China.
Department of Medical Imaging Center, The Fourth Clinical Medical College of Xinjiang Medical University, Urumqi, Xinjian 830000, China.
Int J Med Inform. 2025 Apr;196:105812. doi: 10.1016/j.ijmedinf.2025.105812. Epub 2025 Jan 30.
With advancements in medical technology and science, chronic obstructive pulmonary disease (COPD), one of the world's three major chronic diseases, has seen numerous remarkable outcomes when combined with artificial intelligence, particularly in disease diagnosis. However, the diagnostic performance of these AI models still lacks comprehensive evidence. Therefore, this study quantitatively analyzed the diagnostic performance of AI models in CT images of COPD patients, aiming to promote the development of related research in the future.
PubMed, Cochrane Library, Web of Science, and Embase were retrieved up to September 1, 2024. The QUADAS-2 evaluation tool was used to assess the quality of the included studies. Meta-analysis of the included researches was performed using Stata18, RevMan 5.4, and Meta-Disc 1.4 software to merge sensitivity, specificity and plot a summary receiver operating characteristic curve (SROC). Heterogeneity was assessed using the Q statistic, and sources of inter-study heterogeneity were explored through meta-regression analysis.
Twenty-two of 3280 identified studies were eligible. Meta-analysis was performed on 15 of these studies, encompassing a total of 22,817 patients for which statistical metrics were reported or could be calculated. Seven studies were based on deep learning (DL) model, three on machine learning (ML) model, and five on DL model with multiple-instance learning (MIL) mechanisms. One study evaluated both DL and ML models. The meta-analysis results showed that the pooled sensitivity of all DL and ML models was 86 % (95 %CI 78-91 %), specificity was 87 % (95 %CI 83-91 %), and area under the curve was 93 % (95 %CI 90-95 %). Subgroup analyses revealed no significant difference in diagnostic sensitivity and specificity between DL and ML models (sensitivity 82 % (95 %CI 76-87 %), 93 % (95 %CI 85-97 %); specificity 87 % (95 %CI 79-91 %), 84 % (95 %CI 79-88 %), and the DL model with MIL (sensitivity 87 % (95 %CI 61-96 %); specificity 89 % (95 %CI 78-95 %) improved the performance of DL model, but this improvement was not statistically significant (p > 0.05).
Both DL and ML models for diagnosing COPD using CT images exhibited high accuracy. There was no significant difference in diagnostic efficacy between the two types of AI models, and the addition of the MIL mechanism may enhance the performance of the DL model.
随着医学技术和科学的进步,慢性阻塞性肺疾病(COPD)作为世界三大主要慢性病之一,与人工智能结合已取得了众多显著成果,尤其是在疾病诊断方面。然而,这些人工智能模型的诊断性能仍缺乏全面的证据。因此,本研究对COPD患者CT图像中人工智能模型的诊断性能进行了定量分析,旨在推动未来相关研究的发展。
检索截至2024年9月1日的PubMed、Cochrane图书馆、科学引文索引和Embase数据库。使用QUADAS - 2评估工具评估纳入研究的质量。运用Stata18、RevMan 5.4和Meta - Disc 1.4软件对纳入研究进行荟萃分析,合并敏感性、特异性并绘制汇总受试者工作特征曲线(SROC)。使用Q统计量评估异质性,并通过荟萃回归分析探索研究间异质性的来源。
在3280项已识别的研究中,有22项符合条件。对其中15项研究进行了荟萃分析,这些研究共纳入22817例患者,报告了或可计算其统计指标。7项研究基于深度学习(DL)模型,3项基于机器学习(ML)模型,5项基于具有多实例学习(MIL)机制的DL模型。1项研究同时评估了DL和ML模型。荟萃分析结果显示,所有DL和ML模型的合并敏感性为86%(95%CI 78 - 91%),特异性为87%(95%CI 83 - 91%),曲线下面积为93%(95%CI 90 - 95%)。亚组分析显示,DL和ML模型在诊断敏感性和特异性方面无显著差异(敏感性分别为82%(95%CI 76 - 87%)、93%(95%CI 85 - 97%);特异性分别为87%(95%CI 79 - 91%)、84%(95%CI 79 - 88%)),且具有MIL机制的DL模型(敏感性87%(95%CI 61 - 96%);特异性89%(95%CI 78 - 95%))虽提高了DL模型的性能,但这种提高无统计学意义(p > 0.05)。
使用CT图像诊断COPD的DL和ML模型均表现出较高的准确性。两种类型的人工智能模型在诊断效能上无显著差异,且MIL机制的加入可能会提高DL模型的性能。