Gürsoy Çoruh Ayşegül, Yenigün Bülent, Uzun Çağlar, Kahya Yusuf, Büyükceran Emre Utkan, Elhan Atilla, Orhan Kaan, Kayı Cangır Ayten
Department of Radiology, School of Medicine, Ankara University, Ankara, Turkey.
Department of Thoracic Surgery, School of Medicine, Ankara University, Ankara, Turkey.
Br J Radiol. 2021 Jul 1;94(1123):20210222. doi: 10.1259/bjr.20210222. Epub 2021 Jun 11.
To compare the diagnostic performance of a newly developed artificial intelligence (AI) algorithm derived from the fusion of convolution neural networks (CNN) versus human observers in the estimation of malignancy risk in pulmonary nodules.
The study population consists of 158 nodules from 158 patients. All nodules (81 benign and 77 malignant) were determined to be malignant or benign by a radiologist based on pathologic assessment and/or follow-up imaging. Two radiologists and an AI platform analyzed the nodules based on the Lung-RADS classification. The two observers also noted the size, location, and morphologic features of the nodules. An intraclass correlation coefficient was calculated for both observers and the AI; ROC curve analysis was performed to determine diagnostic performances.
Nodule size, presence of spiculation, and presence of fat were significantly different between the malignant and benign nodules ( < 0.001, for all three). Eighteen (11.3%) nodules were not detected and analyzed by the AI. Observer 1, observer 2, and the AI had an AUC of 0.917 ± 0.023, 0.870 ± 0.033, and 0.790 ± 0.037 in the ROC analysis of malignity probability, respectively. The observers were in almost perfect agreement for localization, nodule size, and lung-RADS classification [κ (95% CI)=0.984 (0.961-1.000), 0.978 (0.970-0.984), and 0.924 (0.878-0.970), respectively].
The performance of the fusion AI algorithm in estimating the risk of malignancy was slightly lower than the performance of the observers. Fusion AI algorithms might be applied in an assisting role, especially for inexperienced radiologists.
In this study, we proposed a fusion model using four state-of-art object detectors for lung nodule detection and discrimination. The use of fusion of deep learning neural networks might be used in a supportive role for radiologists when interpreting lung nodule discrimination.
比较一种新开发的、由卷积神经网络(CNN)融合衍生而来的人工智能(AI)算法与人类观察者在评估肺结节恶性风险方面的诊断性能。
研究人群包括来自158名患者的158个结节。所有结节(81个良性和77个恶性)均由放射科医生根据病理评估和/或随访影像确定为恶性或良性。两名放射科医生和一个AI平台根据Lung-RADS分类对结节进行分析。两名观察者还记录了结节的大小、位置和形态特征。计算两名观察者和AI的组内相关系数;进行ROC曲线分析以确定诊断性能。
恶性和良性结节在结节大小、毛刺征的存在和脂肪的存在方面存在显著差异(所有三项均P<0.001)。AI未检测和分析18个(11.3%)结节。在恶性概率的ROC分析中,观察者1、观察者2和AI的AUC分别为0.917±0.023、0.870±0.033和0.790±0.037。观察者在定位、结节大小和Lung-RADS分类方面几乎完全一致[κ(95%CI)分别为0.984(0.961-1.000)、0.978(0.970-0.984)和0.924(0.878-0.970)]。
融合AI算法在估计恶性风险方面的性能略低于观察者。融合AI算法可能起到辅助作用,尤其是对于经验不足的放射科医生。
在本研究中,我们提出了一种使用四种先进目标检测器的融合模型用于肺结节检测和鉴别。深度学习神经网络的融合在放射科医生解读肺结节鉴别时可能起到支持作用。