Wongveerasin Pootipong, Tongdee Trongtum, Saiviroonporn Pairash
Department of Radiology, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand.
Eur J Radiol Open. 2024 Jul 29;13:100593. doi: 10.1016/j.ejro.2024.100593. eCollection 2024 Dec.
Artificial intelligence (AI) has been proven useful for the assessment of tubes and lines on chest radiographs of general patients. However, validation on intensive care unit (ICU) patients remains imperative.
This retrospective case-control study evaluated the performance of deep learning (DL) models for tubes and lines classification on both an external public dataset and a local dataset comprising 303 films randomly sampled from the ICU database. The endotracheal tubes (ETTs), central venous catheters (CVCs), and nasogastric tubes (NGTs) were classified into "Normal," "Abnormal," or "Borderline" positions by DL models with and without rule-based modification. Their performance was evaluated using an experienced radiologist as the standard reference.
The algorithm showed decreased performance on the local ICU dataset, compared to that of the external dataset, decreasing from the Area Under the Curve of Receiver (AUC) of 0.967 (95 % CI 0.965-0.973) to the AUC of 0.70 (95 % CI 0.68-0.77). Significant improvement in the ETT classification task was observed after modifications were made to the model to allow the use of the spatial relationship between line tips and reference anatomy with the improvement of the AUC, increasing from 0.71 (95 % CI 0.70 - 0.75) to 0.86 (95 % CI 0.83 - 0.94).
The externally trained model exhibited limited generalizability on the local ICU dataset. Therefore, evaluating the performance of externally trained AI before integrating it into critical care routine is crucial. Rule-based algorithm may be used in combination with DL to improve results.
人工智能(AI)已被证明可用于评估普通患者胸部X光片上的管道。然而,对重症监护病房(ICU)患者的验证仍然至关重要。
这项回顾性病例对照研究评估了深度学习(DL)模型在外部公共数据集和从ICU数据库中随机抽取的303张胶片组成的本地数据集上对管道分类的性能。通过有无基于规则修改的DL模型,将气管插管(ETT)、中心静脉导管(CVC)和鼻胃管(NGT)分类为“正常”、“异常”或“临界”位置。以经验丰富的放射科医生作为标准参考来评估其性能。
与外部数据集相比,该算法在本地ICU数据集上的性能有所下降,受试者操作特征曲线下面积(AUC)从0.967(95%CI 0.965 - 0.973)降至0.70(95%CI 0.68 - 0.77)。对模型进行修改以允许使用管线尖端与参考解剖结构之间的空间关系后,ETT分类任务有显著改善,AUC从0.71(95%CI 0.70 - 0.75)提高到0.86(95%CI 0.83 - 0.94)。
外部训练的模型在本地ICU数据集上的通用性有限。因此,在将外部训练的AI整合到重症监护常规流程之前评估其性能至关重要。基于规则的算法可与DL结合使用以改善结果。