Suppr超能文献

提高深度学习模型预测入院时头部CT血肿扩大的稳健性。

Improving the Robustness of Deep-Learning Models in Predicting Hematoma Expansion from Admission Head CT.

作者信息

Tran Anh T, Karam Gaby Abou, Zeevi Dorin, Qureshi Adnan I, Malhotra Ajay, Majidi Shahram, Murthy Santosh B, Park Soojin, Kontos Despina, Falcone Guido J, Sheth Kevin N, Payabvash Seyedmehdi

机构信息

From the Department of Radiology (A.T.T., D.Z., D.K., S. Payabvash) and Neurology (S. Park), NewYork-Presbyterian/Columbia University Irving Medical Center, Columbia University, New York, NY; Department of Radiology and Biomedical Imaging (G.A., A.M.) and Neurology (G.J.F., K.N.S.), Yale School of Medicine, New Haven, CT; Zeenat Qureshi Stroke Institute and Department of Neurology (A.I.Q.), University of Missouri, Columbia, MO; Department of Neurosurgery (S.M.), Icahn School of Medicine at Mount Sinai, Mount Sinai Hospital, New York, NY; and Department of Neurology (S.B.M.), Weill Cornell Medical College, Cornell University, New York, NY.

出版信息

AJNR Am J Neuroradiol. 2025 Jan 10. doi: 10.3174/ajnr.A8650.

Abstract

BACKGROUND AND PURPOSE

Robustness against input data perturbations is essential for deploying deep-learning models in clinical practice. Adversarial attacks involve subtle, voxel-level manipulations of scans to increase deep-learning models' prediction errors. Testing deep-learning model performance on examples of adversarial images provides a measure of robustness, and including adversarial images in the training set can improve the model's robustness. In this study, we examined adversarial training and input modifications to improve the robustness of deep-learning models in predicting hematoma expansion (HE) from admission head CTs of patients with acute intracerebral hemorrhage (ICH).

MATERIALS AND METHODS

We used a multicenter cohort of n=890 patients for cross-validation/training, and a cohort of n=684 consecutive ICH patients from two stroke centers for independent validation. Fast Gradient Sign Method (FGSM) and Projected Gradient Descent (PGD) adversarial attacks were applied for training and testing. We developed and tested four different models to predict ≥3mL, ≥6mL, ≥9mL, and ≥12mL HE in independent validation cohort applying Receiver Operating Characteristics (ROC) Area Under the Curve (AUC). We examined varying mixtures of adversarial and non-perturbed (clean) scans for training as well as including additional input from the hyperparameter-free Otsu multi-threshold segmentation for model.

RESULTS

When deep-learning models trained solely on clean scans were tested with PGD and FGSM adversarial images, the average HE prediction AUC dropped from 0.8 to 0.67 and 0.71, respectively. Overall, the best performing strategy to improve model robustness was training with 5-to-3 mix of clean and PGD adversarial scans and addition of Otsu multi-threshold segmentation to model input, increasing the average AUC to 0.77 against both PGD and FGSM adversarial attacks. Adversarial training with FGSM improved robustness against similar type attack but offered limited cross-attack robustness against PGD-type images.

CONCLUSIONS

Adversarial training and inclusion of threshold-based segmentation as an additional input can improve deep-learning model robustness in prediction of HE from admission head CTs in acute ICH.

ABBREVIATIONS

ATACH-2= Antihypertensive Treatment of Acute Cerebral Hemorrhage; AUC= Area Under the Curve; Dice=Dice coefficient; CNN= Convolutional Neural Network; FGSM= Fast Gradient Sign Method; ICH= Intracerebral hemorrhage; HD= Hausdorff distance; HE= Hematoma expansion; PGD= Projected Gradient Descent; ROC= Receiver Operating Characteristics; VS= Volume similarity.

摘要

背景与目的

在临床实践中部署深度学习模型时,对输入数据扰动的鲁棒性至关重要。对抗性攻击涉及对扫描进行细微的体素级操作,以增加深度学习模型的预测误差。在对抗性图像示例上测试深度学习模型性能可提供一种鲁棒性度量,并且在训练集中包含对抗性图像可以提高模型的鲁棒性。在本研究中,我们研究了对抗性训练和输入修改,以提高深度学习模型从急性脑出血(ICH)患者入院时的头部CT预测血肿扩大(HE)的鲁棒性。

材料与方法

我们使用了一个n = 890例患者的多中心队列进行交叉验证/训练,并使用来自两个卒中中心的n = 684例连续ICH患者的队列进行独立验证。应用快速梯度符号法(FGSM)和投影梯度下降(PGD)对抗性攻击进行训练和测试。我们开发并测试了四种不同的模型,以在独立验证队列中预测≥3mL、≥6mL、≥9mL和≥12mL的HE,应用受试者操作特征(ROC)曲线下面积(AUC)。我们研究了对抗性和未扰动(干净)扫描的不同混合比例用于训练,以及将无超参数的大津多阈值分割的额外输入纳入模型。

结果

当仅在干净扫描上训练的深度学习模型用PGD和FGSM对抗性图像进行测试时,平均HE预测AUC分别从0.8降至0.67和0.71。总体而言,提高模型鲁棒性的最佳策略是用干净扫描与PGD对抗性扫描5比3的混合比例进行训练,并将大津多阈值分割添加到模型输入中,在针对PGD和FGSM对抗性攻击时,平均AUC提高到0.77。用FGSM进行对抗性训练提高了对类似类型攻击的鲁棒性,但对PGD类型图像的交叉攻击鲁棒性有限。

结论

对抗性训练以及将基于阈值的分割作为额外输入纳入,可以提高深度学习模型从急性ICH患者入院时的头部CT预测HE的鲁棒性。

缩写

ATACH - 2 = 急性脑出血的降压治疗;AUC = 曲线下面积;Dice = Dice系数;CNN = 卷积神经网络;FGSM = 快速梯度符号法;ICH = 脑出血;HD = 豪斯多夫距离;HE = 血肿扩大;PGD = 投影梯度下降;ROC = 受试者操作特征;VS = 体积相似性

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验