CHU de Saint-Etienne, Médecine Vasculaire et Thérapeutique, Saint-Etienne, France.
Université Jean Monnet, Laboratoire Hubert Curien, Saint-Etienne, France.
Microvasc Res. 2025 Jan;157:104753. doi: 10.1016/j.mvr.2024.104753. Epub 2024 Oct 9.
To evaluate the performance of machine learning and then deep learning to detect a systemic scleroderma (SSc) landscape from the same set of nailfold capillaroscopy (NC) images from the French prospective multicenter observational study SCLEROCAP.
NC images from the first 100 SCLEROCAP patients were analyzed to assess the performance of machine learning and then deep learning in identifying the SSc landscape, the NC images having previously been independently and consensually labeled by expert clinicians. Images were divided into a training set (70 %) and a validation set (30 %). After features extraction from the NC images, we tested six classifiers (random forests (RF), support vector machine (SVM), logistic regression (LR), light gradient boosting (LGB), extreme gradient boosting (XGB), K-nearest neighbors (KNN)) on the training set with five different combinations of the images. The performance of each classifier was evaluated by the F1 score. In the deep learning section, we tested three pre-trained models from the TIMM library (ResNet-18, DenseNet-121 and VGG-16) on raw NC images after applying image augmentation methods.
With machine learning, performance ranged from 0.60 to 0.73 for each variable, with Hu and Haralick moments being the most discriminating. Performance was highest with the RF, LGB and XGB models (F1 scores: 0.75-0.79). The highest score was obtained by combining all variables and using the LGB model (F1 score: 0.79 ± 0.05, p < 0.01). With deep learning, performance reached a minimum accuracy of 0.87. The best results were obtained with the DenseNet-121 model (accuracy 0.94 ± 0.02, F1 score 0.94 ± 0.02, AUC 0.95 ± 0.03) as compared to ResNet-18 (accuracy 0.87 ± 0.04, F1 score 0.85 ± 0.03, AUC 0.87 ± 0.04) and VGG-16 (accuracy 0.90 ± 0.03, F1 score 0.91 ± 0.02, AUC 0.91 ± 0.04).
By using machine learning and then deep learning on the same set of labeled NC images from the SCLEROCAP study, the highest performances to detect SSc landscape were obtained with deep learning and in particular DenseNet-121. This pre-trained model could therefore be used to automatically interpret NC images in case of suspected SSc. This result nevertheless needs to be confirmed on a larger number of NC images.
评估机器学习和深度学习在识别系统性硬皮病(SSc)特征方面的性能,这些特征均来自法国前瞻性多中心观察性研究 SCLEROCAP 的同一组甲襞毛细血管镜(NC)图像。
对 SCLEROCAP 的前 100 例患者的 NC 图像进行分析,以评估机器学习和深度学习在识别 SSc 特征方面的性能,这些 NC 图像先前已由专家临床医生独立和一致地标记。图像被分为训练集(70%)和验证集(30%)。在对 NC 图像进行特征提取后,我们在训练集上使用了六个分类器(随机森林(RF)、支持向量机(SVM)、逻辑回归(LR)、轻梯度提升机(LGB)、极端梯度提升机(XGB)、K 最近邻(KNN)),并进行了五次不同的图像组合。通过 F1 分数评估每个分类器的性能。在深度学习部分,我们在应用图像增强方法后,使用 TIMM 库中的三个预训练模型(ResNet-18、DenseNet-121 和 VGG-16)对原始 NC 图像进行了测试。
使用机器学习,每个变量的性能范围为 0.60 到 0.73,其中 Hu 和 Haralick 矩最具鉴别力。RF、LGB 和 XGB 模型的性能最高(F1 分数:0.75-0.79)。通过结合所有变量并使用 LGB 模型,获得了最高分数(F1 分数:0.79±0.05,p<0.01)。使用深度学习,性能达到了最低精度 0.87。DenseNet-121 模型的结果最佳(准确率 0.94±0.02,F1 分数 0.94±0.02,AUC 0.95±0.03),优于 ResNet-18(准确率 0.87±0.04,F1 分数 0.85±0.03,AUC 0.87±0.04)和 VGG-16(准确率 0.90±0.03,F1 分数 0.91±0.02,AUC 0.91±0.04)。
通过在 SCLEROCAP 研究的同一组经标记的 NC 图像上使用机器学习和深度学习,使用深度学习和特别是 DenseNet-121 获得了识别 SSc 特征的最高性能。因此,该预训练模型可用于自动解释疑似 SSc 的 NC 图像。然而,这一结果需要在更多的 NC 图像上进行验证。