Espis Andrea, Marzi Chiara, Diciotti Stefano
Department of Electrical, Electronic, and Information Engineering "Guglielmo Marconi" - DEI, University of Bologna, Via dell'Università 50, 47521, Cesena, Italy.
Department of Statistics, Computer Science and Applications "Giuseppe Parenti", University of Florence, 50134, Florence, Italy.
Sci Rep. 2025 Sep 2;15(1):32345. doi: 10.1038/s41598-025-99000-0.
Self-supervised learning (SSL) in computer vision has shown its potential to reduce reliance on labeled data. However, most studies focused on balanced, large, broad-domain datasets like ImageNet, whereas, in real-world medical applications, dataset size is typically limited. This study compares the performance of SSL versus supervised learning (SL) on small, imbalanced medical imaging datasets. We experimented with four binary classification tasks: age prediction and diagnosis of Alzheimer's disease from brain magnetic resonance imaging scans, pneumonia from chest radiograms, and retinal diseases associated with choroidal neovascularization from optical coherence tomography with a mean size of training sets of 843 images, 771 images, 1,214 images, and 33,484 images, respectively. We tested various combinations of label availability and class frequency distribution, repeating the training with different random seeds to assess result uncertainty. In most experiments involving small training sets, SL outperformed the selected SSL paradigms, even when a limited portion of labeled data was available. Our findings highlight the importance of carefully selecting learning paradigms based on specific application requirements, which are influenced by factors such as training set size, label availability, and class frequency distribution.
计算机视觉中的自监督学习(SSL)已显示出其减少对标记数据依赖的潜力。然而,大多数研究集中在像ImageNet这样平衡、大型、广泛领域的数据集上,而在实际的医学应用中,数据集大小通常是有限的。本研究比较了SSL与监督学习(SL)在小型、不平衡医学成像数据集上的性能。我们对四个二分类任务进行了实验:从脑磁共振成像扫描预测年龄和诊断阿尔茨海默病、从胸部X光片诊断肺炎,以及从光学相干断层扫描诊断与脉络膜新生血管相关的视网膜疾病,训练集的平均大小分别为843张图像、771张图像、1214张图像和33484张图像。我们测试了标签可用性和类别频率分布的各种组合,使用不同的随机种子重复训练以评估结果的不确定性。在大多数涉及小型训练集的实验中,即使只有有限部分的标记数据可用,SL的表现也优于所选的SSL范式。我们的研究结果强调了根据特定应用需求仔细选择学习范式的重要性,这些需求会受到训练集大小、标签可用性和类别频率分布等因素的影响。