IEEE Trans Med Imaging. 2024 Jan;43(1):335-350. doi: 10.1109/TMI.2023.3302473. Epub 2024 Jan 2.
In the real world, medical datasets often exhibit a long-tailed data distribution (i.e., a few classes occupy the majority of the data, while most classes have only a limited number of samples), which results in a challenging long-tailed learning scenario. Some recently published datasets in ophthalmology AI consist of more than 40 kinds of retinal diseases with complex abnormalities and variable morbidity. Nevertheless, more than 30 conditions are rarely seen in global patient cohorts. From a modeling perspective, most deep learning models trained on these datasets may lack the ability to generalize to rare diseases where only a few available samples are presented for training. In addition, there may be more than one disease for the presence of the retina, resulting in a challenging label co-occurrence scenario, also known as multi-label, which can cause problems when some re-sampling strategies are applied during training. To address the above two major challenges, this paper presents a novel method that enables the deep neural network to learn from a long-tailed fundus database for various retinal disease recognition. Firstly, we exploit the prior knowledge in ophthalmology to improve the feature representation using a hierarchy-aware pre-training. Secondly, we adopt an instance-wise class-balanced sampling strategy to address the label co-occurrence issue under the long-tailed medical dataset scenario. Thirdly, we introduce a novel hybrid knowledge distillation to train a less biased representation and classifier. We conducted extensive experiments on four databases, including two public datasets and two in-house databases with more than one million fundus images. The experimental results demonstrate the superiority of our proposed methods with recognition accuracy outperforming the state-of-the-art competitors, especially for these rare diseases.
在现实世界中,医学数据集通常呈现长尾数据分布(即少数类别占据了大部分数据,而大多数类别只有有限数量的样本),这导致了具有挑战性的长尾学习场景。一些最近发表的眼科人工智能数据集包含 40 多种视网膜疾病,具有复杂的异常和不同的发病率。然而,全球患者群体中很少见到 30 多种疾病。从建模的角度来看,大多数在这些数据集上训练的深度学习模型可能缺乏泛化到罕见疾病的能力,因为这些罕见疾病只有少数可用样本进行训练。此外,可能存在不止一种疾病会影响视网膜,从而导致具有挑战性的标签共现场景,也称为多标签,这在训练过程中应用某些重采样策略时可能会导致问题。为了解决上述两个主要挑战,本文提出了一种新方法,使深度神经网络能够从各种视网膜疾病识别的长尾眼底数据库中学习。首先,我们利用眼科领域的先验知识,通过层次感知预训练来提高特征表示能力。其次,我们采用实例级别的类别平衡采样策略来解决长尾医学数据集场景下的标签共现问题。第三,我们引入了一种新颖的混合知识蒸馏方法,以训练出偏差较小的表示和分类器。我们在四个数据库上进行了广泛的实验,包括两个公共数据集和两个拥有超过一百万张眼底图像的内部数据库。实验结果表明,我们提出的方法具有优越性,识别准确率优于最先进的竞争对手,特别是对于这些罕见疾病。