Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:2181-2184. doi: 10.1109/EMBC48229.2022.9871617.
Convolutional Neural Networks (CNNs) are an emerging research area for detection of Diabetic Retinopathy (DR) development in fundus images with highly reliable results. However, its accuracy depends on the availability of big datasets to train such a deep network. Due to the privacy concerns, the strict rules on medical data limit accessibility of images in publicly available datasets. In this paper, we propose a collaborative learning approach to train CNN models with multiple datasets while preserving the privacy of datasets in a distributed learning environment without sharing them. First, CNN networks are trained with private datasets, and tested with the same publicly available images. Based on their initial accuracies, the CNN model with the lowest performance among datasets is forwarded to second lowest performed dataset to retrain it using the transfer learning approach. Then, the retrained network is forwarded to next dataset. This procedure is repeated for each dataset from the lowest performed dataset to the highest. With this ascending chain order fashion, the network is retrained again and again using different datasets and its performance is improved over the time. Based on our experimental results on five different retina image datasets, DR detection accuracy is increased to 93.5% compared with the accuracies of merged datasets (84%) and individual datasets (73%, 78%, 83%, 85%).
卷积神经网络(CNNs)是眼底图像中糖尿病视网膜病变(DR)发展检测的一个新兴研究领域,其结果具有高度可靠性。然而,其准确性取决于是否有足够的大型数据集来训练这样的深度网络。由于隐私问题,对医疗数据的严格规定限制了公共数据集的图像获取。在本文中,我们提出了一种协作学习方法,在分布式学习环境中在不共享的情况下使用多个数据集来训练 CNN 模型,同时保护数据集的隐私。首先,使用私有数据集训练 CNN 网络,并使用相同的公开可用图像进行测试。根据初始准确性,在数据集之间表现最低的 CNN 模型将被转发到第二个表现最低的数据集,使用迁移学习方法对其进行重新训练。然后,将重新训练的网络转发到下一个数据集。对于每个数据集,从表现最低的数据集到表现最高的数据集,重复此过程。按照这种递增的链式顺序,网络会使用不同的数据集进行反复重新训练,其性能会随着时间的推移而提高。基于我们在五个不同视网膜图像数据集上的实验结果,与合并数据集(84%)和单个数据集(73%、78%、83%、85%)相比,DR 检测准确率提高到 93.5%。