De Bois Maxime, El Yacoubi Mounîm A, Ammi Mehdi
CNRS-LIMSI and the Université Paris-Saclay, Orsay, France.
Samovar, CNRS, Télécom SudParis, Institut Polytechnique de Paris, Évry, France.
Comput Methods Programs Biomed. 2021 Feb;199:105874. doi: 10.1016/j.cmpb.2020.105874. Epub 2020 Nov 30.
Deep learning has yet to revolutionize general practices in healthcare, despite promising results for some specific tasks. This is partly due to data being in insufficient quantities hurting the training of the models. To address this issue, data from multiple health actors or patients could be combined by capitalizing on their heterogeneity through the use of transfer learning.
To improve the quality of the transfer between multiple sources of data, we propose a multi-source adversarial transfer learning framework that enables the learning of a feature representation that is similar across the sources, and thus more general and more easily transferable. We apply this idea to glucose forecasting for diabetic people using a fully convolutional neural network. The evaluation is done by exploring various transfer scenarios with three datasets characterized by their high inter and intra variability.
While transferring knowledge is beneficial in general, we show that the statistical and clinical accuracies can be further improved by using of the adversarial training methodology, surpassing the current state-of-the-art results. In particular, it shines when using data from different datasets, or when there is too little data in an intra-dataset situation. To understand the behavior of the models, we analyze the learnt feature representations and propose a new metric in this regard. Contrary to a standard transfer, the adversarial transfer does not discriminate the patients and datasets, helping the learning of a more general feature representation.
The adversarial training framework improves the learning of a general feature representation in a multi-source environment, enhancing the knowledge transfer to an unseen target. The proposed method can help improve the efficiency of data shared by different health actors in the training of deep models.
尽管深度学习在一些特定任务中取得了令人瞩目的成果,但尚未彻底改变医疗保健领域的常规做法。部分原因在于数据量不足影响了模型的训练。为解决这一问题,可以利用迁移学习,通过利用多健康参与者或患者数据的异质性来整合这些数据。
为提高多源数据之间迁移的质量,我们提出了一种多源对抗迁移学习框架,该框架能够学习跨源相似的特征表示,从而更具通用性且更易于迁移。我们将此理念应用于使用全卷积神经网络的糖尿病患者血糖预测。通过使用三个具有高组间和组内变异性的数据集探索各种迁移场景来进行评估。
虽然一般而言知识迁移是有益的,但我们表明,使用对抗训练方法可进一步提高统计和临床准确性,超越当前的最先进结果。特别是在使用来自不同数据集的数据时,或者在数据集内数据过少的情况下,效果尤为显著。为了解模型的行为,我们分析了学习到的特征表示,并在此方面提出了一种新的度量标准。与标准迁移不同,对抗迁移不会区分患者和数据集,有助于学习更通用的特征表示。
对抗训练框架改进了多源环境中通用特征表示的学习,增强了向未知目标的知识迁移。所提出的方法有助于提高不同健康参与者在深度模型训练中共享数据的效率。