Li Chenxin, Lin Xin, Mao Yijin, Lin Wei, Qi Qi, Ding Xinghao, Huang Yue, Liang Dong, Yu Yizhou
School of Informatics, Xiamen University, Xiamen, 361005, China.
School of Informatics, Xiamen University, Xiamen, 361005, China.
Comput Biol Med. 2022 Feb;141:105144. doi: 10.1016/j.compbiomed.2021.105144. Epub 2021 Dec 24.
Medical imaging datasets usually exhibit domain shift due to the variations of scanner vendors, imaging protocols, etc. This raises the concern about the generalization capacity of machine learning models. Domain generalization (DG), which aims to learn a model from multiple source domains such that it can be directly generalized to unseen test domains, seems particularly promising to medical imaging community. To address DG, recent model-agnostic meta-learning (MAML) has been introduced, which transfers the knowledge from previous training tasks to facilitate the learning of novel testing tasks. However, in clinical practice, there are usually only a few annotated source domains available, which decreases the capacity of training task generation and thus increases the risk of overfitting to training tasks in the paradigm. In this paper, we propose a novel DG scheme of episodic training with task augmentation on medical imaging classification. Based on meta-learning, we develop the paradigm of episodic training to construct the knowledge transfer from episodic training-task simulation to the real testing task of DG. Motivated by the limited number of source domains in real-world medical deployment, we consider the unique task-level overfitting and we propose task augmentation to enhance the variety during training task generation to alleviate it. With the established learning framework, we further exploit a novel meta-objective to regularize the deep embedding of training domains. To validate the effectiveness of the proposed method, we perform experiments on histopathological images and abdominal CT images.
医学成像数据集通常会因扫描仪供应商、成像协议等的差异而呈现出领域偏移。这引发了对机器学习模型泛化能力的担忧。领域泛化(DG)旨在从多个源域学习模型,使其能够直接泛化到未见的测试域,这对医学成像领域似乎特别有前景。为了解决领域泛化问题,最近引入了模型无关元学习(MAML),它从前序训练任务中转移知识,以促进新测试任务的学习。然而,在临床实践中,通常只有少数带注释的源域可用,这降低了训练任务生成的能力,从而增加了在该范式中过度拟合训练任务的风险。在本文中,我们提出了一种用于医学成像分类的带有任务增强的情节训练的新型领域泛化方案。基于元学习,我们开发了情节训练范式,以构建从情节训练任务模拟到领域泛化的实际测试任务的知识转移。受现实世界医学部署中源域数量有限的启发,我们考虑了独特的任务级过拟合问题,并提出任务增强来增加训练任务生成期间的多样性以缓解该问题。借助已建立的学习框架,我们进一步利用一种新型元目标来规范训练域的深度嵌入。为了验证所提方法的有效性,我们在组织病理学图像和腹部CT图像上进行了实验。