判别式、恢复式和对抗式学习：逐步增量预训练

Discriminative, Restorative, and Adversarial Learning: Stepwise Incremental Pretraining.

作者信息

Guo Zuwei, Islam Nahid Ui, Gotway Michael B, Liang Jianming

机构信息

Arizona State University, Tempe, AZ 85281, USA.

Mayo Clinic, Scottsdale, AZ 85259, USA.

出版信息

Domain Adapt Represent Transf (2022). 2022 Sep;13542:66-76. doi: 10.1007/978-3-031-16852-9_7. Epub 2022 Sep 15.

DOI:10.1007/978-3-031-16852-9_7

PMID:36507899

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9728134/

Abstract

Uniting three self-supervised learning (SSL) ingredients (discriminative, restorative, and adversarial learning) enables collaborative representation learning and yields three transferable components: a discriminative encoder, a restorative decoder, and an adversary encoder. To leverage this advantage, we have redesigned five prominent SSL methods, including Rotation, Jigsaw, Rubik's Cube, Deep Clustering, and TransVW, and formulated each in a framework for 3D medical imaging. However, such a United framework increases model complexity and pretraining difficulty. To overcome this difficulty, we develop a stepwise incremental pretraining strategy, in which a discriminative encoder is first trained via discriminative learning, the pretrained discriminative encoder is then attached to a restorative decoder, forming a skip-connected encoder-decoder, for further joint discriminative and restorative learning, and finally, the pretrained encoder-decoder is associated with an adversarial encoder for final full discriminative, restorative, and adversarial learning. Our extensive experiments demonstrate that the stepwise incremental pretraining stabilizes United models training, resulting in significant performance gains and annotation cost reduction via transfer learning for five target tasks, encompassing both classification and segmentation, across diseases, organs, datasets, and modalities. This performance is attributed to the synergy of the three SSL ingredients in our United framework unleashed via stepwise incremental pretraining. All codes and pretrained models are available at GitHub.com/JLiangLab/StepwisePretraining.

摘要

将三种自监督学习（SSL）要素（判别式、恢复式和对抗式学习）结合起来能够实现协作表示学习，并产生三个可转移组件：一个判别式编码器、一个恢复式解码器和一个对抗式编码器。为了利用这一优势，我们重新设计了五种著名的SSL方法，包括旋转、拼图、魔方、深度聚类和TransVW，并将每种方法应用于一个3D医学成像框架中。然而，这样一个统一的框架增加了模型的复杂性和预训练难度。为了克服这一困难，我们开发了一种逐步增量预训练策略，其中首先通过判别式学习训练一个判别式编码器，然后将预训练的判别式编码器连接到一个恢复式解码器上，形成一个跳跃连接的编码器-解码器，用于进一步的联合判别式和恢复式学习，最后，将预训练的编码器-解码器与一个对抗式编码器关联起来，进行最终的全判别式、恢复式和对抗式学习。我们广泛的实验表明，逐步增量预训练稳定了统一模型的训练，通过针对包括分类和分割在内的五个目标任务进行迁移学习，在疾病、器官、数据集和模态方面均实现了显著的性能提升和标注成本降低。这种性能归因于通过逐步增量预训练在我们的统一框架中释放的三种SSL要素的协同作用。所有代码和预训练模型可在GitHub.com/JLiangLab/StepwisePretraining上获取。

相似文献

Discriminative, Restorative, and Adversarial Learning: Stepwise Incremental Pretraining.判别式、恢复式和对抗式学习：逐步增量预训练

Domain Adapt Represent Transf (2022). 2022 Sep;13542:66-76. doi: 10.1007/978-3-031-16852-9_7. Epub 2022 Sep 15.

Stepwise incremental pretraining for integrating discriminative, restorative, and adversarial learning.用于整合判别式、恢复性和对抗性学习的逐步增量式预训练。

Med Image Anal. 2024 Jul;95:103159. doi: 10.1016/j.media.2024.103159. Epub 2024 Apr 16.

DiRA: Discriminative, Restorative, and Adversarial Learning for Self-supervised Medical Image Analysis.DiRA：用于自监督医学图像分析的判别式、恢复式和对抗式学习

Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2022 Jun;2022:20792-20802. doi: 10.1109/cvpr52688.2022.02016. Epub 2022 Sep 27.

Self-supervised learning for medical image analysis: Discriminative, restorative, or adversarial?用于医学图像分析的自监督学习：判别式、恢复式还是对抗式？

Med Image Anal. 2024 May;94:103086. doi: 10.1016/j.media.2024.103086. Epub 2024 Jan 28.

Generalizability of Self-Supervised Training Models for Digital Pathology: A Multicountry Comparison in Colorectal Cancer.基于数字病理的自监督训练模型的泛化能力：结直肠癌的多国比较。

JCO Clin Cancer Inform. 2023 Sep;7:e2200178. doi: 10.1200/CCI.22.00178.

Improving Data-Efficiency and Robustness of Medical Imaging Segmentation Using Inpainting-Based Self-Supervised Learning.使用基于图像修复的自监督学习提高医学图像分割的数据效率和鲁棒性

Bioengineering (Basel). 2023 Feb 4;10(2):207. doi: 10.3390/bioengineering10020207.

Toward a clinical text encoder: pretraining for clinical natural language processing with applications to substance misuse.迈向临床文本编码器：利用药物滥用应用进行临床自然语言处理的预训练

J Am Med Inform Assoc. 2019 Nov 1;26(11):1272-1278. doi: 10.1093/jamia/ocz072.

Wearable Data From Subjects Playing Super Mario, Taking University Exams, or Performing Physical Exercise Help Detect Acute Mood Disorder Episodes via Self-Supervised Learning: Prospective, Exploratory, Observational Study.来自玩超级马里奥、参加大学考试或进行体育锻炼的受试者的可穿戴数据，通过自监督学习有助于检测急性情绪障碍发作：前瞻性、探索性、观察性研究。

JMIR Mhealth Uhealth. 2024 Jul 17;12:e55094. doi: 10.2196/55094.

DrasCLR: A self-supervised framework of learning disease-related and anatomy-specific representation for 3D lung CT images.DrasCLR：一种用于 3D 肺部 CT 图像的学习疾病相关和解剖特定表示的自监督框架。

Med Image Anal. 2024 Feb;92:103062. doi: 10.1016/j.media.2023.103062. Epub 2023 Dec 9.

Transferable Visual Words: Exploiting the Semantics of Anatomical Patterns for Self-Supervised Learning.可转移的视觉词汇：利用解剖模式的语义进行自监督学习。

IEEE Trans Med Imaging. 2021 Oct;40(10):2857-2868. doi: 10.1109/TMI.2021.3060634. Epub 2021 Sep 30.

引用本文的文献

Representing Part-Whole Hierarchies in Foundation Models by Learning Localizability, Composability, and Decomposability from Anatomy via Self-Supervision.通过自监督从解剖学中学习局部可定位性、可组合性和可分解性，在基础模型中表示部分-整体层次结构。

Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2024 Jun;abs/210504906(2024):11269-11281. doi: 10.1109/cvpr52733.2024.01071. Epub 2024 Sep 16.

A survey of the impact of self-supervised pretraining for diagnostic tasks in medical X-ray, CT, MRI, and ultrasound.针对医学 X 射线、CT、MRI 和超声诊断任务的自监督预训练的影响进行调查。

BMC Med Imaging. 2024 Apr 6;24(1):79. doi: 10.1186/s12880-024-01253-0.

Self-supervised learning for medical image analysis: Discriminative, restorative, or adversarial?用于医学图像分析的自监督学习：判别式、恢复式还是对抗式？

Med Image Anal. 2024 May;94:103086. doi: 10.1016/j.media.2024.103086. Epub 2024 Jan 28.

Seeking an optimal approach for Computer-aided Diagnosis of Pulmonary Embolism.寻求肺栓塞计算机辅助诊断的最佳方法。

Med Image Anal. 2024 Jan;91:102988. doi: 10.1016/j.media.2023.102988. Epub 2023 Oct 13.

本文引用的文献

DiRA: Discriminative, Restorative, and Adversarial Learning for Self-supervised Medical Image Analysis.DiRA：用于自监督医学图像分析的判别式、恢复式和对抗式学习

Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2022 Jun;2022:20792-20802. doi: 10.1109/cvpr52688.2022.02016. Epub 2022 Sep 27.

Guest Editorial Annotation-Efficient Deep Learning: The Holy Grail of Medical Imaging.客座编辑注释 - 高效深度学习：医学成像的圣杯

IEEE Trans Med Imaging. 2021 Oct;40(10):2526-2533. doi: 10.1109/tmi.2021.3089292. Epub 2021 Sep 30.

Transferable Visual Words: Exploiting the Semantics of Anatomical Patterns for Self-Supervised Learning.可转移的视觉词汇：利用解剖模式的语义进行自监督学习。

IEEE Trans Med Imaging. 2021 Oct;40(10):2857-2868. doi: 10.1109/TMI.2021.3060634. Epub 2021 Sep 30.

Models Genesis.模型起源。

Med Image Anal. 2021 Jan;67:101840. doi: 10.1016/j.media.2020.101840. Epub 2020 Oct 13.

Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey.基于深度神经网络的自监督视觉特征学习：综述

IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):4037-4058. doi: 10.1109/TPAMI.2020.2992393. Epub 2021 Oct 1.

Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: The LUNA16 challenge.自动检测 CT 图像中肺结节的算法的验证、比较和组合：LUNA16 挑战赛。

Med Image Anal. 2017 Dec;42:1-13. doi: 10.1016/j.media.2017.06.015. Epub 2017 Jul 13.

The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS).多模态脑肿瘤图像分割基准（BRATS）。

IEEE Trans Med Imaging. 2015 Oct;34(10):1993-2024. doi: 10.1109/TMI.2014.2377694. Epub 2014 Dec 4.

The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): a completed reference database of lung nodules on CT scans.肺影像数据库联盟（LIDC）和图像数据库资源倡议（IDRI）：一个关于 CT 扫描肺部结节的完整参考数据库。

Med Phys. 2011 Feb;38(2):915-31. doi: 10.1118/1.3528204.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验