通过监督学习或自监督学习对胸部X光片分类进行特定领域预训练的再思考：与冷启动主动学习中的ImageNet对应模型的比较研究

Yuan Han, Zhu Mingcheng, Yang Rui, Liu Han, Li Irene, Hong Chuan

Duke-NUS Medical School, Centre for Quantitative Medicine Singapore Singapore.

Department of Engineering Science University of Oxford Oxford UK.

Health Care Sci. 2025 Apr 6;4(2):110-143. doi: 10.1002/hcs2.70009. eCollection 2025 Apr.

OBJECTIVE

Deep learning (DL) has become the prevailing method in chest radiograph analysis, yet its performance heavily depends on large quantities of annotated images. To mitigate the cost, cold-start active learning (AL), comprising an initialization followed by subsequent learning, selects a small subset of informative data points for labeling. Recent advancements in pretrained models by supervised or self-supervised learning tailored to chest radiograph have shown broad applicability to diverse downstream tasks. However, their potential in cold-start AL remains unexplored.

METHODS

To validate the efficacy of domain-specific pretraining, we compared two foundation models: supervised TXRV and self-supervised REMEDIS with their general domain counterparts pretrained on ImageNet. Model performance was evaluated at both initialization and subsequent learning stages on two diagnostic tasks: psychiatric pneumonia and COVID-19. For initialization, we assessed their integration with three strategies: diversity, uncertainty, and hybrid sampling. For subsequent learning, we focused on uncertainty sampling powered by different pretrained models. We also conducted statistical tests to compare the foundation models with ImageNet counterparts, investigate the relationship between initialization and subsequent learning, examine the performance of one-shot initialization against the full AL process, and investigate the influence of class balance in initialization samples on initialization and subsequent learning.

RESULTS

First, domain-specific foundation models failed to outperform ImageNet counterparts in six out of eight experiments on informative sample selection. Both domain-specific and general pretrained models were unable to generate representations that could substitute for the original images as model inputs in seven of the eight scenarios. However, pretrained model-based initialization surpassed random sampling, the default approach in cold-start AL. Second, initialization performance was positively correlated with subsequent learning performance, highlighting the importance of initialization strategies. Third, one-shot initialization performed comparably to the full AL process, demonstrating the potential of reducing experts' repeated waiting during AL iterations. Last, a U-shaped correlation was observed between the class balance of initialization samples and model performance, suggesting that the class balance is more strongly associated with performance at middle budget levels than at low or high budgets.

CONCLUSIONS

In this study, we highlighted the limitations of medical pretraining compared to general pretraining in the context of cold-start AL. We also identified promising outcomes related to cold-start AL, including initialization based on pretrained models, the positive influence of initialization on subsequent learning, the potential for one-shot initialization, and the influence of class balance on middle-budget AL. Researchers are encouraged to improve medical pretraining for versatile DL foundations and explore novel AL methods.

目的

深度学习（DL）已成为胸部X光片分析的主流方法，但其性能在很大程度上依赖于大量带注释的图像。为了降低成本，冷启动主动学习（AL）包括初始化和后续学习，它选择一小部分信息丰富的数据点进行标注。针对胸部X光片的有监督或自监督学习的预训练模型的最新进展已显示出对各种下游任务具有广泛适用性。然而，它们在冷启动主动学习中的潜力仍未得到探索。

方法

为了验证特定领域预训练的有效性，我们比较了两个基础模型：有监督的TXRV和自监督的REMEDIS，以及它们在ImageNet上预训练的通用领域对应模型。在精神性肺炎和新冠肺炎这两项诊断任务的初始化和后续学习阶段对模型性能进行了评估。对于初始化，我们评估了它们与三种策略的整合情况：多样性、不确定性和混合采样。对于后续学习，我们重点关注由不同预训练模型驱动的不确定性采样。我们还进行了统计测试，以比较基础模型与ImageNet对应模型，研究初始化与后续学习之间的关系，检验一次性初始化相对于完整主动学习过程的性能，并研究初始化样本中的类别平衡对初始化和后续学习的影响。

结果

首先，在八项关于信息样本选择的实验中，特定领域的基础模型在六项实验中未能超过ImageNet对应模型。在八种情况中的七种情况下，特定领域和通用预训练模型都无法生成可以替代原始图像作为模型输入的表示。然而，基于预训练模型的初始化超过了冷启动主动学习中的默认方法——随机采样。其次，初始化性能与后续学习性能呈正相关，突出了初始化策略的重要性。第三，一次性初始化的表现与完整主动学习过程相当，这表明在主动学习迭代过程中减少专家重复等待时间具有潜力。最后，在初始化样本的类别平衡与模型性能之间观察到一种U形相关性，这表明类别平衡在中等预算水平下比在低预算或高预算水平下与性能的关联更强。

结论

在本研究中，我们强调了在冷启动主动学习背景下，与通用预训练相比，医学预训练的局限性。我们还确定了与冷启动主动学习相关的有前景的成果，包括基于预训练模型的初始化、初始化对后续学习的积极影响、一次性初始化的潜力以及类别平衡对中等预算主动学习的影响。鼓励研究人员改进用于通用深度学习基础的医学预训练，并探索新的主动学习方法。

相似文献

Rethinking Domain-Specific Pretraining by Supervised or Self-Supervised Learning for Chest Radiograph Classification: A Comparative Study Against ImageNet Counterparts in Cold-Start Active Learning.

Health Care Sci. 2025 Apr 6;4(2):110-143. doi: 10.1002/hcs2.70009. eCollection 2025 Apr.

Weakly-supervised learning-based pathology detection and localization in 3D chest CT scans.

Med Phys. 2024 Nov;51(11):8272-8282. doi: 10.1002/mp.17302. Epub 2024 Aug 14.

Why does my medical AI look at pictures of birds? Exploring the efficacy of transfer learning across domain boundaries.

Comput Methods Programs Biomed. 2025 Apr;261:108634. doi: 10.1016/j.cmpb.2025.108634. Epub 2025 Jan 31.

Enhancing diagnostic deep learning via self-supervised pretraining on large-scale, unlabeled non-medical images.

Eur Radiol Exp. 2024 Feb 8;8(1):10. doi: 10.1186/s41747-023-00411-3.

Improving Medical Image Classification in Noisy Labels Using only Self-supervised Pretraining.

Data Eng Med Imaging (2023). 2023 Oct;14314:78-90. doi: 10.1007/978-3-031-44992-5_8. Epub 2023 Oct 1.

Uncovering the effects of model initialization on deep model generalization: A study with adult and pediatric chest X-ray images.

PLOS Digit Health. 2024 Jan 17;3(1):e0000286. doi: 10.1371/journal.pdig.0000286. eCollection 2024 Jan.

Self-supervised learning improves robustness of deep learning lung tumor segmentation models to CT imaging differences.

Med Phys. 2025 Mar;52(3):1573-1588. doi: 10.1002/mp.17541. Epub 2024 Dec 5.

Transformer-based unsupervised contrastive learning for histopathological image classification.

Med Image Anal. 2022 Oct;81:102559. doi: 10.1016/j.media.2022.102559. Epub 2022 Jul 30.

Generalizability of Self-Supervised Training Models for Digital Pathology: A Multicountry Comparison in Colorectal Cancer.

JCO Clin Cancer Inform. 2023 Sep;7:e2200178. doi: 10.1200/CCI.22.00178.

Tailored self-supervised pretraining improves brain MRI diagnostic models.

Comput Med Imaging Graph. 2025 Jul;123:102560. doi: 10.1016/j.compmedimag.2025.102560. Epub 2025 Apr 17.

本文引用的文献

Leveraging anatomical constraints with uncertainty for pneumothorax segmentation.

Health Care Sci. 2024 Dec 15;3(6):456-474. doi: 10.1002/hcs2.119. eCollection 2024 Dec.

Toward real-world deployment of machine learning for health care: External validation, continual monitoring, and randomized clinical trials.

Health Care Sci. 2024 Oct 14;3(5):360-364. doi: 10.1002/hcs2.114. eCollection 2024 Oct.

Generative AI as a tool for truth.

Science. 2024 Sep 13;385(6714):1164-1165. doi: 10.1126/science.ads0433. Epub 2024 Sep 12.

Enhancing representation in radiography-reports foundation model: a granular alignment algorithm using masked contrastive learning.

Nat Commun. 2024 Sep 2;15(1):7620. doi: 10.1038/s41467-024-51749-0.

A vision-language foundation model for the generation of realistic chest X-ray images.

Nat Biomed Eng. 2025 Apr;9(4):494-506. doi: 10.1038/s41551-024-01246-y. Epub 2024 Aug 26.

A generalist vision-language foundation model for diverse biomedical tasks.

Nat Med. 2024 Nov;30(11):3129-3141. doi: 10.1038/s41591-024-03185-2. Epub 2024 Aug 7.

Clinical domain knowledge-derived template improves post hoc AI explanations in pneumothorax classification.

J Biomed Inform. 2024 Aug;156:104673. doi: 10.1016/j.jbi.2024.104673. Epub 2024 Jun 9.

Federated learning for medical image analysis: A survey.

Pattern Recognit. 2024 Jul;151. doi: 10.1016/j.patcog.2024.110424. Epub 2024 Mar 12.

Towards a general-purpose foundation model for computational pathology.

Nat Med. 2024 Mar;30(3):850-862. doi: 10.1038/s41591-024-02857-3. Epub 2024 Mar 19.

GANDALF: Graph-based transformer and Data Augmentation Active Learning Framework with interpretable features for multi-label chest Xray classification.

Med Image Anal. 2024 Apr;93:103075. doi: 10.1016/j.media.2023.103075. Epub 2024 Jan 6.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Rethinking Domain-Specific Pretraining by Supervised or Self-Supervised Learning for Chest Radiograph Classification: A Comparative Study Against ImageNet Counterparts in Cold-Start Active Learning.

Health Care Sci. 2025 Apr 6;4(2):110-143. doi: 10.1002/hcs2.70009. eCollection 2025 Apr.

Weakly-supervised learning-based pathology detection and localization in 3D chest CT scans.

Med Phys. 2024 Nov;51(11):8272-8282. doi: 10.1002/mp.17302. Epub 2024 Aug 14.

Why does my medical AI look at pictures of birds? Exploring the efficacy of transfer learning across domain boundaries.

Comput Methods Programs Biomed. 2025 Apr;261:108634. doi: 10.1016/j.cmpb.2025.108634. Epub 2025 Jan 31.

Enhancing diagnostic deep learning via self-supervised pretraining on large-scale, unlabeled non-medical images.

Eur Radiol Exp. 2024 Feb 8;8(1):10. doi: 10.1186/s41747-023-00411-3.

Improving Medical Image Classification in Noisy Labels Using only Self-supervised Pretraining.

Data Eng Med Imaging (2023). 2023 Oct;14314:78-90. doi: 10.1007/978-3-031-44992-5_8. Epub 2023 Oct 1.

Uncovering the effects of model initialization on deep model generalization: A study with adult and pediatric chest X-ray images.

PLOS Digit Health. 2024 Jan 17;3(1):e0000286. doi: 10.1371/journal.pdig.0000286. eCollection 2024 Jan.

Self-supervised learning improves robustness of deep learning lung tumor segmentation models to CT imaging differences.

Med Phys. 2025 Mar;52(3):1573-1588. doi: 10.1002/mp.17541. Epub 2024 Dec 5.

Transformer-based unsupervised contrastive learning for histopathological image classification.

Med Image Anal. 2022 Oct;81:102559. doi: 10.1016/j.media.2022.102559. Epub 2022 Jul 30.

Generalizability of Self-Supervised Training Models for Digital Pathology: A Multicountry Comparison in Colorectal Cancer.

JCO Clin Cancer Inform. 2023 Sep;7:e2200178. doi: 10.1200/CCI.22.00178.

Tailored self-supervised pretraining improves brain MRI diagnostic models.

Comput Med Imaging Graph. 2025 Jul;123:102560. doi: 10.1016/j.compmedimag.2025.102560. Epub 2025 Apr 17.

本文引用的文献

Leveraging anatomical constraints with uncertainty for pneumothorax segmentation.

Health Care Sci. 2024 Dec 15;3(6):456-474. doi: 10.1002/hcs2.119. eCollection 2024 Dec.

Toward real-world deployment of machine learning for health care: External validation, continual monitoring, and randomized clinical trials.

Health Care Sci. 2024 Oct 14;3(5):360-364. doi: 10.1002/hcs2.114. eCollection 2024 Oct.

Generative AI as a tool for truth.

Science. 2024 Sep 13;385(6714):1164-1165. doi: 10.1126/science.ads0433. Epub 2024 Sep 12.

Enhancing representation in radiography-reports foundation model: a granular alignment algorithm using masked contrastive learning.

Nat Commun. 2024 Sep 2;15(1):7620. doi: 10.1038/s41467-024-51749-0.

A vision-language foundation model for the generation of realistic chest X-ray images.

Nat Biomed Eng. 2025 Apr;9(4):494-506. doi: 10.1038/s41551-024-01246-y. Epub 2024 Aug 26.

A generalist vision-language foundation model for diverse biomedical tasks.

Nat Med. 2024 Nov;30(11):3129-3141. doi: 10.1038/s41591-024-03185-2. Epub 2024 Aug 7.

Clinical domain knowledge-derived template improves post hoc AI explanations in pneumothorax classification.

J Biomed Inform. 2024 Aug;156:104673. doi: 10.1016/j.jbi.2024.104673. Epub 2024 Jun 9.

Federated learning for medical image analysis: A survey.

Pattern Recognit. 2024 Jul;151. doi: 10.1016/j.patcog.2024.110424. Epub 2024 Mar 12.

Towards a general-purpose foundation model for computational pathology.

Nat Med. 2024 Mar;30(3):850-862. doi: 10.1038/s41591-024-02857-3. Epub 2024 Mar 19.

GANDALF: Graph-based transformer and Data Augmentation Active Learning Framework with interpretable features for multi-label chest Xray classification.

Med Image Anal. 2024 Apr;93:103075. doi: 10.1016/j.media.2023.103075. Epub 2024 Jan 6.

Rethinking Domain-Specific Pretraining by Supervised or Self-Supervised Learning for Chest Radiograph Classification: A Comparative Study Against ImageNet Counterparts in Cold-Start Active Learning.

作者信息

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献