基于图的转换器和数据增强主动学习框架，具有可解释的特征，用于多标签胸部 X 光分类。

GANDALF: Graph-based transformer and Data Augmentation Active Learning Framework with interpretable features for multi-label chest Xray classification.

机构信息

Inception Institute of AI, Abu Dhabi, United Arab Emirates; Faculty of IT, Monash University, Melbourne, Australia.

École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland; Lausanne University Hospital (CHUV), Lausanne, Switzerland.

出版信息

Med Image Anal. 2024 Apr;93:103075. doi: 10.1016/j.media.2023.103075. Epub 2024 Jan 6.

DOI:10.1016/j.media.2023.103075

PMID:38199069

Abstract

Informative sample selection in an active learning (AL) setting helps a machine learning system attain optimum performance with minimum labeled samples, thus reducing annotation costs and boosting performance of computer-aided diagnosis systems in the presence of limited labeled data. Another effective technique to enlarge datasets in a small labeled data regime is data augmentation. An intuitive active learning approach thus consists of combining informative sample selection and data augmentation to leverage their respective advantages and improve the performance of AL systems. In this paper, we propose a novel approach called GANDALF (Graph-based TrANsformer and Data Augmentation Active Learning Framework) to combine sample selection and data augmentation in a multi-label setting. Conventional sample selection approaches in AL have mostly focused on the single-label setting where a sample has only one disease label. These approaches do not perform optimally when a sample can have multiple disease labels (e.g., in chest X-ray images). We improve upon state-of-the-art multi-label active learning techniques by representing disease labels as graph nodes and use graph attention transformers (GAT) to learn more effective inter-label relationships. We identify the most informative samples by aggregating GAT representations. Subsequently, we generate transformations of these informative samples by sampling from a learned latent space. From these generated samples, we identify informative samples via a novel multi-label informativeness score, which beyond the state of the art, ensures that (i) generated samples are not redundant with respect to the training data and (ii) make important contributions to the training stage. We apply our method to two public chest X-ray datasets, as well as breast, dermatology, retina and kidney tissue microscopy MedMNIST datasets, and report improved results over state-of-the-art multi-label AL techniques in terms of model performance, learning rates, and robustness.

摘要

在主动学习（AL）环境中进行信息样本选择有助于机器学习系统在使用最少标注样本的情况下达到最佳性能，从而降低标注成本，并在有限的标注数据情况下提高计算机辅助诊断系统的性能。另一种在小标注数据环境中扩充数据集的有效技术是数据增强。因此，一种直观的主动学习方法是将信息样本选择和数据增强相结合，以利用它们各自的优势并提高 AL 系统的性能。在本文中，我们提出了一种名为 GANDALF（基于图的 Transformer 和数据增强主动学习框架）的新方法，用于在多标签环境中结合样本选择和数据增强。传统的 AL 中的样本选择方法主要集中在单标签设置上，其中一个样本只有一个疾病标签。当一个样本可以有多个疾病标签（例如，在胸部 X 射线图像中）时，这些方法不能达到最佳性能。我们通过将疾病标签表示为图节点，并使用图注意力转换器（GAT）来学习更有效的标签间关系，从而改进了最先进的多标签主动学习技术。我们通过聚合 GAT 表示来识别最有信息的样本。然后，我们通过从学习的潜在空间中采样来生成这些有信息样本的变换。从这些生成的样本中，我们通过一种新的多标签信息量得分来识别有信息的样本，该得分不仅超越了现有技术，还确保了（i）生成的样本相对于训练数据不是冗余的，并且（ii）对训练阶段做出了重要贡献。我们将我们的方法应用于两个公共的胸部 X 射线数据集，以及乳腺、皮肤科、视网膜和肾脏组织显微镜 MedMNIST 数据集，并报告了在模型性能、学习率和鲁棒性方面优于最先进的多标签 AL 技术的结果。

相似文献

GANDALF: Graph-based transformer and Data Augmentation Active Learning Framework with interpretable features for multi-label chest Xray classification.

Med Image Anal. 2024 Apr;93:103075. doi: 10.1016/j.media.2023.103075. Epub 2024 Jan 6.

Graph Node Based Interpretability Guided Sample Selection for Active Learning.

IEEE Trans Med Imaging. 2023 Mar;42(3):661-673. doi: 10.1109/TMI.2022.3215017. Epub 2023 Mar 2.

Label correlation transformer for automated chest X-ray diagnosis with reliable interpretability.

Radiol Med. 2023 Jun;128(6):726-733. doi: 10.1007/s11547-023-01647-0. Epub 2023 May 26.

Chest x-ray diagnosis via spatial-channel high-order attention representation learning.

Phys Med Biol. 2024 Feb 13;69(4). doi: 10.1088/1361-6560/ad2014.

CheXGAT: A disease correlation-aware network for thorax disease diagnosis from chest X-ray images.

Artif Intell Med. 2022 Oct;132:102382. doi: 10.1016/j.artmed.2022.102382. Epub 2022 Aug 27.

Label Co-Occurrence Learning With Graph Convolutional Networks for Multi-Label Chest X-Ray Image Classification.

IEEE J Biomed Health Inform. 2020 Aug;24(8):2292-2302. doi: 10.1109/JBHI.2020.2967084. Epub 2020 Jan 16.

Modeling global and local label correlation with graph convolutional networks for multi-label chest X-ray image classification.

Med Biol Eng Comput. 2022 Sep;60(9):2567-2588. doi: 10.1007/s11517-022-02604-1. Epub 2022 Jul 4.

Interpretability-Driven Sample Selection Using Self Supervised Learning for Disease Classification and Segmentation.

IEEE Trans Med Imaging. 2021 Oct;40(10):2548-2562. doi: 10.1109/TMI.2021.3061724. Epub 2021 Sep 30.

Multi-Label Generalized Zero Shot Chest X-Ray Classification by Combining Image-Text Information With Feature Disentanglement.

IEEE Trans Med Imaging. 2025 Jan;44(1):31-43. doi: 10.1109/TMI.2024.3429471. Epub 2025 Jan 2.

ImageGCN: Multi-Relational Image Graph Convolutional Networks for Disease Identification With Chest X-Rays.

IEEE Trans Med Imaging. 2022 Aug;41(8):1990-2003. doi: 10.1109/TMI.2022.3153322. Epub 2022 Aug 1.

引用本文的文献

Rethinking Domain-Specific Pretraining by Supervised or Self-Supervised Learning for Chest Radiograph Classification: A Comparative Study Against ImageNet Counterparts in Cold-Start Active Learning.

Health Care Sci. 2025 Apr 6;4(2):110-143. doi: 10.1002/hcs2.70009. eCollection 2025 Apr.

Three-Stage Framework for Accurate Pediatric Chest X-ray Diagnosis Using Self-Supervision and Transfer Learning on Small Datasets.

Diagnostics (Basel). 2024 Jul 29;14(15):1634. doi: 10.3390/diagnostics14151634.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于图的转换器和数据增强主动学习框架，具有可解释的特征，用于多标签胸部 X 光分类。

GANDALF: Graph-based transformer and Data Augmentation Active Learning Framework with interpretable features for multi-label chest Xray classification.

机构信息

Inception Institute of AI, Abu Dhabi, United Arab Emirates; Faculty of IT, Monash University, Melbourne, Australia.

École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland; Lausanne University Hospital (CHUV), Lausanne, Switzerland.

出版信息

Med Image Anal. 2024 Apr;93:103075. doi: 10.1016/j.media.2023.103075. Epub 2024 Jan 6.

DOI:10.1016/j.media.2023.103075

PMID:38199069

Abstract

摘要

基于图的转换器和数据增强主动学习框架，具有可解释的特征，用于多标签胸部 X 光分类。

GANDALF: Graph-based transformer and Data Augmentation Active Learning Framework with interpretable features for multi-label chest Xray classification.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于图的转换器和数据增强主动学习框架，具有可解释的特征，用于多标签胸部 X 光分类。

GANDALF: Graph-based transformer and Data Augmentation Active Learning Framework with interpretable features for multi-label chest Xray classification.

机构信息

出版信息

相似文献

引用本文的文献