Suppr超能文献

利用基础多任务模型克服生物医学成像中的数据匮乏问题。

Overcoming data scarcity in biomedical imaging with a foundational multi-task model.

机构信息

Fraunhofer Institute for Digital Medicine MEVIS, Bremen, Germany.

Institute of Image Analysis and Computer Vision, Faculty of Informatics and Data Science, University of Regensburg, Regensburg, Germany.

出版信息

Nat Comput Sci. 2024 Jul;4(7):495-509. doi: 10.1038/s43588-024-00662-z. Epub 2024 Jul 19.

Abstract

Foundational models, pretrained on a large scale, have demonstrated substantial success across non-medical domains. However, training these models typically requires large, comprehensive datasets, which contrasts with the smaller and more specialized datasets common in biomedical imaging. Here we propose a multi-task learning strategy that decouples the number of training tasks from memory requirements. We trained a universal biomedical pretrained model (UMedPT) on a multi-task database including tomographic, microscopic and X-ray images, with various labeling strategies such as classification, segmentation and object detection. The UMedPT foundational model outperformed ImageNet pretraining and previous state-of-the-art models. For classification tasks related to the pretraining database, it maintained its performance with only 1% of the original training data and without fine-tuning. For out-of-domain tasks it required only 50% of the original training data. In an external independent validation, imaging features extracted using UMedPT proved to set a new standard for cross-center transferability.

摘要

基础模型经过大规模预训练,已在非医学领域取得了显著成功。然而,这些模型的训练通常需要大型、全面的数据集,而生物医学成像中常见的数据集则更小且更专业。在这里,我们提出了一种从记忆需求中分离出训练任务数量的多任务学习策略。我们在一个多任务数据库上训练了一个通用的生物医学预训练模型 (UMedPT),该数据库包括断层扫描、显微镜和 X 射线图像,以及各种标记策略,如分类、分割和目标检测。UMedPT 基础模型的表现优于 ImageNet 预训练和以前的最先进模型。对于与预训练数据库相关的分类任务,它只需原始训练数据的 1%,并且无需微调,即可保持其性能。对于域外任务,它只需要原始训练数据的 50%。在外部独立验证中,使用 UMedPT 提取的成像特征被证明为跨中心可转移性设定了新的标准。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bc4/11288886/7fdc34aeba4c/43588_2024_662_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验