大规模长尾疾病在放射影像中的诊断。

Large-scale long-tailed disease diagnosis on radiology images.

机构信息

Shanghai Jiao Tong University, Shanghai, China.

Shanghai Artificial Intelligence Laboratory, Shanghai, China.

出版信息

Nat Commun. 2024 Nov 22;15(1):10147. doi: 10.1038/s41467-024-54424-6.

DOI:10.1038/s41467-024-54424-6

PMID:39578456

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11584732/

Abstract

Developing a generalist radiology diagnosis system can greatly enhance clinical diagnostics. In this paper, we introduce RadDiag, a foundational model supporting 2D and 3D inputs across various modalities and anatomies, using a transformer-based fusion module for comprehensive disease diagnosis. Due to patient privacy concerns and the lack of large-scale radiology diagnosis datasets, we utilize high-quality, clinician-reviewed radiological images available online with diagnosis labels. Our dataset, RP3D-DiagDS, contains 40,936 cases with 195,010 scans covering 5568 disorders (930 unique ICD-10-CM codes). Experimentally, our RadDiag achieves 95.14% AUC on internal evaluation with the knowledge-enhancement strategy. Additionally, RadDiag can be zero-shot applied or fine-tuned to external diagnosis datasets sourced from various medical centers, demonstrating state-of-the-art results. In conclusion, we show that publicly shared medical data on the Internet is a tremendous and valuable resource that can potentially support building strong models for image understanding in healthcare.

摘要

开发通用放射诊断系统可以极大地提高临床诊断水平。在本文中，我们介绍了 RadDiag，这是一个基础模型，支持 2D 和 3D 输入，涵盖各种模态和解剖结构，使用基于转换器的融合模块进行全面的疾病诊断。由于患者隐私问题和缺乏大规模放射诊断数据集，我们利用在线提供的高质量、经过临床医生审查的放射图像，并附有诊断标签。我们的数据集 RP3D-DiagDS 包含 40936 个病例，195010 个扫描，涵盖 5568 种疾病（930 个独特的 ICD-10-CM 代码）。在实验中，我们的 RadDiag 在具有知识增强策略的内部评估中达到了 95.14%的 AUC。此外，RadDiag 可以零样本应用或微调来自不同医疗中心的外部诊断数据集，展示了最先进的结果。总之，我们表明互联网上共享的公共医疗数据是一个巨大而有价值的资源，它有可能支持在医疗保健领域构建强大的图像理解模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2466/11584732/ab28d789c765/41467_2024_54424_Fig1_HTML.jpg

相似文献

Large-scale long-tailed disease diagnosis on radiology images.

Nat Commun. 2024 Nov 22;15(1):10147. doi: 10.1038/s41467-024-54424-6.

A 3D hierarchical cross-modality interaction network using transformers and convolutions for brain glioma segmentation in MR images.

Med Phys. 2024 Nov;51(11):8371-8389. doi: 10.1002/mp.17354. Epub 2024 Aug 13.

[Fully Automatic Glioma Segmentation Algorithm of Magnetic Resonance Imaging Based on 3D-UNet With More Global Contextual Feature Extraction: An Improvement on Insufficient Extraction of Global Features].

Sichuan Da Xue Xue Bao Yi Xue Ban. 2024 Mar 20;55(2):447-454. doi: 10.12182/20240360208.

MedMNIST v2 - A large-scale lightweight benchmark for 2D and 3D biomedical image classification.

Sci Data. 2023 Jan 19;10(1):41. doi: 10.1038/s41597-022-01721-8.

PMFSNet: Polarized multi-scale feature self-attention network for lightweight medical image segmentation.

Comput Methods Programs Biomed. 2025 Apr;261:108611. doi: 10.1016/j.cmpb.2025.108611. Epub 2025 Jan 25.

Seeking an optimal approach for Computer-aided Diagnosis of Pulmonary Embolism.

Med Image Anal. 2024 Jan;91:102988. doi: 10.1016/j.media.2023.102988. Epub 2023 Oct 13.

UNesT: Local spatial representation learning with hierarchical transformer for efficient medical segmentation.

Med Image Anal. 2023 Dec;90:102939. doi: 10.1016/j.media.2023.102939. Epub 2023 Aug 25.

Image-level supervision and self-training for transformer-based cross-modality tumor segmentation.

Med Image Anal. 2024 Oct;97:103287. doi: 10.1016/j.media.2024.103287. Epub 2024 Jul 31.

Hand Pose Understanding With Large-Scale Photo-Realistic Rendering Dataset.

IEEE Trans Image Process. 2021;30:4275-4290. doi: 10.1109/TIP.2021.3070439. Epub 2021 Apr 14.

Embracing Large Natural Data: Enhancing Medical Image Analysis via Cross-Domain Fine-Tuning.

IEEE J Biomed Health Inform. 2024 Aug;28(8):4512-4521. doi: 10.1109/JBHI.2023.3343518. Epub 2024 Aug 6.

引用本文的文献

Large-vocabulary segmentation for medical images with text prompts.

NPJ Digit Med. 2025 Sep 2;8(1):566. doi: 10.1038/s41746-025-01964-w.

CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray.

Med Image Anal. 2025 Jul 29;106:103739. doi: 10.1016/j.media.2025.103739.

CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray.

ArXiv. 2025 Jun 9:arXiv:2506.07984v1.

Comparative Evaluation of Large Language and Multimodal Models in Detecting Spinal Stabilization Systems on X-Ray Images.

J Clin Med. 2025 May 8;14(10):3282. doi: 10.3390/jcm14103282.

本文引用的文献

AUCReshaping: improved sensitivity at high-specificity.

Sci Rep. 2023 Nov 30;13(1):21097. doi: 10.1038/s41598-023-48482-x.

MedCPT: Contrastive Pre-trained Transformers with large-scale PubMed search logs for zero-shot biomedical information retrieval.

Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad651.

Knowledge-enhanced visual-language pre-training on chest radiology images.

Nat Commun. 2023 Jul 28;14(1):4542. doi: 10.1038/s41467-023-40260-7.

VinDr-Mammo: A large-scale benchmark dataset for computer-aided diagnosis in full-field digital mammography.

Sci Data. 2023 May 12;10(1):277. doi: 10.1038/s41597-023-02100-7.

PediCXR: An open, large-scale chest radiograph dataset for interpretation of common thoracic diseases in children.

Sci Data. 2023 Apr 27;10(1):240. doi: 10.1038/s41597-023-02102-5.

Explainable artificial intelligence for mental health through transparency and interpretability for understandability.

NPJ Digit Med. 2023 Jan 18;6(1):6. doi: 10.1038/s41746-023-00751-9.

RadImageNet: An Open Radiologic Deep Learning Research Dataset for Effective Transfer Learning.

Radiol Artif Intell. 2022 Jul 27;4(5):e210315. doi: 10.1148/ryai.210315. eCollection 2022 Sep.

Expert-level detection of pathologies from unannotated chest X-ray images via self-supervised learning.

Nat Biomed Eng. 2022 Dec;6(12):1399-1406. doi: 10.1038/s41551-022-00936-9. Epub 2022 Sep 15.

VinDr-CXR: An open dataset of chest X-rays with radiologist's annotations.

Sci Data. 2022 Jul 20;9(1):429. doi: 10.1038/s41597-022-01498-w.

Vision transformer and explainable transfer learning models for auto detection of kidney cyst, stone and tumor from CT-radiography.

Sci Rep. 2022 Jul 6;12(1):11440. doi: 10.1038/s41598-022-15634-4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

大规模长尾疾病在放射影像中的诊断。

Large-scale long-tailed disease diagnosis on radiology images.

机构信息

Shanghai Jiao Tong University, Shanghai, China.

Shanghai Artificial Intelligence Laboratory, Shanghai, China.

出版信息

Nat Commun. 2024 Nov 22;15(1):10147. doi: 10.1038/s41467-024-54424-6.

DOI:10.1038/s41467-024-54424-6

PMID:39578456

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11584732/

Abstract

摘要

大规模长尾疾病在放射影像中的诊断。

Large-scale long-tailed disease diagnosis on radiology images.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

大规模长尾疾病在放射影像中的诊断。

Large-scale long-tailed disease diagnosis on radiology images.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献