• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

统一大脑:基于分层知识增强预训练的通用脑部磁共振成像诊断

UniBrain: Universal Brain MRI diagnosis with hierarchical knowledge-enhanced pre-training.

作者信息

Lei Jiayu, Dai Lisong, Jiang Haoyun, Wu Chaoyi, Zhang Xiaoman, Zhang Yao, Yao Jiangchao, Xie Weidi, Zhang Yanyong, Li Yuehua, Zhang Ya, Wang Yanfeng

机构信息

School of Computer Science and Technology, University of Science and Technology of China, Hefei, Anhui, 230026, China; Shanghai Artificial Intelligence Laboratory, Shanghai, 200232, China.

Shanghai Sixth People's Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, 200233, China.

出版信息

Comput Med Imaging Graph. 2025 Jun;122:102516. doi: 10.1016/j.compmedimag.2025.102516. Epub 2025 Mar 7.

DOI:10.1016/j.compmedimag.2025.102516
PMID:40073706
Abstract

Magnetic Resonance Imaging (MRI) has become a pivotal tool in diagnosing brain diseases, with a wide array of computer-aided artificial intelligence methods being proposed to enhance diagnostic accuracy. However, early studies were often limited by small-scale datasets and a narrow range of disease types, which posed challenges in model generalization. This study presents UniBrain, a hierarchical knowledge-enhanced pre-training framework designed for universal brain MRI diagnosis. UniBrain leverages a large-scale dataset comprising 24,770 imaging-report pairs from routine diagnostics for pre-training. Unlike previous approaches that either focused solely on visual representation learning or used brute-force alignment between vision and language, the framework introduces a hierarchical alignment mechanism. This mechanism extracts structured knowledge from free-text clinical reports at multiple granularities, enabling vision-language alignment at both the sequence and case levels, thereby significantly improving feature learning efficiency. A coupled vision-language perception module is further employed for text-guided multi-label classification, which facilitates zero-shot evaluation and fine-tuning of downstream tasks without modifying the model architecture. UniBrain is validated on both in-domain and out-of-domain datasets, consistently surpassing existing state-of-the-art diagnostic models and demonstrating performance on par with radiologists in specific disease categories. It shows strong generalization capabilities across diverse tasks, highlighting its potential for broad clinical application. The code is available at https://github.com/ljy19970415/UniBrain.

摘要

磁共振成像(MRI)已成为诊断脑部疾病的关键工具,人们提出了各种各样的计算机辅助人工智能方法来提高诊断准确性。然而,早期研究往往受限于小规模数据集和狭窄的疾病类型范围,这给模型泛化带来了挑战。本研究提出了UniBrain,这是一个为通用脑部MRI诊断设计的分层知识增强预训练框架。UniBrain利用一个包含来自常规诊断的24770个影像-报告对的大规模数据集进行预训练。与以往要么只专注于视觉表征学习,要么在视觉和语言之间使用蛮力对齐的方法不同,该框架引入了一种分层对齐机制。这种机制从自由文本临床报告中以多种粒度提取结构化知识,实现序列和病例级别的视觉-语言对齐,从而显著提高特征学习效率。还采用了一个耦合的视觉-语言感知模块进行文本引导的多标签分类,这有助于在不修改模型架构的情况下对下游任务进行零样本评估和微调。UniBrain在域内和域外数据集上均得到验证,始终超越现有的最先进诊断模型,并在特定疾病类别中表现出与放射科医生相当的性能。它在各种任务中都显示出强大的泛化能力,突出了其广泛临床应用的潜力。代码可在https://github.com/ljy19970415/UniBrain获取。

相似文献

1
UniBrain: Universal Brain MRI diagnosis with hierarchical knowledge-enhanced pre-training.统一大脑:基于分层知识增强预训练的通用脑部磁共振成像诊断
Comput Med Imaging Graph. 2025 Jun;122:102516. doi: 10.1016/j.compmedimag.2025.102516. Epub 2025 Mar 7.
2
ATOMMIC: An Advanced Toolbox for Multitask Medical Imaging Consistency to facilitate Artificial Intelligence applications from acquisition to analysis in Magnetic Resonance Imaging.ATOMMIC:一个高级的多任务医学成像一致性工具箱,旨在促进磁共振成像从采集到分析的人工智能应用。
Comput Methods Programs Biomed. 2024 Nov;256:108377. doi: 10.1016/j.cmpb.2024.108377. Epub 2024 Aug 22.
3
A Foundation Language-Image Model of the Retina (FLAIR): encoding expert knowledge in text supervision.视网膜的基础语言-图像模型(FLAIR):在文本监督中编码专家知识。
Med Image Anal. 2025 Jan;99:103357. doi: 10.1016/j.media.2024.103357. Epub 2024 Oct 1.
4
BrainSegFounder: Towards 3D foundation models for neuroimage segmentation.BrainSegFounder:迈向神经影像分割的 3D 基础模型。
Med Image Anal. 2024 Oct;97:103301. doi: 10.1016/j.media.2024.103301. Epub 2024 Aug 8.
5
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
6
Brain tumor segmentation using multi-scale attention U-Net with EfficientNetB4 encoder for enhanced MRI analysis.使用带有EfficientNetB4编码器的多尺度注意力U-Net进行脑肿瘤分割以增强MRI分析
Sci Rep. 2025 Mar 22;15(1):9914. doi: 10.1038/s41598-025-94267-9.
7
Lightweight MRI Brain Tumor Segmentation Enhanced by Hierarchical Feature Fusion.基于层次特征融合的轻量化 MRI 脑肿瘤分割。
Tomography. 2024 Oct 1;10(10):1577-1590. doi: 10.3390/tomography10100116.
8
DBAII-Net with multiscale feature aggregation and cross-modal attention for enhancing infant brain injury classification in MRI.基于多尺度特征聚合和跨模态注意力的 DBAII-Net 用于增强 MRI 中婴儿脑损伤分类。
Phys Med Biol. 2024 Oct 14;69(20). doi: 10.1088/1361-6560/ad80f7.
9
Prototype-guided multi-scale domain adaptation for Alzheimer's disease detection.基于原型引导的多尺度领域自适应阿尔茨海默病检测。
Comput Biol Med. 2023 Mar;154:106570. doi: 10.1016/j.compbiomed.2023.106570. Epub 2023 Jan 23.
10
AttriPrompter: Auto-Prompting With Attribute Semantics for Zero-Shot Nuclei Detection via Visual-Language Pre-Trained Models.AttriPrompter:通过视觉语言预训练模型进行基于属性语义的零样本细胞核检测自动提示
IEEE Trans Med Imaging. 2025 Feb;44(2):982-993. doi: 10.1109/TMI.2024.3473745. Epub 2025 Feb 4.

引用本文的文献

1
Large-vocabulary segmentation for medical images with text prompts.基于文本提示的医学图像大词汇量分割
NPJ Digit Med. 2025 Sep 2;8(1):566. doi: 10.1038/s41746-025-01964-w.
2
Multimodal contrastive learning for enhanced explainability in pediatric brain tumor molecular diagnosis.用于增强小儿脑肿瘤分子诊断可解释性的多模态对比学习
Sci Rep. 2025 Mar 30;15(1):10943. doi: 10.1038/s41598-025-94806-4.