• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于表型的人工智能流程在使用电子健康记录对罕见疾病进行鉴别诊断方面比人类专家表现更出色。

A phenotype-based AI pipeline outperforms human experts in differentially diagnosing rare diseases using EHRs.

作者信息

Mao Xiaohao, Huang Yu, Jin Ye, Wang Lun, Chen Xuanzhong, Liu Honghong, Yang Xinglin, Xu Haopeng, Luan Xiaodong, Xiao Ying, Feng Siqin, Zhu Jiahao, Zhang Xuegong, Jiang Rui, Zhang Shuyang, Chen Ting

机构信息

Department of Computer Science and Technology & Institute for Artificial Intelligence & BNRist, Tsinghua University, Beijing, China.

Tencent Jarvis Lab, Shenzhen, China.

出版信息

NPJ Digit Med. 2025 Jan 28;8(1):68. doi: 10.1038/s41746-025-01452-1.

DOI:10.1038/s41746-025-01452-1
PMID:39875532
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11775211/
Abstract

Rare diseases, affecting ~350 million people worldwide, pose significant challenges in clinical diagnosis due to the lack of experienced physicians and the complexity of differentiating between numerous rare diseases. To address these challenges, we introduce PhenoBrain, a fully automated artificial intelligence pipeline. PhenoBrain utilizes a BERT-based natural language processing model to extract phenotypes from clinical texts in EHRs and employs five new diagnostic models for differential diagnoses of rare diseases. The AI system was developed and evaluated on diverse, multi-country rare disease datasets, comprising 2271 cases with 431 rare diseases. In 1936 test cases, PhenoBrain achieved an average predicted top-3 recall of 0.513 and a top-10 recall of 0.654, surpassing 13 leading prediction methods. In a human-computer study with 75 cases, PhenoBrain exhibited exceptional performance with a top-3 recall of 0.613 and a top-10 recall of 0.813, surpassing the performance of 50 specialist physicians and large language models like ChatGPT and GPT-4. Combining PhenoBrain's predictions with specialists increased the top-3 recall to 0.768, demonstrating its potential to enhance diagnostic accuracy in clinical workflows.

摘要

罕见病影响着全球约3.5亿人,由于缺乏经验丰富的医生以及区分众多罕见病的复杂性,在临床诊断中面临重大挑战。为应对这些挑战,我们引入了PhenoBrain,这是一个全自动的人工智能流程。PhenoBrain利用基于BERT的自然语言处理模型从电子健康记录(EHR)中的临床文本中提取表型,并采用五种新的诊断模型对罕见病进行鉴别诊断。该人工智能系统是在包含2271例431种罕见病的多国家、多样的罕见病数据集上开发和评估的。在1936个测试病例中,PhenoBrain的平均预测前3召回率为0.513,前10召回率为0.654,超过了13种领先的预测方法。在一项针对75个病例的人机研究中,PhenoBrain表现出色,前3召回率为0.613,前10召回率为0.813,超过了50名专科医生以及ChatGPT和GPT - 4等大型语言模型的表现。将PhenoBrain的预测结果与专家的判断相结合,可将前3召回率提高到0.768,证明了其在临床工作流程中提高诊断准确性的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2acb/11775211/98f285208846/41746_2025_1452_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2acb/11775211/91476508457d/41746_2025_1452_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2acb/11775211/0964a661c5af/41746_2025_1452_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2acb/11775211/98f285208846/41746_2025_1452_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2acb/11775211/91476508457d/41746_2025_1452_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2acb/11775211/0964a661c5af/41746_2025_1452_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2acb/11775211/98f285208846/41746_2025_1452_Fig3_HTML.jpg

相似文献

1
A phenotype-based AI pipeline outperforms human experts in differentially diagnosing rare diseases using EHRs.一种基于表型的人工智能流程在使用电子健康记录对罕见疾病进行鉴别诊断方面比人类专家表现更出色。
NPJ Digit Med. 2025 Jan 28;8(1):68. doi: 10.1038/s41746-025-01452-1.
2
Performance of DeepSeek, Qwen 2.5 MAX, and ChatGPT Assisting in Diagnosis of Corneal Eye Diseases, Glaucoma, and Neuro-Ophthalmology Diseases Based on Clinical Case Reports.基于临床病例报告,DeepSeek、通义千问2.5 MAX和ChatGPT在角膜眼病、青光眼和神经眼科疾病诊断中的性能表现。
medRxiv. 2025 Mar 17:2025.03.14.25323836. doi: 10.1101/2025.03.14.25323836.
3
Learning to Make Rare and Complex Diagnoses With Generative AI Assistance: Qualitative Study of Popular Large Language Models.利用生成式人工智能辅助学习罕见且复杂的诊断:对流行的大型语言模型的定性研究。
JMIR Med Educ. 2024 Feb 13;10:e51391. doi: 10.2196/51391.
4
: Empowering artificial intelligence for respiratory diseases diagnosis based on electronic health records, a multicenter study.基于电子健康记录的人工智能助力呼吸系统疾病诊断:一项多中心研究
MedComm (2020). 2025 Jan 12;6(1):e70043. doi: 10.1002/mco2.70043. eCollection 2025 Jan.
5
Evaluating ChatGPT-4's Accuracy in Identifying Final Diagnoses Within Differential Diagnoses Compared With Those of Physicians: Experimental Study for Diagnostic Cases.评估ChatGPT-4在鉴别诊断中识别最终诊断的准确性与医生的准确性比较:诊断病例的实验研究
JMIR Form Res. 2024 Jun 26;8:e59267. doi: 10.2196/59267.
6
Diagnostic accuracy of large language models in psychiatry.精神科大语言模型的诊断准确性。
Asian J Psychiatr. 2024 Oct;100:104168. doi: 10.1016/j.ajp.2024.104168. Epub 2024 Jul 25.
7
Evaluating ChatGPT-4's Diagnostic Accuracy: Impact of Visual Data Integration.评估ChatGPT-4的诊断准确性:视觉数据整合的影响。
JMIR Med Inform. 2024 Apr 9;12:e55627. doi: 10.2196/55627.
8
Performance of ChatGPT in Ophthalmic Registration and Clinical Diagnosis: Cross-Sectional Study.ChatGPT 在眼科注册和临床诊断中的表现:横断面研究。
J Med Internet Res. 2024 Nov 14;26:e60226. doi: 10.2196/60226.
9
Evaluation of ChatGPT-Generated Differential Diagnosis for Common Diseases With Atypical Presentation: Descriptive Research.评估 ChatGPT 生成的不典型表现常见疾病的鉴别诊断:描述性研究。
JMIR Med Educ. 2024 Jun 21;10:e58758. doi: 10.2196/58758.
10
ChatGPT With GPT-4 Outperforms Emergency Department Physicians in Diagnostic Accuracy: Retrospective Analysis.ChatGPT 联合 GPT-4 在诊断准确率上优于急诊科医生:回顾性分析。
J Med Internet Res. 2024 Jul 8;26:e56110. doi: 10.2196/56110.

引用本文的文献

1
A large language model improves clinicians' diagnostic performance in complex critical illness cases.一个大语言模型提高了临床医生在复杂重症病例中的诊断表现。
Crit Care. 2025 Jun 6;29(1):230. doi: 10.1186/s13054-025-05468-7.

本文引用的文献

1
Enhancing phenotype recognition in clinical notes using large language models: PhenoBCBERT and PhenoGPT.使用大语言模型增强临床笔记中的表型识别:PhenoBCBERT和PhenoGPT。
Patterns (N Y). 2023 Dec 5;5(1):100887. doi: 10.1016/j.patter.2023.100887. eCollection 2024 Jan 12.
2
Phen2Disease: a phenotype-driven model for disease and gene prioritization by bidirectional maximum matching semantic similarities.Phen2Disease:一种基于表型驱动的疾病和基因优先级排序模型,通过双向最大匹配语义相似性实现。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad172.
3
Developing a Framework of Cost Elements of Socioeconomic Burden of Rare Disease: A Scoping Review.
开发罕见病社会经济负担成本要素框架:范围综述。
Pharmacoeconomics. 2023 Jul;41(7):803-818. doi: 10.1007/s40273-023-01262-x. Epub 2023 Apr 7.
4
Benefits, Limits, and Risks of GPT-4 as an AI Chatbot for Medicine.GPT-4作为医学人工智能聊天机器人的益处、局限性和风险
N Engl J Med. 2023 Mar 30;388(13):1233-1239. doi: 10.1056/NEJMsr2214184.
5
PhenoBERT: A Combined Deep Learning Method for Automated Recognition of Human Phenotype Ontology.PhenoBERT:一种用于自动识别人类表型本体的深度学习组合方法。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):1269-1277. doi: 10.1109/TCBB.2022.3170301. Epub 2023 Apr 3.
6
The national economic burden of rare disease in the United States in 2019.2019 年美国罕见病的国家经济负担。
Orphanet J Rare Dis. 2022 Apr 12;17(1):163. doi: 10.1186/s13023-022-02299-5.
7
A guide for the diagnosis of rare and undiagnosed disease: beyond the exome.罕见病和不明原因疾病诊断指南:超越外显子组。
Genome Med. 2022 Feb 28;14(1):23. doi: 10.1186/s13073-022-01026-w.
8
National Rare Diseases Registry System (NRDRS): China's first nation-wide rare diseases demographic analyses.国家罕见病注册系统(NRDRS):中国首次全国范围的罕见病人口学分析。
Orphanet J Rare Dis. 2021 Dec 18;16(1):515. doi: 10.1186/s13023-021-02130-7.
9
PhenoTagger: a hybrid method for phenotype concept recognition using human phenotype ontology.PhenoTagger:一种使用人类表型本体进行表型概念识别的混合方法。
Bioinformatics. 2021 Jul 27;37(13):1884-1890. doi: 10.1093/bioinformatics/btab019.
10
The case for open science: rare diseases.开放科学的案例:罕见病
JAMIA Open. 2020 Sep 11;3(3):472-486. doi: 10.1093/jamiaopen/ooaa030. eCollection 2020 Oct.