• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于罕见遗传病患者表型驱动诊断的少样本学习。

Few shot learning for phenotype-driven diagnosis of patients with rare genetic diseases.

作者信息

Alsentzer Emily, Li Michelle M, Kobren Shilpa N, Noori Ayush, Kohane Isaac S, Zitnik Marinka

机构信息

Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.

Program in Health Sciences and Technology, Massachusetts Institute of Technology, Cambridge, MA, USA.

出版信息

NPJ Digit Med. 2025 Jun 20;8(1):380. doi: 10.1038/s41746-025-01749-1.

DOI:10.1038/s41746-025-01749-1
PMID:40542121
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12181314/
Abstract

There are over 7000 rare diseases, some affecting 3500 or fewer patients in the United States. Due to clinicians' limited experience with such diseases and the heterogeneity of clinical presentations, ~70% of individuals seeking a diagnosis remain undiagnosed. Deep learning has demonstrated success in aiding the diagnosis of common diseases. However, existing approaches require labeled datasets with thousands of diagnosed patients per disease. We present SHEPHERD, a few-shot learning approach for multi-faceted rare disease diagnosis. SHEPHERD performs deep learning over a knowledge graph enriched with rare disease information and is trained on a dataset of simulated rare disease patients. We demonstrate SHEPHERD's effectiveness across diverse diagnostic tasks, performing causal gene discovery, retrieving "patients-like-me", and characterizing novel disease presentations, using real-world cohorts from the Undiagnosed Diseases Network (N = 465), MyGene2 (N = 146), and the Deciphering Developmental Disorders study (N = 1431). SHEPHERD demonstrates the potential of knowledge-grounded deep learning to accelerate rare disease diagnosis.

摘要

有超过7000种罕见疾病,其中一些在美国影响的患者人数为3500人或更少。由于临床医生对这些疾病的经验有限,且临床表现存在异质性,约70%寻求诊断的个体仍未得到诊断。深度学习已在辅助常见疾病诊断方面取得成功。然而,现有方法需要每个疾病有数千名已确诊患者的标记数据集。我们提出了SHEPHERD,一种用于多方面罕见疾病诊断的少样本学习方法。SHEPHERD在富含罕见疾病信息的知识图谱上进行深度学习,并在模拟罕见疾病患者的数据集上进行训练。我们使用来自未确诊疾病网络(N = 465)、MyGene2(N = 146)和发育障碍解读研究(N = 1431)的真实队列,展示了SHEPHERD在各种诊断任务中的有效性,包括因果基因发现、检索“像我这样的患者”以及表征新的疾病表现。SHEPHERD证明了基于知识的深度学习在加速罕见疾病诊断方面的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4d8/12181314/9bda64fb860f/41746_2025_1749_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4d8/12181314/b0cb5ddbbae4/41746_2025_1749_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4d8/12181314/c742464c95a2/41746_2025_1749_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4d8/12181314/320412a426d6/41746_2025_1749_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4d8/12181314/2d99a9db1028/41746_2025_1749_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4d8/12181314/08f0be9fca55/41746_2025_1749_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4d8/12181314/9bda64fb860f/41746_2025_1749_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4d8/12181314/b0cb5ddbbae4/41746_2025_1749_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4d8/12181314/c742464c95a2/41746_2025_1749_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4d8/12181314/320412a426d6/41746_2025_1749_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4d8/12181314/2d99a9db1028/41746_2025_1749_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4d8/12181314/08f0be9fca55/41746_2025_1749_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b4d8/12181314/9bda64fb860f/41746_2025_1749_Fig6_HTML.jpg

相似文献

1
Few shot learning for phenotype-driven diagnosis of patients with rare genetic diseases.用于罕见遗传病患者表型驱动诊断的少样本学习。
NPJ Digit Med. 2025 Jun 20;8(1):380. doi: 10.1038/s41746-025-01749-1.
2
Ablative and non-surgical therapies for early and very early hepatocellular carcinoma: a systematic review and network meta-analysis.早期和极早期肝细胞癌的消融和非手术治疗:系统评价和网络荟萃分析。
Health Technol Assess. 2023 Dec;27(29):1-172. doi: 10.3310/GK5221.
3
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of topotecan for ovarian cancer.拓扑替康治疗卵巢癌的临床有效性和成本效益的快速系统评价。
Health Technol Assess. 2001;5(28):1-110. doi: 10.3310/hta5280.
4
A systematic review of speech, language and communication interventions for children with Down syndrome from 0 to 6 years.对0至6岁唐氏综合征儿童言语、语言和沟通干预措施的系统评价。
Int J Lang Commun Disord. 2022 Mar;57(2):441-463. doi: 10.1111/1460-6984.12699. Epub 2022 Feb 22.
5
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
6
NIH Consensus Statement on Management of Hepatitis C: 2002.美国国立卫生研究院关于丙型肝炎管理的共识声明:2002年。
NIH Consens State Sci Statements. 2002;19(3):1-46.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
8
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
9
Effectiveness and safety of vitamin D in relation to bone health.维生素D对骨骼健康的有效性与安全性。
Evid Rep Technol Assess (Full Rep). 2007 Aug(158):1-235.
10
Interventions for central serous chorioretinopathy: a network meta-analysis.中心性浆液性脉络膜视网膜病变的干预措施:一项网状Meta分析
Cochrane Database Syst Rev. 2025 Jun 16;6(6):CD011841. doi: 10.1002/14651858.CD011841.pub3.

引用本文的文献

1
Biomni: A General-Purpose Biomedical AI Agent.Biomni:一种通用的生物医学人工智能代理。
bioRxiv. 2025 Jun 2:2025.05.30.656746. doi: 10.1101/2025.05.30.656746.

本文引用的文献

1
A foundation model for clinician-centered drug repurposing.一种以临床医生为中心的药物重新利用基础模型。
Nat Med. 2024 Dec;30(12):3601-3613. doi: 10.1038/s41591-024-03233-x. Epub 2024 Sep 25.
2
Assessing the utility of large language models for phenotype-driven gene prioritization in the diagnosis of rare genetic disease.评估大型语言模型在罕见遗传疾病诊断中基于表型的基因优先级排序中的效用。
Am J Hum Genet. 2024 Oct 3;111(10):2190-2202. doi: 10.1016/j.ajhg.2024.08.010. Epub 2024 Sep 9.
3
Evaluating large language models on medical, lay-language, and self-reported descriptions of genetic conditions.
评估大型语言模型在医学、非专业语言和遗传状况的自我报告描述方面的表现。
Am J Hum Genet. 2024 Sep 5;111(9):1819-1833. doi: 10.1016/j.ajhg.2024.07.011. Epub 2024 Aug 14.
4
Outpatient reception via collaboration between nurses and a large language model: a randomized controlled trial.护士与大语言模型合作的门诊接待:一项随机对照试验。
Nat Med. 2024 Oct;30(10):2878-2885. doi: 10.1038/s41591-024-03148-7. Epub 2024 Jul 15.
5
AI-MARRVEL - A Knowledge-Driven AI System for Diagnosing Mendelian Disorders.AI-MARRVEL——一种用于诊断孟德尔疾病的知识驱动型人工智能系统。
NEJM AI. 2024 May;1(5). doi: 10.1056/aioa2300009. Epub 2024 Apr 25.
6
Adapted large language models can outperform medical experts in clinical text summarization.经过改编的大型语言模型在临床文本总结方面的表现优于医学专家。
Nat Med. 2024 Apr;30(4):1134-1142. doi: 10.1038/s41591-024-02855-5. Epub 2024 Feb 27.
7
Simulation of undiagnosed patients with novel genetic conditions.模拟患有新型遗传疾病的未确诊患者。
Nat Commun. 2023 Oct 12;14(1):6403. doi: 10.1038/s41467-023-41980-6.
8
Generating synthetic mixed-type longitudinal electronic health records for artificial intelligent applications.为人工智能应用生成合成混合型纵向电子健康记录。
NPJ Digit Med. 2023 May 27;6(1):98. doi: 10.1038/s41746-023-00834-7.
9
Building a knowledge graph to enable precision medicine.构建知识图谱以实现精准医学。
Sci Data. 2023 Feb 2;10(1):67. doi: 10.1038/s41597-023-01960-3.
10
Evaluation of a whole-exome sequencing pipeline and benchmarking of causal germline variant prioritizers.全外显子测序流程评估及致病种系变异优先级排序工具的基准测试
Hum Mutat. 2022 Dec;43(12):2010-2020. doi: 10.1002/humu.24459. Epub 2022 Sep 12.