• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于对电子健康记录中的纵向生存数据进行聚类的深度表示学习

Deep representation learning for clustering longitudinal survival data from electronic health records.

作者信息

Qiu Jiajun, Hu Yao, Li Li, Erzurumluoglu Abdullah Mesut, Braenne Ingrid, Whitehurst Charles, Schmitz Jochen, Arora Jatin, Bartholdy Boris Alexander, Gandhi Shrey, Khoueiry Pierre, Mueller Stefanie, Noyvert Boris, Ding Zhihao, Jensen Jan Nygaard, de Jong Johann

机构信息

Global Computational Biology and Digital Sciences, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach an der Riβ, Germany.

Immunology & Respiratory Diseases, Boehringer-Ingelheim, Ridgefield, CT, USA.

出版信息

Nat Commun. 2025 Mar 14;16(1):2534. doi: 10.1038/s41467-025-56625-z.

DOI:10.1038/s41467-025-56625-z
PMID:40087274
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11909183/
Abstract

Precision medicine requires accurate identification of clinically relevant patient subgroups. Electronic health records provide major opportunities for leveraging machine learning approaches to uncover novel patient subgroups. However, many existing approaches fail to adequately capture complex interactions between diagnosis trajectories and disease-relevant risk events, leading to subgroups that can still display great heterogeneity in event risk and underlying molecular mechanisms. To address this challenge, we implemented VaDeSC-EHR, a transformer-based variational autoencoder for clustering longitudinal survival data as extracted from electronic health records. We show that VaDeSC-EHR outperforms baseline methods on both synthetic and real-world benchmark datasets with known ground-truth cluster labels. In an application to Crohn's disease, VaDeSC-EHR successfully identifies four distinct subgroups with divergent diagnosis trajectories and risk profiles, revealing clinically and genetically relevant factors in Crohn's disease. Our results show that VaDeSC-EHR can be a powerful tool for discovering novel patient subgroups in the development of precision medicine approaches.

摘要

精准医学需要准确识别临床上相关的患者亚组。电子健康记录为利用机器学习方法发现新的患者亚组提供了重大机遇。然而,许多现有方法未能充分捕捉诊断轨迹与疾病相关风险事件之间的复杂相互作用,导致亚组在事件风险和潜在分子机制方面仍表现出很大的异质性。为应对这一挑战,我们实施了VaDeSC-EHR,这是一种基于Transformer的变分自编码器,用于对从电子健康记录中提取的纵向生存数据进行聚类。我们表明,在具有已知真实聚类标签的合成和真实世界基准数据集上,VaDeSC-EHR均优于基线方法。在克罗恩病的应用中,VaDeSC-EHR成功识别出四个具有不同诊断轨迹和风险特征的不同亚组,揭示了克罗恩病中临床和遗传相关因素。我们的结果表明,VaDeSC-EHR可以成为在精准医学方法开发中发现新的患者亚组的有力工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5164/11909183/a67aad24ac77/41467_2025_56625_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5164/11909183/08866d79d982/41467_2025_56625_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5164/11909183/ffb846395b44/41467_2025_56625_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5164/11909183/e5bf98d308c8/41467_2025_56625_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5164/11909183/5ac8506d83e9/41467_2025_56625_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5164/11909183/ec8b5bfec831/41467_2025_56625_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5164/11909183/a67aad24ac77/41467_2025_56625_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5164/11909183/08866d79d982/41467_2025_56625_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5164/11909183/ffb846395b44/41467_2025_56625_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5164/11909183/e5bf98d308c8/41467_2025_56625_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5164/11909183/5ac8506d83e9/41467_2025_56625_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5164/11909183/ec8b5bfec831/41467_2025_56625_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5164/11909183/a67aad24ac77/41467_2025_56625_Fig6_HTML.jpg

相似文献

1
Deep representation learning for clustering longitudinal survival data from electronic health records.用于对电子健康记录中的纵向生存数据进行聚类的深度表示学习
Nat Commun. 2025 Mar 14;16(1):2534. doi: 10.1038/s41467-025-56625-z.
2
Deep learning for clustering of multivariate clinical patient trajectories with missing values.深度学习在缺失值的多变量临床患者轨迹聚类中的应用。
Gigascience. 2019 Nov 1;8(11). doi: 10.1093/gigascience/giz134.
3
DySurv: dynamic deep learning model for survival analysis with conditional variational inference.DySurv:用于生存分析的动态深度学习模型及条件变分推理
J Am Med Inform Assoc. 2024 Nov 21. doi: 10.1093/jamia/ocae271.
4
Semi-supervised Double Deep Learning Temporal Risk Prediction (SeDDLeR) with Electronic Health Records.基于电子健康记录的半监督双深度学习时间风险预测(SeDDLeR)
J Biomed Inform. 2024 Sep;157:104685. doi: 10.1016/j.jbi.2024.104685. Epub 2024 Jul 14.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Deep representation learning of patient data from Electronic Health Records (EHR): A systematic review.电子健康记录(EHR)中患者数据的深度表征学习:一项系统综述。
J Biomed Inform. 2021 Mar;115:103671. doi: 10.1016/j.jbi.2020.103671. Epub 2020 Dec 31.
7
DeepMPM: a mortality risk prediction model using longitudinal EHR data.DeepMPM:基于纵向电子健康记录数据的死亡率风险预测模型。
BMC Bioinformatics. 2022 Oct 14;23(1):423. doi: 10.1186/s12859-022-04975-6.
8
Applying interpretable deep learning models to identify chronic cough patients using EHR data.应用可解释的深度学习模型,利用电子病历数据识别慢性咳嗽患者。
Comput Methods Programs Biomed. 2021 Oct;210:106395. doi: 10.1016/j.cmpb.2021.106395. Epub 2021 Sep 4.
9
Automated feature selection of predictors in electronic medical records data.电子病历数据中预测指标的自动特征选择
Biometrics. 2019 Mar;75(1):268-277. doi: 10.1111/biom.12987. Epub 2019 Apr 2.
10
Identification of heart failure subtypes using transformer-based deep learning modelling: a population-based study of 379,108 individuals.使用基于Transformer的深度学习模型识别心力衰竭亚型:一项基于379,108人的人群研究。
EBioMedicine. 2025 Apr;114:105657. doi: 10.1016/j.ebiom.2025.105657. Epub 2025 Mar 19.

引用本文的文献

1
Joint models in big data: simulation-based guidelines for required data quality in longitudinal electronic health records.大数据中的联合模型:基于模拟的纵向电子健康记录所需数据质量指南。
BioData Min. 2025 May 13;18(1):35. doi: 10.1186/s13040-025-00450-z.

本文引用的文献

1
Deep learning-based patient stratification for prognostic enrichment of clinical dementia trials.基于深度学习的患者分层,用于临床痴呆症试验的预后富集。
Brain Commun. 2024 Dec 16;6(6):fcae445. doi: 10.1093/braincomms/fcae445. eCollection 2024.
2
Large Language Model Capabilities in Perioperative Risk Prediction and Prognostication.大语言模型在围手术期风险预测和预后中的应用。
JAMA Surg. 2024 Aug 1;159(8):928-937. doi: 10.1001/jamasurg.2024.1621.
3
Rare coding variants in CHRNB2 reduce the likelihood of smoking.CHRNB2 中的罕见编码变异可降低吸烟的可能性。
Nat Genet. 2023 Jul;55(7):1138-1148. doi: 10.1038/s41588-023-01417-8. Epub 2023 Jun 12.
4
A deep learning algorithm to predict risk of pancreatic cancer from disease trajectories.一种基于深度学习算法的胰腺癌风险预测方法。
Nat Med. 2023 May;29(5):1113-1122. doi: 10.1038/s41591-023-02332-5. Epub 2023 May 8.
5
PRSet: Pathway-based polygenic risk score analyses and software.PRSet:基于通路的多基因风险评分分析及软件。
PLoS Genet. 2023 Feb 7;19(2):e1010624. doi: 10.1371/journal.pgen.1010624. eCollection 2023 Feb.
6
Role of pulmonary function testing in inflammatory bowel diseases (Review).肺功能测试在炎症性肠病中的作用(综述)
Med Int (Lond). 2022 Aug 2;2(4):25. doi: 10.3892/mi.2022.50. eCollection 2022 Jul-Aug.
7
Mapping of UK Biobank clinical codes: Challenges and possible solutions.英国生物银行临床代码映射:挑战与可能的解决方案。
PLoS One. 2022 Dec 16;17(12):e0275816. doi: 10.1371/journal.pone.0275816. eCollection 2022.
8
The next-generation Open Targets Platform: reimagined, redesigned, rebuilt.下一代开放靶点平台:重新构想、重新设计、重新构建。
Nucleic Acids Res. 2023 Jan 6;51(D1):D1353-D1359. doi: 10.1093/nar/gkac1046.
9
Micronutrients and Their Role in Inflammatory Bowel Disease: Function, Assessment, Supplementation, and Impact on Clinical Outcomes Including Muscle Health.微量营养素及其在炎症性肠病中的作用:功能、评估、补充以及对包括肌肉健康在内的临床结局的影响
Inflamm Bowel Dis. 2023 Mar 1;29(3):487-501. doi: 10.1093/ibd/izac223.
10
Transforming and evaluating the UK Biobank to the OMOP Common Data Model for COVID-19 research and beyond.将英国生物银行转化并评估为 OMOP 通用数据模型,用于 COVID-19 研究及其他领域。
J Am Med Inform Assoc. 2022 Dec 13;30(1):103-111. doi: 10.1093/jamia/ocac203.