• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MULTIPAR:用于计算表型分析的多任务学习监督不规则张量分解

MULTIPAR: Supervised Irregular Tensor Factorization with Multi-task Learning for Computational Phenotyping.

作者信息

Ren Yifei, Lou Jian, Xiong Li, Ho Joyce C, Jiang Xiaoqian, Bhavani Sivasubramanium Venkatraman

机构信息

Emory University, United States.

Zhejiang University, China.

出版信息

Proc Mach Learn Res. 2023 Dec;225:498-511.

PMID:39624658
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11611252/
Abstract

Tensor factorization has received increasing interest due to its intrinsic ability to capture latent factors in multi-dimensional data with many applications including Electronic Health Records (EHR) mining. PARAFAC2 and its variants have been proposed to address irregular tensors where one of the tensor modes is not aligned, e.g., different patients in EHRs may have different length of records. PARAFAC2 has been successfully applied to EHRs for extracting meaningful medical concepts (phenotypes). Despite recent advancements, current models' predictability and interpretability are not satisfactory, which limits its utility for downstream analysis. In this paper, we propose MULTIPAR: a supervised irregular tensor factorization with multi-task learning for computational phenotyping. MULTIPAR is flexible to incorporate both static (e.g. in-hospital mortality prediction) and continuous or dynamic (e.g. the need for ventilation) tasks. By supervising the tensor factorization with downstream prediction tasks and leveraging information from multiple related predictive tasks, MULTIPAR can yield not only more meaningful phenotypes but also better predictive performance for downstream tasks. We conduct extensive experiments on two real-world temporal EHR datasets to demonstrate that MULTIPAR is scalable and achieves better tensor fit with more meaningful subgroups and stronger predictive performance compared to existing state-of-the-art methods. The implementation of MULTIPAR is available.

摘要

张量分解因其能够捕捉多维数据中的潜在因素而受到越来越多的关注,其应用广泛,包括电子健康记录(EHR)挖掘。PARAFAC2及其变体已被提出用于处理不规则张量,其中张量模式之一未对齐,例如,EHR中的不同患者可能有不同长度的记录。PARAFAC2已成功应用于EHR,以提取有意义的医学概念(表型)。尽管最近取得了进展,但当前模型的可预测性和可解释性并不令人满意,这限制了其在下游分析中的效用。在本文中,我们提出了MULTIPAR:一种用于计算表型分析的具有多任务学习的监督不规则张量分解方法。MULTIPAR可以灵活地纳入静态任务(例如院内死亡率预测)和连续或动态任务(例如通气需求)。通过用下游预测任务监督张量分解并利用来自多个相关预测任务的信息,MULTIPAR不仅可以产生更有意义的表型,还可以为下游任务提供更好的预测性能。我们在两个真实世界的时间EHR数据集上进行了广泛的实验,以证明MULTIPAR具有可扩展性,与现有的最先进方法相比,它能实现更好的张量拟合,产生更有意义的子组,并具有更强的预测性能。MULTIPAR的实现是可用的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4761/11611252/0a1539195539/nihms-2037251-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4761/11611252/106cbdf49658/nihms-2037251-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4761/11611252/de0cc4b3fa39/nihms-2037251-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4761/11611252/5aba2447a8e4/nihms-2037251-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4761/11611252/52d477fbf1e6/nihms-2037251-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4761/11611252/26649d8d3820/nihms-2037251-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4761/11611252/2e6558856141/nihms-2037251-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4761/11611252/0a1539195539/nihms-2037251-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4761/11611252/106cbdf49658/nihms-2037251-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4761/11611252/de0cc4b3fa39/nihms-2037251-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4761/11611252/5aba2447a8e4/nihms-2037251-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4761/11611252/52d477fbf1e6/nihms-2037251-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4761/11611252/26649d8d3820/nihms-2037251-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4761/11611252/2e6558856141/nihms-2037251-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4761/11611252/0a1539195539/nihms-2037251-f0006.jpg

相似文献

1
MULTIPAR: Supervised Irregular Tensor Factorization with Multi-task Learning for Computational Phenotyping.MULTIPAR:用于计算表型分析的多任务学习监督不规则张量分解
Proc Mach Learn Res. 2023 Dec;225:498-511.
2
Short-Term Memory Impairment短期记忆障碍
3
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
4
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
5
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
6
Systemic Inflammatory Response Syndrome全身炎症反应综合征
7
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
8
Unraveling Complex Temporal Patterns in EHRs via Robust Irregular Tensor Factorization.通过稳健的不规则张量分解揭示电子健康记录中的复杂时间模式。
AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:451-460. eCollection 2025.
9
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
10
Sexual Harassment and Prevention Training性骚扰与预防培训

引用本文的文献

1
Integrative, high-resolution analysis of single-cell gene expression across experimental conditions with PARAFAC2-RISE.利用PARAFAC2-RISE对跨实验条件的单细胞基因表达进行综合、高分辨率分析。
Cell Syst. 2025 Jun 18;16(6):101294. doi: 10.1016/j.cels.2025.101294. Epub 2025 May 15.
2
Integrative, high-resolution analysis of single cell gene expression across experimental conditions with PARAFAC2-RISE.使用PARAFAC2-RISE对跨实验条件的单细胞基因表达进行综合、高分辨率分析。
bioRxiv. 2025 Mar 22:2024.07.29.605698. doi: 10.1101/2024.07.29.605698.

本文引用的文献

1
Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark.异构网络表示学习:一个包含综述与基准测试的统一框架
IEEE Trans Knowl Data Eng. 2022 Oct;34(10):4854-4873. doi: 10.1109/tkde.2020.3045924. Epub 2020 Dec 21.
2
Harmonization of Multi-Center Diffusion Tensor Tractography in Neonates with Congenital Heart Disease: Optimizing Post-Processing and Application of ComBat.先天性心脏病新生儿多中心扩散张量纤维束成像的一致性:优化ComBat后处理及应用
Neuroimage Rep. 2022 Sep;2(3). doi: 10.1016/j.ynirp.2022.100114. Epub 2022 Jun 20.
3
LogPar: Logistic PARAFAC2 Factorization for Temporal Binary Data with Missing Values.
LogPar:用于处理带有缺失值的时态二元数据的逻辑PARAFAC2分解
KDD. 2020 Aug;2020:1625-1635. doi: 10.1145/3394486.3403213.
4
TASTE: Temporal and Static Tensor Factorization for Phenotyping Electronic Health Records.TASTE:用于电子健康记录表型分析的时间和静态张量分解
Proc ACM Conf Health Inference Learn (2020). 2020 Apr;2020:193-203. doi: 10.1145/3368555.3384464.
5
Joint calibrationless reconstruction of highly undersampled multicontrast MR datasets using a low-rank Hankel tensor completion framework.基于低秩 Hankel 张量补全框架的超高密度多对比度磁共振数据集的无标联合重建。
Magn Reson Med. 2021 Jun;85(6):3256-3271. doi: 10.1002/mrm.28674. Epub 2021 Feb 3.
6
COPA: Constrained PARAFAC2 for Sparse & Large Datasets.COPA:用于稀疏和大型数据集的约束PARAFAC2
Proc ACM Int Conf Inf Knowl Manag. 2018 Oct;2018:793-802. doi: 10.1145/3269206.3271775.
7
CovidSens: a vision on reliable social sensing for COVID-19.CovidSens:关于COVID-19可靠社会感知的愿景。
Artif Intell Rev. 2021;54(1):1-25. doi: 10.1007/s10462-020-09852-3. Epub 2020 Jun 12.
8
Rubik: Knowledge Guided Tensor Factorization and Completion for Health Data Analytics.鲁比克:用于健康数据分析的知识引导张量分解与补全
KDD. 2015 Aug;2015:1265-1274. doi: 10.1145/2783258.2783395.
9
Multitask learning and benchmarking with clinical time series data.多任务学习与临床时间序列数据的基准测试。
Sci Data. 2019 Jun 17;6(1):96. doi: 10.1038/s41597-019-0103-9.
10
The eICU Collaborative Research Database, a freely available multi-center database for critical care research.eICU 协作研究数据库,一个免费的多中心重症监护研究数据库。
Sci Data. 2018 Sep 11;5:180178. doi: 10.1038/sdata.2018.178.