基于序列数据的患者相似性框架用于患者预后预测：算法开发。

Sequential Data-Based Patient Similarity Framework for Patient Outcome Prediction: Algorithm Development.

机构信息

School of Biomedical Engineering, Capital Medical University, Beijing, China.

Beijing Advanced Innovation Center for Big Data-based Precision Medicine, Capital Medical University, Beijing, China.

出版信息

J Med Internet Res. 2022 Jan 6;24(1):e30720. doi: 10.2196/30720.

DOI:10.2196/30720

PMID:34989682

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8778569/

Abstract

BACKGROUND

Sequential information in electronic medical records is valuable and helpful for patient outcome prediction but is rarely used for patient similarity measurement because of its unevenness, irregularity, and heterogeneity.

OBJECTIVE

We aimed to develop a patient similarity framework for patient outcome prediction that makes use of sequential and cross-sectional information in electronic medical record systems.

METHODS

Sequence similarity was calculated from timestamped event sequences using edit distance, and trend similarity was calculated from time series using dynamic time warping and Haar decomposition. We also extracted cross-sectional information, namely, demographic, laboratory test, and radiological report data, for additional similarity calculations. We validated the effectiveness of the framework by constructing k-nearest neighbors classifiers to predict mortality and readmission for acute myocardial infarction patients, using data from (1) a public data set and (2) a private data set, at 3 time points-at admission, on Day 7, and at discharge-to provide early warning patient outcomes. We also constructed state-of-the-art Euclidean-distance k-nearest neighbor, logistic regression, random forest, long short-term memory network, and recurrent neural network models, which were used for comparison.

RESULTS

With all available information during a hospitalization episode, predictive models using the similarity model outperformed baseline models based on both public and private data sets. For mortality predictions, all models except for the logistic regression model showed improved performances over time. There were no such increasing trends in predictive performances for readmission predictions. The random forest and logistic regression models performed best for mortality and readmission predictions, respectively, when using information from the first week after admission.

CONCLUSIONS

For patient outcome predictions, the patient similarity framework facilitated sequential similarity calculations for uneven electronic medical record data and helped improve predictive performance.

摘要

背景

电子病历中的时序信息对于患者预后预测具有重要价值和帮助，但由于其不均匀、不规则和异质性，很少用于患者相似度测量。

目的

我们旨在开发一种患者相似性框架，用于患者预后预测，该框架利用电子病历系统中的时序和横断面信息。

方法

使用编辑距离计算来自时间戳事件序列的序列相似度，使用动态时间规整和 Haar 分解计算时间序列的趋势相似度。我们还提取了横断面信息，即人口统计学、实验室检查和放射报告数据，用于额外的相似度计算。我们通过构建 k-最近邻分类器来验证框架的有效性，以预测急性心肌梗死患者的死亡率和再入院率，使用数据来自 (1) 公共数据集和 (2) 私人数据集，在 3 个时间点 - 入院时、第 7 天和出院时 - 提供早期预警患者结局。我们还构建了基于欧几里得距离的 k-最近邻、逻辑回归、随机森林、长短期记忆网络和递归神经网络模型，用于比较。

结果

在住院期间的所有可用信息中，使用相似性模型的预测模型在公共和私人数据集上均优于基于基线模型的预测模型。对于死亡率预测，除了逻辑回归模型外，所有模型的性能都随着时间的推移而提高。对于再入院预测，预测性能没有这样的提高趋势。随机森林和逻辑回归模型在使用入院后第一周的信息时，分别对死亡率和再入院预测表现最佳。

结论

对于患者预后预测，患者相似性框架促进了对不均匀电子病历数据的时序相似性计算，并有助于提高预测性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8029/8778569/ae2701784cbe/jmir_v24i1e30720_fig1.jpg

相似文献

Sequential Data-Based Patient Similarity Framework for Patient Outcome Prediction: Algorithm Development.基于序列数据的患者相似性框架用于患者预后预测：算法开发。

J Med Internet Res. 2022 Jan 6;24(1):e30720. doi: 10.2196/30720.

Study on the semi-supervised learning-based patient similarity from heterogeneous electronic medical records.基于半监督学习的异质电子病历中患者相似性研究。

BMC Med Inform Decis Mak. 2021 Jul 30;21(Suppl 2):58. doi: 10.1186/s12911-021-01432-x.

COVID-19 Mortality Prediction From Deep Learning in a Large Multistate Electronic Health Record and Laboratory Information System Data Set: Algorithm Development and Validation.基于大型多状态电子健康记录和实验室信息系统数据集的深度学习预测 COVID-19 死亡率：算法开发与验证。

J Med Internet Res. 2021 Sep 28;23(9):e30157. doi: 10.2196/30157.

Development of Electronic Health Record-Based Prediction Models for 30-Day Readmission Risk Among Patients Hospitalized for Acute Myocardial Infarction.基于电子健康记录的急性心肌梗死住院患者 30 天再入院风险预测模型的建立。

JAMA Netw Open. 2021 Jan 4;4(1):e2035782. doi: 10.1001/jamanetworkopen.2020.35782.

Combining structured and unstructured data for predictive models: a deep learning approach.将结构化和非结构化数据结合用于预测模型：一种深度学习方法。

BMC Med Inform Decis Mak. 2020 Oct 29;20(1):280. doi: 10.1186/s12911-020-01297-6.

Implementation and evaluation of a multivariate abstraction-based, interval-based dynamic time-warping method as a similarity measure for longitudinal medical records.基于多元抽象和区间的动态时间规整方法的实现和评估，作为一种用于纵向医疗记录的相似性度量方法。

J Biomed Inform. 2021 Nov;123:103919. doi: 10.1016/j.jbi.2021.103919. Epub 2021 Oct 8.

Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records.动态可解释机器学习预测 ICU 患者死亡率：电子患者记录中高频数据的回顾性研究。

Lancet Digit Health. 2020 Apr;2(4):e179-e191. doi: 10.1016/S2589-7500(20)30018-2. Epub 2020 Mar 12.

Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.优化神经网络在医学数据集上的应用：以新生儿呼吸暂停预测为例的研究

Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.

Clinical Context-Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation.基于深度神经网络的临床相关生物医学文本摘要：模型开发与验证。

J Med Internet Res. 2020 Oct 23;22(10):e19810. doi: 10.2196/19810.

Graph Neural Network-Based Diagnosis Prediction.基于图神经网络的诊断预测。

Big Data. 2020 Oct;8(5):379-390. doi: 10.1089/big.2020.0070. Epub 2020 Aug 12.

引用本文的文献

Autoencoder-Based Representation Learning for Similar Patients Retrieval From Electronic Health Records: Comparative Study.基于自动编码器的电子健康记录中相似患者检索的表示学习：比较研究

JMIR Med Inform. 2025 Jul 24;13:e68830. doi: 10.2196/68830.

A Comparative Analysis of Patient Similarity Measures for Outcome Prediction.用于结果预测的患者相似性度量的比较分析

AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:270-279. eCollection 2025.

Machine Learning Applications in Acute Coronary Syndrome: Diagnosis, Outcomes and Management.机器学习在急性冠状动脉综合征中的应用：诊断、预后与管理

Adv Ther. 2025 Feb;42(2):636-665. doi: 10.1007/s12325-024-03060-z. Epub 2024 Dec 6.

JMIR Med Inform. 2024 Jan 19;12:e49138. doi: 10.2196/49138.

Year 2022 in Medical Natural Language Processing: Availability of Language Models as a Step in the Democratization of NLP in the Biomedical Area.2022 年医学自然语言处理：语言模型的可用性是生物医学领域 NLP 民主化的一步。

Yearb Med Inform. 2023 Aug;32(1):244-252. doi: 10.1055/s-0043-1768752. Epub 2023 Dec 26.

The predictive value of machine learning for mortality risk in patients with acute coronary syndromes: a systematic review and meta-analysis.机器学习对急性冠状动脉综合征患者死亡风险的预测价值：系统评价和荟萃分析。

Eur J Med Res. 2023 Oct 20;28(1):451. doi: 10.1186/s40001-023-01027-4.

Predicting outcomes at the individual patient level: what is the best method?预测个体患者的预后：哪种方法最佳？

BMJ Ment Health. 2023 Jun;26(1). doi: 10.1136/bmjment-2023-300701.

Improving the Performance of Outcome Prediction for Inpatients With Acute Myocardial Infarction Based on Embedding Representation Learned From Electronic Medical Records: Development and Validation Study.基于电子病历中学习到的嵌入表示来提高急性心肌梗死住院患者结局预测的性能：开发和验证研究。

J Med Internet Res. 2022 Aug 3;24(8):e37486. doi: 10.2196/37486.

本文引用的文献

Study on the semi-supervised learning-based patient similarity from heterogeneous electronic medical records.基于半监督学习的异质电子病历中患者相似性研究。

BMC Med Inform Decis Mak. 2021 Jul 30;21(Suppl 2):58. doi: 10.1186/s12911-021-01432-x.

An explainable machine learning algorithm for risk factor analysis of in-hospital mortality in sepsis survivors with ICU readmission.一种用于分析脓毒症幸存者再次入住重症监护病房时院内死亡风险因素的可解释机器学习算法。

Comput Methods Programs Biomed. 2021 Jun;204:106040. doi: 10.1016/j.cmpb.2021.106040. Epub 2021 Mar 7.

Development and validation of a prognostic nomogram for myocardial infarction patients in intensive care units: a retrospective cohort study.开发和验证 ICU 心肌梗死患者的预后列线图：一项回顾性队列研究。

BMJ Open. 2020 Dec 17;10(12):e040291. doi: 10.1136/bmjopen-2020-040291.

Combining structured and unstructured data for predictive models: a deep learning approach.将结构化和非结构化数据结合用于预测模型：一种深度学习方法。

BMC Med Inform Decis Mak. 2020 Oct 29;20(1):280. doi: 10.1186/s12911-020-01297-6.

Endpoint prediction of heart failure using electronic health records.利用电子健康记录进行心力衰竭的终点预测。

J Biomed Inform. 2020 Sep;109:103518. doi: 10.1016/j.jbi.2020.103518. Epub 2020 Jul 25.

Measurement and application of patient similarity in personalized predictive modeling based on electronic medical records.基于电子病历的个性化预测建模中患者相似性的测量和应用。

Biomed Eng Online. 2019 Oct 11;18(1):98. doi: 10.1186/s12938-019-0718-2.

Scalable and accurate deep learning with electronic health records.借助电子健康记录实现可扩展且准确的深度学习。

NPJ Digit Med. 2018 May 8;1:18. doi: 10.1038/s41746-018-0029-1. eCollection 2018.

Multitask learning and benchmarking with clinical time series data.多任务学习与临床时间序列数据的基准测试。

Sci Data. 2019 Jun 17;6(1):96. doi: 10.1038/s41597-019-0103-9.

[Analysis of diseases distribution in Medical Information Mart for Intensive Care III database].[重症监护医学信息集市III数据库中的疾病分布分析]

Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2018 Jun;30(6):531-537. doi: 10.3760/cma.j.issn.2095-4352.2018.06.006.

IEEE Trans Nanobioscience. 2018 Jul;17(3):219-227. doi: 10.1109/TNB.2018.2837622. Epub 2018 May 16.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于序列数据的患者相似性框架用于患者预后预测：算法开发。

Sequential Data-Based Patient Similarity Framework for Patient Outcome Prediction: Algorithm Development.

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献