• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在高维异质协变量和事件时间结局框架下,早期再入院预测方法的比较。

Comparison of methods for early-readmission prediction in a high-dimensional heterogeneous covariates and time-to-event outcome framework.

机构信息

Laboratoire de Probabilités Statistique et Modélisation (LPSM), UMR 8001, Sorbonne University, 4 Place Jussieu, Paris, 75005, France.

Assistance Publique-Hôpitaux de Paris, Biomedical Informatics and Public Health Department, European Georges Pompidou Hospital, 20 Rue Leblanc, Paris, 75015, France.

出版信息

BMC Med Res Methodol. 2019 Mar 6;19(1):50. doi: 10.1186/s12874-019-0673-4.

DOI:10.1186/s12874-019-0673-4
PMID:30841867
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6404305/
Abstract

BACKGROUND

Choosing the most performing method in terms of outcome prediction or variables selection is a recurring problem in prognosis studies, leading to many publications on methods comparison. But some aspects have received little attention. First, most comparison studies treat prediction performance and variable selection aspects separately. Second, methods are either compared within a binary outcome setting (where we want to predict whether the readmission will occur within an arbitrarily chosen delay or not) or within a survival analysis setting (where the outcomes are directly the censored times), but not both. In this paper, we propose a comparison methodology to weight up those different settings both in terms of prediction and variables selection, while incorporating advanced machine learning strategies.

METHODS

Using a high-dimensional case study on a sickle-cell disease (SCD) cohort, we compare 8 statistical methods. In the binary outcome setting, we consider logistic regression (LR), support vector machine (SVM), random forest (RF), gradient boosting (GB) and neural network (NN); while on the survival analysis setting, we consider the Cox Proportional Hazards (PH), the CURE and the C-mix models. We also propose a method using Gaussian Processes to extract meaningfull structured covariates from longitudinal data.

RESULTS

Among all assessed statistical methods, the survival analysis ones obtain the best results. In particular the C-mix model yields the better performances in both the two considered settings (AUC =0.94 in the binary outcome setting), as well as interesting interpretation aspects. There is some consistency in selected covariates across methods within a setting, but not much across the two settings.

CONCLUSIONS

It appears that learning withing the survival analysis setting first (so using all the temporal information), and then going back to a binary prediction using the survival estimates gives significantly better prediction performances than the ones obtained by models trained "directly" within the binary outcome setting.

摘要

背景

在预后研究中,选择在预测结果或变量选择方面表现最优的方法是一个常见的问题,这导致了许多关于方法比较的文献。但有些方面却很少受到关注。首先,大多数比较研究将预测性能和变量选择方面分开来处理。其次,方法要么在二元结局设置中进行比较(我们希望预测再入院是否会在任意选择的延迟内发生),要么在生存分析设置中进行比较(结局直接是删失时间),但不是两者都比较。在本文中,我们提出了一种比较方法,可以综合考虑预测和变量选择方面的不同设置,同时结合先进的机器学习策略。

方法

我们使用镰状细胞病(SCD)队列的高维案例研究来比较 8 种统计方法。在二元结局设置中,我们考虑逻辑回归(LR)、支持向量机(SVM)、随机森林(RF)、梯度提升(GB)和神经网络(NN);而在生存分析设置中,我们考虑 Cox 比例风险(PH)、CURE 和 C-mix 模型。我们还提出了一种使用高斯过程从纵向数据中提取有意义的结构化协变量的方法。

结果

在所有评估的统计方法中,生存分析方法的表现最好。特别是 C-mix 模型在两个考虑的设置中都取得了更好的结果(二元结局设置中的 AUC =0.94),同时也具有有趣的解释方面。在一个设置中,不同方法之间的协变量选择存在一定的一致性,但在两个设置之间则不太一致。

结论

似乎首先在生存分析设置中进行学习(即使用所有时间信息),然后使用生存估计值回到二元预测,可以显著提高预测性能,优于直接在二元结局设置中训练的模型获得的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe15/6404305/bfdf8cd4994e/12874_2019_673_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe15/6404305/d68eb76396b6/12874_2019_673_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe15/6404305/269ba2edb125/12874_2019_673_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe15/6404305/225aafd69b05/12874_2019_673_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe15/6404305/a11eadfab936/12874_2019_673_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe15/6404305/e0c4eff46935/12874_2019_673_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe15/6404305/bfdf8cd4994e/12874_2019_673_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe15/6404305/d68eb76396b6/12874_2019_673_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe15/6404305/269ba2edb125/12874_2019_673_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe15/6404305/225aafd69b05/12874_2019_673_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe15/6404305/a11eadfab936/12874_2019_673_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe15/6404305/e0c4eff46935/12874_2019_673_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe15/6404305/bfdf8cd4994e/12874_2019_673_Fig6_HTML.jpg

相似文献

1
Comparison of methods for early-readmission prediction in a high-dimensional heterogeneous covariates and time-to-event outcome framework.在高维异质协变量和事件时间结局框架下,早期再入院预测方法的比较。
BMC Med Res Methodol. 2019 Mar 6;19(1):50. doi: 10.1186/s12874-019-0673-4.
2
A comparative study of forest methods for time-to-event data: variable selection and predictive performance.森林方法在生存时间数据中的比较研究:变量选择和预测性能。
BMC Med Res Methodol. 2021 Sep 25;21(1):193. doi: 10.1186/s12874-021-01386-8.
3
Analysis of Machine Learning Techniques for Heart Failure Readmissions.心力衰竭再入院的机器学习技术分析
Circ Cardiovasc Qual Outcomes. 2016 Nov;9(6):629-640. doi: 10.1161/CIRCOUTCOMES.116.003039. Epub 2016 Nov 8.
4
Outcome prediction of intracranial aneurysm treatment by flow diverters using machine learning.使用机器学习对颅内动脉瘤血管内治疗的结果预测。
Neurosurg Focus. 2018 Nov 1;45(5):E7. doi: 10.3171/2018.8.FOCUS18332.
5
Comparison of Unplanned 30-Day Readmission Prediction Models, Based on Hospital Warehouse and Demographic Data.基于医院仓库和人口统计学数据的非计划30天再入院预测模型比较
Stud Health Technol Inform. 2020 Jun 16;270:547-551. doi: 10.3233/SHTI200220.
6
Predictive models for hospital readmission risk: A systematic review of methods.预测医院再入院风险的模型:方法的系统评价。
Comput Methods Programs Biomed. 2018 Oct;164:49-64. doi: 10.1016/j.cmpb.2018.06.006. Epub 2018 Jun 28.
7
A hospital wide predictive model for unplanned readmission using hierarchical ICD data.基于 ICD 数据的层级结构的全院范围预测性模型,用于预测非计划性再入院。
Comput Methods Programs Biomed. 2019 May;173:177-183. doi: 10.1016/j.cmpb.2019.02.007. Epub 2019 Feb 13.
8
Predicting readmission risk with institution-specific prediction models.使用特定机构的预测模型预测再入院风险。
Artif Intell Med. 2015 Oct;65(2):89-96. doi: 10.1016/j.artmed.2015.08.005. Epub 2015 Aug 22.
9
Predicting 30-day Hospital Readmission with Publicly Available Administrative Database. A Conditional Logistic Regression Modeling Approach.利用公开可用的行政数据库预测30天再入院情况。一种条件逻辑回归建模方法。
Methods Inf Med. 2015;54(6):560-7. doi: 10.3414/ME14-02-0017. Epub 2015 Nov 9.
10
A comparative evaluation of the generalised predictive ability of eight machine learning algorithms across ten clinical metabolomics data sets for binary classification.八种机器学习算法在十个临床代谢组学数据集上进行二进制分类的广义预测能力的比较评估。
Metabolomics. 2019 Nov 15;15(12):150. doi: 10.1007/s11306-019-1612-4.

引用本文的文献

1
Survival models and longitudinal medical events for hospital readmission forecasting.用于预测医院再入院的生存模型和纵向医疗事件。
BMC Health Serv Res. 2024 Nov 13;24(1):1394. doi: 10.1186/s12913-024-11771-w.
2
Development and validation of a prospective study to predict the risk of readmission within 365 days of respiratory failure: based on a random survival forest algorithm combined with COX regression modeling.基于随机生存森林算法与 COX 回归模型相结合的方法,开发和验证一项预测呼吸衰竭后 365 天内再入院风险的前瞻性研究。
BMC Pulm Med. 2024 Feb 14;24(1):82. doi: 10.1186/s12890-024-02862-9.
3
Prognostic analysis and risk assessment based on RNA editing in hepatocellular carcinoma.

本文引用的文献

1
C-mix: A high-dimensional mixture model for censored durations, with applications to genetic data.C-mix:一种用于删失持续时间的高维混合模型,及其在遗传数据中的应用。
Stat Methods Med Res. 2019 May;28(5):1523-1539. doi: 10.1177/0962280218766389. Epub 2018 Apr 15.
2
Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent.通过坐标下降法求解Cox比例风险模型的正则化路径
J Stat Softw. 2011 Mar;39(5):1-13. doi: 10.18637/jss.v039.i05.
3
Reviewing 741 patients records in two hours with FASTVISU.使用FASTVISU在两小时内查看741份患者记录。
基于 RNA 编辑的肝细胞癌预后分析与风险评估。
J Appl Genet. 2024 Sep;65(3):519-530. doi: 10.1007/s13353-023-00819-y. Epub 2024 Jan 13.
4
Leveraging deep survival models to predict quality of care risk in diverse hospital readmissions.利用深度生存模型预测不同医院再入院的护理质量风险。
Sci Rep. 2023 Jun 28;13(1):10479. doi: 10.1038/s41598-023-37477-3.
5
Prediction of Hospital Readmission from Longitudinal Mobile Data Streams.基于纵向移动数据流预测医院再入院情况。
Sensors (Basel). 2021 Nov 12;21(22):7510. doi: 10.3390/s21227510.
6
Cohort and Trajectory Analysis in Multi-Agent Support Systems for Cancer Survivors.队列和轨迹分析在癌症幸存者的多代理支持系统中。
J Med Syst. 2021 Nov 11;45(12):109. doi: 10.1007/s10916-021-01770-3.
AMIA Annu Symp Proc. 2015 Nov 5;2015:553-9. eCollection 2015.
4
Comparison of predictive modeling approaches for 30-day all-cause non-elective readmission risk.30天全因非择期再入院风险预测建模方法的比较
BMC Med Res Methodol. 2016 Feb 27;16:26. doi: 10.1186/s12874-016-0128-0.
5
Assessment of performance of survival prediction models for cancer prognosis.癌症预后生存预测模型性能评估。
BMC Med Res Methodol. 2012 Jul 23;12:102. doi: 10.1186/1471-2288-12-102.
6
Artificial neural networks versus proportional hazards Cox models to predict 45-year all-cause mortality in the Italian Rural Areas of the Seven Countries Study.人工神经网络与比例风险 Cox 模型预测 7 国农村地区研究中 45 年全因死亡率。
BMC Med Res Methodol. 2012 Jul 23;12:100. doi: 10.1186/1471-2288-12-100.
7
Hospital readmissions and the Affordable Care Act: paying for coordinated quality care.医院再入院与《平价医疗法案》:为协调的优质护理付费。
JAMA. 2011 Oct 26;306(16):1794-5. doi: 10.1001/jama.2011.1561.
8
Relationship between patient satisfaction with inpatient care and hospital readmission within 30 days.患者对住院护理的满意度与 30 天内再次住院之间的关系。
Am J Manag Care. 2011 Jan;17(1):41-8.
9
Methodology of integration of a clinical data warehouse with a clinical information system: the HEGP case.临床数据仓库与临床信息系统集成的方法:巴黎公共救助中心案例
Stud Health Technol Inform. 2010;160(Pt 1):193-7.
10
Acute care utilization and rehospitalizations for sickle cell disease.镰状细胞病的急性护理利用和再住院情况。
JAMA. 2010 Apr 7;303(13):1288-94. doi: 10.1001/jama.2010.378.