基于机器学习的老年人群死亡率风险评分预测。

Mortality risk score prediction in an elderly population using machine learning.

机构信息

Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD 21205, USA.

出版信息

Am J Epidemiol. 2013 Mar 1;177(5):443-52. doi: 10.1093/aje/kws241. Epub 2013 Jan 29.

Abstract

Standard practice for prediction often relies on parametric regression methods. Interesting new methods from the machine learning literature have been introduced in epidemiologic studies, such as random forest and neural networks. However, a priori, an investigator will not know which algorithm to select and may wish to try several. Here I apply the super learner, an ensembling machine learning approach that combines multiple algorithms into a single algorithm and returns a prediction function with the best cross-validated mean squared error. Super learning is a generalization of stacking methods. I used super learning in the Study of Physical Performance and Age-Related Changes in Sonomans (SPPARCS) to predict death among 2,066 residents of Sonoma, California, aged 54 years or more during the period 1993-1999. The super learner for predicting death (risk score) improved upon all single algorithms in the collection of algorithms, although its performance was similar to that of several algorithms. Super learner outperformed the worst algorithm (neural networks) by 44% with respect to estimated cross-validated mean squared error and had an R2 value of 0.201. The improvement of super learner over random forest with respect to R2 was approximately 2-fold. Alternatives for risk score prediction include the super learner, which can provide improved performance.

摘要

标准预测方法通常依赖于参数回归方法。来自机器学习文献中的一些有趣的新方法已在流行病学研究中得到应用，例如随机森林和神经网络。然而，研究人员事先并不知道应该选择哪种算法，可能希望尝试几种。在这里，我应用了超级学习者，这是一种集成机器学习方法，它将多种算法组合成一个单一的算法，并返回一个具有最佳交叉验证均方误差的预测函数。超级学习是堆叠方法的推广。我在物理性能和 Sonomans 年龄相关变化研究（SPPARCS）中使用超级学习者来预测加利福尼亚州 Sonoma 的 2066 名 54 岁及以上居民在 1993-1999 年期间的死亡情况。用于预测死亡（风险评分）的超级学习者在所有算法集合中都优于所有单个算法，尽管它的性能与几个算法相似。超级学习者在估计的交叉验证均方误差方面比最差算法（神经网络）高出 44%，R2 值为 0.201。超级学习者在 R2 方面相对于随机森林的改进约为 2 倍。风险评分预测的替代方法包括超级学习者，它可以提供更好的性能。

相似文献

Mortality risk score prediction in an elderly population using machine learning.基于机器学习的老年人群死亡率风险评分预测。

Am J Epidemiol. 2013 Mar 1;177(5):443-52. doi: 10.1093/aje/kws241. Epub 2013 Jan 29.

Predicting the outcome of patients with subarachnoid hemorrhage using machine learning techniques.使用机器学习技术预测蛛网膜下腔出血患者的预后。

IEEE Trans Inf Technol Biomed. 2009 Sep;13(5):794-801. doi: 10.1109/TITB.2009.2020434. Epub 2009 Apr 14.

Super learning: an application to the prediction of HIV-1 drug resistance.超级学习：在预测HIV-1耐药性方面的应用。

Stat Appl Genet Mol Biol. 2007;6:Article7. doi: 10.2202/1544-6115.1240. Epub 2007 Feb 23.

Simple point-of-care risk stratification in acute coronary syndromes: the AMIS model.急性冠状动脉综合征的简易床旁风险分层：AMIS模型

Heart. 2009 Apr;95(8):662-8. doi: 10.1136/hrt.2008.145904. Epub 2008 Dec 9.

Stacked generalization: an introduction to super learning.堆叠泛化：超级学习导论。

Eur J Epidemiol. 2018 May;33(5):459-464. doi: 10.1007/s10654-018-0390-z. Epub 2018 Apr 10.

Can Hyperparameter Tuning Improve the Performance of a Super Learner?: A Case Study.超参数调优能否提高超级学习者的性能？：一项案例研究。

Epidemiology. 2019 Jul;30(4):521-531. doi: 10.1097/EDE.0000000000001027.

Supervised machine learning algorithms for protein structure classification.用于蛋白质结构分类的监督式机器学习算法。

Comput Biol Chem. 2009 Jun;33(3):216-23. doi: 10.1016/j.compbiolchem.2009.04.004. Epub 2009 May 3.

The Balance Super Learner: A robust adaptation of the Super Learner to improve estimation of the average treatment effect in the treated based on propensity score matching.平衡超级学习者：超级学习者的稳健自适应方法，可提高基于倾向评分匹配的处理组平均处理效应估计的稳健性。

Stat Methods Med Res. 2018 Aug;27(8):2504-2518. doi: 10.1177/0962280216682055. Epub 2016 Dec 15.

Super learner.超级学习者。

Stat Appl Genet Mol Biol. 2007;6:Article25. doi: 10.2202/1544-6115.1309. Epub 2007 Sep 16.

A hybrid super ensemble learning model for the early-stage prediction of diabetes risk.一种用于糖尿病风险早期预测的混合超级集成学习模型。

Med Biol Eng Comput. 2023 Mar;61(3):785-797. doi: 10.1007/s11517-022-02749-z. Epub 2023 Jan 5.

引用本文的文献

Finding the Optimal Number of Splits and Repetitions in Double Cross-Fitting Targeted Maximum Likelihood Estimators.在双重交叉拟合目标最大似然估计器中寻找最优分割数和重复次数

Pharm Stat. 2025 Sep-Oct;24(5):e70022. doi: 10.1002/pst.70022.

Perspectives of family medicine residents on artificial intelligence for survival estimation in patients with serious illness.家庭医学住院医师对人工智能用于危重病患者生存预估的看法。

PLOS Digit Health. 2025 Jul 1;4(7):e0000917. doi: 10.1371/journal.pdig.0000917. eCollection 2025 Jul.

How Effective Are Machine Learning and Doubly Robust Estimators in Incorporating High-Dimensional Proxies to Reduce Residual Confounding?在纳入高维代理变量以减少残余混杂方面，机器学习和双重稳健估计器的效果如何？

Pharmacoepidemiol Drug Saf. 2025 May;34(5):e70155. doi: 10.1002/pds.70155.

Application of Machine Learning Ensemble Super Learner for analysis of the cytokines transported by high density lipoproteins (HDL) of smokers and nonsmokers.机器学习集成超级学习者在分析吸烟者和非吸烟者高密度脂蛋白（HDL）转运的细胞因子中的应用。

Proc (Int Conf Comput Sci Comput Intell). 2021 Dec;2021:370-375. doi: 10.1109/csci54926.2021.00133. Epub 2022 Jun 22.

Neural network based estimates of the climate impact on mortality in Germany: application to storyline climate simulations.基于神经网络的德国气候变化对死亡率影响的估算：在故事情节气候模拟中的应用。

Sci Rep. 2024 Oct 30;14(1):26074. doi: 10.1038/s41598-024-77398-3.

Predicting the risk of diabetes complications using machine learning and social administrative data in a country with ethnic inequities in health: Aotearoa New Zealand.利用机器学习和社会行政数据预测在一个存在健康不平等的国家中糖尿病并发症的风险：新西兰。

BMC Med Inform Decis Mak. 2024 Sep 27;24(1):274. doi: 10.1186/s12911-024-02678-x.

Drug Burden Index Is a Modifiable Predictor of 30-Day Hospitalization in Community-Dwelling Older Adults With Complex Care Needs: Machine Learning Analysis of InterRAI Data.药物负担指数是具有复杂护理需求的社区居住老年人 30 天住院的可修正预测指标：InterRAI 数据分析的机器学习

J Gerontol A Biol Sci Med Sci. 2024 Aug 1;79(8). doi: 10.1093/gerona/glae130.

Identifying dementia from cognitive footprints in hospital records among Chinese older adults: a machine-learning study.通过中国老年人医院记录中的认知足迹识别痴呆症：一项机器学习研究。

Lancet Reg Health West Pac. 2024 Apr 12;46:101060. doi: 10.1016/j.lanwpc.2024.101060. eCollection 2024 May.

A machine learning approach to predicting vascular calcification risk of type 2 diabetes: A retrospective study.机器学习方法预测 2 型糖尿病血管钙化风险：一项回顾性研究。

Clin Cardiol. 2024 Apr;47(4):e24264. doi: 10.1002/clc.24264.

Identifying low acuity Emergency Department visits with a machine learning approach: The low acuity visit algorithms (LAVA).使用机器学习方法识别低 acuity 急诊科就诊：低 acuity 就诊算法 (LAVA)。

Health Serv Res. 2024 Aug;59(4):e14305. doi: 10.1111/1475-6773.14305. Epub 2024 Mar 30.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于机器学习的老年人群死亡率风险评分预测。

Mortality risk score prediction in an elderly population using machine learning.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献