• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用随机森林模型预测脑卒中患者 90 天内的居家时间。

Using random forests to model 90-day hometime in people with stroke.

机构信息

Department of Clinical Neurosciences, Cumming School of Medicine, University of Calgary, 3330 Hospital Drive NW, Calgary, AB, T2N4N1, Canada.

ICES, Toronto, ON, Canada.

出版信息

BMC Med Res Methodol. 2021 May 10;21(1):102. doi: 10.1186/s12874-021-01289-8.

DOI:10.1186/s12874-021-01289-8
PMID:33971827
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8112132/
Abstract

BACKGROUND

Ninety-day hometime, the number of days a patient is living in the community in the first 90 after stroke, exhibits a non-normal bucket-shaped distribution, with lower and upper constraints making its analysis difficult. In this proof-of-concept study we evaluated the performance of random forests regression in the analysis of hometime.

METHODS

Using administrative data we identified stroke hospitalizations between 2010 and 2017 in Ontario, Canada. We used random forests regression to predict 90-day hometime using 15 covariates. Model accuracy was determined using the r-squared statistic. Variable importance in prediction and the marginal effects of each covariate were explored.

RESULTS

We identified 75,745 eligible patients. Median 90-day hometime was 59 days (Q1: 2, Q3: 83). Random forests predicted hometime with reasonable accuracy (adjusted r-squared 0.3462); no implausible values were predicted but extreme values were predicted with low accuracy. Frailty, stroke severity, and age exhibited inverse non-linear relationships with hometime and patients arriving by ambulance had less hometime than those who did not.

CONCLUSIONS

Random forests may be a useful method for analyzing 90-day hometime and capturing the complex non-linear relationships which exist between predictors and hometime. Future work should compare random forests to other models and focus on improving the accuracy of predictions of extreme values of hometime.

摘要

背景

90 天居家时间,即患者在中风后 90 天内居住在社区中的天数,呈非正态桶形分布,下限和上限的存在使其分析变得困难。在这项概念验证研究中,我们评估了随机森林回归在居家时间分析中的表现。

方法

我们利用行政数据,在加拿大安大略省识别了 2010 年至 2017 年的中风住院患者。我们使用随机森林回归,使用 15 个协变量来预测 90 天居家时间。使用 r 平方统计量来确定模型的准确性。探索了预测中的变量重要性和每个协变量的边际效应。

结果

我们确定了 75745 名符合条件的患者。中位数 90 天居家时间为 59 天(Q1:2,Q3:83)。随机森林对居家时间的预测具有合理的准确性(调整 r 平方 0.3462);没有预测到不合理的值,但对极端值的预测准确性较低。脆弱性、中风严重程度和年龄与居家时间呈负相关的非线性关系,乘坐救护车到达的患者比未乘坐救护车的患者居家时间短。

结论

随机森林可能是分析 90 天居家时间和捕捉预测因子与居家时间之间存在的复杂非线性关系的有用方法。未来的工作应该将随机森林与其他模型进行比较,并专注于提高居家时间极端值预测的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f1/8112132/11ca4ca9e371/12874_2021_1289_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f1/8112132/7210f31ae217/12874_2021_1289_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f1/8112132/d682b06bcde2/12874_2021_1289_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f1/8112132/9569f6d5c174/12874_2021_1289_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f1/8112132/e2135dd177f1/12874_2021_1289_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f1/8112132/cff81738d7af/12874_2021_1289_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f1/8112132/b4d1b35cb84b/12874_2021_1289_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f1/8112132/11ca4ca9e371/12874_2021_1289_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f1/8112132/7210f31ae217/12874_2021_1289_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f1/8112132/d682b06bcde2/12874_2021_1289_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f1/8112132/9569f6d5c174/12874_2021_1289_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f1/8112132/e2135dd177f1/12874_2021_1289_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f1/8112132/cff81738d7af/12874_2021_1289_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f1/8112132/b4d1b35cb84b/12874_2021_1289_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38f1/8112132/11ca4ca9e371/12874_2021_1289_Fig7_HTML.jpg

相似文献

1
Using random forests to model 90-day hometime in people with stroke.应用随机森林模型预测脑卒中患者 90 天内的居家时间。
BMC Med Res Methodol. 2021 May 10;21(1):102. doi: 10.1186/s12874-021-01289-8.
2
Comparing regression modeling strategies for predicting hometime.比较预测出院时间的回归建模策略。
BMC Med Res Methodol. 2021 Jul 7;21(1):138. doi: 10.1186/s12874-021-01331-9.
3
Deriving a Passive Surveillance Stroke Severity Indicator From Routinely Collected Administrative Data: The PaSSV Indicator.从常规收集的行政数据中推导被动监测卒中严重程度指标:PaSSV指标。
Circ Cardiovasc Qual Outcomes. 2020 Feb;13(2):e006269. doi: 10.1161/CIRCOUTCOMES.119.006269. Epub 2020 Feb 14.
4
Unbiased split variable selection for random survival forests using maximally selected rank statistics.使用最大选择秩统计量对随机生存森林进行无偏分裂变量选择。
Stat Med. 2017 Apr 15;36(8):1272-1284. doi: 10.1002/sim.7212. Epub 2017 Jan 15.
5
Variable importance-weighted Random Forests.可变重要性加权随机森林
Quant Biol. 2017 Dec;5(4):338-351. Epub 2017 Nov 6.
6
Improving random forest predictions in small datasets from two-phase sampling designs.改进两阶段抽样设计中小数据集的随机森林预测。
BMC Med Inform Decis Mak. 2021 Nov 22;21(1):322. doi: 10.1186/s12911-021-01688-3.
7
Evaluating Random Forests for Survival Analysis using Prediction Error Curves.使用预测误差曲线评估随机森林用于生存分析
J Stat Softw. 2012 Sep;50(11):1-23. doi: 10.18637/jss.v050.i11.
8
Application of random survival forests in understanding the determinants of under-five child mortality in Uganda in the presence of covariates that satisfy the proportional and non-proportional hazards assumption.在存在满足比例和非比例风险假设的协变量的情况下,随机生存森林在理解乌干达五岁以下儿童死亡率的决定因素中的应用。
BMC Res Notes. 2017 Sep 7;10(1):459. doi: 10.1186/s13104-017-2775-6.
9
A comparison of the conditional inference survival forest model to random survival forests based on a simulation study as well as on two applications with time-to-event data.基于模拟研究以及对两个事件发生时间数据应用的情况,对条件推断生存森林模型与随机生存森林进行比较。
BMC Med Res Methodol. 2017 Jul 28;17(1):115. doi: 10.1186/s12874-017-0383-8.
10
Do little interactions get lost in dark random forests?微小的相互作用会在黑暗的随机森林中消失吗?
BMC Bioinformatics. 2016 Mar 31;17:145. doi: 10.1186/s12859-016-0995-8.

引用本文的文献

1
GABP Promotes Mesangial Cell Proliferation and Renal Fibrosis Through GLI1 in Diabetic Nephropathy.在糖尿病肾病中,GABP通过GLI1促进系膜细胞增殖和肾纤维化。
Adv Sci (Weinh). 2025 Apr;12(15):e2407462. doi: 10.1002/advs.202407462. Epub 2025 Feb 22.
2
How individuals' opinions influence society's resistance to epidemics: an agent-based model approach.个体意见如何影响社会的防疫抵制:基于主体的模型方法。
BMC Public Health. 2024 Mar 20;24(1):863. doi: 10.1186/s12889-024-18310-6.
3
Predicting the behavioral intentions of hospice and palliative care providers from real-world data using supervised learning: A cross-sectional survey study.

本文引用的文献

1
Deriving a Passive Surveillance Stroke Severity Indicator From Routinely Collected Administrative Data: The PaSSV Indicator.从常规收集的行政数据中推导被动监测卒中严重程度指标:PaSSV指标。
Circ Cardiovasc Qual Outcomes. 2020 Feb;13(2):e006269. doi: 10.1161/CIRCOUTCOMES.119.006269. Epub 2020 Feb 14.
2
One-Year Home-Time and Mortality After Thrombolysis Compared With Nontreated Patients in a Propensity-Matched Analysis.溶栓治疗后与未治疗患者的 1 年居家时间和死亡率的倾向匹配分析。
Stroke. 2019 Dec;50(12):3488-3493. doi: 10.1161/STROKEAHA.119.026922. Epub 2019 Oct 1.
3
Home-Time Is a Feasible and Valid Stroke Outcome Measure in National Datasets.
利用监督学习从真实世界数据预测临终关怀和姑息治疗提供者的行为意向:一项横断面调查研究。
Front Public Health. 2022 Sep 30;10:927874. doi: 10.3389/fpubh.2022.927874. eCollection 2022.
4
Comparing regression modeling strategies for predicting hometime.比较预测出院时间的回归建模策略。
BMC Med Res Methodol. 2021 Jul 7;21(1):138. doi: 10.1186/s12874-021-01331-9.
家庭时间是国家数据集中风预后的一种可行和有效的衡量标准。
Stroke. 2019 May;50(5):1282-1285. doi: 10.1161/STROKEAHA.118.023916.
4
Home-Time as a Surrogate Marker for Functional Outcome After Aneurysmal Subarachnoid Hemorrhage.家时间作为动脉瘤性蛛网膜下腔出血后功能结局的替代标志物。
Stroke. 2018 Dec;49(12):3081-3084. doi: 10.1161/STROKEAHA.118.022808.
5
Identifying diabetes cases from administrative data: a population-based validation study.从行政数据中识别糖尿病病例:一项基于人群的验证研究。
BMC Health Serv Res. 2018 May 2;18(1):316. doi: 10.1186/s12913-018-3148-0.
6
Development and validation of a Hospital Frailty Risk Score focusing on older people in acute care settings using electronic hospital records: an observational study.开发和验证一个专注于急性护理环境中老年人的医院衰弱风险评分,使用电子医院记录:一项观察性研究。
Lancet. 2018 May 5;391(10132):1775-1782. doi: 10.1016/S0140-6736(18)30668-8. Epub 2018 Apr 26.
7
Population-based study of home-time by stroke type and correlation with modified Rankin score.基于人群的卒中类型居家时间研究及其与改良Rankin量表评分的相关性
Neurology. 2017 Nov 7;89(19):1970-1976. doi: 10.1212/WNL.0000000000004631. Epub 2017 Oct 11.
8
Factors Associated With Ischemic Stroke Survival and Recovery in Older Adults.老年人缺血性中风存活与康复的相关因素。
Stroke. 2017 Jul;48(7):1818-1826. doi: 10.1161/STROKEAHA.117.016726. Epub 2017 May 19.
9
Hospital Variation in Home-Time After Acute Ischemic Stroke: Insights From the PROSPER Study (Patient-Centered Research Into Outcomes Stroke Patients Prefer and Effectiveness Research).急性缺血性脑卒中后家庭时间的医院差异:来自 PROSPER 研究(以患者为中心的脑卒中患者偏好研究和有效性研究)的见解。
Stroke. 2016 Oct;47(10):2627-33. doi: 10.1161/STROKEAHA.116.013563. Epub 2016 Sep 13.
10
Charlson comorbidity index as a predictor of in-hospital death in acute ischemic stroke among very old patients: a single-cohort perspective study.查尔森合并症指数作为高龄急性缺血性卒中患者院内死亡的预测指标:一项单队列前瞻性研究
Neurol Sci. 2016 Sep;37(9):1443-8. doi: 10.1007/s10072-016-2602-1. Epub 2016 May 11.