• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

随机森林变量重要性度量在医学中的正确使用和误用:通过中风事件预测进行演示。

Use and misuse of random forest variable importance metrics in medicine: demonstrations through incident stroke prediction.

机构信息

Department of Psychiatry, University of Pittsburgh, 3811 O'Hara Street, Pittsburgh, PA, 15231, USA.

Department of Statistics, University of Pittsburgh, Pittsburgh, PA, USA.

出版信息

BMC Med Res Methodol. 2023 Jun 19;23(1):144. doi: 10.1186/s12874-023-01965-x.

DOI:10.1186/s12874-023-01965-x
PMID:37337173
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10280951/
Abstract

BACKGROUND

Machine learning tools such as random forests provide important opportunities for modeling large, complex modern data generated in medicine. Unfortunately, when it comes to understanding why machine learning models are predictive, applied research continues to rely on 'out of bag' (OOB) variable importance metrics (VIMPs) that are known to have considerable shortcomings within the statistics community. After explaining the limitations of OOB VIMPs - including bias towards correlated features and limited interpretability - we describe a modern approach called 'knockoff VIMPs' and explain its advantages.

METHODS

We first evaluate current VIMP practices through an in-depth literature review of 50 recent random forest manuscripts. Next, we recommend organized and interpretable strategies for analysis with knockoff VIMPs, including computing them for groups of features and considering multiple model performance metrics. To demonstrate methods, we develop a random forest to predict 5-year incident stroke in the Sleep Heart Health Study and compare results based on OOB and knockoff VIMPs.

RESULTS

Nearly all papers in the literature review contained substantial limitations in their use of VIMPs. In our demonstration, using OOB VIMPs for individual variables suggested two highly correlated lung function variables (forced expiratory volume, forced vital capacity) as the best predictors of incident stroke, followed by age and height. Using an organized analytic approach that considered knockoff VIMPs of both groups of features and individual features, the largest contributions to model sensitivity were medications (especially cardiovascular) and measured medical risk factors, while the largest contributions to model specificity were age, diastolic blood pressure, self-reported medical risk factors, polysomnography features, and pack-years of smoking. Thus, we reach very different conclusions about stroke risk factors using OOB VIMPs versus knockoff VIMPs.

CONCLUSIONS

The near-ubiquitous reliance on OOB VIMPs may provide misleading results for researchers who use such methods to guide their research. Given the rapid pace of scientific inquiry using machine learning, it is essential to bring modern knockoff VIMPs that are interpretable and unbiased into widespread applied practice to steer researchers using random forest machine learning toward more meaningful results.

摘要

背景

机器学习工具,如随机森林,为在医学中建模大型、复杂的现代数据提供了重要机会。不幸的是,在理解为什么机器学习模型具有预测性时,应用研究仍然依赖于“袋外”(OOB)变量重要性指标(VIMPs),这些指标在统计学界被认为存在相当大的缺陷。在解释了 OOB VIMPs 的局限性后——包括对相关特征的偏见和有限的可解释性——我们描述了一种称为“knockoff VIMPs”的现代方法,并解释了它的优势。

方法

我们首先通过对 50 篇近期随机森林手稿的深入文献回顾,评估当前 VIMP 实践。接下来,我们建议使用 knockoff VIMPs 进行分析的有组织和可解释的策略,包括为特征组计算它们并考虑多个模型性能指标。为了演示方法,我们开发了一个随机森林来预测睡眠心脏健康研究中的 5 年中风事件,并比较基于 OOB 和 knockoff VIMPs 的结果。

结果

文献综述中的几乎所有论文在使用 VIMPs 方面都存在重大局限性。在我们的演示中,使用 OOB VIMPs 对单个变量进行分析表明,两个高度相关的肺功能变量(用力呼气量、用力肺活量)是中风事件的最佳预测指标,其次是年龄和身高。使用一种有组织的分析方法,同时考虑了特征组和单个特征的 knockoff VIMPs,对模型敏感性贡献最大的是药物(特别是心血管药物)和测量的医疗风险因素,而对模型特异性贡献最大的是年龄、舒张压、自我报告的医疗风险因素、多导睡眠图特征和吸烟包年数。因此,我们使用 OOB VIMPs 和 knockoff VIMPs 得出了关于中风风险因素的非常不同的结论。

结论

由于研究人员使用这些方法来指导他们的研究,因此近乎普遍依赖 OOB VIMPs 可能会提供误导性的结果。鉴于使用机器学习进行科学研究的快速步伐,必须将可解释和无偏的现代 knockoff VIMPs 广泛应用于应用实践中,以使使用随机森林机器学习的研究人员能够得出更有意义的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86af/10280951/6a4a8e49cca8/12874_2023_1965_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86af/10280951/d7378f37cbda/12874_2023_1965_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86af/10280951/7af68099b265/12874_2023_1965_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86af/10280951/6a4a8e49cca8/12874_2023_1965_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86af/10280951/d7378f37cbda/12874_2023_1965_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86af/10280951/7af68099b265/12874_2023_1965_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86af/10280951/6a4a8e49cca8/12874_2023_1965_Fig3_HTML.jpg

相似文献

1
Use and misuse of random forest variable importance metrics in medicine: demonstrations through incident stroke prediction.随机森林变量重要性度量在医学中的正确使用和误用:通过中风事件预测进行演示。
BMC Med Res Methodol. 2023 Jun 19;23(1):144. doi: 10.1186/s12874-023-01965-x.
2
Exploring the variable importance in random forests under correlations: a general concept applied to donor organ quality in post-transplant survival.探讨相关性下随机森林中的变量重要性:一个应用于移植后存活中供体器官质量的通用概念。
BMC Med Res Methodol. 2023 Sep 19;23(1):209. doi: 10.1186/s12874-023-02023-2.
3
Multidimensional Sleep and Mortality in Older Adults: A Machine-Learning Comparison With Other Risk Factors.多维睡眠与老年人死亡率:与其他风险因素的机器学习比较。
J Gerontol A Biol Sci Med Sci. 2019 Nov 13;74(12):1903-1909. doi: 10.1093/gerona/glz044.
4
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
A Machine Learning Approach to Support Urgent Stroke Triage Using Administrative Data and Social Determinants of Health at Hospital Presentation: Retrospective Study.一种基于机器学习的方法,利用医院就诊时的行政数据和健康社会决定因素来支持紧急脑卒中分诊:回顾性研究。
J Med Internet Res. 2023 Jan 30;25:e36477. doi: 10.2196/36477.
7
Feature Selection Algorithm based on Random Forest applied to Sleep Apnea Detection.基于随机森林的特征选择算法在睡眠呼吸暂停检测中的应用
Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:2580-2583. doi: 10.1109/EMBC.2019.8856582.
8
The Impact of Time Horizon on Classification Accuracy: Application of Machine Learning to Prediction of Incident Coronary Heart Disease.时间范围对分类准确性的影响:机器学习在预测冠心病发病中的应用。
JMIR Cardio. 2022 Nov 2;6(2):e38040. doi: 10.2196/38040.
9
Novel learning framework (knockoff technique) to evaluate metric ranking algorithms to describe human response to injury.用于评估度量排序算法以描述人类对损伤反应的新型学习框架(仿冒技术)。
Traffic Inj Prev. 2018;19(sup2):S121-S126. doi: 10.1080/15389588.2018.1519805. Epub 2018 Dec 20.
10
Knockoff boosted tree for model-free variable selection.无模型变量选择的仿射提升树。
Bioinformatics. 2021 May 17;37(7):976-983. doi: 10.1093/bioinformatics/btaa770.

引用本文的文献

1
Machine learning integration of bulk and single-cell RNA-seq data reveals glycolytic heterogeneity in colorectal cancer.批量和单细胞RNA测序数据的机器学习整合揭示了结直肠癌中的糖酵解异质性。
Med Oncol. 2025 Aug 30;42(10):458. doi: 10.1007/s12032-025-03007-6.
2
Serial Serum Immunoglobulin G Levels and Correlation with Outcomes in Children with Guillain Barré Syndrome: Authors' Reply-2.格林-巴利综合征患儿血清免疫球蛋白G水平系列及其与预后的相关性:作者回复-2
Indian J Pediatr. 2025 Aug 29. doi: 10.1007/s12098-025-05709-1.
3
Integrating bulk and single-cell RNA data with machine learning to explore mitophagy in periodontitis.

本文引用的文献

1
A general framework for inference on algorithm-agnostic variable importance.一种用于推断与算法无关的变量重要性的通用框架。
J Am Stat Assoc. 2023;118(543):1645-1658. doi: 10.1080/01621459.2021.2003200. Epub 2022 Jan 5.
2
Physiological sleep measures predict time to 15-year mortality in community adults: Application of a novel machine learning framework.生理睡眠测量可预测社区成年人 15 年的死亡率:一种新型机器学习框架的应用。
J Sleep Res. 2021 Dec;30(6):e13386. doi: 10.1111/jsr.13386. Epub 2021 May 15.
3
Risk Factors for Incident Stroke and Its Subtypes in China: A Prospective Study.
整合批量和单细胞RNA数据并结合机器学习以探索牙周炎中的线粒体自噬。
Medicine (Baltimore). 2025 Aug 22;104(34):e44002. doi: 10.1097/MD.0000000000044002.
4
Distilling knowledge from graph neural networks trained on cell graphs to non-neural student models.从在细胞图上训练的图神经网络中提取知识,用于非神经学生模型。
Sci Rep. 2025 Aug 10;15(1):29274. doi: 10.1038/s41598-025-13697-7.
5
Machine learning based classification of catastrophic health expenditures: a cross-sectional study of Korean low-income households.基于机器学习的灾难性卫生支出分类:韩国低收入家庭的横断面研究
BMC Health Serv Res. 2025 Aug 7;25(1):1040. doi: 10.1186/s12913-025-13139-0.
6
Predicting ICU Delirium in Critically Ill COVID-19 Patients Using Demographic, Clinical, and Laboratory Admission Data: A Machine Learning Approach.利用人口统计学、临床和实验室入院数据预测重症 COVID-19 患者的 ICU 谵妄:一种机器学习方法。
Life (Basel). 2025 Jun 30;15(7):1045. doi: 10.3390/life15071045.
7
Prediction of birthweight with early and mid-pregnancy antenatal markers utilising machine learning and explainable artificial intelligence.利用机器学习和可解释人工智能,通过孕早期和孕中期产前标志物预测出生体重。
Sci Rep. 2025 Jul 19;15(1):26223. doi: 10.1038/s41598-025-11837-7.
8
An international multi-cohort investigation of self-reported sleep and future depressive symptoms in older adults.一项关于老年人自我报告的睡眠情况与未来抑郁症状的国际多队列调查。
Sci Rep. 2025 Jul 4;15(1):23918. doi: 10.1038/s41598-025-07864-z.
9
Ranking Nursing Diagnoses by Predictive Relevance for Intensive Care Unit Transfer Risk in Adult and Pediatric Patients: A Machine Learning Approach with Random Forest.通过预测相关性对成人和儿科患者重症监护病房转运风险进行护理诊断排序:一种基于随机森林的机器学习方法
Healthcare (Basel). 2025 Jun 4;13(11):1339. doi: 10.3390/healthcare13111339.
10
Predicting the risk of postoperative gastrointestinal bleeding in patients with Type A aortic dissection based on an interpretable machine learning model.基于可解释机器学习模型预测A型主动脉夹层患者术后胃肠道出血风险
Front Med (Lausanne). 2025 May 19;12:1554579. doi: 10.3389/fmed.2025.1554579. eCollection 2025.
中国首发卒中及其亚型的危险因素:一项前瞻性研究。
J Am Heart Assoc. 2020 Nov 3;9(21):e016352. doi: 10.1161/JAHA.120.016352. Epub 2020 Oct 26.
4
Sleep apnea and stroke.睡眠呼吸暂停与中风。
Curr Opin Neurol. 2020 Feb;33(1):4-9. doi: 10.1097/WCO.0000000000000781.
5
The bidirectional impact of sleep and circadian rhythm dysfunction in human ischaemic stroke: A systematic review.睡眠和昼夜节律紊乱对人类缺血性脑卒中的双向影响:系统评价。
Sleep Med Rev. 2019 Jun;45:54-69. doi: 10.1016/j.smrv.2019.03.003. Epub 2019 Mar 20.
6
Sleep Apnea Is a Risk Factor for Stroke and Vascular Dementia.睡眠呼吸暂停是中风和血管性痴呆的一个危险因素。
Curr Neurol Neurosci Rep. 2018 Jun 25;18(8):53. doi: 10.1007/s11910-018-0855-1.
7
Sleep disorders and the risk of stroke.睡眠障碍与中风风险。
Expert Rev Neurother. 2018 Jul;18(7):523-531. doi: 10.1080/14737175.2018.1489239. Epub 2018 Jun 25.
8
The National Sleep Research Resource: towards a sleep data commons.国家睡眠研究资源:迈向睡眠数据共享。
J Am Med Inform Assoc. 2018 Oct 1;25(10):1351-1358. doi: 10.1093/jamia/ocy064.
9
Hypertension and diabetes mellitus as a predictive risk factors for stroke.高血压和糖尿病作为中风的预测性风险因素。
Diabetes Metab Syndr. 2018 Jul;12(4):577-584. doi: 10.1016/j.dsx.2018.03.009. Epub 2018 Mar 19.
10
Stroke Epidemiology and Risk Factor Management.中风流行病学与风险因素管理
Continuum (Minneap Minn). 2017 Feb;23(1, Cerebrovascular Disease):15-39. doi: 10.1212/CON.0000000000000416.