• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

存在缺失数据时通过随机森林评估预测因子的重要性:利用实验室数据进行肝脏手术的风险预测

Estimation of a predictor's importance by Random Forests when there is missing data: risk prediction in liver surgery using laboratory data.

作者信息

Hapfelmeier Alexander, Hothorn Torsten, Riediger Carina, Ulm Kurt

出版信息

Int J Biostat. 2014;10(2):165-83. doi: 10.1515/ijb-2013-0038.

DOI:10.1515/ijb-2013-0038
PMID:24914728
Abstract

Abstract In the last few decades, new developments in liver surgery have led to an expanded applicability and an improved safety. However, liver surgery is still associated with postoperative morbidity and mortality, especially in extended resections. We analyzed a large liver surgery database to investigate whether laboratory parameters like haemoglobin, leucocytes, bilirubin, haematocrit and lactate might be relevant preoperative predictors. It is not uncommon to observe missing values in such data. This also holds for many other data sources and research fields. For analysis, one can make use of imputation methods or approaches that are able to deal with missing values in the predictor variables. A representative of the latter are Random Forests which also provide variable importance measures to assess a variable's relevance for prediction. Applied to the liver surgery data, we observed divergent results for the laboratory parameters, depending on the method used to cope with missing values. We therefore performed an extensive simulation study to investigate the properties of each approach. Findings and recommendations: Complete case analysis should not be used as it distorts the relevance of completely observed variables in an undesirable way. The estimation of a variable's importance by a self-contained measure that can deal with missing values appropriately reflects the decreased relevance of variables with missing values. It can therefore be used to obtain insight into Random Forests which are commonly fit without preprocessing of missing values in the data. By contrast, multiple imputation allows for the assessment of a variable's relevance one would potentially observe in complete-data situations, if imputation performs well. For the laboratory data, lactate and bilirubin seem to be associated with the risk of liver failure and postoperative complications. These relations should be investigated by future studies in more detail. However, it is important to carefully consider the method used for analysis when there are missing values in the predictor variables.

摘要

摘要 在过去几十年中,肝脏手术的新进展使其适用性得以扩大,安全性也有所提高。然而,肝脏手术仍与术后发病率和死亡率相关,尤其是在扩大切除术中。我们分析了一个大型肝脏手术数据库,以研究诸如血红蛋白、白细胞、胆红素、血细胞比容和乳酸等实验室参数是否可能是相关的术前预测指标。在这类数据中观察到缺失值并不罕见。许多其他数据源和研究领域也是如此。对于分析,可以使用插补方法或能够处理预测变量中缺失值的方法。后者的一个代表是随机森林,它还提供变量重要性度量来评估变量与预测的相关性。应用于肝脏手术数据时,我们根据用于处理缺失值的方法观察到实验室参数的结果存在差异。因此,我们进行了一项广泛的模拟研究,以研究每种方法的特性。研究结果与建议:不应使用完整病例分析,因为它会以不良方式扭曲完全观察到的变量的相关性。通过一种能够适当处理缺失值的独立度量来估计变量的重要性,能恰当地反映具有缺失值的变量相关性的降低。因此,它可用于深入了解通常在不对数据中的缺失值进行预处理的情况下拟合的随机森林。相比之下,如果插补效果良好,多重插补可以评估在完整数据情况下可能观察到的变量相关性。对于实验室数据,乳酸和胆红素似乎与肝衰竭风险和术后并发症相关。这些关系应在未来的研究中进行更详细的调查。然而,当预测变量中存在缺失值时,仔细考虑用于分析的方法非常重要。

相似文献

1
Estimation of a predictor's importance by Random Forests when there is missing data: risk prediction in liver surgery using laboratory data.存在缺失数据时通过随机森林评估预测因子的重要性:利用实验室数据进行肝脏手术的风险预测
Int J Biostat. 2014;10(2):165-83. doi: 10.1515/ijb-2013-0038.
2
Missing data in the American College of Surgeons National Surgical Quality Improvement Program are not missing at random: implications and potential impact on quality assessments.美国外科医师学会国家手术质量改进计划中的缺失数据并非随机缺失:对质量评估的影响和潜在影响。
J Am Coll Surg. 2010 Feb;210(2):125-139.e2. doi: 10.1016/j.jamcollsurg.2009.10.021.
3
Using the outcome for imputation of missing predictor values was preferred.使用结果来插补缺失的预测变量值是更可取的。
J Clin Epidemiol. 2006 Oct;59(10):1092-101. doi: 10.1016/j.jclinepi.2006.01.009. Epub 2006 Jun 19.
4
Missing data and interpretation of cancer surgery outcomes at the American College of Surgeons National Surgical Quality Improvement Program.美国外科医师学会国家外科质量改进计划中癌症手术结果的缺失数据和解释。
J Am Coll Surg. 2011 Sep;213(3):379-91. doi: 10.1016/j.jamcollsurg.2011.05.010. Epub 2011 Jun 22.
5
Missing data and imputation: a practical illustration in a prognostic study on low back pain.缺失数据与插补:腰痛预后研究中的实际例证
J Manipulative Physiol Ther. 2012 Jul;35(6):464-71. doi: 10.1016/j.jmpt.2012.07.002.
6
Collateral missing value imputation: a new robust missing value estimation algorithm for microarray data.并行缺失值插补:一种用于微阵列数据的新型稳健缺失值估计算法。
Bioinformatics. 2005 May 15;21(10):2417-23. doi: 10.1093/bioinformatics/bti345. Epub 2005 Feb 24.
7
Sequential imputation for missing values.缺失值的顺序插补法。
Comput Biol Chem. 2007 Oct;31(5-6):320-7. doi: 10.1016/j.compbiolchem.2007.07.001. Epub 2007 Jul 10.
8
Dealing with missing data in a multi-question depression scale: a comparison of imputation methods.处理多问题抑郁量表中的缺失数据:插补方法比较
BMC Med Res Methodol. 2006 Dec 13;6:57. doi: 10.1186/1471-2288-6-57.
9
DNA microarray data imputation and significance analysis of differential expression.DNA微阵列数据插补与差异表达的显著性分析
Bioinformatics. 2005 Nov 15;21(22):4155-61. doi: 10.1093/bioinformatics/bti638. Epub 2005 Aug 23.
10
[Roaming through methodology. XVI. What to do about missing data].[方法学漫谈。第十六篇。如何处理缺失数据]
Ned Tijdschr Geneeskd. 1999 Oct 2;143(40):1996-2000.

引用本文的文献

1
Data Loss Reconstruction Method for a Bridge Weigh-in-Motion System Using Generative Adversarial Networks.基于生成对抗网络的桥载称重系统数据丢失重构方法。
Sensors (Basel). 2022 Jan 23;22(3):858. doi: 10.3390/s22030858.
2
Accessing habitat suitability and connectivity for the westernmost population of Asian black bear (Ursus thibetanus gedrosianus, Blanford, 1877) based on climate changes scenarios in Iran.基于伊朗气候变化情景的亚洲黑熊(Ursus thibetanus gedrosianus,Blanford,1877)最西部种群的生境适宜性和连通性评估。
PLoS One. 2020 Nov 18;15(11):e0242432. doi: 10.1371/journal.pone.0242432. eCollection 2020.