• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

新型数据插补方法可用于 ICU 中多种类型的缺失数据。

Novel Data Imputation for Multiple Types of Missing Data in Intensive Care Units.

出版信息

IEEE J Biomed Health Inform. 2019 May;23(3):1243-1250. doi: 10.1109/JBHI.2018.2883606. Epub 2019 Apr 16.

DOI:10.1109/JBHI.2018.2883606
PMID:30998482
Abstract

The diversity and number of parameters monitored in an intensive care unit (ICU) make the resulting databases highly susceptible to quality issues, such as missing information and erroneous data entry, which adversely affect the downstream processing and predictive modeling. Missing data interpolation and imputation techniques, such as multiple imputation, expectation maximization, and hot-deck imputation techniques do not account for the type of missing data, which can lead to bias. In our study, we first model the missing data as three types: "neglectable" also known as a.k.a "missing completely at random," "recoverable" a.k.a. "missing at random," and "not easily recoverable" a.k.a. "missing not at random." We then design imputation techniques for each type of missing data. We use a publicly available database (MIMIC II) to demonstrate how these imputations perform with random forests for prediction. Our results indicate that these novel imputation techniques outperformed standard mean filling techniques and expectation maximization with a statistical significance p ≤ 0.01 in predicting ICU mortality.

摘要

重症监护病房 (ICU) 中监测的参数种类繁多,数量庞大,这使得由此产生的数据库非常容易出现质量问题,如信息缺失和数据录入错误,这会对下游处理和预测建模产生不利影响。缺失数据插补和估算技术(如多重插补、期望最大化和热插补技术)并没有考虑缺失数据的类型,这可能会导致偏差。在我们的研究中,我们首先将缺失数据建模为三种类型:“可忽略”也称为“完全随机缺失”,“可恢复”也称为“随机缺失”,以及“不易恢复”也称为“非随机缺失”。然后,我们为每种类型的缺失数据设计了估算技术。我们使用一个公开的数据库(MIMIC II)来演示这些估算方法如何与随机森林一起用于预测。我们的结果表明,这些新的估算技术在预测 ICU 死亡率方面优于标准均值填充技术和期望最大化技术,具有统计学意义 p ≤ 0.01。

相似文献

1
Novel Data Imputation for Multiple Types of Missing Data in Intensive Care Units.新型数据插补方法可用于 ICU 中多种类型的缺失数据。
IEEE J Biomed Health Inform. 2019 May;23(3):1243-1250. doi: 10.1109/JBHI.2018.2883606. Epub 2019 Apr 16.
2
DeepTSE: A Time-Sensitive Deep Embedding of ICU Data for Patient Modeling and Missing Data Imputation.DeepTSE:一种 ICU 数据的时间敏感深度嵌入方法,用于患者建模和缺失数据插补。
Stud Health Technol Inform. 2023 May 18;302:237-241. doi: 10.3233/SHTI230110.
3
Strategies for handling missing clinical data for automated surgical site infection detection from the electronic health record.从电子健康记录中自动检测手术部位感染时处理缺失临床数据的策略。
J Biomed Inform. 2017 Apr;68:112-120. doi: 10.1016/j.jbi.2017.03.009. Epub 2017 Mar 16.
4
Performance of Multiple Imputation Using Modern Machine Learning Methods in Electronic Health Records Data.基于现代机器学习方法在电子健康记录数据中的应用表现。
Epidemiology. 2023 Mar 1;34(2):206-215. doi: 10.1097/EDE.0000000000001578. Epub 2022 Dec 9.
5
Multiple imputation for handling missing outcome data when estimating the relative risk.采用多重插补处理估计相对危险度时丢失的结局数据。
BMC Med Res Methodol. 2017 Sep 6;17(1):134. doi: 10.1186/s12874-017-0414-5.
6
Use of the mean, hot deck and multiple imputation techniques to predict outcome in intensive care unit patients in Colombia.使用均值、热卡填充法和多重填补技术预测哥伦比亚重症监护病房患者的预后。
Stat Med. 2002 Dec 30;21(24):3885-96. doi: 10.1002/sim.1391.
7
[Simulation study on missing data imputation methods for longitudinal data in cohort studies].队列研究中纵向数据缺失值插补方法的模拟研究
Zhonghua Liu Xing Bing Xue Za Zhi. 2021 Oct 10;42(10):1889-1894. doi: 10.3760/cma.j.cn112338-20201130-01363.
8
Missing data approaches in eHealth research: simulation study and a tutorial for nonmathematically inclined researchers.电子健康研究中的缺失数据处理方法:模拟研究及面向非数学专业研究人员的教程
J Med Internet Res. 2010 Dec 19;12(5):e54. doi: 10.2196/jmir.1448.
9
Implementing Multiple Imputation for Missing Data in Longitudinal Studies When Models are Not Feasible: An Example Using the Random Hot Deck Approach.当模型不可行时,在纵向研究中对缺失数据实施多重填补:使用随机热卡方法的一个示例。
Clin Epidemiol. 2022 Nov 15;14:1387-1403. doi: 10.2147/CLEP.S368303. eCollection 2022.
10
Predicting missing quality of life data that were later recovered: an empirical comparison of approaches.预测后来恢复的缺失生活质量数据:方法的实证比较。
Clin Trials. 2010 Aug;7(4):333-42. doi: 10.1177/1740774510374626. Epub 2010 Jun 24.

引用本文的文献

1
Entering the new digital era of intensive care medicine: an overview of interdisciplinary approaches to use artificial intelligence for patients' benefit.进入重症监护医学的新数字时代:利用人工智能造福患者的跨学科方法概述。
Eur J Anaesthesiol Intensive Care. 2022 Dec 21;2(1):e0014. doi: 10.1097/EA9.0000000000000014. eCollection 2023 Feb.
2
Moving Beyond Medical Statistics: A Systematic Review on Missing Data Handling in Electronic Health Records.超越医学统计学:电子健康记录中缺失数据处理的系统评价
Health Data Sci. 2024 Dec 4;4:0176. doi: 10.34133/hds.0176. eCollection 2024.
3
Current status and trends in researches based on public intensive care databases: A scientometric investigation.
基于公共重症监护数据库的研究现状和趋势:一项科学计量学研究。
Front Public Health. 2022 Sep 15;10:912151. doi: 10.3389/fpubh.2022.912151. eCollection 2022.
4
Domain Adaptation Using Convolutional Autoencoder and Gradient Boosting for Adverse Events Prediction in the Intensive Care Unit.使用卷积自动编码器和梯度提升进行重症监护病房不良事件预测的域适应
Front Artif Intell. 2022 Apr 11;5:640926. doi: 10.3389/frai.2022.640926. eCollection 2022.
5
Combination of static and temporal data analysis to predict mortality and readmission in the intensive care.结合静态和时态数据分析以预测重症监护中的死亡率和再入院率。
Annu Int Conf IEEE Eng Med Biol Soc. 2017 Jul;2017:2570-2573. doi: 10.1109/EMBC.2017.8037382.