Sung MinDong, Hahn Sangchul, Han Chang Hoon, Lee Jung Mo, Lee Jayoung, Yoo Jinkyu, Heo Jay, Kim Young Sam, Chung Kyung Soo
Department of Biomedical Systems Informatics, Yonsei University College of Medicine, Seoul, Republic of Korea.
AITRICS. Inc, Seoul, Republic of Korea.
JMIR Med Inform. 2021 Nov 4;9(11):e26426. doi: 10.2196/26426.
In the era of artificial intelligence, event prediction models are abundant. However, considering the limitation of the electronic medical record-based model, including the temporally skewed prediction and the record itself, these models could be delayed or could yield errors.
In this study, we aim to develop multiple event prediction models in intensive care units to overcome their temporal skewness and evaluate their robustness against delayed and erroneous input.
A total of 21,738 patients were included in the development cohort. Three events-death, sepsis, and acute kidney injury-were predicted. To overcome the temporal skewness, we developed three models for each event, which predicted the events in advance of three prespecified timepoints. Additionally, to evaluate the robustness against input error and delays, we added simulated errors and delayed input and calculated changes in the area under the receiver operating characteristic curve (AUROC) values.
Most of the AUROC and area under the precision-recall curve values of each model were higher than those of the conventional scores, as well as other machine learning models previously used. In the error input experiment, except for our proposed model, an increase in the noise added to the model lowered the resulting AUROC value. However, the delayed input did not show the performance decreased in this experiment.
For a prediction model that was applicable in the real world, we considered not only performance but also temporal skewness, delayed input, and input error.
在人工智能时代,事件预测模型众多。然而,考虑到基于电子病历的模型存在局限性,包括时间上的偏态预测以及病历本身的问题,这些模型可能会出现延迟或产生错误。
在本研究中,我们旨在开发重症监护病房中的多种事件预测模型,以克服其时间偏态,并评估其对延迟和错误输入的鲁棒性。
共有21738例患者纳入开发队列。预测了三种事件——死亡、脓毒症和急性肾损伤。为了克服时间偏态,我们针对每个事件开发了三种模型,这些模型在三个预先指定的时间点之前预测事件。此外,为了评估对输入错误和延迟的鲁棒性,我们添加了模拟错误和延迟输入,并计算了受试者操作特征曲线(AUROC)值下面积的变化。
每个模型的大多数AUROC和精确召回率曲线下面积值均高于传统评分以及先前使用的其他机器学习模型。在错误输入实验中,除了我们提出的模型外,添加到模型中的噪声增加会降低所得的AUROC值。然而,在该实验中延迟输入并未显示性能下降。
对于适用于现实世界的预测模型,我们不仅考虑了性能,还考虑了时间偏态、延迟输入和输入错误。