Tu Zheyan, Jeffries Sean D, Morse Joshua, Hemmerling Thomas M
Department of Surgical and Interventional Sciences, McGill University Health Center, Montreal, Canada.
Department of Anesthesia, McGill University, Montreal, Canada.
J Clin Monit Comput. 2025 Jun;39(3):595-605. doi: 10.1007/s10877-024-01237-z. Epub 2024 Oct 29.
This study presents a comprehensive comparison of multiple time-series models applied to physiological metric predictions. It aims to explore the effectiveness of both statistical prediction models and pharmacokinetic-pharmacodynamic prediction model and modern deep learning approaches. Specifically, the study focuses on predicting the bispectral index (BIS), a vital metric in anesthesia used to assess the depth of sedation during surgery, using datasets collected from real-life surgeries. The goal is to evaluate and compare model performance considering both univariate and multivariate schemes. Accurate BIS prediction is essential for avoiding under- or over-sedation, which can lead to adverse outcomes. The study investigates a range of models: The traditional mathematical models include the pharmacokinetic-pharmacodynamic model and statistical models such as autoregressive integrated moving average (ARIMA) and vector autoregression (VAR). The deep learning models encompass recurrent neural networks (RNNs), specifically Long Short-Term Memory (LSTM) and Gated Recurrent Units (GRU), as well as Temporal Convolutional Networks (TCNs) and Transformer models. The analysis focuses on evaluating model performance in predicting the BIS using two distinct datasets of physiological metrics collected from actual surgical procedures. It explores both univariate and multivariate prediction schemes and investigates how different combinations of features and input sequence lengths impact model accuracy. The experimental findings reveal significant performance differences among the models: In univariate prediction scenarios for predicting BIS, the LSTM model demonstrates a 2.88% improvement over the second-best performing model. For multivariate predictions, the LSTM model outperforms others by 6.67% compared to the next best model. Furthermore, the addition of Electromyography (EMG) and Mean Arterial Pressure (MAP) brings significant accuracy improvement when predicting BIS. The study emphasizes the importance of selecting and building appropriate time-series models to achieve accurate predictions in biomedical applications. This research provides insights to guide future efforts in improving vital sign prediction methodologies for clinical and research purposes. Clinically, with improvements in the prediction of physiological parameters, clinicians can be informed of interventions if an anomaly is detected or predicted.
本研究对应用于生理指标预测的多个时间序列模型进行了全面比较。其目的是探索统计预测模型、药代动力学 - 药效学预测模型以及现代深度学习方法的有效性。具体而言,该研究聚焦于利用从实际手术中收集的数据集来预测脑电双频指数(BIS),这是麻醉中用于评估手术期间镇静深度的一项重要指标。目标是在考虑单变量和多变量方案的情况下评估和比较模型性能。准确的BIS预测对于避免镇静不足或过度至关重要,这可能导致不良后果。该研究调查了一系列模型:传统数学模型包括药代动力学 - 药效学模型以及诸如自回归积分移动平均(ARIMA)和向量自回归(VAR)等统计模型。深度学习模型包括递归神经网络(RNN),特别是长短期记忆(LSTM)和门控循环单元(GRU),以及时间卷积网络(TCN)和Transformer模型。分析重点在于使用从实际手术过程中收集的两个不同的生理指标数据集来评估模型在预测BIS方面的性能。它探索了单变量和多变量预测方案,并研究了特征和输入序列长度的不同组合如何影响模型准确性。实验结果揭示了模型之间存在显著的性能差异:在预测BIS的单变量预测场景中,LSTM模型比表现第二好的模型提高了2.88%。对于多变量预测,与次优模型相比,LSTM模型的表现优于其他模型6.67%。此外,在预测BIS时,添加肌电图(EMG)和平均动脉压(MAP)可显著提高准确性。该研究强调了选择和构建合适的时间序列模型以在生物医学应用中实现准确预测 的重要性。这项研究提供了见解,以指导未来为临床和研究目的改进生命体征预测方法的工作。在临床上,随着生理参数预测的改进,如果检测到或预测到异常情况,临床医生可以得到干预的通知。