Dept. of Systems Engineering and Engineering Management, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon Tong, Hong Kong.
Comput Methods Programs Biomed. 2018 Feb;154:191-203. doi: 10.1016/j.cmpb.2017.11.003. Epub 2017 Nov 7.
The Accident and Emergency Department (A&ED) is the frontline for providing emergency care in hospitals. Unfortunately, relative A&ED resources have failed to keep up with continuously increasing demand in recent years, which leads to overcrowding in A&ED. Knowing the fluctuation of patient arrival volume in advance is a significant premise to relieve this pressure. Based on this motivation, the objective of this study is to explore an integrated framework with high accuracy for predicting A&ED patient flow under different triage levels, by combining a novel feature selection process with deep neural networks.
Administrative data is collected from an actual A&ED and categorized into five groups based on different triage levels. A genetic algorithm (GA)-based feature selection algorithm is improved and implemented as a pre-processing step for this time-series prediction problem, in order to explore key features affecting patient flow. In our improved GA, a fitness-based crossover is proposed to maintain the joint information of multiple features during iterative process, instead of traditional point-based crossover. Deep neural networks (DNN) is employed as the prediction model to utilize their universal adaptability and high flexibility. In the model-training process, the learning algorithm is well-configured based on a parallel stochastic gradient descent algorithm. Two effective regularization strategies are integrated in one DNN framework to avoid overfitting. All introduced hyper-parameters are optimized efficiently by grid-search in one pass.
As for feature selection, our improved GA-based feature selection algorithm has outperformed a typical GA and four state-of-the-art feature selection algorithms (mRMR, SAFS, VIFR, and CFR). As for the prediction accuracy of proposed integrated framework, compared with other frequently used statistical models (GLM, seasonal-ARIMA, ARIMAX, and ANN) and modern machine models (SVM-RBF, SVM-linear, RF, and R-LASSO), the proposed integrated "DNN-I-GA" framework achieves higher prediction accuracy on both MAPE and RMSE metrics in pairwise comparisons.
The contribution of our study is two-fold. Theoretically, the traditional GA-based feature selection process is improved to have less hyper-parameters and higher efficiency, and the joint information of multiple features is maintained by fitness-based crossover operator. The universal property of DNN is further enhanced by merging different regularization strategies. Practically, features selected by our improved GA can be used to acquire an underlying relationship between patient flows and input features. Predictive values are significant indicators of patients' demand and can be used by A&ED managers to make resource planning and allocation. High accuracy achieved by the present framework in different cases enhances the reliability of downstream decision makings.
急诊科(A&ED)是医院提供紧急护理的第一线。不幸的是,相对的 A&ED 资源近年来未能跟上不断增长的需求,导致 A&ED 人满为患。提前了解患者到达量的波动是缓解这种压力的重要前提。基于这一动机,本研究的目的是通过结合一种新颖的特征选择过程和深度神经网络,探索一种具有高精度的综合框架,以预测不同分诊水平下的 A&ED 患者流量。
从实际的 A&ED 中收集行政数据,并根据不同的分诊水平将其分为五组。为了解决这个时间序列预测问题,我们改进并实现了一种基于遗传算法(GA)的特征选择算法作为预处理步骤,以探索影响患者流量的关键特征。在我们改进的 GA 中,提出了一种基于适应度的交叉操作,以在迭代过程中保持多个特征的联合信息,而不是传统的基于点的交叉操作。深度神经网络(DNN)被用作预测模型,以利用其通用适应性和高灵活性。在模型训练过程中,根据并行随机梯度下降算法对学习算法进行了很好的配置。在一个 DNN 框架中集成了两种有效的正则化策略,以避免过拟合。所有引入的超参数都通过一次网格搜索进行高效优化。
就特征选择而言,我们改进的基于 GA 的特征选择算法优于典型的 GA 和四种最先进的特征选择算法(mRMR、SAFS、VIFR 和 CFR)。就所提出的集成框架的预测准确性而言,与其他常用的统计模型(GLM、季节性-ARIMA、ARIMAX 和 ANN)和现代机器模型(SVM-RBF、SVM-线性、RF 和 R-LASSO)相比,所提出的集成“DNN-I-GA”框架在 MAPE 和 RMSE 度量上的预测准确性更高在成对比较中。
我们的研究贡献有两个方面。从理论上讲,改进了传统的基于 GA 的特征选择过程,使其具有更少的超参数和更高的效率,并通过基于适应度的交叉操作保持多个特征的联合信息。DNN 的通用属性通过合并不同的正则化策略得到进一步增强。从实践上讲,我们改进的 GA 选择的特征可用于获取患者流量与输入特征之间的潜在关系。预测值是患者需求的重要指标,可用于 A&ED 管理人员进行资源规划和分配。本框架在不同情况下实现的高精度提高了下游决策的可靠性。