School of Management, Guangxi University of Science and Technology, Liuzhou, Guangxi Province 545006, China.
Business School, Sichuan University, Chengdu, Sichuan Province 610000, China.
J Healthc Eng. 2021 Feb 20;2021:6247652. doi: 10.1155/2021/6247652. eCollection 2021.
This study aimed to provide effective methods for the identification of surgeries with high cancellation risk based on machine learning models and analyze the key factors that affect the identification performance. The data covered the period from January 1, 2013, to December 31, 2014, at West China Hospital in China, which focus on elective urologic surgeries. All surgeries were scheduled one day in advance, and all cancellations were of institutional resource- and capacity-related types. Feature selection strategies, machine learning models, and sampling methods are the most discussed topic in general machine learning researches and have a direct impact on the performance of machine learning models. Hence, they were considered to systematically generate complete schemes in machine learning-based identification of surgery cancellations. The results proved the feasibility and robustness of identifying surgeries with high cancellation risk, with the considerable maximum of area under the curve (AUC) (0.7199) for random forest model with original sampling using backward selection strategy. In addition, one-side Delong test and sum of square error analysis were conducted to measure the effects of feature selection strategy, machine learning model, and sampling method on the identification of surgeries with high cancellation risk, and the selection of machine learning model was identified as the key factors that affect the identification of surgeries with high cancellation risk. This study offers methodology and insights for identifying the key experimental factors for identifying surgery cancellations, and it is helpful to further research on machine learning-based identification of surgeries with high cancellation risk.
本研究旨在提供基于机器学习模型识别高取消风险手术的有效方法,并分析影响识别性能的关键因素。数据涵盖了中国华西医院 2013 年 1 月 1 日至 2014 年 12 月 31 日期间的择期泌尿科手术。所有手术均提前一天安排,所有取消均为机构资源和能力相关类型。特征选择策略、机器学习模型和采样方法是机器学习研究中最常讨论的话题,直接影响机器学习模型的性能。因此,它们被认为是系统地生成基于机器学习的手术取消识别完整方案的关键因素。研究结果证明了识别高取消风险手术的可行性和稳健性,使用后向选择策略的原始采样的随机森林模型的曲线下面积(AUC)最大值(0.7199)相当可观。此外,进行了单边 Delong 检验和平方和误差分析,以衡量特征选择策略、机器学习模型和采样方法对识别高取消风险手术的影响,确定机器学习模型的选择是影响识别高取消风险手术的关键因素。本研究为识别手术取消的关键实验因素提供了方法和见解,有助于进一步研究基于机器学习的高取消风险手术识别。