1 Savvysherpa, Inc., Minneapolis, MN, USA.
2 Pharmacy Services, Mayo Clinic, Rochester, MN, USA.
Stat Methods Med Res. 2018 Nov;27(11):3271-3285. doi: 10.1177/0962280217696115. Epub 2017 Mar 1.
Hospital-specific electronic health record systems are used to inform clinical practice about best practices and quality improvements. Many surgical centers have developed deterministic clinical decision rules to discover adverse events (e.g. postoperative complications) using electronic health record data. However, these data provide opportunities to use probabilistic methods for early prediction of adverse health events, which may be more informative than deterministic algorithms. Electronic health record data from a set of 9598 colorectal surgery cases from 2010 to 2014 were used to predict the occurrence of selected complications including surgical site infection, ileus, and bleeding. Consistent with previous studies, we find a high rate of missing values for both covariates and complication information (4-90%). Several machine learning classification methods are trained on an 80% random sample of cases and tested on a remaining holdout set. Predictive performance varies by complication, although an area under the receiver operating characteristic curve as high as 0.86 on testing data was achieved for bleeding complications, and accuracy for all complications compares favorably to existing clinical decision rules. Our results confirm that electronic health records provide opportunities for improved risk prediction of surgical complications; however, consideration of data quality and consistency standards is an important step in predictive modeling with such data.
医院特定的电子健康记录系统用于为临床实践提供最佳实践和质量改进的信息。许多外科中心已经开发了确定性临床决策规则,以使用电子健康记录数据发现不良事件(例如术后并发症)。然而,这些数据提供了使用概率方法对不良健康事件进行早期预测的机会,这可能比确定性算法更具信息量。使用 2010 年至 2014 年 9598 例结肠直肠手术病例的电子健康记录数据来预测选定并发症(包括手术部位感染、肠梗阻和出血)的发生。与之前的研究一致,我们发现协变量和并发症信息的缺失值率都很高(4-90%)。在 80%的病例随机样本上训练了几种机器学习分类方法,并在剩余的保留集上进行了测试。预测性能因并发症而异,尽管在测试数据上达到了高达 0.86 的接收者操作特征曲线下面积,但出血并发症的准确性与现有的临床决策规则相比具有优势。我们的结果证实,电子健康记录为手术并发症的风险预测提供了机会;然而,考虑数据质量和一致性标准是使用此类数据进行预测建模的重要步骤。