Bioengineering Laboratory, Department of Orthopaedic Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts.
J Arthroplasty. 2023 Oct;38(10):1973-1981. doi: 10.1016/j.arth.2023.01.065. Epub 2023 Feb 9.
Nonhome discharge disposition following primary total knee arthroplasty (TKA) is associated with a higher rate of complications and constitutes a socioeconomic burden on the health care system. While existing algorithms predicting nonhome discharge disposition varied in degrees of mathematical complexity and prediction power, their capacity to generalize predictions beyond the development dataset remains limited. Therefore, this study aimed to establish the machine learning model generalizability by performing internal and external validations using nation-scale and institutional cohorts, respectively.
Four machine learning models were trained using the national cohort. Recursive feature elimination and hyper-parameter tuning were applied. Internal validation was achieved through five-fold cross-validation during model training. The trained models' performance was externally validated using the institutional cohort and assessed by discrimination, calibration, and clinical utility.
The national (424,354 patients) and institutional (10,196 patients) cohorts had non-home discharge rates of 19.4 and 36.4%, respectively. The areas under the receiver operating curve of the model predictions were 0.83 to 0.84 during internal validation and increased to 0.88 to 0.89 during external validation. Artificial neural network and histogram-based gradient boosting elicited the best performance with a mean area under the receiver operating curve of 0.89, calibration slope of 1.39, and Brier score of 0.14, which indicated that the two models were robust in distinguishing non-home discharge and well-calibrated with accurate predictions of the probabilities. The low inter-dataset similarity indicated reliable external validation. Length of stay, age, body mass index, and sex were the strongest predictors of discharge destination after primary TKA.
The machine learning models demonstrated excellent predictive performance during both internal and external validations, supporting their generalizability across different patient cohorts and potential applicability in the clinical workflow.
初次全膝关节置换术(TKA)后非居家出院处置与更高的并发症发生率相关,并对医疗保健系统构成社会经济负担。虽然现有的预测非居家出院处置的算法在数学复杂性和预测能力方面存在差异,但它们在超出开发数据集的范围进行预测的能力仍然有限。因此,本研究旨在通过分别使用全国范围和机构队列进行内部和外部验证来建立机器学习模型的可泛化性。
使用全国队列训练了四个机器学习模型。应用递归特征消除和超参数调优。在模型训练过程中通过五折交叉验证进行内部验证。使用机构队列对训练模型的性能进行外部验证,并通过区分度、校准和临床实用性进行评估。
全国(424354 例患者)和机构(10196 例患者)队列的非居家出院率分别为 19.4%和 36.4%。模型预测的内部验证中接收器操作曲线下面积为 0.83 至 0.84,外部验证中增加至 0.88 至 0.89。人工神经网络和基于直方图的梯度提升产生了最佳性能,接收器操作曲线下面积的平均值为 0.89,校准斜率为 1.39,Brier 得分为 0.14,这表明这两个模型在区分非居家出院方面稳健,并且校准良好,对概率的预测准确。数据集之间的低相似性表明了可靠的外部验证。住院时间、年龄、体重指数和性别是初次 TKA 后出院目的地的最强预测因素。
机器学习模型在内部和外部验证中均表现出出色的预测性能,支持其在不同患者队列中的可泛化性和在临床工作流程中的潜在适用性。