Paranjape Pearl R, Thai-Paquette Van, Miamidian John L, Parr Jim, Kazin Eyal A, McLaren Alex, Toler Krista, Deirmengian Carl
Department of Diagnostics Research and Development, Zimmer Biomet, Warsaw, USA.
Department of Data Science and Machine Learning, Zimmer Biomet, Swindon, GBR.
Cureus. 2023 Dec 24;15(12):e51036. doi: 10.7759/cureus.51036. eCollection 2023 Dec.
Background and objective The current periprosthetic joint infection (PJI) diagnostic guidelines require clinicians to interpret and integrate multiple criteria into a complex scoring system. Also, PJI classifications are often inconclusive, failing to provide a clinical diagnosis. Machine learning (ML) models could be leveraged to reduce reliance on these complex systems and thereby reduce diagnostic uncertainty. This study aimed to develop an ML algorithm using synovial fluid (SF) test results to establish a PJI probability score. Methods We used a large clinical laboratory's dataset of SF samples, aspirated from patients with hip or knee arthroplasty as part of a PJI evaluation. Patient age and SF biomarkers [white blood cell count, neutrophil percentage (%PMN), red blood cell count, absorbance at 280 nm wavelength, C-reactive protein (CRP), alpha-defensin (AD), neutrophil elastase, and microbial antigen (MID) tests] were used for model development. Data preprocessing, principal component analysis, and unsupervised clustering (K-means) revealed four clusters of samples that naturally aggregated based on biomarker results. Analysis of the characteristics of each of these four clusters revealed three clusters (n=13,133) with samples having biomarker results typical of a PJI-negative classification and one cluster (n=4,032) with samples having biomarker results typical of a PJI-positive classification. A decision tree model, trained and tested independently of external diagnostic rules, was then developed to match the classification determined by the unsupervised clustering. The performance of the model was assessed versus a modified 2018 International Consensus Meeting (ICM) criteria, in both the test cohort and an independent unlabeled validation set of 5,601 samples. The SHAP (SHapley Additive exPlanations) method was used to explore feature importance. Results The ML model showed an area under the curve of 0.993, with a sensitivity of 98.8%, specificity of 97.3%, positive predictive value (PPV) of 92.9%, and negative predictive value (NPV) of 99.8% in predicting the modified 2018 ICM diagnosis among test set samples. The model maintained its diagnostic accuracy in the validation cohort, yielding 99.1% sensitivity, 97.1% specificity, 91.9% PPV, and 99.9% NPV. The model's inconclusive rate (diagnostic probability between 20-80%) in the validation cohort was only 1.3%, lower than that observed with the modified 2018 ICM PJI classification (7.4%; p<0.001). The SHAP analysis found that AD was the most important feature in the model, exhibiting dominance among >95% of "infected" and "not infected" diagnoses. Other important features were the sum of the MID test panel, %PMN, and SF-CRP. Conclusions Although defined methods and tools for diagnosis of PJI using multiple biomarker criteria are available, they are not consistently applied or widely implemented. There is a need for algorithmic interpretation of these biomarkers to enable consistent interpretation of the results to drive treatment decisions. The new model, using clinical parameters measured from a patient's SF sample, renders a preoperative probability score for PJI which performs well compared to a modified 2018 ICM definition. Taken together with other clinical signs, this model has the potential to increase the accuracy of clinical evaluations and reduce the rate of inconclusive classification, thereby enabling more appropriate and expedited downstream treatment decisions.
背景与目的 当前的人工关节周围感染(PJI)诊断指南要求临床医生将多个标准进行解读并整合到一个复杂的评分系统中。此外,PJI的分类往往不明确,无法提供临床诊断。机器学习(ML)模型可用于减少对这些复杂系统的依赖,从而降低诊断的不确定性。本研究旨在开发一种利用滑液(SF)检测结果的ML算法,以建立PJI概率评分。方法 我们使用了一个大型临床实验室的SF样本数据集,这些样本取自接受髋关节或膝关节置换术的患者,作为PJI评估的一部分。患者年龄和SF生物标志物[白细胞计数、中性粒细胞百分比(%PMN)、红细胞计数、280nm波长处的吸光度、C反应蛋白(CRP)、α-防御素(AD)、中性粒细胞弹性蛋白酶和微生物抗原(MID)检测]用于模型开发。数据预处理、主成分分析和无监督聚类(K均值)揭示了基于生物标志物结果自然聚集的四个样本簇。对这四个簇中每个簇的特征分析显示,三个簇(n=13133)的样本具有典型的PJI阴性分类生物标志物结果,一个簇(n=4032)的样本具有典型的PJI阳性分类生物标志物结果。然后开发了一个独立于外部诊断规则进行训练和测试的决策树模型,以匹配无监督聚类确定的分类。在测试队列和一个包含5601个样本的独立未标记验证集中,根据修改后的2018年国际共识会议(ICM)标准评估模型的性能。使用SHAP(SHapley Additive exPlanations)方法探索特征重要性。结果 在测试集样本中,ML模型在预测修改后的2018年ICM诊断时,曲线下面积为0.993,灵敏度为98.8%,特异性为97.3%,阳性预测值(PPV)为92.9%,阴性预测值(NPV)为99.8%。该模型在验证队列中保持了其诊断准确性,灵敏度为99.1%,特异性为97.1%,PPV为91.9%,NPV为99.9%。该模型在验证队列中的不确定率(诊断概率在20%-80%之间)仅为1.3%,低于修改后的2018年ICM PJI分类的不确定率(7.4%;p<0.001)。SHAP分析发现,AD是模型中最重要的特征,在>95%的“感染”和“未感染”诊断中占主导地位。其他重要特征是MID检测组的总和、%PMN和SF-CRP。结论 虽然有使用多种生物标志物标准诊断PJI的既定方法和工具,但它们并未得到一致应用或广泛实施。需要对这些生物标志物进行算法解读,以便对结果进行一致解读,从而推动治疗决策。新模型使用从患者SF样本中测量的临床参数,得出PJI的术前概率评分,与修改后的2018年ICM定义相比表现良好。结合其他临床体征,该模型有可能提高临床评估的准确性,降低不确定分类率,从而实现更合适、更快速的下游治疗决策。