Department of Obstetrics and Gynecology, Shunyi Women's and Children's Hospital of Beijing Children's Hospital, No. 1 of shun Hong Road, Shunyi District, Beijing, 101300, China.
BMC Womens Health. 2024 Sep 5;24(1):491. doi: 10.1186/s12905-024-03334-2.
The aim of this study is to assess the use of machine learning methodologies in the diagnosis of endometriosis (EM).
This study included a total of 106 patients with EM and 203 patients with non-EM conditions (like simple cysts and simple uterine fibroids), all admitted to the Shunyi Women's and Children's Hospital of Beijing Children's Hospital between January 2017 and September 2022. All participants were free of comorbidities and their diagnoses were confirmed via postoperative pathology. Comparative analysis was conducted between the EM and non-EM groups. Baseline data were assessed, including white blood cell count, neutrophil-to-lymphocyte ratio (NLR), platelet-to-lymphocyte ratio, lymphocyte-to-monocyte ratio, mean platelet volume, hemoglobin, carbohydrate antigen 125 (CA125), carbohydrate antigen 199, coagulation parameters, and other serologic indicators. An optimal predictive model was developed using an artificial intelligence algorithm to determine the presence of EM. The objective is to provide new insights for the clinical diagnosis and treatment of EM.
The random forest algorithm demonstrated superior performance when compared to decision trees, LogitBoost, artificial neural networks, naïve Bayes, support vector machines, and linear regression in machine learning methods. Combining CA125 with the NLR yielded a better prediction of EM than using CA125 alone when applying the random forest algorithm. The accuracy of predicting EM with CA125 combined with NLR was 78.16%, with a sensitivity of 86.21% and an area under the curve (AUC) of 0.85 (P < 0.05). In contrast, using CA125 alone resulted in an EM prediction accuracy of 75.8%, with a sensitivity of 79.3% and an AUC of 0.82 (P < 0.05).
The diagnostic value of serum CA125 combined with the NLR for EM is higher than that of serum CA125 alone. This finding indicates that NLR could serve as a new supplementary biomarker along with serum CA125 in the diagnosis of EM.
本研究旨在评估机器学习方法在子宫内膜异位症(EM)诊断中的应用。
本研究共纳入 2017 年 1 月至 2022 年 9 月期间在北京儿童医院顺义妇儿医院就诊的 106 例 EM 患者和 203 例非 EM 患者(如单纯囊肿和单纯子宫肌瘤)。所有患者均无合并症,且术后病理证实诊断。对 EM 组和非 EM 组进行对比分析。评估基线数据,包括白细胞计数、中性粒细胞与淋巴细胞比值(NLR)、血小板与淋巴细胞比值、淋巴细胞与单核细胞比值、平均血小板体积、血红蛋白、糖抗原 125(CA125)、糖抗原 199、凝血参数和其他血清学指标。采用人工智能算法建立最佳预测模型,以确定 EM 的存在。目的是为 EM 的临床诊断和治疗提供新的思路。
在机器学习方法中,随机森林算法的性能优于决策树、LogitBoost、人工神经网络、朴素贝叶斯、支持向量机和线性回归。与单独使用 CA125 相比,应用随机森林算法时,将 CA125 与 NLR 结合使用可更好地预测 EM。CA125 联合 NLR 预测 EM 的准确率为 78.16%,敏感度为 86.21%,曲线下面积(AUC)为 0.85(P<0.05)。相比之下,单独使用 CA125 预测 EM 的准确率为 75.8%,敏感度为 79.3%,AUC 为 0.82(P<0.05)。
血清 CA125 联合 NLR 对 EM 的诊断价值高于单独使用血清 CA125。这表明 NLR 可作为 CA125 以外的新的补充生物标志物,用于 EM 的诊断。