Chen Mingyang, Wang Jiaxu, Xue Peng, Li Qing, Jiang Yu, Qiao Youlin
School of Population Medicine and Public Health, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100005, China.
Diagnosis and Treatment for Cervical Lesions Center, Shenzhen Maternity & Child Healthcare Hospital, Shenzhen 518028, China.
Diagnostics (Basel). 2022 Dec 6;12(12):3066. doi: 10.3390/diagnostics12123066.
Colposcopy plays an essential role in cervical cancer control, but its performance remains unsatisfactory. This study evaluates the feasibility of machine learning (ML) models for predicting high-grade squamous intraepithelial lesions or worse (HSIL+) in patients referred for colposcopy by combining colposcopic findings with demographic and screening results. In total, 7485 patients who underwent colposcopy examination in seven hospitals in mainland China were used to train, internally validate, and externally validate six commonly used ML models, including logistic regression, decision tree, naïve bayes, support vector machine, random forest, and extreme gradient boosting. Nine variables, including age, gravidity, parity, menopause status, cytological results, high-risk human papillomavirus (HR-HPV) infection type, HR-HPV multi-infection, transformation zone (TZ) type, and colposcopic impression, were used for model construction. Colposcopic impression, HR-HPV results, and cytology results were the top three variables that determined model performance among all included variables. In the internal validation set, six ML models that integrated demographics, screening results, and colposcopic impression showed significant improvements in the area under the curve (AUC) (0.067 to 0.099) and sensitivity (11.55% to 14.88%) compared with colposcopists. Greater increases in AUC (0.087 to 0.119) and sensitivity (17.17% to 22.08%) were observed in the six models with the external validation set. By incorporating demographics, screening results, and colposcopic impressions, ML improved the AUC and sensitivity for detecting HSIL+ in patients referred for colposcopy. Such models could transform the subjective experience into objective judgments to help clinicians make decisions at the time of colposcopy examinations.
阴道镜检查在宫颈癌防治中发挥着重要作用,但其表现仍不尽人意。本研究通过将阴道镜检查结果与人口统计学和筛查结果相结合,评估机器学习(ML)模型预测阴道镜检查转诊患者高级别鳞状上皮内病变或更严重病变(HSIL+)的可行性。总共7485名在中国大陆七家医院接受阴道镜检查的患者被用于训练、内部验证和外部验证六种常用的ML模型,包括逻辑回归、决策树、朴素贝叶斯、支持向量机、随机森林和极端梯度提升。九个变量,包括年龄、妊娠次数、产次、绝经状态、细胞学结果、高危型人乳头瘤病毒(HR-HPV)感染类型、HR-HPV多重感染、转化区(TZ)类型和阴道镜印象,被用于模型构建。在所有纳入变量中,阴道镜印象、HR-HPV结果和细胞学结果是决定模型性能的前三个变量。在内部验证集中,与阴道镜检查医师相比,整合了人口统计学、筛查结果和阴道镜印象的六种ML模型在曲线下面积(AUC)(0.067至0.099)和敏感性(11.55%至14.88%)方面有显著改善。在外部验证集中,六种模型的AUC(0.087至0.119)和敏感性(17.17%至22.08%)有更大幅度的提高。通过纳入人口统计学、筛查结果和阴道镜印象,ML提高了检测阴道镜检查转诊患者HSIL+的AUC和敏感性。此类模型可将主观经验转化为客观判断,以帮助临床医生在阴道镜检查时做出决策。