Soe Nyi Nyi, Latt Phyu Mon, Yu Zhen, Lee David, Kim Cham-Mill, Tran Daniel, Ong Jason J, Ge Zongyuan, Fairley Christopher K, Zhang Lei
Melbourne Sexual Health Centre, Alfred Health, Melbourne, Australia; Central Clinical School, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia.
Central Clinical School, Faculty of Medicine, Nursing and Health Sciences, Monash University, Melbourne, Australia; Monash e-Research Centre, Faculty of Engineering, Airdoc Research, Nvidia AI Technology Research Centre, Monash University, Melbourne, Australia.
J Infect. 2024 Apr;88(4):106128. doi: 10.1016/j.jinf.2024.106128. Epub 2024 Mar 5.
Many sexual health services are overwhelmed and cannot cater for all the individuals who present with sexually transmitted infections (STIs). Digital health software that separates STIs from non-STIs could improve the efficiency of clinical services. We developed and evaluated a machine learning model that predicts whether patients have an STI based on their clinical features.
We manually extracted 25 demographic features and clinical features from 1315 clinical records in the electronic health record system at Melbourne Sexual Health Center. We examined 16 machine learning models to predict a binary outcome of an STI or a non-STI diagnosis. We evaluated the models' performance with the area under the ROC curve (AUC), accuracy and F1-scores.
Our study included 1315 consultations, of which 36.8% (484/1315) were diagnosed with STIs and 63.2% (831/1315) had non-STI conditions. The study population predominantly consisted of heterosexual men (49.5%, 651/1315), followed by gay, bisexual and other men who have sex with men (GBMSM) (25.7%), women (21.6%) and unknown gender (3.2%). The median age was 31 years (intra-quartile range (IQR) 26-39). The top 5 performing models were CatBoost (AUC 0.912), Random Forest (AUC 0.917), LightGBM (AUC 0.907), Gradient Boosting (AUC 0.905) and XGBoost (AUC 0.900). The best model, CatBoost, achieved an accuracy of 0.837, sensitivity of 0.776, specificity of 0.831, precision of 0.782 and F1-score of 0.778. The key important features were lesion duration, type of skin lesions, age, gender, history of skin disorders, number of lesions, dysuria duration, anorectal pain and itchiness.
Our best model demonstrates a reasonable performance in distinguishing STIs from non-STIs. However, to be clinically useful, more detailed information such as clinical images, may be required to reach sufficient accuracy.
许多性健康服务不堪重负,无法满足所有出现性传播感染(STIs)的个体需求。将性传播感染与非性传播感染区分开来的数字健康软件可以提高临床服务效率。我们开发并评估了一种基于患者临床特征预测其是否患有性传播感染的机器学习模型。
我们从墨尔本性健康中心电子健康记录系统中的1315份临床记录中手动提取了25个人口统计学特征和临床特征。我们研究了16种机器学习模型,以预测性传播感染或非性传播感染诊断的二元结果。我们使用ROC曲线下面积(AUC)、准确率和F1分数评估模型的性能。
我们的研究包括1315次会诊,其中36.8%(484/1315)被诊断为性传播感染,63.2%(831/1315)患有非性传播感染疾病。研究人群主要由异性恋男性(49.5%,651/1315)组成,其次是男同性恋、双性恋和其他与男性发生性关系的男性(GBMSM)(25.7%)、女性(21.6%)和性别不明者(3.2%)。年龄中位数为31岁(四分位间距(IQR)26 - 39)。表现最佳的前5个模型是CatBoost(AUC 0.912)、随机森林(AUC 0.917)、LightGBM(AUC 0.907)、梯度提升(AUC 0.905)和XGBoost(AUC 0.900)。最佳模型CatBoost的准确率为0.837,灵敏度为0.776,特异性为0.831,精确率为0.782,F1分数为0.778。关键的重要特征是病损持续时间、皮肤病变类型、年龄、性别、皮肤疾病史、病损数量、排尿困难持续时间、肛门直肠疼痛和瘙痒。
我们的最佳模型在区分性传播感染与非性传播感染方面表现出合理的性能。然而,要在临床上有用,可能需要更详细的信息,如临床图像,以达到足够的准确性。