Suppr超能文献

预测男男性行为者感染艾滋病毒和性传播疾病的风险:使用多种机器学习方法的横断面研究

Predicting the Risk of HIV Infection and Sexually Transmitted Diseases Among Men Who Have Sex With Men: Cross-Sectional Study Using Multiple Machine Learning Approaches.

作者信息

Lin Bing, Liu Jiaxiu, Li Kangjie, Zhong Xiaoni

机构信息

School of Public Health, Chongqing Medical University, Chongqing, China.

Research Center for Medicine and Social Development, Chongqing Medical University, Chongqing, China.

出版信息

J Med Internet Res. 2025 Feb 20;27:e59101. doi: 10.2196/59101.

Abstract

BACKGROUND

Men who have sex with men (MSM) are at high risk for HIV infection and sexually transmitted diseases (STDs). However, there is a lack of accurate and convenient tools to assess this risk.

OBJECTIVE

This study aimed to develop machine learning models and tools to predict and assess the risk of HIV infection and STDs among MSM.

METHODS

We conducted a cross-sectional study that collected individual characteristics of 1999 MSM with negative or unknown HIV serostatus in Western China from 2013 to 2023. MSM self-reported their STD history and were tested for HIV. We compared the accuracy of 6 machine learning methods in predicting the risk of HIV infection and STDs using 7 parameters for a comprehensive assessment, ranking the methods according to their performance in each parameter. We selected data from the Sichuan MSM for external validation.

RESULTS

Of the 1999 MSM, 72 (3.6%) tested positive for HIV and 146 (7.3%) self-reported a history of previous STD infection. After taking the results of the intersection of the 3 feature screening methods, a total of 7 and 5 predictors were screened for predicting HIV infection and STDs, respectively, and multiple machine learning prediction models were constructed. Extreme gradient boost models performed optimally in predicting the risk of HIV infection and STDs, with area under the curve values of 0.777 (95% CI 0.639-0.915) and 0.637 (95% CI 0.541-0.732), respectively, demonstrating stable performance in both internal and external validation. The highest combined predictive performance scores of HIV and STD models were 33 and 39, respectively. Interpretability analysis showed that nonadherence to condom use, low HIV knowledge, multiple male partners, and internet dating were risk factors for HIV infection. Low degree of education, internet dating, and multiple male and female partners were risk factors for STDs. The risk stratification analysis showed that the optimal model effectively distinguished between high- and low-risk MSM. MSM were classified into HIV (predicted risk score <0.506 and ≥0.506) and STD (predicted risk score <0.479 and ≥0.479) risk groups. In total, 22.8% (114/500) were in the HIV high-risk group, and 43% (215/500) were in the STD high-risk group. HIV infection and STDs were significantly higher in the high-risk groups (P<.001 and P=.05, respectively), with higher predicted probabilities (P<.001 for both). The prediction results of the optimal model were displayed in web applications for probability estimation and interactive computation.

CONCLUSIONS

Machine learning methods have demonstrated strengths in predicting the risk of HIV infection and STDs among MSM. Risk stratification models and web applications can facilitate clinicians in accurately assessing the risk of infection in individuals with high risk, especially MSM with concealed behaviors, and help them to self-monitor their risk for targeted, timely diagnosis and interventions to reduce new infections.

摘要

背景

男男性行为者(MSM)感染人类免疫缺陷病毒(HIV)和性传播疾病(STD)的风险很高。然而,缺乏准确且便捷的工具来评估这种风险。

目的

本研究旨在开发机器学习模型和工具,以预测和评估男男性行为者感染HIV和性传播疾病的风险。

方法

我们进行了一项横断面研究,收集了2013年至2023年中国西部1999名HIV血清学状态为阴性或未知的男男性行为者的个体特征。男男性行为者自行报告其性传播疾病史并接受HIV检测。我们使用7个参数比较了6种机器学习方法在预测HIV感染和性传播疾病风险方面的准确性,以便进行综合评估,并根据它们在每个参数上的表现对方法进行排名。我们从四川的男男性行为者中选取数据进行外部验证。

结果

在1999名男男性行为者中,72人(3.6%)HIV检测呈阳性,146人(7.3%)自行报告有既往性传播疾病感染史。在采用3种特征筛选方法的交集结果后,分别共筛选出7个和5个预测因子用于预测HIV感染和性传播疾病,并构建了多个机器学习预测模型。极端梯度提升模型在预测HIV感染和性传播疾病风险方面表现最佳,曲线下面积值分别为0.777(95%置信区间0.639 - 0.915)和0.637(95%置信区间0.541 - 0.732),在内部和外部验证中均表现出稳定的性能。HIV和性传播疾病模型的最高综合预测性能得分分别为33和39。可解释性分析表明,不坚持使用安全套、HIV知识水平低、多个男性性伴侣以及网络交友是HIV感染的风险因素。低教育程度、网络交友以及多个男性和女性性伴侣是性传播疾病的风险因素。风险分层分析表明,最佳模型有效地将高风险和低风险的男男性行为者区分开来。男男性行为者被分为HIV(预测风险评分<0.506和≥0.506)和性传播疾病(预测风险评分<0.479和≥0.479)风险组。共有22.8%(114/500)属于HIV高风险组,43%(215/500)属于性传播疾病高风险组。高风险组中的HIV感染和性传播疾病发生率显著更高(分别为P<0.001和P = 0.05),预测概率也更高(两者均为P<0.001)。最佳模型的预测结果显示在网络应用程序中,用于概率估计和交互式计算。

结论

机器学习方法在预测男男性行为者感染HIV和性传播疾病的风险方面已展现出优势。风险分层模型和网络应用程序可以帮助临床医生准确评估高风险个体,尤其是有隐蔽行为的男男性行为者的感染风险,并帮助他们自我监测风险,以便进行有针对性的、及时的诊断和干预,从而减少新感染病例。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a31/11888048/4f29b72c8502/jmir_v27i1e59101_fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验