Pediatric Infectious Diseases Unit, Fundación para la Investigación Biomédica del Hospital 12 de Octubre, Madrid, Spain.
PENTA Foundation, Padova, Italy.
PLoS One. 2022 Oct 14;17(10):e0276116. doi: 10.1371/journal.pone.0276116. eCollection 2022.
Logistic regression (LR) is the most common prediction model in medicine. In recent years, supervised machine learning (ML) methods have gained popularity. However, there are many concerns about ML utility for small sample sizes. In this study, we aim to compare the performance of 7 algorithms in the prediction of 1-year mortality and clinical progression to AIDS in a small cohort of infants living with HIV from South Africa and Mozambique. The data set (n = 100) was randomly split into 70% training and 30% validation set. Seven algorithms (LR, Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbor (KNN), Naïve Bayes (NB), Artificial Neural Network (ANN), and Elastic Net) were compared. The variables included as predictors were the same across the models including sociodemographic, virologic, immunologic, and maternal status features. For each of the models, a parameter tuning was performed to select the best-performing hyperparameters using 5 times repeated 10-fold cross-validation. A confusion-matrix was built to assess their accuracy, sensitivity, and specificity. RF ranked as the best algorithm in terms of accuracy (82,8%), sensitivity (78%), and AUC (0,73). Regarding specificity and sensitivity, RF showed better performance than the other algorithms in the external validation and the highest AUC. LR showed lower performance compared with RF, SVM, or KNN. The outcome of children living with perinatally acquired HIV can be predicted with considerable accuracy using ML algorithms. Better models would benefit less specialized staff in limited resources countries to improve prompt referral in case of high-risk clinical progression.
逻辑回归(LR)是医学中最常用的预测模型。近年来,监督机器学习(ML)方法越来越受欢迎。然而,对于 ML 在小样本量下的实用性存在许多担忧。在这项研究中,我们旨在比较 7 种算法在预测南非和莫桑比克感染 HIV 的小婴儿中 1 年死亡率和向艾滋病临床进展的表现。数据集(n = 100)随机分为 70%的训练集和 30%的验证集。比较了 7 种算法(LR、随机森林(RF)、支持向量机(SVM)、K-最近邻(KNN)、朴素贝叶斯(NB)、人工神经网络(ANN)和弹性网络)。纳入的预测变量在模型中是相同的,包括社会人口统计学、病毒学、免疫学和母体状态特征。对于每个模型,都进行了参数调整,使用 5 次重复 10 折交叉验证选择表现最佳的超参数。构建混淆矩阵以评估其准确性、敏感性和特异性。RF 在准确性(82.8%)、敏感性(78%)和 AUC(0.73)方面表现最佳。关于特异性和敏感性,RF 在外部验证中的表现优于其他算法,且 AUC 最高。LR 的性能低于 RF、SVM 或 KNN。使用 ML 算法可以相当准确地预测围产期感染 HIV 的儿童的结果。更好的模型将使资源有限国家的非专业人员受益,以便在临床进展风险高的情况下及时转诊。