Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia.
Institute for Child and Youth Health Care of Vojvodina, 21000 Novi Sad, Serbia.
Viruses. 2023 Jul 8;15(7):1522. doi: 10.3390/v15071522.
In order to limit the spread of the novel betacoronavirus (SARS-CoV-2), it is necessary to detect positive cases as soon as possible and isolate them. For this purpose, machine-learning algorithms, as a field of artificial intelligence, have been recognized as a promising tool. The aim of this study was to assess the utility of the most common machine-learning algorithms in the rapid triage of children with suspected COVID-19 using easily accessible and inexpensive laboratory parameters. A cross-sectional study was conducted on 566 children treated for respiratory diseases: 280 children with PCR-confirmed SARS-CoV-2 infection and 286 children with respiratory symptoms who were SARS-CoV-2 PCR-negative (control group). Six machine-learning algorithms, based on the blood laboratory data, were tested: random forest, support vector machine, linear discriminant analysis, artificial neural network, k-nearest neighbors, and decision tree. The training set was validated through stratified cross-validation, while the performance of each algorithm was confirmed by an independent test set. Random forest and support vector machine models demonstrated the highest accuracy of 85% and 82.1%, respectively. The models demonstrated better sensitivity than specificity and better negative predictive value than positive predictive value. The F1 score was higher for the random forest than for the support vector machine model, 85.2% and 82.3%, respectively. This study might have significant clinical applications, helping healthcare providers identify children with COVID-19 in the early stage, prior to PCR and/or antigen testing. Additionally, machine-learning algorithms could improve overall testing efficiency with no extra costs for the healthcare facility.
为了限制新型贝塔冠状病毒(SARS-CoV-2)的传播,有必要尽快发现并隔离阳性病例。为此,机器学习算法作为人工智能的一个领域,已被认为是一种很有前途的工具。本研究旨在评估最常见的机器学习算法在使用易于获取和廉价的实验室参数对疑似 COVID-19 的儿童进行快速分诊中的效用。在一项针对 566 名因呼吸系统疾病接受治疗的儿童的横断面研究中,包括 280 名经 PCR 确诊的 SARS-CoV-2 感染儿童和 286 名 SARS-CoV-2 PCR 阴性的呼吸系统症状儿童(对照组)。使用基于血液实验室数据的六种机器学习算法进行了测试:随机森林、支持向量机、线性判别分析、人工神经网络、k-最近邻和决策树。训练集通过分层交叉验证进行验证,而每个算法的性能通过独立测试集进行确认。随机森林和支持向量机模型的准确性最高,分别为 85%和 82.1%。这些模型的敏感性优于特异性,阴性预测值优于阳性预测值。随机森林模型的 F1 评分高于支持向量机模型,分别为 85.2%和 82.3%。这项研究可能具有重要的临床应用价值,有助于医疗保健提供者在 PCR 和/或抗原检测之前尽早识别 COVID-19 儿童。此外,机器学习算法可以提高整体检测效率,而不会给医疗机构带来额外的成本。