Ialongo Cristiano, Ciotti Marco, Giovannelli Alfredo, Tomassetti Flaminia, Pelagalli Martina, Di Carlo Stefano, Bernardini Sergio, Pieri Massimo, Nicolai Eleonora
Department of Experimental Medicine, Policlinico Umberto I, 'Sapienza' University, 00161 Rome, Italy.
Laboratory of Clinical Microbiology and Virology, Tor Vergata University Hospital, 00133 Rome, Italy.
Antibiotics (Basel). 2025 Jul 30;14(8):768. doi: 10.3390/antibiotics14080768.
Urine microbial analysis is a frequently requested test that is often associated with contamination during specimen collection or storage, which leads to false-positive diagnoses and delayed reporting. In the era of digitalization, machine learning (ML) can serve as a valuable tool to support clinical decision-making. This study investigates the application of a simple artificial neural network (ANN) to pre-identify negative and contaminated (false-positive) specimens. An ML model was developed using 8181 urine samples, including cytology, dipstick tests, and culture results. The dataset was randomly split 2:1 for training and testing a multilayer perceptron (MLP). Input variables with a normalized importance below 0.2 were excluded. The final model used only microbial and either urine color or urobilinogen pigment analysis as inputs; other physical, chemical, and cellular parameters were omitted. The frequency of positive and negative specimens for bacteria was 6.9% and 89.6%, respectively. Contaminated specimens represented 3.5% of cases and were predominantly misclassified as negative by the MLP. Thus, the negative predictive value (NPV) was 96.5% and the positive predictive value (PPV) was 87.2%, leading to 0.82% of the cultures being unnecessary microbial cultures (UMC). These results suggest that the MLP is reliable for screening out negative specimens but less effective at identifying positive ones. In conclusion, ANN models can effectively support the screening of negative urine samples, detect clinically significant bacteriuria, and potentially reduce unnecessary cultures. Incorporating morphological information data could further improve the accuracy of our model and minimize false negatives.
尿液微生物分析是一项经常被要求进行的检测,在标本采集或储存过程中常与污染相关,这会导致假阳性诊断和报告延迟。在数字化时代,机器学习(ML)可作为支持临床决策的宝贵工具。本研究调查了一种简单的人工神经网络(ANN)在预识别阴性和污染(假阳性)标本方面的应用。使用8181份尿液样本开发了一个ML模型,包括细胞学、试纸条检测和培养结果。数据集以2:1的比例随机拆分用于训练和测试多层感知器(MLP)。归一化重要性低于0.2的输入变量被排除。最终模型仅使用微生物以及尿液颜色或尿胆原色素分析作为输入;其他物理、化学和细胞参数被省略。细菌阳性和阴性标本的频率分别为6.9%和89.6%。污染标本占病例的3.5%,主要被MLP误分类为阴性。因此,阴性预测值(NPV)为96.5%,阳性预测值(PPV)为87.2%,导致0.82%的培养为不必要的微生物培养(UMC)。这些结果表明,MLP在筛选阴性标本方面可靠,但在识别阳性标本方面效果较差。总之,ANN模型可以有效地支持阴性尿液样本的筛选,检测具有临床意义的菌尿,并可能减少不必要的培养。纳入形态学信息数据可以进一步提高我们模型的准确性并最大限度地减少假阴性。