Nguyen Thi Bang-Suong, Nguyen Hoang-Bac, Le Thi Xuan-Thao, Bui Thi Hong-Chau, Nguyen Le Song-Toan, Nguyen Thao-Huong, Nguyen Truong Cong-Minh
University Medical Center Ho Chi Minh City, Ho Chi Minh City, 700000, Vietnam.
University of Medicine and Pharmacy at Ho Chi Minh City, Ho Chi Minh City, 700000, Vietnam.
Sci Rep. 2025 May 31;15(1):19171. doi: 10.1038/s41598-025-04626-9.
Vaginitis is a prevalent gynecological condition that impacts women's quality of life, with most women likely to experience it at least once. Traditional diagnosis involves manually observing vaginal discharge samples under a microscope. This process relies heavily on the technician's expertise and is vulnerable to subjective biases. The study aimed to improve diagnostic accuracy by applying machine learning, specifically the MobileNetV2 model, to automate the classification of vaginal discharge samples. This model supports doctors in identifying causative agents of vaginitis, including Gardnerella vaginalis, fungi, and other pathogens like bacteria or Trichomonas vaginalis. A dataset of 3,164 images from 1,582 vaginal discharge samples of women aged 18 and over was analyzed. Images were taken under a 40x optical microscope with a resolution of 800 × 800 pixels and classified into three groups: Group B (mixed bacteria or Trichomonas vaginalis), Group C (Gardnerella vaginalis, identified by clue cells), and Group F (fungi, e.g., Candida albicans, which appear as hyphae or yeast cells in samples). The model was trained using 80% of data for training, 10% for validation, and 10% for testing. Performance was evaluated using two statistical metrics: the F1 score (a measure of accuracy balancing precision and recall) and the AUC-PR (Area Under the Curve of the Precision-Recall curve, a measure of model reliability for imbalanced datasets). The MobileNetV2 model performed well across all datasets, achieving an F1 score > 0.75 and an AUC-PR > 0.80. It demonstrated the best performance in identifying Gardnerella vaginalis (Group C), with both metrics exceeding 0.90. In conclusion, this study highlights MobileNetV2's potential as a rapid screening tool for vaginitis, particularly in identifying Gardnerella vaginalis (F1 score and AUC-PR > 0.90). While challenges have remained in classifying co-infections (e.g., Groups B vs. F), the model's stability across datasets underscores its practical utility. Integrating AI into vaginitis diagnosis could enhance efficiency, reduce human error, and improve early detection, ultimately advancing patient care.
阴道炎是一种常见的妇科疾病,会影响女性的生活质量,大多数女性可能至少经历过一次。传统诊断方法是在显微镜下人工观察阴道分泌物样本。这个过程严重依赖技术人员的专业知识,并且容易受到主观偏差的影响。该研究旨在通过应用机器学习,特别是MobileNetV2模型,来实现阴道分泌物样本分类的自动化,从而提高诊断准确性。该模型可帮助医生识别阴道炎的病原体,包括阴道加德纳菌、真菌以及其他病原体,如细菌或滴虫。分析了来自1582名18岁及以上女性阴道分泌物样本的3164张图像数据集。图像是在40倍光学显微镜下拍摄的,分辨率为800×800像素,并分为三组:B组(混合细菌或滴虫)、C组(通过线索细胞鉴定出的阴道加德纳菌)和F组(真菌,如白色念珠菌,在样本中表现为菌丝或酵母细胞)。使用80%的数据进行模型训练,10%用于验证,10%用于测试。使用两个统计指标评估性能:F1分数(一种平衡精确率和召回率的准确性度量)和AUC-PR(精确率-召回率曲线下面积,一种用于不平衡数据集的模型可靠性度量)。MobileNetV2模型在所有数据集中表现良好,F1分数>0.75,AUC-PR>0.80。它在识别阴道加德纳菌(C组)方面表现最佳,两个指标均超过0.90。总之,本研究突出了MobileNetV2作为阴道炎快速筛查工具的潜力,特别是在识别阴道加德纳菌方面(F1分数和AUC-PR>0.90)。虽然在对合并感染(如B组与F组)进行分类时仍存在挑战,但该模型在不同数据集上的稳定性强调了其实用价值。将人工智能整合到阴道炎诊断中可以提高效率、减少人为误差并改善早期检测,最终推动患者护理的发展。