Rickard Declan, Kabir Muhammad Ashad, Homaira Nusrat
School of Clinical Medicine, UNSW Sydney, Kensington, NSW, 2052, Australia.
School of Computing, Mathematics and Engineering, Charles Sturt University, Bathurst, NSW, 2795, Australia; Artificial Intelligence and Cyber Futures Institute, Charles Sturt University, Bathurst, NSW, 2795, Australia.
Comput Methods Programs Biomed. 2025 Aug;268:108802. doi: 10.1016/j.cmpb.2025.108802. Epub 2025 May 8.
Pneumonia is the leading cause of hospitalisation and mortality among children under five, particularly in low-resource settings. Accurate differentiation between viral and bacterial pneumonia is essential for guiding appropriate treatment, yet it remains challenging due to overlapping clinical and radiographic features. Advances in machine learning (ML), particularly deep learning (DL), have shown promise in classifying pneumonia using chest X-ray (CXR) images. This scoping review summarises the evidence on ML techniques for classifying viral and bacterial pneumonia using CXR images in paediatric patients.
This scoping review was conducted following the Joanna Briggs Institute methodology and the PRISMA-ScR guidelines. A comprehensive search was performed in PubMed, Embase, and Scopus to identify studies involving children (0-18 years) with pneumonia diagnosed through CXR, using ML models for binary or multiclass classification. Data extraction included ML models, dataset characteristics, and performance metrics.
A total of 35 studies, published between 2018 and 2025, were included in this review. Of these, 31 studies used the publicly available Kermany dataset, raising concerns about overfitting and limited generalisability to broader, real-world clinical populations. Most studies (n=33) used convolutional neural networks (CNNs) for pneumonia classification. While many models demonstrated promising performance, significant variability was observed due to differences in methodologies, dataset sizes, and validation strategies, complicating direct comparisons. For binary classification (viral vs bacterial pneumonia), a median accuracy of 92.3% (range: 80.8% to 97.9%) was reported. For multiclass classification (healthy, viral pneumonia, and bacterial pneumonia), the median accuracy was 91.8% (range: 76.8% to 99.7%).
Current evidence is constrained by a predominant reliance on a single dataset and variability in methodologies, which limit the generalisability and clinical applicability of findings. To address these limitations, future research should focus on developing diverse and representative datasets while adhering to standardised reporting guidelines. Such efforts are essential to improve the reliability, reproducibility, and translational potential of machine learning models in clinical settings.
肺炎是五岁以下儿童住院和死亡的主要原因,在资源匮乏地区尤其如此。准确区分病毒性肺炎和细菌性肺炎对于指导恰当治疗至关重要,但由于临床和影像学特征重叠,这仍然具有挑战性。机器学习(ML)的进展,尤其是深度学习(DL),在利用胸部X光(CXR)图像对肺炎进行分类方面显示出了前景。本范围综述总结了关于使用CXR图像对儿科患者的病毒性和细菌性肺炎进行分类的ML技术的证据。
本范围综述按照乔安娜·布里格斯研究所的方法和PRISMA-ScR指南进行。在PubMed、Embase和Scopus中进行了全面检索,以识别涉及通过CXR诊断为肺炎的儿童(0至18岁)的研究,这些研究使用ML模型进行二元或多类分类。数据提取包括ML模型、数据集特征和性能指标。
本综述共纳入了2018年至2025年间发表的35项研究。其中,31项研究使用了公开可用的克曼尼数据集,这引发了对过度拟合以及对更广泛的真实世界临床人群的可推广性有限的担忧。大多数研究(n = 33)使用卷积神经网络(CNN)进行肺炎分类。虽然许多模型表现出了有前景的性能,但由于方法、数据集大小和验证策略的差异,观察到了显著的变异性,这使得直接比较变得复杂。对于二元分类(病毒性肺炎与细菌性肺炎),报告的中位准确率为92.3%(范围:80.8%至97.9%)。对于多类分类(健康、病毒性肺炎和细菌性肺炎),中位准确率为91.8%(范围:76.8%至99.7%)。
当前证据受到对单一数据集的主要依赖以及方法变异性的限制,这限制了研究结果的可推广性和临床适用性。为解决这些局限性,未来研究应专注于开发多样化和有代表性的数据集,同时遵循标准化报告指南。这些努力对于提高机器学习模型在临床环境中的可靠性、可重复性和转化潜力至关重要。