Sebastian Neenu, Ankayarkanni B
Department of Computer Science and Engineering, School of Computing, Sathyabama Institute of Science and Technology (Deemed to be University), Chennai 600119, India.
Diagnostics (Basel). 2025 Aug 14;15(16):2041. doi: 10.3390/diagnostics15162041.
Pneumonia is a critical lung infection that demands timely and precise diagnosis, particularly during the evaluation of chest X-ray images. Deep learning is widely used for pneumonia detection but faces challenges such as poor denoising, limited feature diversity, low interpretability, and class imbalance issues. This study aims to develop an optimized ResNet-50 based framework for accurate pneumonia detection. The proposed approach integrates Multiscale Curvelet Filtering with Directional Denoising (MCF-DD) as a preprocessing step to suppress noise while preserving diagnostic details. Multi-feature fusion is performed by combining deep features extracted from ResNet-50 with handcrafted texture descriptors such as Local Binary Patterns (LBPs), leveraging both semantic and structural information. Precision attention mechanisms are incorporated to enhance interpretability by highlighting diagnostically relevant regions. Validation on the Kaggle chest radiograph dataset demonstrates that the proposed model achieves higher accuracy, sensitivity, specificity, and other performance metrics compared to existing methods. The inclusion of MCF-DD preprocessing, multi-feature fusion, and precision attention contributes to improved robustness and diagnostic reliability. The optimized ResNet-50 framework, enhanced by noise suppression, multi-feature fusion, and attention mechanisms, offers a more accurate and interpretable solution for pneumonia detection from chest X-ray images, addressing key challenges in existing deep learning approaches.
肺炎是一种严重的肺部感染,需要及时、准确的诊断,尤其是在胸部X光图像评估期间。深度学习广泛用于肺炎检测,但面临着诸如去噪效果差、特征多样性有限、可解释性低和类别不平衡等问题。本研究旨在开发一种基于优化的ResNet-50框架,用于准确的肺炎检测。所提出的方法将多尺度曲波滤波与方向去噪(MCF-DD)集成作为预处理步骤,以抑制噪声同时保留诊断细节。通过将从ResNet-50提取的深度特征与手工纹理描述符(如局部二值模式(LBP))相结合来进行多特征融合,利用语义和结构信息。纳入精确注意力机制,通过突出诊断相关区域来增强可解释性。在Kaggle胸部X光数据集上的验证表明,与现有方法相比,所提出的模型实现了更高的准确率、灵敏度、特异性和其他性能指标。MCF-DD预处理、多特征融合和精确注意力的纳入有助于提高鲁棒性和诊断可靠性。通过噪声抑制、多特征融合和注意力机制增强的优化ResNet-50框架,为从胸部X光图像中检测肺炎提供了一种更准确、可解释的解决方案,解决了现有深度学习方法中的关键挑战。