Shaik Ayesha, Dutta Shivanya Shomir, Sawant Ishaan Milind, Kumar Shreyas, Balasundaram Ananthakrishnan, De Kanjar
Centre for Cyber Physical Systems, Vellore Institute of Technology (VIT), Chennai, 600127, India.
School of Computer Science and Engineering, Vellore Institute of Technology (VIT), Chennai, 600127, India.
Sci Rep. 2025 May 5;15(1):15680. doi: 10.1038/s41598-025-00025-2.
Skin lesions remain a significant global health issue, with their incidence rising steadily over the past few years. Early and accurate detection is crucial for effective treatment and improving patient outcomes. This work explores the integration of advanced Convolutional Neural Networks (CNNs) with Bidirectional Long Short Term Memory (BiLSTM) enhanced by spatial, channel, and temporal attention mechanisms to improve the classification of skin lesions. The hybrid model is trained to distinguish between various skin lesions with high precision. Among the models evaluated, the CNN (original architecture) with BiLSTM and attention mechanisms model achieved the highest performance, with an accuracy of 92.73%, precision of 92.84%, F1 score of 92.70%, recall of 92.73%, Jaccard Index (JAC) of 87.08%, Dice Coefficient (DIC) of 92.70%, and Matthews Correlation Coefficient (MCC) of 91.55%. The proposed model was compared to other configurations, including CNN with Gated Recurrent Units (GRU) and attention mechanisms, CNN with LSTM and attention mechanisms, CNN with BiGRU and attention mechanisms, CNN with BiLSTM, CNN with LSTM, CNN with BiGRU, CNN with GRU, standalone CNN, InceptionV3, Visual Geometry Group-16 (VGG16), and Xception, to highlight the efficacy of the proposed approach. This research aims to empower healthcare professionals by providing a robust diagnostic tool that enhances accuracy and supports proactive management strategies. The model's ability to analyze high-resolution images and capture complex features of skin lesions promises significant advancements in early detection and personalized treatment. This work not only seeks to advance the technological capabilities in skin lesion diagnostics but also aims to mitigate the disease's impact through timely interventions and improved healthcare outcomes, ultimately enhancing public health resilience on a global scale.
皮肤病变仍然是一个重大的全球健康问题,在过去几年中其发病率稳步上升。早期准确检测对于有效治疗和改善患者预后至关重要。这项工作探索了将先进的卷积神经网络(CNN)与通过空间、通道和时间注意力机制增强的双向长短期记忆(BiLSTM)相结合,以改善皮肤病变的分类。该混合模型经过训练,能够高精度地区分各种皮肤病变。在评估的模型中,具有BiLSTM和注意力机制的CNN(原始架构)模型表现最佳,准确率为92.73%,精确率为92.84%,F1分数为92.70%,召回率为92.73%,杰卡德指数(JAC)为87.08%,骰子系数(DIC)为92.70%,马修斯相关系数(MCC)为91.55%。将所提出的模型与其他配置进行了比较,包括具有门控循环单元(GRU)和注意力机制的CNN、具有长短期记忆(LSTM)和注意力机制的CNN、具有双向门控循环单元(BiGRU)和注意力机制的CNN、具有BiLSTM的CNN、具有LSTM的CNN、具有BiGRU的CNN、具有GRU的CNN、独立的CNN、InceptionV3、视觉几何组16(VGG16)和Xception,以突出所提出方法的有效性。本研究旨在通过提供一种强大的诊断工具来增强医疗保健专业人员的能力,该工具可提高准确性并支持积极的管理策略。该模型分析高分辨率图像并捕捉皮肤病变复杂特征的能力有望在早期检测和个性化治疗方面取得重大进展。这项工作不仅旨在提高皮肤病变诊断的技术能力,还旨在通过及时干预和改善医疗保健结果来减轻疾病的影响,最终在全球范围内增强公众健康复原力。