Abdullah Alohali Manal, Alqahtani Hamed, Ebad Shouki A, Alotaibi Faiz Abdullah, K Venkatachalam, Cho Jaehyuk
Department of Information Systems, Princess Nourah Bint Abdulrahman University, Riyadh, Saudi Arabia.
Department of Information Systems, King Khalid University, Abha, Saudi Arabia.
PeerJ Comput Sci. 2025 May 27;11:e2853. doi: 10.7717/peerj-cs.2853. eCollection 2025.
Lung cancer remains one of the most prevalent and life-threatening diseases, often diagnosed at an advanced stage due to the challenges in early detection. Contributory factors include genetic mutations, smoking, alcohol consumption, and exposure to hazardous environmental conditions. Computer-aided diagnosis (CAD) systems have significantly improved early cancer detection, but limitations such as high-dimensional feature sets and overfitting issues persist. This study presents an optimised deep learning approach for lung cancer classification, integrating flying fox optimization (FFXO) for feature selection and bidirectional generative adversarial networks (Bi-GAN) for classification. The methodology consists of three key phases: (1) Data preprocessing, where missing values are handled using the multiple imputations by chain equation (MICE) technique and feature scaling is applied using standard and min-max scalers; (2) Feature selection, where the FFXO algorithm reduces feature dimensionality to enhance classification efficiency; and (3) Lung tumor classification, utilizing Bi-GAN to improve predictive accuracy. The proposed system was evaluated using key performance metrics-accuracy, precision, recall, and F1-score-and demonstrated superior performance to conventional models. Experimental results on a publicly available lung cancer dataset showed an accuracy of 98.7% highlighting the approach's robustness in precise lung tumor classification. This study provides a novel framework for improving the reliability and efficiency of lung cancer detection, offering significant potential for clinical applications.
肺癌仍然是最普遍且危及生命的疾病之一,由于早期检测存在挑战,往往在晚期才被诊断出来。促成因素包括基因突变、吸烟、饮酒以及暴露于有害环境条件下。计算机辅助诊断(CAD)系统显著改善了癌症的早期检测,但诸如高维特征集和过拟合问题等局限性仍然存在。本研究提出了一种优化的深度学习方法用于肺癌分类,集成了用于特征选择的狐蝠优化算法(FFXO)和用于分类的双向生成对抗网络(Bi-GAN)。该方法包括三个关键阶段:(1)数据预处理,使用链式方程多重插补(MICE)技术处理缺失值,并使用标准和最小-最大缩放器进行特征缩放;(2)特征选择,其中FFXO算法降低特征维度以提高分类效率;(3)肺肿瘤分类,利用Bi-GAN提高预测准确性。所提出的系统使用关键性能指标——准确率、精确率、召回率和F1分数进行评估,并表现出优于传统模型的性能。在一个公开可用的肺癌数据集上的实验结果显示准确率为98.7%,突出了该方法在精确肺肿瘤分类中的稳健性。本研究为提高肺癌检测的可靠性和效率提供了一个新颖的框架,在临床应用中具有巨大潜力。