Malik Varun, Mittal Ruchi, Gupta Deepali, Juneja Sapna, Mohiuddin Khalid, Kumari Swati
Chitkara University Institute of Engineering and Technology, Chitkara University, Rajpura, Punjab, India.
KIET Group of Institutions, Ghaziabad, India.
PLoS One. 2025 Apr 29;20(4):e0319499. doi: 10.1371/journal.pone.0319499. eCollection 2025.
Non-small cell lung cancer (NSCLC) accounts for the majority of lung cancer cases, making it the most fatal diseases worldwide. Predicting NSCLC patients' survival outcomes accurately remains a significant challenge despite advancements in treatment. The difficulties in developing effective drug therapies, which are frequently hampered by severe side effects, drug resistance, and limited effectiveness across diverse patient populations, highlight the complexity of NSCLC. The machine learning (ML) and deep learning (DL) modelsare starting to reform the field of NSCLC drug disclosure. These methodologies empower the distinguishing proof of medication targets and the improvement of customized treatment techniques that might actually upgrade endurance results for NSCLC patients. Using cutting-edge methods of feature extraction and transfer learning, we present a drug discovery model for the identification of therapeutic targets in this paper. For the purpose of extracting features from drug and protein sequences, we make use of a hybrid UNet transformer. This makes it possible to extract deep features that address the issue of false alarms. For dimensionality reduction, the modified Rime optimization (MRO) algorithm is used to select the best features among multiples. In addition, we design the deep transfer learning (DTransL) model to boost the drug discovery accuracy for NSCLC patients' therapeutic targets. Davis, KIBA, and Binding-DB are examples of benchmark datasets that are used to validate the proposed model. Results exhibit that the MRO+DTransL model outflanks existing cutting edge models. On the Davis dataset, the MRO+DTransL model performed better than the LSTM model by 9.742%, achieved an accuracy of 98.398%. It reached 98.264% and 97.344% on the KIBA and Binding-DB datasets, respectively, indicating improvements of 8.608% and 8.957% over baseline models.
非小细胞肺癌(NSCLC)占肺癌病例的大多数,是全球最致命的疾病。尽管治疗方面有所进步,但准确预测NSCLC患者的生存结果仍然是一项重大挑战。开发有效的药物疗法困难重重,常因严重的副作用、耐药性以及在不同患者群体中效果有限而受阻,这凸显了NSCLC的复杂性。机器学习(ML)和深度学习(DL)模型正开始变革NSCLC药物研发领域。这些方法有助于识别药物靶点并改进定制化治疗技术,这实际上可能改善NSCLC患者的生存结果。本文利用前沿的特征提取和迁移学习方法,提出了一种用于识别治疗靶点的药物发现模型。为了从药物和蛋白质序列中提取特征,我们使用了一种混合UNet变压器。这使得提取能够解决误报问题的深度特征成为可能。为了进行降维,使用改进的Rime优化(MRO)算法在多个特征中选择最佳特征。此外,我们设计了深度迁移学习(DTransL)模型,以提高NSCLC患者治疗靶点的药物发现准确性。Davis、KIBA和Binding-DB等是用于验证所提出模型的基准数据集示例。结果表明,MRO+DTransL模型优于现有的前沿模型。在Davis数据集上,MRO+DTransL模型比LSTM模型表现好9.742%,准确率达到98.398%。在KIBA和Binding-DB数据集上分别达到98.264%和97.344%,表明比基线模型分别提高了8.608%和8.957%。