Mishra Priya, Vipsita Swati
Department of Computer Science and Engineering, IIIT Bhubaneswar, Odisha, India.
J Comput Aided Mol Des. 2025 Jun 16;39(1):31. doi: 10.1007/s10822-025-00605-4.
In drug discovery, virtual screening and repositioning rely on accurate Drug-Target Binding Affinity (DTBA) prediction to develop effective therapies. However, DTBA prediction remains challenging due to limited annotated datasets, high-dimensional biochemical data, and heterogeneous data sources, including chemical structures, biological sequences, and molecular interactions. These complexities hinder the development of unified deep-learning frameworks. To address these challenges, we propose DTBA-Net, a novel hybrid neural network model that enhances DTBA prediction accuracy and efficiency. DTBA-Net integrates optimal feature selection within a CNN architecture to predict DTBA. Protein sequences and compound structures are processed through a hybrid CNN that includes convolutional layers, a flattened layer, a Modified JAYA Algorithm for optimal feature selection, and dense blocks. The Modified JAYA algorithm selects relevant features, reduces computational overhead, and improves predictive performance. DTBA-Net was evaluated on two benchmark datasets, KIBA and DAVIS. On the DAVIS dataset, DTBA-Net attained an R-squared value of 0.95 and a Mean Absolute Error (MAE) of 0.17. Further validation using the drug Nirmatrelvir resulted in an R-squared value of 0.96, showcasing the model's robustness and scalability. Integrating a hybrid neural network with an optimized feature selection process accelerates model training and enhances prediction accuracy. DTBA-Net demonstrates promising potential for scalable, efficient, and accurate DTBA prediction, facilitating faster and more reliable drug discovery.
在药物发现中,虚拟筛选和重新定位依赖于准确的药物-靶点结合亲和力(DTBA)预测来开发有效的治疗方法。然而,由于注释数据集有限、高维生化数据以及包括化学结构、生物序列和分子相互作用在内的异质数据源,DTBA预测仍然具有挑战性。这些复杂性阻碍了统一深度学习框架的发展。为了应对这些挑战,我们提出了DTBA-Net,一种新颖的混合神经网络模型,可提高DTBA预测的准确性和效率。DTBA-Net在CNN架构中集成了最优特征选择以预测DTBA。蛋白质序列和化合物结构通过一个混合CNN进行处理,该混合CNN包括卷积层、展平层、用于最优特征选择的改进JAYA算法和密集块。改进的JAYA算法选择相关特征,减少计算开销,并提高预测性能。DTBA-Net在两个基准数据集KIBA和DAVIS上进行了评估。在DAVIS数据集上,DTBA-Net的决定系数R²值为0.95,平均绝对误差(MAE)为0.17。使用药物奈玛特韦进行的进一步验证得到的R²值为0.96,展示了该模型的稳健性和可扩展性。将混合神经网络与优化的特征选择过程相结合可加速模型训练并提高预测准确性。DTBA-Net在可扩展、高效且准确的DTBA预测方面显示出有前景的潜力,有助于更快且更可靠的药物发现。