Jaiswal Ravishankar, Bhati Girdhar, Ahmed Shakil, Siddiqi Mohammad Imran
Biochemistry and Structural Biology Division, CSIR-Central Drug Research Institute, Sector 10, Jankipuram Extension, Sitapur Road, Lucknow, 226031, India.
Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India.
Mol Divers. 2024 Dec 8. doi: 10.1007/s11030-024-11055-9.
Triple-negative breast cancer (TNBC) lacks estrogen, progesterone, and HER2 expression, accounting for 15-20% of breast cancer cases. It is challenging due to low therapeutic response, heterogeneity, and aggressiveness. The PI3Ka isoform is a promising therapeutic target, often hyperactivated in TNBC, contributing to uncontrolled growth and cancer cell formation. We have proposed an interpretable deep convolutional neural network prediction (iDCNNPred) system using 2D molecular images to classify bioactivity and identify potential PI3Ka inhibitors. We built Custom-DCNN models and pre-trained models such as AlexNet, SqueezeNet, and VGG19 by using the Bayesian optimization algorithm, and found that our Custom-DCNN model performed better than a pre-trained model with lower complexity and memory usage. All top-performed models were screened with the Maybridge Chemical library to find predictive hit molecules. The screened molecules were further evaluated for protein-ligand interaction with molecular docking and finally 12 promising hits were shortlisted for biological validation using in-vitro PI3K inhibition studies. After biological evaluation, 4 potent molecules with different structural moieties were identified, and these molecules present new starting scaffolds for further improvement in terms of their potency and selectivity as PI3K inhibitors with the help of medicinal chemistry efforts. Furthermore, we also showed the significance of the interpretation and visualization of the model's predictions by the Grad-CAM technique, enhancing the robustness, transparency, and interpretability of the model's predictions. The data and script files and prediction run of models used for this study to reproduce the experiment are available in the GitHub repository at https://github.com/ravishankar1307/iDCNNPred.git .
三阴性乳腺癌(TNBC)缺乏雌激素、孕激素和HER2表达,占乳腺癌病例的15 - 20%。由于治疗反应低、异质性和侵袭性,它具有挑战性。PI3Ka亚型是一个有前景的治疗靶点,在TNBC中常被过度激活,导致不受控制的生长和癌细胞形成。我们提出了一种可解释的深度卷积神经网络预测(iDCNNPred)系统,使用二维分子图像对生物活性进行分类并识别潜在的PI3Ka抑制剂。我们通过贝叶斯优化算法构建了自定义深度卷积神经网络(Custom - DCNN)模型以及AlexNet、SqueezeNet和VGG19等预训练模型,发现我们的Custom - DCNN模型比复杂度和内存使用较低的预训练模型表现更好。所有表现最佳的模型都用Maybridge化学文库进行筛选以找到预测命中分子。对筛选出的分子进行进一步评估,通过分子对接研究蛋白质 - 配体相互作用,最后筛选出12个有前景的命中分子用于体外PI3K抑制研究的生物学验证。经过生物学评估,鉴定出4种具有不同结构部分的有效分子,借助药物化学研究,这些分子为进一步提高其作为PI3K抑制剂的效力和选择性提供了新的起始骨架。此外,我们还通过Grad - CAM技术展示了模型预测解释和可视化的重要性,增强了模型预测的稳健性、透明度和可解释性。本研究用于重现实验的模型的数据和脚本文件以及预测运行可在GitHub仓库https://github.com/ravishankar1307/iDCNNPred.git获取。