Truong Gia-Bao, Pham Thanh-An, To Van-Thinh, Le Hoang-Son Lai, Van Nguyen Phuoc-Chung, Trinh The-Chuong, Phan Tieu-Long, Truong Tuyen Ngoc
Faculty of Pharmacy, University of Medicine and Pharmacy at Ho Chi Minh City, 41 Dinh Tien Hoang, District 1, Ho Chi Minh City 700000, Vietnam.
Faculty of Pharmacy, Grenoble Alpes University, La Tronche 38700, France.
ACS Omega. 2024 Nov 12;9(47):47180-47193. doi: 10.1021/acsomega.4c07689. eCollection 2024 Nov 26.
In the development of anticancer medications, vascular endothelial growth factor receptor 2 (VEGFR-2), which belongs to the protein tyrosine kinase family, emerges as one of the most significant targets of interest. The ongoing Food and Drug Administration (FDA) approval of novel therapeutic medicines toward VEGFR-2 emphasizes the urgent need to discover sophisticated molecular structures that are capable of reliably limiting VEGFR-2 activity. Recognizing the huge potential of deep-learning-based molecular model advancements, we focused our study on exploring the chemical space to find small molecules potentially inhibiting VEGFR-2. To achieve this goal, we utilized the junction tree variational autoencoder in combination with two optimization approaches on the latent space: the local Bayesian optimization on the initial data set and the gradient ascent on nine FDA-approved drugs targeting VEGFR-2. The optimization results yielded a set of 493 uncharted small molecules. Quantitative structure-activity relationship (QSAR) models and molecular docking were used to assess the generated molecules for their inhibitory potential using their predicted pIC and binding affinity. The QSAR model constructed on RDK7 fingerprints using the CatBoost algorithm achieved remarkable coefficients of determination ( ) of 0.792 ± 0.075 and 0.859 with respect to internal and external validation. Molecular docking was implemented using the 4ASD complex with optimistic retrospective control results (the ROC-AUC value was 0.710 and the binding activity threshold was -7.90 kcal/mol). Newly generated molecules possessing acceptable results corresponding to both assessments were shortlisted and checked for interactions with the protein at the binding site on important residues, including Cys919, Asp1046, and Glu885.
在抗癌药物的研发中,属于蛋白酪氨酸激酶家族的血管内皮生长因子受体2(VEGFR - 2)成为最受关注的重要靶点之一。美国食品药品监督管理局(FDA)对针对VEGFR - 2的新型治疗药物的持续批准,凸显了发现能够可靠限制VEGFR - 2活性的精密分子结构的迫切需求。认识到基于深度学习的分子模型进步的巨大潜力,我们将研究重点放在探索化学空间以寻找可能抑制VEGFR - 2的小分子上。为实现这一目标,我们将联合树变分自编码器与潜在空间上的两种优化方法:对初始数据集进行局部贝叶斯优化以及对九种FDA批准的靶向VEGFR - 2的药物进行梯度上升。优化结果产生了一组493个未知的小分子。使用定量构效关系(QSAR)模型和分子对接,通过预测的pIC和结合亲和力来评估所生成分子的抑制潜力。使用CatBoost算法基于RDK7指纹构建的QSAR模型在内部和外部验证方面分别取得了显著的决定系数( ),分别为0.792±0.075和0.859。使用4ASD复合物进行分子对接,得到了乐观的回顾性对照结果(ROC - AUC值为0.710,结合活性阈值为 - 7.90 kcal/mol)。筛选出在两项评估中均具有可接受结果的新生成分子,并检查它们与结合位点上重要残基(包括Cys919、Asp1046和Glu885)处的蛋白质的相互作用。