Ai Daiqiao, Wu Jingxing, Cai Hanxuan, Zhao Duancheng, Chen Yihao, Wei Jiajia, Xu Jianrong, Zhang Jiquan, Wang Ling
School of Biology and Biological Engineering, Guangdong Provincial Key Laboratory of Fermentation and Enzyme Engineering, Joint International Research Laboratory of Synthetic Biology and Medicine, Guangdong Provincial Engineering and Technology Research Center of Biopharmaceuticals, South China University of Technology, Guangzhou, China.
Department of Pharmacology and Chemical Biology, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
Front Pharmacol. 2022 Oct 11;13:971369. doi: 10.3389/fphar.2022.971369. eCollection 2022.
PARP (poly ADP-ribose polymerase) family is a crucial DNA repair enzyme that responds to DNA damage, regulates apoptosis, and maintains genome stability; therefore, PARP inhibitors represent a promising therapeutic strategy for the treatment of various human diseases including COVID-19. In this study, a multi-task FP-GNN (Fingerprint and Graph Neural Networks) deep learning framework was proposed to predict the inhibitory activity of molecules against four PARP isoforms (PARP-1, PARP-2, PARP-5A, and PARP-5B). Compared with baseline predictive models based on four conventional machine learning methods such as RF, SVM, XGBoost, and LR as well as six deep learning algorithms such as DNN, Attentive FP, MPNN, GAT, GCN, and D-MPNN, the evaluation results indicate that the multi-task FP-GNN method achieves the best performance with the highest average BA, F1, and AUC values of 0.753 ± 0.033, 0.910 ± 0.045, and 0.888 ± 0.016 for the test set. In addition, Y-scrambling testing successfully verified that the model was not results of chance correlation. More importantly, the interpretability of the multi-task FP-GNN model enabled the identification of key structural fragments associated with the inhibition of each PARP isoform. To facilitate the use of the multi-task FP-GNN model in the field, an online webserver called PARPi-Predict and its local version software were created to predict whether compounds bear potential inhibitory activity against PARPs, thereby contributing to design and discover better selective PARP inhibitors.
聚(ADP - 核糖)聚合酶(PARP)家族是一种关键的DNA修复酶,可对DNA损伤作出反应、调节细胞凋亡并维持基因组稳定性;因此,PARP抑制剂是治疗包括COVID - 19在内的各种人类疾病的一种有前景的治疗策略。在本研究中,提出了一种多任务指纹与图神经网络(FP - GNN)深度学习框架,用于预测分子对四种PARP亚型(PARP - 1、PARP - 2、PARP - 5A和PARP - 5B)的抑制活性。与基于随机森林(RF)、支持向量机(SVM)、极端梯度提升(XGBoost)和逻辑回归(LR)等四种传统机器学习方法以及深度神经网络(DNN)、注意力指纹(Attentive FP)、消息传递神经网络(MPNN)、图注意力网络(GAT)、图卷积网络(GCN)和深度消息传递神经网络(D - MPNN)等六种深度学习算法的基线预测模型相比,评估结果表明,多任务FP - GNN方法在测试集上取得了最佳性能,平均平衡准确率(BA)、F1值和曲线下面积(AUC)最高,分别为0.753±0.033、0.910±0.045和0.888±0.016。此外,Y - 扰乱测试成功验证了该模型并非偶然相关性的结果。更重要的是,多任务FP - GNN模型的可解释性使得能够识别与每种PARP亚型抑制相关的关键结构片段。为便于该多任务FP - GNN模型在该领域的应用,创建了一个名为PARPi - Predict的在线网络服务器及其本地版本软件,以预测化合物是否具有针对PARP的潜在抑制活性,从而有助于设计和发现更好的选择性PARP抑制剂。