基于图神经网络的多源迁移学习在靶向孤儿 G 蛋白偶联受体的配体生物活性建模中的优异表现。

Multi-source transfer learning with Graph Neural Network for excellent modelling the bioactivities of ligands targeting orphan G protein-coupled receptors.

机构信息

College of Physics and Information Engineering, Fuzhou University, Fuzhou 350116, China.

VeriMake Innovation Lab, Nanjing Renmian Integrated Circuit Co., Ltd., Nanjing 210088, China.

出版信息

Math Biosci Eng. 2023 Jan;20(2):2588-2608. doi: 10.3934/mbe.2023121. Epub 2022 Nov 25.

DOI:10.3934/mbe.2023121

PMID:36899548

Abstract

G protein-coupled receptors (GPCRs) have been the targets for more than 40% of the currently approved drugs. Although neural networks can effectively improve the accuracy of prediction with the biological activity, the result is undesirable in the limited orphan GPCRs (oGPCRs) datasets. To this end, we proposed Multi-source Transfer Learning with Graph Neural Network, called MSTL-GNN, to bridge this gap. Firstly, there are three ideal sources of data for transfer learning, oGPCRs, experimentally validated GPCRs, and invalidated GPCRs similar to the former one. Secondly, the SIMLEs format GPCRs convert to graphics, and they can be the input of Graph Neural Network (GNN) and ensemble learning for improving prediction accuracy. Finally, our experiments show that MSTL-GNN remarkably improves the prediction of GPCRs ligand activity value compared with previous studies. On average, the two evaluation indexes we adopted, R2 and Root-mean-square deviation (RMSE). Compared with the state-of-the-art work MSTL-GNN increased up to 67.13% and 17.22%, respectively. The effectiveness of MSTL-GNN in the field of GPCR Drug discovery with limited data also paves the way for other similar application scenarios.

摘要

G 蛋白偶联受体（GPCRs）是目前批准的 40%以上药物的靶点。虽然神经网络可以有效地提高生物活性的预测准确性，但在有限的孤儿 GPCR（oGPCR）数据集上，结果并不理想。为此，我们提出了一种基于图神经网络的多源迁移学习方法，称为 MSTL-GNN，以弥合这一差距。首先，对于迁移学习，有三种理想的数据来源，即 oGPCR、经过实验验证的 GPCR 和类似于前者的无效 GPCR。其次，将 SIMLEs 格式的 GPCR 转换为图形，它们可以作为图神经网络（GNN）和集成学习的输入，以提高预测准确性。最后，我们的实验表明，与之前的研究相比，MSTL-GNN 显著提高了 GPCR 配体活性值的预测。平均而言，我们采用的两个评估指标，R2 和均方根偏差（RMSE），与最先进的工作 MSTL-GNN 相比，分别提高了 67.13%和 17.22%。MSTL-GNN 在数据有限的 GPCR 药物发现领域的有效性也为其他类似的应用场景铺平了道路。

相似文献

Multi-source transfer learning with Graph Neural Network for excellent modelling the bioactivities of ligands targeting orphan G protein-coupled receptors.基于图神经网络的多源迁移学习在靶向孤儿 G 蛋白偶联受体的配体生物活性建模中的优异表现。

Math Biosci Eng. 2023 Jan;20(2):2588-2608. doi: 10.3934/mbe.2023121. Epub 2022 Nov 25.

Transfer learning with molecular graph convolutional networks for accurate modeling and representation of bioactivities of ligands targeting GPCRs without sufficient data.基于分子图卷积网络的迁移学习，实现了对 GPCR 靶点配体生物活性的精确建模和表示，即使在数据不足的情况下也能得到良好的效果。

Comput Biol Chem. 2022 Jun;98:107664. doi: 10.1016/j.compbiolchem.2022.107664. Epub 2022 Mar 9.

Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors.精确建模和解释靶向 G 蛋白偶联受体的配体的生物活性。

Bioinformatics. 2019 Jul 15;35(14):i324-i332. doi: 10.1093/bioinformatics/btz336.

MD-GNN: A mechanism-data-driven graph neural network for molecular properties prediction and new material discovery.MD-GNN：一种基于机制数据的图神经网络，用于分子性质预测和新材料发现。

J Mol Graph Model. 2023 Sep;123:108506. doi: 10.1016/j.jmgm.2023.108506. Epub 2023 May 9.

Integrated Transfer Learning and Multitask Learning Strategies to Construct Graph Neural Network Models for Predicting Bioaccumulation Parameters of Chemicals.集成迁移学习和多任务学习策略，构建用于预测化学品生物积累参数的图神经网络模型。

Environ Sci Technol. 2024 Sep 3;58(35):15650-15660. doi: 10.1021/acs.est.4c02421. Epub 2024 Jul 25.

Multiphysical graph neural network (MP-GNN) for COVID-19 drug design.多物理图神经网络（MP-GNN）在 COVID-19 药物设计中的应用。

Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac231.

Visualizing Graph Neural Networks With CorGIE: Corresponding a Graph to Its Embedding.用 CorGIE 可视化图神经网络：将图与其嵌入对应。

IEEE Trans Vis Comput Graph. 2022 Jun;28(6):2500-2516. doi: 10.1109/TVCG.2022.3148197. Epub 2022 May 2.

Homologous G Protein-Coupled Receptors Boost the Modeling and Interpretation of Bioactivities of Ligand Molecules.同源 G 蛋白偶联受体增强配体分子生物活性的建模和解释。

J Chem Inf Model. 2020 Mar 23;60(3):1865-1875. doi: 10.1021/acs.jcim.9b01000. Epub 2020 Feb 18.

Prediction of GPCR activity using machine learning.使用机器学习预测G蛋白偶联受体（GPCR）活性。

Comput Struct Biotechnol J. 2022 May 18;20:2564-2573. doi: 10.1016/j.csbj.2022.05.016. eCollection 2022.

Boosting-GNN: Boosting Algorithm for Graph Networks on Imbalanced Node Classification.增强型图神经网络（Boosting-GNN）：用于不平衡节点分类的图网络增强算法。

Front Neurorobot. 2021 Nov 25;15:775688. doi: 10.3389/fnbot.2021.775688. eCollection 2021.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于图神经网络的多源迁移学习在靶向孤儿 G 蛋白偶联受体的配体生物活性建模中的优异表现。

Multi-source transfer learning with Graph Neural Network for excellent modelling the bioactivities of ligands targeting orphan G protein-coupled receptors.

机构信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献