• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GGN-GO:基于多尺度结构特征预测蛋白质功能的几何图网络。

GGN-GO: geometric graph networks for predicting protein function by multi-scale structure features.

机构信息

The College of Information Science and Technology, Beijing University of Chemical Technology, Beijing.

The College of Life Science and Technology, Beijing University of Chemical Technology, Beijing.

出版信息

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae559.

DOI:10.1093/bib/bbae559
PMID:39487084
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11530295/
Abstract

Recent advances in high-throughput sequencing have led to an explosion of genomic and transcriptomic data, offering a wealth of protein sequence information. However, the functions of most proteins remain unannotated. Traditional experimental methods for annotation of protein functions are costly and time-consuming. Current deep learning methods typically rely on Graph Convolutional Networks to propagate features between protein residues. However, these methods fail to capture fine atomic-level geometric structural features and cannot directly compute or propagate structural features (such as distances, directions, and angles) when transmitting features, often simplifying them to scalars. Additionally, difficulties in capturing long-range dependencies limit the model's ability to identify key nodes (residues). To address these challenges, we propose a geometric graph network (GGN-GO) for predicting protein function that enriches feature extraction by capturing multi-scale geometric structural features at the atomic and residue levels. We use a geometric vector perceptron to convert these features into vector representations and aggregate them with node features for better understanding and propagation in the network. Moreover, we introduce a graph attention pooling layer captures key node information by adaptively aggregating local functional motifs, while contrastive learning enhances graph representation discriminability through random noise and different views. The experimental results show that GGN-GO outperforms six comparative methods in tasks with the most labels for both experimentally validated and predicted protein structures. Furthermore, GGN-GO identifies functional residues corresponding to those experimentally confirmed, showcasing its interpretability and the ability to pinpoint key protein regions. The code and data are available at: https://github.com/MiJia-ID/GGN-GO.

摘要

高通量测序技术的最新进展导致基因组和转录组数据呈爆炸式增长,提供了丰富的蛋白质序列信息。然而,大多数蛋白质的功能仍然没有被注释。传统的蛋白质功能注释实验方法成本高、耗时长。当前的深度学习方法通常依赖图卷积网络在蛋白质残基之间传播特征。然而,这些方法无法捕捉精细的原子级几何结构特征,并且在传递特征时无法直接计算或传播结构特征(如距离、方向和角度),通常将它们简化为标量。此外,捕捉长程依赖关系的困难限制了模型识别关键节点(残基)的能力。为了解决这些挑战,我们提出了一种用于预测蛋白质功能的几何图网络(GGN-GO),通过捕获原子和残基水平的多尺度几何结构特征来丰富特征提取。我们使用几何向量感知机将这些特征转换为向量表示,并将其与节点特征聚合在一起,以在网络中更好地理解和传播。此外,我们引入了一个图注意力池化层,通过自适应聚合局部功能基元来捕获关键节点信息,而对比学习通过随机噪声和不同视图来增强图表示的可区分性。实验结果表明,在具有最多标签的实验验证和预测蛋白质结构任务中,GGN-GO 优于六个比较方法。此外,GGN-GO 识别出与实验证实的功能相对应的功能残基,展示了其可解释性和确定关键蛋白质区域的能力。代码和数据可在 https://github.com/MiJia-ID/GGN-GO 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f04/11530295/1e39f16dddff/bbae559f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f04/11530295/44b6f62cca83/bbae559f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f04/11530295/5b3ad48d13e1/bbae559f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f04/11530295/b27f2d55c1a8/bbae559f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f04/11530295/1e39f16dddff/bbae559f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f04/11530295/44b6f62cca83/bbae559f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f04/11530295/5b3ad48d13e1/bbae559f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f04/11530295/b27f2d55c1a8/bbae559f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f04/11530295/1e39f16dddff/bbae559f4.jpg

相似文献

1
GGN-GO: geometric graph networks for predicting protein function by multi-scale structure features.GGN-GO:基于多尺度结构特征预测蛋白质功能的几何图网络。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae559.
2
Deep learning model for protein multi-label subcellular localization and function prediction based on multi-task collaborative training.基于多任务协同训练的蛋白质多标签亚细胞定位和功能预测深度学习模型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae568.
3
Hierarchical graph transformer with contrastive learning for protein function prediction.基于对比学习的层次图转换器在蛋白质功能预测中的应用。
Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad410.
4
DSSGNN-PPI: A Protein-Protein Interactions prediction model based on Double Structure and Sequence graph neural networks.DSSGNN-PPI:一种基于双结构和序列图神经网络的蛋白质-蛋白质相互作用预测模型。
Comput Biol Med. 2024 Jul;177:108669. doi: 10.1016/j.compbiomed.2024.108669. Epub 2024 May 29.
5
PCP-GC-LM: single-sequence-based protein contact prediction using dual graph convolutional neural network and convolutional neural network.PCP-GC-LM:基于双图卷积神经网络和卷积神经网络的单序列蛋白质接触预测。
BMC Bioinformatics. 2024 Sep 2;25(1):287. doi: 10.1186/s12859-024-05914-3.
6
Exploring potential circRNA biomarkers for cancers based on double-line heterogeneous graph representation learning.基于双线性异质图表示学习的癌症潜在环状 RNA 生物标志物研究
BMC Med Inform Decis Mak. 2024 Jun 6;24(1):159. doi: 10.1186/s12911-024-02564-6.
7
TAWFN: a deep learning framework for protein function prediction.TAWFN:一种用于蛋白质功能预测的深度学习框架。
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae571.
8
Advancing cancer driver gene detection via Schur complement graph augmentation and independent subspace feature extraction.通过舒尔补图增强和独立子空间特征提取推进癌症驱动基因检测。
Comput Biol Med. 2024 May;174:108484. doi: 10.1016/j.compbiomed.2024.108484. Epub 2024 Apr 16.
9
Protein Subcellular Localization Prediction Model Based on Graph Convolutional Network.基于图卷积网络的蛋白质亚细胞定位预测模型
Interdiscip Sci. 2022 Dec;14(4):937-946. doi: 10.1007/s12539-022-00529-9. Epub 2022 Jun 17.
10
Predicting miRNA-disease association via graph attention learning and multiplex adaptive modality fusion.通过图注意力学习和多复用自适应模态融合预测 miRNA-疾病关联。
Comput Biol Med. 2024 Feb;169:107904. doi: 10.1016/j.compbiomed.2023.107904. Epub 2023 Dec 28.

本文引用的文献

1
Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。
Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.
2
Protein Engineering with Lightweight Graph Denoising Neural Networks.用光重量化图去噪神经网络进行蛋白质工程。
J Chem Inf Model. 2024 May 13;64(9):3650-3661. doi: 10.1021/acs.jcim.4c00036. Epub 2024 Apr 17.
3
Functional annotation of enzyme-encoding genes using deep learning with transformer layers.利用带有转换器层的深度学习对酶编码基因进行功能注释。
Nat Commun. 2023 Nov 14;14(1):7370. doi: 10.1038/s41467-023-43216-z.
4
Accurately identifying nucleic-acid-binding sites through geometric graph learning on language model predicted structures.通过对语言模型预测结构进行几何图学习来准确识别核酸结合位点。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad360.
5
Combining protein sequences and structures with transformers and equivariant graph neural networks to predict protein function.将蛋白质序列和结构与转换器和等变图神经网络相结合,以预测蛋白质功能。
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i318-i325. doi: 10.1093/bioinformatics/btad208.
6
Hierarchical graph transformer with contrastive learning for protein function prediction.基于对比学习的层次图转换器在蛋白质功能预测中的应用。
Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad410.
7
GKLOMLI: a link prediction model for inferring miRNA-lncRNA interactions by using Gaussian kernel-based method on network profile and linear optimization algorithm.GKLOMLI:一种基于高斯核函数的网络特征和线性优化算法的 miRNA-lncRNA 相互作用预测模型。
BMC Bioinformatics. 2023 May 8;24(1):188. doi: 10.1186/s12859-023-05309-w.
8
Enzyme function prediction using contrastive learning.使用对比学习进行酶功能预测。
Science. 2023 Mar 31;379(6639):1358-1363. doi: 10.1126/science.adf2465. Epub 2023 Mar 30.
9
Fast and accurate protein function prediction from sequence through pretrained language model and homology-based label diffusion.通过预训练语言模型和基于同源性的标签扩散,从序列快速准确地预测蛋白质功能。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad117.
10
PFresGO: an attention mechanism-based deep-learning approach for protein annotation by integrating gene ontology inter-relationships.PFresGO:一种基于注意力机制的深度学习方法,通过整合基因本体论的相互关系来进行蛋白质注释。
Bioinformatics. 2023 Mar 1;39(3). doi: 10.1093/bioinformatics/btad094.