• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PPA-GCN:一种用于原核生物途径分配的高效图卷积网络框架。

PPA-GCN: A Efficient GCN Framework for Prokaryotic Pathways Assignment.

作者信息

Lu Yuntao, Li Qi, Li Tao

机构信息

Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China.

College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, Beijing, China.

出版信息

Front Genet. 2022 Apr 4;13:839453. doi: 10.3389/fgene.2022.839453. eCollection 2022.

DOI:10.3389/fgene.2022.839453
PMID:35444686
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9013948/
Abstract

With the rapid development of sequencing technology, completed genomes of microbes have explosively emerged. For a newly sequenced prokaryotic genome, gene functional annotation and metabolism pathway assignment are important foundations for all subsequent research work. However, the assignment rate for gene metabolism pathways is lower than 48% on the whole. It is even lower for newly sequenced prokaryotic genomes, which has become a bottleneck for subsequent research. Thus, the development of a high-precision metabolic pathway assignment framework is urgently needed. Here, we developed PPA-GCN, a prokaryotic pathways assignment framework based on graph convolutional network, to assist functional pathway assignments using KEGG information and genomic characteristics. In the framework, genomic gene synteny information was used to construct a network, and ideas of self-supervised learning were inspired to enhance the framework's learning ability. Our framework is applicable to the genera of microbe with sufficient whole genome sequences. To evaluate the assignment rate, genomes from three different genera ( (65 genomes) and (100 genomes), (500 genomes)) were used. The initial functional pathway assignment rate of the three test genera were 27.7% (), 49.5% () and 30.1% (). PPA-GCN achieved excellence performance of 84.8% (), 77.0% () and 71.0% () for assignment rate. At the same time, PPA-GCN was proved to have strong fault tolerance. The framework provides novel insights into assignment for metabolism pathways and is likely to inform future deep learning applications for interpreting functional annotations and extends to all prokaryotic genera with sufficient genomes.

摘要

随着测序技术的快速发展,微生物的完整基因组大量涌现。对于新测序的原核生物基因组,基因功能注释和代谢途径分配是所有后续研究工作的重要基础。然而,基因代谢途径的分配率总体上低于48%。对于新测序的原核生物基因组,这一比例甚至更低,已成为后续研究的瓶颈。因此,迫切需要开发一种高精度的代谢途径分配框架。在此,我们开发了PPA-GCN,一种基于图卷积网络的原核生物途径分配框架,以利用KEGG信息和基因组特征辅助功能途径分配。在该框架中,利用基因组基因共线性信息构建网络,并启发自监督学习的思想来增强框架的学习能力。我们的框架适用于具有足够全基因组序列的微生物属。为了评估分配率,使用了来自三个不同属( (65个基因组)、 (100个基因组)、 (500个基因组))的基因组。三个测试属的初始功能途径分配率分别为27.7%( )、49.5%( )和30.1%( )。PPA-GCN在分配率方面分别达到了84.8%( )、77.0%( )和71.0%( )的优异性能。同时,PPA-GCN被证明具有很强的容错能力。该框架为代谢途径的分配提供了新的见解,可能为未来解释功能注释的深度学习应用提供参考,并扩展到所有具有足够基因组的原核生物属。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/11cb956aaacc/fgene-13-839453-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/12f8fdc56ae7/fgene-13-839453-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/7ec62f188b15/fgene-13-839453-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/23eeafafa30f/fgene-13-839453-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/cd65467ea90d/fgene-13-839453-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/5599bb56871e/fgene-13-839453-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/80d25189ac33/fgene-13-839453-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/66a058392579/fgene-13-839453-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/11cb956aaacc/fgene-13-839453-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/12f8fdc56ae7/fgene-13-839453-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/7ec62f188b15/fgene-13-839453-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/23eeafafa30f/fgene-13-839453-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/cd65467ea90d/fgene-13-839453-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/5599bb56871e/fgene-13-839453-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/80d25189ac33/fgene-13-839453-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/66a058392579/fgene-13-839453-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ec7/9013948/11cb956aaacc/fgene-13-839453-g008.jpg

相似文献

1
PPA-GCN: A Efficient GCN Framework for Prokaryotic Pathways Assignment.PPA-GCN:一种用于原核生物途径分配的高效图卷积网络框架。
Front Genet. 2022 Apr 4;13:839453. doi: 10.3389/fgene.2022.839453. eCollection 2022.
2
3
A safe semi-supervised graph convolution network.一种安全的半监督图卷积网络。
Math Biosci Eng. 2022 Aug 31;19(12):12677-12692. doi: 10.3934/mbe.2022592.
4
MVS-GCN: A prior brain structure learning-guided multi-view graph convolution network for autism spectrum disorder diagnosis.MVS-GCN:一种基于先验脑结构学习的多视图图卷积网络自闭症谱系障碍诊断方法。
Comput Biol Med. 2022 Mar;142:105239. doi: 10.1016/j.compbiomed.2022.105239. Epub 2022 Jan 19.
5
Predicting the hosts of prokaryotic viruses using GCN-based semi-supervised learning.基于图卷积网络的半监督学习预测原核病毒宿主。
BMC Biol. 2021 Nov 24;19(1):250. doi: 10.1186/s12915-021-01180-4.
6
Hi-GCN: A hierarchical graph convolution network for graph embedding learning of brain network and brain disorders prediction.Hi-GCN:一种用于脑网络图嵌入学习和脑疾病预测的层次图卷积网络。
Comput Biol Med. 2020 Dec;127:104096. doi: 10.1016/j.compbiomed.2020.104096. Epub 2020 Nov 3.
7
Graph Convolution Networks with manifold regularization for semi-supervised learning.图卷积网络与流形正则化的半监督学习。
Neural Netw. 2020 Jul;127:160-167. doi: 10.1016/j.neunet.2020.04.016. Epub 2020 Apr 23.
8
Pathway importance by graph convolutional network and Shapley additive explanations in gene expression phenotype of diffuse large B-cell lymphoma.基于图卷积网络和 Shapley 加法解释的弥漫性大 B 细胞淋巴瘤基因表达表型通路重要性分析。
PLoS One. 2022 Jun 24;17(6):e0269570. doi: 10.1371/journal.pone.0269570. eCollection 2022.
9
An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes.一种用于原核生物基因组中顺式调控基序识别的综合且适用的系统发育足迹分析框架。
BMC Genomics. 2016 Aug 9;17:578. doi: 10.1186/s12864-016-2982-x.
10
C-GCN: A Flexible CSI Phase Feature Extraction Network for Error Suppression in Indoor Positioning.C-GCN:一种用于室内定位中误差抑制的灵活CSI相位特征提取网络。
Entropy (Basel). 2021 Jul 31;23(8):1004. doi: 10.3390/e23081004.

引用本文的文献

1
A novel hierarchical network-based approach to unveil the complexity of functional microbial genome.一种揭示功能微生物基因组复杂性的新型层次网络方法。
BMC Genomics. 2024 Aug 14;25(1):786. doi: 10.1186/s12864-024-10692-6.

本文引用的文献

1
Bacteriophage classification for assembled contigs using graph convolutional network.基于图卷积网络的组装 contig 细菌噬菌体分类。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i25-i33. doi: 10.1093/bioinformatics/btab293.
2
HPOFiller: identifying missing protein-phenotype associations by graph convolutional network.HPOFiller:通过图卷积网络识别缺失的蛋白质-表型关联
Bioinformatics. 2021 Oct 11;37(19):3328-3336. doi: 10.1093/bioinformatics/btab224.
3
Deep neural networks identify sequence context features predictive of transcription factor binding.
深度神经网络可识别预测转录因子结合的序列上下文特征。
Nat Mach Intell. 2021 Feb;3(2):172-180. doi: 10.1038/s42256-020-00282-y. Epub 2021 Jan 18.
4
Classification of Grain Amaranths Using Chromosome-Level Genome Assembly of Ramdana, .利用Ramdana的染色体水平基因组组装对籽粒苋进行分类
Front Plant Sci. 2020 Nov 11;11:579529. doi: 10.3389/fpls.2020.579529. eCollection 2020.
5
Pseudo2GO: A Graph-Based Deep Learning Method for Pseudogene Function Prediction by Borrowing Information From Coding Genes.Pseudo2GO:一种基于图的深度学习方法,通过借鉴编码基因的信息进行假基因功能预测。
Front Genet. 2020 Aug 18;11:807. doi: 10.3389/fgene.2020.00807. eCollection 2020.
6
Representation of features as images with neighborhood dependencies for compatibility with convolutional neural networks.基于邻域依赖的特征图像表示方法,以与卷积神经网络兼容。
Nat Commun. 2020 Sep 1;11(1):4391. doi: 10.1038/s41467-020-18197-y.
7
Drug-target interaction prediction using semi-bipartite graph model and deep learning.基于半二部图模型和深度学习的药物-靶点相互作用预测。
BMC Bioinformatics. 2020 Jul 6;21(Suppl 4):248. doi: 10.1186/s12859-020-3518-6.
8
Pan-Genome Analyses of spp. Reveal Genetic Characteristics and Composting Potential. spp.的泛基因组分析揭示了其遗传特征和堆肥潜力。
Int J Mol Sci. 2020 May 11;21(9):3393. doi: 10.3390/ijms21093393.
9
A Comprehensive Survey on Graph Neural Networks.图神经网络综述。
IEEE Trans Neural Netw Learn Syst. 2021 Jan;32(1):4-24. doi: 10.1109/TNNLS.2020.2978386. Epub 2021 Jan 4.
10
Deep graph embedding for prioritizing synergistic anticancer drug combinations.用于优先排序协同抗癌药物组合的深度图嵌入
Comput Struct Biotechnol J. 2020 Feb 15;18:427-438. doi: 10.1016/j.csbj.2020.02.006. eCollection 2020.