• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用图神经网络和接触图鉴定植物液泡蛋白。

Identification of plant vacuole proteins by using graph neural network and contact maps.

机构信息

School of Information Science and Engineering, University of Jinan, Jinan, China.

Laboratory of Zoology, Graduate School of Bioresource and Bioenvironmental Sciences, Kyushu University, Fukuoka-Shi, Fukuoka, Japan.

出版信息

BMC Bioinformatics. 2023 Sep 22;24(1):357. doi: 10.1186/s12859-023-05475-x.

DOI:10.1186/s12859-023-05475-x
PMID:37740195
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10517492/
Abstract

Plant vacuoles are essential organelles in the growth and development of plants, and accurate identification of their proteins is crucial for understanding their biological properties. In this study, we developed a novel model called GraphIdn for the identification of plant vacuole proteins. The model uses SeqVec, a deep representation learning model, to initialize the amino acid sequence. We utilized the AlphaFold2 algorithm to obtain the structural information of corresponding plant vacuole proteins, and then fed the calculated contact maps into a graph convolutional neural network. GraphIdn achieved accuracy values of 88.51% and 89.93% in independent testing and fivefold cross-validation, respectively, outperforming previous state-of-the-art predictors. As far as we know, this is the first model to use predicted protein topology structure graphs to identify plant vacuole proteins. Furthermore, we assessed the effectiveness and generalization capability of our GraphIdn model by applying it to identify and locate peroxisomal proteins, which yielded promising outcomes. The source code and datasets can be accessed at https://github.com/SJNNNN/GraphIdn .

摘要

植物液泡是植物生长和发育过程中的重要细胞器,准确鉴定其蛋白质对于了解其生物学特性至关重要。在这项研究中,我们开发了一种名为 GraphIdn 的新型模型,用于鉴定植物液泡蛋白。该模型使用深度表示学习模型 SeqVec 来初始化氨基酸序列。我们利用 AlphaFold2 算法获取相应植物液泡蛋白的结构信息,然后将计算得到的接触图输入到图卷积神经网络中。GraphIdn 在独立测试和五重交叉验证中的准确率分别达到了 88.51%和 89.93%,优于之前的最先进预测器。据我们所知,这是第一个使用预测的蛋白质拓扑结构图来识别植物液泡蛋白的模型。此外,我们通过应用 GraphIdn 模型来识别和定位过氧化物酶体蛋白,评估了该模型的有效性和泛化能力,取得了有前景的结果。源代码和数据集可在 https://github.com/SJNNNN/GraphIdn 上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/01654335f79d/12859_2023_5475_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/12f4a5583c3a/12859_2023_5475_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/4f3fdaa4a4fe/12859_2023_5475_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/c760aeabf309/12859_2023_5475_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/9773ee4aa146/12859_2023_5475_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/6d18e417b7de/12859_2023_5475_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/9032c571f9a2/12859_2023_5475_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/98330f6aaeab/12859_2023_5475_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/35b3ca4cb4d0/12859_2023_5475_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/01654335f79d/12859_2023_5475_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/12f4a5583c3a/12859_2023_5475_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/4f3fdaa4a4fe/12859_2023_5475_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/c760aeabf309/12859_2023_5475_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/9773ee4aa146/12859_2023_5475_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/6d18e417b7de/12859_2023_5475_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/9032c571f9a2/12859_2023_5475_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/98330f6aaeab/12859_2023_5475_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/35b3ca4cb4d0/12859_2023_5475_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff99/10517492/01654335f79d/12859_2023_5475_Fig9_HTML.jpg

相似文献

1
Identification of plant vacuole proteins by using graph neural network and contact maps.利用图神经网络和接触图鉴定植物液泡蛋白。
BMC Bioinformatics. 2023 Sep 22;24(1):357. doi: 10.1186/s12859-023-05475-x.
2
Identification of plant vacuole proteins by exploiting deep representation learning features.利用深度表征学习特征鉴定植物液泡蛋白
Comput Struct Biotechnol J. 2022 Jun 8;20:2921-2927. doi: 10.1016/j.csbj.2022.06.002. eCollection 2022.
3
Protein Subcellular Localization Prediction Model Based on Graph Convolutional Network.基于图卷积网络的蛋白质亚细胞定位预测模型
Interdiscip Sci. 2022 Dec;14(4):937-946. doi: 10.1007/s12539-022-00529-9. Epub 2022 Jun 17.
4
TAWFN: a deep learning framework for protein function prediction.TAWFN:一种用于蛋白质功能预测的深度学习框架。
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae571.
5
3D-equivariant graph neural networks for protein model quality assessment.用于蛋白质模型质量评估的 3D 等变图神经网络。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btad030.
6
Struct2GO: protein function prediction based on graph pooling algorithm and AlphaFold2 structure information.Struct2GO:基于图池化算法和 AlphaFold2 结构信息的蛋白质功能预测。
Bioinformatics. 2023 Oct 3;39(10). doi: 10.1093/bioinformatics/btad637.
7
Prediction of protein-protein interaction using graph neural networks.基于图神经网络的蛋白质-蛋白质相互作用预测。
Sci Rep. 2022 May 19;12(1):8360. doi: 10.1038/s41598-022-12201-9.
8
PEL-PVP: Application of plant vacuolar protein discriminator based on PEFT ESM-2 and bilayer LSTM in an unbalanced dataset.PEL-PVP:基于 PEFT ESM-2 和双层 LSTM 的植物液泡蛋白鉴别器在不平衡数据集上的应用。
Int J Biol Macromol. 2024 Oct;277(Pt 3):134317. doi: 10.1016/j.ijbiomac.2024.134317. Epub 2024 Jul 31.
9
DLDTI: a learning-based framework for drug-target interaction identification using neural networks and network representation.DLDTI:一种基于学习的药物-靶点相互作用识别框架,使用神经网络和网络表示法。
J Transl Med. 2020 Nov 13;18(1):434. doi: 10.1186/s12967-020-02602-7.
10
deepAMPNet: a novel antimicrobial peptide predictor employing AlphaFold2 predicted structures and a bi-directional long short-term memory protein language model.深度 AMP 网络:一种新颖的抗菌肽预测器,采用 AlphaFold2 预测结构和双向长短期记忆蛋白质语言模型。
PeerJ. 2024 Jul 19;12:e17729. doi: 10.7717/peerj.17729. eCollection 2024.

本文引用的文献

1
Genome-wide identification and analysis of the invertase gene family in tobacco () reveals NtNINV10 participating the sugar metabolism.烟草中转化酶基因家族的全基因组鉴定与分析表明NtNINV10参与糖代谢。
Front Plant Sci. 2023 Jun 2;14:1164296. doi: 10.3389/fpls.2023.1164296. eCollection 2023.
2
Golgi_DF: Golgi proteins classification with deep forest.高尔基体_DF:基于深度森林的高尔基体蛋白质分类
Front Neurosci. 2023 May 12;17:1197824. doi: 10.3389/fnins.2023.1197824. eCollection 2023.
3
Phylogenetic and AlphaFold predicted structure analyses provide insights for A1 aspartic protease family classification in .
系统发育分析和AlphaFold预测结构分析为A1天冬氨酸蛋白酶家族在……中的分类提供了见解。
Front Plant Sci. 2023 Feb 3;14:1072168. doi: 10.3389/fpls.2023.1072168. eCollection 2023.
4
DeePVP: Identification and classification of phage virion proteins using deep learning.基于深度学习的噬菌体病毒蛋白鉴定与分类
Gigascience. 2022 Aug 11;11. doi: 10.1093/gigascience/giac076.
5
Identification of plant vacuole proteins by exploiting deep representation learning features.利用深度表征学习特征鉴定植物液泡蛋白
Comput Struct Biotechnol J. 2022 Jun 8;20:2921-2927. doi: 10.1016/j.csbj.2022.06.002. eCollection 2022.
6
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
7
In-Pero: Exploiting Deep Learning Embeddings of Protein Sequences to Predict the Localisation of Peroxisomal Proteins.In-Pero:利用蛋白质序列的深度学习嵌入来预测过氧化物酶体蛋白的定位。
Int J Mol Sci. 2021 Jun 15;22(12):6409. doi: 10.3390/ijms22126409.
8
Anticancer peptides prediction with deep representation learning features.基于深度表示学习特征的抗癌肽预测。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab008.
9
Sequence representation approaches for sequence-based protein prediction tasks that use deep learning.用于基于序列的蛋白质预测任务的序列表示方法,这些任务使用深度学习。
Brief Funct Genomics. 2021 Mar 2;20(1):61-73. doi: 10.1093/bfgp/elaa030.
10
Identification of sub-Golgi protein localization by use of deep representation learning features.利用深度表征学习特征鉴定高尔基体亚结构蛋白定位
Bioinformatics. 2021 Apr 5;36(24):5600-5609. doi: 10.1093/bioinformatics/btaa1074.