• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用深度表征学习特征鉴定植物液泡蛋白

Identification of plant vacuole proteins by exploiting deep representation learning features.

作者信息

Jiao Shihu, Zou Quan

机构信息

Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China.

State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin, China.

出版信息

Comput Struct Biotechnol J. 2022 Jun 8;20:2921-2927. doi: 10.1016/j.csbj.2022.06.002. eCollection 2022.

DOI:10.1016/j.csbj.2022.06.002
PMID:35765653
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9207291/
Abstract

Plant vacuoles are the most important organelles for plant growth, development, and defense, and they play an important role in many types of stress responses. An important function of vacuole proteins is the transport of various classes of amino acids, ions, sugars, and other molecules. Accurate identification of vacuole proteins is crucial for revealing their biological functions. Several automatic and rapid computational tools have been proposed for the subcellular localization of proteins. Regrettably, they are not specific for the identification of plant vacuole proteins. To the best of our knowledge, there is only one computational software specifically trained for plant vacuolar proteins. Although its accuracy is acceptable, the prediction performance and stability of this method in practical applications can still be improved. Hence, in this study, a new predictor named iPVP-DRLF was developed to identify plant vacuole proteins specifically and effectively. This prediction software is designed using the light gradient boosting machine (LGBM) algorithm and hybrid features composed of classic sequence features and deep representation learning features. iPVP-DRLF achieved fivefold cross-validation and independent test accuracy values of 88.25 % and 87.16 %, respectively, both outperforming previous state-of-the-art predictors. Moreover, the blind dataset test results also showed that the performance of iPVP-DRLF was significantly better than the existing tools. The results of comparative experiments confirmed that deep representation learning features have an advantage over other classic sequence features in the identification of plant vacuole proteins. We believe that iPVP-DRLF would serve as an effective computational technique for plant vacuole protein prediction and facilitate related future research. The online server is freely accessible at https://lab.malab.cn/~acy/iPVP-DRLF. In addition, the source code and datasets are also accessible at https://github.com/jiaoshihu/iPVP-DRLF.

摘要

植物液泡是植物生长、发育和防御过程中最重要的细胞器,它们在多种应激反应中发挥着重要作用。液泡蛋白的一个重要功能是运输各类氨基酸、离子、糖类和其他分子。准确鉴定液泡蛋白对于揭示其生物学功能至关重要。已经提出了几种自动且快速的计算工具用于蛋白质的亚细胞定位。遗憾的是,它们并非专门用于鉴定植物液泡蛋白。据我们所知,仅有一款专门针对植物液泡蛋白进行训练的计算软件。尽管其准确性尚可,但该方法在实际应用中的预测性能和稳定性仍可提高。因此,在本研究中,开发了一种名为iPVP-DRLF的新型预测工具,以专门且有效地鉴定植物液泡蛋白。该预测软件是使用轻梯度提升机(LGBM)算法以及由经典序列特征和深度表征学习特征组成的混合特征设计而成。iPVP-DRLF在五折交叉验证和独立测试中的准确率分别达到了88.25%和87.16%,均优于先前的最先进预测工具。此外,盲数据集测试结果也表明iPVP-DRLF的性能明显优于现有工具。对比实验结果证实,在植物液泡蛋白的鉴定中,深度表征学习特征比其他经典序列特征更具优势。我们相信iPVP-DRLF将成为一种有效的植物液泡蛋白预测计算技术,并推动未来相关研究。可通过https://lab.malab.cn/~acy/iPVP-DRLF免费访问在线服务器。此外,源代码和数据集也可在https://github.com/jiaoshihu/iPVP-DRLF获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2eb/9207291/055a77af1008/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2eb/9207291/a34be6ed578b/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2eb/9207291/b5b440841969/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2eb/9207291/b256093557cf/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2eb/9207291/055a77af1008/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2eb/9207291/a34be6ed578b/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2eb/9207291/b5b440841969/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2eb/9207291/b256093557cf/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2eb/9207291/055a77af1008/gr3.jpg

相似文献

1
Identification of plant vacuole proteins by exploiting deep representation learning features.利用深度表征学习特征鉴定植物液泡蛋白
Comput Struct Biotechnol J. 2022 Jun 8;20:2921-2927. doi: 10.1016/j.csbj.2022.06.002. eCollection 2022.
2
prPred-DRLF: Plant R protein predictor using deep representation learning features.prPred-DRLF:基于深度表示学习特征的植物 R 蛋白预测器。
Proteomics. 2022 Jan;22(1-2):e2100161. doi: 10.1002/pmic.202100161. Epub 2021 Oct 14.
3
Meta-iPVP: a sequence-based meta-predictor for improving the prediction of phage virion proteins using effective feature representation.Meta-iPVP:一种基于序列的元预测器,用于使用有效的特征表示来改进噬菌体衣壳蛋白的预测。
J Comput Aided Mol Des. 2020 Oct;34(10):1105-1116. doi: 10.1007/s10822-020-00323-z. Epub 2020 Jun 16.
4
ProSE-Pero: Peroxisomal Protein Localization Identification Model Based on Self-Supervised Multi-Task Language Pre-Training Model.ProSE-Pero:基于自监督多任务语言预训练模型的过氧化物酶体蛋白定位识别模型。
Front Biosci (Landmark Ed). 2023 Dec 1;28(12):322. doi: 10.31083/j.fbl2812322.
5
Identification of plant vacuole proteins by using graph neural network and contact maps.利用图神经网络和接触图鉴定植物液泡蛋白。
BMC Bioinformatics. 2023 Sep 22;24(1):357. doi: 10.1186/s12859-023-05475-x.
6
Identification of sub-Golgi protein localization by use of deep representation learning features.利用深度表征学习特征鉴定高尔基体亚结构蛋白定位
Bioinformatics. 2021 Apr 5;36(24):5600-5609. doi: 10.1093/bioinformatics/btaa1074.
7
ATGPred-FL: sequence-based prediction of autophagy proteins with feature representation learning.ATGPred-FL:基于特征表示学习的自噬蛋白序列预测。
Amino Acids. 2022 May;54(5):799-809. doi: 10.1007/s00726-022-03145-5. Epub 2022 Mar 14.
8
Anticancer peptides prediction with deep representation learning features.基于深度表示学习特征的抗癌肽预测。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab008.
9
Identify Bitter Peptides by Using Deep Representation Learning Features.利用深度表示学习特征识别苦味肽。
Int J Mol Sci. 2022 Jul 17;23(14):7877. doi: 10.3390/ijms23147877.
10
PeNGaRoo, a combined gradient boosting and ensemble learning framework for predicting non-classical secreted proteins.PeNGaRoo,一种组合梯度提升和集成学习框架,用于预测非经典分泌蛋白。
Bioinformatics. 2020 Feb 1;36(3):704-712. doi: 10.1093/bioinformatics/btz629.

引用本文的文献

1
Diabetic peripheral neuropathy detection of type 2 diabetes using machine learning from TCM features: a cross-sectional study.基于中医特征运用机器学习检测2型糖尿病患者的糖尿病周围神经病变:一项横断面研究
BMC Med Inform Decis Mak. 2025 Feb 18;25(1):90. doi: 10.1186/s12911-025-02932-w.
2
GASIDN: identification of sub-Golgi proteins with multi-scale feature fusion.GASIDN:具有多尺度特征融合的亚高尔基体蛋白鉴定。
BMC Genomics. 2024 Oct 30;25(1):1019. doi: 10.1186/s12864-024-10954-3.
3
Identification of plant vacuole proteins by using graph neural network and contact maps.

本文引用的文献

1
In-Pero: Exploiting Deep Learning Embeddings of Protein Sequences to Predict the Localisation of Peroxisomal Proteins.In-Pero:利用蛋白质序列的深度学习嵌入来预测过氧化物酶体蛋白的定位。
Int J Mol Sci. 2021 Jun 15;22(12):6409. doi: 10.3390/ijms22126409.
2
Anticancer peptides prediction with deep representation learning features.基于深度表示学习特征的抗癌肽预测。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab008.
3
Sequence representation approaches for sequence-based protein prediction tasks that use deep learning.
利用图神经网络和接触图鉴定植物液泡蛋白。
BMC Bioinformatics. 2023 Sep 22;24(1):357. doi: 10.1186/s12859-023-05475-x.
用于基于序列的蛋白质预测任务的序列表示方法,这些任务使用深度学习。
Brief Funct Genomics. 2021 Mar 2;20(1):61-73. doi: 10.1093/bfgp/elaa030.
4
Evaluating Protein Transfer Learning with TAPE.使用TAPE评估蛋白质迁移学习。
Adv Neural Inf Process Syst. 2019 Dec;32:9689-9701.
5
Identification of sub-Golgi protein localization by use of deep representation learning features.利用深度表征学习特征鉴定高尔基体亚结构蛋白定位
Bioinformatics. 2021 Apr 5;36(24):5600-5609. doi: 10.1093/bioinformatics/btaa1074.
6
Computational identification of eukaryotic promoters based on cascaded deep capsule neural networks.基于级联深度胶囊神经网络的真核启动子计算识别。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa299.
7
VacPred: Sequence-based prediction of plant vacuole proteins using machine-learning techniques.VacPred:基于机器学习技术的植物液泡蛋白序列预测。
J Biosci. 2020;45.
8
DeepATT: a hybrid category attention neural network for identifying functional effects of DNA sequences.DeepATT:一种用于识别 DNA 序列功能效应的混合类别注意力神经网络。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa159.
9
Plant-mSubP: a computational framework for the prediction of single- and multi-target protein subcellular localization using integrated machine-learning approaches.植物微小肽:一种使用集成机器学习方法预测单靶点和多靶点蛋白质亚细胞定位的计算框架。
AoB Plants. 2019 Oct 17;12(3):plz068. doi: 10.1093/aobpla/plz068. eCollection 2020 Jun.
10
Vacuole Biogenesis in Plants: How Many Vacuoles, How Many Models?植物液泡生物发生:多少液泡,多少模型?
Trends Plant Sci. 2020 Jun;25(6):538-548. doi: 10.1016/j.tplants.2020.01.008. Epub 2020 Feb 21.