• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用进化信息和基于序列的深度学习预测蛋白质的亚细胞定位。

Predicting subcellular location of protein with evolution information and sequence-based deep learning.

机构信息

Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fujian Medical University, 1 Xuefu North Road, University Town, Fuzhou, 350122, FJ, China.

Department of Computer Science and Engineering, University of South Carolina, 550 Assembly St, Columbia, SC, 29208, USA.

出版信息

BMC Bioinformatics. 2021 Oct 22;22(Suppl 10):515. doi: 10.1186/s12859-021-04404-0.

DOI:10.1186/s12859-021-04404-0
PMID:34686152
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8539821/
Abstract

BACKGROUND

Protein subcellular localization prediction plays an important role in biology research. Since traditional methods are laborious and time-consuming, many machine learning-based prediction methods have been proposed. However, most of the proposed methods ignore the evolution information of proteins. In order to improve the prediction accuracy, we present a deep learning-based method to predict protein subcellular locations.

RESULTS

Our method utilizes not only amino acid compositions sequence but also evolution matrices of proteins. Our method uses a bidirectional long short-term memory network that processes the entire protein sequence and a convolutional neural network that extracts features from protein sequences. The position specific scoring matrix is used as a supplement to protein sequences. Our method was trained and tested on two benchmark datasets. The experiment results show that our method yields accurate results on the two datasets with an average precision of 0.7901, ranking loss of 0.0758 and coverage of 1.2848.

CONCLUSION

The experiment results show that our method outperforms five methods currently available. According to those experiments, we can see that our method is an acceptable alternative to predict protein subcellular location.

摘要

背景

蛋白质亚细胞定位预测在生物学研究中起着重要作用。由于传统方法既费力又耗时,因此已经提出了许多基于机器学习的预测方法。但是,大多数提出的方法都忽略了蛋白质的进化信息。为了提高预测精度,我们提出了一种基于深度学习的方法来预测蛋白质亚细胞位置。

结果

我们的方法不仅利用了氨基酸组成序列,还利用了蛋白质的进化矩阵。我们的方法使用双向长短时记忆网络来处理整个蛋白质序列,以及卷积神经网络来从蛋白质序列中提取特征。位置特异性评分矩阵被用作蛋白质序列的补充。我们的方法在两个基准数据集上进行了训练和测试。实验结果表明,我们的方法在两个数据集上均取得了准确的结果,平均精度为 0.7901,排名损失为 0.0758,覆盖率为 1.2848。

结论

实验结果表明,我们的方法优于目前可用的五种方法。根据这些实验,我们可以看到,我们的方法是预测蛋白质亚细胞位置的一种可接受的替代方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c699/8539821/8de16797666a/12859_2021_4404_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c699/8539821/ee6ad2713211/12859_2021_4404_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c699/8539821/7a67301551ae/12859_2021_4404_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c699/8539821/7b4093297e66/12859_2021_4404_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c699/8539821/8de16797666a/12859_2021_4404_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c699/8539821/ee6ad2713211/12859_2021_4404_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c699/8539821/7a67301551ae/12859_2021_4404_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c699/8539821/7b4093297e66/12859_2021_4404_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c699/8539821/8de16797666a/12859_2021_4404_Fig4_HTML.jpg

相似文献

1
Predicting subcellular location of protein with evolution information and sequence-based deep learning.利用进化信息和基于序列的深度学习预测蛋白质的亚细胞定位。
BMC Bioinformatics. 2021 Oct 22;22(Suppl 10):515. doi: 10.1186/s12859-021-04404-0.
2
Machine and Deep Learning for Prediction of Subcellular Localization.机器和深度学习在预测亚细胞定位中的应用。
Methods Mol Biol. 2021;2361:249-261. doi: 10.1007/978-1-0716-1641-3_15.
3
Prediction of nuclear proteins using nuclear translocation signals proposed by probabilistic latent semantic indexing.基于概率潜在语义索引的核转位信号预测核蛋白。
BMC Bioinformatics. 2012;13 Suppl 17(Suppl 17):S13. doi: 10.1186/1471-2105-13-S17-S13. Epub 2012 Dec 13.
4
Accurate prediction of multi-label protein subcellular localization through multi-view feature learning with RBRL classifier.通过使用 RBRL 分类器的多视图特征学习实现多标签蛋白质亚细胞定位的准确预测。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab012.
5
Subcellular location prediction of apoptosis proteins using two novel feature extraction methods based on evolutionary information and LDA.基于进化信息和 LDA 的两种新特征提取方法对凋亡蛋白的亚细胞定位预测
BMC Bioinformatics. 2020 May 24;21(1):212. doi: 10.1186/s12859-020-3539-1.
6
Identification of protein subcellular localization via integrating evolutionary and physicochemical information into Chou's general PseAAC.通过将进化和物理化学信息整合到 Chou 的通用 PseAAC 中,鉴定蛋白质亚细胞定位。
J Theor Biol. 2019 Feb 7;462:230-239. doi: 10.1016/j.jtbi.2018.11.012. Epub 2018 Nov 16.
7
Comprehensive Study on Enhancing Low-Quality Position-Specific Scoring Matrix with Deep Learning for Accurate Protein Structure Property Prediction: Using Bagging Multiple Sequence Alignment Learning.利用Bagging多序列比对学习,通过深度学习增强低质量位置特异性评分矩阵以进行准确蛋白质结构特性预测的综合研究
J Comput Biol. 2021 Apr;28(4):346-361. doi: 10.1089/cmb.2020.0416. Epub 2021 Feb 22.
8
Predicting Protein-Protein Interactions from Matrix-Based Protein Sequence Using Convolution Neural Network and Feature-Selective Rotation Forest.基于卷积神经网络和特征选择旋转森林的矩阵基蛋白序列预测蛋白-蛋白相互作用
Sci Rep. 2019 Jul 8;9(1):9848. doi: 10.1038/s41598-019-46369-4.
9
SCLpred-MEM: Subcellular localization prediction of membrane proteins by deep N-to-1 convolutional neural networks.SCLpred-MEM:基于深度 N 到 1 卷积神经网络的膜蛋白亚细胞定位预测。
Proteins. 2021 Oct;89(10):1233-1239. doi: 10.1002/prot.26144. Epub 2021 May 19.
10
DeepLncLoc: a deep learning framework for long non-coding RNA subcellular localization prediction based on subsequence embedding.DeepLncLoc:一种基于子序列嵌入的深度学习框架,用于长非编码 RNA 亚细胞定位预测。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab360.

引用本文的文献

1
A Comprehensive Review on RNA Subcellular Localization Prediction.RNA亚细胞定位预测综述
ArXiv. 2025 Apr 24:arXiv:2504.17162v1.
2
Impact of Alignments on the Accuracy of Protein Subcellular Localization Predictions.序列比对对蛋白质亚细胞定位预测准确性的影响。
Proteins. 2025 Mar;93(3):745-759. doi: 10.1002/prot.26767. Epub 2024 Nov 22.
3
Protein subcellular localization prediction tools.蛋白质亚细胞定位预测工具。

本文引用的文献

1
Modern deep learning in bioinformatics.生物信息学中的现代深度学习
J Mol Cell Biol. 2020 Oct 30;12(11):823-827. doi: 10.1093/jmcb/mjaa030.
2
ImPLoc: a multi-instance deep learning model for the prediction of protein subcellular localization based on immunohistochemistry images.ImPLoc:一种基于免疫组化图像的蛋白质亚细胞定位预测的多实例深度学习模型。
Bioinformatics. 2020 Apr 1;36(7):2244-2250. doi: 10.1093/bioinformatics/btz909.
3
Protein Crystallization Identification via Fuzzy Model on Linear Neighborhood Representation.基于线性邻域表示的模糊模型的蛋白质结晶鉴定。
Comput Struct Biotechnol J. 2024 Apr 15;23:1796-1807. doi: 10.1016/j.csbj.2024.04.032. eCollection 2024 Dec.
4
A Review for Artificial Intelligence Based Protein Subcellular Localization.基于人工智能的蛋白质亚细胞定位研究综述
Biomolecules. 2024 Mar 27;14(4):409. doi: 10.3390/biom14040409.
5
Recent Advancements in Subcellular Proteomics: Growing Impact of Organellar Protein Niches on the Understanding of Cell Biology.亚细胞蛋白质组学的最新进展:细胞器蛋白质龛对细胞生物学理解的影响日益增大。
J Proteome Res. 2024 Aug 2;23(8):2700-2722. doi: 10.1021/acs.jproteome.3c00839. Epub 2024 Mar 7.
6
Dual-Signal Feature Spaces Map Protein Subcellular Locations Based on Immunohistochemistry Image and Protein Sequence.基于免疫组化图像和蛋白质序列的双信号特征空间映射蛋白质亚细胞定位。
Sensors (Basel). 2023 Nov 7;23(22):9014. doi: 10.3390/s23229014.
7
Genome-wide identification and characterization of glutathione -transferase gene family in quinoa ( Willd.).藜麦(藜麦属)中谷胱甘肽-S-转移酶基因家族的全基因组鉴定与特征分析
3 Biotech. 2023 Jul;13(7):230. doi: 10.1007/s13205-023-03659-z. Epub 2023 Jun 10.
8
Recent Advances in the Prediction of Subcellular Localization of Proteins and Related Topics.蛋白质亚细胞定位预测及相关主题的最新进展
Front Bioinform. 2022 May 19;2:910531. doi: 10.3389/fbinf.2022.910531. eCollection 2022.
IEEE/ACM Trans Comput Biol Bioinform. 2021 Sep-Oct;18(5):1986-1995. doi: 10.1109/TCBB.2019.2954826. Epub 2021 Oct 7.
4
Critical evaluation of web-based prediction tools for human protein subcellular localization.基于网络的人类蛋白质亚细胞定位预测工具的批判性评价。
Brief Bioinform. 2020 Sep 25;21(5):1628-1640. doi: 10.1093/bib/bbz106.
5
Prediction of mRNA subcellular localization using deep recurrent neural networks.基于深度递归神经网络的 mRNA 亚细胞定位预测。
Bioinformatics. 2019 Jul 15;35(14):i333-i342. doi: 10.1093/bioinformatics/btz337.
6
Efficient partition of integer optimization problems with one-hot encoding.使用独热编码对整数优化问题进行高效划分。
Sci Rep. 2019 Sep 10;9(1):13036. doi: 10.1038/s41598-019-49539-6.
7
Advances in Predicting Subcellular Localization of Multi-label Proteins and its Implication for Developing Multi-target Drugs.多标签蛋白质亚细胞定位预测的进展及其对开发多靶点药物的意义。
Curr Med Chem. 2019;26(26):4918-4943. doi: 10.2174/0929867326666190507082559.
8
Identification of protein subcellular localization via integrating evolutionary and physicochemical information into Chou's general PseAAC.通过将进化和物理化学信息整合到 Chou 的通用 PseAAC 中,鉴定蛋白质亚细胞定位。
J Theor Biol. 2019 Feb 7;462:230-239. doi: 10.1016/j.jtbi.2018.11.012. Epub 2018 Nov 16.
9
Prediction of LncRNA Subcellular Localization with Deep Learning from Sequence Features.基于序列特征的深度学习预测长链非编码 RNA 的亚细胞定位
Sci Rep. 2018 Nov 6;8(1):16385. doi: 10.1038/s41598-018-34708-w.
10
pLoc_bal-mAnimal: predict subcellular localization of animal proteins by balancing training dataset and PseAAC.pLoc_bal-mAnimal:通过平衡训练数据集和 PseAAC 来预测动物蛋白质的亚细胞定位。
Bioinformatics. 2019 Feb 1;35(3):398-406. doi: 10.1093/bioinformatics/bty628.