• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

变化愈多——进化序列分歧预测蛋白质亚细胞定位信号。

Plus ça change - evolutionary sequence divergence predicts protein subcellular localization signals.

机构信息

Department of Computational Biology, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Japan.

出版信息

BMC Genomics. 2014 Jan 20;15:46. doi: 10.1186/1471-2164-15-46.

DOI:10.1186/1471-2164-15-46
PMID:24438075
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3906766/
Abstract

BACKGROUND

Protein subcellular localization is a central problem in understanding cell biology and has been the focus of intense research. In order to predict localization from amino acid sequence a myriad of features have been tried: including amino acid composition, sequence similarity, the presence of certain motifs or domains, and many others. Surprisingly, sequence conservation of sorting motifs has not yet been employed, despite its extensive use for tasks such as the prediction of transcription factor binding sites.

RESULTS

Here, we flip the problem around, and present a proof of concept for the idea that the lack of sequence conservation can be a novel feature for localization prediction. We show that for yeast, mammal and plant datasets, evolutionary sequence divergence alone has significant power to identify sequences with N-terminal sorting sequences. Moreover sequence divergence is nearly as effective when computed on automatically defined ortholog sets as on hand curated ones. Unfortunately, sequence divergence did not necessarily increase classification performance when combined with some traditional sequence features such as amino acid composition. However a post-hoc analysis of the proteins in which sequence divergence changes the prediction yielded some proteins with atypical (i.e. not MPP-cleaved) matrix targeting signals as well as a few misannotations.

CONCLUSION

We report the results of the first quantitative study of the effectiveness of evolutionary sequence divergence as a feature for protein subcellular localization prediction. We show that divergence is indeed useful for prediction, but it is not trivial to improve overall accuracy simply by adding this feature to classical sequence features. Nevertheless we argue that sequence divergence is a promising feature and show anecdotal examples in which it succeeds where other features fail.

摘要

背景

蛋白质亚细胞定位是理解细胞生物学的核心问题,也是研究的焦点。为了从氨基酸序列预测定位,人们尝试了无数的特征:包括氨基酸组成、序列相似性、特定模体或结构域的存在等。令人惊讶的是,尽管排序基序的序列保守性已被广泛用于转录因子结合位点预测等任务,但尚未将其用于预测定位。

结果

在这里,我们将问题颠倒过来,提出了一个概念验证,即缺乏序列保守性可能是定位预测的一个新特征。我们表明,对于酵母、哺乳动物和植物数据集,仅进化序列分歧就具有识别具有 N 端分选序列的序列的重要能力。此外,在自动定义的直系同源物集上计算序列分歧与在手工整理的直系同源物集上计算序列分歧一样有效。不幸的是,当与传统序列特征(如氨基酸组成)结合使用时,序列分歧不一定会提高分类性能。然而,对序列分歧改变预测的蛋白质进行的事后分析产生了一些具有非典型(即非 MPP 切割)基质靶向信号的蛋白质,以及一些错误注释。

结论

我们报告了进化序列分歧作为蛋白质亚细胞定位预测特征的有效性的首次定量研究结果。我们表明,分歧确实对预测有用,但简单地通过将此特征添加到经典序列特征中,就提高整体准确性并非易事。尽管如此,我们认为序列分歧是一种很有前途的特征,并展示了一些轶事案例,其中它在其他特征失败的地方取得了成功。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b1f/3906766/aa919f9d3099/1471-2164-15-46-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b1f/3906766/10c9aa207d2f/1471-2164-15-46-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b1f/3906766/b3f960dc0115/1471-2164-15-46-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b1f/3906766/514b89606145/1471-2164-15-46-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b1f/3906766/7b6735bcf336/1471-2164-15-46-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b1f/3906766/3aa3f86f91bb/1471-2164-15-46-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b1f/3906766/e21641572bf1/1471-2164-15-46-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b1f/3906766/aa919f9d3099/1471-2164-15-46-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b1f/3906766/10c9aa207d2f/1471-2164-15-46-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b1f/3906766/b3f960dc0115/1471-2164-15-46-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b1f/3906766/514b89606145/1471-2164-15-46-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b1f/3906766/7b6735bcf336/1471-2164-15-46-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b1f/3906766/3aa3f86f91bb/1471-2164-15-46-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b1f/3906766/e21641572bf1/1471-2164-15-46-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b1f/3906766/aa919f9d3099/1471-2164-15-46-7.jpg

相似文献

1
Plus ça change - evolutionary sequence divergence predicts protein subcellular localization signals.变化愈多——进化序列分歧预测蛋白质亚细胞定位信号。
BMC Genomics. 2014 Jan 20;15:46. doi: 10.1186/1471-2164-15-46.
2
MitoFates: improved prediction of mitochondrial targeting sequences and their cleavage sites.MitoFates:线粒体靶向序列及其切割位点的改进预测
Mol Cell Proteomics. 2015 Apr;14(4):1113-26. doi: 10.1074/mcp.M114.043083. Epub 2015 Feb 10.
3
Prediction of the subcellular localization of eukaryotic proteins using sequence signals and composition.利用序列信号和组成预测真核生物蛋白质的亚细胞定位
Proteomics. 2004 Jun;4(6):1591-6. doi: 10.1002/pmic.200300769.
4
Prediction of protein subcellular localization.蛋白质亚细胞定位预测
Proteins. 2006 Aug 15;64(3):643-51. doi: 10.1002/prot.21018.
5
ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules.ConSurf 2016:一种用于估计和可视化大分子进化保守性的改进方法。
Nucleic Acids Res. 2016 Jul 8;44(W1):W344-50. doi: 10.1093/nar/gkw408. Epub 2016 May 10.
6
Functional conservation of yeast mtTFB despite extensive sequence divergence.尽管酵母线粒体转录因子B(mtTFB)的序列存在广泛差异,但其功能保守。
Gene Expr. 1996;6(4):219-30.
7
Hidden localization motifs: naturally occurring peroxisomal targeting signals in non-peroxisomal proteins.隐藏的定位基序:非过氧化物酶体蛋白中天然存在的过氧化物酶体靶向信号
Genome Biol. 2004;5(12):R97. doi: 10.1186/gb-2004-5-12-r97. Epub 2004 Nov 30.
8
Prediction of nuclear proteins using nuclear translocation signals proposed by probabilistic latent semantic indexing.基于概率潜在语义索引的核转位信号预测核蛋白。
BMC Bioinformatics. 2012;13 Suppl 17(Suppl 17):S13. doi: 10.1186/1471-2105-13-S17-S13. Epub 2012 Dec 13.
9
Genetic structure and evolution of the Vps25 family, a yeast ESCRT-II component.酵母内体转运所需分选复合体-II(ESCRT-II)组分Vps25家族的遗传结构与进化
BMC Evol Biol. 2006 Aug 4;6:59. doi: 10.1186/1471-2148-6-59.
10
Evolution, structure and function of mitochondrial carriers: a review with new insights.线粒体载体的进化、结构和功能:综述及新见解
Plant J. 2011 Apr;66(1):161-81. doi: 10.1111/j.1365-313X.2011.04516.x.

引用本文的文献

1
Transcriptome analysis of Leucojum aestivum and identification of genes involved in norbelladine biosynthesis.文殊兰转录组分析及参与石蒜碱生物合成基因的鉴定。
Planta. 2022 Jan 3;255(2):30. doi: 10.1007/s00425-021-03741-x.
2
A Brief History of Protein Sorting Prediction.蛋白质分拣预测简史。
Protein J. 2019 Jun;38(3):200-216. doi: 10.1007/s10930-019-09838-3.
3
NommPred: Prediction of Mitochondrial and Mitochondrion-Related Organelle Proteins of Nonmodel Organisms.NommPred:非模式生物中线粒体及线粒体相关细胞器蛋白的预测

本文引用的文献

1
Evidence of evolutionary constraints that influences the sequence composition and diversity of mitochondrial matrix targeting signals.进化约束的证据影响了线粒体基质靶向信号的序列组成和多样性。
PLoS One. 2013 Jun 25;8(6):e67938. doi: 10.1371/journal.pone.0067938. Print 2013.
2
A reevaluation of dual-targeting of proteins to mitochondria and chloroplasts.对蛋白质双靶向定位于线粒体和叶绿体的重新评估。
Biochim Biophys Acta. 2013 Feb;1833(2):253-9. doi: 10.1016/j.bbamcr.2012.05.029. Epub 2012 Jun 7.
3
Processing peptidases in mitochondria and chloroplasts.
Evol Bioinform Online. 2018 Dec 23;14:1176934318819835. doi: 10.1177/1176934318819835. eCollection 2018.
4
Evolutionary Interplay between Symbiotic Relationships and Patterns of Signal Peptide Gain and Loss.共生关系与信号肽获得和丢失模式的进化相互作用。
Genome Biol Evol. 2018 Mar 1;10(3):928-938. doi: 10.1093/gbe/evy049.
5
Cytosolic and Nuclear Co-localization of Betalain Biosynthetic Enzymes in Tobacco Suggests that Betalains Are Synthesized in the Cytoplasm and/or Nucleus of Betalainic Plant Cells.甜菜红素生物合成酶在烟草中的胞质和核共定位表明,甜菜红素在含甜菜红素植物细胞的细胞质和/或细胞核中合成。
Front Plant Sci. 2017 May 18;8:831. doi: 10.3389/fpls.2017.00831. eCollection 2017.
6
Mining the Cicer arietinum genome for the mildew locus O (Mlo) gene family and comparative evolutionary analysis of the Mlo genes from Medicago truncatula and some other plant species.从鹰嘴豆基因组中挖掘白粉病抗性位点O(Mlo)基因家族,并对蒺藜苜蓿和其他一些植物物种的Mlo基因进行比较进化分析。
J Plant Res. 2017 Mar;130(2):239-253. doi: 10.1007/s10265-016-0868-2. Epub 2016 Nov 30.
7
MitoFates: improved prediction of mitochondrial targeting sequences and their cleavage sites.MitoFates:线粒体靶向序列及其切割位点的改进预测
Mol Cell Proteomics. 2015 Apr;14(4):1113-26. doi: 10.1074/mcp.M114.043083. Epub 2015 Feb 10.
线粒体和叶绿体中的加工肽酶。
Biochim Biophys Acta. 2013 Feb;1833(2):360-70. doi: 10.1016/j.bbamcr.2012.03.012. Epub 2012 Apr 1.
4
Inferring orthology and paralogy.推断直系同源和旁系同源关系。
Methods Mol Biol. 2012;855:259-79. doi: 10.1007/978-1-61779-582-4_9.
5
Presequence-dependent folding ensures MrpL32 processing by the m-AAA protease in mitochondria.前导序列依赖性折叠确保 MrpL32 通过线粒体中的 m-AAA 蛋白酶进行加工。
EMBO J. 2011 May 24;30(13):2545-56. doi: 10.1038/emboj.2011.169.
6
Mitochondrial protein turnover: role of the precursor intermediate peptidase Oct1 in protein stabilization.线粒体蛋白周转:前体中间肽酶 Oct1 在蛋白质稳定中的作用。
Mol Biol Cell. 2011 Jul 1;22(13):2135-43. doi: 10.1091/mbc.E11-02-0169. Epub 2011 Apr 27.
7
Dual role of the receptor Tom20 in specificity and efficiency of protein import into mitochondria.Tom20 受体在蛋白质导入线粒体的特异性和效率中的双重作用。
Proc Natl Acad Sci U S A. 2011 Jan 4;108(1):91-6. doi: 10.1073/pnas.1014918108. Epub 2010 Dec 20.
8
Prediction of subcellular locations of proteins: where to proceed?蛋白质亚细胞定位预测:路在何方?
Proteomics. 2010 Nov;10(22):3970-83. doi: 10.1002/pmic.201000274. Epub 2010 Nov 2.
9
Search and clustering orders of magnitude faster than BLAST.比 BLAST 快几个数量级的搜索和聚类。
Bioinformatics. 2010 Oct 1;26(19):2460-1. doi: 10.1093/bioinformatics/btq461. Epub 2010 Aug 12.
10
A comparative study of conservation and variation scores.保护和变异分数的比较研究。
BMC Bioinformatics. 2010 Jul 21;11:388. doi: 10.1186/1471-2105-11-388.