• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SARP:一种评估蛋白质序列组成性偏差的新算法。

SARP: A Novel Algorithm to Assess Compositional Biases in Protein Sequences.

机构信息

Department of Genetics and Biotechnology, St. Petersburg State University, St. Petersburg, Russia.

出版信息

Evol Bioinform Online. 2013 Jul 11;9:263-73. doi: 10.4137/EBO.S12299. Print 2013.

DOI:10.4137/EBO.S12299
PMID:23919085
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3728207/
Abstract

The composition of a defined set of subunits (nucleotides, amino acids) is one of the key features of biological sequences. Compositional biases are local shifts in amino acid or nucleotide frequencies that can occur as an adaptation of an organism to an extreme ecological niche, or as the signature of a specific function or localization of the corresponding protein. The calculation of probability is a method for annotating compositional bias and providing accurate detection of biased subsequences. Here, we present a Sequence Analysis based on the Ranking of Probabilities (SARP), a novel algorithm for the annotation of compositional biases based on ranking subsequences by their probabilities. SARP provides the same accuracy as the previously published Lower Probability Subsequences (LPS) algorithm but performs at an approximately 230-fold faster rate. It can be recommended for use when working with large datasets to reduce the time and resources required.

摘要

定义的一组亚基(核苷酸、氨基酸)的组成是生物序列的关键特征之一。组成偏差是氨基酸或核苷酸频率的局部偏移,可以作为生物体对极端生态位的适应,或者作为特定功能或相应蛋白质定位的特征。概率计算是一种注释组成偏差并提供偏置子序列的准确检测的方法。在这里,我们提出了一种基于概率排序的序列分析(SARP),这是一种基于概率对组成偏差进行注释的新算法。SARP 提供了与先前发布的低概率子序列(LPS)算法相同的准确性,但速度快约 230 倍。当处理大型数据集以减少所需的时间和资源时,推荐使用它。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3728207/42c1655d5d57/ebo-9-2013-263f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3728207/c64493a0af4b/ebo-9-2013-263f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3728207/3a58db50217e/ebo-9-2013-263f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3728207/a7f9d757d173/ebo-9-2013-263f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3728207/6224d95e4254/ebo-9-2013-263f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3728207/42c1655d5d57/ebo-9-2013-263f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3728207/c64493a0af4b/ebo-9-2013-263f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3728207/3a58db50217e/ebo-9-2013-263f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3728207/a7f9d757d173/ebo-9-2013-263f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3728207/6224d95e4254/ebo-9-2013-263f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d4f7/3728207/42c1655d5d57/ebo-9-2013-263f5.jpg

相似文献

1
SARP: A Novel Algorithm to Assess Compositional Biases in Protein Sequences.SARP:一种评估蛋白质序列组成性偏差的新算法。
Evol Bioinform Online. 2013 Jul 11;9:263-73. doi: 10.4137/EBO.S12299. Print 2013.
2
LPS-annotate: complete annotation of compositionally biased regions in the protein knowledgebase.LPS-annotate:对蛋白质知识库中组成性偏倚区域进行完整注释。
Database (Oxford). 2011 Jan 6;2011:baq031. doi: 10.1093/database/baq031. Print 2011.
3
fLPS: Fast discovery of compositional biases for the protein universe.fLPS:蛋白质宇宙组成偏差的快速发现
BMC Bioinformatics. 2017 Nov 13;18(1):476. doi: 10.1186/s12859-017-1906-3.
4
Nucleotide composition as a driving force in the evolution of retroviruses.核苷酸组成作为逆转录病毒进化的驱动力。
J Mol Evol. 1994 May;38(5):506-32. doi: 10.1007/BF00178851.
5
Revisiting the relationship between compositional sequence complexity and periodicity.重新审视组成序列复杂性与周期性之间的关系。
Comput Biol Chem. 2008 Feb;32(1):17-28. doi: 10.1016/j.compbiolchem.2007.09.001. Epub 2007 Sep 12.
6
Mitochondrial phylogenomics of early land plants: mitigating the effects of saturation, compositional heterogeneity, and codon-usage bias.早期陆地植物的线粒体系统发育基因组学:减轻饱和度、组成异质性和密码子使用偏好的影响。
Syst Biol. 2014 Nov;63(6):862-78. doi: 10.1093/sysbio/syu049. Epub 2014 Jul 28.
7
Compositional bias may affect both DNA-based and protein-based phylogenetic reconstructions.组成偏差可能会影响基于DNA和基于蛋白质的系统发育重建。
J Mol Evol. 1999 Mar;48(3):284-90. doi: 10.1007/pl00006471.
8
A novel sensitive method for the detection of user-defined compositional bias in biological sequences.一种用于检测生物序列中用户定义组成偏差的新型灵敏方法。
Bioinformatics. 2006 May 1;22(9):1055-63. doi: 10.1093/bioinformatics/btl049. Epub 2006 Feb 24.
9
A method to assess compositional bias in biological sequences and its application to prion-like glutamine/asparagine-rich domains in eukaryotic proteomes.一种评估生物序列中组成性偏差的方法及其在真核生物蛋白质组中类朊病毒谷氨酰胺/天冬酰胺富集结构域的应用。
Genome Biol. 2003;4(6):R40. doi: 10.1186/gb-2003-4-6-r40. Epub 2003 May 30.
10
Exhaustive assignment of compositional bias reveals universally prevalent biased regions: analysis of functional associations in human and Drosophila.成分偏差的详尽分析揭示了普遍存在的偏差区域:人类和果蝇功能关联分析
BMC Bioinformatics. 2006 Oct 10;7:441. doi: 10.1186/1471-2105-7-441.

引用本文的文献

1
Optimizing strategy for the discovery of compositionally-biased or low-complexity regions in proteins.优化发现蛋白质中组成偏向或低复杂度区域的策略。
Sci Rep. 2024 Jan 5;14(1):680. doi: 10.1038/s41598-023-50991-8.
2
Application of yeast to studying amyloid and prion diseases.酵母在淀粉样变性和朊病毒病研究中的应用。
Adv Genet. 2020;105:293-380. doi: 10.1016/bs.adgen.2020.01.002. Epub 2020 May 4.
3
Biomolecular Assemblies: Moving from Observation to Predictive Design.生物分子组装:从观察到预测设计的转变。

本文引用的文献

1
PrionHome: a database of prions and other sequences relevant to prion phenomena.PrionHome:一个包含朊病毒和其他与朊病毒现象相关序列的数据库。
PLoS One. 2012;7(2):e31785. doi: 10.1371/journal.pone.0031785. Epub 2012 Feb 20.
2
Polyglutamine neurodegeneration: expanded glutamines enhance native functions.多聚谷氨酰胺神经退行性变:扩展的谷氨酰胺增强了天然功能。
Curr Opin Genet Dev. 2012 Jun;22(3):251-5. doi: 10.1016/j.gde.2012.01.001. Epub 2012 Jan 25.
3
RNA-binding proteins with prion-like domains in ALS and FTLD-U.具有朊病毒样结构域的 RNA 结合蛋白在 ALS 和 FTLD-U 中的作用。
Chem Rev. 2018 Dec 26;118(24):11519-11574. doi: 10.1021/acs.chemrev.8b00038. Epub 2018 Oct 3.
4
Exploring Proteins Containing Amyloidogenic Regions in the Proteomes of Bacteria of the Order .探索该目细菌蛋白质组中含有淀粉样生成区域的蛋白质。
Evol Bioinform Online. 2018 Apr 9;14:1176934318768781. doi: 10.1177/1176934318768781. eCollection 2018.
5
M60-like metalloprotease domain of the Escherichia coli YghJ protein forms amyloid fibrils.大肠杆菌YghJ蛋白的M60样金属蛋白酶结构域形成淀粉样纤维。
PLoS One. 2018 Jan 30;13(1):e0191317. doi: 10.1371/journal.pone.0191317. eCollection 2018.
6
Predicting Amyloidogenic Proteins in the Proteomes of Plants.预测植物蛋白质组中的淀粉样蛋白形成蛋白。
Int J Mol Sci. 2017 Oct 16;18(10):2155. doi: 10.3390/ijms18102155.
7
Amyloids and prions in plants: Facts and perspectives.植物中的淀粉样蛋白和朊病毒:事实与展望。
Prion. 2017 Sep 3;11(5):300-312. doi: 10.1080/19336896.2017.1377875.
8
Prions, amyloids, and RNA: Pieces of a puzzle.朊病毒、淀粉样蛋白与RNA:谜题的碎片
Prion. 2016 May 3;10(3):182-206. doi: 10.1080/19336896.2016.1181253.
Prion. 2011 Jul-Sep;5(3):179-87. doi: 10.4161/pri.5.3.17230. Epub 2011 Jul 1.
4
GC content-independent amino acid patterns in bacteria and archaea.细菌和古菌中与 GC 含量无关的氨基酸模式。
J Basic Microbiol. 2012 Apr;52(2):195-205. doi: 10.1002/jobm.201100067. Epub 2011 Jul 21.
5
LPS-annotate: complete annotation of compositionally biased regions in the protein knowledgebase.LPS-annotate:对蛋白质知识库中组成性偏倚区域进行完整注释。
Database (Oxford). 2011 Jan 6;2011:baq031. doi: 10.1093/database/baq031. Print 2011.
6
Understanding protein non-folding.理解蛋白质的非折叠状态。
Biochim Biophys Acta. 2010 Jun;1804(6):1231-64. doi: 10.1016/j.bbapap.2010.01.017. Epub 2010 Feb 1.
7
Structural basis for the aminoacid composition of proteins from halophilic archea.嗜盐古菌蛋白质的氨基酸组成的结构基础。
PLoS Biol. 2009 Dec;7(12):e1000257. doi: 10.1371/journal.pbio.1000257. Epub 2009 Dec 15.
8
STUDIES IN THE PHYSICAL CHEMISTRY OF THE PROTEINS : III. THE RELATION BETWEEN THE AMINO ACID COMPOSITION OF CASEIN AND ITS CAPACITY TO COMBINE WITH BASE.蛋白质物理化学研究:III.干酪素的氨基酸组成与其与碱基结合能力的关系。
J Gen Physiol. 1924 Sep 20;7(1):45-79. doi: 10.1085/jgp.7.1.45.
9
A systematic survey identifies prions and illuminates sequence features of prionogenic proteins.一项系统性调查识别出了朊病毒,并阐明了朊病毒生成蛋白的序列特征。
Cell. 2009 Apr 3;137(1):146-58. doi: 10.1016/j.cell.2009.02.044.
10
The SR protein family of splicing factors: master regulators of gene expression.剪接因子的SR蛋白家族:基因表达的主要调控因子。
Biochem J. 2009 Jan 1;417(1):15-27. doi: 10.1042/BJ20081501.