• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在人类蛋白质中发现的同源重复序列的密码子频率是否存在偏倚?

Is there a bias in the codon frequency corresponding to homo-repeats found in human proteins?

机构信息

Gamaleya Research Centre of Epidemiology and Microbiology, 123098, Moscow, Russia; Institute of Protein Research, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia.

Institute of Protein Research, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia.

出版信息

Biosystems. 2024 Dec;246:105357. doi: 10.1016/j.biosystems.2024.105357. Epub 2024 Oct 21.

DOI:10.1016/j.biosystems.2024.105357
PMID:39442908
Abstract

It is well known that there is a codon usage bias in genomes, that is, some codons are observed more often than others. Codons implicated in the homo-repeats regions in human proteins are no exception. In this work, we analyzed the codon usage bias for all amino acid residues in homo-repeats larger than 4 in 3753 human proteins from 20447 protein sequences from the canonically reviewed human proteome. We have discovered that almost all homo-repeats in the human proteome, most of which encode Ala, Glu, Gly, Leu, Pro, and Ser (∼80% of all homo-repeats), have a codon usage bias, i.e. are mainly encoded by one codon. Moreover, there is a strong shift in homo-repeats in favor of the content of GC rich codons. Homo-repeats with Ala, Glu, Gly, Leu, Pro, and Ser predominate in the PDB, which has both ordered and disordered status. Examining the distribution of splicing sites, we found that about 15% of homo-repeats either contain or are located within 10 nucleotides of the splicing site, and Glu and Leu predominate in these homo-repeats. Our data is important for future study of the functions of homo-repeats, protein-protein interactions, and evolutionary fitness.

摘要

众所周知,基因组中存在密码子使用偏好性,即某些密码子比其他密码子更频繁地出现。在人类蛋白质的同源重复区域中涉及的密码子也不例外。在这项工作中,我们分析了来自 20447 个蛋白质序列的 3753 个人类蛋白质中同源重复长度大于 4 的所有氨基酸残基的密码子使用偏好性。我们发现,人类蛋白质组中的几乎所有同源重复序列(约占所有同源重复序列的 80%)都具有密码子使用偏好性,即主要由一个密码子编码。此外,同源重复序列中存在强烈的 GC 丰富密码子含量偏好性。富含 Ala、Glu、Gly、Leu、Pro 和 Ser 的同源重复序列在 PDB 中占主导地位,PDB 既有有序状态,也有无序状态。检查剪接位点的分布,我们发现约 15%的同源重复序列要么包含剪接位点,要么位于剪接位点的 10 个核苷酸内,并且这些同源重复序列中 Glu 和 Leu 占主导地位。我们的数据对于未来研究同源重复序列的功能、蛋白质-蛋白质相互作用和进化适应性非常重要。

相似文献

1
Is there a bias in the codon frequency corresponding to homo-repeats found in human proteins?在人类蛋白质中发现的同源重复序列的密码子频率是否存在偏倚?
Biosystems. 2024 Dec;246:105357. doi: 10.1016/j.biosystems.2024.105357. Epub 2024 Oct 21.
2
Is there codon usage bias for poly-Q stretches in the human proteome?人类蛋白质组中多聚谷氨酰胺序列存在密码子使用偏好吗?
J Bioinform Comput Biol. 2019 Feb;17(1):1950010. doi: 10.1142/S0219720019500100.
3
[An Overlap between Splicing Sites in RNA and Homo-Repeats in Human Proteins].[RNA中的剪接位点与人类蛋白质中的同聚物重复序列之间的重叠]
Mol Biol (Mosk). 2019 May-Jun;53(3):524-528. doi: 10.1134/S0026898419030066.
4
Genomic and evolutionary insights into genes encoding proteins with single amino acid repeats.对编码具有单氨基酸重复序列蛋白质的基因的基因组学和进化分析
Mol Biol Evol. 2006 Jul;23(7):1357-69. doi: 10.1093/molbev/msk022. Epub 2006 Apr 17.
5
Non-random distribution of homo-repeats: links with biological functions and human diseases.同聚物重复序列的非随机分布:与生物学功能及人类疾病的联系
Sci Rep. 2016 Jun 3;6:26941. doi: 10.1038/srep26941.
6
Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights.蛋白质组世界中的单氨基酸重复序列:结构、功能及进化见解
PLoS One. 2016 Nov 28;11(11):e0166854. doi: 10.1371/journal.pone.0166854. eCollection 2016.
7
Evolutionary Forces and Codon Bias in Different Flavors of Intrinsic Disorder in the Human Proteome.人类蛋白质组中不同类型固有无序区的进化力量和密码子偏好性。
J Mol Evol. 2020 Mar;88(2):164-178. doi: 10.1007/s00239-019-09921-4. Epub 2019 Dec 10.
8
RCPdb: An evolutionary classification and codon usage database for repeat-containing proteins.RCPdb:一个用于含重复序列蛋白质的进化分类和密码子使用数据库。
Genome Res. 2007 Jul;17(7):1118-27. doi: 10.1101/gr.6255407. Epub 2007 Jun 13.
9
Codon Usage Optimization in the Prokaryotic Tree of Life: How Synonymous Codons Are Differentially Selected in Sequence Domains with Different Expression Levels and Degrees of Conservation.原核生物树中的密码子使用优化:具有不同表达水平和不同保守程度的序列结构域中如何差异选择同义密码子。
mBio. 2020 Jul 21;11(4):e00766-20. doi: 10.1128/mBio.00766-20.
10
Comparison of codon usage and tRNAs in mitochondrial genomes of Candida species.念珠菌属线粒体基因组中密码子使用情况与转运RNA的比较。
Biosystems. 2007 Sep-Oct;90(2):362-70. doi: 10.1016/j.biosystems.2006.09.039. Epub 2006 Oct 5.

引用本文的文献

1
What Can Be Learned by Knowing Only the Amino Acid Composition of Proteins?仅通过了解蛋白质的氨基酸组成能学到什么?
Int J Mol Sci. 2024 Dec 21;25(24):13680. doi: 10.3390/ijms252413680.