Suppr超能文献

12 个果蝇基因组中氨基酸替换的进化模式。

Evolutionary patterns of amino acid substitutions in 12 Drosophila genomes.

机构信息

Department of Biological sciences, East Tennessee State University, Johnson City, TN 37614, USA.

出版信息

BMC Genomics. 2010 Dec 2;11 Suppl 4(Suppl 4):S10. doi: 10.1186/1471-2164-11-S4-S10.

Abstract

BACKGROUND

Harnessing vast amounts of genomic data in phylogenetic context stemming from massive sequencing of multiple closely related genomes requires new tools and approaches. We present a tool for the genome-wide analysis of frequencies and patterns of amino acid substitutions in multiple alignments of genes' coding regions, and a database of amino acid substitutions in the phylogeny of 12 Drosophila genomes. We illustrate the use of these resources to address three types of evolutionary genomics questions: about fluxes in amino acid composition in proteins, about asymmetries in amino acid substitutions and about patterns of molecular evolution in duplicated genes.

RESULTS

We demonstrate that amino acid composition of Drosophila proteins underwent a significant shift over the last 70 million years encompassed by the studied phylogeny, with less common amino acids (Cys, Met, His) increasing in frequency and more common ones (Ala, Leu, Glu) becoming less frequent. These fluxes are strongly correlated with polarity of source and destination amino acids, resulting in overall systematic decrease of mean polarity of amino acids found in Drosophila proteins. Frequency and radicality of amino acid substitutions are higher in paralogs than in orthologous single-copy genes and are higher in gene families with paralogs than in gene families without surviving duplications. Rate and radicality of substitutions, as expected, are negatively correlated with overall level and uniformity of gene expression. However, these correlations are not observed for substitutions occurring in duplicated genes, indicating a different selective constraint on the evolution of paralogous sequences. Clades resulting from duplications show a marked asymmetry in rate and radicality of amino acid substitutions, possibly a signal of widespread neofunctionalization. These patterns differ among protein families of different functionality, with genes coding for RNA-binding proteins differing from most other functional groups in terms of amino acid substitution patterns in duplicated and single-copy genes.

CONCLUSIONS

We demonstrate that deep phylogenetic analysis of amino acid substitutions can reveal interesting genome-wide patterns. Amino acid composition of drosophilid proteins is shaped by fluxes similar to those previously observed in prokaryotic, yeast and mammalian genomes, indicating globally present patterns. Increased frequency and radicality of amino acid substitutions in duplicated genes and the presence of asymmetry of these parameters between paralogous clades indicate widespread neofunctionalization among paralogs as the mechanism of duplication retention.

摘要

背景

从大量密切相关的基因组的大规模测序中获取的大量基因组数据,需要新的工具和方法。我们提出了一种用于分析基因编码区的多个比对中氨基酸替换的频率和模式的全基因组工具,以及一个包含 12 个果蝇基因组系统发育中氨基酸替换的数据库。我们举例说明了这些资源在解决三种类型的进化基因组问题中的用途:关于蛋白质中氨基酸组成的通量、氨基酸替换的不对称性以及重复基因的分子进化模式。

结果

我们证明,在研究的系统发育中涵盖的过去 7000 万年里,果蝇蛋白质的氨基酸组成发生了显著的变化,较少见的氨基酸(半胱氨酸、蛋氨酸、组氨酸)的频率增加,而更常见的氨基酸(丙氨酸、亮氨酸、谷氨酸)的频率降低。这些通量与源和目标氨基酸的极性强烈相关,导致果蝇蛋白质中氨基酸的平均极性总体系统性降低。在直系同源单拷贝基因中,旁系同源基因的氨基酸替换频率和激进性更高,在有存活重复的基因家族中,氨基酸替换频率和激进性高于没有存活重复的基因家族。如预期的那样,替换的速率和激进性与基因表达的整体水平和均匀性呈负相关。然而,这些相关性在发生在重复基因中的替换中没有观察到,表明重复序列的进化受到不同的选择压力。重复产生的支系在氨基酸替换的速率和激进性上表现出明显的不对称性,这可能是广泛的新功能化的信号。这些模式在不同功能的蛋白质家族之间存在差异,与大多数其他功能组相比,编码 RNA 结合蛋白的基因在重复和单拷贝基因中的氨基酸替换模式方面有所不同。

结论

我们证明,对氨基酸替换的深度系统发育分析可以揭示有趣的全基因组模式。果蝇蛋白的氨基酸组成受到与先前在原核生物、酵母和哺乳动物基因组中观察到的相似的通量的影响,这表明存在全球性的模式。重复基因中氨基酸替换的频率和激进性增加,以及在旁系同源支系之间这些参数的不对称性的存在,表明广泛的新功能化是重复保留的机制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d51/3005911/1ffd89411416/1471-2164-11-S4-S10-1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验