Suppr超能文献

从头构建和进化分析具有部分同源关系的蛋白质编码基因家族:以密切相关的果蝇基因组为例。

Ab Initio Construction and Evolutionary Analysis of Protein-Coding Gene Families with Partially Homologous Relationships: Closely Related Drosophila Genomes as a Case Study.

机构信息

State Key Laboratory of Earth Surface Processes and Resource Ecology, Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, China.

出版信息

Genome Biol Evol. 2020 Mar 1;12(3):185-202. doi: 10.1093/gbe/evaa041.

Abstract

How have genes evolved within a well-known genome phylogeny? Many protein-coding genes should have evolved as a whole at the gene level, and some should have evolved partly through fragments at the subgene level. To comprehensively explore such complex homologous relationships and better understand gene family evolution, here, with de novo-identified modules, the subgene units which could consecutively cover proteins within a set of closely related species, we applied a new phylogeny-based approach that considers evolutionary models with partial homology to classify all protein-coding genes in nine Drosophila genomes. Compared with two other popular methods for gene family construction, our approach improved practical gene family classifications with a more reasonable view of homology and provided a much more complete landscape of gene family evolution at the gene and subgene levels. In the case study, we found that most expanded gene families might have evolved mainly through module rearrangements rather than gene duplications and mainly generated single-module genes through partial gene duplication, suggesting that there might be pervasive subgene rearrangement in the evolution of protein-coding gene families. The use of a phylogeny-based approach with partial homology to classify and analyze protein-coding gene families may provide us with a more comprehensive landscape depicting how genes evolve within a well-known genome phylogeny.

摘要

基因在一个已知的基因组系统发育中是如何进化的?许多蛋白质编码基因应该作为一个整体在基因水平上进化,而有些则应该通过亚基因水平的片段部分进化。为了全面探索这种复杂的同源关系,更好地了解基因家族的进化,在这里,我们使用从头鉴定的模块,这些模块可以连续覆盖一组密切相关物种中的蛋白质,应用了一种新的基于系统发育的方法,该方法考虑了具有部分同源性的进化模型,对九个果蝇基因组中的所有蛋白质编码基因进行分类。与另外两种流行的基因家族构建方法相比,我们的方法通过更合理的同源性观点提高了实际基因家族的分类,并在基因和亚基因水平上提供了更完整的基因家族进化景观。在案例研究中,我们发现大多数扩展的基因家族可能主要通过模块重排而不是基因复制进化而来,并且主要通过部分基因复制产生单模块基因,这表明在蛋白质编码基因家族的进化过程中可能存在普遍的亚基因重排。使用具有部分同源性的基于系统发育的方法对蛋白质编码基因家族进行分类和分析,可能为我们提供一个更全面的景观,描绘基因如何在一个已知的基因组系统发育中进化。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c62/7144356/14ecf5490e44/evaa041f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验