Suppr超能文献

全基因组计算预测串联基因阵列:在酵母中的应用。

Genome-wide computational prediction of tandem gene arrays: application in yeasts.

机构信息

Université de Strasbourg, CNRS UMR7156, Strasbourg, France.

出版信息

BMC Genomics. 2010 Jan 21;11:56. doi: 10.1186/1471-2164-11-56.

Abstract

BACKGROUND

This paper describes an efficient in silico method for detecting tandem gene arrays (TGAs) in fully sequenced and compact genomes such as those of prokaryotes or unicellular eukaryotes. The originality of this method lies in the search of protein sequence similarities in the vicinity of each coding sequence, which allows the prediction of tandem duplicated gene copies independently of their functionality.

RESULTS

Applied to nine hemiascomycete yeast genomes, this method predicts that 2% of the genes are involved in TGAs and gene relics are present in 11% of TGAs. The frequency of TGAs with degenerated gene copies means that a significant fraction of tandem duplicated genes follows the birth-and-death model of evolution. A comparison of sequence identity distributions between sets of homologous gene pairs shows that the different copies of tandem arrayed paralogs are less divergent than copies of dispersed paralogs in yeast genomes. It suggests that paralogs included in tandem structures are more recent or more subject to the gene conversion mechanism than other paralogs.

CONCLUSION

The method reported here is a useful computational tool to provide a database of TGAs composed of functional or nonfunctional gene copies. Such a database has obvious applications in the fields of structural and comparative genomics. Notably, a detailed study of the TGA catalog will make it possible to tackle the fundamental questions of the origin and evolution of tandem gene clusters.

摘要

背景

本文描述了一种高效的计算方法,用于检测完全测序的和紧凑的基因组(如原核生物或单细胞真核生物)中的串联基因阵列(TGA)。该方法的创新性在于在每个编码序列附近搜索蛋白质序列相似性,从而可以独立于其功能预测串联重复基因副本。

结果

将该方法应用于 9 个半子囊菌酵母基因组,预测 2%的基因涉及 TGA,并且在 11%的 TGA 中存在基因遗迹。具有退化基因副本的 TGA 的频率意味着,大量串联重复基因遵循着诞生和死亡的进化模型。同源基因对集之间的序列同一性分布的比较表明,酵母基因组中,串联排列的直系同源基因的不同副本比分散的直系同源基因的副本分歧度更小。这表明,包含在串联结构中的直系同源基因比其他直系同源基因更新或更受基因转换机制的影响。

结论

本文报道的方法是一种有用的计算工具,可提供由功能或非功能基因副本组成的 TGA 数据库。这种数据库在结构和比较基因组学领域具有明显的应用。值得注意的是,对 TGA 目录的详细研究将使解决串联基因簇起源和进化的基本问题成为可能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20ca/2822764/97f531d1011d/1471-2164-11-56-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验