Suppr超能文献

需要注意的是,逆转录复制的鉴定:内含子基因的基于 DNA 的复制会显著促进单外显子基因的起源。

A cautionary note for retrocopy identification: DNA-based duplication of intron-containing genes significantly contributes to the origination of single exon genes.

机构信息

Department of Ecology and Evolution, The University of Chicago, 1101 E 57th Street, Chicago, IL 60637, USA.

出版信息

Bioinformatics. 2011 Jul 1;27(13):1749-53. doi: 10.1093/bioinformatics/btr280. Epub 2011 May 5.

Abstract

MOTIVATION

Retrocopies are important genes in the genomes of almost all higher eukaryotes. However, the annotation of such genes is a non-trivial task. Intronless genes have often been considered to be retroposed copies of intron-containing paralogs. Such categorization relies on the implicit premise that alignable regions of the duplicates should be long enough to cover exon-exon junctions of the intron-containing genes, and thus intron loss events can be inferred. Here, we examined the alternative possibility that intronless genes could be generated by partial DNA-based duplication of intron-containing genes in the fruitfly genome.

RESULTS

By building pairwise protein-, transcript- and genome-level DNA alignments between intronless genes and their corresponding intron-containing paralogs, we found that alignments do not cover exon-exon junctions in 40% of cases and thus no intron loss could be inferred. For these cases, the candidate parental proteins tend to be partially duplicated, and intergenic sequences or neighboring genes are included in the intronless paralog. Moreover, we observed that it is significantly less likely for these paralogs to show inter-chromosomal duplication and testis-dominant transcription, compared to the remaining 60% of cases with evidence of clear intron loss (retrogenes). These lines of analysis reveal that DNA-based duplication contributes significantly to the 40% of cases of single exon gene duplication. Finally, we performed an analogous survey in the human genome and the result is similar, wherein 34% of the cases do not cover exon-exon junctions. Thus, genome annotation for retrogene identification should discard candidates without clear evidence of intron loss.

CONTACT

mlong@uchicago.edu; zhangy@uchicago.edu

摘要

动机

逆转录副本是几乎所有高等真核生物基因组中的重要基因。然而,此类基因的注释是一项复杂的任务。无内含子基因通常被认为是具有内含子的同源基因的逆转录副本。这种分类依赖于一个隐含的前提,即重复序列的可比对区域应该足够长,以覆盖含有内含子基因的外显子-内含子交界处,从而可以推断内含子丢失事件。在这里,我们研究了另一种可能性,即无内含子基因可能是通过果蝇基因组中具有内含子的基因的部分基于 DNA 的重复而产生的。

结果

通过在无内含子基因与其相应的具有内含子的同源基因之间构建两两蛋白质、转录本和基因组水平的 DNA 比对,我们发现,在 40%的情况下,比对并未覆盖外显子-内含子交界处,因此无法推断内含子丢失。对于这些情况,候选亲本蛋白往往是部分重复的,并且内含子基因的内含子基因或相邻基因包含在内含子基因中。此外,我们观察到,与具有明确内含子丢失证据(返基因)的剩余 60%的情况相比,这些同源基因发生染色体间重复和睾丸显性转录的可能性显著降低。这些分析表明,基于 DNA 的重复对 40%的单外显子基因重复有重要贡献。最后,我们在人类基因组中进行了类似的调查,结果相似,其中 34%的情况未覆盖外显子-内含子交界处。因此,反转录基因识别的基因组注释应排除没有明确内含子丢失证据的候选基因。

联系方式

mlong@uchicago.eduzhangy@uchicago.edu

相似文献

5
Intron gain and loss in segmentally duplicated genes in rice.水稻中片段重复基因的内含子增减
Genome Biol. 2006;7(5):R41. doi: 10.1186/gb-2006-7-5-r41. Epub 2006 May 23.
6
Evolutionary origin and functions of retrogene introns.反转基因内含子的进化起源与功能
Mol Biol Evol. 2009 Sep;26(9):2147-56. doi: 10.1093/molbev/msp125. Epub 2009 Jun 24.
10
Splicing and the evolution of proteins in mammals.哺乳动物中蛋白质的剪接与进化
PLoS Biol. 2007 Feb;5(2):e14. doi: 10.1371/journal.pbio.0050014.

引用本文的文献

本文引用的文献

3
Origins, evolution, and phenotypic impact of new genes.新基因的起源、进化和表型影响。
Genome Res. 2010 Oct;20(10):1313-26. doi: 10.1101/gr.101386.109. Epub 2010 Jul 22.
6
General gene movement off the X chromosome in the Drosophila genus.果蝇属中X染色体上的一般基因移动。
Genome Res. 2009 May;19(5):897-903. doi: 10.1101/gr.088609.108. Epub 2009 Feb 27.
8
On the origin of new genes in Drosophila.论果蝇中新基因的起源
Genome Res. 2008 Sep;18(9):1446-55. doi: 10.1101/gr.076588.108. Epub 2008 Jun 11.
10
Database resources of the National Center for Biotechnology Information.美国国立生物技术信息中心的数据库资源。
Nucleic Acids Res. 2008 Jan;36(Database issue):D13-21. doi: 10.1093/nar/gkm1000. Epub 2007 Nov 27.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验