Suppr超能文献

大型序列重叠群的计算完成揭示了玉米rf1相关区域中散布的嵌套重复序列和基因岛。

Computational finishing of large sequence contigs reveals interspersed nested repeats and gene islands in the rf1-associated region of maize.

作者信息

Kronmiller Brent A, Wise Roger P

机构信息

Bioinformatics and Computational Biology, Iowa State University, Ames, Iowa 50011-1020, USA.

出版信息

Plant Physiol. 2009 Oct;151(2):483-95. doi: 10.1104/pp.109.143370. Epub 2009 Aug 12.

Abstract

The architecture of grass genomes varies on multiple levels. Large long terminal repeat retrotransposon clusters occupy significant portions of the intergenic regions, and islands of protein-encoding genes are interspersed among the repeat clusters. Hence, advanced assembly techniques are required to obtain completely finished genomes as well as to investigate gene and transposable element distributions. To characterize the organization and distribution of repeat clusters and gene islands across large grass genomes, we present 961- and 594-kb contiguous sequence contigs associated with the rf1 (for restorer of fertility1) locus in the near-centromeric region of maize (Zea mays) chromosome 3. We present two methods for computational finishing of highly repetitive bacterial artificial chromosome clones that have proved successful to close all sequence gaps caused by transposable element insertions. Sixteen repeat clusters were observed, ranging in length from 23 to 155 kb. These repeat clusters are almost exclusively long terminal repeat retrotransposons, of which the paleontology of insertion varies throughout the cluster. Gene islands contain from one to four predicted genes, resulting in a gene density of one gene per 16 kb in gene islands and one gene per 111 kb over the entire sequenced region. The two sequence contigs, when compared with the rice (Oryza sativa) and sorghum (Sorghum bicolor) genomes, retain gene colinearity of 50% and 71%, respectively, and 70% and 100%, respectively, for high-confidence gene models. Collinear genes on single gene islands show that while most expansion of the maize genome has occurred in the repeat clusters, gene islands are not immune and have experienced growth in both intragene and intergene locations.

摘要

禾本科植物基因组的结构在多个层面上存在差异。大型长末端重复反转录转座子簇占据了基因间区域的很大一部分,而蛋白质编码基因岛则散布在重复簇之间。因此,需要先进的组装技术来获得完全完成的基因组,并研究基因和转座元件的分布。为了表征大型禾本科植物基因组中重复簇和基因岛的组织与分布,我们展示了与玉米(Zea mays)第3号染色体近着丝粒区域的rf1(育性恢复基因1)位点相关的961 kb和594 kb连续序列重叠群。我们提出了两种用于对高度重复的细菌人工染色体克隆进行计算完成的方法,这些方法已被证明成功地填补了由转座元件插入导致的所有序列缺口。观察到16个重复簇,长度从23 kb到155 kb不等。这些重复簇几乎完全是长末端重复反转录转座子,其插入的古生物学在整个簇中各不相同。基因岛包含1至4个预测基因,基因岛中的基因密度为每16 kb一个基因,而在整个测序区域中为每111 kb一个基因。当将这两个序列重叠群与水稻(Oryza sativa)和高粱(Sorghum bicolor)基因组进行比较时,对于高可信度基因模型,分别保留了50%和71%以及70%和100%的基因共线性。单个基因岛上的共线基因表明,虽然玉米基因组的大多数扩增发生在重复簇中,但基因岛也未能幸免,在基因内和基因间位置都经历了增长。

相似文献

3
Evolution of DNA sequence nonhomologies among maize inbreds.玉米自交系间DNA序列非同源性的进化
Plant Cell. 2005 Feb;17(2):343-60. doi: 10.1105/tpc.104.025627. Epub 2005 Jan 19.
6
Sequencing, mapping, and analysis of 27,455 maize full-length cDNAs.27455 个玉米全长 cDNA 的测序、作图和分析。
PLoS Genet. 2009 Nov;5(11):e1000740. doi: 10.1371/journal.pgen.1000740. Epub 2009 Nov 20.
10
Detailed analysis of a contiguous 22-Mb region of the maize genome.详细分析玉米基因组的一个连续 22Mb 区域。
PLoS Genet. 2009 Nov;5(11):e1000728. doi: 10.1371/journal.pgen.1000728. Epub 2009 Nov 20.

引用本文的文献

5
Insular organization of gene space in grass genomes.草基因组中基因空间的岛状组织。
PLoS One. 2013;8(1):e54101. doi: 10.1371/journal.pone.0054101. Epub 2013 Jan 11.
7
A single molecule scaffold for the maize genome.一个用于玉米基因组的单分子支架。
PLoS Genet. 2009 Nov;5(11):e1000711. doi: 10.1371/journal.pgen.1000711. Epub 2009 Nov 20.

本文引用的文献

6
Uneven chromosome contraction and expansion in the maize genome.玉米基因组中染色体收缩和扩张不均一。
Genome Res. 2006 Oct;16(10):1241-51. doi: 10.1101/gr.5338906. Epub 2006 Aug 10.
8
GenBank.基因银行
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D16-20. doi: 10.1093/nar/gkj157.
9
Structure and architecture of the maize genome.玉米基因组的结构与架构。
Plant Physiol. 2005 Dec;139(4):1612-24. doi: 10.1104/pp.105.068718.
10
Combined evidence annotation of transposable elements in genome sequences.基因组序列中转座元件的联合证据注释
PLoS Comput Biol. 2005 Jul;1(2):166-75. doi: 10.1371/journal.pcbi.0010022. Epub 2005 Jul 29.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验