Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.
Genome Biol. 2010;11(2):R12. doi: 10.1186/gb-2010-11-2-r12. Epub 2010 Feb 3.
Miscanthus x giganteus (Mxg) is a perennial grass that produces superior biomass yields in temperate environments. The essentially uncharacterized triploid genome (3n = 57, x = 19) of Mxg is likely critical for the rapid growth of this vegetatively propagated interspecific hybrid.
A survey of the complex Mxg genome was conducted using 454 pyrosequencing of genomic DNA and Illumina sequencing-by-synthesis of small RNA. We found that the coding fraction of the Mxg genome has a high level of sequence identity to that of other grasses. Highly repetitive sequences representing the great majority of the Mxg genome were predicted using non-cognate assembly for de novo repeat detection. Twelve abundant families of repeat were observed, with those related to either transposons or centromeric repeats likely to comprise over 95% of the genome. Comparisons of abundant repeat sequences to a small RNA survey of three Mxg organs (leaf, rhizome, inflorescence) revealed that the majority of observed 24-nucleotide small RNAs are derived from these repetitive sequences. We show that high-copy-number repeats match more of the small RNA, even when the amount of the repeat sequence in the genome is accounted for.
We show that major repeats are present within the triploid Mxg genome and are actively producing small RNAs. We also confirm the hypothesized origins of Mxg, and suggest that while the repeat content of Mxg differs from sorghum, the sorghum genome is likely to be of utility in the assembly of a gene-space sequence of Mxg.
芒属(Miscanthus x giganteus,Mxg)是一种多年生草本植物,在温带环境下能产生优质的生物量。芒属的三倍体基因组(3n = 57,x = 19)尚未被充分研究,但它很可能对这种营养繁殖的种间杂种的快速生长至关重要。
利用基因组 DNA 的 454 焦磷酸测序和小 RNA 的 Illumina 合成测序对复杂的芒属基因组进行了调查。我们发现,芒属基因组的编码区与其他禾本科植物具有高度的序列同一性。利用非同源组装进行从头重复检测,预测了代表芒属基因组绝大多数的高度重复序列。观察到 12 个丰富的重复家族,其中与转座子或着丝粒重复相关的重复序列可能占基因组的 95%以上。将丰富的重复序列与对三个芒属器官(叶片、根茎、花序)的小 RNA 调查进行比较,结果表明,大多数观察到的 24 核苷酸小 RNA 来源于这些重复序列。我们表明,高拷贝数的重复序列匹配更多的小 RNA,即使考虑到基因组中重复序列的数量。
我们表明,主要重复序列存在于三倍体芒属基因组中,并能积极产生小 RNA。我们还证实了芒属的假定起源,并提出尽管芒属的重复含量与高粱不同,但高粱基因组可能对组装芒属的基因空间序列有用。