CRIBI Biotechnology Center, University of Padova, Padova, Italy.
PLoS One. 2011;6(10):e26421. doi: 10.1371/journal.pone.0026421. Epub 2011 Oct 18.
Wheat is one of the world's most important crops and is characterized by a large polyploid genome. One way to reduce genome complexity is to isolate single chromosomes using flow cytometry. Low coverage DNA sequencing can provide a snapshot of individual chromosomes, allowing a fast characterization of their main features and comparison with other genomes. We used massively parallel 454 pyrosequencing to obtain a 2x coverage of wheat chromosome 5A. The resulting sequence assembly was used to identify TEs, genes and miRNAs, as well as to infer a virtual gene order based on the synteny with other grass genomes. Repetitive elements account for more than 75% of the genome. Gene content was estimated considering non-redundant reads showing at least one match to ESTs or proteins. The results indicate that the coding fraction represents 1.08% and 1.3% of the short and long arm respectively, projecting the number of genes of the whole chromosome to approximately 5,000. 195 candidate miRNA precursors belonging to 16 miRNA families were identified. The 5A genes were used to search for syntenic relationships between grass genomes. The short arm is closely related to Brachypodium chromosome 4, sorghum chromosome 8 and rice chromosome 12; the long arm to regions of Brachypodium chromosomes 4 and 1, sorghum chromosomes 1 and 2 and rice chromosomes 9 and 3. From these similarities it was possible to infer the virtual gene order of 392 (5AS) and 1,480 (5AL) genes of chromosome 5A, which was compared to, and found to be largely congruent with the available physical map of this chromosome.
小麦是世界上最重要的作物之一,其基因组具有高度的多倍体特征。一种降低基因组复杂性的方法是使用流式细胞术分离单个染色体。低覆盖度的 DNA 测序可以提供单个染色体的快照,从而快速描述其主要特征并与其他基因组进行比较。我们使用大规模平行 454 焦磷酸测序技术对小麦 5A 染色体进行了 2x 覆盖。所得序列组装用于鉴定转座元件、基因和 miRNA,以及根据与其他禾本科基因组的共线性推断虚拟基因顺序。重复元件占基因组的 75%以上。基因含量是通过考虑至少与 EST 或蛋白质有一次匹配的非冗余读数来估计的。结果表明,编码部分分别占短臂和长臂的 1.08%和 1.3%,预计整条染色体的基因数约为 5000 个。鉴定出属于 16 个 miRNA 家族的 195 个候选 miRNA 前体。5A 基因用于在禾本科基因组之间搜索共线性关系。短臂与短柄草 4 号染色体、高粱 8 号染色体和水稻 12 号染色体密切相关;长臂与短柄草 4 号和 1 号染色体、高粱 1 号和 2 号染色体以及水稻 9 号和 3 号染色体的区域密切相关。根据这些相似性,可以推断出 5A 染色体 392 个(5AS)和 1480 个(5AL)基因的虚拟基因顺序,并与该染色体的现有物理图谱进行比较,发现它们在很大程度上是一致的。