Co-Innovation Center for Sustainable Forestry in Southern China, College of Forestry, Nanjing Forestry University, Nanjing, China.
J Hered. 2021 May 24;112(3):303-310. doi: 10.1093/jhered/esab010.
Populus deltoides has important ecological and economic values, widely used in poplar breeding programs due to its superior characteristics such as rapid growth and resistance to disease. Although the genome sequence of P. deltoides WV94 is available, the assembly is fragmented. Here, we reported an improved chromosome-level assembly of the P. deltoides cultivar I-69 by combining Nanopore sequencing and chromosome conformation capture (Hi-C) technologies. The assembly was 429.3 Mb in size and contained 657 contigs with a contig N50 length of 2.62 Mb. Hi-C scaffolding of the contigs generated 19 chromosome-level sequences, which covered 97.4% (418 Mb) of the total assembly size. Moreover, repetitive sequences annotation showed that 39.28% of the P. deltoides genome was composed of interspersed elements, including retroelements (23.66%), DNA transposons (6.83%), and unclassified elements (8.79%). We also identified a total of 44 362 protein-coding genes in the current P. deltoides assembly. Compared with the previous genome assembly of P. deltoides WV94, the current assembly had some significantly improved qualities: the contig N50 increased 3.5-fold and the proportion of gaps decreased from 3.2% to 0.08%. This high-quality, well-annotated genome assembly provides a reliable genomic resource for identifying genome variants among individuals, mining candidate genes that control growth and wood quality traits, and facilitating further application of genomics-assisted breeding in populations related to P. deltoides.
杨树种质 I-69 的染色体水平组装和基因注释
杨属(Populus)具有重要的生态和经济价值,由于其生长迅速、抗病等优良特性,被广泛应用于杨树的选育计划中。虽然已经获得了 P. deltoides WV94 的基因组序列,但组装结果较为碎片化。在这里,我们结合纳米孔测序和染色体构象捕获(Hi-C)技术,报道了对 P. deltoides 品种 I-69 的改进的染色体水平组装。组装大小为 429.3Mb,包含 657 个 contigs,其 contig N50 长度为 2.62Mb。Hi-C 支架技术将 contigs 构建成 19 条染色体水平的序列,覆盖了总组装大小的 97.4%(418Mb)。此外,重复序列注释表明,杨树基因组的 39.28%由散布元件组成,包括反转元件(23.66%)、DNA 转座子(6.83%)和未分类元件(8.79%)。我们还在当前的 P. deltoides 组装中鉴定了总共 44362 个蛋白质编码基因。与之前的 P. deltoides WV94 基因组组装相比,当前的组装具有一些显著的改进:contig N50 增加了 3.5 倍,间隙的比例从 3.2%下降到 0.08%。这个高质量、注释良好的基因组组装为鉴定个体间的基因组变异、挖掘控制生长和木材质量性状的候选基因,以及促进基于基因组学的辅助育种在与 P. deltoides 相关的群体中的进一步应用提供了可靠的基因组资源。