College of Horticulture, Shenyang Agricultural University, Shenyang, China.
College of Bioscience and Biotechnology, Shenyang Agricultural University, Shenyang, China.
Mol Ecol Resour. 2022 Aug;22(6):2396-2410. doi: 10.1111/1755-0998.13616. Epub 2022 Apr 13.
Mongolian oak (Quercus mongolica Fisch.) is an ecologically and economically important white oak species native to and widespread in the temperate zone of East Asia. Here, we present a chromosome-scale reference genome assembly of Q. mongolica, a representative white oak species, by combining Illumina and PacBio data with Hi-C mapping technologies that is the first reference genome created for an Asian oak. Our results showed that the PacBio draft genome size was 809.84 Mb, with a BUSCO complete gene percentage of 92.71%. Hi-C scaffolding anchored 774.59 Mb contigs (95.65% of draft assembly) onto 12 pseudochromosomes. The contig N50 and scaffold N50 were 2.64 and 66.74 Mb, respectively. Of the 36,553 protein-coding genes predicted in the study, approximately 95% had functional annotations in public databases. A total of 435.34 Mb (53.75% of the genome) of repetitive sequences were predicted in the assembled genome. Genome evolution analysis showed that Q. mongolica is closely related to Q. robur from Europe, and they shared a common ancestor ~11.8 million years ago (Ma). Gene family evolution analysis of Q. mongolica revealed that the nucleotide-binding site (NBS)-encoding gene family related to disease resistance was significantly contracted, whereas the ECERIFERUM 1 (CER1) homologous genes related to cuticular wax biosynthesis was significantly expanded. This pioneering Asian oak genome resource represents an important supplement to the oak genomics community and will improve our understanding of Asian white oak biology and evolution.
蒙古栎(Quercus mongolica Fisch.)是一种生态和经济上重要的白栎物种,原产于东亚温带地区,并广泛分布于此。在这里,我们通过结合 Illumina 和 PacBio 数据以及 Hi-C 图谱技术,为一个代表白栎物种的蒙古栎提供了一个染色体水平的参考基因组组装,这是第一个为亚洲栎属植物创建的参考基因组。我们的结果表明,PacBio 草案基因组大小为 809.84 Mb,具有 92.71%的完整 BUSCO 基因百分比。Hi-C 支架将 774.59 Mb 的重叠群(草案组装的 95.65%)锚定到 12 个假染色体上。重叠群 N50 和支架 N50 分别为 2.64 Mb 和 66.74 Mb。在研究中预测的 36553 个蛋白质编码基因中,约 95%在公共数据库中有功能注释。在组装的基因组中预测了总共 435.34 Mb(基因组的 53.75%)的重复序列。基因组进化分析表明,蒙古栎与欧洲的 Q. robur 关系密切,它们大约在 1180 万年前(Ma)拥有共同的祖先。蒙古栎基因家族进化分析表明,与抗病性相关的核苷酸结合位点(NBS)编码基因家族显著收缩,而与角质层蜡生物合成相关的 ECERIFERUM 1(CER1)同源基因显著扩张。这个开创性的亚洲栎属基因组资源是栎属植物基因组学领域的一个重要补充,将提高我们对亚洲白栎生物学和进化的认识。