Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, BC, V5Z 4S6, Canada.
Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada.
Plant J. 2015 Jul;83(2):189-212. doi: 10.1111/tpj.12886. Epub 2015 Jun 19.
White spruce (Picea glauca), a gymnosperm tree, has been established as one of the models for conifer genomics. We describe the draft genome assemblies of two white spruce genotypes, PG29 and WS77111, innovative tools for the assembly of very large genomes, and the conifer genomics resources developed in this process. The two white spruce genotypes originate from distant geographic regions of western (PG29) and eastern (WS77111) North America, and represent elite trees in two Canadian tree-breeding programs. We present an update (V3 and V4) for a previously reported PG29 V2 draft genome assembly and introduce a second white spruce genome assembly for genotype WS77111. Assemblies of the PG29 and WS77111 genomes confirm the reconstructed white spruce genome size in the 20 Gbp range, and show broad synteny. Using the PG29 V3 assembly and additional white spruce genomics and transcriptomics resources, we performed MAKER-P annotation and meticulous expert annotation of very large gene families of conifer defense metabolism, the terpene synthases and cytochrome P450s. We also comprehensively annotated the white spruce mevalonate, methylerythritol phosphate and phenylpropanoid pathways. These analyses highlighted the large extent of gene and pseudogene duplications in a conifer genome, in particular for genes of secondary (i.e. specialized) metabolism, and the potential for gain and loss of function for defense and adaptation.
白云杉(Picea glauca)是一种裸子植物,已被确立为针叶树基因组学的模型之一。我们描述了两个白云杉基因型 PG29 和 WS77111 的基因组草案组装,这是组装非常大基因组的创新工具,以及在这个过程中开发的针叶树基因组学资源。这两个白云杉基因型来自北美西部(PG29)和东部(WS77111)的遥远地理区域,代表了两个加拿大树木育种计划中的优秀树木。我们展示了之前报道的 PG29 V2 草案基因组组装的更新版本(V3 和 V4),并引入了第二个白云杉基因型 WS77111 的基因组组装。PG29 和 WS77111 基因组的组装证实了重建的白云杉基因组大小在 200 亿碱基对范围内,并显示出广泛的同线性。使用 PG29 V3 组装和其他白云杉基因组学和转录组学资源,我们进行了 MAKER-P 注释和对针叶树防御代谢、萜烯合酶和细胞色素 P450s 的非常大的基因家族的细致专家注释。我们还全面注释了白云杉甲羟戊酸、甲基赤藓醇磷酸和苯丙烷途径。这些分析突出了针叶树基因组中基因和假基因重复的广泛程度,特别是对于次生(即特化)代谢的基因,以及防御和适应功能获得和丧失的潜力。