Clinomics, Ulsan National Institute of Science and Technology (UNIST), Ulsan, Korea.
Personal Genomics Institute (PGI), Genome Research Foundation, Cheongju, Korea.
Mol Ecol Resour. 2022 Apr;22(3):1168-1177. doi: 10.1111/1755-0998.13537. Epub 2021 Nov 5.
Cymbidium goeringii, commonly known as the spring orchid, has long been favoured for horticultural purposes in Asian countries. It is a popular orchid with much demand for improvement and development for its valuable varieties. Until now, its reference genome has not been published despite its popularity and conservation efforts. Here, we report the de novo assembly of the C. goeringii genome, which is the largest among the orchids published to date, using a strategy that combines short- and long-read sequencing and chromosome conformation capture (Hi-C) information. The total length of all scaffolds is 3.99 Gb, with an N50 scaffold size of 178.2 Mb. A total of 29,556 protein-coding genes were annotated and 3.55 Gb (88.87% of genome) repetitive sequences were identified. We constructed pseudomolecular chromosomes using Hi-C, incorporating 89.4% of the scaffolds in 20 chromosomes. We identified 220 expanded and 106 contracted genes families in C. goeringii after divergence from its close relative. We also identified new gene families, resistance gene analogues and changes within the MADS-box genes, which control a diverse set of developmental processes during orchid evolution. Our high quality chromosomal-level assembly of C. goeringii can provide a platform for elucidating the genomic evolution of orchids, mining functional genes for agronomic traits and for developing molecular markers for accelerated breeding as well as accelerating conservation efforts.
大花蕙兰,俗称春兰,长期以来一直是亚洲国家园艺的宠儿。它是一种受欢迎的兰花,因其有价值的品种而有很大的改进和发展需求。尽管它很受欢迎且有保护措施,但直到现在,它的参考基因组仍未公布。在这里,我们报告了大花蕙兰基因组的从头组装,这是迄今为止发表的兰花中最大的基因组,该组装结合了短读长和长读长测序以及染色体构象捕获(Hi-C)信息。所有支架的总长度为 39.9Gb,N50 支架大小为 178.2Mb。共注释了 29556 个蛋白质编码基因,鉴定了 3.55Gb(基因组的 88.87%)重复序列。我们使用 Hi-C 构建了拟分子染色体,将 20 条染色体中的 89.4%的支架纳入其中。在与近亲分化后,我们在大花蕙兰中鉴定出了 220 个扩展和 106 个收缩基因家族。我们还鉴定了新的基因家族、抗病基因类似物和 MADS 盒基因内的变化,这些变化控制着兰花进化过程中多样化的发育过程。我们高质量的大花蕙兰染色体水平组装可以为阐明兰花的基因组进化、挖掘农艺性状的功能基因以及开发加速育种的分子标记以及加速保护工作提供一个平台。