Zhang Xinhua, Li MingZhi, Bian Zhan, Chen Xiaohong, Li Yuan, Xiong Yuping, Fang Lin, Wu Kunlin, Zeng Songjun, Jian Shuguang, Wang Rujiang, Ren Hai, Teixeira da Silva Jaime A, Ma Guohua
Key Laboratory of South China Agricultural Plant Molecular Analysis and Genetic Improvement & Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China.
Bio&Data Biotechnologies Co. Ltd., Guangzhou, 510700, China.
Sci Data. 2023 Dec 21;10(1):921. doi: 10.1038/s41597-023-02849-x.
Santalum album is a well-known aromatic and medicinal plant that is highly valued for the essential oil (EO) extracted from its heartwood. In this study, we present a high-quality chromosome-level genome assembly of S. album after integrating PacBio Sequel, Illumina HiSeq paired-end and high-throughput chromosome conformation capture sequencing technologies. The assembled genome size is 207.39 M with a contig N50 of 7.33 M and scaffold N50 size of 18.31 M. Compared with three previously published sandalwood genomes, the N50 length of the genome assembly was longer. In total, 94.26% of the assembly was assigned to 10 pseudo-chromosomes, and the anchor rate far exceeded that of a recently released value. BUSCO analysis yielded a completeness score of 94.91%. In addition, we predicted 23,283 protein-coding genes, 89.68% of which were functionally annotated. This high-quality genome will provide a foundation for sandalwood functional genomics studies, and also for elucidating the genetic basis of EO biosynthesis in S. album.
檀香是一种著名的芳香药用植物,其心材中提取的精油(EO)具有很高的价值。在本研究中,我们整合了PacBio Sequel、Illumina HiSeq双末端测序和高通量染色体构象捕获测序技术,展示了檀香的高质量染色体水平基因组组装。组装后的基因组大小为207.39 M,重叠群N50为7.33 M,支架N50大小为18.31 M。与之前发表的三个檀香基因组相比,该基因组组装的N50长度更长。总共94.26%的组装序列被分配到10条假染色体上,其锚定率远远超过最近公布的值。BUSCO分析得出的完整性评分为94.91%。此外,我们预测了23283个蛋白质编码基因,其中89.68%在功能上得到了注释。这个高质量的基因组将为檀香功能基因组学研究提供基础,也有助于阐明檀香中精油生物合成的遗传基础。