Yan Kuan, Zhu Hui, Cao Guiling, Meng Lina, Li Junqiang, Zhang Jian, Liu Sicen, Wang Yujie, Feng Ruizhang, Soaud Salma A, Elhamid Mohamed A Abd, Heakel Rania M Y, Wei Qin, El-Sappah Ahmed H, Ru Dafu
Faculty of Agriculture, Forestry and Food Engineering, Yibin University, Yibin, China.
Sichuan Oil Cinnamon Engineering Technology Research Center, Yibin University, Yibin, China.
Front Plant Sci. 2024 Jun 26;15:1372127. doi: 10.3389/fpls.2024.1372127. eCollection 2024.
, a crucial commercial crop and a fundamental component of traditional Chinese medicine, is renowned for its abundant production of volatile terpenoids. However, the lack of available genomic information has hindered pertinent research efforts in the past.
To bridge this gap, the present study aimed to use PacBio HiFi, short-read, and highthroughput chromosome conformation capture sequencing to construct a chromosome-level assembly of the genome.
With twelve chromosomes accounting for 99.82% (766.69 Mb) of the final genome assembly, which covered 768.10 Mb, it was very complete. Remarkably, the assembly's contig and scaffold N50 values are exceptional as well-41.12 and 63.78 Mb, respectively-highlighting its excellent quality and intact structure. Furthermore, a total of 39,173 protein-coding genes were predicted, with 38,766 (98.96%) of them being functionally annotated. The completeness of the genome was confirmed by the Benchmarking Universal Single-Copy Ortholog evaluation, which revealed 99.01% of highly conserved plant genes. As the first comprehensive assembly of the genome, it provides a crucial starting point for deciphering the complex pathways involved in terpenoid production. Furthermore, this excellent genome serves as a vital resource for upcoming research on the breeding and genetics of .
[植物名称]是一种重要的经济作物,也是中药的重要组成部分,以大量产生挥发性萜类化合物而闻名。然而,过去缺乏可用的基因组信息阻碍了相关研究工作。
为了弥补这一差距,本研究旨在使用PacBio HiFi、短读长和高通量染色体构象捕获测序来构建[植物名称]基因组的染色体水平组装。
最终的基因组组装包含12条染色体,占768.10 Mb的99.82%(766.69 Mb),非常完整。值得注意的是,组装的重叠群和支架N50值也非常出色,分别为41.12 Mb和63.78 Mb,突出了其优异的质量和完整的结构。此外,总共预测了39173个蛋白质编码基因,其中38766个(98.96%)具有功能注释。通过基准通用单拷贝直系同源物评估证实了基因组的完整性,该评估揭示了99.01%的高度保守植物基因。作为[植物名称]基因组的首次全面组装,它为破译萜类化合物产生所涉及的复杂途径提供了关键的起点。此外,这个出色的基因组是[植物名称]未来育种和遗传学研究的重要资源。