Yunnan Key Laboratory for Integrative Conservation of Plant Species with Extremely Small Populations, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, China.
CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, China.
Sci Data. 2023 Dec 15;10(1):901. doi: 10.1038/s41597-023-02821-9.
Microcos paniculata is a shrub used traditionally as folk medicine and to make herbal teas. Previous research into this species has mainly focused on its chemical composition and medicinal value. However, the lack of a reference genome limits the study of the molecular mechanisms of active compounds in this species. Here, we assembled a haplotype-resolved chromosome-level genome of M. paniculata based on PacBio HiFi and Hi-C data. The assembly contains two haploid genomes with sizes 399.43 Mb and 393.10 Mb, with contig N50 lengths of 43.44 Mb and 30.17 Mb, respectively. About 99.93% of the assembled sequences could be anchored to 18 pseudo-chromosomes. Additionally, a total of 482 Mb repeat sequences were identified, accounting for 60.76% of the genome. A total of 49,439 protein-coding genes were identified, of which 48,979 (99%) were functionally annotated. This haplotype-resolved chromosome-level assembly and annotation of M. paniculata will serve as a valuable resource for investigating the biosynthesis and genetic basis of active compounds in this species, as well as advancing evolutionary phylogenomic studies in Malvales.
微毛黄皮树是一种灌木,传统上被用作民间药物和草药茶。以前对该物种的研究主要集中在其化学成分和药用价值上。然而,缺乏参考基因组限制了对该物种中活性化合物的分子机制的研究。在这里,我们基于 PacBio HiFi 和 Hi-C 数据组装了一个单倍型解析的染色体水平基因组。组装包含两个大小分别为 399.43 Mb 和 393.10 Mb 的单倍体基因组,其 contig N50 长度分别为 43.44 Mb 和 30.17 Mb。大约 99.93%的组装序列可以锚定到 18 个假染色体上。此外,总共鉴定出 482 Mb 的重复序列,占基因组的 60.76%。总共鉴定出 49439 个蛋白质编码基因,其中 48979 个(99%)具有功能注释。这个单倍型解析的染色体水平组装和微毛黄皮树的注释将作为研究该物种中活性化合物生物合成和遗传基础的有价值资源,并推进 Malvales 进化系统基因组学研究。