Hu Yongle, Ma Dongna, Ning Shuju, Ye Qi, Zhao Xuanxuan, Ding Qiansu, Liang Pingping, Cai Guoqian, Ma Xiaomao, Qin Xia, Wei Daozhi
College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China.
College of Ecology and Resource Engineering, Wuyi University, Wuyishan, China.
Front Plant Sci. 2021 Sep 30;12:742420. doi: 10.3389/fpls.2021.742420. eCollection 2021.
(Nees) Kuntze is an important plant used to process the traditional Chinese herbal medicines "Qingdai" and "Nanbanlangen". The key active ingredients are indole alkaloids (IAs) that exert antibacterial, antiviral, and antitumor pharmacological activities and serve as natural dyes. We assembled the genome at the chromosome level through combined PacBio circular consensus sequencing (CCS) and Hi-C sequencing data. Hi-C data revealed a draft genome size of 913.74 Mb, with 904.18 Mb contigs anchored into 16 pseudo-chromosomes. Contig N50 and scaffold N50 were 35.59 and 68.44 Mb, respectively. Of the 32,974 predicted protein-coding genes, 96.52% were functionally annotated in public databases. We predicted 675.66 Mb repetitive sequences, 47.08% of sequences were long terminal repeat (LTR) retrotransposons. Moreover, 983 -specific genes (SSGs) were identified for the first time, accounting for ~2.98% of all protein-coding genes. Further, 245 putative centromeric and 29 putative telomeric fragments were identified. The transcriptome analysis identified 2,975 differentially expressed genes (DEGs) enriched in phenylpropanoid, flavonoid, and triterpenoid biosynthesis. This systematic characterization of key enzyme-coding genes associated with the IA pathway and basic helix-loop-helix (bHLH) transcription factor family formed a network from the shikimate pathway to the indole alkaloid synthesis pathway in . The high-quality genome presented herein is an essential resource for the traditional Chinese medicine genomics studies and understanding the genetic underpinning of IA biosynthesis.
(菘蓝)菘蓝是一种重要的植物,用于加工传统中药“青黛”和“南板蓝根”。其关键活性成分是吲哚生物碱(IAs),具有抗菌、抗病毒和抗肿瘤药理活性,并可作为天然染料。我们通过结合PacBio环形一致序列测序(CCS)和Hi-C测序数据,在染色体水平上组装了基因组。Hi-C数据显示基因组草图大小为913.74 Mb,其中904.18 Mb的重叠群被锚定到16条假染色体上。重叠群N50和支架N50分别为35.59 Mb和68.44 Mb。在32974个预测的蛋白质编码基因中,96.52%在公共数据库中得到了功能注释。我们预测了675.66 Mb的重复序列,其中47.08%的序列是长末端重复(LTR)反转录转座子。此外,首次鉴定出983个特异性基因(SSGs),占所有蛋白质编码基因的约2.98%。此外,还鉴定出245个推定的着丝粒片段和29个推定的端粒片段。转录组分析确定了2975个差异表达基因(DEGs),这些基因富集于苯丙烷类、黄酮类和三萜类生物合成途径。这种与IA途径相关的关键酶编码基因和基本螺旋-环-螺旋(bHLH)转录因子家族的系统表征,在菘蓝中形成了一个从莽草酸途径到吲哚生物碱合成途径的网络。本文呈现的高质量基因组是中药基因组学研究以及理解IA生物合成遗传基础的重要资源。