Du Yuhui, Song Wei, Yin Zhiqiu, Wu Shengbo, Liu Jiaheng, Wang Ning, Jin Hua, Qiao Jianjun, Huo Yi-Xin
Key Laboratory of Molecular Medicine and Biotherapy, School of Life Sciences, Beijing Institute of Technology, Beijing, China.
National Engineering Laboratory for Efficient Utilization of Soil and Fertilizer Resources, College of Resources and Environment, Shandong Agricultural University, Tai'an, China.
Front Plant Sci. 2022 Apr 18;13:853861. doi: 10.3389/fpls.2022.853861. eCollection 2022.
(neem), an evergreen tree of the Meliaceae family, is a source of the potent biopesticide azadirachtin. The lack of a chromosome-level assembly impedes an in-depth understanding of its genome architecture and the comparative genomic analysis of . Here, a high-quality genome assembly of was constructed using a combination of data from Illumina, PacBio, and Hi-C technology, which is the first chromosome-scale genome assembly of . Based on the length of our assembly, the genome size of is estimated to be 281 Mb anchored to 14 chromosomes (contig N50 = 6 Mb and scaffold N50 = 19 Mb). The genome assembly contained 115 Mb repetitive elements and 25,767 protein-coding genes. Evolutional analysis revealed that didn't experience any whole-genome duplication (WGD) event after the core eudicot γ event, but some genes and genome segment might likely experienced recent duplications. The secondary metabolite clusters, TPS genes, and CYP genes were also identified. Comparative genomic analysis revealed that most of the -specific TPS genes and CYP genes were located on the terpene-related clusters on chromosome 13. It is suggested that chromosome 13 may play an important role in the specific terpene biosynthesis of . The gene duplication events may be responsible for the terpene biosynthesis expansion in . The genomic dataset and genomic analysis created for will shed light on terpene biosynthesis in and facilitate comparative genomic research of the family Meliaceae.
印楝是楝科的一种常绿乔木,是强效生物农药印楝素的来源。缺乏染色体水平的组装阻碍了对其基因组结构的深入理解以及对其进行比较基因组分析。在此,利用Illumina、PacBio和Hi-C技术的数据构建了高质量的印楝基因组组装,这是印楝的首个染色体级基因组组装。根据我们组装的长度,印楝的基因组大小估计为281 Mb,锚定在14条染色体上(重叠群N50 = 6 Mb,支架N50 = 19 Mb)。该基因组组装包含115 Mb重复元件和25,767个蛋白质编码基因。进化分析表明,印楝在核心真双子叶植物γ事件之后没有经历任何全基因组复制(WGD)事件,但一些基因和基因组片段可能经历了近期的复制。还鉴定了次生代谢物簇、TPS基因和CYP基因。比较基因组分析表明,大多数印楝特异性TPS基因和CYP基因位于13号染色体上与萜烯相关的簇上。表明13号染色体可能在印楝特定萜烯生物合成中起重要作用。基因复制事件可能是印楝中萜烯生物合成扩展的原因。为印楝创建的基因组数据集和基因组分析将为印楝中的萜烯生物合成提供线索,并促进楝科的比较基因组研究。