Institute for Information Transmission Problems of the Russian Academy of Sciences, Moscow, Russia.
Skolkovo Institute of Science and Technology, Moscow, Russia.
Plant J. 2024 Jan;117(2):449-463. doi: 10.1111/tpj.16500. Epub 2023 Oct 17.
Heracleum sosnowskyi, belonging to a group of giant hogweeds, is a plant with large effects on ecosystems and human health. It is an invasive species that contributes to the deterioration of grassland ecosystems. The ability of H. sosnowskyi to produce linear furanocoumarins (FCs), photosensitizing compounds, makes it very dangerous. At the same time, linear FCs are compounds with high pharmaceutical value used in skin disease therapies. Despite this high importance, it has not been the focus of genetic and genomic studies. Here, we report a chromosome-scale assembly of Sosnowsky's hogweed genome. Genomic analysis revealed an unusually high number of genes (55106) in the hogweed genome, in contrast to the 25-35 thousand found in most plants. However, we did not find any traces of recent whole-genome duplications not shared with its confamiliar, Daucus carota (carrot), which has approximately thirty thousand genes. The analysis of the genomic proximity of duplicated genes indicates on tandem duplications as a main reason for this increase. We performed a genome-wide search of the genes of the FC biosynthesis pathway and surveyed their expression in aboveground plant parts. Using a combination of expression data and phylogenetic analysis, we found candidate genes for psoralen synthase and experimentally showed the activity of one of them using a heterologous yeast expression system. These findings expand our knowledge on the evolution of gene space in plants and lay a foundation for further analysis of hogweed as an invasive plant and as a source of FCs.
独活属植物,属于巨型豚草群,是对生态系统和人类健康有重大影响的植物。它是一种入侵物种,会导致草原生态系统恶化。独活属植物能够产生线性呋喃香豆素(FCs),这是一种光敏化合物,使其变得非常危险。同时,线性 FCs 是一种具有高药用价值的化合物,用于皮肤病治疗。尽管其具有重要意义,但它并未成为遗传和基因组研究的重点。在这里,我们报告了独活属植物的染色体级别的基因组组装。基因组分析表明,独活属植物基因组中的基因数量异常多(55106 个),而大多数植物的基因数量在 25000 到 35000 个之间。然而,我们没有发现任何与它的近亲胡萝卜(Daucus carota)共享的近期全基因组复制的痕迹,胡萝卜大约有三万条基因。对重复基因基因组邻近性的分析表明,串联重复是导致这种增加的主要原因。我们对 FC 生物合成途径的基因进行了全基因组搜索,并调查了它们在上部植物部分的表达情况。结合表达数据和系统发育分析,我们找到了补骨脂素合酶的候选基因,并通过异源酵母表达系统实验证明了其中一个基因的活性。这些发现扩展了我们对植物基因空间进化的认识,并为进一步分析豚草作为入侵植物和 FC 来源奠定了基础。