Key Laboratory of Vegetation Restoration and Management of Degraded Ecosystems, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China.
Key Laboratory of National Forestry and Grassland Administration on Plant Conservation and Utilization in Southern China, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China.
BMC Genom Data. 2024 May 23;25(1):48. doi: 10.1186/s12863-024-01230-0.
Ottelia Pers. is in the Hydrocharitaceae family. Species in the genus are aquatic, and China is their centre of origin in Asia. Ottelia alismoides (L.) Pers., which is distributed worldwide, is a distinguishing element in China, while other species of this genus are endemic to China. However, O. alismoides is also considered endangered due to habitat loss and pollution in some Asian countries. Ottelia alismoides is the only submerged macrophyte that contains three carbon dioxide-concentrating mechanisms, i.e. bicarbonate (HCO) use, crassulacean acid metabolism and the C4 pathway. In this study, we present its first genome assembly to help illustrate the various carbon metabolism mechanisms and to enable genetic conservation in the future.
Using DNA and RNA extracted from one O. alismoides leaf, this work produced ∼ 73.4 Gb HiFi reads, ∼ 126.4 Gb whole genome sequencing short reads and ∼ 21.9 Gb RNA-seq reads. The de novo genome assembly was 6,455,939,835 bp in length, with 11,923 scaffolds/contigs and an N50 of 790,733 bp. Genome assembly completeness assessment with Benchmarking Universal Single-Copy Orthologs revealed a score of 94.4%. The repetitive sequence in the assembly was 4,875,817,144 bp (75.5%). A total of 116,176 genes were predicted. The protein sequences were functionally annotated against multiple databases, facilitating comparative genomic analysis.
茨藻科中的茨藻属物种均为水生植物,中国是该属在亚洲的起源中心。分布于全球的菹草(Ottelia alismoides (L.) Pers.)是中国的特有种,而该属的其他物种则是中国特有的。然而,由于一些亚洲国家的栖息地丧失和污染,菹草也被认为处于濒危状态。菹草是唯一含有三种二氧化碳浓缩机制的沉水植物,即碳酸氢盐(HCO)利用、景天酸代谢和 C4 途径。在本研究中,我们首次组装了其基因组,以帮助阐明各种碳代谢机制,并为未来的遗传保护提供支持。
本研究从一片菹草叶片中提取 DNA 和 RNA,生成了约 73.4 Gb 的 HiFi reads、约 126.4 Gb 的全基因组测序短 reads和约 21.9 Gb 的 RNA-seq reads。从头组装的基因组大小为 6,455,939,835bp,由 11,923 个 scaffolds/contigs 组成,N50 为 790,733bp。使用基准通用单拷贝直系同源物评估基因组组装完整性,得到 94.4%的得分。组装中重复序列的长度为 4,875,817,144bp(75.5%)。共预测到 116,176 个基因。蛋白质序列被注释到多个数据库中,有助于进行比较基因组分析。