Northing Poppy C, Pelosi Jessie A, Venable D Lawrence, Dlugosch Katrina M
Department of Ecology and Evolutionary Biology University of Arizona Tucson 85721 Arizona USA.
Appl Plant Sci. 2025 May 21;13(3):e70008. doi: 10.1002/aps3.70008. eCollection 2025 May-Jun.
(Boraginaceae, subfamily Cynoglossoideae), a species native to the Sonoran Desert (North America), has served as a model system for a suite of ecological and evolutionary studies. However, no reference genomes are currently available in Cynoglossoideae. A high-quality reference genome for would be valuable for addressing questions in this system and across broader taxonomic scales.
Using PacBio HiFi sequencing, we assembled a reference genome for and annotated coding regions with full-length transcripts from an Iso-Seq library. We assessed genome completeness with BUSCO and -mer analysis, and estimated the genome size of six individuals using flow cytometry.
The chromosome-scale genome assembly for was 216.0 Mbp long (N50 = 12.1 Mbp). Previous observations indicated is 2 = 24. Our assembly included 12 primary contigs (158.3 Mbp) containing 30,655 genes with telomeres at 23 out of 24 ends. Flow cytometry measurements from the same population included two plants with 1C = 196.9 Mbp, the smallest measured for Boraginaceae, and four with 1C = 385.8 Mbp, which is consistent with tetraploidy in this population.
The genome assembly and annotation provide a high-quality genomic resource in a sparsely represented area of the angiosperm phylogeny. This new reference genome will facilitate answering open questions in ecophysiology, biogeography, and systematics.
(紫草科,倒提壶亚科)是一种原产于索诺兰沙漠(北美洲)的物种,已成为一系列生态和进化研究的模式系统。然而,目前倒提壶亚科没有参考基因组。高质量的 的参考基因组对于解决该系统以及更广泛分类尺度上的问题将具有重要价值。
我们使用PacBio HiFi测序技术为 组装了一个参考基因组,并利用来自Iso-Seq文库的全长转录本注释了编码区。我们通过BUSCO和 -mer分析评估了基因组的完整性,并使用流式细胞术估计了六个个体的基因组大小。
的染色体水平基因组组装长度为216.0 Mbp(N50 = 12.1 Mbp)。先前的观察表明 是2n = 24。我们的组装包括12个主要重叠群(158.3 Mbp),包含30,655个基因,24个末端中的23个带有端粒。来自同一群体的流式细胞术测量结果显示,有两株植物的1C = 196.9 Mbp,这是紫草科中测量到的最小数值,还有四株植物1C = 385.8 Mbp,这与该群体中的四倍体现象一致。
的基因组组装和注释为被子植物系统发育中一个代表性不足的领域提供了高质量的基因组资源。这个新的参考基因组将有助于回答生态生理学、生物地理学和系统学方面的开放性问题。