Zeng Zhefei, Mao Chunmin, Zhang Jiayin, Tian Zhongqiong, Xu Min, Li Wei, Wang Junwei, Zhang Wenju, Qiong La
Key Laboratory of Biodiversity and Environment on the Qinghai-Tibetan Plateau, Ministry of Education, School of Ecology and Environment, Xizang University, Lasa, China.
Yani Observation and Research Station for Wetland Ecosystem of the Tibet (Xizang) Autonomous Region, Xizang University, Linzhi, China.
Sci Data. 2025 Aug 28;12(1):1503. doi: 10.1038/s41597-025-05844-6.
Hippophae salicifolia, a dioecious small tree species endemic to the Himalayan region, holds great potential in both ecological conservation and industrial applications. In this study, we employed PacBio HiFi long reads, Illumina short reads, and Hi-C technology to construct a high-quality, chromosome-level reference genome. The assembled genome is approximately 1.11 Gb in size, with a scaffold N50 of 95.29 Mb, and 99.94% of the sequences were successfully anchored to 12 pseudo-chromosomes. A total of 42,547 protein-coding genes were predicted, and approximately 85% of these genes obtained functional annotations. Repetitive elements constituted about 45.25% of the genome, with Long Terminal Repeat (LTR) being the most abundant (32.54%). BUSCO analysis indicated that both the assembly and annotation are highly complete. This high-quality genomic resource provides a valuable foundation for investigating sex determination mechanisms, adaptive evolution, and genomic diversity in H. salicifolia and related species, as well as for advancing genetic improvement, resource conservation, and utilization efforts.
沙棘(Hippophae salicifolia)是喜马拉雅地区特有的雌雄异株小乔木物种,在生态保护和工业应用方面都具有巨大潜力。在本研究中,我们利用PacBio HiFi长读长、Illumina短读长和Hi-C技术构建了一个高质量的染色体水平参考基因组。组装后的基因组大小约为1.11 Gb,支架N50为95.29 Mb,99.94%的序列成功锚定到12条假染色体上。共预测到42,547个蛋白质编码基因,其中约85%的基因获得了功能注释。重复元件约占基因组的45.25%,其中长末端重复序列(LTR)最为丰富(32.54%)。BUSCO分析表明组装和注释都非常完整。这一高质量的基因组资源为研究沙棘及其相关物种的性别决定机制、适应性进化和基因组多样性,以及推进遗传改良、资源保护和利用工作提供了宝贵的基础。