Da Xinwei, Liu Yanrui, Jin Xun, Lu Xin
Key Laboratory of Biodiversity and Environment on the Qinghai-Tibetan Plateau of the Ministry of Education, College of Life Sciences, Wuhan University, Wuhan, China.
Department of Ecology, College of Life Sciences, Henan Normal University, Xinxiang, China.
Sci Data. 2025 May 15;12(1):799. doi: 10.1038/s41597-025-05171-w.
Pseudopodoces humilis is a small passerine bird predominantly found in the mid-latitude regions of the Tibetan Plateau in Asia. A chromosome-level reference genome assembly for P. humilis was generated using PacBio CLR with Hi-C. The final genome assembly spans approximately 1.096 Gb, consisting of 1,968 contigs with a Contig N50 of 32.246 Mb, and was evaluated to be 95.60% complete using BUSCO. Hi-C chromosome mapping resulted in 33 chromosome sequences, which enabled the ordering and orientation of 329 contigs, with chromosome lengths ranging from 2.08 Mb to 152.13 Mb, covering 95.85% of the total genome sequence. Repetitive sequences comprised 144.91 Mb of the genome. A total of 381 tRNA, 507 non-coding RNA (ncRNA), and 205 rRNA were identified. In addition, we identified 17,108 protein-coding genes and 29,473 proteins, comprising a total of 17,236,726 amino acids. This high-quality genome assembly provides a strong genomic foundation for exploring critical questions in evolutionary genetics, phylogenomics, and the molecular mechanisms of adaptation - key areas for understanding biodiversity and species resilience amidst changing environments.
棕颈雪雀是一种小型雀形目鸟类,主要分布在亚洲青藏高原的中纬度地区。利用PacBio CLR和Hi-C技术生成了棕颈雪雀的染色体水平参考基因组组装。最终的基因组组装大小约为1.096Gb,由1968个重叠群组成,重叠群N50为32.246Mb,使用BUSCO评估其完整性为95.60%。Hi-C染色体图谱分析产生了33条染色体序列,可对329个重叠群进行排序和定向,染色体长度从2.08Mb到152.13Mb不等,覆盖了95.85%的全基因组序列。重复序列占基因组的144.91Mb。共鉴定出381个tRNA、507个非编码RNA(ncRNA)和205个rRNA。此外,我们鉴定出17108个蛋白质编码基因和29473个蛋白质,总共包含17236726个氨基酸。这种高质量的基因组组装为探索进化遗传学、系统发育基因组学以及适应的分子机制等关键问题提供了强大的基因组基础,这些领域是理解生物多样性和物种在不断变化的环境中的恢复力的关键。