Conservation Science Wildlife Health, San Diego Zoo Wildlife Alliance, Escondido, CA, USA.
The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
Genome Biol Evol. 2022 Aug 3;14(8). doi: 10.1093/gbe/evac122.
High-quality reference genomes are fundamental tools for understanding population history, and can provide estimates of genetic and demographic parameters relevant to the conservation of biodiversity. The federally endangered Pacific pocket mouse (PPM), which persists in three small, isolated populations in southern California, is a promising model for studying how demographic history shapes genetic diversity, and how diversity in turn may influence extinction risk. To facilitate these studies in PPM, we combined PacBio HiFi long reads with Omni-C and Hi-C data to generate a de novo genome assembly, and annotated the genome using RNAseq. The assembly comprised 28 chromosome-length scaffolds (N50 = 72.6 MB) and the complete mitochondrial genome, and included a long heterochromatic region on chromosome 18 not represented in the previously available short-read assembly. Heterozygosity was highly variable across the genome of the reference individual, with 18% of windows falling in runs of homozygosity (ROH) >1 MB, and nearly 9% in tracts spanning >5 MB. Yet outside of ROH, heterozygosity was relatively high (0.0027), and historical Ne estimates were large. These patterns of genetic variation suggest recent inbreeding in a formerly large population. Currently the most contiguous assembly for a heteromyid rodent, this reference genome provides insight into the past and recent demographic history of the population, and will be a critical tool for management and future studies of outbreeding depression, inbreeding depression, and genetic load.
高质量的参考基因组是理解种群历史的基本工具,可用于估计与生物多样性保护相关的遗传和人口参数。濒临灭绝的太平洋囊鼠(PPM)在加利福尼亚州南部的三个小而孤立的种群中生存,它是研究人口历史如何塑造遗传多样性,以及多样性如何反过来影响灭绝风险的有前途的模型。为了促进 PPM 中的这些研究,我们将 PacBio HiFi 长读与 Omni-C 和 Hi-C 数据相结合,生成了从头组装的基因组,并使用 RNAseq 对基因组进行了注释。组装由 28 条染色体长度的支架(N50=72.6MB)和完整的线粒体基因组组成,包括 18 号染色体上以前的短读组装中未代表的长异染色质区域。参考个体的基因组中杂合度变化很大,有 18%的窗口落在杂合度大于 1MB 的纯合子区域(ROH)中,近 9%的窗口落在跨越 5MB 的区域中。然而,在 ROH 之外,杂合度相对较高(0.0027),历史 Ne 估计值很大。这些遗传变异模式表明,以前的大种群中存在近期近亲繁殖。作为异关节鼠科啮齿动物中最连续的组装体,该参考基因组提供了对种群过去和近期人口历史的深入了解,并且将成为管理和未来研究外交衰退、近亲衰退和遗传负荷的关键工具。