State Key Laboratory of Grassland Agro-Ecosystems, School of Life Sciences, Lanzhou University, Lanzhou, China.
Research Center for Ecology and Environmental Sciences, Northwestern Polytechnical University, Xi'an, China.
Sci Data. 2020 Feb 24;7(1):66. doi: 10.1038/s41597-020-0400-3.
Vulnerable populations of wild yak (Bos mutus), the wild ancestral species of domestic yak, survive in extremely cold, harsh and oxygen-poor regions of the Qinghai-Tibetan Plateau (QTP) and adjacent high-altitude regions. In this study, we sequenced and assembled its genome de novo. In total, six different insert-size libraries were sequenced, and 662 Gb of clean data were generated. The assembled wild yak genome is 2.83 Gb in length, with an N50 contig size of 63.2 kb and a scaffold size of 16.3 Mb. BUSCO assessment indicated that 93.8% of the highly conserved mammal genes were completely present in the genome assembly. Annotation of the wild yak genome assembly identified 1.41 Gb (49.65%) of repetitive sequences and a total of 22,910 protein-coding genes, including 20,660 (90.18%) annotated with functional terms. This first construction of the wild yak genome provides a variable genetic resource that will facilitate further study of the genetic diversity of bovine species and accelerate yak breeding efforts.
野生牦牛(Bos mutus)是家牦牛的野生祖先物种,生存于青藏高原(QTP)及其毗邻高海拔地区极其寒冷、恶劣和缺氧的环境中。在这项研究中,我们从头对其基因组进行了测序和组装。总共对六个不同的插入片段文库进行了测序,生成了 662Gb 的清洁数据。组装的野生牦牛基因组长 2.83Gb,N50 序列大小为 63.2kb,支架大小为 16.3Mb。BUSCO 评估表明,高度保守的哺乳动物基因中有 93.8%完全存在于基因组组装中。野生牦牛基因组组装的注释确定了 1.41Gb(49.65%)的重复序列和总共 22910 个蛋白质编码基因,其中 20660 个(90.18%)具有功能术语注释。野生牦牛基因组的首次构建提供了一个可变的遗传资源,这将有助于进一步研究牛科物种的遗传多样性,并加速牦牛的繁殖工作。