Cao Caiyun, Miao Jian, Xie Qinqin, Sun Jiabao, Cheng Hong, Zhang Zhenyang, Wu Fen, Liu Shuang, Ye Xiaowei, Gong Huanfa, Zhang Zhe, Wang Qishan, Pan Yuchun, Wang Zhen
College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China.
Hainan Institute of Zhejiang University, Building 11, Yongyou Industrial Park, Yazhou Bay Science and Technology City, Yazhou District, Sanya 572025 Hainan, China.
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf048.
Pigs are crucial sources of meat and protein, valuable animal models, and potential donors for xenotransplantation. However, the existing reference genome for pigs is incomplete, with thousands of segments and centromeres and telomeres missing, which limits our understanding of the important traits in these genomic regions.
We present a near-complete genome assembly for the Jinhua pig (JH-T2T) and provide a set of diploid Jinhua reference genomes, constructed using PacBio HiFi, ONT long reads, and Hi-C reads. This assembly includes all 18 autosomes and the X and Y sex chromosomes, with only 6 gaps. It features annotations of 46.90% repetitive sequences, 33 telomeres, 17 centromeres, and 23,924 high-confident genes. Compared to the Sscrofa11.1, JH-T2T closes nearly all gaps, extends sequences by 177 Mb, predicts more intact telomeres and centromeres, and gains 799 more genes and loses 114 genes. Moreover, it enhances the mapping rate for both Western and Chinese local pigs, outperforming Sscrofa11.1 as a reference genome. Additionally, this comprehensive genome assembly will facilitate large-scale variant detection.
This study produced a near-gapless assembly of the pig genome and provides a set of haploid Jinhua reference genomes. Our findings represent a significant advance in pig genomics, providing a robust resource that enhances genetic research, breeding programs, and biomedical applications.
猪是肉类和蛋白质的重要来源、有价值的动物模型以及异种移植的潜在供体。然而,现有的猪参考基因组并不完整,数千个片段以及着丝粒和端粒缺失,这限制了我们对这些基因组区域重要性状的理解。
我们展示了金华猪的一个近乎完整的基因组组装(JH-T2T),并提供了一组使用PacBio HiFi、ONT长读长和Hi-C读长构建的二倍体金华参考基因组。该组装包括所有18条常染色体以及X和Y性染色体,仅有6个缺口。它具有46.90%的重复序列注释、33个端粒、17个着丝粒以及23924个高可信度基因。与Sscrofa11.1相比,JH-T2T几乎填补了所有缺口,序列延伸了177 Mb,预测出更多完整的端粒和着丝粒,新增799个基因并丢失114个基因。此外,它提高了西方猪和中国地方猪的映射率,作为参考基因组优于Sscrofa11.1。此外,这种全面的基因组组装将有助于大规模变异检测。
本研究产生了一个近乎无缺口的猪基因组组装,并提供了一组单倍体金华参考基因组。我们的研究结果代表了猪基因组学的重大进展,提供了一个强大的资源,可加强遗传研究、育种计划和生物医学应用。