Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea.
Department of Agricultural Biotechnology and Research Institute of Agriculture and Life Sciences, College of Agriculture and Life Sciences, Seoul National University, Seoul, Korea.
Sci Data. 2023 Aug 23;10(1):560. doi: 10.1038/s41597-023-02453-z.
This study presents the first chromosome-level genome assembly of Hanwoo, an indigenous Korean breed of Bos taurus taurus. This is the first genome assembly of Asian taurus breed. Also, we constructed a pangenome graph of 14 B. taurus genome assemblies. The contig N50 was over 55 Mb, the scaffold N50 was over 89 Mb and a genome completeness of 95.8%, as estimated by BUSCO using the mammalian set, indicated a high-quality assembly. 48.7% of the genome comprised various repetitive elements, including DNAs, tandem repeats, long interspersed nuclear elements, and simple repeats. A total of 27,314 protein-coding genes were identified, including 25,302 proteins with inferred gene names and 2,012 unknown proteins. The pangenome graph of 14 B. taurus autosomes revealed 528.47 Mb non-reference regions in total and 61.87 Mb Hanwoo-specific regions. Our Hanwoo assembly and pangenome graph provide valuable resources for studying B. taurus populations.
本研究呈现了韩牛(Hanwoo)的首个染色体水平的基因组组装,韩牛是一种原产于韩国的牛品种,属于牛属(Bos taurus taurus)。这是首个亚洲牛属品种的基因组组装。此外,我们构建了 14 个牛属基因组组装的泛基因组图谱。该图谱的 contig N50 超过 55Mb,scaffold N50 超过 89Mb,使用哺乳动物数据集估计的基因组完整性为 95.8%,表明这是一个高质量的组装。基因组的 48.7%由各种重复元件组成,包括 DNA、串联重复、长散布核元件和简单重复。总共鉴定出 27,314 个编码蛋白的基因,包括 25,302 个具有推测基因名称的蛋白和 2,012 个未知蛋白。14 个牛属常染色体的泛基因组图谱共揭示了 528.47Mb 的非参考区域和 61.87Mb 的韩牛特异性区域。我们的韩牛基因组组装和泛基因组图谱为研究牛属群体提供了有价值的资源。