Department of Physiology, University of Tartu, 19 Ravila Street, 50411 Tartu, Estonia.
Mamm Genome. 2013 Aug;24(7-8):309-21. doi: 10.1007/s00335-013-9464-0. Epub 2013 Jul 27.
The aim of our study was to create a high-quality Holstein cow genome reference sequence and describe the different types of variations in this genome compared to the reference Hereford breed. We generated one fragment and three mate-paired libraries from genomic DNA. Raw files were mapped and paired to the reference cow (Bos taurus) genome assemblies bosTau6/UMD_3.1. BioScope (v1.3) software was used for mapping and variant analysis. Initial sequencing resulted in 2,842,744,008 of 50-bp reads. Average mapping efficiency was 78.4 % and altogether 2,168,425,497 reads and 98,022,357,422 bp were successfully mapped, resulting in 36.7X coverage. Tertiary analysis found 5,923,230 SNPs in the bovine genome, of which 3,833,249 were heterozygous and 2,089,981 were homozygous variants. Annotation revealed that 4,241,000 of all discovered SNPs were annotated in the dbSNP database and 1,682,230 SNPs were considered as novel. Large indel variations accounted for 48,537,190 bp of the entire genome and there were 138,504 of them. The largest deletion was 18,594 bp and the largest insertion was 13,498 bp. Another group of variants, small indels (n = 458,061), accounted for the total variation of 1,839,872 nucleotides in the genome. Only 92,115 small indels were listed in the dbSNP and therefore 365,946 small indels were novel. Finally, we identified 1,876 inversions in the bovine genome. In conclusion, this is another description of the Holstein cow genome and, similar to previous studies, we found a large amount of novel variations. Better knowledge of these variations could explain significant phenotypic differences (e.g., health, production, reproduction) between different breeds.
我们的研究目的是创建一个高质量的荷斯坦奶牛基因组参考序列,并描述与参考海福特品种相比,该基因组中的不同类型变异。我们从基因组 DNA 生成了一个片段和三个 mate-paired 文库。原始文件被映射并与参考牛(Bos taurus)基因组组装 bosTau6/UMD_3.1 进行比对。使用 BioScope(v1.3)软件进行映射和变异分析。初始测序得到了 50-bp 读数的 2,842,744,008 个。平均映射效率为 78.4%,总共成功映射了 2,168,425,497 个读数和 98,022,357,422 bp,覆盖率为 36.7X。三级分析在牛基因组中发现了 5,923,230 个 SNPs,其中 3,833,249 个为杂合子,2,089,981 个为纯合子变体。注释表明,所有发现的 SNPs 中有 4,241,000 个被 dbSNP 数据库注释,1,682,230 个 SNPs 被认为是新的。大的插入缺失变异占整个基因组的 48,537,190 bp,其中有 138,504 个。最大的缺失为 18,594 bp,最大的插入为 13,498 bp。另一组变体,小插入缺失(n=458,061),占基因组中总变异的 1,839,872 个核苷酸。仅在 dbSNP 中列出了 92,115 个小插入缺失,因此 365,946 个小插入缺失是新的。最后,我们在牛基因组中鉴定了 1,876 个倒位。总之,这是对荷斯坦奶牛基因组的另一种描述,与之前的研究类似,我们发现了大量的新变异。更好地了解这些变异可以解释不同品种之间的显著表型差异(例如健康、生产、繁殖)。