提高热带适应性肉牛的插补准确性：利用全基因组测序数据对婆罗门牛和内洛尔牛的应用

Improving imputation accuracy across tropically adapted beef cattle: an application for Brahman and Nellore using whole-genome sequencing data.

作者信息

Fernandes Júnior G A, Costilla R, Carvalheiro R, Hayes B, Ross E M, Oliveira H N, Albuquerque L G

机构信息

School of Agricultural and Veterinarian Sciences, UNESP, Jaboticabal, SP 14884-900, Brazil; Acarau Valley State University, UVA, Sobral, CE 62040-370, Brazil.

Cawthron Institute, Nelson 7010, New Zealand.

出版信息

Animal. 2025 Jul 16;19(9):101601. doi: 10.1016/j.animal.2025.101601.

DOI:10.1016/j.animal.2025.101601

PMID:40773784

Abstract

Combining information of different breeds is a cost-effective strategy to increase the size and genetic diversity of reference populations, which would improve imputation and/or genomic prediction accuracies in comparison with single-breed evaluations. Here, we have evaluated the impact of combining sequence information from two of the most relevant tropically adapted beef cattle breeds (Brahman and Nellore) on imputation accuracies to the sequence level. Whole-genome sequencing data of 279 (128 Brahman and 151 Nellore) animals were used in this study. Animals were chosen based on their contribution to the respective breed, attempting to reach high imputation accuracies by maximizing the genetic variability captured in the sequencing. Ten well-designed imputation scenarios from high-density single-nucleotide polymorphism (SNP) panel (∼777 K) to whole-genome sequence, implemented using the software FImpute3, were used to study different strategies to combine Brahman and Nellore sequencing data for imputation purposes. Animal and SNP imputation accuracies were assessed by the squared correlation between observed and imputed genotypes. The analysis of the genetic structure of the sequenced animals showed that Nellore and Brahman are genetically distinct cattle breeds with similar patterns of linkage disequilibrium. Compared to single-breed evaluations, the average imputation accuracy per animal improved from 0.89 to 0.91 in Brahman and from 0.94 to 0.96 in Nellore by utilizing a multibreed model. The overall average SNP-wise imputation accuracies were also improved (from 0.78 to 0.82 in Brahman and from 0.86 to 0.92 in Nellore) by combining sequence data from Nellore and Brahman, including a considerably better imputation for the known hard-to-impute genomic regions on chromosomes 5, 10, 12, 15, and 23. This study showed that higher accuracy of imputation to whole-genome sequencing can be achieved for both Brahman and Nellore using multibreed models in comparison to the standard single-breed evaluations, especially when restricting the analysis to a reference panel that is segregating in both breeds.

摘要

整合不同品种的信息是一种经济高效的策略，可增加参考群体的规模和遗传多样性，与单一品种评估相比，这将提高基因填充和/或基因组预测的准确性。在此，我们评估了整合两个最具代表性的热带适应性肉牛品种（婆罗门牛和内洛尔牛）的序列信息对基因填充到序列水平准确性的影响。本研究使用了279头动物（128头婆罗门牛和151头内洛尔牛）的全基因组测序数据。根据动物对各自品种的贡献进行选择，试图通过最大化测序中捕获的遗传变异来达到高基因填充准确性。使用软件FImpute3实施了从高密度单核苷酸多态性（SNP）芯片（约777K）到全基因组序列的10种精心设计的基因填充方案，以研究整合婆罗门牛和内洛尔牛测序数据用于基因填充目的的不同策略。通过观察到的和填充的基因型之间的平方相关性评估动物和SNP的基因填充准确性。对测序动物的遗传结构分析表明，内洛尔牛和婆罗门牛是遗传上不同的牛品种，具有相似的连锁不平衡模式。与单一品种评估相比，通过使用多品种模型，婆罗门牛每头动物的平均基因填充准确性从0.89提高到0.91，内洛尔牛从0.94提高到0.96。通过整合内洛尔牛和婆罗门牛的序列数据，总体平均SNP水平的基因填充准确性也得到了提高（婆罗门牛从0.78提高到0.82，内洛尔牛从0.86提高到0.92），包括对5号、10号、12号、15号和2号染色体上已知难以填充的基因组区域有显著更好的填充效果。这项研究表明，与标准的单一品种评估相比，使用多品种模型可以使婆罗门牛和内洛尔牛在全基因组测序的基因填充方面都获得更高的准确性，特别是当将分析限制在两个品种中都存在分离的参考面板时。