Research Institute for Farm Animal Biology (FBN), Institute of Genetics and Biometry, Dummerstorf, Germany.
Division of Animal Sciences, Institute for Data Science and Informatics, 162 Animal Science Research Center, University of Missouri, Columbia, Missouri, USA.
Anim Genet. 2022 Aug;53(4):498-505. doi: 10.1111/age.13205. Epub 2022 May 1.
Creation of the bovine reference assembly paved the way to develop the high-throughput genotyping arrays of the single nucleotide polymorphisms (SNPs) based on the available map coordinates that facilitated major advances in gene mapping and selection programs. The assembly flaws, however, may cause false results in the downstream gene mapping studies. The most recent bovine reference genome (ARS-UCD1.2) is built on the long-read sequences that provides improved quality and continuity. By applying population genetic metrics in this study, we aimed to evaluate the map coordinates to which SNP markers were assigned. We employed a three-step approach by combining the recombination and linkage disequilibrium analyses to test if the markers fit into the assigned physical map coordinates. We applied the method to the bovine 50k array in a large pedigree of Holstein cattle and revealed a panel of 65 candidate markers, most of which were re-located either on a different chromosome or re-mapped as far as several million base pairs away on the same chromosome. This list of candidates accounts for 0.1% of the SNPs in the widely used 50k genotyping array and we foresee a reasonably larger set of markers being misplaced in the BovineHD 700K BeadChip. We suggest pre-removal of the candidate misplaced markers to reduce false signals in association mapping studies.
牛参考基因组的构建为基于可用图谱坐标开发基于单核苷酸多态性 (SNP) 的高通量基因分型阵列铺平了道路,这极大地促进了基因图谱和选择计划的发展。然而,组装缺陷可能会导致下游基因图谱研究出现错误结果。最新的牛参考基因组 (ARS-UCD1.2) 是基于长读序列构建的,这提供了更高的质量和连续性。通过在本研究中应用群体遗传指标,我们旨在评估 SNP 标记被分配到的图谱坐标。我们采用了一种三步法,结合重组和连锁不平衡分析,以测试标记是否符合指定的物理图谱坐标。我们将该方法应用于荷斯坦牛的大型家系中的牛 50k 阵列,揭示了 65 个候选标记,其中大多数被重新定位到不同的染色体上,或者在同一染色体上重新映射到几百万个碱基对之外。这组候选标记占广泛使用的 50k 基因分型阵列中 SNP 的 0.1%,我们预计在 BovineHD 700K BeadChip 中有更多的标记被错误定位。我们建议预先去除候选错误定位的标记,以减少关联图谱研究中的假信号。