INRES Pflanzenzüchtung, University of Bonn, 53115, Bonn, Germany.
Department of Mathematical Sciences and Biocenter Oulu, University of Oulu, FIN-90014, Oulu, Finland.
Heredity (Edinb). 2018 Apr;120(4):356-368. doi: 10.1038/s41437-017-0023-4. Epub 2017 Dec 14.
Single nucleotide polymorphism (SNP)-heritability estimation is an important topic in several research fields, including animal, plant and human genetics, as well as in ecology. Linear mixed model estimation of SNP-heritability uses the structures of genomic relationships between individuals, which is constructed from genome-wide sets of SNP-markers that are generally weighted equally in their contributions. Proposed methods to handle dependence between SNPs include, "thinning" the marker set by linkage disequilibrium (LD)-pruning, the use of haplotype-tagging of SNPs, and LD-weighting of the SNP-contributions. For improved estimation, we propose a new conceptual framework for genomic relationship matrix, in which Mahalanobis distance-based LD-correction is used in a linear mixed model estimation of SNP-heritability. The superiority of the presented method is illustrated and compared to mixed-model analyses using a VanRaden genomic relationship matrix, a matrix used by GCTA and a matrix employing LD-weighting (as implemented in the LDAK software) in simulated (using real human, rice and cattle genotypes) and real (maize, rice and mice) datasets. Despite of the computational difficulties, our results suggest that by using the proposed method one can improve the accuracy of SNP-heritability estimates in datasets with high LD.
单核苷酸多态性(SNP)遗传力估计是动物学、植物学和人类遗传学以及生态学等多个研究领域的重要课题。基于个体基因组关系的线性混合模型 SNP 遗传力估计,是利用全基因组 SNP 标记之间的结构构建的,这些标记在贡献方面通常是平等加权的。处理 SNP 之间相关性的方法包括通过连锁不平衡(LD)修剪来“稀疏”标记集,使用 SNP 单倍型标记,以及对 SNP 贡献进行 LD 加权。为了提高估计的准确性,我们提出了一个新的基因组关系矩阵的概念框架,其中基于 Mahalanobis 距离的 LD 校正用于 SNP 遗传力的线性混合模型估计。本文提出的方法在模拟(使用真实的人类、水稻和牛基因型)和真实(玉米、水稻和老鼠)数据集上进行了实例化,与使用 VanRaden 基因组关系矩阵、GCTA 中使用的矩阵和采用 LD 加权(如 LDAK 软件中实现的)的混合模型分析进行了比较,展示并说明了该方法的优越性。尽管存在计算上的困难,但我们的结果表明,通过使用所提出的方法,在 LD 较高的数据集可以提高 SNP 遗传力估计的准确性。