Zhu Huanhuan, Zhou Xiang
Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA.
Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA.
Comput Struct Biotechnol J. 2020 Jun 18;18:1557-1568. doi: 10.1016/j.csbj.2020.06.011. eCollection 2020.
In GWAS studies, SNP heritability measures the proportion of phenotypic variance explained by all measured SNPs. Accurate estimation of SNP heritability can help us better understand the degree to which measured genetic variants influence phenotypes. Over the last decade, a variety of statistical methods and software tools have been developed for SNP heritability estimation with different data types including genotype array data, imputed genotype data, whole-genome sequencing data, RNA sequencing data, and bisulfite sequencing data. However, a thorough technical review of these methods, especially from a statistical and computational viewpoint, is currently missing. To fill this knowledge gap, we present a comprehensive review on a broad category of recently developed and commonly used SNP heritability estimation methods. We focus on their modeling assumptions; their interconnected relationships; their applicability to quantitative, binary and count phenotypes; their use of individual level data versus summary statistics, as well as their utility for SNP heritability partitioning. We hope that this review will serve as a useful reference for both methodologists who develop heritability estimation methods and practitioners who perform heritability analysis.
在全基因组关联研究(GWAS)中,单核苷酸多态性(SNP)遗传力衡量的是所有测得的SNP所解释的表型变异比例。准确估计SNP遗传力有助于我们更好地理解测得的基因变异对表型的影响程度。在过去十年中,已经开发了多种统计方法和软件工具,用于利用不同类型的数据(包括基因型阵列数据、推算基因型数据、全基因组测序数据、RNA测序数据和亚硫酸氢盐测序数据)来估计SNP遗传力。然而,目前缺少对这些方法的全面技术综述,尤其是从统计和计算角度的综述。为了填补这一知识空白,我们对最近开发的和常用的一大类SNP遗传力估计方法进行了全面综述。我们关注它们的建模假设;它们之间的相互关系;它们对定量、二元和计数表型的适用性;它们对个体水平数据与汇总统计量的使用,以及它们在SNP遗传力划分方面的效用。我们希望这篇综述能为开发遗传力估计方法的方法学家和进行遗传力分析的从业者提供有用的参考。