Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, Colorado; Department of Psychology and NeuroscienceUniversity of Colorado Boulder, Boulder, Colorado.
Department of Psychology, University of Texas at Austin, Austin, Texas; Population Research Center, University of Texas at Austin, Austin, Texas.
Biol Psychiatry. 2023 Jan 1;93(1):29-36. doi: 10.1016/j.biopsych.2022.05.029. Epub 2022 Jun 8.
Single nucleotide polymorphism-based heritability is a fundamental quantity in the genetic analysis of complex traits. For case-control phenotypes, for which the continuous distribution of risk in the population is unobserved, observed-scale heritability estimates must be transformed to the more interpretable liability scale. This article describes how the field standard approach incorrectly performs the liability correction in that it does not appropriately account for variation in the proportion of cases across the cohorts comprising the meta-analysis. We propose a simple solution that incorporates cohort-specific ascertainment using the summation of effective sample sizes across cohorts. This solution is applied at the stage of single nucleotide polymorphism-based heritability estimation and does not require generating updated meta-analytic genome-wide association study summary statistics.
We began by performing a series of simulations to examine the ability of the standard approach and our proposed approach to recapture liability-scale heritability in the population. We went on to examine the differences in estimates obtained from these 2 approaches for real data for 12 major case-control genome-wide association studies of psychiatric and neurologic traits.
We found that the field standard approach for performing the liability conversion can downwardly bias estimates by as much as approximately 50% in simulation and approximately 30% in real data.
Prior estimates of liability-scale heritability for genome-wide association study meta-analysis may be drastically underestimated. To this end, we strongly recommend using our proposed approach of using the sum of effective sample sizes across contributing cohorts to obtain unbiased estimates.
基于单核苷酸多态性的遗传力是复杂性状遗传分析的基本数量。对于病例对照表型,由于人群中风险的连续分布无法观察到,因此必须将观察到的遗传力估计值转换为更具解释性的易感性标度。本文描述了标准方法如何在易感性校正中不正确地执行,因为它没有适当考虑构成荟萃分析的队列中病例比例的变化。我们提出了一种简单的解决方案,该方案使用跨队列的有效样本量总和来进行队列特异性确定。该解决方案应用于基于单核苷酸多态性的遗传力估计阶段,不需要生成更新的全基因组关联研究汇总统计数据。
我们首先进行了一系列模拟,以检验标准方法和我们提出的方法在人群中重新捕获易感性遗传力的能力。然后,我们检查了这两种方法从 12 项主要的病例对照全基因组关联研究的精神和神经性状的真实数据中获得的估计值之间的差异。
我们发现,标准的易感性转换方法在模拟中可以将估计值向下偏倚多达约 50%,在真实数据中可以偏倚约 30%。
先前针对全基因组关联研究荟萃分析的易感性遗传力的估计可能被大大低估了。为此,我们强烈建议使用我们提出的方法,即使用跨贡献队列的有效样本量总和来获得无偏估计值。