全基因组关联研究荟萃分析中 Liability-scale 遗传力估计的普遍向下偏差：一个简单的解决方案。

Pervasive Downward Bias in Estimates of Liability-Scale Heritability in Genome-wide Association Study Meta-analysis: A Simple Solution.

机构信息

Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, Colorado; Department of Psychology and NeuroscienceUniversity of Colorado Boulder, Boulder, Colorado.

Department of Psychology, University of Texas at Austin, Austin, Texas; Population Research Center, University of Texas at Austin, Austin, Texas.

出版信息

Biol Psychiatry. 2023 Jan 1;93(1):29-36. doi: 10.1016/j.biopsych.2022.05.029. Epub 2022 Jun 8.

DOI:10.1016/j.biopsych.2022.05.029

PMID:35973856

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10066905/

Abstract

BACKGROUND

Single nucleotide polymorphism-based heritability is a fundamental quantity in the genetic analysis of complex traits. For case-control phenotypes, for which the continuous distribution of risk in the population is unobserved, observed-scale heritability estimates must be transformed to the more interpretable liability scale. This article describes how the field standard approach incorrectly performs the liability correction in that it does not appropriately account for variation in the proportion of cases across the cohorts comprising the meta-analysis. We propose a simple solution that incorporates cohort-specific ascertainment using the summation of effective sample sizes across cohorts. This solution is applied at the stage of single nucleotide polymorphism-based heritability estimation and does not require generating updated meta-analytic genome-wide association study summary statistics.

METHODS

We began by performing a series of simulations to examine the ability of the standard approach and our proposed approach to recapture liability-scale heritability in the population. We went on to examine the differences in estimates obtained from these 2 approaches for real data for 12 major case-control genome-wide association studies of psychiatric and neurologic traits.

RESULTS

We found that the field standard approach for performing the liability conversion can downwardly bias estimates by as much as approximately 50% in simulation and approximately 30% in real data.

CONCLUSIONS

Prior estimates of liability-scale heritability for genome-wide association study meta-analysis may be drastically underestimated. To this end, we strongly recommend using our proposed approach of using the sum of effective sample sizes across contributing cohorts to obtain unbiased estimates.

摘要

背景

基于单核苷酸多态性的遗传力是复杂性状遗传分析的基本数量。对于病例对照表型，由于人群中风险的连续分布无法观察到，因此必须将观察到的遗传力估计值转换为更具解释性的易感性标度。本文描述了标准方法如何在易感性校正中不正确地执行，因为它没有适当考虑构成荟萃分析的队列中病例比例的变化。我们提出了一种简单的解决方案，该方案使用跨队列的有效样本量总和来进行队列特异性确定。该解决方案应用于基于单核苷酸多态性的遗传力估计阶段，不需要生成更新的全基因组关联研究汇总统计数据。

方法

我们首先进行了一系列模拟，以检验标准方法和我们提出的方法在人群中重新捕获易感性遗传力的能力。然后，我们检查了这两种方法从 12 项主要的病例对照全基因组关联研究的精神和神经性状的真实数据中获得的估计值之间的差异。

结果

我们发现，标准的易感性转换方法在模拟中可以将估计值向下偏倚多达约 50%，在真实数据中可以偏倚约 30%。

结论

先前针对全基因组关联研究荟萃分析的易感性遗传力的估计可能被大大低估了。为此，我们强烈建议使用我们提出的方法，即使用跨贡献队列的有效样本量总和来获得无偏估计值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7983/10066905/b4eac9b06774/nihms-1881684-f0001.jpg

相似文献

Pervasive Downward Bias in Estimates of Liability-Scale Heritability in Genome-wide Association Study Meta-analysis: A Simple Solution.全基因组关联研究荟萃分析中 Liability-scale 遗传力估计的普遍向下偏差：一个简单的解决方案。

Biol Psychiatry. 2023 Jan 1;93(1):29-36. doi: 10.1016/j.biopsych.2022.05.029. Epub 2022 Jun 8.

SumVg: Total Heritability Explained by All Variants in Genome-Wide Association Studies Based on Summary Statistics with Standard Error Estimates.SumVg：基于具有标准误差估计的汇总统计数据的全基因组关联研究中所有变异解释的总遗传力。

Int J Mol Sci. 2024 Jan 22;25(2):1347. doi: 10.3390/ijms25021347.

Accurate estimation of SNP-heritability from biobank-scale data irrespective of genetic architecture.从生物库规模数据中准确估计 SNP 遗传力，与遗传结构无关。

Nat Genet. 2019 Aug;51(8):1244-1251. doi: 10.1038/s41588-019-0465-0. Epub 2019 Jul 29.

How meaningful are heritability estimates of liability?易感性的遗传力估计值有多大意义？

Hum Genet. 2013 Dec;132(12):1351-60. doi: 10.1007/s00439-013-1334-z. Epub 2013 Jul 19.

Leveraging correlations between variants in polygenic risk scores to detect heterogeneity in GWAS cohorts.利用多基因风险评分中变异的相关性来检测 GWAS 队列中的异质性。

PLoS Genet. 2020 Sep 21;16(9):e1009015. doi: 10.1371/journal.pgen.1009015. eCollection 2020 Sep.

Identity-by-descent-based heritability analysis in the Northern Finland Birth Cohort.基于亲缘关系的遗传力分析在芬兰北部出生队列中。

Hum Genet. 2013 Feb;132(2):129-38. doi: 10.1007/s00439-012-1230-y. Epub 2012 Sep 29.

Estimating SNP heritability in presence of population substructure in biobank-scale datasets.在生物库规模数据集存在群体亚结构的情况下估计 SNP 遗传力。

Genetics. 2022 Apr 4;220(4). doi: 10.1093/genetics/iyac015.

Reevaluation of SNP heritability in complex human traits.复杂人类性状中SNP遗传力的重新评估。

Nat Genet. 2017 Jul;49(7):986-992. doi: 10.1038/ng.3865. Epub 2017 May 22.

Integrated analysis of direct and proxy genome wide association studies highlights polygenicity of Alzheimer's disease outside of the APOE region.直接和间接全基因组关联研究的综合分析突出了 APOE 区域外阿尔茨海默病的多基因性。

PLoS Genet. 2022 Jun 3;18(6):e1010208. doi: 10.1371/journal.pgen.1010208. eCollection 2022 Jun.

Meta-analysis of Genome-wide Association Studies for Neuroticism, and the Polygenic Association With Major Depressive Disorder.神经质的全基因组关联研究的荟萃分析以及与重度抑郁症的多基因关联

JAMA Psychiatry. 2015 Jul;72(7):642-50. doi: 10.1001/jamapsychiatry.2015.0554.

引用本文的文献

Age-specific childhood obesity and adult cholelithiasis: association and shared transcriptomic bases.特定年龄段儿童肥胖与成人胆石症：关联及共享的转录组学基础

Int J Obes (Lond). 2025 Aug 22. doi: 10.1038/s41366-025-01877-4.

Genome-wide association study of borderline personality disorder identifies 11 loci and highlights shared risk with mental and somatic disorders.边缘型人格障碍的全基因组关联研究确定了11个基因座，并突出了与精神和躯体疾病的共同风险。

medRxiv. 2025 Aug 12:2024.11.12.24316957. doi: 10.1101/2024.11.12.24316957.

Examining the genetic links between clusters of immune-mediated diseases and psychiatric disorders.研究免疫介导疾病集群与精神疾病之间的遗传联系。

Transl Psychiatry. 2025 Jul 21;15(1):252. doi: 10.1038/s41398-025-03470-9.

PGSFusion streamlines polygenic score construction and epidemiological applications in biobank-scale cohorts.PGSFusion简化了生物样本库规模队列中的多基因评分构建和流行病学应用。

Genome Med. 2025 Jul 14;17(1):77. doi: 10.1186/s13073-025-01505-w.

The Landscape of Shared and Divergent Genetic Influences across 14 Psychiatric Disorders.14种精神疾病中共同和不同遗传影响的情况

medRxiv. 2025 Jan 15:2025.01.14.25320574. doi: 10.1101/2025.01.14.25320574.

Protocol for finding genetic variation associated with unmeasured traits through GenomicSEM common-factor GWAS.通过基因组结构方程模型共同因素全基因组关联研究寻找与未测量性状相关的遗传变异的方案。

STAR Protoc. 2025 Jun 17;6(3):103905. doi: 10.1016/j.xpro.2025.103905.

Evaluating metabolome-wide causal effects on risk for psychiatric and neurodegenerative disorders.评估代谢组范围内对精神疾病和神经退行性疾病风险的因果效应。

BMC Med. 2025 Jun 2;23(1):326. doi: 10.1186/s12916-025-04129-4.

Genome-wide association studies of binge eating behaviour and anorexia nervosa yield insights into the unique and shared biology of eating disorder phenotypes.暴饮暴食行为和神经性厌食症的全基因组关联研究揭示了饮食失调表型的独特和共同生物学机制。

medRxiv. 2025 May 8:2025.01.31.25321397. doi: 10.1101/2025.01.31.25321397.

Polygenic risk score prediction accuracy convergence.多基因风险评分预测准确性的收敛性。

HGG Adv. 2025 May 14;6(3):100457. doi: 10.1016/j.xhgg.2025.100457.

Genome-wide analyses identify 30 loci associated with obsessive-compulsive disorder.全基因组分析确定了30个与强迫症相关的基因座。

Nat Genet. 2025 May 13. doi: 10.1038/s41588-025-02189-z.

本文引用的文献

Identifying and correcting for misspecifications in GWAS summary statistics and polygenic scores.识别并校正全基因组关联研究汇总统计数据和多基因评分中的错误设定。

HGG Adv. 2022 Aug 18;3(4):100136. doi: 10.1016/j.xhgg.2022.100136. eCollection 2022 Oct 13.

Multivariate GWAS of psychiatric disorders and their cardinal symptoms reveal two dimensions of cross-cutting genetic liabilities.精神疾病及其主要症状的多变量全基因组关联研究揭示了交叉遗传易感性的两个维度。

Cell Genom. 2022 Jun 8;2(6). doi: 10.1016/j.xgen.2022.100140.

Mapping genomic loci implicates genes and synaptic biology in schizophrenia.基因组定位研究提示精神分裂症的发病与基因及突触生物学有关。

Nature. 2022 Apr;604(7906):502-508. doi: 10.1038/s41586-022-04434-5. Epub 2022 Apr 8.

Genome-wide association study of more than 40,000 bipolar disorder cases provides new insights into the underlying biology.对超过 40000 例双相情感障碍病例的全基因组关联研究为其潜在生物学机制提供了新的见解。

Nat Genet. 2021 Jun;53(6):817-829. doi: 10.1038/s41588-021-00857-4. Epub 2021 May 17.

Systematic Review: Molecular Studies of Common Genetic Variation in Child and Adolescent Psychiatric Disorders.系统综述：儿童和青少年精神障碍常见遗传变异的分子研究。

J Am Acad Child Adolesc Psychiatry. 2022 Feb;61(2):227-242. doi: 10.1016/j.jaac.2021.03.020. Epub 2021 Apr 28.

A large-scale genome-wide association study meta-analysis of cannabis use disorder.一项大麻使用障碍的大规模全基因组关联研究荟萃分析。

Lancet Psychiatry. 2020 Dec;7(12):1032-1045. doi: 10.1016/S2215-0366(20)30339-4. Epub 2020 Oct 20.

International meta-analysis of PTSD genome-wide association studies identifies sex- and ancestry-specific genetic risk loci.国际 PTSD 全基因组关联研究的荟萃分析确定了性别和祖先特异性的遗传风险位点。

Nat Commun. 2019 Oct 8;10(1):4558. doi: 10.1038/s41467-019-12576-w.

Genome-wide association study identifies eight risk loci and implicates metabo-psychiatric origins for anorexia nervosa.全基因组关联研究确定了 8 个风险位点，并提示神经性厌食症与代谢-精神起源有关。

Nat Genet. 2019 Aug;51(8):1207-1214. doi: 10.1038/s41588-019-0439-2. Epub 2019 Jul 15.

Genomic structural equation modelling provides insights into the multivariate genetic architecture of complex traits.基因组结构方程模型为复杂性状的多变量遗传结构提供了深入的了解。

Nat Hum Behav. 2019 May;3(5):513-525. doi: 10.1038/s41562-019-0566-x. Epub 2019 Apr 8.

Genetic meta-analysis of diagnosed Alzheimer's disease identifies new risk loci and implicates Aβ, tau, immunity and lipid processing.基于诊断的阿尔茨海默病的全基因组关联荟萃分析鉴定出新的风险位点，并提示 Aβ、tau、免疫和脂类代谢过程的作用。

Nat Genet. 2019 Mar;51(3):414-430. doi: 10.1038/s41588-019-0358-2. Epub 2019 Feb 28.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验