Suppr超能文献

从全基因组关联研究中估计复杂疾病潜在的易感变异总数。

Estimating the total number of susceptibility variants underlying complex diseases from genome-wide association studies.

机构信息

Department of Psychiatry, University of Hong Kong, Hong Kong SAR, China.

出版信息

PLoS One. 2010 Nov 17;5(11):e13898. doi: 10.1371/journal.pone.0013898.

Abstract

Recently genome-wide association studies (GWAS) have identified numerous susceptibility variants for complex diseases. In this study we proposed several approaches to estimate the total number of variants underlying these diseases. We assume that the variance explained by genetic markers (Vg) follow an exponential distribution, which is justified by previous studies on theories of adaptation. Our aim is to fit the observed distribution of Vg from GWAS to its theoretical distribution. The number of variants is obtained by the heritability divided by the estimated mean of the exponential distribution. In practice, due to limited sample sizes, there is insufficient power to detect variants with small effects. Therefore the power was taken into account in fitting. Besides considering the most significant variants, we also tried to relax the significance threshold, allowing more markers to be fitted. The effects of false positive variants were removed by considering the local false discovery rates. In addition, we developed an alternative approach by directly fitting the z-statistics from GWAS to its theoretical distribution. In all cases, the "winner's curse" effect was corrected analytically. Confidence intervals were also derived. Simulations were performed to compare and verify the performance of different estimators (which incorporates various means of winner's curse correction) and the coverage of the proposed analytic confidence intervals. Our methodology only requires summary statistics and is able to handle both binary and continuous traits. Finally we applied the methods to a few real disease examples (lipid traits, type 2 diabetes and Crohn's disease) and estimated that hundreds to nearly a thousand variants underlie these traits.

摘要

最近,全基因组关联研究(GWAS)已经确定了许多复杂疾病的易感变异。在这项研究中,我们提出了几种方法来估计这些疾病背后的总变异数量。我们假设遗传标记解释的方差(Vg)遵循指数分布,这是先前关于适应理论研究的合理假设。我们的目的是将 GWAS 中观察到的 Vg 分布拟合到其理论分布。变异数量通过遗传力除以指数分布的估计均值获得。实际上,由于样本量有限,检测具有小效应的变异的能力不足。因此,在拟合过程中考虑了功效。除了考虑最显著的变异外,我们还试图放宽显著性阈值,允许拟合更多的标记。通过考虑局部假发现率来去除假阳性变异的影响。此外,我们还通过直接将 GWAS 的 z 统计量拟合到其理论分布来开发了一种替代方法。在所有情况下,都通过解析方式校正了“胜者诅咒”效应。还推导出了置信区间。进行了模拟以比较和验证不同估计器(包含各种胜者诅咒校正方法)的性能以及所提出的解析置信区间的覆盖范围。我们的方法仅需要汇总统计信息,并且能够处理二分类和连续性状。最后,我们将方法应用于一些实际疾病示例(脂质性状、2 型糖尿病和克罗恩病),并估计这些性状背后有数百到近千个变异。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32ee/2984437/a851c23e5230/pone.0013898.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验