稀有变异与二项性状关联性的统计检验比较。

Comparison of statistical tests for association between rare variants and binary traits.

机构信息

Quantitative Sciences, GlaxoSmithKline, Research Triangle Park, North Carolina, United States of America.

出版信息

PLoS One. 2012;7(8):e42530. doi: 10.1371/journal.pone.0042530. Epub 2012 Aug 9.

DOI:10.1371/journal.pone.0042530

PMID:22912707

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3415421/

Abstract

Genome-wide association studies have found thousands of common genetic variants associated with a wide variety of diseases and other complex traits. However, a large portion of the predicted genetic contribution to many traits remains unknown. One plausible explanation is that some of the missing variation is due to the effects of rare variants. Nonetheless, the statistical analysis of rare variants is challenging. A commonly used method is to contrast, within the same region (gene), the frequency of minor alleles at rare variants between cases and controls. However, this strategy is most useful under the assumption that the tested variants have similar effects. We previously proposed a method that can accommodate heterogeneous effects in the analysis of quantitative traits. Here we extend this method to include binary traits that can accommodate covariates. We use simulations for a variety of causal and covariate impact scenarios to compare the performance of the proposed method to standard logistic regression, C-alpha, SKAT, and EREC. We found that i) logistic regression methods perform well when the heterogeneity of the effects is not extreme and ii) SKAT and EREC have good performance under all tested scenarios but they can be computationally intensive. Consequently, it would be more computationally desirable to use a two-step strategy by (i) selecting promising genes by faster methods and ii) analyzing selected genes using SKAT/EREC. To select promising genes one can use (1) regression methods when effect heterogeneity is assumed to be low and the covariates explain a non-negligible part of trait variability, (2) C-alpha when heterogeneity is assumed to be large and covariates explain a small fraction of trait's variability and (3) the proposed trend and heterogeneity test when the heterogeneity is assumed to be non-trivial and the covariates explain a large fraction of trait variability.

摘要

全基因组关联研究发现了数千种与多种疾病和其他复杂特征相关的常见遗传变异。然而，许多特征的预测遗传贡献的很大一部分仍然未知。一个合理的解释是，一些缺失的变异是由于罕见变异的影响。尽管如此，罕见变异的统计分析仍然具有挑战性。一种常用的方法是在同一区域（基因）内，比较病例和对照之间罕见变异的次要等位基因频率。然而，这种策略在测试变体具有相似影响的假设下最有用。我们之前提出了一种可以在分析定量特征时容纳异质性效应的方法。在这里，我们将该方法扩展到包括可以容纳协变量的二元特征。我们使用各种因果和协变量影响场景的模拟来比较所提出的方法与标准逻辑回归、C-alpha、SKAT 和 EREC 的性能。我们发现：i）当效应异质性不是极端时，逻辑回归方法的性能良好；ii）SKAT 和 EREC 在所有测试场景下都具有良好的性能，但它们可能计算密集。因此，使用两步策略更为理想，即通过 i）使用更快的方法选择有前途的基因，ii）使用 SKAT/EREC 分析选定的基因。要选择有前途的基因，可以使用 1）当假设效应异质性较低且协变量解释了特征可变性的不可忽略部分时使用回归方法，2）当假设异质性较大且协变量解释了特征可变性的一小部分时使用 C-alpha，以及 3）当假设异质性不可忽视且协变量解释了特征可变性的很大一部分时使用拟议的趋势和异质性检验。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a870/3415421/187eeab2ad91/pone.0042530.g001.jpg

相似文献

Comparison of statistical tests for association between rare variants and binary traits.稀有变异与二项性状关联性的统计检验比较。

PLoS One. 2012;7(8):e42530. doi: 10.1371/journal.pone.0042530. Epub 2012 Aug 9.

A Comparison Study of Fixed and Mixed Effect Models for Gene Level Association Studies of Complex Traits.复杂性状基因水平关联研究中固定效应模型与混合效应模型的比较研究

Genet Epidemiol. 2016 Dec;40(8):702-721. doi: 10.1002/gepi.21984. Epub 2016 Jul 4.

Comparison of methods and sampling designs to test for association between rare variants and quantitative traits.用于检测罕见变异与数量性状之间关联的方法和抽样设计的比较。

Genet Epidemiol. 2011 May;35(4):226-35. doi: 10.1002/gepi.20570.

Generalized functional linear models for gene-based case-control association studies.用于基于基因的病例对照关联研究的广义功能线性模型。

Genet Epidemiol. 2014 Nov;38(7):622-637. doi: 10.1002/gepi.21840. Epub 2014 Sep 9.

A comparison study of multivariate fixed models and Gene Association with Multiple Traits (GAMuT) for next-generation sequencing.下一代测序中多变量固定模型与多性状基因关联分析（GAMuT）的比较研究

Genet Epidemiol. 2017 Jan;41(1):18-34. doi: 10.1002/gepi.22014. Epub 2016 Dec 5.

Evaluating the Calibration and Power of Three Gene-Based Association Tests of Rare Variants for the X Chromosome.评估X染色体上三种基于基因的罕见变异关联测试的校准度和效能。

Genet Epidemiol. 2015 Nov;39(7):499-508. doi: 10.1002/gepi.21935. Epub 2015 Oct 10.

Rare variant association test with multiple phenotypes.针对多种表型的罕见变异关联测试。

Genet Epidemiol. 2017 Apr;41(3):198-209. doi: 10.1002/gepi.22021. Epub 2016 Dec 31.

Gene-Based Association Analysis for Censored Traits Via Fixed Effect Functional Regressions.基于固定效应函数回归的删失性状基因关联分析

Genet Epidemiol. 2016 Feb;40(2):133-43. doi: 10.1002/gepi.21947. Epub 2016 Jan 18.

Methodology for the analysis of rare genetic variation in genome-wide association and re-sequencing studies of complex human traits.复杂人类性状的全基因组关联研究和重测序研究中罕见基因变异的分析方法学。

Brief Funct Genomics. 2014 Sep;13(5):362-70. doi: 10.1093/bfgp/elu012. Epub 2014 Jun 10.

Weighted selective collapsing strategy for detecting rare and common variants in genetic association study.加权选择性合并策略在遗传关联研究中检测罕见和常见变异。

BMC Genet. 2012 Feb 6;13:7. doi: 10.1186/1471-2156-13-7.

引用本文的文献

Real world scenarios in rare variant association analysis: the impact of imbalance and sample size on the power in silico.真实世界中的稀有变异关联分析场景：不平衡和样本量对计算机模拟功效的影响。

BMC Bioinformatics. 2019 Jan 22;20(1):46. doi: 10.1186/s12859-018-2591-6.

A gene-based test of association through an orthogonal decomposition of genotype scores.通过基因型得分的正交分解进行基于基因的关联测试。

Hum Genet. 2017 Oct;136(10):1385-1394. doi: 10.1007/s00439-017-1839-y. Epub 2017 Sep 1.

A biologically informed method for detecting rare variant associations.一种用于检测罕见变异关联的基于生物学信息的方法。

BioData Min. 2016 Aug 30;9(1):27. doi: 10.1186/s13040-016-0107-3. eCollection 2016.

KNOWLEDGE DRIVEN BINNING AND PHEWAS ANALYSIS IN MARSHFIELD PERSONALIZED MEDICINE RESEARCH PROJECT USING BIOBIN.在马什菲尔德个性化医学研究项目中使用BioBin进行知识驱动的分箱和全表型组关联研究分析

Pac Symp Biocomput. 2016;21:249-60.

Next Generation Statistical Genetics: Modeling, Penalization, and Optimization in High-Dimensional Data.下一代统计遗传学：高维数据中的建模、惩罚与优化

Annu Rev Stat Appl. 2014 Jan 1;1(1):279-300. doi: 10.1146/annurev-statistics-022513-115638.

Rare variants detection with kernel machine learning based on likelihood ratio test.基于似然比检验的核机器学习稀有变异检测

PLoS One. 2014 Mar 27;9(3):e93355. doi: 10.1371/journal.pone.0093355. eCollection 2014.

Two-phase and family-based designs for next-generation sequencing studies.用于下一代测序研究的两阶段和基于家系的设计。

Front Genet. 2013 Dec 13;4:276. doi: 10.3389/fgene.2013.00276.

Assessing association between protein truncating variants and quantitative traits.评估蛋白截断变异与数量性状之间的关联。

Bioinformatics. 2013 Oct 1;29(19):2419-26. doi: 10.1093/bioinformatics/btt409. Epub 2013 Jul 16.

本文引用的文献

An abundance of rare functional variants in 202 drug target genes sequenced in 14,002 people.在 14002 个人中对 202 个药物靶标基因进行测序，发现了大量罕见的功能变异。

Science. 2012 Jul 6;337(6090):100-4. doi: 10.1126/science.1217876. Epub 2012 May 17.

A general framework for detecting disease associations with rare variants in sequencing studies.一种用于在测序研究中检测罕见变异与疾病关联的通用框架。

Am J Hum Genet. 2011 Sep 9;89(3):354-67. doi: 10.1016/j.ajhg.2011.07.015. Epub 2011 Sep 1.

Studying gene and gene-environment effects of uncommon and common variants on continuous traits: a marker-set approach using gene-trait similarity regression.研究罕见和常见变异对连续性状的基因和基因-环境效应：一种使用基因-性状相似性回归的标记集方法。

Am J Hum Genet. 2011 Aug 12;89(2):277-88. doi: 10.1016/j.ajhg.2011.07.007.

Rare-variant association testing for sequencing data with the sequence kernel association test.基于序列核关联检验的测序数据罕见变异关联分析

Am J Hum Genet. 2011 Jul 15;89(1):82-93. doi: 10.1016/j.ajhg.2011.05.029. Epub 2011 Jul 7.

An increased burden of common and rare lipid-associated risk alleles contributes to the phenotypic spectrum of hypertriglyceridemia.常见和罕见的脂质相关风险等位基因负担增加导致高甘油三酯血症表型谱扩大。

Arterioscler Thromb Vasc Biol. 2011 Aug;31(8):1916-26. doi: 10.1161/ATVBAHA.111.226365. Epub 2011 May 19.

Testing for an unusual distribution of rare variants.检测罕见变异的异常分布。

PLoS Genet. 2011 Mar;7(3):e1001322. doi: 10.1371/journal.pgen.1001322. Epub 2011 Mar 3.

Comparison of methods and sampling designs to test for association between rare variants and quantitative traits.用于检测罕见变异与数量性状之间关联的方法和抽样设计的比较。

Genet Epidemiol. 2011 May;35(4):226-35. doi: 10.1002/gepi.20570.

A new testing strategy to identify rare variants with either risk or protective effect on disease.一种新的检测策略，用于识别对疾病具有风险或保护作用的罕见变异。

PLoS Genet. 2011 Feb 3;7(2):e1001289. doi: 10.1371/journal.pgen.1001289.

A novel adaptive method for the analysis of next-generation sequencing data to detect complex trait associations with rare variants due to gene main effects and interactions.一种用于分析下一代测序数据的新自适应方法，用于检测由于基因主效应和相互作用而导致的复杂性状关联的罕见变异体。

PLoS Genet. 2010 Oct 14;6(10):e1001156. doi: 10.1371/journal.pgen.1001156.

Hundreds of variants clustered in genomic loci and biological pathways affect human height.数以百计的变异体聚集在基因组位置和生物途径中，影响人类身高。

Nature. 2010 Oct 14;467(7317):832-8. doi: 10.1038/nature09410. Epub 2010 Sep 29.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

稀有变异与二项性状关联性的统计检验比较。

Comparison of statistical tests for association between rare variants and binary traits.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献