• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

稀有变异与二项性状关联性的统计检验比较。

Comparison of statistical tests for association between rare variants and binary traits.

机构信息

Quantitative Sciences, GlaxoSmithKline, Research Triangle Park, North Carolina, United States of America.

出版信息

PLoS One. 2012;7(8):e42530. doi: 10.1371/journal.pone.0042530. Epub 2012 Aug 9.

DOI:10.1371/journal.pone.0042530
PMID:22912707
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3415421/
Abstract

Genome-wide association studies have found thousands of common genetic variants associated with a wide variety of diseases and other complex traits. However, a large portion of the predicted genetic contribution to many traits remains unknown. One plausible explanation is that some of the missing variation is due to the effects of rare variants. Nonetheless, the statistical analysis of rare variants is challenging. A commonly used method is to contrast, within the same region (gene), the frequency of minor alleles at rare variants between cases and controls. However, this strategy is most useful under the assumption that the tested variants have similar effects. We previously proposed a method that can accommodate heterogeneous effects in the analysis of quantitative traits. Here we extend this method to include binary traits that can accommodate covariates. We use simulations for a variety of causal and covariate impact scenarios to compare the performance of the proposed method to standard logistic regression, C-alpha, SKAT, and EREC. We found that i) logistic regression methods perform well when the heterogeneity of the effects is not extreme and ii) SKAT and EREC have good performance under all tested scenarios but they can be computationally intensive. Consequently, it would be more computationally desirable to use a two-step strategy by (i) selecting promising genes by faster methods and ii) analyzing selected genes using SKAT/EREC. To select promising genes one can use (1) regression methods when effect heterogeneity is assumed to be low and the covariates explain a non-negligible part of trait variability, (2) C-alpha when heterogeneity is assumed to be large and covariates explain a small fraction of trait's variability and (3) the proposed trend and heterogeneity test when the heterogeneity is assumed to be non-trivial and the covariates explain a large fraction of trait variability.

摘要

全基因组关联研究发现了数千种与多种疾病和其他复杂特征相关的常见遗传变异。然而,许多特征的预测遗传贡献的很大一部分仍然未知。一个合理的解释是,一些缺失的变异是由于罕见变异的影响。尽管如此,罕见变异的统计分析仍然具有挑战性。一种常用的方法是在同一区域(基因)内,比较病例和对照之间罕见变异的次要等位基因频率。然而,这种策略在测试变体具有相似影响的假设下最有用。我们之前提出了一种可以在分析定量特征时容纳异质性效应的方法。在这里,我们将该方法扩展到包括可以容纳协变量的二元特征。我们使用各种因果和协变量影响场景的模拟来比较所提出的方法与标准逻辑回归、C-alpha、SKAT 和 EREC 的性能。我们发现:i)当效应异质性不是极端时,逻辑回归方法的性能良好;ii)SKAT 和 EREC 在所有测试场景下都具有良好的性能,但它们可能计算密集。因此,使用两步策略更为理想,即通过 i)使用更快的方法选择有前途的基因,ii)使用 SKAT/EREC 分析选定的基因。要选择有前途的基因,可以使用 1)当假设效应异质性较低且协变量解释了特征可变性的不可忽略部分时使用回归方法,2)当假设异质性较大且协变量解释了特征可变性的一小部分时使用 C-alpha,以及 3)当假设异质性不可忽视且协变量解释了特征可变性的很大一部分时使用拟议的趋势和异质性检验。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a870/3415421/42fc4dea4418/pone.0042530.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a870/3415421/187eeab2ad91/pone.0042530.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a870/3415421/4deab36b742b/pone.0042530.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a870/3415421/1bcd692fa5dd/pone.0042530.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a870/3415421/42fc4dea4418/pone.0042530.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a870/3415421/187eeab2ad91/pone.0042530.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a870/3415421/4deab36b742b/pone.0042530.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a870/3415421/1bcd692fa5dd/pone.0042530.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a870/3415421/42fc4dea4418/pone.0042530.g004.jpg

相似文献

1
Comparison of statistical tests for association between rare variants and binary traits.稀有变异与二项性状关联性的统计检验比较。
PLoS One. 2012;7(8):e42530. doi: 10.1371/journal.pone.0042530. Epub 2012 Aug 9.
2
A Comparison Study of Fixed and Mixed Effect Models for Gene Level Association Studies of Complex Traits.复杂性状基因水平关联研究中固定效应模型与混合效应模型的比较研究
Genet Epidemiol. 2016 Dec;40(8):702-721. doi: 10.1002/gepi.21984. Epub 2016 Jul 4.
3
Comparison of methods and sampling designs to test for association between rare variants and quantitative traits.用于检测罕见变异与数量性状之间关联的方法和抽样设计的比较。
Genet Epidemiol. 2011 May;35(4):226-35. doi: 10.1002/gepi.20570.
4
Generalized functional linear models for gene-based case-control association studies.用于基于基因的病例对照关联研究的广义功能线性模型。
Genet Epidemiol. 2014 Nov;38(7):622-637. doi: 10.1002/gepi.21840. Epub 2014 Sep 9.
5
A comparison study of multivariate fixed models and Gene Association with Multiple Traits (GAMuT) for next-generation sequencing.下一代测序中多变量固定模型与多性状基因关联分析(GAMuT)的比较研究
Genet Epidemiol. 2017 Jan;41(1):18-34. doi: 10.1002/gepi.22014. Epub 2016 Dec 5.
6
Evaluating the Calibration and Power of Three Gene-Based Association Tests of Rare Variants for the X Chromosome.评估X染色体上三种基于基因的罕见变异关联测试的校准度和效能。
Genet Epidemiol. 2015 Nov;39(7):499-508. doi: 10.1002/gepi.21935. Epub 2015 Oct 10.
7
Rare variant association test with multiple phenotypes.针对多种表型的罕见变异关联测试。
Genet Epidemiol. 2017 Apr;41(3):198-209. doi: 10.1002/gepi.22021. Epub 2016 Dec 31.
8
Gene-Based Association Analysis for Censored Traits Via Fixed Effect Functional Regressions.基于固定效应函数回归的删失性状基因关联分析
Genet Epidemiol. 2016 Feb;40(2):133-43. doi: 10.1002/gepi.21947. Epub 2016 Jan 18.
9
Methodology for the analysis of rare genetic variation in genome-wide association and re-sequencing studies of complex human traits.复杂人类性状的全基因组关联研究和重测序研究中罕见基因变异的分析方法学。
Brief Funct Genomics. 2014 Sep;13(5):362-70. doi: 10.1093/bfgp/elu012. Epub 2014 Jun 10.
10
Weighted selective collapsing strategy for detecting rare and common variants in genetic association study.加权选择性合并策略在遗传关联研究中检测罕见和常见变异。
BMC Genet. 2012 Feb 6;13:7. doi: 10.1186/1471-2156-13-7.

引用本文的文献

1
Real world scenarios in rare variant association analysis: the impact of imbalance and sample size on the power in silico.真实世界中的稀有变异关联分析场景:不平衡和样本量对计算机模拟功效的影响。
BMC Bioinformatics. 2019 Jan 22;20(1):46. doi: 10.1186/s12859-018-2591-6.
2
A gene-based test of association through an orthogonal decomposition of genotype scores.通过基因型得分的正交分解进行基于基因的关联测试。
Hum Genet. 2017 Oct;136(10):1385-1394. doi: 10.1007/s00439-017-1839-y. Epub 2017 Sep 1.
3
A biologically informed method for detecting rare variant associations.

本文引用的文献

1
An abundance of rare functional variants in 202 drug target genes sequenced in 14,002 people.在 14002 个人中对 202 个药物靶标基因进行测序,发现了大量罕见的功能变异。
Science. 2012 Jul 6;337(6090):100-4. doi: 10.1126/science.1217876. Epub 2012 May 17.
2
A general framework for detecting disease associations with rare variants in sequencing studies.一种用于在测序研究中检测罕见变异与疾病关联的通用框架。
Am J Hum Genet. 2011 Sep 9;89(3):354-67. doi: 10.1016/j.ajhg.2011.07.015. Epub 2011 Sep 1.
3
Studying gene and gene-environment effects of uncommon and common variants on continuous traits: a marker-set approach using gene-trait similarity regression.
一种用于检测罕见变异关联的基于生物学信息的方法。
BioData Min. 2016 Aug 30;9(1):27. doi: 10.1186/s13040-016-0107-3. eCollection 2016.
4
KNOWLEDGE DRIVEN BINNING AND PHEWAS ANALYSIS IN MARSHFIELD PERSONALIZED MEDICINE RESEARCH PROJECT USING BIOBIN.在马什菲尔德个性化医学研究项目中使用BioBin进行知识驱动的分箱和全表型组关联研究分析
Pac Symp Biocomput. 2016;21:249-60.
5
Next Generation Statistical Genetics: Modeling, Penalization, and Optimization in High-Dimensional Data.下一代统计遗传学:高维数据中的建模、惩罚与优化
Annu Rev Stat Appl. 2014 Jan 1;1(1):279-300. doi: 10.1146/annurev-statistics-022513-115638.
6
Rare variants detection with kernel machine learning based on likelihood ratio test.基于似然比检验的核机器学习稀有变异检测
PLoS One. 2014 Mar 27;9(3):e93355. doi: 10.1371/journal.pone.0093355. eCollection 2014.
7
Two-phase and family-based designs for next-generation sequencing studies.用于下一代测序研究的两阶段和基于家系的设计。
Front Genet. 2013 Dec 13;4:276. doi: 10.3389/fgene.2013.00276.
8
Assessing association between protein truncating variants and quantitative traits.评估蛋白截断变异与数量性状之间的关联。
Bioinformatics. 2013 Oct 1;29(19):2419-26. doi: 10.1093/bioinformatics/btt409. Epub 2013 Jul 16.
研究罕见和常见变异对连续性状的基因和基因-环境效应:一种使用基因-性状相似性回归的标记集方法。
Am J Hum Genet. 2011 Aug 12;89(2):277-88. doi: 10.1016/j.ajhg.2011.07.007.
4
Rare-variant association testing for sequencing data with the sequence kernel association test.基于序列核关联检验的测序数据罕见变异关联分析
Am J Hum Genet. 2011 Jul 15;89(1):82-93. doi: 10.1016/j.ajhg.2011.05.029. Epub 2011 Jul 7.
5
An increased burden of common and rare lipid-associated risk alleles contributes to the phenotypic spectrum of hypertriglyceridemia.常见和罕见的脂质相关风险等位基因负担增加导致高甘油三酯血症表型谱扩大。
Arterioscler Thromb Vasc Biol. 2011 Aug;31(8):1916-26. doi: 10.1161/ATVBAHA.111.226365. Epub 2011 May 19.
6
Testing for an unusual distribution of rare variants.检测罕见变异的异常分布。
PLoS Genet. 2011 Mar;7(3):e1001322. doi: 10.1371/journal.pgen.1001322. Epub 2011 Mar 3.
7
Comparison of methods and sampling designs to test for association between rare variants and quantitative traits.用于检测罕见变异与数量性状之间关联的方法和抽样设计的比较。
Genet Epidemiol. 2011 May;35(4):226-35. doi: 10.1002/gepi.20570.
8
A new testing strategy to identify rare variants with either risk or protective effect on disease.一种新的检测策略,用于识别对疾病具有风险或保护作用的罕见变异。
PLoS Genet. 2011 Feb 3;7(2):e1001289. doi: 10.1371/journal.pgen.1001289.
9
A novel adaptive method for the analysis of next-generation sequencing data to detect complex trait associations with rare variants due to gene main effects and interactions.一种用于分析下一代测序数据的新自适应方法,用于检测由于基因主效应和相互作用而导致的复杂性状关联的罕见变异体。
PLoS Genet. 2010 Oct 14;6(10):e1001156. doi: 10.1371/journal.pgen.1001156.
10
Hundreds of variants clustered in genomic loci and biological pathways affect human height.数以百计的变异体聚集在基因组位置和生物途径中,影响人类身高。
Nature. 2010 Oct 14;467(7317):832-8. doi: 10.1038/nature09410. Epub 2010 Sep 29.