• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全基因组关联研究中H1模拟的替代方法。

Alternative methods for H1 simulations in genome-wide association studies.

作者信息

Perduca V, Sinoquet C, Mourad R, Nuel G

机构信息

MAP5 - UMR CNRS 8145, Université Paris Descartes, Paris, France. vittorio.perduca @ parisdescartes.fr

出版信息

Hum Hered. 2012;73(2):95-104. doi: 10.1159/000336194. Epub 2012 Mar 28.

DOI:10.1159/000336194
PMID:22472690
Abstract

OBJECTIVE

Assessing the statistical power to detect susceptibility variants plays a critical role in genome-wide association (GWA) studies both from the prospective and retrospective point of view. Power is empirically estimated by simulating phenotypes under a disease model H1. For this purpose, the gold standard consists in simulating genotypes given the phenotypes (e.g. Hapgen). We introduce here an alternative approach for simulating phenotypes under H1 that does not require generating new genotypes for each simulation.

METHODS

In order to simulate phenotypes with a fixed total number of cases and under a given disease model, we suggest 3 algorithms: (1) a simple rejection algorithm; (2) a numerical Markov chain Monte-Carlo (MCMC) approach, and (3) an exact and efficient backward sampling algorithm. In our study, we validated the 3 algorithms both on a simulated dataset and by comparing them with Hapgen on a more realistic dataset. For an application, we then conducted a simulation study on a 1000 Genomes Project dataset consisting of 629 individuals (314 cases) and 8,048 SNPs from chromosome X. We arbitrarily defined an additive disease model with two susceptibility SNPs and an epistatic effect.

RESULTS

The 3 algorithms are consistent, but backward sampling is dramatically faster than the other two. Our approach also gives consistent results with Hapgen. Using our application data, we showed that our limited design requires a biological a priori to limit the investigated region. We also proved that epistatic effects can play a significant role even when simple marker statistics (e.g. trend) are used. We finally showed that the overall performance of a GWA study strongly depends on the prevalence of the disease: the larger the prevalence, the better the power.

CONCLUSIONS

Our approach is a valid alternative to Hapgen-type methods; it is not only dramatically faster but has 2 main advantages: (1) there is no need for sophisticated genotype models (e.g. haplotype frequencies, or recombination rates), and (2) the choice of the disease model is completely unconstrained (number of SNPs involved, gene-environment interactions, hybrid genetic models, etc.). Our 3 algorithms are available in an R package called 'waffect' ('double-u affect', for weighted affectations).

摘要

目的

从前瞻性和回顾性角度来看,评估检测易感性变异的统计效能在全基因组关联(GWA)研究中都起着关键作用。效能是通过在疾病模型H1下模拟表型进行经验性估计的。为此,金标准是根据表型模拟基因型(例如Hapgen)。我们在此介绍一种在H1下模拟表型的替代方法,该方法无需为每次模拟生成新的基因型。

方法

为了在固定病例总数且给定疾病模型下模拟表型,我们提出了3种算法:(1)一种简单的拒绝算法;(2)一种数值马尔可夫链蒙特卡罗(MCMC)方法,以及(3)一种精确且高效的反向抽样算法。在我们的研究中,我们在模拟数据集上以及通过在更现实的数据集上与Hapgen进行比较来验证这3种算法。作为应用,我们随后对一个由629个个体(314例病例)和来自X染色体的8048个单核苷酸多态性(SNP)组成的千人基因组计划数据集进行了模拟研究。我们任意定义了一个具有两个易感性SNP和上位效应的加性疾病模型。

结果

这3种算法是一致的,但反向抽样比其他两种算法快得多。我们的方法与Hapgen也给出了一致的结果。使用我们的应用数据,我们表明我们有限的设计需要一个生物学先验来限制研究区域。我们还证明即使使用简单的标记统计量(例如趋势),上位效应也可能起重要作用。我们最终表明GWA研究的总体效能强烈依赖于疾病的患病率:患病率越高,效能越好。

结论

我们的方法是Hapgen型方法的有效替代方法;它不仅显著更快,而且有两个主要优点:(1)无需复杂的基因型模型(例如单倍型频率或重组率),以及(2)疾病模型的选择完全不受限制(涉及的SNP数量、基因 - 环境相互作用、混合遗传模型等)。我们的3种算法可在一个名为“waffect”(“双u影响”,用于加权分配)的R包中获取。

相似文献

1
Alternative methods for H1 simulations in genome-wide association studies.全基因组关联研究中H1模拟的替代方法。
Hum Hered. 2012;73(2):95-104. doi: 10.1159/000336194. Epub 2012 Mar 28.
2
Direct analysis of unphased SNP genotype data in population-based association studies via Bayesian partition modelling of haplotypes.在基于人群的关联研究中,通过单倍型的贝叶斯分区建模对未分型的单核苷酸多态性(SNP)基因型数据进行直接分析。
Genet Epidemiol. 2005 Sep;29(2):91-107. doi: 10.1002/gepi.20080.
3
The impact of missing and erroneous genotypes on tagging SNP selection and power of subsequent association tests.缺失和错误基因型对标签单核苷酸多态性选择及后续关联检验效能的影响。
Hum Hered. 2006;61(1):31-44. doi: 10.1159/000092141. Epub 2006 Mar 23.
4
Evaluation of potential power gain with imputed genotypes in genome-wide association studies.在全基因组关联研究中使用推算基因型评估潜在的功效增益。
Hum Hered. 2009;68(1):23-34. doi: 10.1159/000210446. Epub 2009 Apr 1.
5
Bivariate association analysis for quantitative traits using generalized estimation equation.使用广义估计方程进行数量性状的双变量关联分析。
J Genet Genomics. 2009 Dec;36(12):733-43. doi: 10.1016/S1673-8527(08)60166-6.
6
SNPHarvester: a filtering-based approach for detecting epistatic interactions in genome-wide association studies.SNPHarvester:一种在全基因组关联研究中基于过滤的上位性相互作用检测方法。
Bioinformatics. 2009 Feb 15;25(4):504-11. doi: 10.1093/bioinformatics/btn652. Epub 2008 Dec 19.
7
A Bayesian model for detection of high-order interactions among genetic variants in genome-wide association studies.一种用于在全基因组关联研究中检测基因变异间高阶相互作用的贝叶斯模型。
BMC Genomics. 2015 Nov 25;16:1011. doi: 10.1186/s12864-015-2217-6.
8
Genome simulation approaches for synthesizing in silico datasets for human genomics.基因组模拟方法合成人类基因组学的计算机数据集。
Adv Genet. 2010;72:1-24. doi: 10.1016/B978-0-12-380862-2.00001-1.
9
Bayesian variable and model selection methods for genetic association studies.用于基因关联研究的贝叶斯变量与模型选择方法。
Genet Epidemiol. 2009 Jan;33(1):27-37. doi: 10.1002/gepi.20353.
10
Linear trend tests for case-control genetic association that incorporate random phenotype and genotype misclassification error.纳入随机表型和基因型错误分类误差的病例对照基因关联线性趋势检验。
Genet Epidemiol. 2007 Dec;31(8):853-70. doi: 10.1002/gepi.20246.

引用本文的文献

1
Polygenic background modifies penetrance of monogenic variants for tier 1 genomic conditions.多基因背景会影响一级基因组条件中单基因变异的外显率。
Nat Commun. 2020 Aug 20;11(1):3635. doi: 10.1038/s41467-020-17374-3.
2
Performance of epistasis detection methods in semi-simulated GWAS.连锁不平衡检测方法在半模拟 GWAS 中的性能。
BMC Bioinformatics. 2018 Jun 18;19(1):231. doi: 10.1186/s12859-018-2229-8.
3
An exome sequencing based approach for genome-wide association studies in the dog.基于外显子组测序的全基因组关联研究在犬中的应用。
Sci Rep. 2017 Nov 15;7(1):15680. doi: 10.1038/s41598-017-15947-9.