• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用公开可用的参考样本扩大对照组来优化全基因组关联研究的效力。

Optimizing the power of genome-wide association studies by using publicly available reference samples to expand the control group.

机构信息

Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford, UK.

出版信息

Genet Epidemiol. 2010 May;34(4):319-26. doi: 10.1002/gepi.20482.

DOI:10.1002/gepi.20482
PMID:20088020
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2962805/
Abstract

Genome-wide association (GWA) studies have proved extremely successful in identifying novel genetic loci contributing effects to complex human diseases. In doing so, they have highlighted the fact that many potential loci of modest effect remain undetected, partly due to the need for samples consisting of many thousands of individuals. Large-scale international initiatives, such as the Wellcome Trust Case Control Consortium, the Genetic Association Information Network, and the database of genetic and phenotypic information, aim to facilitate discovery of modest-effect genes by making genome-wide data publicly available, allowing information to be combined for the purpose of pooled analysis. In principle, disease or control samples from these studies could be used to increase the power of any GWA study via judicious use as "genetically matched controls" for other traits. Here, we present the biological motivation for the problem and the theoretical potential for expanding the control group with publicly available disease or reference samples. We demonstrate that a naïve application of this strategy can greatly inflate the false-positive error rate in the presence of population structure. As a remedy, we make use of genome-wide data and model selection techniques to identify "axes" of genetic variation which are associated with disease. These axes are then included as covariates in association analysis to correct for population structure, which can result in increases in power over standard analysis of genetic information from the samples in the original GWA study.

摘要

全基因组关联 (GWA) 研究已被证明在确定对复杂人类疾病有影响的新遗传基因座方面非常成功。通过这样做,它们突出了一个事实,即许多潜在的、效应适中的基因座仍然未被发现,部分原因是需要由数千个人组成的样本。大型国际倡议,如威康信托基金会病例对照联合会、遗传关联信息网络和基因与表型信息数据库,旨在通过公开提供全基因组数据,允许为合并分析而合并信息,来促进发现效应适中的基因。原则上,可以使用这些研究中的疾病或对照样本,通过明智地将其用作其他特征的“基因匹配对照”,来增加任何 GWA 研究的功效。在这里,我们提出了该问题的生物学动机和利用公开的疾病或参考样本扩展对照组的理论潜力。我们证明,在存在群体结构的情况下,这种策略的盲目应用会极大地增加假阳性错误率。作为补救措施,我们利用全基因组数据和模型选择技术来识别与疾病相关的遗传变异“轴”。然后,将这些轴作为协变量包含在关联分析中,以纠正群体结构,这可以提高对原始 GWA 研究中样本的遗传信息进行标准分析的功效。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/322c/2962805/decc99855a2f/gepi0034-0319-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/322c/2962805/f700fa7ef1f9/gepi0034-0319-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/322c/2962805/25d6d3003a1b/gepi0034-0319-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/322c/2962805/decc99855a2f/gepi0034-0319-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/322c/2962805/f700fa7ef1f9/gepi0034-0319-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/322c/2962805/25d6d3003a1b/gepi0034-0319-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/322c/2962805/decc99855a2f/gepi0034-0319-f3.jpg

相似文献

1
Optimizing the power of genome-wide association studies by using publicly available reference samples to expand the control group.利用公开可用的参考样本扩大对照组来优化全基因组关联研究的效力。
Genet Epidemiol. 2010 May;34(4):319-26. doi: 10.1002/gepi.20482.
2
Discovery properties of genome-wide association signals from cumulatively combined data sets.从累积合并数据集中发现全基因组关联信号的特性。
Am J Epidemiol. 2009 Nov 15;170(10):1197-206. doi: 10.1093/aje/kwp262. Epub 2009 Oct 6.
3
An evaluation of statistical approaches to rare variant analysis in genetic association studies.遗传关联研究中罕见变异分析的统计方法评估。
Genet Epidemiol. 2010 Feb;34(2):188-93. doi: 10.1002/gepi.20450.
4
Were genome-wide linkage studies a waste of time? Exploiting candidate regions within genome-wide association studies.全基因组连锁研究是否浪费时间?在全基因组关联研究中利用候选区域。
Genet Epidemiol. 2010 Feb;34(2):107-18. doi: 10.1002/gepi.20438.
5
A powerful approach to sub-phenotype analysis in population-based genetic association studies.一种基于人群的遗传关联研究中进行亚表型分析的有力方法。
Genet Epidemiol. 2010 May;34(4):335-43. doi: 10.1002/gepi.20486.
6
Genome-wide association analysis of imputed rare variants: application to seven common complex diseases.推算的罕见变异的全基因组关联分析:应用于七种常见复杂疾病
Genet Epidemiol. 2012 Dec;36(8):785-96. doi: 10.1002/gepi.21675. Epub 2012 Sep 5.
7
The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease.基于基因的罕见变异方法在检测疾病相关变异以及检验关于复杂疾病的假设方面的能力。
PLoS Genet. 2015 Apr 23;11(4):e1005165. doi: 10.1371/journal.pgen.1005165. eCollection 2015 Apr.
8
Improving power of genome-wide association studies with weighted false discovery rate control and prioritized subset analysis.利用加权假发现率控制和优先子集分析提高全基因组关联研究的效能。
PLoS One. 2012;7(4):e33716. doi: 10.1371/journal.pone.0033716. Epub 2012 Apr 9.
9
On the prospects of whole-genome association mapping in Saccharomyces cerevisiae.全基因组关联图谱在酿酒酵母中的前景。
Genetics. 2012 Aug;191(4):1345-53. doi: 10.1534/genetics.112.141168. Epub 2012 Jun 5.
10
Pleiotropy informed adaptive association test of multiple traits using genome-wide association study summary data.利用全基因组关联研究汇总数据进行多性状的多效性知情适应性关联测试。
Biometrics. 2019 Dec;75(4):1076-1085. doi: 10.1111/biom.13076. Epub 2019 Aug 2.

引用本文的文献

1
GAWMerge expands GWAS sample size and diversity by combining array-based genotyping and whole-genome sequencing.GAWMerge 通过结合基于阵列的基因分型和全基因组测序来扩大 GWAS 的样本量和多样性。
Commun Biol. 2022 Aug 11;5(1):806. doi: 10.1038/s42003-022-03738-6.
2
Best practices for analyzing imputed genotypes from low-pass sequencing in dogs.用于分析犬低深度测序中导入基因型的最佳实践。
Mamm Genome. 2022 Mar;33(1):213-229. doi: 10.1007/s00335-021-09914-z. Epub 2021 Sep 8.
3
KAT2B polymorphism identified for drug abuse in African Americans with regulatory links to drug abuse pathways in human prefrontal cortex.

本文引用的文献

1
SNP selection and multidimensional scaling to quantify population structure.单核苷酸多态性(SNP)选择与多维尺度分析以量化群体结构
Genet Epidemiol. 2009 Sep;33(6):488-96. doi: 10.1002/gepi.20401.
2
Genomewide association analysis of coronary artery disease.冠状动脉疾病的全基因组关联分析。
N Engl J Med. 2007 Aug 2;357(5):443-53. doi: 10.1056/NEJMoa072366. Epub 2007 Jul 18.
3
A new multipoint method for genome-wide association studies by imputation of genotypes.一种通过基因型插补进行全基因组关联研究的新的多点方法。
在非裔美国人中发现的KAT2B基因多态性与药物滥用有关,且与人类前额叶皮质中的药物滥用途径存在调控联系。
Addict Biol. 2016 Nov;21(6):1217-1232. doi: 10.1111/adb.12286. Epub 2015 Jul 23.
4
Imputation across genotyping arrays for genome-wide association studies: assessment of bias and a correction strategy.全基因组关联研究中基于基因分型阵列的插补:偏差评估和校正策略。
Hum Genet. 2013 May;132(5):509-22. doi: 10.1007/s00439-013-1266-7. Epub 2013 Jan 22.
5
Transethnic meta-analysis of genomewide association studies.全基因组关联研究的跨种族荟萃分析。
Genet Epidemiol. 2011 Dec;35(8):809-22. doi: 10.1002/gepi.20630.
6
Including additional controls from public databases improves the power of a genome-wide association study.纳入来自公共数据库的额外对照可提高全基因组关联研究的效能。
Hum Hered. 2011;72(1):21-34. doi: 10.1159/000330149. Epub 2011 Aug 17.
7
Genome-wide association study of Stevens-Johnson Syndrome and Toxic Epidermal Necrolysis in Europe.全基因组关联研究在欧洲史蒂文斯-约翰逊综合征和中毒性表皮坏死松解症。
Orphanet J Rare Dis. 2011 Jul 29;6:52. doi: 10.1186/1750-1172-6-52.
8
Artifact due to differential error when cases and controls are imputed from different platforms.由于病例和对照是从不同平台推断出来的,因此存在差异错误导致的伪影。
Hum Genet. 2012 Jan;131(1):111-9. doi: 10.1007/s00439-011-1054-1. Epub 2011 Jul 7.
9
Confounded by sequencing depth in association studies of rare alleles.在罕见等位基因关联研究中受测序深度的困扰。
Genet Epidemiol. 2011 May;35(4):261-8. doi: 10.1002/gepi.20574.
10
Clustering by genetic ancestry using genome-wide SNP data.基于全基因组 SNP 数据的遗传谱系聚类分析。
BMC Genet. 2010 Dec 9;11:108. doi: 10.1186/1471-2156-11-108.
Nat Genet. 2007 Jul;39(7):906-13. doi: 10.1038/ng2088. Epub 2007 Jun 17.
4
Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.对14000例七种常见疾病患者及3000例共享对照进行全基因组关联研究。
Nature. 2007 Jun 7;447(7145):661-78. doi: 10.1038/nature05911.
5
Sequence variants in the autophagy gene IRGM and multiple other replicating loci contribute to Crohn's disease susceptibility.自噬基因IRGM及多个其他复制位点的序列变异会导致克罗恩病易感性。
Nat Genet. 2007 Jul;39(7):830-2. doi: 10.1038/ng2061. Epub 2007 Jun 6.
6
Robust associations of four new chromosome regions from genome-wide analyses of type 1 diabetes.通过全基因组分析得出的1型糖尿病四个新染色体区域的强关联。
Nat Genet. 2007 Jul;39(7):857-64. doi: 10.1038/ng2068. Epub 2007 Jun 6.
7
Replication of genome-wide association signals in UK samples reveals risk loci for type 2 diabetes.在英国样本中对全基因组关联信号进行复制,揭示了2型糖尿病的风险位点。
Science. 2007 Jun 1;316(5829):1336-41. doi: 10.1126/science.1142364. Epub 2007 Apr 26.
8
A common variant in the FTO gene is associated with body mass index and predisposes to childhood and adult obesity.FTO基因中的一种常见变异与体重指数相关,并易导致儿童期和成年期肥胖。
Science. 2007 May 11;316(5826):889-94. doi: 10.1126/science.1141634. Epub 2007 Apr 12.
9
Population structure and eigenanalysis.群体结构与特征分析
PLoS Genet. 2006 Dec;2(12):e190. doi: 10.1371/journal.pgen.0020190.
10
Principal components analysis corrects for stratification in genome-wide association studies.主成分分析可校正全基因组关联研究中的分层现象。
Nat Genet. 2006 Aug;38(8):904-9. doi: 10.1038/ng1847. Epub 2006 Jul 23.