基于 P 值的全基因组关联研究中共享对照设计分析。

P-value based analysis for shared controls design in genome-wide association studies.

机构信息

Biostatistics Branch, National Institute of Environmental Health Sciences, National Institutes of Health, North Carolina, USA.

出版信息

Genet Epidemiol. 2010 Nov;34(7):725-38. doi: 10.1002/gepi.20536.

DOI:10.1002/gepi.20536

PMID:20976797

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3190645/

Abstract

An appealing genome-wide association study design compares one large control group against several disease samples. A pioneering study by the Wellcome Trust Case Control Consortium that employed such a design has identified multiple susceptibility regions, many of which have been independently replicated. While reusing a control sample provides effective utilization of data, it also creates correlation between association statistics across diseases. An observation of a large association statistic for one of the diseases may greatly increase chances of observing a spuriously large association for a different disease. Accounting for the correlation is also particularly important when screening for SNPs that might be involved in a set of diseases with overlapping etiology. We describe methods that correct association statistics for dependency due to shared controls, and we describe ways to obtain a measure of overall evidence and to combine association signals across multiple diseases. The methods we describe require no access to individual subject data, instead, they efficiently utilize information contained in P-values for association reported for individual diseases. P-value based combined tests for association are flexible and essentially as powerful as the approach based on aggregating the individual subject data.

摘要

一种有吸引力的全基因组关联研究设计是将一个大型对照组与几个疾病样本进行比较。由 Wellcome Trust Case Control Consortium 进行的一项开创性研究采用了这种设计，该研究已经确定了多个易感性区域，其中许多已经得到了独立的复制。虽然重复使用对照样本可以有效地利用数据，但它也会导致不同疾病之间的关联统计数据之间存在相关性。对于一种疾病的关联统计数据的一个大观察结果可能会大大增加对不同疾病中出现假性大关联的可能性。在筛选可能与重叠病因的一组疾病相关的 SNPs 时，考虑相关性也尤为重要。我们描述了校正因共享对照而导致的关联统计数据的方法，以及获取整体证据的度量和组合多个疾病的关联信号的方法。我们描述的方法不需要访问个人主体数据，而是有效地利用了针对个体疾病报告的关联 P 值中包含的信息。基于 P 值的联合关联检验非常灵活，并且与基于聚合个体数据的方法一样强大。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aef/3190645/19d694e67f04/nihms322355f1.jpg

相似文献

P-value based analysis for shared controls design in genome-wide association studies.基于 P 值的全基因组关联研究中共享对照设计分析。

Genet Epidemiol. 2010 Nov;34(7):725-38. doi: 10.1002/gepi.20536.

Design and analysis of multiple diseases genome-wide association studies without controls.无对照的多种疾病全基因组关联研究的设计与分析。

Gene. 2012 Nov 15;510(1):87-92. doi: 10.1016/j.gene.2012.07.089. Epub 2012 Aug 23.

A novel genome-information content-based statistic for genome-wide association analysis designed for next-generation sequencing data.一种基于基因组信息含量的新型统计方法，用于针对下一代测序数据的全基因组关联分析。

J Comput Biol. 2012 Jun;19(6):731-44. doi: 10.1089/cmb.2012.0035. Epub 2012 May 31.

Testing for association in case-control genome-wide association studies with shared controls.在具有共享对照的病例对照全基因组关联研究中进行关联测试。

Stat Methods Med Res. 2016 Apr;25(2):954-67. doi: 10.1177/0962280212474061. Epub 2013 Feb 1.

Pathway-based analysis for genome-wide association studies using supervised principal components.基于有监督主成分的全基因组关联研究的通路分析。

Genet Epidemiol. 2010 Nov;34(7):716-24. doi: 10.1002/gepi.20532.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

A robust genome-wide scan statistic of the Wellcome Trust Case-Control Consortium.威康信托病例对照研究联盟的一项强大的全基因组扫描统计数据。

Biometrics. 2009 Dec;65(4):1115-22. doi: 10.1111/j.1541-0420.2009.01185.x.

Meta-analysis of genetic association studies and adjustment for multiple testing of correlated SNPs and traits.遗传关联研究的荟萃分析及相关 SNP 和性状的多重检验校正。

Genet Epidemiol. 2010 Nov;34(7):739-46. doi: 10.1002/gepi.20538.

Genetic overlap analysis of endometriosis and asthma identifies shared loci implicating sex hormones and thyroid signalling pathways.子宫内膜异位症和哮喘的遗传重叠分析确定了与性激素和甲状腺信号通路相关的共同位点。

Hum Reprod. 2022 Jan 28;37(2):366-383. doi: 10.1093/humrep/deab254.

MegaSNPHunter: a learning approach to detect disease predisposition SNPs and high level interactions in genome wide association study.MegaSNPHunter：一种在全基因组关联研究中检测疾病易感性单核苷酸多态性和高阶相互作用的学习方法。

BMC Bioinformatics. 2009 Jan 9;10:13. doi: 10.1186/1471-2105-10-13.

引用本文的文献

PASTRY: achieving balanced power for detecting risk and protective minor alleles in meta-analysis of association studies with overlapping subjects.荟萃分析重叠研究对象的关联研究中检测风险和保护次要等位基因的平衡功效：Pastry 方法。

BMC Bioinformatics. 2024 Jan 12;25(1):24. doi: 10.1186/s12859-023-05627-z.

More practical differentially private publication of key statistics in GWAS.全基因组关联研究中关键统计量的更实用的差分隐私发布。

Bioinform Adv. 2021 May 18;1(1):vbab004. doi: 10.1093/bioadv/vbab004. eCollection 2021.

Meta-analysis of SNP-environment interaction with heterogeneity for overlapping data.基于重叠数据的 SNP-环境互作异质性的荟萃分析。

Sci Rep. 2021 Jan 28;11(1):2590. doi: 10.1038/s41598-021-82336-8.

Meta-Analysis of SNP-Environment Interaction With Overlapping Data.具有重叠数据的单核苷酸多态性-环境相互作用的Meta分析

Front Genet. 2020 Jan 30;10:1400. doi: 10.3389/fgene.2019.01400. eCollection 2019.

Combining CDKN1A gene expression and genome-wide SNPs in a twin cohort to gain insight into the heritability of individual radiosensitivity.结合双胞胎队列中的 CDKN1A 基因表达和全基因组 SNPs 深入了解个体放射敏感性的遗传性。

Funct Integr Genomics. 2019 Jul;19(4):575-585. doi: 10.1007/s10142-019-00658-3. Epub 2019 Jan 31.

Combining controls can improve power in two-stage association studies.合并对照可提高两阶段关联研究的检验效能。

BMC Genet. 2018 Oct 3;19(1):89. doi: 10.1186/s12863-018-0675-y.

Meta-analysis of genome-wide association studies of aggressive and chronic periodontitis identifies two novel risk loci.全基因组关联研究的荟萃分析鉴定出侵袭性和慢性牙周炎的两个新的风险位点。

Eur J Hum Genet. 2019 Jan;27(1):102-113. doi: 10.1038/s41431-018-0265-5. Epub 2018 Sep 14.

Genome-wide association meta-analysis of coronary artery disease and periodontitis reveals a novel shared risk locus.全基因组关联荟萃分析表明，冠心病和牙周炎存在一个新的共同风险位点。

Sci Rep. 2018 Sep 12;8(1):13678. doi: 10.1038/s41598-018-31980-8.

Statistical power and utility of meta-analysis methods for cross-phenotype genome-wide association studies.元分析方法在跨表型全基因组关联研究中的统计功效和实用性。

PLoS One. 2018 Mar 1;13(3):e0193256. doi: 10.1371/journal.pone.0193256. eCollection 2018.

An efficient Bayesian meta-analysis approach for studying cross-phenotype genetic associations.一种用于研究跨表型遗传关联的高效贝叶斯元分析方法。

PLoS Genet. 2018 Feb 12;14(2):e1007139. doi: 10.1371/journal.pgen.1007139. eCollection 2018 Feb.

本文引用的文献

Robust Tests in Genome-Wide Scans under Incomplete Linkage Disequilibrium.不完全连锁不平衡下全基因组扫描中的稳健检验

Stat Sci. 2009 Nov;24(4):503-516. doi: 10.1214/09-sts314. Epub 2010 Apr 20.

Meta-analysis of genome-wide association studies with overlapping subjects.基于重叠研究对象的全基因组关联研究的荟萃分析。

Am J Hum Genet. 2009 Dec;85(6):862-72. doi: 10.1016/j.ajhg.2009.11.001.

Meta-analysis of genome-wide association studies: no efficiency gain in using individual participant data.全基因组关联研究的荟萃分析：使用个体参与者数据没有效率增益。

Genet Epidemiol. 2010 Jan;34(1):60-6. doi: 10.1002/gepi.20435.

Multiple Loci within the major histocompatibility complex confer risk of psoriasis.主要组织相容性复合体中的多个基因座会增加患牛皮癣的风险。

PLoS Genet. 2009 Aug;5(8):e1000606. doi: 10.1371/journal.pgen.1000606. Epub 2009 Aug 14.

Genome-wide association study and meta-analysis find that over 40 loci affect risk of type 1 diabetes.全基因组关联研究和荟萃分析发现，40 多个位点影响 1 型糖尿病的风险。

Nat Genet. 2009 Jun;41(6):703-7. doi: 10.1038/ng.381. Epub 2009 May 10.

Replication of signals from recent studies of Crohn's disease identifies previously unknown disease loci for ulcerative colitis.克罗恩病近期研究信号的复制确定了溃疡性结肠炎先前未知的疾病位点。

Nat Genet. 2008 Jun;40(6):713-5. doi: 10.1038/ng.148. Epub 2008 Apr 27.

A note on allelic tests in case-control association studies.病例对照关联研究中基因座测试的一则注释

Ann Hum Genet. 2008 May;72(Pt 3):407-9. doi: 10.1111/j.1469-1809.2008.00438.x. Epub 2008 Mar 17.

Genetic flip-flop without an accompanying change in linkage disequilibrium.基因触发器，连锁不平衡无伴随变化。

Am J Hum Genet. 2008 Mar;82(3):794-6; author reply 796-7. doi: 10.1016/j.ajhg.2008.02.001.

Guilt beyond a reasonable doubt.毫无疑问的内疚。

Nat Genet. 2007 Jul;39(7):813-5. doi: 10.1038/ng0707-813.

Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.对14000例七种常见疾病患者及3000例共享对照进行全基因组关联研究。

Nature. 2007 Jun 7;447(7145):661-78. doi: 10.1038/nature05911.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验