Suppr
超能文献

重新审视常见变异 GWAS 的全基因组显著阈值。

Revisiting the genome-wide significance threshold for common variant GWAS.

机构信息

Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI 48109-2029, USA.

出版信息

G3 (Bethesda). 2021 Feb 9;11(2). doi: 10.1093/g3journal/jkaa056.

DOI:10.1093/g3journal/jkaa056

PMID:33585870

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8022962/

Abstract

Over the last decade, GWAS meta-analyses have used a strict P-value threshold of 5 × 10-8 to classify associations as significant. Here, we use our current understanding of frequently studied traits including lipid levels, height, and BMI to revisit this genome-wide significance threshold. We compare the performance of studies using the P = 5 × 10-8 threshold in terms of true and false positive rate to other multiple testing strategies: (1) less stringent P-value thresholds, (2) controlling the FDR with the Benjamini-Hochberg and Benjamini-Yekutieli procedure, and (3) controlling the Bayesian FDR with posterior probabilities. We applied these procedures to re-analyze results from the Global Lipids and GIANT GWAS meta-analysis consortia and supported them with extensive simulation that mimics the empirical data. We observe in simulated studies with sample sizes ∼20,000 and >120,000 that relaxing the P-value threshold to 5 × 10-7 increased discovery at the cost of 18% and 8% of additional loci being false positive results, respectively. FDR and Bayesian FDR are well controlled for both sample sizes with a few exceptions that disappear under a less stringent definition of true positives and the two approaches yield similar results. Our work quantifies the value of using a relaxed P-value threshold in large studies to increase their true positive discovery but also show the excess false positive rates due to such actions in modest-sized studies. These results may guide investigators considering different thresholds in replication studies and downstream work such as gene-set enrichment or pathway analysis. Finally, we demonstrate the viability of FDR-controlling procedures in GWAS.

摘要

在过去的十年中，GWAS 荟萃分析使用严格的 P 值阈值 5×10-8 将关联分类为显著。在这里，我们利用我们目前对经常研究的特征（包括血脂水平、身高和 BMI）的理解，重新审视这个全基因组显著性阈值。我们比较了使用 P=5×10-8 阈值的研究在真阳性率和假阳性率方面的表现，与其他多重检验策略相比：（1）更宽松的 P 值阈值；（2）使用 Benjamini-Hochberg 和 Benjamini-Yekutieli 程序控制 FDR；（3）使用后验概率控制贝叶斯 FDR。我们将这些程序应用于重新分析来自全球脂质和 GIANT GWAS 荟萃分析联盟的结果，并通过模拟经验数据的广泛模拟来支持它们。我们在模拟研究中观察到，在样本量约为 20000 和>120000 的情况下，放宽 P 值阈值至 5×10-7 会以 18%和 8%的额外假阳性结果为代价增加发现，但 FDR 和贝叶斯 FDR 都得到了很好的控制，只有少数例外情况在更宽松的真阳性定义下消失，两种方法的结果相似。我们的工作量化了在大型研究中使用放宽的 P 值阈值来增加其真阳性发现的价值，但也显示了在适度样本量的研究中由于这种方法而导致的额外假阳性率。这些结果可能会指导研究人员在复制研究和下游工作（如基因集富集或通路分析）中考虑不同的阈值。最后，我们展示了 FDR 控制程序在 GWAS 中的可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8890/8022962/5c5d614a98b8/jkaa056f1.jpg

相似文献

Revisiting the genome-wide significance threshold for common variant GWAS.

G3 (Bethesda). 2021 Feb 9;11(2). doi: 10.1093/g3journal/jkaa056.

Resampling-based empirical Bayes multiple testing procedures for controlling generalized tail probability and expected value error rates: focus on the false discovery rate and simulation study.

Biom J. 2008 Oct;50(5):716-44. doi: 10.1002/bimj.200710473.

Wavelet thresholding with bayesian false discovery rate control.

Biometrics. 2005 Mar;61(1):25-35. doi: 10.1111/j.0006-341X.2005.031102.x.

Modifying the false discovery rate procedure based on the information theory under arbitrary correlation structure and its performance in high-dimensional genomic data.

BMC Bioinformatics. 2024 Feb 5;25(1):57. doi: 10.1186/s12859-024-05678-w.

A pleiotropy-informed Bayesian false discovery rate adapted to a shared control design finds new disease associations from GWAS summary statistics.

PLoS Genet. 2015 Feb 6;11(2):e1004926. doi: 10.1371/journal.pgen.1004926. eCollection 2015 Feb.

Significance testing and genomic inflation factor using high-density genotypes or whole-genome sequence data.

J Anim Breed Genet. 2019 Nov;136(6):418-429. doi: 10.1111/jbg.12419. Epub 2019 Jun 19.

Local and Bayesian Survival FDR Estimations to Identify Reliable Associations in Whole Genome of Bread Wheat.

Int J Mol Sci. 2023 Sep 12;24(18):14011. doi: 10.3390/ijms241814011.

Improving power of genome-wide association studies with weighted false discovery rate control and prioritized subset analysis.

PLoS One. 2012;7(4):e33716. doi: 10.1371/journal.pone.0033716. Epub 2012 Apr 9.

Mapping multiple quantitative trait loci under Bayes error control.

Genet Res (Camb). 2009 Jun;91(3):147-59. doi: 10.1017/S001667230900010X.

Genome-wide genetic analyses highlight mitogen-activated protein kinase (MAPK) signaling in the pathogenesis of endometriosis.

Hum Reprod. 2017 Apr 1;32(4):780-793. doi: 10.1093/humrep/dex024.

引用本文的文献

Disentangling soybean GxE effects in an integrated genomic prediction and machine learning-GWAS workflow.

Plant Methods. 2025 Aug 25;21(1):119. doi: 10.1186/s13007-025-01434-0.

Pharmacogenomics of steroid-induced ocular hypertension: relationship to high-tension glaucomas and new pathophysiologic insight.

medRxiv. 2025 Aug 13:2025.08.11.25333245. doi: 10.1101/2025.08.11.25333245.

Effects of Antibiotic Residues on Fecal Microbiota Composition and Antimicrobial Resistance Gene Profiles in Cattle from Northwestern China.

Microorganisms. 2025 Jul 14;13(7):1658. doi: 10.3390/microorganisms13071658.

Genetic Insights Into the Link Between Restless Legs Syndrome and Diabetic Nephropathy Risk.

Brain Behav. 2025 Jul;15(7):e70696. doi: 10.1002/brb3.70696.

Causal relationships between inflammatory mediators, sleep traits, and cardiac magnetic resonance imaging-derived cardiac phenotypes.

Sci Rep. 2025 Jul 11;15(1):25118. doi: 10.1038/s41598-025-10929-8.

Barrier genes are associated with preterm birth.

Front Med (Lausanne). 2025 Jun 23;12:1580877. doi: 10.3389/fmed.2025.1580877. eCollection 2025.

Integrative bioinformatics frameworks for abdominal aortic aneurysm using GWAS meta-analysis, biological network construction, and structural modeling.

Sci Rep. 2025 Jul 1;15(1):22331. doi: 10.1038/s41598-025-07989-1.

A mathematical model that predicts human biological age from physiological traits identifies environmental and genetic factors that influence aging.

Elife. 2025 Jun 11;13:RP92092. doi: 10.7554/eLife.92092.

Clinical and Genetic Factors Associated with Intraoperative Minimum Alveolar Concentration Ratio: A Single-center Retrospective Cohort and Genome-wide Association Study.

Anesthesiology. 2025 Jul 21. doi: 10.1097/ALN.0000000000005602.

Antecedent Flu-Like Illness and Onset of Idiopathic Dilated Cardiomyopathy: The DCM Precision Medicine Study.

Circ Heart Fail. 2025 May;18(5):e012602. doi: 10.1161/CIRCHEARTFAILURE.124.012602. Epub 2025 Apr 14.

本文引用的文献

Exploring various polygenic risk scores for skin cancer in the phenomes of the Michigan genomics initiative and the UK Biobank with a visual catalog: PRSWeb.

PLoS Genet. 2019 Jun 13;15(6):e1008202. doi: 10.1371/journal.pgen.1008202. eCollection 2019 Jun.

Redefine statistical significance.

Nat Hum Behav. 2018 Jan;2(1):6-10. doi: 10.1038/s41562-017-0189-z.

Scientists rise up against statistical significance.

Nature. 2019 Mar;567(7748):305-307. doi: 10.1038/d41586-019-00857-9.

A simple and accurate method to determine genomewide significance for association tests in sequencing studies.

Genet Epidemiol. 2019 Jun;43(4):365-372. doi: 10.1002/gepi.22183. Epub 2019 Jan 8.

Meta-analysis of genome-wide association studies for height and body mass index in ∼700000 individuals of European ancestry.

Hum Mol Genet. 2018 Oct 15;27(20):3641-3649. doi: 10.1093/hmg/ddy271.

Methods for meta-analysis of multiple traits using GWAS summary statistics.

Genet Epidemiol. 2018 Mar;42(2):134-145. doi: 10.1002/gepi.22105. Epub 2017 Dec 10.

Resetting the bar: Statistical significance in whole-genome sequencing-based association studies of global populations.

Genet Epidemiol. 2017 Feb;41(2):145-151. doi: 10.1002/gepi.22032. Epub 2016 Dec 18.

Controlling the Rate of GWAS False Discoveries.

Genetics. 2017 Jan;205(1):61-75. doi: 10.1534/genetics.116.193987. Epub 2016 Oct 26.

A reference panel of 64,976 haplotypes for genotype imputation.

Nat Genet. 2016 Oct;48(10):1279-83. doi: 10.1038/ng.3643. Epub 2016 Aug 22.

Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets.

Nat Genet. 2016 May;48(5):481-7. doi: 10.1038/ng.3538. Epub 2016 Mar 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

重新审视常见变异 GWAS 的全基因组显著阈值。

Revisiting the genome-wide significance threshold for common variant GWAS.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译