• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GWAS 显著性阈值对于深度表型研究可以取决于次要等位基因频率和样本量。

GWAS significance thresholds for deep phenotyping studies can depend upon minor allele frequencies and sample size.

机构信息

Department of Psychiatry and Behavioral Neurosciences, University of Chicago, 924 East 57th Street Room. R016, Chicago, IL, 60637, USA.

Department of Psychiatry, University of Texas Southwestern Medical Center, Dallas, TX, USA.

出版信息

Mol Psychiatry. 2021 Jun;26(6):2048-2055. doi: 10.1038/s41380-020-0670-3. Epub 2020 Feb 17.

DOI:10.1038/s41380-020-0670-3
PMID:32066829
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7429341/
Abstract

An important issue affecting genome-wide association studies with deep phenotyping (multiple correlated phenotypes) is determining the suitable family-wise significance threshold. Straightforward family-wise correction (Bonferroni) of p < 0.05 for 4.3 million genotypes and 335 phenotypes would give a threshold of p < 3.46E-11. This would be too conservative because it assumes all tests are independent. The effective number of tests, both phenotypic and genotypic, must be adjusted for the correlations between them. Spectral decomposition of the phenotype matrix and LD-based correction of the number of tested SNPs are currently used to determine an effective number of tests. In this paper, we compare these calculated estimates with permutation-determined family-wise significance thresholds. Permutations are performed by shuffling individual IDs of the genotype vector for this dataset, to preserve correlation of phenotypes. Our results demonstrate that the permutation threshold is influenced by minor allele frequency (MAF) of the SNPs, and by the number of individuals tested. For the more common SNPs (MAF > 0.1), the permutation family-wise threshold was in close agreement with spectral decomposition methods. However, for less common SNPs (0.05 < MAF ≤ 0.1), the permutation threshold calculated over all SNPs was off by orders of magnitude. This applies to the number of individuals studied (here 777) but not to very much larger numbers. Based on these findings, we propose that the threshold to find a particular level of family-wise significance may need to be established using separate permutations of the actual data for several MAF bins.

摘要

影响深度表型(多个相关表型)全基因组关联研究的一个重要问题是确定合适的总体显著水平阈值。对于 430 万个基因型和 335 个表型,直接进行简单的总体校正(Bonferroni),p < 0.05 的阈值将为 p < 3.46E-11。这将过于保守,因为它假设所有检验都是独立的。必须根据它们之间的相关性调整检验的有效数量,包括表型和基因型检验。目前,使用谱分解表型矩阵和基于 LD 的校正测试的 SNP 数量来确定有效检验数量。在本文中,我们将这些计算估计与置换确定的总体显著水平阈值进行了比较。置换是通过打乱该数据集基因型向量的个体 ID 来进行的,以保留表型的相关性。我们的结果表明,置换阈值受 SNP 的次要等位基因频率(MAF)和测试个体数量的影响。对于更常见的 SNP(MAF > 0.1),置换总体显著阈值与谱分解方法非常吻合。然而,对于不太常见的 SNP(0.05 < MAF ≤ 0.1),计算所有 SNP 的置换总体阈值相差几个数量级。这适用于研究的个体数量(此处为 777),但不适用于更大的数量。基于这些发现,我们提出,为了找到特定水平的总体显著,可能需要使用实际数据的不同置换来为几个 MAF 间隔建立阈值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ad1/7429341/3caa2d853484/nihms-1554000-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ad1/7429341/a1d4dfe79c2e/nihms-1554000-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ad1/7429341/3caa2d853484/nihms-1554000-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ad1/7429341/a1d4dfe79c2e/nihms-1554000-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ad1/7429341/3caa2d853484/nihms-1554000-f0002.jpg

相似文献

1
GWAS significance thresholds for deep phenotyping studies can depend upon minor allele frequencies and sample size.GWAS 显著性阈值对于深度表型研究可以取决于次要等位基因频率和样本量。
Mol Psychiatry. 2021 Jun;26(6):2048-2055. doi: 10.1038/s41380-020-0670-3. Epub 2020 Feb 17.
2
Characterization of DSM-IV Opioid Dependence Among Individuals of European Ancestry.DSM-IV 型阿片类药物依赖患者的特征。
J Stud Alcohol Drugs. 2019 May;80(3):319-330. doi: 10.15288/jsad.2019.80.319.
3
Significance testing and genomic inflation factor using high-density genotypes or whole-genome sequence data.使用高密度基因型或全基因组序列数据进行显著性检验和基因组膨胀因子分析。
J Anim Breed Genet. 2019 Nov;136(6):418-429. doi: 10.1111/jbg.12419. Epub 2019 Jun 19.
4
Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.).从大西洋鲑鱼(Salmo salar L.)混合 DNA 中进行经济有效的全基因组等位基因频率估计。
BMC Genomics. 2013 Jan 16;14:12. doi: 10.1186/1471-2164-14-12.
5
Establishing an adjusted p-value threshold to control the family-wide type 1 error in genome wide association studies.在全基因组关联研究中建立调整后的p值阈值以控制全家族I型错误。
BMC Genomics. 2008 Oct 31;9:516. doi: 10.1186/1471-2164-9-516.
6
Systematic permutation testing in GWAS pathway analyses: identification of genetic networks in dilated cardiomyopathy and ulcerative colitis.全基因组关联研究通路分析中的系统排列检验:扩张型心肌病和溃疡性结肠炎遗传网络的识别
BMC Genomics. 2014 Jul 22;15:622. doi: 10.1186/1471-2164-15-622.
7
Uncovering networks from genome-wide association studies via circular genomic permutation.通过环状基因组置换从全基因组关联研究中揭示网络
G3 (Bethesda). 2012 Sep;2(9):1067-75. doi: 10.1534/g3.112.002618. Epub 2012 Sep 1.
8
A unified approach for allele frequency estimation, SNP detection and association studies based on pooled sequencing data using EM algorithms.基于 EM 算法的基于测序数据的等位基因频率估计、SNP 检测和关联研究的统一方法。
BMC Genomics. 2013;14 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2164-14-S1-S1. Epub 2013 Jan 21.
9
Genetic variants associated with idiopathic pulmonary fibrosis susceptibility and mortality: a genome-wide association study.与特发性肺纤维化易感性和死亡率相关的遗传变异:全基因组关联研究。
Lancet Respir Med. 2013 Jun;1(4):309-317. doi: 10.1016/S2213-2600(13)70045-6. Epub 2013 Apr 17.
10
Assessing statistical significance in multivariable genome wide association analysis.评估多变量全基因组关联分析中的统计学显著性。
Bioinformatics. 2016 Jul 1;32(13):1990-2000. doi: 10.1093/bioinformatics/btw128. Epub 2016 Mar 7.

引用本文的文献

1
Fine mapping genetic variants affecting birth weight in sheep: a GWAS of 3007 individuals using low-coverage whole genome sequencing.精细定位影响绵羊出生体重的遗传变异:利用低覆盖度全基因组测序对3007只个体进行全基因组关联研究
J Anim Sci Biotechnol. 2025 Aug 12;16(1):115. doi: 10.1186/s40104-025-01251-4.
2
GWAS significance thresholds in large cohorts of European ancestry.欧洲血统大型队列中的全基因组关联研究显著性阈值。
Genetics. 2025 May 8;230(1). doi: 10.1093/genetics/iyaf056.
3
K-mer-based Approaches to Bridging Pangenomics and Population Genetics.
基于K-mer的泛基因组学与群体遗传学关联方法。
Mol Biol Evol. 2025 Mar 5;42(3). doi: 10.1093/molbev/msaf047.
4
The Significant Effects of Threshold Selection for Advancing Nitrogen Use Efficiency in Whole Genome of Bread Wheat.阈值选择对提高面包小麦全基因组氮素利用效率的显著影响
Plant Direct. 2025 Jan 21;9(1):e70036. doi: 10.1002/pld3.70036. eCollection 2025 Jan.
5
Endophenotype 2.0: updated definitions and criteria for endophenotypes of psychiatric disorders, incorporating new technologies and findings.内表型2.0:精神障碍内表型的更新定义和标准,纳入新技术和研究结果。
Transl Psychiatry. 2024 Dec 24;14(1):502. doi: 10.1038/s41398-024-03195-1.
6
Mapping Epigenetic Gene Variant Dynamics: Comparative Analysis of Frequency, Functional Impact and Trait Associations in African and European Populations.绘制表观遗传基因变异动态图谱:非洲和欧洲人群中频率、功能影响及性状关联的比较分析
medRxiv. 2024 Aug 12:2024.08.11.24311816. doi: 10.1101/2024.08.11.24311816.
7
An SNP Marker Predicts Colorectal Cancer Outcomes with 5-Fluorouracil-Based Adjuvant Chemotherapy Post-Resection.SNP 标志物预测结直肠癌患者接受氟尿嘧啶为基础的辅助化疗后的生存结局。
Int J Mol Sci. 2024 Jun 17;25(12):6642. doi: 10.3390/ijms25126642.
8
Cross-continental environmental and genome-wide association study on children and adolescent anxiety and depression.关于儿童和青少年焦虑与抑郁的跨大陆环境与全基因组关联研究。
Front Psychiatry. 2024 May 17;15:1384298. doi: 10.3389/fpsyt.2024.1384298. eCollection 2024.
9
SABER: Statistical Identification of Loci of Interest in GWAS Summary Statistics using a Bayesian Gaussian Mixture Model.SABER:使用贝叶斯高斯混合模型对全基因组关联研究汇总统计数据中的感兴趣位点进行统计识别。
AMIA Jt Summits Transl Sci Proc. 2024 May 31;2024:575-583. eCollection 2024.
10
Molecular genetics of neuropsychiatric illness: some musings.神经精神疾病的分子遗传学:一些思考。
Front Genet. 2023 Nov 1;14:1203017. doi: 10.3389/fgene.2023.1203017. eCollection 2023.