• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全基因组关联研究中基于群体感知排列的显著性阈值

Population-aware permutation-based significance thresholds for genome-wide association studies.

作者信息

John Maura, Korte Arthur, Todesco Marco, Grimm Dominik G

机构信息

Technical University of Munich, TUM Campus Straubing for Biotechnology and Sustainability, Bioinformatics, 94315 Straubing, Germany.

Weihenstephan-Triesdorf University of Applied Sciences, Bioinformatics, 94315 Straubing, Germany.

出版信息

Bioinform Adv. 2024 Oct 28;4(1):vbae168. doi: 10.1093/bioadv/vbae168. eCollection 2024.

DOI:10.1093/bioadv/vbae168
PMID:39678204
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11639184/
Abstract

MOTIVATION

Permutation-based significance thresholds have been shown to be a robust alternative to classical Bonferroni significance thresholds in genome-wide association studies (GWAS) for skewed phenotype distributions. The recently published method permGWAS introduced a batch-wise approach to efficiently compute permutation-based GWAS. However, running multiple univariate tests in parallel leads to many repetitive computations and increased computational resources. More importantly, traditional permutation methods that permute only the phenotype break the underlying population structure.

RESULTS

We propose permGWAS2, an improved method that does not break the population structure during permutations and uses an elegant block matrix decomposition to optimize computations, thereby reducing redundancies. We show on synthetic data that this improved approach yields a lower false discovery rate for skewed phenotype distributions compared to the previous version and the commonly used Bonferroni correction. In addition, we re-analyze a dataset covering phenotypic variation in 86 traits in a population of 615 wild sunflowers ( L.). This led to the identification of dozens of novel associations with putatively adaptive traits, and removed several likely false-positive associations with limited biological support.

AVAILABILITY AND IMPLEMENTATION

permGWAS2 is open-source and publicly available on GitHub for download: https://github.com/grimmlab/permGWAS.

摘要

动机

在全基因组关联研究(GWAS)中,对于偏态表型分布,基于排列的显著性阈值已被证明是经典邦费罗尼显著性阈值的一种稳健替代方法。最近发表的permGWAS方法引入了一种分批方法来高效计算基于排列的GWAS。然而,并行运行多个单变量检验会导致许多重复计算并增加计算资源。更重要的是,仅对表型进行排列的传统排列方法会破坏潜在的群体结构。

结果

我们提出了permGWAS2,这是一种改进方法,在排列过程中不会破坏群体结构,并使用一种巧妙的块矩阵分解来优化计算,从而减少冗余。我们在合成数据上表明,与先前版本和常用的邦费罗尼校正相比,这种改进方法对于偏态表型分布产生的错误发现率更低。此外,我们重新分析了一个数据集,该数据集涵盖了615株野生向日葵(L.)群体中86个性状的表型变异。这导致识别出数十个与假定适应性性状的新关联,并消除了一些缺乏生物学支持的可能假阳性关联。

可用性和实现

permGWAS2是开源的,可在GitHub上公开下载:https://github.com/grimmlab/permGWAS。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c34/11639184/78e4bf88b3cb/vbae168f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c34/11639184/c634cad0ab2b/vbae168f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c34/11639184/b82a78c423e9/vbae168f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c34/11639184/36bdc16ee1c0/vbae168f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c34/11639184/78e4bf88b3cb/vbae168f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c34/11639184/c634cad0ab2b/vbae168f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c34/11639184/b82a78c423e9/vbae168f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c34/11639184/36bdc16ee1c0/vbae168f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c34/11639184/78e4bf88b3cb/vbae168f4.jpg

相似文献

1
Population-aware permutation-based significance thresholds for genome-wide association studies.全基因组关联研究中基于群体感知排列的显著性阈值
Bioinform Adv. 2024 Oct 28;4(1):vbae168. doi: 10.1093/bioadv/vbae168. eCollection 2024.
2
Efficient permutation-based genome-wide association studies for normal and skewed phenotypic distributions.高效的基于排列的全基因组关联研究,适用于正态和偏态表型分布。
Bioinformatics. 2022 Sep 16;38(Suppl_2):ii5-ii12. doi: 10.1093/bioinformatics/btac455.
3
The benefits of permutation-based genome-wide association studies.基于排列的全基因组关联研究的优势。
J Exp Bot. 2024 Sep 11;75(17):5377-5389. doi: 10.1093/jxb/erae280.
4
GWAS significance thresholds for deep phenotyping studies can depend upon minor allele frequencies and sample size.GWAS 显著性阈值对于深度表型研究可以取决于次要等位基因频率和样本量。
Mol Psychiatry. 2021 Jun;26(6):2048-2055. doi: 10.1038/s41380-020-0670-3. Epub 2020 Feb 17.
5
Systematic permutation testing in GWAS pathway analyses: identification of genetic networks in dilated cardiomyopathy and ulcerative colitis.全基因组关联研究通路分析中的系统排列检验:扩张型心肌病和溃疡性结肠炎遗传网络的识别
BMC Genomics. 2014 Jul 22;15:622. doi: 10.1186/1471-2164-15-622.
6
Uncovering networks from genome-wide association studies via circular genomic permutation.通过环状基因组置换从全基因组关联研究中揭示网络
G3 (Bethesda). 2012 Sep;2(9):1067-75. doi: 10.1534/g3.112.002618. Epub 2012 Sep 1.
7
Optimized permutation testing for information theoretic measures of multi-gene interactions.优化排列检验用于多基因相互作用的信息理论测度。
BMC Bioinformatics. 2021 Apr 7;22(1):180. doi: 10.1186/s12859-021-04107-6.
8
Prioritizing genetic variants in GWAS with lasso using permutation-assisted tuning.使用排列辅助调优的lasso 优先考虑 GWAS 中的遗传变异。
Bioinformatics. 2020 Jun 1;36(12):3811-3817. doi: 10.1093/bioinformatics/btaa229.
9
MPI-GWAS: a supercomputing-aided permutation approach for genomewide association studies.MPI-全基因组关联研究:一种用于全基因组关联研究的超级计算辅助置换方法。
Genomics Inform. 2022 Mar;20(1):e14. doi: 10.5808/gi.22001. Epub 2022 Mar 31.
10
Integrate multiple traits to detect novel trait-gene association using GWAS summary data with an adaptive test approach.利用 GWAS 汇总数据和自适应检验方法整合多种性状,以检测新的性状-基因关联。
Bioinformatics. 2019 Jul 1;35(13):2251-2257. doi: 10.1093/bioinformatics/bty961.

引用本文的文献

1
FlexLMM: a Nextflow linear mixed model framework for GWAS.FlexLMM:一种用于全基因组关联研究的Nextflow线性混合模型框架。
Bioinformatics. 2024 Dec 26;41(1). doi: 10.1093/bioinformatics/btaf021.

本文引用的文献

1
HeliantHOME, a public and centralized database of phenotypic sunflower data.HeliantHOME,一个公开且集中的向日葵表型数据库。
Sci Data. 2022 Nov 30;9(1):735. doi: 10.1038/s41597-022-01842-0.
2
Efficient permutation-based genome-wide association studies for normal and skewed phenotypic distributions.高效的基于排列的全基因组关联研究,适用于正态和偏态表型分布。
Bioinformatics. 2022 Sep 16;38(Suppl_2):ii5-ii12. doi: 10.1093/bioinformatics/btac455.
3
Massive haplotypes underlie ecotypic differentiation in sunflowers.大片段单倍型是向日葵生态型分化的基础。
Nature. 2020 Aug;584(7822):602-607. doi: 10.1038/s41586-020-2467-6. Epub 2020 Jul 8.
4
Imputation of 3 million SNPs in the Arabidopsis regional mapping population.在拟南芥区域作图群体中对 300 万个 SNPs 进行了插补。
Plant J. 2020 May;102(4):872-882. doi: 10.1111/tpj.14659. Epub 2020 Feb 11.
5
A resource-efficient tool for mixed model association analysis of large-scale data.一种资源高效的工具,用于大规模数据的混合模型关联分析。
Nat Genet. 2019 Dec;51(12):1749-1755. doi: 10.1038/s41588-019-0530-8. Epub 2019 Nov 25.
6
Methods and Tools in Genome-wide Association Studies.全基因组关联研究中的方法与工具
Methods Mol Biol. 2018;1819:93-136. doi: 10.1007/978-1-4939-8618-7_5.
7
The sunflower genome provides insights into oil metabolism, flowering and Asterid evolution.向日葵基因组为油脂代谢、开花和菊类植物进化提供了线索。
Nature. 2017 Jun 1;546(7656):148-152. doi: 10.1038/nature22380. Epub 2017 May 22.
8
easyGWAS: A Cloud-Based Platform for Comparing the Results of Genome-Wide Association Studies.easyGWAS:一个用于比较全基因组关联研究结果的基于云的平台。
Plant Cell. 2017 Jan;29(1):5-19. doi: 10.1105/tpc.16.00551. Epub 2016 Dec 16.
9
Genome-wide association of multiple complex traits in outbred mice by ultra-low-coverage sequencing.通过超低覆盖度测序对远交系小鼠多种复杂性状进行全基因组关联分析。
Nat Genet. 2016 Aug;48(8):912-8. doi: 10.1038/ng.3595. Epub 2016 Jul 4.
10
Genome-wide detection of intervals of genetic heterogeneity associated with complex traits.全基因组检测与复杂性状相关的遗传异质性区间
Bioinformatics. 2015 Jun 15;31(12):i240-9. doi: 10.1093/bioinformatics/btv263.