• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 GWAS 汇总数据对多种表型进行强大且高效的 SNP 集关联测试。

Powerful and efficient SNP-set association tests across multiple phenotypes using GWAS summary data.

机构信息

Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, USA.

出版信息

Bioinformatics. 2019 Apr 15;35(8):1366-1372. doi: 10.1093/bioinformatics/bty811.

DOI:10.1093/bioinformatics/bty811
PMID:30239606
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6477978/
Abstract

MOTIVATION

Many GWAS conducted in the past decade have identified tens of thousands of disease related variants, which in total explained only part of the heritability for most traits. There remain many more genetics variants with small effect sizes to be discovered. This has motivated the development of sequencing studies with larger sample sizes and increased resolution of genotyped variants, e.g., the ongoing NHLBI Trans-Omics for Precision Medicine (TOPMed) whole genome sequencing project. An alternative approach is the development of novel and more powerful statistical methods. The current dominating approach in the field of GWAS analysis is the "single trait single variant" association test, despite the fact that most GWAS are conducted in deeply-phenotyped cohorts with many correlated traits measured. In this paper, we aim to develop rigorous methods that integrate multiple correlated traits and multiple variants to improve the power to detect novel variants. In recognition of the difficulty of accessing raw genotype and phenotype data due to privacy and logistic concerns, we develop methods that are applicable to publicly available GWAS summary data.

RESULTS

We build rigorous statistical models for GWAS summary statistics to motivate novel multi-trait SNP-set association tests, including variance component test, burden test and their adaptive test, and develop efficient numerical algorithms to quickly compute their analytical P-values. We implement the proposed methods in an open source R package. We conduct thorough simulation studies to verify the proposed methods rigorously control type I errors at the genome-wide significance level, and further demonstrate their utility via comprehensive analysis of GWAS summary data for multiple lipids traits and glycemic traits. We identified many novel loci that were not detected by the individual trait based GWAS analysis.

AVAILABILITY AND IMPLEMENTATION

We have implemented the proposed methods in an R package freely available at http://www.github.com/baolinwu/MSKAT.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

过去十年进行的许多 GWAS 已经确定了数万种与疾病相关的变异,这些变异总共仅解释了大多数性状遗传力的一部分。还有更多遗传变异具有较小的效应大小有待发现。这促使人们开展了具有更大样本量和更高分辨率的基因分型变异的测序研究,例如,正在进行的 NHLBI 转化医学精准医学(TOPMed)全基因组测序项目。另一种方法是开发新的、更强大的统计方法。目前 GWAS 分析领域的主流方法是“单一性状单一变异”关联测试,尽管大多数 GWAS 都是在具有许多相关性状的深度表型队列中进行的。在本文中,我们旨在开发严格的方法,整合多个相关性状和多个变异,以提高检测新变异的能力。由于隐私和物流问题,我们认识到获取原始基因型和表型数据的困难,因此开发了适用于公开可用的 GWAS 汇总数据的方法。

结果

我们为 GWAS 汇总统计数据构建了严格的统计模型,以激发新的多性状 SNP 集关联测试,包括方差分量测试、负担测试及其自适应测试,并开发了高效的数值算法来快速计算其分析 P 值。我们在一个开源 R 包中实现了所提出的方法。我们进行了彻底的模拟研究,以严格控制基因组范围内显著水平的Ⅰ型错误,并通过对多个脂质性状和血糖性状的 GWAS 汇总数据进行综合分析进一步证明了其效用。我们确定了许多以前未被基于单个性状的 GWAS 分析检测到的新位点。

可用性和实现

我们已在一个免费的 R 包中实现了所提出的方法,可在 http://www.github.com/baolinwu/MSKAT 上获得。

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
Powerful and efficient SNP-set association tests across multiple phenotypes using GWAS summary data.利用 GWAS 汇总数据对多种表型进行强大且高效的 SNP 集关联测试。
Bioinformatics. 2019 Apr 15;35(8):1366-1372. doi: 10.1093/bioinformatics/bty811.
2
Integrate multiple traits to detect novel trait-gene association using GWAS summary data with an adaptive test approach.利用 GWAS 汇总数据和自适应检验方法整合多种性状,以检测新的性状-基因关联。
Bioinformatics. 2019 Jul 1;35(13):2251-2257. doi: 10.1093/bioinformatics/bty961.
3
Pleiotropy informed adaptive association test of multiple traits using genome-wide association study summary data.利用全基因组关联研究汇总数据进行多性状的多效性知情适应性关联测试。
Biometrics. 2019 Dec;75(4):1076-1085. doi: 10.1111/biom.13076. Epub 2019 Aug 2.
4
CoMM-S2: a collaborative mixed model using summary statistics in transcriptome-wide association studies.CoMM-S2:一种基于转录组关联研究汇总统计信息的协作混合模型。
Bioinformatics. 2020 Apr 1;36(7):2009-2016. doi: 10.1093/bioinformatics/btz880.
5
Methods for meta-analysis of multiple traits using GWAS summary statistics.使用全基因组关联研究(GWAS)汇总统计量进行多性状荟萃分析的方法。
Genet Epidemiol. 2018 Mar;42(2):134-145. doi: 10.1002/gepi.22105. Epub 2017 Dec 10.
6
Powerful statistical method to detect disease-associated genes using publicly available genome-wide association studies summary data.利用公开的全基因组关联研究汇总数据,使用强大的统计方法来检测疾病相关基因。
Genet Epidemiol. 2019 Dec;43(8):941-951. doi: 10.1002/gepi.22251. Epub 2019 Aug 7.
7
Prioritizing genetic variants in GWAS with lasso using permutation-assisted tuning.使用排列辅助调优的lasso 优先考虑 GWAS 中的遗传变异。
Bioinformatics. 2020 Jun 1;36(12):3811-3817. doi: 10.1093/bioinformatics/btaa229.
8
Gene-based association tests using GWAS summary statistics.基于基因的关联测试,使用 GWAS 汇总统计数据。
Bioinformatics. 2019 Oct 1;35(19):3701-3708. doi: 10.1093/bioinformatics/btz172.
9
Gene- and pathway-based association tests for multiple traits with GWAS summary statistics.基于基因和通路的多性状关联测试与全基因组关联研究汇总统计。
Bioinformatics. 2017 Jan 1;33(1):64-71. doi: 10.1093/bioinformatics/btw577. Epub 2016 Sep 4.
10
RAISS: robust and accurate imputation from summary statistics.RAISS:从汇总统计数据中进行稳健且准确的推断。
Bioinformatics. 2019 Nov 1;35(22):4837-4839. doi: 10.1093/bioinformatics/btz466.

引用本文的文献

1
The goldmine of GWAS summary statistics: a systematic review of methods and tools.全基因组关联研究汇总统计数据的宝库:方法与工具的系统综述
BioData Min. 2024 Sep 5;17(1):31. doi: 10.1186/s13040-024-00385-x.
2
Inferring causal direction between two traits using R with application to transcriptome-wide association studies.使用 R 推断两个性状之间的因果关系及其在转录组关联研究中的应用。
Am J Hum Genet. 2024 Aug 8;111(8):1782-1795. doi: 10.1016/j.ajhg.2024.06.013. Epub 2024 Jul 24.
3
Summary statistics-based association test for identifying the pleiotropic effects with set of genetic variants.基于汇总统计的关联测试,用于识别具有一组遗传变异的多效性效应。
Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad182.
4
A Meta-Analysis of the Genome-Wide Association Studies on Two Genetically Correlated Phenotypes Suggests Four New Risk Loci for Headaches.一项针对两种遗传相关表型的全基因组关联研究的荟萃分析表明存在四个新的头痛风险基因座。
Phenomics. 2022 Nov 18;3(1):64-76. doi: 10.1007/s43657-022-00078-7. eCollection 2023 Feb.
5
A gene based combination test using GWAS summary data.基于 GWAS 汇总数据的基因组合测试。
BMC Bioinformatics. 2023 Jan 3;24(1):2. doi: 10.1186/s12859-022-05114-x.
6
Simultaneous detection of novel genes and SNPs by adaptive -value combination.通过适应性值组合同时检测新基因和单核苷酸多态性
Front Genet. 2022 Nov 17;13:1009428. doi: 10.3389/fgene.2022.1009428. eCollection 2022.
7
A comprehensive comparison of multilocus association methods with summary statistics in genome-wide association studies.全基因组关联研究中基于汇总统计量与多位点关联分析方法的全面比较。
BMC Bioinformatics. 2022 Aug 30;23(1):359. doi: 10.1186/s12859-022-04897-3.
8
A Bayesian hierarchically structured prior for gene-based association testing with multiple traits in genome-wide association studies.基于贝叶斯层次结构的先验方法在全基因组关联研究中用于多性状基因关联检验。
Genet Epidemiol. 2022 Feb;46(1):63-72. doi: 10.1002/gepi.22437. Epub 2021 Nov 17.
9
Coupled mixed model for joint genetic analysis of complex disorders with two independently collected data sets.用于对具有两个独立收集数据集的复杂疾病进行联合遗传分析的耦合混合模型。
BMC Bioinformatics. 2021 Feb 5;22(1):50. doi: 10.1186/s12859-021-03959-2.
10
TS: a powerful truncated test to detect novel disease associated genes using publicly available gWAS summary data.TS:一种强大的截断测试,用于使用公开可用的 gWAS 汇总数据检测新的疾病相关基因。
BMC Bioinformatics. 2020 May 4;21(1):172. doi: 10.1186/s12859-020-3511-0.

本文引用的文献

1
Statistical methods to detect novel genetic variants using publicly available GWAS summary data.利用公开的 GWAS 汇总数据检测新型遗传变异的统计方法。
Comput Biol Chem. 2018 Jun;74:76-79. doi: 10.1016/j.compbiolchem.2018.02.016. Epub 2018 Mar 1.
2
BAYESIAN LARGE-SCALE MULTIPLE REGRESSION WITH SUMMARY STATISTICS FROM GENOME-WIDE ASSOCIATION STUDIES.基于全基因组关联研究汇总统计量的贝叶斯大规模多元回归
Ann Appl Stat. 2017;11(3):1561-1592. doi: 10.1214/17-aoas1046. Epub 2017 Oct 5.
3
10 Years of GWAS Discovery: Biology, Function, and Translation.全基因组关联研究十年发现:生物学、功能与转化
Am J Hum Genet. 2017 Jul 6;101(1):5-22. doi: 10.1016/j.ajhg.2017.06.005.
4
Identification of Quantitative Trait Loci That Determine Plasma Total-Cholesterol and Triglyceride Concentrations in DDD/Sgn and C57BL/6J Inbred Mice.在DDD/Sgn和C57BL/6J近交系小鼠中确定决定血浆总胆固醇和甘油三酯浓度的数量性状基因座。
Cholesterol. 2017;2017:3178204. doi: 10.1155/2017/3178204. Epub 2017 May 31.
5
Discovery and fine-mapping of adiposity loci using high density imputation of genome-wide association studies in individuals of African ancestry: African Ancestry Anthropometry Genetics Consortium.利用非洲裔个体全基因组关联研究的高密度归因法发现肥胖位点并进行精细定位:非洲裔人体测量学遗传学联盟
PLoS Genet. 2017 Apr 21;13(4):e1006719. doi: 10.1371/journal.pgen.1006719. eCollection 2017 Apr.
6
Dissecting the genetics of complex traits using summary association statistics.利用汇总关联统计剖析复杂性状的遗传学。
Nat Rev Genet. 2017 Feb;18(2):117-127. doi: 10.1038/nrg.2016.142. Epub 2016 Nov 14.
7
Fast set-based association analysis using summary data from GWAS identifies novel gene loci for human complex traits.基于汇总 GWAS 数据的快速集合关联分析确定了人类复杂性状的新基因座。
Sci Rep. 2016 Sep 8;6:32894. doi: 10.1038/srep32894.
8
Gene- and pathway-based association tests for multiple traits with GWAS summary statistics.基于基因和通路的多性状关联测试与全基因组关联研究汇总统计。
Bioinformatics. 2017 Jan 1;33(1):64-71. doi: 10.1093/bioinformatics/btw577. Epub 2016 Sep 4.
9
metaCCA: summary statistics-based multivariate meta-analysis of genome-wide association studies using canonical correlation analysis.metaCCA:基于全基因组关联研究汇总统计量,运用典型相关分析的多变量荟萃分析。
Bioinformatics. 2016 Jul 1;32(13):1981-9. doi: 10.1093/bioinformatics/btw052. Epub 2016 Feb 19.
10
A Statistical Approach for Testing Cross-Phenotype Effects of Rare Variants.一种用于检验罕见变异的跨表型效应的统计方法。
Am J Hum Genet. 2016 Mar 3;98(3):525-540. doi: 10.1016/j.ajhg.2016.01.017.