• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

sumSTAAR:一种基于基因的关联研究的灵活框架,使用 GWAS 汇总统计数据。

sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics.

机构信息

Laboratory of Segregation and Recombination Analyses, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia.

Laboratory of Animal Genetics, Vavilov Institute of General Genetics, the Russian Academy of Sciences, Moscow, Russia.

出版信息

PLoS Comput Biol. 2022 Jun 2;18(6):e1010172. doi: 10.1371/journal.pcbi.1010172. eCollection 2022 Jun.

DOI:10.1371/journal.pcbi.1010172
PMID:35653402
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9197066/
Abstract

Gene-based association analysis is an effective gene-mapping tool. Many gene-based methods have been proposed recently. However, their power depends on the underlying genetic architecture, which is rarely known in complex traits, and so it is likely that a combination of such methods could serve as a universal approach. Several frameworks combining different gene-based methods have been developed. However, they all imply a fixed set of methods, weights and functional annotations. Moreover, most of them use individual phenotypes and genotypes as input data. Here, we introduce sumSTAAR, a framework for gene-based association analysis using summary statistics obtained from genome-wide association studies (GWAS). It is an extended and modified version of STAAR framework proposed by Li and colleagues in 2020. The sumSTAAR framework offers a wider range of gene-based methods to combine. It allows the user to arbitrarily define a set of these methods, weighting functions and probabilities of genetic variants being causal. The methods used in the framework were adapted to analyse genes with large number of SNPs to decrease the running time. The framework includes the polygene pruning procedure to guard against the influence of the strong GWAS signals outside the gene. We also present new improved matrices of correlations between the genotypes of variants within genes. These matrices estimated on a sample of 265,000 individuals are a state-of-the-art replacement of widely used matrices based on the 1000 Genomes Project data.

摘要

基于基因的关联分析是一种有效的基因映射工具。最近已经提出了许多基于基因的方法。然而,它们的功效取决于潜在的遗传结构,而在复杂性状中很少知道这种结构,因此,结合这些方法可能是一种通用的方法。已经开发了几种结合不同基于基因的方法的框架。然而,它们都暗示了一组固定的方法、权重和功能注释。此外,它们大多数都使用个体表型和基因型作为输入数据。在这里,我们介绍 sumSTAAR,这是一个使用从全基因组关联研究(GWAS)中获得的汇总统计数据进行基于基因的关联分析的框架。它是 Li 及其同事在 2020 年提出的 STAAR 框架的扩展和修改版本。sumSTAAR 框架提供了更广泛的基于基因的方法来组合。它允许用户任意定义一组这些方法、权重函数以及遗传变异是因果关系的概率。框架中使用的方法经过了改编,可以分析具有大量 SNP 的基因,以减少运行时间。该框架包括多基因修剪过程,以防止基因外强 GWAS 信号的影响。我们还提出了新的改进的基因内变异基因型之间相关性矩阵。这些在 265000 个人的样本上估计的矩阵是广泛使用的基于 1000 基因组计划数据的矩阵的最新替代方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f95e/9197066/b89d0285ab18/pcbi.1010172.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f95e/9197066/740d3af25299/pcbi.1010172.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f95e/9197066/9c84f18ecbb8/pcbi.1010172.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f95e/9197066/b89d0285ab18/pcbi.1010172.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f95e/9197066/740d3af25299/pcbi.1010172.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f95e/9197066/9c84f18ecbb8/pcbi.1010172.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f95e/9197066/b89d0285ab18/pcbi.1010172.g003.jpg

相似文献

1
sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics.sumSTAAR:一种基于基因的关联研究的灵活框架,使用 GWAS 汇总统计数据。
PLoS Comput Biol. 2022 Jun 2;18(6):e1010172. doi: 10.1371/journal.pcbi.1010172. eCollection 2022 Jun.
2
Investigation of multi-trait associations using pathway-based analysis of GWAS summary statistics.基于 GWAS 汇总统计数据的通路分析探究多性状关联。
BMC Genomics. 2019 Feb 4;20(Suppl 1):79. doi: 10.1186/s12864-018-5373-7.
3
How powerful are summary-based methods for identifying expression-trait associations under different genetic architectures?基于汇总数据的方法在不同遗传结构下识别表达性状关联的能力有多强?
Pac Symp Biocomput. 2018;23:228-239.
4
Comparison of two multi-trait association testing methods and sequence-based fine mapping of six additive QTL in Swiss Large White pigs.比较两种多性状关联测试方法和瑞士大白猪六个加性 QTL 的基于序列的精细定位。
BMC Genomics. 2023 Apr 10;24(1):192. doi: 10.1186/s12864-023-09295-4.
5
A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study.一种功能信息的基于集合的分析的通用框架:在大规模结直肠癌研究中的应用。
PLoS Genet. 2020 Aug 24;16(8):e1008947. doi: 10.1371/journal.pgen.1008947. eCollection 2020 Aug.
6
Integrating eQTL data with GWAS summary statistics in pathway-based analysis with application to schizophrenia.在基于通路的分析中将表达数量性状基因座(eQTL)数据与全基因组关联研究(GWAS)汇总统计数据相结合,并应用于精神分裂症研究。
Genet Epidemiol. 2018 Apr;42(3):303-316. doi: 10.1002/gepi.22110. Epub 2018 Feb 7.
7
Integrate multiple traits to detect novel trait-gene association using GWAS summary data with an adaptive test approach.利用 GWAS 汇总数据和自适应检验方法整合多种性状,以检测新的性状-基因关联。
Bioinformatics. 2019 Jul 1;35(13):2251-2257. doi: 10.1093/bioinformatics/bty961.
8
A gene based combination test using GWAS summary data.基于 GWAS 汇总数据的基因组合测试。
BMC Bioinformatics. 2023 Jan 3;24(1):2. doi: 10.1186/s12859-022-05114-x.
9
JEPEG: a summary statistics based tool for gene-level joint testing of functional variants.JEPEG:一种基于汇总统计量的功能变异基因水平联合检测工具。
Bioinformatics. 2015 Apr 15;31(8):1176-82. doi: 10.1093/bioinformatics/btu816. Epub 2014 Dec 12.
10
Endometrial vezatin and its association with endometriosis risk.子宫内膜 vezatin 及其与子宫内膜异位症风险的关联。
Hum Reprod. 2016 May;31(5):999-1013. doi: 10.1093/humrep/dew047. Epub 2016 Mar 22.

引用本文的文献

1
A New Method for Conditional Gene-Based Analysis Effectively Accounts for the Regional Polygenic Background.一种新的基于条件基因的分析方法能够有效地解释区域多基因背景。
Genes (Basel). 2024 Sep 7;15(9):1174. doi: 10.3390/genes15091174.
2
Multi-Trait Exome-Wide Association Study of Back Pain-Related Phenotypes.多特征外显子组关联研究与腰痛相关表型。
Genes (Basel). 2023 Oct 19;14(10):1962. doi: 10.3390/genes14101962.
3
A gene based combination test using GWAS summary data.基于 GWAS 汇总数据的基因组合测试。

本文引用的文献

1
A generalized linear mixed model association tool for biobank-scale data.一种用于生物样本库规模数据的广义线性混合模型关联工具。
Nat Genet. 2021 Nov;53(11):1616-1621. doi: 10.1038/s41588-021-00954-4. Epub 2021 Nov 4.
2
Gene-based association analysis identifies 190 genes affecting neuroticism.基于基因的关联分析确定了 190 个影响神经质的基因。
Sci Rep. 2021 Jan 28;11(1):2484. doi: 10.1038/s41598-021-82123-5.
3
Integrating comprehensive functional annotations to boost power and accuracy in gene-based association analysis.整合全面的功能注释以提高基于基因的关联分析的功效和准确性。
BMC Bioinformatics. 2023 Jan 3;24(1):2. doi: 10.1186/s12859-022-05114-x.
PLoS Genet. 2020 Dec 15;16(12):e1009060. doi: 10.1371/journal.pgen.1009060. eCollection 2020 Dec.
4
Dynamic incorporation of multiple in silico functional annotations empowers rare variant association analysis of large whole-genome sequencing studies at scale.大规模全基因组测序研究中通过多种计算功能注释的动态整合增强罕见变异关联分析。
Nat Genet. 2020 Sep;52(9):969-983. doi: 10.1038/s41588-020-0676-4. Epub 2020 Aug 24.
5
A rank-based normalization method with the fully adjusted full-stage procedure in genetic association studies.基于秩的标准化方法与全调整全阶段程序在遗传关联研究中的应用。
PLoS One. 2020 Jun 19;15(6):e0233847. doi: 10.1371/journal.pone.0233847. eCollection 2020.
6
Genome-Wide Gene-Based Multi-Trait Analysis.全基因组基于基因的多性状分析
Front Genet. 2020 May 19;11:437. doi: 10.3389/fgene.2020.00437. eCollection 2020.
7
Multi-trait analysis of rare-variant association summary statistics using MTAR.使用 MTAR 进行罕见变异关联汇总统计的多性状分析。
Nat Commun. 2020 Jun 5;11(1):2850. doi: 10.1038/s41467-020-16591-0.
8
Convex combination sequence kernel association test for rare-variant studies.凸组合序列核关联检验在罕见变异研究中的应用。
Genet Epidemiol. 2020 Jun;44(4):352-367. doi: 10.1002/gepi.22287. Epub 2020 Feb 26.
9
A generalized model for combining dependent SNP-level summary statistics and its extensions to statistics of other levels.一种用于合并依赖 SNP 水平汇总统计数据的广义模型及其在其他水平统计数据上的扩展。
Sci Rep. 2019 Apr 2;9(1):5461. doi: 10.1038/s41598-019-41827-5.
10
Gene-based association tests using GWAS summary statistics.基于基因的关联测试,使用 GWAS 汇总统计数据。
Bioinformatics. 2019 Oct 1;35(19):3701-3708. doi: 10.1093/bioinformatics/btz172.