• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于基因的测试方法,该方法考虑了附近基因之间测试的相关性。

An approach to gene-based testing accounting for dependence of tests among nearby genes.

机构信息

Department of Statistics & Data Science, Carnegie Mellon University, Pittsburgh, PA, USA.

Department of Computational Biology, Carnegie Mellon University, USA.

出版信息

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab329.

DOI:10.1093/bib/bbab329
PMID:34459489
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8575037/
Abstract

In genome-wide association studies (GWAS), it has become commonplace to test millions of single-nucleotide polymorphisms (SNPs) for phenotypic association. Gene-based testing can improve power to detect weak signal by reducing multiple testing and pooling signal strength. While such tests account for linkage disequilibrium (LD) structure of SNP alleles within each gene, current approaches do not capture LD of SNPs falling in different nearby genes, which can induce correlation of gene-based test statistics. We introduce an algorithm to account for this correlation. When a gene's test statistic is independent of others, it is assessed separately; when test statistics for nearby genes are strongly correlated, their SNPs are agglomerated and tested as a locus. To provide insight into SNPs and genes driving association within loci, we develop an interactive visualization tool to explore localized signal. We demonstrate our approach in the context of weakly powered GWAS for autism spectrum disorder, which is contrasted to more highly powered GWAS for schizophrenia and educational attainment. To increase power for these analyses, especially those for autism, we use adaptive $P$-value thresholding, guided by high-dimensional metadata modeled with gradient boosted trees, highlighting when and how it can be most useful. Notably our workflow is based on summary statistics.

摘要

在全基因组关联研究(GWAS)中,对表型关联进行数百万个单核苷酸多态性(SNP)的基因检测已经很常见。基因检测可以通过减少多重检验和汇集信号强度来提高检测弱信号的能力。虽然这些测试考虑了每个基因中 SNP 等位基因的连锁不平衡(LD)结构,但目前的方法无法捕获落在不同附近基因中的 SNP 的 LD,这会导致基因检测统计数据的相关性。我们引入了一种算法来考虑这种相关性。当一个基因的检测统计量与其他基因独立时,将单独评估;当附近基因的检测统计量高度相关时,将它们的 SNP 聚集成一个位点进行检测。为了深入了解驱动基因座内关联的 SNP 和基因,我们开发了一个交互式可视化工具来探索局部信号。我们在自闭症谱系障碍的弱效 GWAS 背景下展示了我们的方法,并与精神分裂症和教育程度的高功效 GWAS 进行了对比。为了提高这些分析的功效,尤其是自闭症的分析功效,我们使用了基于梯度提升树建模的高维元数据指导的自适应 P 值阈值,突出显示何时以及如何最有用。值得注意的是,我们的工作流程基于汇总统计数据。

相似文献

1
An approach to gene-based testing accounting for dependence of tests among nearby genes.一种基于基因的测试方法,该方法考虑了附近基因之间测试的相关性。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab329.
2
A selective inference approach for false discovery rate control using multiomics covariates yields insights into disease risk.一种基于多组学协变量的假发现率控制的选择性推理方法为疾病风险提供了新的见解。
Proc Natl Acad Sci U S A. 2020 Jun 30;117(26):15028-15035. doi: 10.1073/pnas.1918862117. Epub 2020 Jun 10.
3
The genome-wide risk alleles for psychiatric disorders at 3p21.1 show convergent effects on mRNA expression, cognitive function, and mushroom dendritic spine.精神疾病的全基因组风险等位基因在 3p21.1 上显示出对 mRNA 表达、认知功能和蘑菇状树突棘的趋同效应。
Mol Psychiatry. 2020 Jan;25(1):48-66. doi: 10.1038/s41380-019-0592-0. Epub 2019 Nov 13.
4
A powerful and versatile colocalization test.一种强大且多功能的共定位测试。
PLoS Comput Biol. 2020 Apr 10;16(4):e1007778. doi: 10.1371/journal.pcbi.1007778. eCollection 2020 Apr.
5
Detection of common single nucleotide polymorphisms synthesizing quantitative trait association of rarer causal variants.检测常见的单核苷酸多态性,合成罕见因果变异的数量性状关联。
Genome Res. 2011 Jul;21(7):1122-30. doi: 10.1101/gr.115832.110. Epub 2011 Mar 25.
6
A method combining a random forest-based technique with the modeling of linkage disequilibrium through latent variables, to run multilocus genome-wide association studies.一种结合基于随机森林的技术和通过潜在变量进行连锁不平衡建模的方法,用于进行多基因座全基因组关联研究。
BMC Bioinformatics. 2018 Mar 27;19(1):106. doi: 10.1186/s12859-018-2054-0.
7
Exploiting Linkage Disequilibrium for Ultrahigh-Dimensional Genome-Wide Data with an Integrated Statistical Approach.利用连锁不平衡和综合统计方法处理超高维全基因组数据
Genetics. 2016 Feb;202(2):411-26. doi: 10.1534/genetics.115.179507. Epub 2015 Dec 12.
8
Gene, pathway and network frameworks to identify epistatic interactions of single nucleotide polymorphisms derived from GWAS data.用于识别源自全基因组关联研究(GWAS)数据的单核苷酸多态性上位性相互作用的基因、通路和网络框架。
BMC Syst Biol. 2012;6 Suppl 3(Suppl 3):S15. doi: 10.1186/1752-0509-6-S3-S15. Epub 2012 Dec 17.
9
Genetic control of gene expression at novel and established chronic obstructive pulmonary disease loci.新发和已确定的慢性阻塞性肺疾病基因座处基因表达的遗传控制。
Hum Mol Genet. 2015 Feb 15;24(4):1200-10. doi: 10.1093/hmg/ddu525. Epub 2014 Oct 14.
10
Colocalization of GWAS and eQTL signals at loci with multiple signals identifies additional candidate genes for body fat distribution.在具有多个信号的 GWAS 和 eQTL 信号的共定位鉴定了身体脂肪分布的其他候选基因。
Hum Mol Genet. 2019 Dec 15;28(24):4161-4172. doi: 10.1093/hmg/ddz263.

引用本文的文献

1
The goldmine of GWAS summary statistics: a systematic review of methods and tools.全基因组关联研究汇总统计数据的宝库:方法与工具的系统综述
BioData Min. 2024 Sep 5;17(1):31. doi: 10.1186/s13040-024-00385-x.
2
CWAS-Plus: estimating category-wide association of rare noncoding variation from whole-genome sequencing data with cell-type-specific functional data.CWAS-Plus:基于全基因组测序数据和细胞类型特异性功能数据估计全类别罕见非编码变异的关联。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae323.
3
CWAS-Plus: Estimating category-wide association of rare noncoding variation from whole-genome sequencing data with cell-type-specific functional data.CWAS-Plus:利用全基因组测序数据和细胞类型特异性功能数据估计罕见非编码变异的全类别关联。
medRxiv. 2024 Apr 15:2024.04.15.24305828. doi: 10.1101/2024.04.15.24305828.

本文引用的文献

1
H-MAGMA, inheriting a shaky statistical foundation, yields excess false positives.H-MAGMA继承了一个不可靠的统计基础,会产生过多的假阳性结果。
Ann Hum Genet. 2021 May;85(3-4):97-100. doi: 10.1111/ahg.12412. Epub 2020 Dec 29.
2
Tauopathies: Deciphering Disease Mechanisms to Develop Effective Therapies.tau 病:破解疾病机制以开发有效疗法。
Int J Mol Sci. 2020 Nov 25;21(23):8948. doi: 10.3390/ijms21238948.
3
A selective inference approach for false discovery rate control using multiomics covariates yields insights into disease risk.一种基于多组学协变量的假发现率控制的选择性推理方法为疾病风险提供了新的见解。
Proc Natl Acad Sci U S A. 2020 Jun 30;117(26):15028-15035. doi: 10.1073/pnas.1918862117. Epub 2020 Jun 10.
4
The mutational constraint spectrum quantified from variation in 141,456 humans.从 141456 名人类个体的变异中量化的突变约束谱。
Nature. 2020 May;581(7809):434-443. doi: 10.1038/s41586-020-2308-7. Epub 2020 May 27.
5
Whole-Genome and RNA Sequencing Reveal Variation and Transcriptomic Coordination in the Developing Human Prefrontal Cortex.全基因组和 RNA 测序揭示人类前额叶皮层发育过程中的变异和转录组协调。
Cell Rep. 2020 Apr 7;31(1):107489. doi: 10.1016/j.celrep.2020.03.053.
6
A computational tool (H-MAGMA) for improved prediction of brain-disorder risk genes by incorporating brain chromatin interaction profiles.一种计算工具 (H-MAGMA),通过整合大脑染色质互作谱,提高了大脑疾病风险基因的预测能力。
Nat Neurosci. 2020 Apr;23(4):583-593. doi: 10.1038/s41593-020-0603-0. Epub 2020 Mar 9.
7
Large-Scale Exome Sequencing Study Implicates Both Developmental and Functional Changes in the Neurobiology of Autism.大规模外显子组测序研究表明自闭症的神经生物学既有发育性变化也有功能性变化。
Cell. 2020 Feb 6;180(3):568-584.e23. doi: 10.1016/j.cell.2019.12.036. Epub 2020 Jan 23.
8
A gene co-expression network-based analysis of multiple brain tissues reveals novel genes and molecular pathways underlying major depression.基于基因共表达网络的多脑区分析揭示了重度抑郁症的新基因和分子途径。
PLoS Genet. 2019 Jul 15;15(7):e1008245. doi: 10.1371/journal.pgen.1008245. eCollection 2019 Jul.
9
Identification of common genetic risk variants for autism spectrum disorder.孤独症谱系障碍常见遗传风险变异的鉴定。
Nat Genet. 2019 Mar;51(3):431-444. doi: 10.1038/s41588-019-0344-8. Epub 2019 Feb 25.
10
Contribution of rare and common variants to intellectual disability in a sub-isolate of Northern Finland.芬兰北部一个亚群集人群中智力障碍的罕见和常见变异的贡献。
Nat Commun. 2019 Jan 24;10(1):410. doi: 10.1038/s41467-018-08262-y.