• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用关联贝叶斯概率进行 SNP 优先级排序。

SNP prioritization using a Bayesian probability of association.

机构信息

Department of Health Sciences, University of Leicester, Leicester, United Kingdom.

出版信息

Genet Epidemiol. 2013 Feb;37(2):214-21. doi: 10.1002/gepi.21704. Epub 2012 Dec 26.

DOI:10.1002/gepi.21704
PMID:23280596
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3725584/
Abstract

Prioritization is the process whereby a set of possible candidate genes or SNPs is ranked so that the most promising can be taken forward into further studies. In a genome-wide association study, prioritization is usually based on the P-values alone, but researchers sometimes take account of external annotation information about the SNPs such as whether the SNP lies close to a good candidate gene. Using external information in this way is inherently subjective and is often not formalized, making the analysis difficult to reproduce. Building on previous work that has identified 14 important types of external information, we present an approximate Bayesian analysis that produces an estimate of the probability of association. The calculation combines four sources of information: the genome-wide data, SNP information derived from bioinformatics databases, empirical SNP weights, and the researchers' subjective prior opinions. The calculation is fast enough that it can be applied to millions of SNPS and although it does rely on subjective judgments, those judgments are made explicit so that the final SNP selection can be reproduced. We show that the resulting probability of association is intuitively more appealing than the P-value because it is easier to interpret and it makes allowance for the power of the study. We illustrate the use of the probability of association for SNP prioritization by applying it to a meta-analysis of kidney function genome-wide association studies and demonstrate that SNP selection performs better using the probability of association compared with P-values alone.

摘要

优先级排序是指对一组可能的候选基因或单核苷酸多态性(SNP)进行排序,以便将最有前途的候选基因或 SNP 进一步推进到后续研究中。在全基因组关联研究中,优先级排序通常仅基于 P 值,但研究人员有时会考虑 SNP 的外部注释信息,例如 SNP 是否靠近一个好的候选基因。这种使用外部信息的方法本质上是主观的,并且通常没有形式化,使得分析难以重现。基于先前已经确定的 14 种重要类型的外部信息的工作,我们提出了一种近似贝叶斯分析,该分析可以对关联的概率进行估计。该计算将四个来源的信息结合在一起:全基因组数据、来自生物信息学数据库的 SNP 信息、经验性 SNP 权重以及研究人员的主观先验意见。该计算速度足够快,可以应用于数百万个 SNP,尽管它确实依赖于主观判断,但这些判断是明确的,因此可以重现最终的 SNP 选择。我们表明,与 P 值相比,关联的概率在直观上更具吸引力,因为它更容易解释并且考虑到了研究的效力。我们通过将其应用于肾功能全基因组关联研究的荟萃分析来说明关联概率在 SNP 优先级排序中的使用,并证明与仅使用 P 值相比,使用关联概率进行 SNP 选择的效果更好。

相似文献

1
SNP prioritization using a Bayesian probability of association.使用关联贝叶斯概率进行 SNP 优先级排序。
Genet Epidemiol. 2013 Feb;37(2):214-21. doi: 10.1002/gepi.21704. Epub 2012 Dec 26.
2
A latent model for prioritization of SNPs for functional studies.用于功能研究中 SNP 优先级排序的潜在模型。
PLoS One. 2011;6(6):e20764. doi: 10.1371/journal.pone.0020764. Epub 2011 Jun 8.
3
Incorporating Functional Genomic Information in Genetic Association Studies Using an Empirical Bayes Approach.使用经验贝叶斯方法将功能基因组信息纳入基因关联研究。
Genet Epidemiol. 2016 Apr;40(3):176-87. doi: 10.1002/gepi.21956. Epub 2016 Feb 1.
4
Importance of different types of prior knowledge in selecting genome-wide findings for follow-up.不同类型先验知识在选择全基因组发现进行随访中的重要性。
Genet Epidemiol. 2013 Feb;37(2):205-13. doi: 10.1002/gepi.21705.
5
A method combining a random forest-based technique with the modeling of linkage disequilibrium through latent variables, to run multilocus genome-wide association studies.一种结合基于随机森林的技术和通过潜在变量进行连锁不平衡建模的方法,用于进行多基因座全基因组关联研究。
BMC Bioinformatics. 2018 Mar 27;19(1):106. doi: 10.1186/s12859-018-2054-0.
6
Structured Genome-Wide Association Studies with Bayesian Hierarchical Variable Selection.基于贝叶斯分层变量选择的结构全基因组关联研究。
Genetics. 2019 Jun;212(2):397-415. doi: 10.1534/genetics.119.301906. Epub 2019 Apr 22.
7
Ranking of genome-wide association scan signals by different measures.按不同指标对全基因组关联扫描信号进行排名。
Int J Epidemiol. 2009 Oct;38(5):1364-73. doi: 10.1093/ije/dyp285. Epub 2009 Sep 4.
8
Multiple SNP Set Analysis for Genome-Wide Association Studies Through Bayesian Latent Variable Selection.通过贝叶斯潜在变量选择进行全基因组关联研究的多单核苷酸多态性集分析
Genet Epidemiol. 2015 Dec;39(8):664-77. doi: 10.1002/gepi.21932. Epub 2015 Oct 30.
9
SNPranker 2.0: a gene-centric data mining tool for diseases associated SNP prioritization in GWAS.SNPranker 2.0:一种针对 GWAS 中疾病相关 SNP 优先级排序的基于基因的数据分析工具。
BMC Bioinformatics. 2013;14 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-14-S1-S9. Epub 2013 Jan 14.
10
The utility of the Laplace effect size prior distribution in Bayesian fine-mapping studies.拉普拉斯效应大小先验分布在贝叶斯精细映射研究中的效用。
Genet Epidemiol. 2021 Jun;45(4):386-401. doi: 10.1002/gepi.22375. Epub 2021 Jan 6.

引用本文的文献

1
Biologically Enhanced Genome-Wide Association Study Provides Further Evidence for Candidate Loci and Discovers Novel Loci That Influence Risk of Anterior Cruciate Ligament Rupture in a Dog Model.生物增强全基因组关联研究为候选基因座提供了进一步证据,并发现了影响犬模型中前交叉韧带断裂风险的新基因座。
Front Genet. 2021 Mar 5;12:593515. doi: 10.3389/fgene.2021.593515. eCollection 2021.
2
The Evolving Field of Genetic Epidemiology: From Familial Aggregation to Genomic Sequencing.遗传流行病学的发展领域:从家族聚集到基因组测序。
Am J Epidemiol. 2019 Dec 31;188(12):2069-2077. doi: 10.1093/aje/kwz193.
3
iFunMed: Integrative functional mediation analysis of GWAS and eQTL studies.iFunMed:GWAS 和 eQTL 研究的综合功能中介分析。
Genet Epidemiol. 2019 Oct;43(7):742-760. doi: 10.1002/gepi.22217. Epub 2019 Jul 22.
4
Exploring the underlying biology of intrinsic cardiorespiratory fitness through integrative analysis of genomic variants and muscle gene expression profiling.通过整合基因组变异和肌肉基因表达谱分析探索内在心肺功能适应性的潜在生物学机制。
J Appl Physiol (1985). 2019 May 1;126(5):1292-1314. doi: 10.1152/japplphysiol.00035.2018. Epub 2019 Jan 3.
5
Annotation Regression for Genome-Wide Association Studies with an Application to Psychiatric Genomic Consortium Data.用于全基因组关联研究的注释回归及其在精神疾病基因组学联盟数据中的应用
Stat Biosci. 2017 Jun;9(1):50-72. doi: 10.1007/s12561-016-9154-z. Epub 2016 Aug 12.
6
Inclusion of biological knowledge in a Bayesian shrinkage model for joint estimation of SNP effects.将生物学知识纳入用于SNP效应联合估计的贝叶斯收缩模型。
Genet Epidemiol. 2017 May;41(4):320-331. doi: 10.1002/gepi.22038. Epub 2017 Apr 10.
7
Using local multiplicity to improve effect estimation from a hypothesis-generating pharmacogenetics study.利用局部多重性来改善假设生成性药物遗传学研究中的效应估计。
Pharmacogenomics J. 2016 Feb;16(1):107-12. doi: 10.1038/tpj.2015.19. Epub 2015 Mar 24.
8
Two-phase and family-based designs for next-generation sequencing studies.用于下一代测序研究的两阶段和基于家系的设计。
Front Genet. 2013 Dec 13;4:276. doi: 10.3389/fgene.2013.00276.
9
A review of post-GWAS prioritization approaches.GWAS 后优先级排序方法综述。
Front Genet. 2013 Dec 9;4:280. doi: 10.3389/fgene.2013.00280.

本文引用的文献

1
Importance of different types of prior knowledge in selecting genome-wide findings for follow-up.不同类型先验知识在选择全基因组发现进行随访中的重要性。
Genet Epidemiol. 2013 Feb;37(2):205-13. doi: 10.1002/gepi.21705.
2
Computational tools for prioritizing candidate genes: boosting disease gene discovery.计算工具在候选基因优先级排序中的应用:提高疾病基因发现的效率。
Nat Rev Genet. 2012 Jul 3;13(8):523-36. doi: 10.1038/nrg3253.
3
Methods for meta-analyses of genome-wide association studies: critical assessment of empirical evidence.用于全基因组关联研究荟萃分析的方法:对经验证据的批判性评估。
Am J Epidemiol. 2012 Apr 15;175(8):739-49. doi: 10.1093/aje/kwr385. Epub 2012 Mar 16.
4
A latent model for prioritization of SNPs for functional studies.用于功能研究中 SNP 优先级排序的潜在模型。
PLoS One. 2011;6(6):e20764. doi: 10.1371/journal.pone.0020764. Epub 2011 Jun 8.
5
SPOT: a web-based tool for using biological databases to prioritize SNPs after a genome-wide association study.SPOT:一种基于网络的工具,用于在全基因组关联研究后使用生物数据库对 SNPs 进行优先级排序。
Nucleic Acids Res. 2010 Jul;38(Web Server issue):W201-9. doi: 10.1093/nar/gkq513. Epub 2010 Jun 6.
6
New loci associated with kidney function and chronic kidney disease.与肾功能和慢性肾脏病相关的新基因座。
Nat Genet. 2010 May;42(5):376-84. doi: 10.1038/ng.568. Epub 2010 Apr 11.
7
Prioritizing GWAS results: A review of statistical methods and recommendations for their application.优先考虑 GWAS 结果:统计方法综述及其应用建议。
Am J Hum Genet. 2010 Jan;86(1):6-22. doi: 10.1016/j.ajhg.2009.11.017.
8
Finding the missing heritability of complex diseases.寻找复杂疾病中缺失的遗传力。
Nature. 2009 Oct 8;461(7265):747-53. doi: 10.1038/nature08494.
9
Bayesian statistical methods for genetic association studies.用于基因关联研究的贝叶斯统计方法。
Nat Rev Genet. 2009 Oct;10(10):681-90. doi: 10.1038/nrg2615.
10
Multiple loci associated with indices of renal function and chronic kidney disease.多个与肾功能和慢性肾脏病指标相关的基因座。
Nat Genet. 2009 Jun;41(6):712-7. doi: 10.1038/ng.377. Epub 2009 May 10.