• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

贝叶斯方法评估和发现疾病相关基因座。

A bayesian method for evaluating and discovering disease loci associations.

机构信息

Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America.

出版信息

PLoS One. 2011;6(8):e22075. doi: 10.1371/journal.pone.0022075. Epub 2011 Aug 10.

DOI:10.1371/journal.pone.0022075
PMID:21853025
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3154195/
Abstract

BACKGROUND

A genome-wide association study (GWAS) typically involves examining representative SNPs in individuals from some population. A GWAS data set can concern a million SNPs and may soon concern billions. Researchers investigate the association of each SNP individually with a disease, and it is becoming increasingly commonplace to also analyze multi-SNP associations. Techniques for handling so many hypotheses include the Bonferroni correction and recently developed bayesian methods. These methods can encounter problems. Most importantly, they are not applicable to a complex multi-locus hypothesis which has several competing hypotheses rather than only a null hypothesis. A method that computes the posterior probability of complex hypotheses is a pressing need.

METHODOLOGY/FINDINGS: We introduce the bayesian network posterior probability (BNPP) method which addresses the difficulties. The method represents the relationship between a disease and SNPs using a directed acyclic graph (DAG) model, and computes the likelihood of such models using a bayesian network scoring criterion. The posterior probability of a hypothesis is computed based on the likelihoods of all competing hypotheses. The BNPP can not only be used to evaluate a hypothesis that has previously been discovered or suspected, but also to discover new disease loci associations. The results of experiments using simulated and real data sets are presented. Our results concerning simulated data sets indicate that the BNPP exhibits both better evaluation and discovery performance than does a p-value based method. For the real data sets, previous findings in the literature are confirmed and additional findings are found.

CONCLUSIONS/SIGNIFICANCE: We conclude that the BNPP resolves a pressing problem by providing a way to compute the posterior probability of complex multi-locus hypotheses. A researcher can use the BNPP to determine the expected utility of investigating a hypothesis further. Furthermore, we conclude that the BNPP is a promising method for discovering disease loci associations.

摘要

背景

全基因组关联研究(GWAS)通常涉及检查来自某些人群的个体中的代表性 SNP。GWAS 数据集可能涉及一百万 SNP,并且很快可能涉及数十亿 SNP。研究人员单独研究每个 SNP 与疾病的关联,并且分析多 SNP 关联也越来越普遍。处理如此多假设的技术包括 Bonferroni 校正和最近开发的贝叶斯方法。这些方法可能会遇到问题。最重要的是,它们不适用于具有多个竞争假设而不是仅一个零假设的复杂多基因座假设。计算复杂假设后验概率的方法是迫切需要的。

方法/发现:我们引入了贝叶斯网络后验概率(BNPP)方法来解决这些困难。该方法使用有向无环图(DAG)模型表示疾病与 SNPs 之间的关系,并使用贝叶斯网络评分标准计算此类模型的可能性。根据所有竞争假设的可能性来计算假设的后验概率。BNPP 不仅可用于评估先前已发现或怀疑的假设,还可用于发现新的疾病相关位点。使用模拟和真实数据集的实验结果。关于模拟数据集的结果表明,BNPP 不仅在评估性能方面表现更好,而且在发现性能方面也表现更好。对于真实数据集,确认了文献中的先前发现,并发现了其他发现。

结论/意义:我们得出结论,BNPP 通过提供一种计算复杂多基因座假设后验概率的方法解决了一个紧迫的问题。研究人员可以使用 BNPP 进一步确定调查假设的预期效用。此外,我们得出结论,BNPP 是发现疾病相关位点关联的一种很有前途的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/60fe9dcc1fd5/pone.0022075.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/3f5605f6708e/pone.0022075.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/766a7afda028/pone.0022075.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/8a0b182442c9/pone.0022075.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/384673e613b6/pone.0022075.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/351796728960/pone.0022075.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/c897eacbd1b1/pone.0022075.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/0850ebb40f6a/pone.0022075.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/c2c709d9f627/pone.0022075.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/60fe9dcc1fd5/pone.0022075.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/3f5605f6708e/pone.0022075.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/766a7afda028/pone.0022075.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/8a0b182442c9/pone.0022075.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/384673e613b6/pone.0022075.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/351796728960/pone.0022075.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/c897eacbd1b1/pone.0022075.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/0850ebb40f6a/pone.0022075.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/c2c709d9f627/pone.0022075.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae4b/3154195/60fe9dcc1fd5/pone.0022075.g009.jpg

相似文献

1
A bayesian method for evaluating and discovering disease loci associations.贝叶斯方法评估和发现疾病相关基因座。
PLoS One. 2011;6(8):e22075. doi: 10.1371/journal.pone.0022075. Epub 2011 Aug 10.
2
Evaluating de novo locus-disease discoveries in GWAS using the signal-to-noise ratio.使用信噪比评估全基因组关联研究中从头发现的基因座与疾病的关联。
AMIA Annu Symp Proc. 2011;2011:617-24. Epub 2011 Oct 22.
3
LEAP: biomarker inference through learning and evaluating association patterns.LEAP:通过学习和评估关联模式进行生物标志物推断。
Genet Epidemiol. 2015 Mar;39(3):173-84. doi: 10.1002/gepi.21889. Epub 2015 Feb 12.
4
A method combining a random forest-based technique with the modeling of linkage disequilibrium through latent variables, to run multilocus genome-wide association studies.一种结合基于随机森林的技术和通过潜在变量进行连锁不平衡建模的方法,用于进行多基因座全基因组关联研究。
BMC Bioinformatics. 2018 Mar 27;19(1):106. doi: 10.1186/s12859-018-2054-0.
5
Mining pure, strict epistatic interactions from high-dimensional datasets: ameliorating the curse of dimensionality.从高维数据集挖掘纯净、严格的上位性相互作用:缓解维度灾难。
PLoS One. 2012;7(10):e46771. doi: 10.1371/journal.pone.0046771. Epub 2012 Oct 12.
6
Iterative sure independence screening EM-Bayesian LASSO algorithm for multi-locus genome-wide association studies.用于多位点全基因组关联研究的迭代确定独立筛选EM-贝叶斯套索算法
PLoS Comput Biol. 2017 Jan 31;13(1):e1005357. doi: 10.1371/journal.pcbi.1005357. eCollection 2017 Jan.
7
Probability theory-based SNP association study method for identifying susceptibility loci and genetic disease models in human case-control data.基于概率论的 SNP 关联研究方法,用于鉴定人类病例对照数据中的易感基因座和遗传疾病模型。
IEEE Trans Nanobioscience. 2010 Dec;9(4):232-41. doi: 10.1109/TNB.2010.2070805. Epub 2010 Sep 13.
8
Comparative analysis of methods for detecting interacting loci.检测互作基因座方法的比较分析。
BMC Genomics. 2011 Jul 5;12:344. doi: 10.1186/1471-2164-12-344.
9
Dementia revealed: novel chromosome 6 locus for late-onset Alzheimer disease provides genetic evidence for folate-pathway abnormalities.痴呆症的揭示:新型 6 号染色体晚发性阿尔茨海默病位点为叶酸代谢途径异常提供遗传证据。
PLoS Genet. 2010 Sep 23;6(9):e1001130. doi: 10.1371/journal.pgen.1001130.
10
Discovery and fine-mapping of kidney function loci in first genome-wide association study in Africans.首个在非裔人群中进行的全基因组关联研究发现并精细定位了肾功能相关位点。
Hum Mol Genet. 2021 Jul 28;30(16):1559-1568. doi: 10.1093/hmg/ddab088.

引用本文的文献

1
Verification of Three-Phase Dependency Analysis Bayesian Network Learning Method for Maize Carotenoid Gene Mining.用于玉米类胡萝卜素基因挖掘的三相依赖分析贝叶斯网络学习方法的验证
Biomed Res Int. 2017;2017:1813494. doi: 10.1155/2017/1813494. Epub 2017 Jul 30.
2
Modeling miRNA-mRNA interactions that cause phenotypic abnormality in breast cancer patients.对导致乳腺癌患者表型异常的微小RNA-信使核糖核酸相互作用进行建模。
PLoS One. 2017 Aug 9;12(8):e0182666. doi: 10.1371/journal.pone.0182666. eCollection 2017.
3
An algorithm for direct causal learning of influences on patient outcomes.

本文引用的文献

1
Analysis of Genome-Wide Association Study (GWAS) data looking for replicating signals in Alzheimer's disease (AD).对全基因组关联研究(GWAS)数据进行分析,以寻找阿尔茨海默病(AD)中的重复信号。
Int J Mol Epidemiol Genet. 2010;1(1):53-66. Epub 2009 Nov 15.
2
Learning genetic epistasis using Bayesian network scoring criteria.利用贝叶斯网络评分标准学习遗传上位性。
BMC Bioinformatics. 2011 Mar 31;12:89. doi: 10.1186/1471-2105-12-89.
3
A fast algorithm for learning epistatic genomic relationships.一种用于学习上位性基因组关系的快速算法。
一种用于直接因果学习对患者预后影响的算法。
Artif Intell Med. 2017 Jan;75:1-15. doi: 10.1016/j.artmed.2016.10.003. Epub 2016 Nov 5.
4
Discovering causal interactions using Bayesian network scoring and information gain.使用贝叶斯网络评分和信息增益发现因果相互作用。
BMC Bioinformatics. 2016 May 26;17(1):221. doi: 10.1186/s12859-016-1084-8.
5
Computational methods for ubiquitination site prediction using physicochemical properties of protein sequences.利用蛋白质序列的物理化学性质进行泛素化位点预测的计算方法。
BMC Bioinformatics. 2016 Mar 3;17:116. doi: 10.1186/s12859-016-0959-z.
6
Learning Predictive Interactions Using Information Gain and Bayesian Network Scoring.使用信息增益和贝叶斯网络评分学习预测性相互作用。
PLoS One. 2015 Dec 1;10(12):e0143247. doi: 10.1371/journal.pone.0143247. eCollection 2015.
7
Evaluation of a two-stage framework for prediction using big genomic data.使用大型基因组数据评估用于预测的两阶段框架。
Brief Bioinform. 2015 Nov;16(6):912-21. doi: 10.1093/bib/bbv010. Epub 2015 Mar 18.
8
LEAP: biomarker inference through learning and evaluating association patterns.LEAP:通过学习和评估关联模式进行生物标志物推断。
Genet Epidemiol. 2015 Mar;39(3):173-84. doi: 10.1002/gepi.21889. Epub 2015 Feb 12.
9
Evaluation of an ensemble of genetic models for prediction of a quantitative trait.用于预测数量性状的遗传模型集成评估。
Front Genet. 2015 Jan 13;5:474. doi: 10.3389/fgene.2014.00474. eCollection 2014.
10
Inferring Aberrant Signal Transduction Pathways in Ovarian Cancer from TCGA Data.从TCGA数据推断卵巢癌中的异常信号转导通路
Cancer Inform. 2014 Oct 13;13(Suppl 1):29-36. doi: 10.4137/CIN.S13881. eCollection 2014.
AMIA Annu Symp Proc. 2010 Nov 13;2010:341-5.
4
Genetic signatures of exceptional longevity in humans.人类超长寿命的遗传特征。
Science. 2010 Jul 1;2010. doi: 10.1126/science.1190532.
5
Identifying genetic interactions in genome-wide data using Bayesian networks.利用贝叶斯网络鉴定全基因组数据中的遗传交互作用。
Genet Epidemiol. 2010 Sep;34(6):575-81. doi: 10.1002/gepi.20514.
6
FGFR2 and other loci identified in genome-wide association studies are associated with breast cancer in African-American and younger women.全基因组关联研究中鉴定的 FGFR2 及其他基因座与非裔美国人和年轻女性的乳腺癌相关。
Carcinogenesis. 2010 Aug;31(8):1417-23. doi: 10.1093/carcin/bgq128. Epub 2010 Jun 16.
7
International network of cancer genome projects.国际癌症基因组计划网络。
Nature. 2010 Apr 15;464(7291):993-8. doi: 10.1038/nature08987.
8
A Bayesian method for identifying genetic interactions.一种用于识别基因相互作用的贝叶斯方法。
AMIA Annu Symp Proc. 2009 Nov 14;2009:673-7.
9
Fine mapping of the chromosome 10q11-q21 linkage region in Alzheimer's disease cases and controls.阿尔茨海默病病例和对照中 10q11-q21 连锁区域的精细定位。
Neurogenetics. 2010 Jul;11(3):335-48. doi: 10.1007/s10048-010-0234-9. Epub 2010 Feb 25.
10
Beyond genome-wide association studies: genetic heterogeneity and individual predisposition to cancer.超越全基因组关联研究:癌症的遗传异质性和个体易感性。
Trends Genet. 2010 Mar;26(3):132-41. doi: 10.1016/j.tig.2009.12.008. Epub 2010 Jan 26.