• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用贝叶斯网络评分标准学习遗传上位性。

Learning genetic epistasis using Bayesian network scoring criteria.

机构信息

Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA.

出版信息

BMC Bioinformatics. 2011 Mar 31;12:89. doi: 10.1186/1471-2105-12-89.

DOI:10.1186/1471-2105-12-89
PMID:21453508
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3080825/
Abstract

BACKGROUND

Gene-gene epistatic interactions likely play an important role in the genetic basis of many common diseases. Recently, machine-learning and data mining methods have been developed for learning epistatic relationships from data. A well-known combinatorial method that has been successfully applied for detecting epistasis is Multifactor Dimensionality Reduction (MDR). Jiang et al. created a combinatorial epistasis learning method called BNMBL to learn Bayesian network (BN) epistatic models. They compared BNMBL to MDR using simulated data sets. Each of these data sets was generated from a model that associates two SNPs with a disease and includes 18 unrelated SNPs. For each data set, BNMBL and MDR were used to score all 2-SNP models, and BNMBL learned significantly more correct models. In real data sets, we ordinarily do not know the number of SNPs that influence phenotype. BNMBL may not perform as well if we also scored models containing more than two SNPs. Furthermore, a number of other BN scoring criteria have been developed. They may detect epistatic interactions even better than BNMBL.Although BNs are a promising tool for learning epistatic relationships from data, we cannot confidently use them in this domain until we determine which scoring criteria work best or even well when we try learning the correct model without knowledge of the number of SNPs in that model.

RESULTS

We evaluated the performance of 22 BN scoring criteria using 28,000 simulated data sets and a real Alzheimer's GWAS data set. Our results were surprising in that the Bayesian scoring criterion with large values of a hyperparameter called α performed best. This score performed better than other BN scoring criteria and MDR at recall using simulated data sets, at detecting the hardest-to-detect models using simulated data sets, and at substantiating previous results using the real Alzheimer's data set.

CONCLUSIONS

We conclude that representing epistatic interactions using BN models and scoring them using a BN scoring criterion holds promise for identifying epistatic genetic variants in data. In particular, the Bayesian scoring criterion with large values of a hyperparameter α appears more promising than a number of alternatives.

摘要

背景

基因-基因上位性相互作用可能在许多常见疾病的遗传基础中发挥重要作用。最近,已经开发了机器学习和数据挖掘方法,用于从数据中学习上位性关系。一种众所周知的组合方法,多因子降维(MDR),已成功用于检测上位性。Jiang 等人创建了一种组合上位性学习方法,称为 BNMBL,用于学习贝叶斯网络(BN)上位性模型。他们使用模拟数据集比较了 BNMBL 和 MDR。这些数据集中的每一个都是从一个与疾病相关的两个 SNP 并包含 18 个不相关 SNP 的模型生成的。对于每个数据集,BNMBL 和 MDR 都用于对所有 2-SNP 模型进行评分,并且 BNMBL 学习到的正确模型明显更多。在真实数据集,我们通常不知道影响表型的 SNP 数量。如果我们还对包含两个以上 SNP 的模型进行评分,BNMBL 的表现可能不会那么好。此外,还开发了许多其他 BN 评分标准。它们可能比 BNMBL 更好地检测上位性相互作用。尽管 BNs 是从数据中学习上位性关系的有前途的工具,但在确定哪种评分标准在不知道模型中 SNP 数量的情况下学习正确模型时效果最好甚至良好之前,我们不能在该领域放心使用它们。

结果

我们使用 28000 个模拟数据集和一个真实的阿尔茨海默病 GWAS 数据集评估了 22 个 BN 评分标准的性能。我们的结果令人惊讶,即具有较大超参数α值的贝叶斯评分标准表现最佳。与模拟数据集中的其他 BN 评分标准和 MDR 相比,该评分在召回率方面表现更好,在检测模拟数据集中最难检测的模型方面表现更好,并且在使用真实的阿尔茨海默病数据证实之前的结果方面表现更好。

结论

我们得出结论,使用 BN 模型表示上位性相互作用并使用 BN 评分标准对其进行评分,有望在数据中识别上位性遗传变异。特别是,具有较大超参数α值的贝叶斯评分标准比许多替代方法更有前途。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc29/3080825/79c9b1b91fe8/1471-2105-12-89-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc29/3080825/10273285e616/1471-2105-12-89-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc29/3080825/6b360b7e9d81/1471-2105-12-89-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc29/3080825/79c9b1b91fe8/1471-2105-12-89-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc29/3080825/10273285e616/1471-2105-12-89-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc29/3080825/6b360b7e9d81/1471-2105-12-89-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc29/3080825/79c9b1b91fe8/1471-2105-12-89-3.jpg

相似文献

1
Learning genetic epistasis using Bayesian network scoring criteria.利用贝叶斯网络评分标准学习遗传上位性。
BMC Bioinformatics. 2011 Mar 31;12:89. doi: 10.1186/1471-2105-12-89.
2
A Bayesian method for identifying genetic interactions.一种用于识别基因相互作用的贝叶斯方法。
AMIA Annu Symp Proc. 2009 Nov 14;2009:673-7.
3
Genetic studies of complex human diseases: characterizing SNP-disease associations using Bayesian networks.复杂人类疾病的遗传学研究:使用贝叶斯网络表征单核苷酸多态性与疾病的关联
BMC Syst Biol. 2012;6 Suppl 3(Suppl 3):S14. doi: 10.1186/1752-0509-6-S3-S14. Epub 2012 Dec 17.
4
Identifying genetic interactions in genome-wide data using Bayesian networks.利用贝叶斯网络鉴定全基因组数据中的遗传交互作用。
Genet Epidemiol. 2010 Sep;34(6):575-81. doi: 10.1002/gepi.20514.
5
Cuckoo search epistasis: a new method for exploring significant genetic interactions.布谷鸟搜索上位性:一种探索重要基因相互作用的新方法。
Heredity (Edinb). 2014 Jun;112(6):666-74. doi: 10.1038/hdy.2014.4. Epub 2014 Feb 19.
6
Comparative analysis of methods for detecting interacting loci.检测互作基因座方法的比较分析。
BMC Genomics. 2011 Jul 5;12:344. doi: 10.1186/1471-2164-12-344.
7
Mining pure, strict epistatic interactions from high-dimensional datasets: ameliorating the curse of dimensionality.从高维数据集挖掘纯净、严格的上位性相互作用:缓解维度灾难。
PLoS One. 2012;7(10):e46771. doi: 10.1371/journal.pone.0046771. Epub 2012 Oct 12.
8
Detecting purely epistatic multi-locus interactions by an omnibus permutation test on ensembles of two-locus analyses.通过对两基因座分析集合进行整体置换检验来检测纯上位多基因座相互作用。
BMC Bioinformatics. 2009 Sep 17;10:294. doi: 10.1186/1471-2105-10-294.
9
An Approach of Epistasis Detection Using Integer Linear Programming Optimizing Bayesian Network.基于整数线性规划优化贝叶斯网络的上位性检测方法。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Sep-Oct;19(5):2654-2671. doi: 10.1109/TCBB.2021.3092719. Epub 2022 Oct 10.
10
Utilizing Deep Learning and Genome Wide Association Studies for Epistatic-Driven Preterm Birth Classification in African-American Women.利用深度学习和全基因组关联研究对非裔美国妇女的由上位效应驱动的早产进行分类。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Mar-Apr;17(2):668-678. doi: 10.1109/TCBB.2018.2868667. Epub 2018 Sep 3.

引用本文的文献

1
A causal learning framework for the analysis and interpretation of COVID-19 clinical data.用于分析和解释 COVID-19 临床数据的因果学习框架。
PLoS One. 2022 May 19;17(5):e0268327. doi: 10.1371/journal.pone.0268327. eCollection 2022.
2
Automated Cyber and Privacy Risk Management Toolkit.自动化网络和隐私风险管理工具包。
Sensors (Basel). 2021 Aug 15;21(16):5493. doi: 10.3390/s21165493.
3
The Application of Artificial Intelligence in the Genetic Study of Alzheimer's Disease.人工智能在阿尔茨海默病基因研究中的应用

本文引用的文献

1
Identifying genetic interactions in genome-wide data using Bayesian networks.利用贝叶斯网络鉴定全基因组数据中的遗传交互作用。
Genet Epidemiol. 2010 Sep;34(6):575-81. doi: 10.1002/gepi.20514.
2
A Markov blanket-based method for detecting causal SNPs in GWAS.基于马尔可夫毯的 GWAS 中因果 SNP 检测方法。
BMC Bioinformatics. 2010 Apr 29;11 Suppl 3(Suppl 3):S5. doi: 10.1186/1471-2105-11-S3-S5.
3
COE: a general approach for efficient genome-wide two-locus epistasis test in disease association study.COE:疾病关联研究中全基因组双位点上位性检验的一种有效通用方法。
Aging Dis. 2020 Dec 1;11(6):1567-1584. doi: 10.14336/AD.2020.0312. eCollection 2020 Dec.
4
Epi-GTBN: an approach of epistasis mining based on genetic Tabu algorithm and Bayesian network.Epi-GTBN:一种基于遗传禁忌搜索算法和贝叶斯网络的上位性挖掘方法。
BMC Bioinformatics. 2019 Aug 28;20(1):444. doi: 10.1186/s12859-019-3022-z.
5
Self-Adjusting Ant Colony Optimization Based on Information Entropy for Detecting Epistatic Interactions.基于信息熵的自适应蚁群优化算法用于检测上位性相互作用。
Genes (Basel). 2019 Feb 1;10(2):114. doi: 10.3390/genes10020114.
6
Bayesian Network Construction and Genotype-Phenotype Inference Using GWAS Statistics.基于 GWAS 统计数据的贝叶斯网络构建和基因型-表型推断。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Mar-Apr;16(2):475-489. doi: 10.1109/TCBB.2017.2779498. Epub 2017 Dec 4.
7
Heterogeneity Analysis and Diagnosis of Complex Diseases Based on Deep Learning Method.基于深度学习方法的复杂疾病异质性分析与诊断。
Sci Rep. 2018 Apr 18;8(1):6155. doi: 10.1038/s41598-018-24588-5.
8
epiACO - a method for identifying epistasis based on ant Colony optimization algorithm.epiACO——一种基于蚁群优化算法识别上位性的方法。
BioData Min. 2017 Jul 6;10:23. doi: 10.1186/s13040-017-0143-7. eCollection 2017.
9
Discovering causal interactions using Bayesian network scoring and information gain.使用贝叶斯网络评分和信息增益发现因果相互作用。
BMC Bioinformatics. 2016 May 26;17(1):221. doi: 10.1186/s12859-016-1084-8.
10
FHSA-SED: Two-Locus Model Detection for Genome-Wide Association Study with Harmony Search Algorithm.FHSA-SED:基于和声搜索算法的全基因组关联研究双位点模型检测
PLoS One. 2016 Mar 25;11(3):e0150669. doi: 10.1371/journal.pone.0150669. eCollection 2016.
J Comput Biol. 2010 Mar;17(3):401-15. doi: 10.1089/cmb.2009.0155.
4
A Bayesian method for identifying genetic interactions.一种用于识别基因相互作用的贝叶斯方法。
AMIA Annu Symp Proc. 2009 Nov 14;2009:673-7.
5
A variational Bayes algorithm for fast and accurate multiple locus genome-wide association analysis.一种快速准确的多基因座全基因组关联分析的变分贝叶斯算法。
BMC Bioinformatics. 2010 Jan 27;11:58. doi: 10.1186/1471-2105-11-58.
6
Screen and clean: a tool for identifying interactions in genome-wide association studies.筛选与清理:一种用于鉴定全基因组关联研究中相互作用的工具。
Genet Epidemiol. 2010 Apr;34(3):275-85. doi: 10.1002/gepi.20459.
7
Methods for investigating gene-environment interactions in candidate pathway and genome-wide association studies.候选途径和全基因组关联研究中调查基因-环境相互作用的方法。
Annu Rev Public Health. 2010;31:21-36. doi: 10.1146/annurev.publhealth.012809.103619.
8
Predictive rule inference for epistatic interaction detection in genome-wide association studies.用于全基因组关联研究中上位性相互作用检测的预测规则推断。
Bioinformatics. 2010 Jan 1;26(1):30-7. doi: 10.1093/bioinformatics/btp622. Epub 2009 Oct 30.
9
Detecting purely epistatic multi-locus interactions by an omnibus permutation test on ensembles of two-locus analyses.通过对两基因座分析集合进行整体置换检验来检测纯上位多基因座相互作用。
BMC Bioinformatics. 2009 Sep 17;10:294. doi: 10.1186/1471-2105-10-294.
10
Genome-wide association study identifies variants at CLU and CR1 associated with Alzheimer's disease.全基因组关联研究确定了CLU和CR1基因中与阿尔茨海默病相关的变异。
Nat Genet. 2009 Oct;41(10):1094-9. doi: 10.1038/ng.439. Epub 2009 Sep 6.