• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于马尔可夫毯的 GWAS 中因果 SNP 检测方法。

A Markov blanket-based method for detecting causal SNPs in GWAS.

机构信息

Department of Electrical Engineering and Computer Science, The University of Kansas, Lawrence, 66045, USA.

出版信息

BMC Bioinformatics. 2010 Apr 29;11 Suppl 3(Suppl 3):S5. doi: 10.1186/1471-2105-11-S3-S5.

DOI:10.1186/1471-2105-11-S3-S5
PMID:20438652
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2863064/
Abstract

BACKGROUND

Detecting epistatic interactions associated with complex and common diseases can help to improve prevention, diagnosis and treatment of these diseases. With the development of genome-wide association studies (GWAS), designing powerful and robust computational method for identifying epistatic interactions associated with common diseases becomes a great challenge to bioinformatics society, because the study of epistatic interactions often deals with the large size of the genotyped data and the huge amount of combinations of all the possible genetic factors. Most existing computational detection methods are based on the classification capacity of SNP sets, which may fail to identify SNP sets that are strongly associated with the diseases and introduce a lot of false positives. In addition, most methods are not suitable for genome-wide scale studies due to their computational complexity.

RESULTS

We propose a new Markov Blanket-based method, DASSO-MB (Detection of ASSOciations using Markov Blanket) to detect epistatic interactions in case-control GWAS. Markov blanket of a target variable T can completely shield T from all other variables. Thus, we can guarantee that the SNP set detected by DASSO-MB has a strong association with diseases and contains fewest false positives. Furthermore, DASSO-MB uses a heuristic search strategy by calculating the association between variables to avoid the time-consuming training process as in other machine-learning methods. We apply our algorithm to simulated datasets and a real case-control dataset. We compare DASSO-MB to other commonly-used methods and show that our method significantly outperforms other methods and is capable of finding SNPs strongly associated with diseases.

CONCLUSIONS

Our study shows that DASSO-MB can identify a minimal set of causal SNPs associated with diseases, which contains less false positives compared to other existing methods. Given the huge size of genomic dataset produced by GWAS, this is critical in saving the potential costs of biological experiments and being an efficient guideline for pathogenesis research.

摘要

背景

检测与复杂和常见疾病相关的上位相互作用可以帮助改善这些疾病的预防、诊断和治疗。随着全基因组关联研究(GWAS)的发展,设计用于识别与常见疾病相关的上位相互作用的强大而稳健的计算方法成为生物信息学领域的一大挑战,因为上位相互作用的研究通常涉及到基因分型数据的大规模和所有可能遗传因素的组合的大量。大多数现有的计算检测方法都是基于 SNP 集的分类能力,这可能无法识别与疾病强烈相关的 SNP 集,并引入大量的假阳性。此外,由于计算复杂性,大多数方法不适合全基因组规模的研究。

结果

我们提出了一种新的基于马尔可夫 blankets 的方法,DASSO-MB(使用马尔可夫 blankets 检测关联),用于检测病例对照 GWAS 中的上位相互作用。目标变量 T 的马尔可夫 blankets 可以完全屏蔽 T 与所有其他变量的联系。因此,我们可以保证 DASSO-MB 检测到的 SNP 集与疾病有很强的关联,并且包含最少的假阳性。此外,DASSO-MB 通过计算变量之间的关联来使用启发式搜索策略,避免了像其他机器学习方法那样耗时的训练过程。我们将我们的算法应用于模拟数据集和真实的病例对照数据集。我们将 DASSO-MB 与其他常用方法进行比较,结果表明我们的方法显著优于其他方法,并且能够找到与疾病强烈相关的 SNPs。

结论

我们的研究表明,DASSO-MB 可以识别与疾病相关的最小一组因果 SNPs,与其他现有方法相比,它包含的假阳性更少。鉴于 GWAS 产生的基因组数据集的巨大规模,这对于节省潜在的生物学实验成本和作为发病机制研究的有效指导方针至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30e0/2863064/7c1adbabe124/1471-2105-11-S3-S5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30e0/2863064/ed26c1e7b573/1471-2105-11-S3-S5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30e0/2863064/ef89123a96f8/1471-2105-11-S3-S5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30e0/2863064/7c1adbabe124/1471-2105-11-S3-S5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30e0/2863064/ed26c1e7b573/1471-2105-11-S3-S5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30e0/2863064/ef89123a96f8/1471-2105-11-S3-S5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30e0/2863064/7c1adbabe124/1471-2105-11-S3-S5-3.jpg

相似文献

1
A Markov blanket-based method for detecting causal SNPs in GWAS.基于马尔可夫毯的 GWAS 中因果 SNP 检测方法。
BMC Bioinformatics. 2010 Apr 29;11 Suppl 3(Suppl 3):S5. doi: 10.1186/1471-2105-11-S3-S5.
2
FEPI-MB: identifying SNPs-disease association using a Markov Blanket-based approach.FEPI-MB:基于马尔可夫毯的方法识别 SNP 与疾病的关联。
BMC Bioinformatics. 2011 Nov 24;12 Suppl 12(Suppl 12):S3. doi: 10.1186/1471-2105-12-S12-S3.
3
Genetic studies of complex human diseases: characterizing SNP-disease associations using Bayesian networks.复杂人类疾病的遗传学研究:使用贝叶斯网络表征单核苷酸多态性与疾病的关联
BMC Syst Biol. 2012;6 Suppl 3(Suppl 3):S14. doi: 10.1186/1752-0509-6-S3-S14. Epub 2012 Dec 17.
4
bNEAT: a Bayesian network method for detecting epistatic interactions in genome-wide association studies.bNEAT:一种用于检测全基因组关联研究中上位性相互作用的贝叶斯网络方法。
BMC Genomics. 2011;12 Suppl 2(Suppl 2):S9. doi: 10.1186/1471-2164-12-S2-S9. Epub 2011 Jul 27.
5
SMMB: a stochastic Markov blanket framework strategy for epistasis detection in GWAS.SMMB:GWAS 中检测上位性的随机马尔可夫毯框架策略。
Bioinformatics. 2018 Aug 15;34(16):2773-2780. doi: 10.1093/bioinformatics/bty154.
6
Utilizing Deep Learning and Genome Wide Association Studies for Epistatic-Driven Preterm Birth Classification in African-American Women.利用深度学习和全基因组关联研究对非裔美国妇女的由上位效应驱动的早产进行分类。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Mar-Apr;17(2):668-678. doi: 10.1109/TCBB.2018.2868667. Epub 2018 Sep 3.
7
Cloud computing for detecting high-order genome-wide epistatic interaction via dynamic clustering.通过动态聚类检测高阶全基因组上位性相互作用的云计算。
BMC Bioinformatics. 2014 Apr 10;15:102. doi: 10.1186/1471-2105-15-102.
8
GWIS--model-free, fast and exhaustive search for epistatic interactions in case-control GWAS.GWIS--无模型、快速且全面搜索病例对照 GWAS 中的上位相互作用。
BMC Genomics. 2013;14 Suppl 3(Suppl 3):S10. doi: 10.1186/1471-2164-14-S3-S10. Epub 2013 May 28.
9
CMDR based differential evolution identifies the epistatic interaction in genome-wide association studies.基于CMDR的差分进化算法在全基因组关联研究中识别上位性相互作用。
Bioinformatics. 2017 Aug 1;33(15):2354-2362. doi: 10.1093/bioinformatics/btx163.
10
A novel Markov Blanket-based repeated-fishing strategy for capturing phenotype-related biomarkers in big omics data.一种基于马尔可夫毯的新型重复捕捞策略,用于在大型组学数据中捕获与表型相关的生物标志物。
BMC Genet. 2016 Mar 9;17:51. doi: 10.1186/s12863-016-0358-5.

引用本文的文献

1
Learning Causes of Functional Dynamic Targets: Screening and Local Methods.功能性动态目标的学习原因:筛选与局部方法。
Entropy (Basel). 2024 Jun 24;26(7):541. doi: 10.3390/e26070541.
2
Twitter data from the 2019-20 Australian bushfires reveals participatory and temporal variations in social media use for disaster recovery.2019-20 年澳大利亚丛林大火期间的推特数据揭示了社交媒体在灾难恢复中的参与性和时间性变化。
Sci Rep. 2022 Oct 8;12(1):16914. doi: 10.1038/s41598-022-21265-6.
3
STS-BN: An efficient Bayesian network method for detecting causal SNPs.

本文引用的文献

1
A random forest approach to the detection of epistatic interactions in case-control studies.一种用于病例对照研究中检测上位性相互作用的随机森林方法。
BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S65. doi: 10.1186/1471-2105-10-S1-S65.
2
Generating samples for association studies based on HapMap data.基于HapMap数据生成用于关联研究的样本。
BMC Bioinformatics. 2008 Jan 24;9:44. doi: 10.1186/1471-2105-9-44.
3
A support vector machine approach for detecting gene-gene interaction.一种用于检测基因-基因相互作用的支持向量机方法。
STS-BN:一种用于检测因果单核苷酸多态性的高效贝叶斯网络方法。
Front Genet. 2022 Sep 15;13:942464. doi: 10.3389/fgene.2022.942464. eCollection 2022.
4
A BAYESIAN GRAPHICAL MODEL FOR GENOME-WIDE ASSOCIATION STUDIES (GWAS).一种用于全基因组关联研究(GWAS)的贝叶斯图形模型。
Ann Appl Stat. 2016 Jun;10(2):786-811. doi: 10.1214/16-aoas909. Epub 2016 Jul 22.
5
Tumor somatic mutations also existing as germline polymorphisms may help to identify functional SNPs from genome-wide association studies.肿瘤体细胞突变也可能作为种系多态性存在,这有助于从全基因组关联研究中识别功能性单核苷酸多态性。
Carcinogenesis. 2020 Oct 15;41(10):1353-1362. doi: 10.1093/carcin/bgaa077.
6
Identification of genetic interaction networks via an evolutionary algorithm evolved Bayesian network.通过进化算法进化的贝叶斯网络识别基因相互作用网络。
BioData Min. 2016 May 10;9:18. doi: 10.1186/s13040-016-0094-4. eCollection 2016.
7
A novel Markov Blanket-based repeated-fishing strategy for capturing phenotype-related biomarkers in big omics data.一种基于马尔可夫毯的新型重复捕捞策略,用于在大型组学数据中捕获与表型相关的生物标志物。
BMC Genet. 2016 Mar 9;17:51. doi: 10.1186/s12863-016-0358-5.
8
A survey about methods dedicated to epistasis detection.一项关于用于上位性检测方法的调查。
Front Genet. 2015 Sep 10;6:285. doi: 10.3389/fgene.2015.00285. eCollection 2015.
9
An Efficient Stepwise Statistical Test to Identify Multiple Linked Human Genetic Variants Associated with Specific Phenotypic Traits.一种用于识别与特定表型特征相关的多个连锁人类遗传变异的高效逐步统计检验。
PLoS One. 2015 Sep 25;10(9):e0138700. doi: 10.1371/journal.pone.0138700. eCollection 2015.
10
An Improved Opposition-Based Learning Particle Swarm Optimization for the Detection of SNP-SNP Interactions.一种用于检测单核苷酸多态性-单核苷酸多态性相互作用的改进的基于对立学习的粒子群优化算法。
Biomed Res Int. 2015;2015:524821. doi: 10.1155/2015/524821. Epub 2015 Jul 5.
Genet Epidemiol. 2008 Feb;32(2):152-67. doi: 10.1002/gepi.20272.
4
Bayesian inference of epistatic interactions in case-control studies.病例对照研究中上位性相互作用的贝叶斯推断。
Nat Genet. 2007 Sep;39(9):1167-73. doi: 10.1038/ng2110. Epub 2007 Aug 26.
5
A whole-genome association study of major determinants for host control of HIV-1.一项关于宿主对HIV-1控制的主要决定因素的全基因组关联研究。
Science. 2007 Aug 17;317(5840):944-7. doi: 10.1126/science.1143767. Epub 2007 Jul 19.
6
Genome-wide association study identifies novel breast cancer susceptibility loci.全基因组关联研究确定了新的乳腺癌易感基因座。
Nature. 2007 Jun 28;447(7148):1087-93. doi: 10.1038/nature05887.
7
Penalized logistic regression for detecting gene interactions.用于检测基因相互作用的惩罚逻辑回归
Biostatistics. 2008 Jan;9(1):30-50. doi: 10.1093/biostatistics/kxm010. Epub 2007 Apr 11.
8
Detection of gene x gene interactions in genome-wide association studies of human population data.在人类群体数据的全基因组关联研究中检测基因与基因的相互作用
Hum Hered. 2007;63(2):67-84. doi: 10.1159/000099179. Epub 2007 Feb 2.
9
Machine learning for detecting gene-gene interactions: a review.用于检测基因-基因相互作用的机器学习:综述
Appl Bioinformatics. 2006;5(2):77-88. doi: 10.2165/00822942-200605020-00002.
10
Mendelian disorders deserve more attention.孟德尔疾病值得更多关注。
Nat Rev Genet. 2006 Apr;7(4):277-82. doi: 10.1038/nrg1826.