• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过动态聚类检测高阶全基因组上位性相互作用的云计算。

Cloud computing for detecting high-order genome-wide epistatic interaction via dynamic clustering.

机构信息

Department of Computer Science, Georgia State University, 34 Peachtree Street, Atlanta, USA.

出版信息

BMC Bioinformatics. 2014 Apr 10;15:102. doi: 10.1186/1471-2105-15-102.

DOI:10.1186/1471-2105-15-102
PMID:24717145
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4021249/
Abstract

BACKGROUND

Taking the advantage of high-throughput single nucleotide polymorphism (SNP) genotyping technology, large genome-wide association studies (GWASs) have been considered to hold promise for unravelling complex relationships between genotype and phenotype. At present, traditional single-locus-based methods are insufficient to detect interactions consisting of multiple-locus, which are broadly existing in complex traits. In addition, statistic tests for high order epistatic interactions with more than 2 SNPs propose computational and analytical challenges because the computation increases exponentially as the cardinality of SNPs combinations gets larger.

RESULTS

In this paper, we provide a simple, fast and powerful method using dynamic clustering and cloud computing to detect genome-wide multi-locus epistatic interactions. We have constructed systematic experiments to compare powers performance against some recently proposed algorithms, including TEAM, SNPRuler, EDCF and BOOST. Furthermore, we have applied our method on two real GWAS datasets, Age-related macular degeneration (AMD) and Rheumatoid arthritis (RA) datasets, where we find some novel potential disease-related genetic factors which are not shown up in detections of 2-loci epistatic interactions.

CONCLUSIONS

Experimental results on simulated data demonstrate that our method is more powerful than some recently proposed methods on both two- and three-locus disease models. Our method has discovered many novel high-order associations that are significantly enriched in cases from two real GWAS datasets. Moreover, the running time of the cloud implementation for our method on AMD dataset and RA dataset are roughly 2 hours and 50 hours on a cluster with forty small virtual machines for detecting two-locus interactions, respectively. Therefore, we believe that our method is suitable and effective for the full-scale analysis of multiple-locus epistatic interactions in GWAS.

摘要

背景

利用高通量单核苷酸多态性(SNP)基因分型技术,全基因组关联研究(GWAS)被认为有望揭示基因型和表型之间的复杂关系。目前,传统的基于单基因座的方法不足以检测由多个基因座组成的相互作用,这些相互作用广泛存在于复杂特征中。此外,对于包含 2 个以上 SNP 的高阶上位性相互作用的统计检验提出了计算和分析方面的挑战,因为随着 SNP 组合的基数增大,计算量呈指数级增长。

结果

在本文中,我们提供了一种简单、快速和强大的方法,使用动态聚类和云计算来检测全基因组多基因座上位性相互作用。我们已经构建了系统实验来比较与一些最近提出的算法的功效性能,包括 TEAM、SNPRuler、EDCF 和 BOOST。此外,我们将我们的方法应用于两个真实的 GWAS 数据集,即年龄相关性黄斑变性(AMD)和类风湿性关节炎(RA)数据集,在那里我们发现了一些新的潜在与疾病相关的遗传因素,这些因素在 2 个基因座上位性相互作用的检测中没有显示出来。

结论

模拟数据的实验结果表明,我们的方法在两种和三种疾病模型上都比一些最近提出的方法更有效。我们的方法发现了许多新的高阶关联,这些关联在来自两个真实 GWAS 数据集的病例中显著富集。此外,我们的方法在 AMD 数据集和 RA 数据集上的云实现的运行时间大约分别为 2 小时和 50 小时,用于检测两个基因座的相互作用,在一个包含四十个小虚拟机的集群上。因此,我们相信我们的方法适用于全规模分析 GWAS 中的多基因座上位性相互作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca09/4021249/b60fb531a650/1471-2105-15-102-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca09/4021249/2493f627e0e7/1471-2105-15-102-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca09/4021249/1d69ca87d3ea/1471-2105-15-102-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca09/4021249/37a8dfa1a4b1/1471-2105-15-102-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca09/4021249/0e33d41786de/1471-2105-15-102-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca09/4021249/b353ea52e86a/1471-2105-15-102-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca09/4021249/b60fb531a650/1471-2105-15-102-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca09/4021249/2493f627e0e7/1471-2105-15-102-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca09/4021249/1d69ca87d3ea/1471-2105-15-102-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca09/4021249/37a8dfa1a4b1/1471-2105-15-102-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca09/4021249/0e33d41786de/1471-2105-15-102-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca09/4021249/b353ea52e86a/1471-2105-15-102-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca09/4021249/b60fb531a650/1471-2105-15-102-6.jpg

相似文献

1
Cloud computing for detecting high-order genome-wide epistatic interaction via dynamic clustering.通过动态聚类检测高阶全基因组上位性相互作用的云计算。
BMC Bioinformatics. 2014 Apr 10;15:102. doi: 10.1186/1471-2105-15-102.
2
Searching Genome-Wide Multi-Locus Associations for Multiple Diseases Based on Bayesian Inference.基于贝叶斯推理搜索多种疾病的全基因组多位点关联
IEEE/ACM Trans Comput Biol Bioinform. 2017 May-Jun;14(3):600-610. doi: 10.1109/TCBB.2016.2527648. Epub 2016 Feb 11.
3
Detecting genome-wide epistases based on the clustering of relatively frequent items.基于相对频繁项聚类检测全基因组上位性。
Bioinformatics. 2012 Jan 1;28(1):5-12. doi: 10.1093/bioinformatics/btr603. Epub 2011 Nov 3.
4
Fast detection of high-order epistatic interactions in genome-wide association studies using information theoretic measure.利用信息论方法在全基因组关联研究中快速检测高阶上位性相互作用
Comput Biol Chem. 2014 Jun;50:19-28. doi: 10.1016/j.compbiolchem.2014.01.005. Epub 2014 Jan 27.
5
Evaluating the detection ability of a range of epistasis detection methods on simulated data for pure and impure epistatic models.评估一系列上位性检测方法在纯上位性模型和不纯上位性模型的模拟数据中的检测能力。
PLoS One. 2022 Feb 18;17(2):e0263390. doi: 10.1371/journal.pone.0263390. eCollection 2022.
6
Detecting purely epistatic multi-locus interactions by an omnibus permutation test on ensembles of two-locus analyses.通过对两基因座分析集合进行整体置换检验来检测纯上位多基因座相互作用。
BMC Bioinformatics. 2009 Sep 17;10:294. doi: 10.1186/1471-2105-10-294.
7
GWIS--model-free, fast and exhaustive search for epistatic interactions in case-control GWAS.GWIS--无模型、快速且全面搜索病例对照 GWAS 中的上位相互作用。
BMC Genomics. 2013;14 Suppl 3(Suppl 3):S10. doi: 10.1186/1471-2164-14-S3-S10. Epub 2013 May 28.
8
A Markov blanket-based method for detecting causal SNPs in GWAS.基于马尔可夫毯的 GWAS 中因果 SNP 检测方法。
BMC Bioinformatics. 2010 Apr 29;11 Suppl 3(Suppl 3):S5. doi: 10.1186/1471-2105-11-S3-S5.
9
Gene, pathway and network frameworks to identify epistatic interactions of single nucleotide polymorphisms derived from GWAS data.用于识别源自全基因组关联研究(GWAS)数据的单核苷酸多态性上位性相互作用的基因、通路和网络框架。
BMC Syst Biol. 2012;6 Suppl 3(Suppl 3):S15. doi: 10.1186/1752-0509-6-S3-S15. Epub 2012 Dec 17.
10
Genetic studies of complex human diseases: characterizing SNP-disease associations using Bayesian networks.复杂人类疾病的遗传学研究:使用贝叶斯网络表征单核苷酸多态性与疾病的关联
BMC Syst Biol. 2012;6 Suppl 3(Suppl 3):S14. doi: 10.1186/1752-0509-6-S3-S14. Epub 2012 Dec 17.

引用本文的文献

1
GWASTool: A web pipeline for detecting SNP-phenotype associations.GWASTool:一种用于检测单核苷酸多态性与表型关联的网络流程。
Fundam Res. 2024 Mar 22;4(4):761-769. doi: 10.1016/j.fmre.2024.03.005. eCollection 2024 Jul.
2
Discovering SNP-disease relationships in genome-wide SNP data using an improved harmony search based on SNP locus and genetic inheritance patterns.利用基于 SNP 位置和遗传遗传模式的改进和声搜索在全基因组 SNP 数据中发现 SNP 疾病关系。
PLoS One. 2023 Oct 13;18(10):e0292266. doi: 10.1371/journal.pone.0292266. eCollection 2023.
3
MDSN: A Module Detection Method for Identifying High-Order Epistatic Interactions.

本文引用的文献

1
Whole genome identity-by-descent determination.全基因组同源性确定。
J Bioinform Comput Biol. 2013 Apr;11(2):1350002. doi: 10.1142/S0219720013500029. Epub 2013 Jan 16.
2
iLOCi: a SNP interaction prioritization technique for detecting epistasis in genome-wide association studies.iLOCi:一种 SNP 相互作用优先级技术,用于检测全基因组关联研究中的上位性。
BMC Genomics. 2012;13 Suppl 7(Suppl 7):S2. doi: 10.1186/1471-2164-13-S7-S2. Epub 2012 Dec 13.
3
A novel method to identify high order gene-gene interactions in genome-wide association studies: gene-based MDR.
MDSN:一种用于识别高阶上位性相互作用的模块检测方法。
Genes (Basel). 2022 Dec 18;13(12):2403. doi: 10.3390/genes13122403.
4
A Review of Feature Selection Methods for Machine Learning-Based Disease Risk Prediction.基于机器学习的疾病风险预测的特征选择方法综述
Front Bioinform. 2022 Jun 27;2:927312. doi: 10.3389/fbinf.2022.927312. eCollection 2022.
5
Multi-Objective Artificial Bee Colony Algorithm Based on Scale-Free Network for Epistasis Detection.基于无标度网络的多目标人工蜂群算法在连锁检测中的应用。
Genes (Basel). 2022 May 12;13(5):871. doi: 10.3390/genes13050871.
6
JS-MA: A Jensen-Shannon Divergence Based Method for Mapping Genome-Wide Associations on Multiple Diseases.JS-MA:一种基于詹森-香农散度的方法,用于绘制多种疾病的全基因组关联图谱。
Front Genet. 2020 Oct 30;11:507038. doi: 10.3389/fgene.2020.507038. eCollection 2020.
7
HiSSI: high-order SNP-SNP interactions detection based on efficient significant pattern and differential evolution.HiSSI:基于高效显著模式和差分进化的高阶 SNP-SNP 交互检测。
BMC Med Genomics. 2019 Dec 30;12(Suppl 7):139. doi: 10.1186/s12920-019-0584-6.
8
Locating potentially lethal genes using the abnormal distributions of genotypes.利用基因型的异常分布定位潜在致死基因。
Sci Rep. 2019 Jul 22;9(1):10543. doi: 10.1038/s41598-019-47076-w.
9
FDHE-IW: A Fast Approach for Detecting High-Order Epistasis in Genome-Wide Case-Control Studies.FDHE-IW:一种在全基因组病例对照研究中检测高阶上位性的快速方法。
Genes (Basel). 2018 Aug 29;9(9):435. doi: 10.3390/genes9090435.
10
ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information.聚类多变量互信息(ClusterMI):基于聚类和互信息的高阶 SNP 相互作用检测。
Int J Mol Sci. 2018 Aug 2;19(8):2267. doi: 10.3390/ijms19082267.
一种在全基因组关联研究中识别高阶基因-基因相互作用的新方法:基于基因的多变量数据分析。
BMC Bioinformatics. 2012 Jun 11;13 Suppl 9(Suppl 9):S5. doi: 10.1186/1471-2105-13-S9-S5.
4
Fast accurate missing SNP genotype local imputation.快速准确的缺失单核苷酸多态性(SNP)基因型局部填充
BMC Res Notes. 2012 Aug 3;5:404. doi: 10.1186/1756-0500-5-404.
5
High-order SNP combinations associated with complex diseases: efficient discovery, statistical power and functional interactions.与复杂疾病相关的高阶 SNP 组合:高效发现、统计能力和功能相互作用。
PLoS One. 2012;7(4):e33531. doi: 10.1371/journal.pone.0033531. Epub 2012 Apr 19.
6
Detecting genome-wide epistases based on the clustering of relatively frequent items.基于相对频繁项聚类检测全基因组上位性。
Bioinformatics. 2012 Jan 1;28(1):5-12. doi: 10.1093/bioinformatics/btr603. Epub 2011 Nov 3.
7
An empirical comparison of several recent epistatic interaction detection methods.几种近期上位性互作检测方法的实证比较。
Bioinformatics. 2011 Nov 1;27(21):2936-43. doi: 10.1093/bioinformatics/btr512. Epub 2011 Sep 7.
8
Genome-wide interaction-based association analysis identified multiple new susceptibility Loci for common diseases.基于全基因组相互作用的关联分析鉴定出常见疾病的多个新易感位点。
PLoS Genet. 2011 Mar;7(3):e1001338. doi: 10.1371/journal.pgen.1001338. Epub 2011 Mar 17.
9
GBOOST: a GPU-based tool for detecting gene-gene interactions in genome-wide case control studies.GBOOST:一种基于 GPU 的工具,用于在全基因组病例对照研究中检测基因-基因相互作用。
Bioinformatics. 2011 May 1;27(9):1309-10. doi: 10.1093/bioinformatics/btr114. Epub 2011 Mar 3.
10
Model-based multifactor dimensionality reduction for detecting epistasis in case-control data in the presence of noise.基于模型的多因素降维方法,用于在存在噪声的病例对照数据中检测上位性。
Ann Hum Genet. 2011 Jan;75(1):78-89. doi: 10.1111/j.1469-1809.2010.00604.x. Epub 2010 Sep 8.