综述：利用高性能计算检测基因组规模数据集中的上位性

Review: High-performance computing to detect epistasis in genome scale data sets.

作者信息

Upton Alex, Trelles Oswaldo, Cornejo-García José Antonio, Perkins James Richard

出版信息

Brief Bioinform. 2016 May;17(3):368-79. doi: 10.1093/bib/bbv058. Epub 2015 Aug 13.

Abstract

It is becoming clear that most human diseases have a complex etiology that cannot be explained by single nucleotide polymorphisms (SNPs) or simple additive combinations; the general consensus is that they are caused by combinations of multiple genetic variations. The limited success of some genome-wide association studies is partly a result of this focus on single genetic markers. A more promising approach is to take into account epistasis, by considering the association of multiple SNP interactions with disease. However, as genomic data continues to grow in resolution, and genome and exome sequencing become more established, the number of combinations of variants to consider increases rapidly. Two potential solutions should be considered: the use of high-performance computing, which allows us to consider a larger number of variables, and heuristics to make the solution more tractable, essential in the case of genome sequencing. In this review, we look at different computational methods to analyse epistatic interactions within disease-related genetic data sets created by microarray technology. We also review efforts to use epistatic analysis results to produce biomarkers for diagnostic tests and give our views on future directions in this field in light of advances in sequencing technology and variants in non-coding regions.

摘要

越来越明显的是，大多数人类疾病具有复杂的病因，无法用单核苷酸多态性（SNP）或简单的累加组合来解释；普遍的共识是，它们是由多种基因变异的组合引起的。一些全基因组关联研究取得的有限成功部分是由于专注于单一遗传标记的结果。一种更有前景的方法是通过考虑多个SNP相互作用与疾病的关联来考虑上位性。然而，随着基因组数据分辨率的不断提高，以及基因组和外显子组测序变得更加成熟，需要考虑的变异组合数量迅速增加。应考虑两种潜在的解决方案：使用高性能计算，这使我们能够考虑更多变量；以及启发式方法，使解决方案更易于处理，这在基因组测序的情况下至关重要。在这篇综述中，我们探讨了不同的计算方法，以分析由微阵列技术创建的疾病相关遗传数据集中的上位性相互作用。我们还回顾了利用上位性分析结果生成诊断测试生物标志物的努力，并根据测序技术的进展和非编码区变异，对该领域的未来方向发表了我们的看法。

相似文献

Review: High-performance computing to detect epistasis in genome scale data sets.

Brief Bioinform. 2016 May;17(3):368-79. doi: 10.1093/bib/bbv058. Epub 2015 Aug 13.

Cloud computing for detecting high-order genome-wide epistatic interaction via dynamic clustering.

BMC Bioinformatics. 2014 Apr 10;15:102. doi: 10.1186/1471-2105-15-102.

SNPHarvester: a filtering-based approach for detecting epistatic interactions in genome-wide association studies.

Bioinformatics. 2009 Feb 15;25(4):504-11. doi: 10.1093/bioinformatics/btn652. Epub 2008 Dec 19.

Parallel and serial computing tools for testing single-locus and epistatic SNP effects of quantitative traits in genome-wide association studies.

BMC Bioinformatics. 2008 Jul 21;9:315. doi: 10.1186/1471-2105-9-315.

High-throughput analysis of epistasis in genome-wide association studies with BiForce.

Bioinformatics. 2012 Aug 1;28(15):1957-64. doi: 10.1093/bioinformatics/bts304. Epub 2012 May 21.

Additive and epistatic genome-wide association for growth and ultrasound scan measures of carcass-related traits in Brahman cattle.

J Anim Breed Genet. 2015 Apr;132(2):187-97. doi: 10.1111/jbg.12147. Epub 2015 Mar 6.

A Tool for Detecting Complementary Single Nucleotide Polymorphism Pairs in Genome-Wide Association Studies for Epistasis Testing.

J Comput Biol. 2021 Apr;28(4):378-380. doi: 10.1089/cmb.2020.0430. Epub 2020 Dec 15.

eCEO: an efficient Cloud Epistasis cOmputing model in genome-wide association study.

Bioinformatics. 2011 Apr 15;27(8):1045-51. doi: 10.1093/bioinformatics/btr091. Epub 2011 Mar 2.

Distributed transformer for high order epistasis detection in large-scale datasets.

Sci Rep. 2024 Jun 25;14(1):14579. doi: 10.1038/s41598-024-65317-5.

Random Forests approach for identifying additive and epistatic single nucleotide polymorphisms associated with residual feed intake in dairy cattle.

J Dairy Sci. 2013 Oct;96(10):6716-29. doi: 10.3168/jds.2012-6237. Epub 2013 Aug 9.

引用本文的文献

ACOCMPMI: An Ant Colony Optimization Algorithm Based on Composite Multiscale Part Mutual Information for Detecting Epistatic Interactions.

Hum Mutat. 2025 Jun 13;2025:7656300. doi: 10.1155/humu/7656300. eCollection 2025.

GRAMMAR-Lambda Delivers Efficient Understanding of the Genetic Basis for Head Size in Catfish.

Biology (Basel). 2025 Jan 13;14(1):63. doi: 10.3390/biology14010063.

Compressed variance component mixed model reveals epistasis associated with flowering in .

Front Plant Sci. 2024 Jan 8;14:1283642. doi: 10.3389/fpls.2023.1283642. eCollection 2023.

Discovering SNP-disease relationships in genome-wide SNP data using an improved harmony search based on SNP locus and genetic inheritance patterns.

PLoS One. 2023 Oct 13;18(10):e0292266. doi: 10.1371/journal.pone.0292266. eCollection 2023.

Open problems in human trait genetics.

Genome Biol. 2022 Jun 20;23(1):131. doi: 10.1186/s13059-022-02697-9.

Gene-Interaction-Sensitive enrichment analysis in congenital heart disease.

BioData Min. 2022 Feb 12;15(1):4. doi: 10.1186/s13040-022-00287-w.

Gene-gene interaction analysis incorporating network information via a structured Bayesian approach.

Stat Med. 2021 Dec 20;40(29):6619-6633. doi: 10.1002/sim.9202. Epub 2021 Sep 20.

Long-range linkage disequilibrium in French beef cattle breeds.

Genet Sel Evol. 2021 Jul 23;53(1):63. doi: 10.1186/s12711-021-00657-8.

Towards the Interpretability of Machine Learning Predictions for Medical Applications Targeting Personalised Therapies: A Cancer Case Survey.

Int J Mol Sci. 2021 Apr 22;22(9):4394. doi: 10.3390/ijms22094394.

Detecting fitness epistasis in recently admixed populations with genome-wide data.

BMC Genomics. 2020 Jul 11;21(1):476. doi: 10.1186/s12864-020-06874-7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

综述：利用高性能计算检测基因组规模数据集中的上位性

Review: High-performance computing to detect epistasis in genome scale data sets.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献