鉴定 SNP 相互作用的方法：逻辑回归、随机森林和贝叶斯逻辑回归的变化综述。

Methods for identifying SNP interactions: a review on variations of Logic Regression, Random Forest and Bayesian logistic regression.

机构信息

Discipline of Mathematical Sciences, Queensland University of Technology, Gardens Point, Brisbane, Queensland 4001, Australia.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2011 Nov-Dec;8(6):1580-91. doi: 10.1109/TCBB.2011.46.

DOI:10.1109/TCBB.2011.46

PMID:21383421

Abstract

Due to advancements in computational ability, enhanced technology and a reduction in the price of genotyping, more data are being generated for understanding genetic associations with diseases and disorders. However, with the availability of large data sets comes the inherent challenges of new methods of statistical analysis and modeling. Considering a complex phenotype may be the effect of a combination of multiple loci, various statistical methods have been developed for identifying genetic epistasis effects. Among these methods, logic regression (LR) is an intriguing approach incorporating tree-like structures. Various methods have built on the original LR to improve different aspects of the model. In this study, we review four variations of LR, namely Logic Feature Selection, Monte Carlo Logic Regression, Genetic Programming for Association Studies, and Modified Logic Regression-Gene Expression Programming, and investigate the performance of each method using simulated and real genotype data. We contrast these with another tree-like approach, namely Random Forests, and a Bayesian logistic regression with stochastic search variable selection.

摘要

由于计算能力的提高、技术的增强和基因分型价格的降低，越来越多的数据被用于理解遗传与疾病和障碍的关联。然而，随着大量数据集的出现，统计分析和建模的新方法带来了固有挑战。由于复杂表型可能是多个基因座共同作用的结果，因此已经开发了各种统计方法来识别遗传上位性效应。在这些方法中，逻辑回归（LR）是一种引人注目的方法，它包含树状结构。各种方法都在原始 LR 的基础上进行了改进，以改善模型的不同方面。在这项研究中，我们回顾了四种 LR 变体，即逻辑特征选择、蒙特卡罗逻辑回归、用于关联研究的遗传编程和修改后的逻辑回归-基因表达编程，并使用模拟和真实基因型数据研究了每种方法的性能。我们将这些方法与另一种树状方法，即随机森林，以及具有随机搜索变量选择的贝叶斯逻辑回归进行了对比。

相似文献

Methods for identifying SNP interactions: a review on variations of Logic Regression, Random Forest and Bayesian logistic regression.

IEEE/ACM Trans Comput Biol Bioinform. 2011 Nov-Dec;8(6):1580-91. doi: 10.1109/TCBB.2011.46.

Evaluating the ability of tree-based methods and logistic regression for the detection of SNP-SNP interaction.

Ann Hum Genet. 2009 May;73(Pt 3):360-9. doi: 10.1111/j.1469-1809.2009.00511.x. Epub 2009 Mar 8.

Identifying interacting SNPs using Monte Carlo logic regression.

Genet Epidemiol. 2005 Feb;28(2):157-70. doi: 10.1002/gepi.20042.

Bayesian variable and model selection methods for genetic association studies.

Genet Epidemiol. 2009 Jan;33(1):27-37. doi: 10.1002/gepi.20353.

Comparative analysis of methods for detecting interacting loci.

BMC Genomics. 2011 Jul 5;12:344. doi: 10.1186/1471-2164-12-344.

Direct analysis of unphased SNP genotype data in population-based association studies via Bayesian partition modelling of haplotypes.

Genet Epidemiol. 2005 Sep;29(2):91-107. doi: 10.1002/gepi.20080.

Mapping the genetic architecture of complex traits in experimental populations.

Bioinformatics. 2007 Jun 15;23(12):1527-36. doi: 10.1093/bioinformatics/btm143. Epub 2007 Apr 25.

Gathering the gold dust: methods for assessing the aggregate impact of small effect genes in genomic scans.

Pac Symp Biocomput. 2008:190-200.

Bayesian phylogeny analysis via stochastic approximation Monte Carlo.

Mol Phylogenet Evol. 2009 Nov;53(2):394-403. doi: 10.1016/j.ympev.2009.06.019. Epub 2009 Jul 7.

Logic regression and its extensions.

Adv Genet. 2010;72:25-45. doi: 10.1016/B978-0-12-380862-2.00002-3.

引用本文的文献

Models based on dietary nutrients predicting all-cause and cardiovascular mortality in people with diabetes.

Sci Rep. 2025 Feb 7;15(1):4600. doi: 10.1038/s41598-025-88480-9.

Relationships between multivitamins, blood biochemistry markers, and BMC and BMD based on RF: A cross-sectional and population-based study of NHANES, 2017-2018.

PLoS One. 2025 Jan 29;20(1):e0309524. doi: 10.1371/journal.pone.0309524. eCollection 2025.

An early warning approach for the rapid identification of extreme weather disasters based on phased array dual polarization radar cooperative network data.

PLoS One. 2024 Jan 3;19(1):e0296044. doi: 10.1371/journal.pone.0296044. eCollection 2024.

Association of SNPs in the FK-506 binding protein (FKBP5) gene among Han Chinese women with polycystic ovary syndrome.

BMC Med Genomics. 2022 Jul 4;15(1):149. doi: 10.1186/s12920-022-01301-0.

Diagnosis of Amnesic Mild Cognitive Impairment Using MGS-WBC and VGBN-LM Algorithms.

Front Aging Neurosci. 2022 May 30;14:893250. doi: 10.3389/fnagi.2022.893250. eCollection 2022.

Analytical and numerical comparisons of two methods of estimation of additive × additive × additive interaction of QTL effects.

J Appl Genet. 2022 May;63(2):213-221. doi: 10.1007/s13353-021-00676-7. Epub 2021 Dec 23.

Machine Learning Protocols in Early Cancer Detection Based on Liquid Biopsy: A Survey.

Life (Basel). 2021 Jun 30;11(7):638. doi: 10.3390/life11070638.

An evolution-based high-fidelity method of epistasis measurement: Theory and application to influenza.

PLoS Pathog. 2021 Jun 21;17(6):e1009669. doi: 10.1371/journal.ppat.1009669. eCollection 2021 Jun.

Effective Analysis of Inpatient Satisfaction: The Random Forest Algorithm.

Patient Prefer Adherence. 2021 Apr 7;15:691-703. doi: 10.2147/PPA.S294402. eCollection 2021.

The use of Logic regression in epidemiologic studies to investigate multiple binary exposures: an example of occupation history and amyotrophic lateral sclerosis.

Epidemiol Methods. 2020 Jan;9(1). doi: 10.1515/em-2019-0032. Epub 2020 Feb 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

鉴定 SNP 相互作用的方法：逻辑回归、随机森林和贝叶斯逻辑回归的变化综述。

Methods for identifying SNP interactions: a review on variations of Logic Regression, Random Forest and Bayesian logistic regression.

机构信息

Discipline of Mathematical Sciences, Queensland University of Technology, Gardens Point, Brisbane, Queensland 4001, Australia.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2011 Nov-Dec;8(6):1580-91. doi: 10.1109/TCBB.2011.46.

DOI:10.1109/TCBB.2011.46

PMID:21383421

Abstract

摘要

鉴定 SNP 相互作用的方法：逻辑回归、随机森林和贝叶斯逻辑回归的变化综述。

Methods for identifying SNP interactions: a review on variations of Logic Regression, Random Forest and Bayesian logistic regression.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

鉴定 SNP 相互作用的方法：逻辑回归、随机森林和贝叶斯逻辑回归的变化综述。

Methods for identifying SNP interactions: a review on variations of Logic Regression, Random Forest and Bayesian logistic regression.

机构信息

出版信息

相似文献

引用本文的文献