Suppr超能文献

逻辑回归、逻辑斯蒂回归、分类树和随机森林用于识别有效基因-基因及基因-环境相互作用的比较

A Comparison of Logistic Regression, Logic Regression, Classification Tree, and Random Forests to Identify Effective Gene-Gene and Gene-Environmental Interactions.

作者信息

Yoo Wonsuk, Ference Brian A, Cote Michele L, Schwartz Ann

机构信息

Biostatistics and Epidemiology Division, University of Tennessee Health Science Center, 66 N. Pauline St, Suite 633, Memphis, TN 38163, USA.

出版信息

Int J Appl Sci Technol. 2012 Aug;2(7):268.

Abstract

Genome wide association studies (GWAS) have identified numerous single nucleotide polymorphisms (SNPs) that are associated with a variety of common human diseases. Due to the weak marginal effect of most disease-associated SNPs, attention has recently turned to evaluating the combined effect of multiple disease-associated SNPs on the risk of disease. Several recent multigenic studies show potential evidence of applying multigenic approaches in association studies of various diseases including lung cancer. But the question remains as to the best methodology to analyze single nucleotide polymorphisms in multiple genes. In this work, we consider four methods-logistic regression, logic regression, classification tree, and random forests-to compare results for identifying important genes or gene-gene and gene-environmental interactions. To evaluate the performance of four methods, the cross-validation misclassification error and areas under the curves are provided. We performed a simulation study and applied them to the data from a large-scale, population-based, case-control study.

摘要

全基因组关联研究(GWAS)已经鉴定出许多与多种常见人类疾病相关的单核苷酸多态性(SNP)。由于大多数疾病相关SNP的边际效应较弱,最近人们的注意力转向评估多个疾病相关SNP对疾病风险的综合影响。最近的几项多基因研究显示了在包括肺癌在内的各种疾病的关联研究中应用多基因方法的潜在证据。但对于分析多个基因中的单核苷酸多态性的最佳方法仍然存在疑问。在这项工作中,我们考虑了四种方法——逻辑回归、逻辑回归、分类树和随机森林——来比较识别重要基因或基因-基因以及基因-环境相互作用的结果。为了评估这四种方法的性能,提供了交叉验证误分类误差和曲线下面积。我们进行了一项模拟研究,并将它们应用于一项大规模、基于人群的病例对照研究的数据。

相似文献

7
Identification of SNP interactions using logic regression.使用逻辑回归识别单核苷酸多态性(SNP)相互作用。
Biostatistics. 2008 Jan;9(1):187-98. doi: 10.1093/biostatistics/kxm024. Epub 2007 Jun 19.

引用本文的文献

2
Identifying Factors Associated with Periodontal Disease Using Machine Learning.利用机器学习识别与牙周病相关的因素。
J Int Soc Prev Community Dent. 2022 Dec 30;12(6):612-622. doi: 10.4103/jispcd.JISPCD_188_22. eCollection 2022 Nov-Dec.

本文引用的文献

1
7
Classification across gene expression microarray studies.基因表达微阵列研究中的分类。
BMC Bioinformatics. 2009 Dec 30;10:453. doi: 10.1186/1471-2105-10-453.
8
Optimum lymphadenectomy for esophageal cancer.食管癌的最佳淋巴结清扫术。
Ann Surg. 2010 Jan;251(1):46-50. doi: 10.1097/SLA.0b013e3181b2f6ee.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验