• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
A Comparison of Logistic Regression, Logic Regression, Classification Tree, and Random Forests to Identify Effective Gene-Gene and Gene-Environmental Interactions.逻辑回归、逻辑斯蒂回归、分类树和随机森林用于识别有效基因-基因及基因-环境相互作用的比较
Int J Appl Sci Technol. 2012 Aug;2(7):268.
2
Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.使用基于质量的两阶段随机森林进行全基因组关联数据分类和单核苷酸多态性选择。
BMC Genomics. 2015;16 Suppl 2(Suppl 2):S5. doi: 10.1186/1471-2164-16-S2-S5. Epub 2015 Jan 21.
3
Evaluation of tree-based statistical learning methods for constructing genetic risk scores.基于树的统计学习方法构建遗传风险评分的评估。
BMC Bioinformatics. 2022 Mar 21;23(1):97. doi: 10.1186/s12859-022-04634-w.
4
Classification of rheumatoid arthritis status with candidate gene and genome-wide single-nucleotide polymorphisms using random forests.利用随机森林法通过候选基因和全基因组单核苷酸多态性对类风湿性关节炎状态进行分类
BMC Proc. 2007;1 Suppl 1(Suppl 1):S62. doi: 10.1186/1753-6561-1-s1-s62. Epub 2007 Dec 18.
5
Evaluating the ability of tree-based methods and logistic regression for the detection of SNP-SNP interaction.评估基于树的方法和逻辑回归检测单核苷酸多态性(SNP)-SNP相互作用的能力。
Ann Hum Genet. 2009 May;73(Pt 3):360-9. doi: 10.1111/j.1469-1809.2009.00511.x. Epub 2009 Mar 8.
6
Evaluation of potential novel variations and their interactions related to bipolar disorders: analysis of genome-wide association study data.双相情感障碍相关潜在新变异及其相互作用的评估:全基因组关联研究数据分析
Neuropsychiatr Dis Treat. 2016 Nov 24;12:2997-3004. doi: 10.2147/NDT.S112558. eCollection 2016.
7
Identification of SNP interactions using logic regression.使用逻辑回归识别单核苷酸多态性(SNP)相互作用。
Biostatistics. 2008 Jan;9(1):187-98. doi: 10.1093/biostatistics/kxm024. Epub 2007 Jun 19.
8
A comparative study on the unified model based multifactor dimensionality reduction methods for identifying gene-gene interactions associated with the survival phenotype.基于统一模型的多因素降维方法识别与生存表型相关的基因-基因相互作用的比较研究。
BioData Min. 2021 Mar 1;14(1):17. doi: 10.1186/s13040-021-00248-9.
9
Assessing SNP-SNP interactions among DNA repair, modification and metabolism related pathway genes in breast cancer susceptibility.评估DNA修复、修饰和代谢相关通路基因之间的单核苷酸多态性(SNP)-SNP相互作用与乳腺癌易感性的关系。
PLoS One. 2013 Jun 3;8(6):e64896. doi: 10.1371/journal.pone.0064896. Print 2014.
10
Screening large-scale association study data: exploiting interactions using random forests.筛选大规模关联研究数据:利用随机森林探索相互作用
BMC Genet. 2004 Dec 10;5:32. doi: 10.1186/1471-2156-5-32.

引用本文的文献

1
Fire susceptibility assessment in the Carpathians using an interpretable framework.利用可解释框架对喀尔巴阡山脉的火灾易感性进行评估。
Sci Rep. 2025 Aug 18;15(1):30207. doi: 10.1038/s41598-025-10296-4.
2
Identifying Factors Associated with Periodontal Disease Using Machine Learning.利用机器学习识别与牙周病相关的因素。
J Int Soc Prev Community Dent. 2022 Dec 30;12(6):612-622. doi: 10.4103/jispcd.JISPCD_188_22. eCollection 2022 Nov-Dec.
3
Combinations of chronic conditions, functional limitations and geriatric syndromes associated with periodontal disease.慢性疾病、功能限制和与牙周病相关的老年综合病症的组合。
Fam Med Community Health. 2022 Aug;10(3). doi: 10.1136/fmch-2022-001733.
4
An Applicable Machine Learning Model Based on Preoperative Examinations Predicts Histology, Stage, and Grade for Endometrial Cancer.基于术前检查的适用机器学习模型可预测子宫内膜癌的组织学、分期和分级。
Front Oncol. 2022 May 30;12:904597. doi: 10.3389/fonc.2022.904597. eCollection 2022.
5
Evaluation of tree-based statistical learning methods for constructing genetic risk scores.基于树的统计学习方法构建遗传风险评分的评估。
BMC Bioinformatics. 2022 Mar 21;23(1):97. doi: 10.1186/s12859-022-04634-w.
6
Using logic regression to characterize extreme heat exposures and their health associations: a time-series study of emergency department visits in Atlanta.运用逻辑回归方法描述极端高温暴露及其与健康的关联:亚特兰大急诊就诊的时间序列研究。
BMC Med Res Methodol. 2021 Apr 26;21(1):87. doi: 10.1186/s12874-021-01278-x.
7
The use of Logic regression in epidemiologic studies to investigate multiple binary exposures: an example of occupation history and amyotrophic lateral sclerosis.在流行病学研究中使用逻辑回归来调查多种二元暴露因素:以职业史与肌萎缩侧索硬化症为例。
Epidemiol Methods. 2020 Jan;9(1). doi: 10.1515/em-2019-0032. Epub 2020 Feb 25.
8
Joint and interactive effects between health comorbidities and environmental exposures in predicting amyotrophic lateral sclerosis.健康共病与环境暴露在预测肌萎缩侧索硬化症中的联合和交互作用。
Int J Hyg Environ Health. 2021 Jan;231:113655. doi: 10.1016/j.ijheh.2020.113655. Epub 2020 Oct 30.
9
Predicting hypertension using machine learning: Findings from Qatar Biobank Study.使用机器学习预测高血压:来自卡塔尔生物银行研究的结果。
PLoS One. 2020 Oct 16;15(10):e0240370. doi: 10.1371/journal.pone.0240370. eCollection 2020.
10
Applying Supervised Machine Learning to Identify Which Patient Characteristics Identify the Highest Rates of Mortality Post-Interhospital Transfer.应用监督式机器学习来确定哪些患者特征表明院际转运后的死亡率最高。
Biomed Inform Insights. 2019 Mar 18;11:1178222619835548. doi: 10.1177/1178222619835548. eCollection 2019.

本文引用的文献

1
Smad2: a candidate gene for the murine autoimmune diabetes locus Idd21.1.Smad2:鼠自身免疫性糖尿病位点 Idd21.1 的候选基因。
J Clin Endocrinol Metab. 2011 Dec;96(12):E2072-7. doi: 10.1210/jc.2011-0463. Epub 2011 Oct 5.
2
Candidate gene association study for diabetic retinopathy in persons with type 2 diabetes: the Candidate gene Association Resource (CARe).2 型糖尿病患者糖尿病视网膜病变的候选基因关联研究:候选基因关联资源 (CARe)。
Invest Ophthalmol Vis Sci. 2011 Sep 29;52(10):7593-602. doi: 10.1167/iovs.11-7510.
3
Risk factors for autism: translating genomic discoveries into diagnostics.自闭症的风险因素:将基因组发现转化为诊断。
Hum Genet. 2011 Jul;130(1):123-48. doi: 10.1007/s00439-011-1037-2. Epub 2011 Jun 24.
4
Association between polymorphisms in cancer-related genes and early onset of esophageal adenocarcinoma.癌症相关基因多态性与食管腺癌早发的相关性研究。
Neoplasia. 2011 Apr;13(4):386-92. doi: 10.1593/neo.101722.
5
Genetic variation within the anticoagulant, procoagulant, fibrinolytic and innate immunity pathways as risk factors for venous thromboembolism.抗凝、促凝、纤溶和固有免疫途径中的遗传变异作为静脉血栓栓塞的危险因素。
J Thromb Haemost. 2011 Jun;9(6):1133-42. doi: 10.1111/j.1538-7836.2011.04272.x.
6
A three-stage genome-wide association study of general cognitive ability: hunting the small effects.一项关于一般认知能力的三阶段全基因组关联研究:寻找小效应。
Behav Genet. 2010 Nov;40(6):759-67. doi: 10.1007/s10519-010-9350-4. Epub 2010 Mar 21.
7
Classification across gene expression microarray studies.基因表达微阵列研究中的分类。
BMC Bioinformatics. 2009 Dec 30;10:453. doi: 10.1186/1471-2105-10-453.
8
Optimum lymphadenectomy for esophageal cancer.食管癌的最佳淋巴结清扫术。
Ann Surg. 2010 Jan;251(1):46-50. doi: 10.1097/SLA.0b013e3181b2f6ee.
9
An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests.递归分区介绍:分类和回归树、装袋和随机森林的原理、应用和特点。
Psychol Methods. 2009 Dec;14(4):323-48. doi: 10.1037/a0016973.
10
Genome-wide association of early-onset myocardial infarction with single nucleotide polymorphisms and copy number variants.早发性心肌梗死与单核苷酸多态性和拷贝数变异的全基因组关联研究
Nat Genet. 2009 Mar;41(3):334-41. doi: 10.1038/ng.327. Epub 2009 Feb 8.

逻辑回归、逻辑斯蒂回归、分类树和随机森林用于识别有效基因-基因及基因-环境相互作用的比较

A Comparison of Logistic Regression, Logic Regression, Classification Tree, and Random Forests to Identify Effective Gene-Gene and Gene-Environmental Interactions.

作者信息

Yoo Wonsuk, Ference Brian A, Cote Michele L, Schwartz Ann

机构信息

Biostatistics and Epidemiology Division, University of Tennessee Health Science Center, 66 N. Pauline St, Suite 633, Memphis, TN 38163, USA.

出版信息

Int J Appl Sci Technol. 2012 Aug;2(7):268.

PMID:23795347
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3686280/
Abstract

Genome wide association studies (GWAS) have identified numerous single nucleotide polymorphisms (SNPs) that are associated with a variety of common human diseases. Due to the weak marginal effect of most disease-associated SNPs, attention has recently turned to evaluating the combined effect of multiple disease-associated SNPs on the risk of disease. Several recent multigenic studies show potential evidence of applying multigenic approaches in association studies of various diseases including lung cancer. But the question remains as to the best methodology to analyze single nucleotide polymorphisms in multiple genes. In this work, we consider four methods-logistic regression, logic regression, classification tree, and random forests-to compare results for identifying important genes or gene-gene and gene-environmental interactions. To evaluate the performance of four methods, the cross-validation misclassification error and areas under the curves are provided. We performed a simulation study and applied them to the data from a large-scale, population-based, case-control study.

摘要

全基因组关联研究(GWAS)已经鉴定出许多与多种常见人类疾病相关的单核苷酸多态性(SNP)。由于大多数疾病相关SNP的边际效应较弱,最近人们的注意力转向评估多个疾病相关SNP对疾病风险的综合影响。最近的几项多基因研究显示了在包括肺癌在内的各种疾病的关联研究中应用多基因方法的潜在证据。但对于分析多个基因中的单核苷酸多态性的最佳方法仍然存在疑问。在这项工作中,我们考虑了四种方法——逻辑回归、逻辑回归、分类树和随机森林——来比较识别重要基因或基因-基因以及基因-环境相互作用的结果。为了评估这四种方法的性能,提供了交叉验证误分类误差和曲线下面积。我们进行了一项模拟研究,并将它们应用于一项大规模、基于人群的病例对照研究的数据。