• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用随机森林识别预测类风湿性关节炎的基因和单倍型。

Identification of genes and haplotypes that predict rheumatoid arthritis using random forests.

作者信息

Tang Rui, Sinnwell Jason P, Li Jia, Rider David N, de Andrade Mariza, Biernacka Joanna M

机构信息

Department of Health Sciences Research, 200 First Street Southwest, Mayo Clinic, Rochester, Minnesota 55905, USA.

出版信息

BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S68. doi: 10.1186/1753-6561-3-s7-s68.

DOI:10.1186/1753-6561-3-s7-s68
PMID:20018062
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2795969/
Abstract

Random forest (RF) analysis of genetic data does not require specification of the mode of inheritance, and provides measures of variable importance that incorporate interaction effects. In this paper we describe RF-based approaches for assessment of gene and haplotype importance, and apply these approaches to a subset of the North American Rheumatoid Arthritis Consortium case-control data provided by Genetic Analysis Workshop 16. The RF analyses of 37 genes identified many of the same genes as logistic regression, but also suggested importance of certain single-nucleotide polymorphism and genes that were not ranked highly by logistic regression. A new permutation method did not reveal strong evidence of gene-gene interaction effects in these data. Although RFs are a promising approach for genetic data analysis, extensions beyond simple single-nucleotide polymorphism analyses and modifications to improve computational feasibility are needed.

摘要

对基因数据进行随机森林(RF)分析不需要指定遗传模式,并且能提供包含交互效应的变量重要性度量。在本文中,我们描述了基于RF的评估基因和单倍型重要性的方法,并将这些方法应用于遗传分析研讨会16提供的北美类风湿关节炎联盟病例对照数据的一个子集。对37个基因进行的RF分析识别出了许多与逻辑回归相同的基因,但也表明了某些单核苷酸多态性和未被逻辑回归高度排名的基因的重要性。一种新的置换方法并未在这些数据中揭示出基因-基因交互效应的有力证据。尽管RF是遗传数据分析的一种有前景的方法,但需要超越简单的单核苷酸多态性分析进行扩展,并进行修改以提高计算可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/846e/2795969/2c17e78f33fc/1753-6561-3-S7-S68-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/846e/2795969/2c17e78f33fc/1753-6561-3-S7-S68-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/846e/2795969/2c17e78f33fc/1753-6561-3-S7-S68-1.jpg

相似文献

1
Identification of genes and haplotypes that predict rheumatoid arthritis using random forests.使用随机森林识别预测类风湿性关节炎的基因和单倍型。
BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S68. doi: 10.1186/1753-6561-3-s7-s68.
2
Analyses of single marker and pairwise effects of candidate loci for rheumatoid arthritis using logistic regression and random forests.使用逻辑回归和随机森林对类风湿性关节炎候选基因座的单标记和成对效应进行分析。
BMC Proc. 2007;1 Suppl 1(Suppl 1):S54. doi: 10.1186/1753-6561-1-s1-s54. Epub 2007 Dec 18.
3
Classification of rheumatoid arthritis status with candidate gene and genome-wide single-nucleotide polymorphisms using random forests.利用随机森林法通过候选基因和全基因组单核苷酸多态性对类风湿性关节炎状态进行分类
BMC Proc. 2007;1 Suppl 1(Suppl 1):S62. doi: 10.1186/1753-6561-1-s1-s62. Epub 2007 Dec 18.
4
Genome-wide analysis of haplotype interaction for the data from the North American Rheumatoid Arthritis Consortium.对北美类风湿关节炎联盟数据进行单倍型相互作用的全基因组分析。
BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S34. doi: 10.1186/1753-6561-3-s7-s34.
5
Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.使用基于质量的两阶段随机森林进行全基因组关联数据分类和单核苷酸多态性选择。
BMC Genomics. 2015;16 Suppl 2(Suppl 2):S5. doi: 10.1186/1471-2164-16-S2-S5. Epub 2015 Jan 21.
6
SNP interaction detection with Random Forests in high-dimensional genetic data.利用随机森林在高维遗传数据中检测 SNP 相互作用。
BMC Bioinformatics. 2012 Jul 15;13:164. doi: 10.1186/1471-2105-13-164.
7
Modeling X Chromosome Data Using Random Forests: Conquering Sex Bias.使用随机森林对X染色体数据进行建模:克服性别偏差。
Genet Epidemiol. 2016 Feb;40(2):123-32. doi: 10.1002/gepi.21946. Epub 2015 Dec 7.
8
Picking single-nucleotide polymorphisms in forests.在森林中挑选单核苷酸多态性。
BMC Proc. 2007;1 Suppl 1(Suppl 1):S59. doi: 10.1186/1753-6561-1-s1-s59. Epub 2007 Dec 18.
9
Two-stage approach for identifying single-nucleotide polymorphisms associated with rheumatoid arthritis using random forests and Bayesian networks.使用随机森林和贝叶斯网络识别类风湿关节炎相关单核苷酸多态性的两阶段方法。
BMC Proc. 2007;1 Suppl 1(Suppl 1):S56. doi: 10.1186/1753-6561-1-s1-s56. Epub 2007 Dec 18.
10
Comparative study for haplotype block partitioning methods - Evidence from chromosome 6 of the North American Rheumatoid Arthritis Consortium (NARAC) dataset.单体型块划分方法的比较研究——来自北美类风湿关节炎联盟(NARAC)数据集 6 号染色体的证据。
PLoS One. 2018 Dec 31;13(12):e0209603. doi: 10.1371/journal.pone.0209603. eCollection 2018.

引用本文的文献

1
Unravelling the Genetic Landscape of Hemiplegic Migraine: Exploring Innovative Strategies and Emerging Approaches.解析偏瘫性偏头痛的遗传图谱:探索创新策略与新兴方法。
Genes (Basel). 2024 Mar 31;15(4):443. doi: 10.3390/genes15040443.
2
Machine Learning-Assisted Approaches in Modernized Plant Breeding Programs.机器学习辅助方法在现代化植物育种计划中的应用。
Genes (Basel). 2023 Mar 23;14(4):777. doi: 10.3390/genes14040777.
3
Tumor Necrosis Factor-α Gene Polymorphism is Associated with Short- and Long-Term Kidney Allograft Outcomes.

本文引用的文献

1
Multiple imputation and random forests (MIRF) for unobservable, high-dimensional data.
Int J Biostat. 2007;3(1):Article 12. doi: 10.2202/1557-4679.1049.
2
A forest-based approach to identifying gene and gene gene interactions.一种基于森林模型的基因及基因-基因相互作用识别方法。
Proc Natl Acad Sci U S A. 2007 Dec 4;104(49):19199-203. doi: 10.1073/pnas.0709868104. Epub 2007 Nov 28.
3
Data mining, neural nets, trees--problems 2 and 3 of Genetic Analysis Workshop 15.数据挖掘、神经网络、决策树——遗传分析研讨会15的问题2和问题3。
肿瘤坏死因子-α基因多态性与肾移植短期和长期预后相关。
J Inflamm Res. 2022 Apr 5;15:2243-2254. doi: 10.2147/JIR.S334738. eCollection 2022.
4
A random forest method with feature selection for developing medical prediction models with clustered and longitudinal data.基于聚类和纵向数据的医学预测模型的特征选择随机森林方法。
J Biomed Inform. 2021 May;117:103763. doi: 10.1016/j.jbi.2021.103763. Epub 2021 Mar 26.
5
Studying the effects of haplotype partitioning methods on the RA-associated genomic results from the North American Rheumatoid Arthritis Consortium (NARAC) dataset.研究单倍型划分方法对来自北美类风湿关节炎协会(NARAC)数据集的类风湿关节炎相关基因组结果的影响。
J Adv Res. 2019 Jan 18;18:113-126. doi: 10.1016/j.jare.2019.01.006. eCollection 2019 Jul.
6
Ensemble learning for detecting gene-gene interactions in colorectal cancer.用于检测结直肠癌中基因-基因相互作用的集成学习
PeerJ. 2018 Oct 29;6:e5854. doi: 10.7717/peerj.5854. eCollection 2018.
7
A Phenome-Wide Association Study Uncovers a Role for Autoimmunity in the Development of Chronic Obstructive Pulmonary Disease.一项全表型关联研究揭示自身免疫在慢性阻塞性肺疾病发生中的作用。
Am J Respir Cell Mol Biol. 2018 Jun;58(6):777-779. doi: 10.1165/rcmb.2017-0409LE.
8
Classification Tree Analysis as a Method for Uncovering Relations Between CHRNA5A3B4 and CHRNB3A6 in Predicting Smoking Progression in Adolescent Smokers.分类树分析作为一种揭示青少年吸烟者中CHRNA5A3B4和CHRNB3A6在预测吸烟进展方面关系的方法。
Nicotine Tob Res. 2017 Apr 1;19(4):410-416. doi: 10.1093/ntr/ntw197.
9
Identification of a potential fibromyalgia diagnosis using random forest modeling applied to electronic medical records.运用随机森林模型对电子病历进行分析以识别潜在的纤维肌痛诊断。
J Pain Res. 2015 Jun 10;8:277-88. doi: 10.2147/jpr.s8256. eCollection 2015.
10
RAD-QTL Mapping Reveals Both Genome-Level Parallelism and Different Genetic Architecture Underlying the Evolution of Body Shape in Lake Whitefish (Coregonus clupeaformis) Species Pairs.RAD-QTL定位揭示了湖白鲑(Coregonus clupeaformis)物种对体型进化背后的基因组水平平行性和不同的遗传结构。
G3 (Bethesda). 2015 May 21;5(7):1481-91. doi: 10.1534/g3.115.019067.
Genet Epidemiol. 2007;31 Suppl 1:S51-60. doi: 10.1002/gepi.20280.
4
Genome-wide strategies for detecting multiple loci that influence complex diseases.用于检测影响复杂疾病的多个基因座的全基因组策略。
Nat Genet. 2005 Apr;37(4):413-7. doi: 10.1038/ng1537. Epub 2005 Mar 27.
5
Identifying SNPs predictive of phenotype using random forests.使用随机森林识别预测表型的单核苷酸多态性
Genet Epidemiol. 2005 Feb;28(2):171-82. doi: 10.1002/gepi.20041.
6
Screening large-scale association study data: exploiting interactions using random forests.筛选大规模关联研究数据:利用随机森林探索相互作用
BMC Genet. 2004 Dec 10;5:32. doi: 10.1186/1471-2156-5-32.
7
Mapping complex traits using Random Forests.使用随机森林映射复杂性状。
BMC Genet. 2003 Dec 31;4 Suppl 1(Suppl 1):S64. doi: 10.1186/1471-2156-4-S1-S64.