Suppr超能文献

采用机器学习和贝叶斯阈值最小绝对收缩和选择算子(LASSO)模型的两步法,通过类风湿关节炎中的单核苷酸多态性相互作用检测单核苷酸多态性。

Detecting single-nucleotide polymorphism by single-nucleotide polymorphism interactions in rheumatoid arthritis using a two-step approach with machine learning and a Bayesian threshold least absolute shrinkage and selection operator (LASSO) model.

作者信息

González-Recio Oscar, de Maturana Evangelina López, Vega Andrés T, Engelman Corinne D, Broman Karl W

机构信息

Department of Dairy Science, University of Wisconsin-Madison, 266 Animal Science Building, 1675 Observatory Drive, Madison, Wisconsin 53706, USA.

出版信息

BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S63. doi: 10.1186/1753-6561-3-s7-s63.

Abstract

The objective of this study was to detect interactions between relevant single-nucleotide polymorphisms (SNPs) associated with rheumatoid arthritis (RA). Data from Problem 1 of the Genetic Analysis Workshop 16 were used. These data consisted of 868 cases and 1,194 controls genotyped with the 500 k Illumina chip. First, machine learning methods were applied for preselecting SNPs. One hundred SNPs outside the HLA region and 1,500 SNPs in the HLA region were preselected using information-gain theory. The software weka was used to reduce colinearity and redundancy in the HLA region, resulting in a subset of 6 SNPs out of 1,500. In a second step, a parametric approach to account for interactions between SNPs in the HLA region, as well as HLA-nonHLA interactions was conducted using a Bayesian threshold least absolute shrinkage and selection operator (LASSO) model incorporating 2,560 covariates. This approach detected some main and interaction effects for SNPs in genes that have previously been associated with RA (e.g., rs2395175, rs660895, rs10484560, and rs2476601). Further, some other SNPs detected in this study may be considered in candidate gene studies.

摘要

本研究的目的是检测与类风湿性关节炎(RA)相关的单核苷酸多态性(SNP)之间的相互作用。使用了遗传分析研讨会16问题1的数据。这些数据包括用Illumina 500 k芯片进行基因分型的868例病例和1194例对照。首先,应用机器学习方法对SNP进行预选。利用信息增益理论预选了HLA区域外的100个SNP和HLA区域内的1500个SNP。使用软件weka来减少HLA区域内的共线性和冗余,从1500个SNP中得到了一个包含6个SNP的子集。第二步,使用纳入2560个协变量的贝叶斯阈值最小绝对收缩和选择算子(LASSO)模型,采用参数化方法来分析HLA区域内SNP之间的相互作用以及HLA与非HLA之间的相互作用。该方法检测到了一些先前与RA相关基因中SNP的主要效应和相互作用效应(如rs2395175、rs660895、rs10484560和rs2476601)。此外,本研究中检测到的其他一些SNP可在候选基因研究中予以考虑。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee7/2795964/2c5c8ae71def/1753-6561-3-S7-S63-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验