Department of Biomedical Science and Environmental Biology, Graduate Institute of Natural Products, College of Pharmacy, Cancer Center, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung 807, Taiwan.
Int J Endocrinol. 2013;2013:850735. doi: 10.1155/2013/850735. Epub 2013 Jan 14.
An essential task in a genomic analysis of a human disease is limiting the number of strongly associated genes when studying susceptibility to the disease. The goal of this study was to compare computational tools with and without feature selection for predicting osteoporosis outcome in Taiwanese women based on genetic factors such as single nucleotide polymorphisms (SNPs). To elucidate relationships between osteoporosis and SNPs in this population, three classification algorithms were applied: multilayer feedforward neural network (MFNN), naive Bayes, and logistic regression. A wrapper-based feature selection method was also used to identify a subset of major SNPs. Experimental results showed that the MFNN model with the wrapper-based approach was the best predictive model for inferring disease susceptibility based on the complex relationship between osteoporosis and SNPs in Taiwanese women. The findings suggest that patients and doctors can use the proposed tool to enhance decision making based on clinical factors such as SNP genotyping data.
在人类疾病的基因组分析中,当研究疾病易感性时,限制强关联基因的数量是一项基本任务。本研究的目的是比较基于遗传因素(如单核苷酸多态性 (SNP))预测台湾女性骨质疏松症结果的具有和不具有特征选择的计算工具。为了阐明骨质疏松症与该人群中 SNP 之间的关系,应用了三种分类算法:多层前馈神经网络 (MFNN)、朴素贝叶斯和逻辑回归。还应用了基于包装器的特征选择方法来识别主要 SNP 的子集。实验结果表明,基于包装器的方法的 MFNN 模型是根据台湾女性骨质疏松症与 SNP 之间的复杂关系推断疾病易感性的最佳预测模型。研究结果表明,患者和医生可以使用所提出的工具,根据 SNP 基因分型数据等临床因素增强决策制定。