Department of Biomedical Sciences, University of Padua, Padua, Italy.
Department of Information Engineering, University of Padua, Padua, Italy.
PLoS One. 2020 Feb 10;15(2):e0227191. doi: 10.1371/journal.pone.0227191. eCollection 2020.
In this work we present a framework for blood cholesterol levels prediction from genotype data. The predictor is based on an algorithm for cholesterol metabolism simulation available in literature, implemented and optimized by our group in the R language. The main weakness of the former simulation algorithm was the need of experimental data to simulate mutations in genes altering the cholesterol metabolism. This caveat strongly limited the application of the model in the clinical practice. In this work we present how this limitation could be bypassed thanks to an optimization of model parameters based on patient cholesterol levels retrieved from literature. Prediction performance has been assessed taking into consideration several scoring indices currently used for performance evaluation of machine learning methods. Our assessment shows how the optimization phase improved model performance, compared to the original version available in literature.
在这项工作中,我们提出了一个从基因型数据预测血液胆固醇水平的框架。预测器基于文献中可用的胆固醇代谢模拟算法,由我们小组在 R 语言中实现和优化。以前的模拟算法的主要弱点是需要实验数据来模拟改变胆固醇代谢的基因中的突变。这一缺陷极大地限制了该模型在临床实践中的应用。在这项工作中,我们展示了如何通过基于文献中从患者胆固醇水平中检索到的模型参数优化来绕过这一限制。预测性能的评估考虑了当前用于机器学习方法性能评估的几种评分指标。我们的评估表明,与文献中可用的原始版本相比,优化阶段如何提高了模型性能。