Konieczny Andrzej, Stojanowski Jakub, Krajewska Magdalena, Kusztal Mariusz
Department of Nephrology and Transplantation Medicine, Wroclaw Medical University, 50-556 Wroclaw, Poland.
J Pers Med. 2021 Apr 17;11(4):312. doi: 10.3390/jpm11040312.
We are overwhelmed by a deluge of data and, although its interpretation is challenging, fortunately, information technology comes to the rescue. One of the tools is artificial intelligence, allowing the identification of relationships between variables and their arbitrary classification. We focused on the assessment of both the remission of proteinuria and the deterioration of kidney function in patients with IgA nephropathy, comparing several methods of machine learning. It is of utmost importance to respond to subtle changes in kidney function, which will lead to a deceleration of the disease. This goal has been achieved by analyzing regression techniques, predicting the difference in serum creatinine concentration. We obtained the performance of the tested models which classified patients with high accuracy (Random Forest Classifier showed an accuracy of 0.8-1.0, Multi-Layer Perceptron an Area Under Curve of 0.8842-0.9035 and an accuracy of 0.7527-1.0) and regressors with a low estimation error (Decision Tree Regressor showed MAE 0.2059, RMSE 0.2645). We have demonstrated the impact of both model selection and input features on performance. Application of machine learning methods requires careful selection of models and assessed parameters. The computing power of modern computers allows searching for the models most effective in terms of accuracy.
我们被大量的数据淹没,尽管对其进行解读具有挑战性,但幸运的是,信息技术前来救援。其中一个工具是人工智能,它能识别变量之间的关系并进行任意分类。我们专注于评估IgA肾病患者蛋白尿的缓解情况和肾功能的恶化情况,比较了几种机器学习方法。对肾功能的细微变化做出反应至关重要,这将减缓疾病的发展。通过分析回归技术,预测血清肌酐浓度的差异,我们实现了这一目标。我们获得了测试模型的性能,这些模型对患者进行了高精度分类(随机森林分类器的准确率为0.8 - 1.0,多层感知器的曲线下面积为0.8842 - 0.9035,准确率为0.7527 - 1.0),并且回归器的估计误差较低(决策树回归器的平均绝对误差为0.2059,均方根误差为0.2645)。我们已经证明了模型选择和输入特征对性能的影响。应用机器学习方法需要仔细选择模型和评估参数。现代计算机的计算能力允许寻找在准确性方面最有效的模型。