Statistics Discipline, Khulna University, Khulna, Bangladesh.
Statistics Discipline, Khulna University, Khulna, Bangladesh.
Nutrition. 2020 Oct;78:110861. doi: 10.1016/j.nut.2020.110861. Epub 2020 May 15.
The aim of this study was is to predict malnutrition status in under-five children in Bangladesh by using various machine learning (ML) algorithms.
For analysis purposes, the nationally representative secondary records from the 2014 Bangladesh Demographic and Health Survey (BDHS) were used. Five well-known ML algorithms such as linear discriminant analysis (LDA), k-nearest neighbors (k-NN), support vector machines (SVM), random forest (RF), and logistic regression (LR) have been considered to accurately predict malnutrition status among children. Additionally, a systematic assessment of the algorithms was performed by using accuracy, sensitivity, specificity, and Cohen's κ statistic.
Based on various performance parameters, the best results were accomplished with the RF algorithm, which demonstrated an accuracy of 68.51%, a sensitivity of 94.66%, and a specificity of 69.76%. Additionally, a most extreme discriminative ability appeared by RF classification (Cohen's κ = 0.2434).
On the basis of the findings, we can presume that the RF algorithm was moderately superior to any other ML algorithms used in this study to predict malnutrition status among under-five children in Bangladesh. Finally, the present research recommends applying RF classification with RF feature selection when the prediction of malnutrition is the core interest.
本研究旨在使用各种机器学习(ML)算法预测孟加拉国五岁以下儿童的营养不良状况。
为分析目的,使用了来自 2014 年孟加拉国人口与健康调查(BDHS)的全国代表性二级记录。考虑了五种知名的 ML 算法,包括线性判别分析(LDA)、k-最近邻(k-NN)、支持向量机(SVM)、随机森林(RF)和逻辑回归(LR),以准确预测儿童的营养不良状况。此外,还通过准确性、敏感性、特异性和 Cohen's κ 统计对算法进行了系统评估。
基于各种性能参数,RF 算法取得了最佳结果,其准确性为 68.51%,敏感性为 94.66%,特异性为 69.76%。此外,RF 分类表现出最强的判别能力(Cohen's κ=0.2434)。
根据研究结果,我们可以推测 RF 算法在预测孟加拉国五岁以下儿童营养不良状况方面略优于本研究中使用的任何其他 ML 算法。最后,本研究建议在关注营养不良预测时应用 RF 分类和 RF 特征选择。