Tutar Halit, Celik Senol, Er Hasan, Gönülal Erdal
Faculty of Agriculture, Department of Field Crops, Bingöl University, Bingöl, Türkiye.
Faculty of Agriculture, Department of Animal Science, Bingöl University, Bingöl, Türkiye.
PLoS One. 2025 Feb 5;20(2):e0318230. doi: 10.1371/journal.pone.0318230. eCollection 2025.
In this study, the effect of morphological traits on fresh herbage yield of sorghum x sudangrass hybrid plant grown in Konya province, which is the largest cereal production area in Turkey, was analyzed with some data mining methods. For this purpose, Artificial Neural Networks (ANN), Automatic Linear Model (ALM), Random Forest (RF) Algorithm and Multivariate Adaptive Regression Spline (MARS) Algorithm were used, and the prediction performances of these methods were compared. Plant height of 251.22 cm, stem diameter of 7.03 mm, fresh herbage yield of 8010.69 kg da-1, crude protein ratio of 9.09%, acid detergent fiber 33.23%, neutral detergent fiber 57.44%, acid detergent lignin 7.43%, dry matter digestibility of 63.01%, dry matter intake 2.11%, and relative feed value of 103.02 were the descriptive statistical values that were computed. Model fit statistics, including coefficient of determination (R2), adjusted R2, root of mean square error (RMSE), mean absolute percentage error (MAPE), standard deviation ratio (SD ratio), Mean Absolution Error (MAE) and Relative Absolution Error (RAE), were used to evaluate the prediction abilities of the fitted models. The MARS method was shown to be the best model for describing fresh herbage yield, with the lowest values of RMSE, MAPE, SD ratio, MAE and RAE (137.7, 1.488, 0.072, 109.718 and 0.017, respectively), as well as the highest R2 value (0.995) and adjusted R2 value (0.991). The experimental results show that the MARS algorithm is the most suitable model for predicting fresh herbage yield in sorghum x sudangrass hybrid, providing a good alternative to other data mining algorithms.
在本研究中,运用一些数据挖掘方法,分析了形态特征对种植于土耳其最大谷物产区科尼亚省的高粱苏丹草杂交植物鲜草产量的影响。为此,使用了人工神经网络(ANN)、自动线性模型(ALM)、随机森林(RF)算法和多元自适应回归样条(MARS)算法,并比较了这些方法的预测性能。计算得出的描述性统计值为:株高251.22厘米、茎直径7.03毫米、鲜草产量8010.69千克/公顷、粗蛋白比率9.09%、酸性洗涤纤维33.23%、中性洗涤纤维57.44%、酸性洗涤木质素7.43%、干物质消化率63.01%、干物质摄入量2.11%以及相对饲用价值103.02。用于评估拟合模型预测能力的模型拟合统计量包括决定系数(R2)、调整后的R2、均方根误差(RMSE)、平均绝对百分比误差(MAPE)、标准差比率(SD比率)、平均绝对误差(MAE)和相对绝对误差(RAE)。结果表明,MARS方法是描述鲜草产量的最佳模型,其RMSE、MAPE、SD比率、MAE和RAE值最低(分别为137.7、1.488、0.072、109.718和0.017),同时R2值最高(0.995),调整后的R2值也最高(0.991)。实验结果表明,MARS算法是预测高粱苏丹草杂交植物鲜草产量最合适的模型,为其他数据挖掘算法提供了一个很好的替代方案。