Zaborski Daniel, Grzesiak Wilhelm, Fatih Abdul, Faraz Asim, Tariq Mohammad Masood, Sheikh Irfan Shahzad, Waheed Abdul, Ullah Asad, Marghazani Illahi Bakhsh, Mustafa Muhammad Zahid, Tırınk Cem, Celik Senol, Stadnytska Olha, Klym Oleh
Laboratory of Biostatistics, Bioinformatics and Animal Research, West Pomeranian University of Technology, 71-270 Szczecin, Poland.
Centre for Advanced Studies in Vaccinology and Biotechnology (CASVAB), University of Balochistan, Quetta 87300, Pakistan.
Animals (Basel). 2025 Jul 11;15(14):2051. doi: 10.3390/ani15142051.
The determination of the live body weight of camels (required for their successful breeding) is a rather difficult task due to the problems with handling and restraining these animals. Therefore, the main aim of this study was to predict the ABW of eight indigenous camel () breeds of Pakistan (Bravhi, Kachi, Kharani, Kohi, Lassi, Makrani, Pishin, and Rodbari). Selected productive (hair production, milk yield per lactation, and lactation length) and reproductive (age of puberty, age at first breeding, gestation period, dry period, and calving interval) traits served as the predictors. Six data mining methods [classification and regression trees (CARTs), chi-square automatic interaction detector (CHAID), exhaustive CHAID (EXCHAID), multivariate adaptive regression splines (MARSs), MLP, and RBF] were applied for ABW prediction. Additionally, hierarchical cluster analysis with Euclidean distance was performed for the phenotypic characterization of the camel breeds. The highest Pearson correlation coefficient between the observed and predicted values (0.84, < 0.05) was obtained for MLP, which was also characterized by the lowest root-mean-square error (RMSE) (20.86 kg), standard deviation ratio (SD) (0.54), mean absolute percentage error (MAPE) (2.44%), and mean absolute deviation (MAD) (16.45 kg). The most influential predictor for all the models was the camel breed. The applied methods allowed for the moderately accurate prediction of ABW (average R equal to 65.0%) and the identification of the most important productive and reproductive traits affecting its value. However, one important limitation of the present study is its relatively small dataset, especially for training the ANN (MLP and RBF). Hence, the obtained preliminary results should be validated on larger datasets in the future.
确定骆驼的活体体重(这对其成功繁殖至关重要)是一项相当困难的任务,因为在处理和控制这些动物时存在问题。因此,本研究的主要目的是预测巴基斯坦八个本土骆驼品种(布拉维、卡奇、卡拉尼、科希、拉西、马克拉尼、皮申和罗德巴里)的成年体重。选定的生产性状(产毛量、每次泌乳期的产奶量和泌乳期长度)和繁殖性状(初情期年龄、首次配种年龄、妊娠期、干奶期和产犊间隔)作为预测因子。应用六种数据挖掘方法[分类回归树(CART)、卡方自动交互检测器(CHAID)、详尽CHAID(EXCHAID)、多元自适应回归样条(MARS)、多层感知器(MLP)和径向基函数(RBF)]进行成年体重预测。此外,采用欧氏距离进行层次聚类分析,以对骆驼品种进行表型特征描述。MLP获得的观测值与预测值之间的皮尔逊相关系数最高(0.84,P<0.05),其均方根误差(RMSE)(20.86千克)、标准差比(SD)(0.54)、平均绝对百分比误差(MAPE)(2.44%)和平均绝对偏差(MAD)(16.45千克)也最低。对所有模型影响最大的预测因子是骆驼品种。所应用的方法能够对成年体重进行中等准确的预测(平均R等于百分之65.0),并识别出影响其数值的最重要的生产和繁殖性状。然而,本研究的一个重要局限性是其数据集相对较小,尤其是用于训练人工神经网络(MLP和RBF)的数据。因此,未来应在更大的数据集中对获得的初步结果进行验证。