Freitas Luara A, Savegnago Rodrigo P, Alves Anderson A C, Costa Ricardo L D, Munari Danisio P, Stafuzza Nedenia B, Rosa Guilherme J M, Paz Claudia C P
Department of Genetics, University of Sao Paulo, Ribeirão Preto 14049-900, SP, Brazil.
Department of Animal and Dairy Sciences, University of Wisconsin, Madison, WI 53706, USA.
Animals (Basel). 2023 Jan 21;13(3):374. doi: 10.3390/ani13030374.
This study investigated the feasibility of using easy-to-measure phenotypic traits to predict sheep resistant, resilient, and susceptible to gastrointestinal nematodes, compared the classification performance of multinomial logistic regression (MLR), linear discriminant analysis (LDA), random forest (RF), and artificial neural network (ANN) methods, and evaluated the applicability of the best classification model on each farm. The database comprised 3654 records of 1250 Santa Inês sheep from 6 farms. The animals were classified into resistant (2605 records), resilient (939 records), and susceptible (110 records) according to fecal egg count and packed cell volume. A random oversampling method was performed to balance the dataset. The classification methods were fitted using the information of age class, the month of record, farm, sex, Famacha© degree, body weight, and body condition score as predictors, and the resistance, resilience, and susceptibility to gastrointestinal nematodes as the target classes to be predicted considering data from all farms randomly. An additional leave-one-farm-out cross-validation technique was used to assess prediction quality across farms. The MLR and LDA models presented good performances in predicting susceptible and resistant animals. The results suggest that the use of readily available records and easily measurable traits may provide useful information for supporting management decisions at the farm level.
本研究调查了使用易于测量的表型特征来预测绵羊对胃肠道线虫的抗性、弹性和易感性的可行性,比较了多项逻辑回归(MLR)、线性判别分析(LDA)、随机森林(RF)和人工神经网络(ANN)方法的分类性能,并评估了最佳分类模型在每个农场的适用性。该数据库包含来自6个农场的1250只圣伊内斯羊的3654条记录。根据粪便虫卵计数和红细胞压积,将这些动物分为抗性(2605条记录)、弹性(939条记录)和易感(110条记录)三类。采用随机过采样方法来平衡数据集。使用年龄组、记录月份、农场、性别、Famacha©等级、体重和体况评分等信息作为预测因子,将对胃肠道线虫的抗性、弹性和易感性作为要预测的目标类别,对所有农场的数据进行随机拟合分类方法。另外采用留一农场交叉验证技术来评估各农场间的预测质量。MLR和LDA模型在预测易感和抗性动物方面表现良好。结果表明,使用现成的记录和易于测量的特征可为支持农场层面的管理决策提供有用信息。