Suppr超能文献

用于预测撒哈拉以南非洲小农户奶牛场不同动物育种服务使用情况的机器学习模型。

Machine learning models for predicting the use of different animal breeding services in smallholder dairy farms in Sub-Saharan Africa.

作者信息

Mwanga G, Lockwood S, Mujibi D F N, Yonah Z, Chagunda M G G

机构信息

The Nelson Mandela African Institution of Science and Technology, Tengeru, Arusha, Tanzania.

Washington State University, Pullman, WA, USA.

出版信息

Trop Anim Health Prod. 2020 May;52(3):1081-1091. doi: 10.1007/s11250-019-02097-5. Epub 2019 Nov 15.

Abstract

This study is concerned with developing predictive models using machine learning techniques to be used in identifying factors that influence farmers' decisions, predict farmers' decisions, and forecast farmers' demands relating to breeding service. The data used to develop the models comes from a survey of small-scale dairy farmers from Tanzania (n = 3500 farmers), Kenya (n = 6190 farmers), Ethiopia (n = 4920 farmers), and Uganda (n = 5390 farmers) and more than 120 variables were identified to influence breeding decisions. Feature engineering process was used to reduce the number of variables to a practical level and to identify the most influential ones. Three algorithms were used for feature selection, namely: logistic regression, random forest, and Boruta. Subsequently, six predictive models, using features selected by feature selection method, were tested for each country-neural network, logistic regression, K-nearest neighbor, decision tree, random forest, and Gaussian mixture model. A combination of decision tree and random forest algorithms was used to develop the final models. Each country model showed high predictive power (up to 93%) and are ready for practical use. The use of ML techniques assisted in identifying the key factors that influence the adoption of breeding method that can be taken and prioritized to improve the dairy sector among countries. Moreover, it provided various alternatives for policymakers to compare the consequences of different courses of action which can assist in determining which alternative at any particular choice point had a high probability to succeed, given the information and alternatives pertinent to the breeding decision. Also, through the use of ML, results to the identification of different clusters of farmers, who were classified based on their farm, and farmers' characteristics, i.e., farm location, feeding system, animal husbandry practices, etc. This information had significant value to decision-makers in finding the appropriate intervention for a particular cluster of farmers. In the future, such predictive models will assist decision-makers in planning and managing resources by allocating breeding services and capabilities where they would be most in demand.

摘要

本研究关注利用机器学习技术开发预测模型,以识别影响奶农决策的因素、预测奶农的决策,并预测奶农对育种服务的需求。用于开发模型的数据来自对坦桑尼亚(3500名奶农)、肯尼亚(6190名奶农)、埃塞俄比亚(4920名奶农)和乌干达(5390名奶农)小规模奶农的调查,共识别出120多个变量会影响育种决策。采用特征工程方法将变量数量减少到实际可行的水平,并识别出最具影响力的变量。使用了三种算法进行特征选择,即逻辑回归、随机森林和博鲁塔算法。随后,针对每个国家,使用通过特征选择方法选择的特征对六种预测模型——神经网络、逻辑回归、K近邻、决策树、随机森林和高斯混合模型进行了测试。使用决策树和随机森林算法的组合来开发最终模型。每个国家的模型都显示出较高的预测能力(高达93%),并已准备好投入实际使用。机器学习技术的应用有助于识别影响育种方法采用的关键因素,这些因素可被采纳并确定优先级,以改善各国的乳业部门。此外,它为政策制定者提供了各种选择,以便比较不同行动方案的后果,这有助于在给定与育种决策相关的信息和选择的情况下,确定在任何特定选择点哪种选择成功的可能性较高。此外,通过使用机器学习,结果识别出了不同的奶农群体,这些群体是根据他们的农场以及奶农的特征(即农场位置、饲养系统、畜牧实践等)进行分类的。这些信息对于决策者为特定奶农群体找到合适的干预措施具有重要价值。未来,此类预测模型将通过在最需要的地方分配育种服务和能力,协助决策者进行资源规划和管理。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验