• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用多种无监督模型验证聚类在刻画小农户奶农特征方面的稳健性

Application of Multiple Unsupervised Models to Validate Clusters Robustness in Characterizing Smallholder Dairy Farmers.

作者信息

Nyambo Devotha G, Luhanga Edith T, Yonah Zaipuna O, Mujibi Fidalis D N

机构信息

Nelson Mandela African Institution of Science and Technology, P.O. Box 447, Arusha, Tanzania.

USOMI Limited, P.O. Box 105086-00101, Nairobi, Kenya.

出版信息

ScientificWorldJournal. 2019 Jan 2;2019:1020521. doi: 10.1155/2019/1020521. eCollection 2019.

DOI:10.1155/2019/1020521
PMID:30718979
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6334318/
Abstract

The heterogeneity of smallholder dairy production systems complicates service provision, information sharing, and dissemination of new technologies, especially those needed to maximize productivity and profitability. In order to obtain homogenous groups within which interventions can be made, it is necessary to define clusters of farmers who undertake similar management activities. This paper explores robustness of production cluster definition using various unsupervised learning algorithms to assess the best approach to define clusters. Data were collected from 8179 smallholder dairy farms in Ethiopia and Tanzania. From a total of 500 variables, selection of the 35 variables used in defining production clusters and household membership to these clusters was determined by Principal Component Analysis and domain expert knowledge. Three clustering algorithms, K-means, fuzzy, and Self-Organizing Maps (SOM), were compared in terms of their grouping consistency and prediction accuracy. The model with the least household reallocation between clusters for training and testing data was deemed the most robust. Prediction accuracy was obtained by fitting a model with fixed effects model including production clusters on milk yield, sales, and choice of breeding method. Results indicated that, for the Ethiopian dataset, clusters derived from the fuzzy algorithm had the highest predictive power (77% for milk yield and 48% for milk sales), while for the Tanzania data, clusters derived from Self-Organizing Maps were the best performing. The average cluster membership reallocation was 15%, 12%, and 34% for K-means, SOM, and fuzzy, respectively, for households in Ethiopia. Based on the divergent performance of the various algorithms evaluated, it is evident that, despite similar information being available for the study populations, the uniqueness of the data from each country provided an over-riding influence on cluster robustness and prediction accuracy. The results obtained in this study demonstrate the difficulty of generalizing model application and use across countries and production systems, despite seemingly similar information being collected.

摘要

小农户乳制品生产系统的异质性使得服务提供、信息共享和新技术传播变得复杂,尤其是那些为实现生产力和盈利能力最大化所需的技术。为了获得能够进行干预的同质化群体,有必要界定从事相似管理活动的农户集群。本文运用各种无监督学习算法探索生产集群定义的稳健性,以评估定义集群的最佳方法。数据收集自埃塞俄比亚和坦桑尼亚的8179个小农户奶牛场。在总共500个变量中,用于定义生产集群及农户在这些集群中的成员身份的35个变量是通过主成分分析和领域专家知识确定的。比较了三种聚类算法,即K均值算法、模糊算法和自组织映射(SOM)算法在分组一致性和预测准确性方面的表现。在训练和测试数据中,集群间农户重新分配最少的模型被认为是最稳健的。通过使用包含生产集群的固定效应模型拟合牛奶产量、销售额和育种方法选择的模型来获得预测准确性。结果表明,对于埃塞俄比亚数据集,模糊算法得出的集群具有最高的预测能力(牛奶产量预测能力为77%,牛奶销售额预测能力为48%),而对于坦桑尼亚的数据,自组织映射算法得出的集群表现最佳。对于埃塞俄比亚的农户,K均值算法、SOM算法和模糊算法的平均集群成员重新分配率分别为15%、12%和34%。基于所评估的各种算法的不同表现,很明显,尽管研究对象可获得相似的信息,但每个国家数据的独特性对集群稳健性和预测准确性产生了压倒性影响。本研究所得结果表明,尽管收集的信息看似相似,但在不同国家和生产系统中推广模型应用和使用存在困难。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/101d/6334318/b6ec4561352c/TSWJ2019-1020521.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/101d/6334318/14b0302632eb/TSWJ2019-1020521.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/101d/6334318/30826f2270f3/TSWJ2019-1020521.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/101d/6334318/15edec4f948f/TSWJ2019-1020521.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/101d/6334318/a85cf976620b/TSWJ2019-1020521.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/101d/6334318/342f0cd8954e/TSWJ2019-1020521.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/101d/6334318/71a68e9ef104/TSWJ2019-1020521.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/101d/6334318/b6ec4561352c/TSWJ2019-1020521.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/101d/6334318/14b0302632eb/TSWJ2019-1020521.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/101d/6334318/30826f2270f3/TSWJ2019-1020521.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/101d/6334318/15edec4f948f/TSWJ2019-1020521.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/101d/6334318/a85cf976620b/TSWJ2019-1020521.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/101d/6334318/342f0cd8954e/TSWJ2019-1020521.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/101d/6334318/71a68e9ef104/TSWJ2019-1020521.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/101d/6334318/b6ec4561352c/TSWJ2019-1020521.007.jpg

相似文献

1
Application of Multiple Unsupervised Models to Validate Clusters Robustness in Characterizing Smallholder Dairy Farmers.应用多种无监督模型验证聚类在刻画小农户奶农特征方面的稳健性
ScientificWorldJournal. 2019 Jan 2;2019:1020521. doi: 10.1155/2019/1020521. eCollection 2019.
2
Seasonal variations in the availability of fodder resources and practices of dairy cattle feeding among the smallholder farmers in Western Usambara Highlands, Tanzania.坦桑尼亚西部乌桑巴拉高地小农户饲料资源的季节性变化及奶牛饲养方式
Trop Anim Health Prod. 2018 Oct;50(7):1653-1664. doi: 10.1007/s11250-018-1609-4. Epub 2018 May 8.
3
Machine learning models for predicting the use of different animal breeding services in smallholder dairy farms in Sub-Saharan Africa.用于预测撒哈拉以南非洲小农户奶牛场不同动物育种服务使用情况的机器学习模型。
Trop Anim Health Prod. 2020 May;52(3):1081-1091. doi: 10.1007/s11250-019-02097-5. Epub 2019 Nov 15.
4
Typification and differentiation of smallholder dairy production systems in smallholder mixed farming in the highlands of southern Ethiopia.埃塞俄比亚南部高地小农户混合农业中小农户奶牛养殖系统的典型化和差异化。
PLoS One. 2024 Aug 29;19(8):e0307685. doi: 10.1371/journal.pone.0307685. eCollection 2024.
5
Study on categorization of factors affecting smallholder dairy production in Siltie Zone, Southern Ethiopia, applying multivariate analysis approaches.埃塞俄比亚南部 Siltie 地区影响小农户奶业生产因素的分类研究,应用多元分析方法。
Trop Anim Health Prod. 2022 Oct 17;54(6):347. doi: 10.1007/s11250-022-03336-y.
6
A Review of Characterization Approaches for Smallholder Farmers: Towards Predictive Farm Typologies.小农户特征描述方法综述:迈向预测性农场类型划分
ScientificWorldJournal. 2019 May 22;2019:6121467. doi: 10.1155/2019/6121467. eCollection 2019.
7
Rule-Based Engine for Automatic Allocation of Smallholder Dairy Producers in Preidentified Production Clusters.基于规则的引擎,用于自动将小农户奶农分配到预先确定的生产集群中。
ScientificWorldJournal. 2022 Jun 30;2022:6944151. doi: 10.1155/2022/6944151. eCollection 2022.
8
Organic dairy farmers put more emphasis on production traits than conventional farmers.有机奶农比传统奶农更注重生产性状。
J Dairy Sci. 2016 Dec;99(12):9845-9856. doi: 10.3168/jds.2016-11346. Epub 2016 Sep 28.
9
Farmer-preferred traits in smallholder dairy farming systems in Tanzania.坦桑尼亚小农户奶牛养殖系统中农民偏好的性状
Trop Anim Health Prod. 2019 Jul;51(6):1337-1344. doi: 10.1007/s11250-018-01796-9. Epub 2019 Feb 4.
10
Performance Evaluation of Highly Admixed Tanzanian Smallholder Dairy Cattle Using SNP Derived Kinship Matrix.使用单核苷酸多态性(SNP)衍生亲缘关系矩阵对高度混合的坦桑尼亚小农户奶牛进行性能评估
Front Genet. 2019 Apr 26;10:375. doi: 10.3389/fgene.2019.00375. eCollection 2019.

引用本文的文献

1
Disentangling clustering configuration intricacies for divergently selected chicken breeds.解析差异选择的鸡品种聚类结构的复杂性。
Sci Rep. 2023 Feb 27;13(1):3319. doi: 10.1038/s41598-023-28651-8.
2
Rule-Based Engine for Automatic Allocation of Smallholder Dairy Producers in Preidentified Production Clusters.基于规则的引擎,用于自动将小农户奶农分配到预先确定的生产集群中。
ScientificWorldJournal. 2022 Jun 30;2022:6944151. doi: 10.1155/2022/6944151. eCollection 2022.
3
A Review of Characterization Approaches for Smallholder Farmers: Towards Predictive Farm Typologies.

本文引用的文献

1
An enhanced deterministic K-Means clustering algorithm for cancer subtype prediction from gene expression data.一种增强型确定性 K-Means 聚类算法,用于从基因表达数据中预测癌症亚型。
Comput Biol Med. 2017 Dec 1;91:213-221. doi: 10.1016/j.compbiomed.2017.10.014. Epub 2017 Oct 23.
2
Survey of clustering algorithms.聚类算法综述
IEEE Trans Neural Netw. 2005 May;16(3):645-78. doi: 10.1109/TNN.2005.845141.
小农户特征描述方法综述:迈向预测性农场类型划分
ScientificWorldJournal. 2019 May 22;2019:6121467. doi: 10.1155/2019/6121467. eCollection 2019.