• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用统计机器学习方法从常规牛奶光谱中预测牛奶质量特性。

Predicting cow milk quality traits from routinely available milk spectra using statistical machine learning methods.

机构信息

School of Mathematics and Statistics, University College Dublin, Belfield, Dublin 4, Ireland; Teagasc, Animal & Grassland Research and Innovation Centre, Moorepark, Fermoy, Co. Cork, P61 P302 Ireland.

School of Mathematics and Statistics, University College Dublin, Belfield, Dublin 4, Ireland.

出版信息

J Dairy Sci. 2021 Jul;104(7):7438-7447. doi: 10.3168/jds.2020-19576. Epub 2021 Apr 15.

DOI:10.3168/jds.2020-19576
PMID:33865578
Abstract

Numerous statistical machine learning methods suitable for application to highly correlated features, as those that exist for spectral data, could potentially improve prediction performance over the commonly used partial least squares approach. Milk samples from 622 individual cows with known detailed protein composition and technological trait data accompanied by mid-infrared spectra were available to assess the predictive ability of different regression and classification algorithms. The regression-based approaches were partial least squares regression (PLSR), ridge regression (RR), least absolute shrinkage and selection operator (LASSO), elastic net, principal component regression, projection pursuit regression, spike and slab regression, random forests, boosting decision trees, neural networks (NN), and a post-hoc approach of model averaging (MA). Several classification methods (i.e., partial least squares discriminant analysis (PLSDA), random forests, boosting decision trees, and support vector machines (SVM)) were also used after stratifying the traits of interest into categories. In the regression analyses, MA was the best prediction method for 6 of the 14 traits investigated [curd firmness at 60 min, α-casein (CN), α-CN, κ-CN, α-lactalbumin, and β-lactoglobulin B], whereas NN and RR were the best algorithms for 3 traits each (rennet coagulation time, curd-firming time, and heat stability, and curd firmness at 30 min, β-CN, and β-lactoglobulin A, respectively), PLSR was best for pH, and LASSO was best for CN micelle size. When traits were divided into 2 classes, SVM had the greatest accuracy for the majority of the traits investigated. Although the well-established PLSR-based method performed competitively, the application of statistical machine learning methods for regression analyses reduced the root mean square error compared with PLSR from between 0.18% (κ-CN) to 3.67% (heat stability). The use of modern statistical machine learning methods for trait prediction from mid-infrared spectroscopy may improve the prediction accuracy for some traits.

摘要

大量适用于高度相关特征的统计机器学习方法,如适用于光谱数据的方法,可能会提高预测性能,优于常用的偏最小二乘方法。有 622 头奶牛的牛奶样本具有已知的详细蛋白质组成和技术性状数据,并附有中红外光谱,用于评估不同回归和分类算法的预测能力。基于回归的方法有偏最小二乘回归(PLSR)、岭回归(RR)、最小绝对收缩和选择算子(LASSO)、弹性网络、主成分回归、投影寻踪回归、尖峰和板回归、随机森林、提升决策树、神经网络(NN)和事后模型平均(MA)。在对感兴趣的性状进行分类后,还使用了几种分类方法(即偏最小二乘判别分析(PLSDA)、随机森林、提升决策树和支持向量机(SVM))。在回归分析中,MA 是 14 个研究性状中 6 个性状的最佳预测方法[60 分钟时的凝乳强度、α-酪蛋白(CN)、α-CN、κ-CN、α-乳白蛋白和β-乳球蛋白 B],而 NN 和 RR 是 3 个性状的最佳算法(凝乳酶凝固时间、凝乳时间和热稳定性,以及 30 分钟时的凝乳强度、β-CN 和β-乳球蛋白 A),PLSR 是 pH 的最佳方法,LASSO 是 CN 胶束大小的最佳方法。当性状分为 2 类时,SVM 对大多数研究性状具有最高的准确性。尽管基于 PLSR 的既定方法具有竞争力,但统计机器学习方法在回归分析中的应用与 PLSR 相比,将根均方误差降低了 0.18%(κ-CN)至 3.67%(热稳定性)。从中红外光谱预测性状时使用现代统计机器学习方法可能会提高某些性状的预测准确性。

相似文献

1
Predicting cow milk quality traits from routinely available milk spectra using statistical machine learning methods.利用统计机器学习方法从常规牛奶光谱中预测牛奶质量特性。
J Dairy Sci. 2021 Jul;104(7):7438-7447. doi: 10.3168/jds.2020-19576. Epub 2021 Apr 15.
2
Milk protein fractions strongly affect the patterns of coagulation, curd firming, and syneresis.乳蛋白组分强烈影响凝固、凝块固化和乳清析出的模式。
J Dairy Sci. 2019 Apr;102(4):2903-2917. doi: 10.3168/jds.2018-15524. Epub 2019 Feb 14.
3
Prediction of bovine milk technological traits from mid-infrared spectroscopy analysis in dairy cows.利用中红外光谱分析法预测奶牛的牛乳加工特性
J Dairy Sci. 2015 Sep;98(9):6620-9. doi: 10.3168/jds.2015-9323. Epub 2015 Jul 15.
4
Evaluating the performance of machine learning methods and variable selection methods for predicting difficult-to-measure traits in Holstein dairy cattle using milk infrared spectral data.利用牛奶近红外光谱数据评估机器学习方法和变量选择方法在荷斯坦奶牛中预测难以测量性状的性能。
J Dairy Sci. 2021 Jul;104(7):8107-8121. doi: 10.3168/jds.2020-19861. Epub 2021 Apr 15.
5
Prediction of individual milk proteins including free amino acids in bovine milk using mid-infrared spectroscopy and their correlations with milk processing characteristics.利用中红外光谱预测牛乳中包括游离氨基酸在内的个体乳蛋白及其与牛奶加工特性的相关性。
J Dairy Sci. 2016 Apr;99(4):3171-3182. doi: 10.3168/jds.2015-9747. Epub 2016 Jan 29.
6
Processing characteristics of dairy cow milk are moderately heritable.奶牛乳的加工特性具有中等程度的遗传性。
J Dairy Sci. 2017 Aug;100(8):6343-6355. doi: 10.3168/jds.2017-12642. Epub 2017 May 30.
7
Predicting milk protein fractions using infrared spectroscopy and a gradient boosting machine for breeding purposes in Holstein cattle.利用红外光谱和梯度提升机预测荷斯坦奶牛育种用乳蛋白组分
J Dairy Sci. 2023 Mar;106(3):1853-1873. doi: 10.3168/jds.2022-22119. Epub 2023 Jan 27.
8
Comparison of milk protein composition and rennet coagulation properties in native Swedish dairy cow breeds and high-yielding Swedish Red cows.比较瑞典本土奶牛品种和高产瑞典红牛的牛奶蛋白组成和凝乳酶凝固特性。
J Dairy Sci. 2017 Nov;100(11):8722-8734. doi: 10.3168/jds.2017-12920. Epub 2017 Sep 13.
9
Factors influencing degree of glycosylation and phosphorylation of caseins in individual cow milk samples.影响个体牛奶样本中酪蛋白糖基化和磷酸化程度的因素。
J Dairy Sci. 2016 May;99(5):3325-3333. doi: 10.3168/jds.2015-10226. Epub 2016 Mar 16.
10
Real-time milk analysis integrated with stacking ensemble learning as a tool for the daily prediction of cheese-making traits in Holstein cattle.将实时牛奶分析与堆叠集成学习相结合,作为预测荷斯坦奶牛奶酪制作特性的日常工具。
J Dairy Sci. 2022 May;105(5):4237-4255. doi: 10.3168/jds.2021-21426. Epub 2022 Mar 10.

引用本文的文献

1
Comparison of machine learning and validation methods for high-dimensional accelerometer data to detect foot lesions in dairy cattle.用于检测奶牛足部病变的高维加速度计数据的机器学习与验证方法比较
PLoS One. 2025 Jun 27;20(6):e0325927. doi: 10.1371/journal.pone.0325927. eCollection 2025.
2
The Use of Explainable Machine Learning for the Prediction of the Quality of Bulk-Tank Milk in Sheep and Goat Farms.可解释机器学习在绵羊和山羊养殖场批量储存牛奶质量预测中的应用
Foods. 2024 Dec 12;13(24):4015. doi: 10.3390/foods13244015.
3
The Genetic Characteristics of FT-MIRS-Predicted Milk Fatty Acids in Chinese Holstein Cows.
中国荷斯坦奶牛中FT-MIRS预测的乳脂肪酸的遗传特征
Animals (Basel). 2024 Oct 8;14(19):2901. doi: 10.3390/ani14192901.
4
Beyond the hype: using AI, big data, wearable devices, and the internet of things for high-throughput livestock phenotyping.超越炒作:利用人工智能、大数据、可穿戴设备和物联网进行高通量家畜表型分析。
Brief Funct Genomics. 2025 Jan 15;24. doi: 10.1093/bfgp/elae032.
5
Heat Stability Assessment of Milk: A Review of Traditional and Innovative Methods.牛奶的热稳定性评估:传统方法与创新方法综述
Foods. 2024 Jul 16;13(14):2236. doi: 10.3390/foods13142236.
6
Establishment and risk factor assessment of the abnormal body temperature probability prediction model (ABTP) for dairy cattle.奶牛异常体温概率预测模型(ABTP)的建立及风险因素评估。
Sci Rep. 2024 Jun 24;14(1):14557. doi: 10.1038/s41598-024-65419-0.
7
Possible Alternatives: Identifying and Quantifying Adulteration in Buffalo, Goat, and Camel Milk Using Mid-Infrared Spectroscopy Combined with Modern Statistical Machine Learning Methods.可能的替代方法:使用中红外光谱结合现代统计机器学习方法识别和量化水牛、山羊和骆驼奶中的掺假情况。
Foods. 2023 Oct 21;12(20):3856. doi: 10.3390/foods12203856.
8
Evaluating the use of statistical and machine learning methods for estimating breed composition of purebred and crossbred animals in thirteen cattle breeds using genomic information.利用基因组信息评估统计和机器学习方法在13个牛品种中估计纯种和杂交动物品种组成的应用。
Front Genet. 2023 May 15;14:1120312. doi: 10.3389/fgene.2023.1120312. eCollection 2023.
9
Integrating on-farm and genomic information improves the predictive ability of milk infrared prediction of blood indicators of metabolic disorders in dairy cows.将农场数据与基因组信息整合,可提高牛奶近红外预测奶牛代谢紊乱血液指标的预测能力。
Genet Sel Evol. 2023 Apr 3;55(1):23. doi: 10.1186/s12711-023-00795-1.
10
Predicting starch content in cassava fresh roots using near-infrared spectroscopy.利用近红外光谱法预测木薯鲜根中的淀粉含量。
Front Plant Sci. 2022 Nov 8;13:990250. doi: 10.3389/fpls.2022.990250. eCollection 2022.