• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习预测并深入了解乳球菌的牛奶酸化速率。

Machine learning predicts and provides insights into milk acidification rates of Lactococcus lactis.

机构信息

Chr. Hansen A/S, Hoersholm, Denmark.

National Food Institute, Technical University of Denmark, Lyngby, Denmark.

出版信息

PLoS One. 2021 Mar 15;16(3):e0246287. doi: 10.1371/journal.pone.0246287. eCollection 2021.

DOI:10.1371/journal.pone.0246287
PMID:33720959
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7959382/
Abstract

Lactococcus lactis strains are important components in industrial starter cultures for cheese manufacturing. They have many strain-dependent properties, which affect the final product. Here, we explored the use of machine learning to create systematic, high-throughput screening methods for these properties. Fast acidification of milk is such a strain-dependent property. To predict the maximum hourly acidification rate (Vmax), we trained Random Forest (RF) models on four different genomic representations: Presence/absence of gene families, counts of Pfam domains, the 8 nucleotide long subsequences of their DNA (8-mers), and the 9 nucleotide long subsequences of their DNA (9-mers). Vmax was measured at different temperatures, volumes, and in the presence or absence of yeast extract. These conditions were added as features in each RF model. The four models were trained on 257 strains, and the correlation between the measured Vmax and the predicted Vmax was evaluated with Pearson Correlation Coefficients (PC) on a separate dataset of 85 strains. The models all had high PC scores: 0.83 (gene presence/absence model), 0.84 (Pfam domain model), 0.76 (8-mer model), and 0.85 (9-mer model). The models all based their predictions on relevant genetic features and showed consensus on systems for lactose metabolism, degradation of casein, and pH stress response. Each model also predicted a set of features not found by the other models.

摘要

乳球菌菌株是奶酪制造工业起始培养物中的重要组成部分。它们具有许多菌株依赖性特性,这些特性会影响最终产品。在这里,我们探索了使用机器学习来创建针对这些特性的系统的高通量筛选方法。快速酸化牛奶就是这样一种菌株依赖性特性。为了预测最大每小时酸化率(Vmax),我们在四个不同的基因组表示形式上训练了随机森林(RF)模型:基因家族的存在/不存在、Pfam 结构域的计数、其 DNA 的 8 个核苷酸长的子序列(8-mers)以及其 DNA 的 9 个核苷酸长的子序列(9-mers)。在不同的温度、体积以及是否存在酵母提取物的情况下测量 Vmax。这些条件被添加为每个 RF 模型的特征。四个模型在 257 株菌株上进行了训练,并在另一个 85 株菌株的数据集上使用 Pearson 相关系数(PC)评估了实测 Vmax 与预测 Vmax 之间的相关性。所有模型的 PC 得分都很高:0.83(基因存在/不存在模型)、0.84(Pfam 结构域模型)、0.76(8-mer 模型)和 0.85(9-mer 模型)。所有模型的预测均基于相关的遗传特征,并在乳糖代谢、酪蛋白降解和 pH 应激反应系统上达成共识。每个模型还预测了一组其他模型未发现的特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a114/7959382/dae3f9c77cc3/pone.0246287.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a114/7959382/1dd8e590c3f6/pone.0246287.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a114/7959382/befbab9610b4/pone.0246287.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a114/7959382/bd9ee245f999/pone.0246287.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a114/7959382/c008f2c9fac4/pone.0246287.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a114/7959382/9b3374e28ac9/pone.0246287.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a114/7959382/dae3f9c77cc3/pone.0246287.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a114/7959382/1dd8e590c3f6/pone.0246287.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a114/7959382/befbab9610b4/pone.0246287.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a114/7959382/bd9ee245f999/pone.0246287.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a114/7959382/c008f2c9fac4/pone.0246287.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a114/7959382/9b3374e28ac9/pone.0246287.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a114/7959382/dae3f9c77cc3/pone.0246287.g006.jpg

相似文献

1
Machine learning predicts and provides insights into milk acidification rates of Lactococcus lactis.机器学习预测并深入了解乳球菌的牛奶酸化速率。
PLoS One. 2021 Mar 15;16(3):e0246287. doi: 10.1371/journal.pone.0246287. eCollection 2021.
2
Effect of dissolved oxygen on redox potential and milk acidification by lactic acid bacteria isolated from a DL-starter culture.溶解氧对从DL发酵剂培养物中分离出的乳酸菌的氧化还原电位和乳酸酸化的影响。
J Dairy Sci. 2015 Mar;98(3):1640-51. doi: 10.3168/jds.2014-8971. Epub 2015 Jan 15.
3
In vivo application and dynamics of lactic acid bacteria for the four-season production of Vastedda-like cheese.用于 Vastedda 样奶酪四季生产的乳酸菌的体内应用和动力学。
Int J Food Microbiol. 2014 May 2;177:37-48. doi: 10.1016/j.ijfoodmicro.2014.02.007. Epub 2014 Feb 20.
4
Resistance to bacteriocin Lcn972 improves oxygen tolerance of Lactococcus lactis IPLA947 without compromising its performance as a dairy starter.对细菌素 Lcn972 的抗性提高了乳酸乳球菌 IPLA947 的耐氧性,而不影响其作为乳制品发酵剂的性能。
BMC Microbiol. 2018 Jul 20;18(1):76. doi: 10.1186/s12866-018-1222-8.
5
Comparison of the acidifying activity of Lactococcus lactis subsp. lactis strains isolated from goat's milk and Valdeteja cheese.从山羊奶和瓦尔代特亚奶酪中分离出的乳酸乳球菌乳酸亚种菌株的酸化活性比较。
Lett Appl Microbiol. 2002;34(2):134-8. doi: 10.1046/j.1472-765x.2002.01061.x.
6
Phenotypic and molecular characterization of Lactococcus lactis from milk and plants.来自牛奶和植物的乳酸乳球菌的表型和分子特征
J Appl Microbiol. 2006 Aug;101(2):396-405. doi: 10.1111/j.1365-2672.2006.02949.x.
7
Effect of Lactococcus garvieae, Lactococcus lactis and Enterococcus faecalis on the behaviour of Staphylococcus aureus in microfiltered milk.加氏乳球菌、乳酸乳球菌和粪肠球菌对微滤乳中金黄色葡萄球菌行为的影响。
Food Microbiol. 2008 May;25(3):502-8. doi: 10.1016/j.fm.2008.01.005. Epub 2008 Jan 29.
8
Transcriptional responses in Lactococcus lactis subsp. cremoris to the changes in oxygen and redox potential during milk acidification.乳酸乳球菌乳脂亚种在牛奶酸化过程中对氧气和氧化还原电位变化的转录反应。
Lett Appl Microbiol. 2016 Aug;63(2):117-23. doi: 10.1111/lam.12596.
9
Bacteriocins produced by wild Lactococcus lactis strains isolated from traditional, starter-free cheeses made of raw milk.从生牛乳传统、无发酵剂奶酪中分离得到的野生乳球菌所产细菌素。
Int J Food Microbiol. 2010 Sep 30;143(1-2):61-6. doi: 10.1016/j.ijfoodmicro.2010.07.029. Epub 2010 Jul 27.
10
Transcriptome analysis of Lactococcus lactis subsp. lactis during milk acidification as affected by dissolved oxygen and the redox potential.乳球菌乳亚种在牛奶酸化过程中转录组分析受溶解氧和氧化还原电位的影响。
Int J Food Microbiol. 2016 Jun 2;226:5-12. doi: 10.1016/j.ijfoodmicro.2016.03.002. Epub 2016 Mar 5.

引用本文的文献

1
Screening approaches and potential of isolated lactic acid bacteria for improving fermentation of Saccharina latissima.用于改善海带发酵的分离乳酸菌的筛选方法及潜力
BMC Biotechnol. 2025 Jan 5;25(1):2. doi: 10.1186/s12896-024-00926-6.
2
Modeling control and transduction of electrochemical gradients in acid-stressed bacteria.酸性胁迫细菌中电化学梯度的建模控制与转导
iScience. 2023 Jun 17;26(7):107140. doi: 10.1016/j.isci.2023.107140. eCollection 2023 Jul 21.
3
A machine learning approach to explore individual risk factors for tuberculosis treatment non-adherence in Mukono district.

本文引用的文献

1
Beyond multidrug resistance: Leveraging rare variants with machine and statistical learning models in Mycobacterium tuberculosis resistance prediction.超越多药耐药性:利用机器和统计学习模型在结核分枝杆菌耐药性预测中的罕见变异。
EBioMedicine. 2019 May;43:356-369. doi: 10.1016/j.ebiom.2019.04.016. Epub 2019 Apr 29.
2
Interpretable genotype-to-phenotype classifiers with performance guarantees.具有性能保证的可解释基因型到表型分类器。
Sci Rep. 2019 Mar 11;9(1):4071. doi: 10.1038/s41598-019-40561-2.
3
Comparative Genome Analysis of Indicates Niche Adaptation and Resolves Genotype/Phenotype Disparity.
一种探索穆科诺区结核病治疗不依从个体风险因素的机器学习方法。
PLOS Glob Public Health. 2023 Jul 3;3(7):e0001466. doi: 10.1371/journal.pgph.0001466. eCollection 2023.
4
From genotype to phenotype: computational approaches for inferring microbial traits relevant to the food industry.从基因型到表型:推断与食品工业相关的微生物特性的计算方法。
FEMS Microbiol Rev. 2023 Jul 5;47(4). doi: 10.1093/femsre/fuad030.
[具体研究对象]的比较基因组分析表明生态位适应并解决了基因型/表型差异。
Front Microbiol. 2019 Jan 31;10:4. doi: 10.3389/fmicb.2019.00004. eCollection 2019.
4
High-throughput screening for texturing Lactococcus strains.高通量筛选纹理乳球菌株。
FEMS Microbiol Lett. 2019 Jan 1;366(2). doi: 10.1093/femsle/fnz001.
5
Prediction of antibiotic resistance in Escherichia coli from large-scale pan-genome data.从大规模泛基因组数据预测大肠杆菌的抗生素耐药性。
PLoS Comput Biol. 2018 Dec 14;14(12):e1006258. doi: 10.1371/journal.pcbi.1006258. eCollection 2018 Dec.
6
The Pfam protein families database in 2019.2019 年 Pfam 蛋白质家族数据库。
Nucleic Acids Res. 2019 Jan 8;47(D1):D427-D432. doi: 10.1093/nar/gky995.
7
A k-mer-based method for the identification of phenotype-associated genomic biomarkers and predicting phenotypes of sequenced bacteria.基于 k-mer 的方法鉴定与表型相关的基因组生物标志物并预测测序细菌的表型。
PLoS Comput Biol. 2018 Oct 22;14(10):e1006434. doi: 10.1371/journal.pcbi.1006434. eCollection 2018 Oct.
8
Predicting bacterial resistance from whole-genome sequences using k-mers and stability selection.基于 k- -mer 和稳定性选择预测全基因组序列中的细菌耐药性。
BMC Bioinformatics. 2018 Oct 17;19(1):383. doi: 10.1186/s12859-018-2403-z.
9
Rapid and precise alignment of raw reads against redundant databases with KMA.使用 KMA 实现原始读取与冗余数据库的快速精确比对。
BMC Bioinformatics. 2018 Aug 29;19(1):307. doi: 10.1186/s12859-018-2336-6.
10
Developing an in silico minimum inhibitory concentration panel test for Klebsiella pneumoniae.开发一种用于肺炎克雷伯菌的计算机模拟最小抑菌浓度检测板试验。
Sci Rep. 2018 Jan 11;8(1):421. doi: 10.1038/s41598-017-18972-w.