Department of Nutrition and Food Science, University of Maryland, College Park, MD 20742, USA.
Department of Agriculture, Food and Resource Sciences, University of Maryland Eastern Shore, Princess Anne, MD 21853, USA.
Food Res Int. 2024 Jul;188:114464. doi: 10.1016/j.foodres.2024.114464. Epub 2024 May 10.
Vibrio parahaemolyticus and Vibrio vulnificus are bacteria with a significant public health impact. Identifying factors impacting their presence and concentrations in food sources could enable the identification of significant risk factors and prevent incidences of foodborne illness. In recent years, machine learning has shown promise in modeling microbial presence based on prevalent external and internal variables, such as environmental variables and gene presence/absence, respectively, particularly with the generation and availability of large amounts and diverse sources of data. Such analyses can prove useful in predicting microbial behavior in food systems, particularly under the influence of the constant changes in environmental variables. In this study, we tested the efficacy of six machine learning regression models (random forest, support vector machine, elastic net, neural network, k-nearest neighbors, and extreme gradient boosting) in predicting the relationship between environmental variables and total and pathogenic V. parahaemolyticus and V. vulnificus concentrations in seawater and oysters. In general, environmental variables were found to be reliable predictors of total and pathogenic V. parahaemolyticus and V. vulnificus concentrations in seawater, and pathogenic V. parahaemolyticus in oysters (Acceptable Prediction Zone >70 %) when analyzed using our machine learning models. SHapley Additive exPlanations, which was used to identify variables influencing Vibrio concentrations, identified chlorophyll a content, seawater salinity, seawater temperature, and turbidity as influential variables. It is important to note that different strains were differentially impacted by the same environmental variable, indicating the need for further research to study the causes and potential mechanisms of these variations. In conclusion, environmental variables could be important predictors of Vibrio growth and behavior in seafood. Moreover, the models developed in this study could prove invaluable in assessing and managing the risks associated with V. parahaemolyticus and V. vulnificus, particularly in the face of a changing environment.
副溶血性弧菌和创伤弧菌是对公共健康有重大影响的细菌。确定影响其在食物来源中存在和浓度的因素,可以识别出重要的风险因素,防止食源性疾病的发生。近年来,机器学习在基于常见的外部和内部变量(分别为环境变量和基因存在/缺失)对微生物存在进行建模方面显示出了前景,尤其是在大量和多样化的数据生成和可用性方面。这些分析可用于预测食品系统中的微生物行为,特别是在环境变量不断变化的情况下。在这项研究中,我们测试了六种机器学习回归模型(随机森林、支持向量机、弹性网络、神经网络、k-最近邻和极端梯度提升)预测环境变量与海水中总致病性副溶血性弧菌和创伤弧菌浓度以及牡蛎中致病性副溶血性弧菌浓度之间关系的功效。总的来说,当使用我们的机器学习模型进行分析时,环境变量被发现是海水中总致病性副溶血性弧菌和创伤弧菌浓度以及牡蛎中致病性副溶血性弧菌浓度的可靠预测因子(可接受的预测区>70%)。Shapley Additive exPlanations 用于识别影响弧菌浓度的变量,确定叶绿素 a 含量、海水盐度、海水温度和浊度为有影响的变量。值得注意的是,同一环境变量对不同菌株的影响不同,这表明需要进一步研究以研究这些变化的原因和潜在机制。总之,环境变量可能是海产品中弧菌生长和行为的重要预测因子。此外,本研究中开发的模型在评估和管理副溶血性弧菌和创伤弧菌相关风险方面可能具有重要价值,特别是在面对不断变化的环境时。