• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

探讨高级机器学习在鉴定沙门氏菌严重疾病表型方面的预测能力。

Exploring the predictive capability of advanced machine learning in identifying severe disease phenotype in Salmonella enterica.

机构信息

Department of Nutrition and Food Science, University of Maryland, College Park, MD 20742, USA.

Department of Nutrition and Food Science, University of Maryland, College Park, MD 20742, USA; Center for Food Safety and Security Systems, University of Maryland, College Park, MD 20742, USA.

出版信息

Food Res Int. 2022 Jan;151:110817. doi: 10.1016/j.foodres.2021.110817. Epub 2021 Nov 22.

DOI:10.1016/j.foodres.2021.110817
PMID:34980422
Abstract

The past few years have seen a significant increase in availability of whole genome sequencing information, allowing for its incorporation in predictive modeling for foodborne pathogens to account for inter- and intra-species differences in their virulence. However, this is hindered by the inability of traditional statistical methods to analyze such large amounts of data compared to the number of observations/isolates. In this study, we have explored the applicability of machine learning (ML) models to predict the disease outcome, while identifying features that exert a significant effect on the prediction. This study was conducted on Salmonella enterica, a major foodborne pathogen with considerable inter- and intra-serovar variation. WGS of isolates obtained from various sources (i.e., human, chicken, and swine) were used as input in four machine learning models (logistic regression with ridge, random forest, support vector machine, and AdaBoost) to classify isolates based on disease severity (extraintestinal vs. gastrointestinal) in the host. The predictive performances of all models were tested with and without Elastic Net regularization to combat dimensionality issues. Elastic Net-regularized logistic regression model showed the best area under the receiver operating characteristic curve (AUC-ROC; 0.86) and outcome prediction accuracy (0.76). Additionally, genes coding for transcriptional regulation, acidic, oxidative, and anaerobic stress response, and antibiotic resistance were found to be significant predictors of disease severity. These genes, which were significantly associated with each outcome, could possibly be input in amended, gene-expression-specific predictive models to estimate virulence pattern-specific effect of Salmonella and other foodborne pathogens on human health.

摘要

过去几年中,全基因组测序信息的可用性显著增加,这使得可以将其纳入食源性致病菌的预测模型中,以解释其在毒力方面的种间和种内差异。然而,与观察/分离物的数量相比,传统的统计方法无法分析如此大量的数据,这限制了其应用。在这项研究中,我们探索了机器学习 (ML) 模型在预测疾病结果方面的适用性,同时确定了对预测有显著影响的特征。本研究以沙门氏菌属(Salmonella enterica)为对象,沙门氏菌属是一种主要的食源性致病菌,具有相当大的种间和种内变异。从各种来源(即人类、鸡和猪)获得的分离物的 WGS 被用作四个机器学习模型(带有岭回归的逻辑回归、随机森林、支持向量机和 AdaBoost)的输入,以根据宿主中的疾病严重程度(肠外与胃肠道)对分离物进行分类。所有模型的预测性能均在有无弹性网络正则化的情况下进行了测试,以解决维度问题。弹性网络正则化逻辑回归模型显示出最佳的接收者操作特征曲线下面积(AUC-ROC;0.86)和结果预测准确性(0.76)。此外,编码转录调节、酸性、氧化和厌氧应激反应以及抗生素耐药性的基因被发现是疾病严重程度的重要预测因子。这些与每种结果都显著相关的基因,可能会被输入到经过修正的、基于基因表达的特定预测模型中,以估计沙门氏菌和其他食源性致病菌对人类健康的毒力模式特异性影响。

相似文献

1
Exploring the predictive capability of advanced machine learning in identifying severe disease phenotype in Salmonella enterica.探讨高级机器学习在鉴定沙门氏菌严重疾病表型方面的预测能力。
Food Res Int. 2022 Jan;151:110817. doi: 10.1016/j.foodres.2021.110817. Epub 2021 Nov 22.
2
A machine learning approach to identifying Salmonella stress response genes in isolates from poultry processing.一种机器学习方法,用于鉴定来自家禽加工分离株中的沙门氏菌应激反应基因。
Food Res Int. 2024 Jan;175:113635. doi: 10.1016/j.foodres.2023.113635. Epub 2023 Nov 2.
3
The advantage of intergenic regions as genomic features for machine-learning-based host attribution of Typhimurium from the USA.基因间区域作为基于机器学习的来自美国的鼠伤寒沙门氏菌宿主归因的基因组特征的优势。
Microb Genom. 2023 Oct;9(10). doi: 10.1099/mgen.0.001116.
4
Combining Whole-Genome Sequencing and Multimodel Phenotyping To Identify Genetic Predictors of Virulence.结合全基因组测序和多模型表型分析鉴定毒力的遗传预测因子。
mSphere. 2020 Jun 10;5(3):e00293-20. doi: 10.1128/mSphere.00293-20.
5
Whole genome sequencing analysis of multiple Salmonella serovars provides insights into phylogenetic relatedness, antimicrobial resistance, and virulence markers across humans, food animals and agriculture environmental sources.对多个沙门氏菌血清型进行全基因组测序分析,深入了解人类、食品动物和农业环境来源中菌株的系统进化关系、抗药性和毒力标记。
BMC Genomics. 2018 Nov 6;19(1):801. doi: 10.1186/s12864-018-5137-4.
6
Genomic Approaches for Understanding the Characteristics of subsp. Serovar Typhimurium ST1120, Isolated from Swine Feces in Korea.利用基因组学方法了解从韩国猪粪便中分离出的鼠伤寒沙门氏菌ST1120亚种血清型的特征
J Microbiol Biotechnol. 2017 Nov 28;27(11):1983-1993. doi: 10.4014/jmb.1708.08027.
7
Development of a novel machine learning-based weighted modeling approach to incorporate Salmonella enterica heterogeneity on a genetic scale in a dose-response modeling framework.开发一种基于机器学习的新型加权建模方法,以在剂量反应建模框架中纳入遗传水平上的肠炎沙门氏菌异质性。
Risk Anal. 2023 Mar;43(3):440-450. doi: 10.1111/risa.13924. Epub 2022 Apr 12.
8
Core genome sequence analysis to characterize Salmonella enterica serovar Rissen ST469 from a swine production chain.核心基因组序列分析鉴定猪生产链中沙门氏菌肠炎亚种 Rissen ST469。
Int J Food Microbiol. 2019 Sep 2;304:68-74. doi: 10.1016/j.ijfoodmicro.2019.05.022. Epub 2019 May 28.
9
Whole-Genome Sequencing Analysis of Nontyphoidal of Chicken Meat and Human Origin Under Surveillance in Sri Lanka.斯里兰卡监控的鸡肉和人源非伤寒沙门氏菌的全基因组测序分析。
Foodborne Pathog Dis. 2019 Jul;16(7):531-537. doi: 10.1089/fpd.2018.2604. Epub 2019 May 21.
10
Machine learning identifies signatures of host adaptation in the bacterial pathogen Salmonella enterica.机器学习鉴定出细菌病原体沙门氏菌中宿主适应的特征。
PLoS Genet. 2018 May 8;14(5):e1007333. doi: 10.1371/journal.pgen.1007333. eCollection 2018 May.

引用本文的文献

1
Whole-genome phenotype prediction with machine learning: open problems in bacterial genomics.利用机器学习进行全基因组表型预测:细菌基因组学中的开放性问题
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf206.
2
Bioinformatics combined with machine learning unravels differences among environmental, seafood, and clinical isolates of .生物信息学与机器学习相结合揭示了环境、海鲜和临床分离株之间的差异。 (原文句末of后缺少具体内容)
Front Microbiol. 2025 Mar 19;16:1549260. doi: 10.3389/fmicb.2025.1549260. eCollection 2025.
3
Using GWAS and Machine Learning to Identify and Predict Genetic Variants Associated with Foodborne Bacteria Phenotypic Traits.
利用 GWAS 和机器学习识别和预测与食源性病原体表型特征相关的遗传变异。
Methods Mol Biol. 2025;2852:223-253. doi: 10.1007/978-1-0716-4100-2_16.
4
Advancements in Predictive Microbiology: Integrating New Technologies for Efficient Food Safety Models.预测微生物学的进展:整合新技术以构建高效食品安全模型
Int J Microbiol. 2024 May 17;2024:6612162. doi: 10.1155/2024/6612162. eCollection 2024.
5
Research gaps and priorities for quantitative microbial risk assessment (QMRA).定量微生物风险评估(QMRA)的研究空白和重点。
Risk Anal. 2024 Nov;44(11):2521-2536. doi: 10.1111/risa.14318. Epub 2024 May 21.
6
Development and validation of a random forest algorithm for source attribution of animal and human Typhimurium and monophasic variants of Typhimurium isolates in England and Wales utilising whole genome sequencing data.利用全基因组测序数据开发并验证一种随机森林算法,用于英格兰和威尔士动物及人类鼠伤寒沙门氏菌以及鼠伤寒沙门氏菌单相变体分离株的溯源分析。
Front Microbiol. 2024 Mar 12;14:1254860. doi: 10.3389/fmicb.2023.1254860. eCollection 2023.
7
and Salmonellosis: An Update on Public Health Implications and Control Strategies.以及沙门氏菌病:公共卫生影响与控制策略的最新情况
Animals (Basel). 2023 Nov 27;13(23):3666. doi: 10.3390/ani13233666.
8
The genomic and epidemiological virulence patterns of Salmonella enterica serovars in the United States.美国肠沙门氏菌血清型的基因组和流行病学毒力模式。
PLoS One. 2023 Dec 5;18(12):e0294624. doi: 10.1371/journal.pone.0294624. eCollection 2023.
9
Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends.基于基因组特征和气象趋势的机器学习预测食源性沙门氏菌病暴发情况
Curr Res Food Sci. 2023 May 28;6:100525. doi: 10.1016/j.crfs.2023.100525. eCollection 2023.
10
A Machine Learning Model for Food Source Attribution of .用于……食物来源归因的机器学习模型
Pathogens. 2022 Jun 16;11(6):691. doi: 10.3390/pathogens11060691.