• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全基因组测序与监督式机器学习相结合可明确鉴定产志贺毒素阳性菌。

Combination of whole genome sequencing and supervised machine learning provides unambiguous identification of -positive Shiga toxin-producing .

作者信息

Vorimore Fabien, Jaudou Sandra, Tran Mai-Lan, Richard Hugues, Fach Patrick, Delannoy Sabine

机构信息

ANSES, Laboratory for Food Safety, Genomics Platform IdentyPath, Maisons-Alfort, France.

ANSES, Laboratory for Food Safety, COLiPATH Unit, Maisons-Alfort, France.

出版信息

Front Microbiol. 2023 May 12;14:1118158. doi: 10.3389/fmicb.2023.1118158. eCollection 2023.

DOI:10.3389/fmicb.2023.1118158
PMID:37250024
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10213463/
Abstract

INTRODUCTION

The objective of this study was to develop, using a genome wide machine learning approach, an unambiguous model to predict the presence of highly pathogenic STEC in reads assemblies derived from complex samples containing potentially multiple strains. Our approach has taken into account the high genomic plasticity of and utilized the stratification of STEC and pathogroups classification based on the serotype and virulence factors to identify specific combinations of biomarkers for improved characterization of -positive STEC (also named EHEC for enterohemorrhagic ) which are associated with bloody diarrhea and hemolytic uremic syndrome (HUS) in human.

METHODS

The Machine Learning (ML) approach was used in this study on a large curated dataset composed of 1,493 genome sequences and 1,178 Coding Sequences (CDS). Feature selection has been performed using eight classification algorithms, resulting in a reduction of the number of CDS to six. From this reduced dataset, the eight ML models were trained with hyper-parameter tuning and cross-validation steps.

RESULTS AND DISCUSSION

It is remarkable that only using these six genes, EHEC can be clearly identified from read assemblies obtained from in silico mixtures and complex samples such as milk metagenomes. These various combinations of discriminative biomarkers can be implemented as novel marker genes for the unambiguous EHEC characterization from different strains mixtures as well as from raw milk metagenomes.

摘要

引言

本研究的目的是使用全基因组机器学习方法开发一种明确的模型,以预测在源自可能包含多种菌株的复杂样本的读段组装中高致病性肠出血性大肠杆菌(STEC)的存在。我们的方法考虑到了STEC的高基因组可塑性,并利用基于血清型和毒力因子的STEC及致病菌群分类分层,来识别生物标志物的特定组合,以更好地表征与人的血性腹泻和溶血尿毒综合征(HUS)相关的产志贺毒素大肠杆菌阳性STEC(也称为肠出血性大肠杆菌EHEC)。

方法

本研究对一个由1493个大肠杆菌基因组序列和1178个编码序列(CDS)组成的大型精选数据集使用了机器学习(ML)方法。使用八种分类算法进行了特征选择,将CDS数量减少到六个。从这个精简的数据集中,通过超参数调整和交叉验证步骤训练了八个ML模型。

结果与讨论

值得注意的是,仅使用这六个基因,就可以从计算机模拟混合物和复杂样本(如牛奶宏基因组)获得的读段组装中清晰地鉴定出EHEC。这些具有区分性的生物标志物的各种组合可以作为新的标记基因,用于从不同的大肠杆菌菌株混合物以及生牛奶宏基因组中明确鉴定EHEC。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/998b/10213463/f768bca4c34d/fmicb-14-1118158-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/998b/10213463/83ca39199822/fmicb-14-1118158-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/998b/10213463/1c147e06d154/fmicb-14-1118158-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/998b/10213463/08fcd751e33d/fmicb-14-1118158-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/998b/10213463/f768bca4c34d/fmicb-14-1118158-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/998b/10213463/83ca39199822/fmicb-14-1118158-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/998b/10213463/1c147e06d154/fmicb-14-1118158-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/998b/10213463/08fcd751e33d/fmicb-14-1118158-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/998b/10213463/f768bca4c34d/fmicb-14-1118158-g0004.jpg

相似文献

1
Combination of whole genome sequencing and supervised machine learning provides unambiguous identification of -positive Shiga toxin-producing .全基因组测序与监督式机器学习相结合可明确鉴定产志贺毒素阳性菌。
Front Microbiol. 2023 May 12;14:1118158. doi: 10.3389/fmicb.2023.1118158. eCollection 2023.
2
Identification of genetic markers for differentiation of Shiga toxin-producing, enteropathogenic, and avirulent strains of Escherichia coli O26.鉴定产志贺毒素、肠致病性和无毒大肠杆菌 O26 菌株的遗传标记。
Appl Environ Microbiol. 2011 Apr;77(7):2275-81. doi: 10.1128/AEM.02832-10. Epub 2011 Feb 11.
3
Comparative Genomics of Shiga Toxin-Producing Escherichia coli Strains Isolated from Pediatric Patients with and without Hemolytic Uremic Syndrome from 2000 to 2016 in Finland.2000 年至 2016 年芬兰儿童患者与无溶血尿毒症综合征患者中产志贺毒素大肠埃希菌的比较基因组学研究。
Microbiol Spectr. 2022 Aug 31;10(4):e0066022. doi: 10.1128/spectrum.00660-22. Epub 2022 Jun 22.
4
Genome-wide association study of hemolytic uremic syndrome causing Shiga toxin-producing Escherichia coli from Sweden, 1994-2018.1994 年至 2018 年瑞典产志贺毒素大肠杆菌引起溶血性尿毒综合征的全基因组关联研究。
Eur J Clin Microbiol Infect Dis. 2023 Jun;42(6):771-779. doi: 10.1007/s10096-023-04600-1. Epub 2023 Apr 27.
5
High prevalence and pathogenic potential of Shiga toxin-producing strains in raw mutton and beef in Shandong, China.中国山东生羊肉和牛肉中产志贺毒素菌株的高流行率和致病潜力。
Curr Res Food Sci. 2022 Sep 15;5:1596-1602. doi: 10.1016/j.crfs.2022.08.021. eCollection 2022.
6
Detection of Shiga toxin-producing Escherichia coli serotypes O26:H11, O103:H2, O111:H8, O145:H28, and O157:H7 in raw-milk cheeses by using multiplex real-time PCR.应用多重实时 PCR 检测生乳奶酪中的产志贺毒素大肠杆菌血清型 O26:H11、O103:H2、O111:H8、O145:H28 和 O157:H7。
Appl Environ Microbiol. 2011 Mar;77(6):2035-41. doi: 10.1128/AEM.02089-10. Epub 2011 Jan 14.
7
Improving hazard characterization in microbial risk assessment using next generation sequencing data and machine learning: Predicting clinical outcomes in shigatoxigenic Escherichia coli.利用下一代测序数据和机器学习改进微生物风险评估中的危害特征描述:预测志贺毒素产生性大肠杆菌的临床结果。
Int J Food Microbiol. 2019 Mar 2;292:72-82. doi: 10.1016/j.ijfoodmicro.2018.11.016. Epub 2018 Dec 4.
8
Detection of Shiga toxin (Stx)-producing Escherichia coli (STEC) in bovine dairy herds in Northern Italy.意大利北部奶牛场中产志贺毒素(Stx)大肠杆菌(STEC)的检测。
Int J Food Microbiol. 2014 Aug 1;184:45-9. doi: 10.1016/j.ijfoodmicro.2013.12.033. Epub 2014 Jan 10.
9
Characterization of Shiga toxin-producing Escherichia coli strains isolated from human patients in Germany over a 3-year period.对德国三年内从人类患者中分离出的产志贺毒素大肠杆菌菌株的特性分析。
J Clin Microbiol. 2004 Mar;42(3):1099-108. doi: 10.1128/JCM.42.3.1099-1108.2004.
10
Characterization of Shiga toxin producing E. coli and O157 serotype E. coli isolated in France from healthy domestic cattle.法国健康家养牛源产志贺毒素大肠杆菌和O157血清型大肠杆菌的特性分析
Int J Food Microbiol. 2001 Feb 15;63(3):217-23. doi: 10.1016/s0168-1605(00)00422-0.

引用本文的文献

1
Using GWAS and Machine Learning to Identify and Predict Genetic Variants Associated with Foodborne Bacteria Phenotypic Traits.利用 GWAS 和机器学习识别和预测与食源性病原体表型特征相关的遗传变异。
Methods Mol Biol. 2025;2852:223-253. doi: 10.1007/978-1-0716-4100-2_16.
2
Harmonization of supervised machine learning practices for efficient source attribution of Listeria monocytogenes based on genomic data.基于基因组数据的李斯特菌高效溯源的监督机器学习实践的协调。
BMC Genomics. 2023 Sep 22;24(1):560. doi: 10.1186/s12864-023-09667-w.
3
Precision metagenomics sequencing for food safety: hybrid assembly of Shiga toxin-producing in enriched agricultural water.

本文引用的文献

1
A step forward for Shiga toxin-producing identification and characterization in raw milk using long-read metagenomics.长读宏基因组学在生牛乳中产志贺毒素鉴定和特征分析方面的研究进展。
Microb Genom. 2022 Nov;8(11). doi: 10.1099/mgen.0.000911.
2
Hybrid Assembly from 75 E. coli Genomes Isolated from French Bovine Food Products between 1995 and 2016.1995年至2016年间从法国牛肉制品中分离出的75个大肠杆菌基因组的混合组装。
Microbiol Resour Announc. 2023 Mar 16;12(3):e0109522. doi: 10.1128/mra.01095-22. Epub 2023 Feb 1.
3
: Investigation of Fitness in Soil Does Not Support the Relevance of Ecotypes.
用于食品安全的精准宏基因组学测序:富集农业用水中产志贺毒素菌的混合组装
Front Microbiol. 2023 Aug 31;14:1221668. doi: 10.3389/fmicb.2023.1221668. eCollection 2023.
土壤适应性研究不支持生态型的相关性。
Front Microbiol. 2022 Jun 13;13:917588. doi: 10.3389/fmicb.2022.917588. eCollection 2022.
4
Combination of MALDI-TOF Mass Spectrometry and Machine Learning for Rapid Antimicrobial Resistance Screening: The Case of spp.基质辅助激光解吸电离飞行时间质谱与机器学习相结合用于快速抗菌药物耐药性筛查:以 菌属为例
Front Microbiol. 2022 Feb 18;12:804484. doi: 10.3389/fmicb.2021.804484. eCollection 2021.
5
Characterization of Atypical Shiga Toxin Gene Sequences and Description of Stx2j, a New Subtype.非典型志贺毒素基因序列的特征描述及 Stx2j 新亚型的发现。
J Clin Microbiol. 2022 Mar 16;60(3):e0222921. doi: 10.1128/jcm.02229-21.
6
Genome Informatics and Machine Learning-Based Identification of Antimicrobial Resistance-Encoding Features and Virulence Attributes in Escherichia coli Genomes Representing Globally Prevalent Lineages, Including High-Risk Clonal Complexes.基于基因组信息学和机器学习的方法鉴定全球流行谱系(包括高风险克隆群)中大肠杆菌基因组中的抗药性编码特征和毒力属性。
mBio. 2021 Feb 22;13(1):e0379621. doi: 10.1128/mbio.03796-21. Epub 2022 Feb 15.
7
Insights into the assessment of highly pathogenic Shiga toxin-producing Escherichia coli in raw milk and raw milk cheeses by High Throughput Real-time PCR.高通量实时 PCR 技术在生奶和生奶奶酪中高致病性志贺毒素产生大肠杆菌评估方面的研究进展。
Int J Food Microbiol. 2022 Apr 2;366:109564. doi: 10.1016/j.ijfoodmicro.2022.109564. Epub 2022 Feb 7.
8
The global population structure and evolutionary history of the acquisition of major virulence factor-encoding genetic elements in Shiga toxin-producing O121:H19.产志贺毒素大肠埃希氏菌 O121:H19 中主要毒力因子编码遗传元件获得的全球种群结构和进化历史。
Microb Genom. 2021 Dec;7(12). doi: 10.1099/mgen.0.000716.
9
Pathogenic potential assessment of the Shiga toxin-producing by a source attribution-considered machine learning model.基于溯源归因的机器学习模型评估产志贺毒素大肠埃希氏菌的致病潜能
Proc Natl Acad Sci U S A. 2021 May 18;118(20). doi: 10.1073/pnas.2018877118.
10
The European Union One Health 2019 Zoonoses Report.《欧盟2019年人畜共患病“同一健康”报告》
EFSA J. 2021 Feb 27;19(2):e06406. doi: 10.2903/j.efsa.2021.6406. eCollection 2021 Feb.