利用监督式机器学习鉴定与软腐相关的病原菌达旦氏菌3937和胡萝卜软腐果胶杆菌WPP14基因组中的宿主-微生物相互作用因子。

Identification of host-microbe interaction factors in the genomes of soft rot-associated pathogens Dickeya dadantii 3937 and Pectobacterium carotovorum WPP14 with supervised machine learning.

作者信息

Ma Bing, Charkowski Amy O, Glasner Jeremy D, Perna Nicole T

机构信息

Genome Center of Wisconsin, University of Wisconsin-Madison, Madison, WI 53706, USA.

出版信息

BMC Genomics. 2014 Jun 21;15:508. doi: 10.1186/1471-2164-15-508.

DOI:10.1186/1471-2164-15-508

PMID:24952641

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4079955/

Abstract

BACKGROUND

A wealth of genome sequences has provided thousands of genes of unknown function, but identification of functions for the large numbers of hypothetical genes in phytopathogens remains a challenge that impacts all research on plant-microbe interactions. Decades of research on the molecular basis of pathogenesis focused on a limited number of factors associated with long-known host-microbe interaction systems, providing limited direction into this challenge. Computational approaches to identify virulence genes often rely on two strategies: searching for sequence similarity to known host-microbe interaction factors from other organisms, and identifying islands of genes that discriminate between pathogens of one type and closely related non-pathogens or pathogens of a different type. The former is limited to known genes, excluding vast collections of genes of unknown function found in every genome. The latter lacks specificity, since many genes in genomic islands have little to do with host-interaction.

RESULT

In this study, we developed a supervised machine learning approach that was designed to recognize patterns from large and disparate data types, in order to identify candidate host-microbe interaction factors. The soft rot Enterobacteriaceae strains Dickeya dadantii 3937 and Pectobacterium carotovorum WPP14 were used for development of this tool, because these pathogens are important on multiple high value crops in agriculture worldwide and more genomic and functional data is available for the Enterobacteriaceae than any other microbial family. Our approach achieved greater than 90% precision and a recall rate over 80% in 10-fold cross validation tests.

CONCLUSION

Application of the learning scheme to the complete genome of these two organisms generated a list of roughly 200 candidates, many of which were previously not implicated in plant-microbe interaction and many of which are of completely unknown function. These lists provide new targets for experimental validation and further characterization, and our approach presents a promising pattern-learning scheme that can be generalized to create a resource to study host-microbe interactions in other bacterial phytopathogens.

摘要

背景

大量的基因组序列已提供了数千个功能未知的基因，但确定植物病原体中大量假设基因的功能仍然是一项挑战，这影响着所有关于植物-微生物相互作用的研究。数十年来对发病机制分子基础的研究集中在与长期已知的宿主-微生物相互作用系统相关的有限数量的因素上，为应对这一挑战提供的指导有限。识别毒力基因的计算方法通常依赖于两种策略：搜索与其他生物体中已知的宿主-微生物相互作用因子的序列相似性，以及识别区分一种类型的病原体与密切相关的非病原体或不同类型病原体的基因岛。前者仅限于已知基因，排除了每个基因组中发现的大量功能未知的基因集合。后者缺乏特异性，因为基因岛中的许多基因与宿主相互作用几乎无关。

结果

在本研究中，我们开发了一种监督机器学习方法，旨在从大量不同的数据类型中识别模式，以确定候选的宿主-微生物相互作用因子。软腐肠杆菌菌株胡萝卜软腐果胶杆菌3937和胡萝卜果胶杆菌WPP14被用于开发此工具，因为这些病原体在全球农业中的多种高价值作物上具有重要影响，并且与任何其他微生物家族相比，肠杆菌科有更多的基因组和功能数据。我们的方法在10折交叉验证测试中实现了超过90%的精度和超过8%的召回率。

结论

将该学习方案应用于这两种生物体的完整基因组产生了一份约200个候选基因的列表，其中许多以前未涉及植物-微生物相互作用，并且许多功能完全未知。这些列表为实验验证和进一步表征提供了新的靶点，并且我们的方法提出了一种有前景的模式学习方案，该方案可以推广以创建一种资源来研究其他细菌性植物病原体中的宿主-微生物相互作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f061/4079955/469f2549075e/12864_2013_6186_Fig1_HTML.jpg

相似文献

Identification of host-microbe interaction factors in the genomes of soft rot-associated pathogens Dickeya dadantii 3937 and Pectobacterium carotovorum WPP14 with supervised machine learning.利用监督式机器学习鉴定与软腐相关的病原菌达旦氏菌3937和胡萝卜软腐果胶杆菌WPP14基因组中的宿主-微生物相互作用因子。

BMC Genomics. 2014 Jun 21;15:508. doi: 10.1186/1471-2164-15-508.

Genomic and metabolic comparison with Dickeya dadantii 3937 reveals the emerging Dickeya solani potato pathogen to display distinctive metabolic activities and T5SS/T6SS-related toxin repertoire.与达旦氏果胶杆菌3937的基因组和代谢比较表明，新出现的茄科果胶杆菌马铃薯病原体具有独特的代谢活动和与T5SS/T6SS相关的毒素库。

BMC Genomics. 2014 Apr 15;15:283. doi: 10.1186/1471-2164-15-283.

Transcriptome of Pectobacterium carotovorum subsp. carotovorum PccS1 infected in calla plants in vivo highlights a spatiotemporal expression pattern of genes related to virulence, adaptation, and host response.根癌土壤杆菌亚种胡萝卜软腐果胶杆菌 PccS1 感染活体马蹄莲组织的转录组分析突出了与毒性、适应和宿主反应相关基因的时空表达模式。

Mol Plant Pathol. 2020 Jun;21(6):871-891. doi: 10.1111/mpp.12936. Epub 2020 Apr 8.

The type III secreted effector DspE is required early in solanum tuberosum leaf infection by Pectobacterium carotovorum to cause cell death, and requires Wx(3-6)D/E motifs.III型分泌效应蛋白DspE在胡萝卜软腐果胶杆菌感染马铃薯叶片早期导致细胞死亡过程中是必需的，并且需要Wx(3-6)D/E基序。

PLoS One. 2013 Jun 3;8(6):e65534. doi: 10.1371/journal.pone.0065534. Print 2013.

New PCR-Based Assay for the Identification of Causing Potato Soft Rot.基于 PCR 的新型检测方法用于鉴定引起马铃薯软腐病的病原菌。

Plant Dis. 2022 Feb;106(2):676-684. doi: 10.1094/PDIS-08-21-1676-RE. Epub 2022 Feb 15.

Complete Genome Sequence Resource for the Necrotrophic Plant-Pathogenic Bacterium WPP14.全基因组序列资源用于研究坏死型植物病原菌 WPP14。

Plant Dis. 2021 Jan;105(1):196-198. doi: 10.1094/PDIS-05-20-1059-A. Epub 2020 Nov 17.

Pectobacterium carotovorum elicits plant cell death with DspE/F but the P. carotovorum DspE does not suppress callose or induce expression of plant genes early in plant-microbe interactions.果胶杆菌能诱导植物细胞死亡，并利用 DspE/F 实现这一过程，但果胶杆菌的 DspE 并不能抑制胼胝质的形成，也不能在植物与微生物的早期相互作用中诱导植物基因的表达。

Mol Plant Microbe Interact. 2011 Jul;24(7):773-86. doi: 10.1094/MPMI-06-10-0143.

Transcriptome and Comparative Genomics Analyses Reveal New Functional Insights on Key Determinants of Pathogenesis and Interbacterial Competition in and spp.转录组和比较基因组学分析揭示了和种属中关键致病和种间竞争决定因素的新功能见解。

Appl Environ Microbiol. 2019 Jan 9;85(2). doi: 10.1128/AEM.02050-18. Print 2019 Jan 15.

Specific detection of Pectobacterium carotovorum by loop-mediated isothermal amplification.通过环介导等温扩增技术特异性检测胡萝卜软腐果胶杆菌

Mol Plant Pathol. 2016 Dec;17(9):1499-1505. doi: 10.1111/mpp.12378. Epub 2016 Apr 21.

Interactions of Salmonella enterica Serovar Typhimurium and Pectobacterium carotovorum within a Tomato Soft Rot.鼠伤寒沙门氏菌血清型和胡萝卜软腐果胶杆菌在番茄软腐病中的相互作用

Appl Environ Microbiol. 2018 Feb 14;84(5). doi: 10.1128/AEM.01913-17. Print 2018 Mar 1.

引用本文的文献

Protein Language Models Uncover Carbohydrate-Active Enzyme Function in Metagenomics.蛋白质语言模型揭示宏基因组学中碳水化合物活性酶的功能。

bioRxiv. 2023 Oct 25:2023.10.23.563620. doi: 10.1101/2023.10.23.563620.

Artificial Intelligence: A Promising Tool in Exploring the Phytomicrobiome in Managing Disease and Promoting Plant Health.人工智能：探索植物微生物组以管理疾病和促进植物健康的一种有前景的工具。

Plants (Basel). 2023 Apr 30;12(9):1852. doi: 10.3390/plants12091852.

Bacterial secretion system functions: evidence of interactions and downstream implications.细菌分泌系统的功能：相互作用的证据及其下游影响。

Microbiology (Reading). 2023 Apr;169(4). doi: 10.1099/mic.0.001326.

Prevalence and Specificity of Chemoreceptor Profiles in Plant-Associated Bacteria.植物相关细菌中化学感受器谱的患病率和特异性

mSystems. 2021 Oct 26;6(5):e0095121. doi: 10.1128/mSystems.00951-21. Epub 2021 Sep 21.

Comparative genomics of 84 Pectobacterium genomes reveals the variations related to a pathogenic lifestyle.84 个果胶杆菌基因组的比较基因组学揭示了与致病生活方式相关的变异。

BMC Genomics. 2018 Dec 7;19(1):889. doi: 10.1186/s12864-018-5269-6.

Characterization of Pectobacterium carotovorum proteins differentially expressed during infection of Zantedeschia elliotiana in vivo and in vitro which are essential for virulence.鉴定在活体和离体感染马蹄莲过程中差异表达的胡萝卜软腐果胶杆菌蛋白，这些蛋白对毒力是必需的。

Mol Plant Pathol. 2018 Jan;19(1):35-48. doi: 10.1111/mpp.12493. Epub 2016 Nov 23.

Role of Dickeya dadantii 3937 chemoreceptors in the entry to Arabidopsis leaves through wounds.胡萝卜软腐果胶杆菌3937化学感受器在通过伤口进入拟南芥叶片过程中的作用。

Mol Plant Pathol. 2015 Sep;16(7):685-98. doi: 10.1111/mpp.12227. Epub 2015 Apr 14.

本文引用的文献

Role of the LysR-type transcriptional regulator PecT and DNA supercoiling in the thermoregulation of pel genes, the major virulence factors in Dickeya dadantii.LysR 型转录调节因子 PecT 和 DNA 超螺旋在 Dickeya dadantii 主要毒力因子 pel 基因的热调控中的作用。

Environ Microbiol. 2014 Mar;16(3):734-45. doi: 10.1111/1462-2920.12198. Epub 2013 Jul 19.

A phyletically rare gene promotes the niche-specific fitness of an E. coli pathogen during bacteremia.一个系统发育上罕见的基因在大肠杆菌菌血症期间促进了该病原体的特定小生境适应性。

PLoS Pathog. 2013 Feb;9(2):e1003175. doi: 10.1371/journal.ppat.1003175. Epub 2013 Feb 14.

Update of PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence.PROFEAT 更新：一个用于从氨基酸序列计算蛋白质和肽的结构和物理化学特征的网络服务器。

Nucleic Acids Res. 2011 Jul;39(Web Server issue):W385-90. doi: 10.1093/nar/gkr284. Epub 2011 May 23.

Soft rot erwiniae: from genes to genomes.软腐欧文氏菌：从基因到基因组。

Mol Plant Pathol. 2003 Jan 1;4(1):17-30. doi: 10.1046/j.1364-3703.2003.00149.x.

Analysis of gene order conservation in eukaryotes identifies transcriptionally and functionally linked genes.分析真核生物中基因顺序的保守性可以识别出转录和功能上相关的基因。

PLoS One. 2010 May 14;5(5):e10654. doi: 10.1371/journal.pone.0010654.

PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes.PSORTb 3.0：通过改进定位亚类和提高对所有原核生物的预测能力，改善了蛋白质亚细胞定位预测。

Bioinformatics. 2010 Jul 1;26(13):1608-15. doi: 10.1093/bioinformatics/btq249. Epub 2010 May 13.

Genome sequence of Pantoea ananatis LMG20103, the causative agent of Eucalyptus blight and dieback.导致桉树枯萎和衰退的病原体——香蕉泛菌 LMG20103 的基因组序列。

J Bacteriol. 2010 Jun;192(11):2936-7. doi: 10.1128/JB.00060-10. Epub 2010 Mar 26.

IslandViewer: an integrated interface for computational identification and visualization of genomic islands.IslandViewer：用于基因组岛的计算识别与可视化的集成界面。

Bioinformatics. 2009 Mar 1;25(5):664-5. doi: 10.1093/bioinformatics/btp030. Epub 2009 Jan 16.

Machine learning-based receiver operating characteristic (ROC) curves for crisp and fuzzy classification of DNA microarrays in cancer research.癌症研究中基于机器学习的DNA微阵列清晰和模糊分类的接收器操作特征（ROC）曲线。

Int J Approx Reason. 2008 Jan;47(1):17-36. doi: 10.1016/j.ijar.2007.03.006.

Improving classification performance with discretization on biomedical datasets.通过对生物医学数据集进行离散化来提高分类性能。

AMIA Annu Symp Proc. 2008 Nov 6;2008:445-9.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用监督式机器学习鉴定与软腐相关的病原菌达旦氏菌3937和胡萝卜软腐果胶杆菌WPP14基因组中的宿主-微生物相互作用因子。

Identification of host-microbe interaction factors in the genomes of soft rot-associated pathogens Dickeya dadantii 3937 and Pectobacterium carotovorum WPP14 with supervised machine learning.

作者信息

机构信息

出版信息

BACKGROUND

RESULT

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献