• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

类似自然语言处理的深度学习有助于识别和验证多种细菌中的硫代亚磺酸盐耐受性簇。

NLP-like deep learning aided in identification and validation of thiosulfinate tolerance clusters in diverse bacteria.

作者信息

Myers Brendon K, Lamichhane Anuj, Kvitko Brian H, Dutta Bhabesh

机构信息

Department of Plant Pathology, The University of Georgia, Tifton, Georgia, USA.

Department of Plant Pathology, The University of Georgia, Athens, Georgia, USA.

出版信息

mSphere. 2025 Jul 29;10(7):e0002325. doi: 10.1128/msphere.00023-25. Epub 2025 Jun 17.

DOI:10.1128/msphere.00023-25
PMID:40525872
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12306174/
Abstract

Allicin tolerance () clusters in phytopathogenic bacteria, which provide resistance to thiosulfinates like allicin, are challenging to find using conventional approaches due to their varied architecture and the paradox of being vertically maintained within genera despite likely being horizontally transferred. This results in significant sequential diversity that further complicates their identification. Natural language processing (NLP), like techniques such as those used in DeepBGC, offers a promising solution by treating gene clusters like a language, allowing for identifying and collecting gene clusters based on patterns and relationships within the sequences. We curated and validated -like clusters in 97-1R, pv. FDAARGOS 389, and pv. tomato DC3000. Leveraging sequences from the RefSeq bacterial database, we conducted comparative analyses of gene synteny, gene/protein sequences, protein structures, and predicted protein interactions. This approach enabled the discovery of several novel -like clusters previously undetectable by other methods, which were further validated experimentally. Our work highlights the effectiveness of NLP-like techniques for identifying underrepresented gene clusters and expands our understanding of the diversity and utility of -like clusters in diverse bacterial genera. This work demonstrates the potential of these techniques to simplify the identification process and enhance the applicability of biological data in real-world scenarios.IMPORTANCEThiosulfinates, like allicin, are potent antifeedants and antimicrobials produced by species and pose a challenge for phytopathogenic bacteria. Phytopathogenic bacteria have been shown to utilize an allicin tolerance () gene cluster to circumvent this host response, leading to economically significant yield losses. Due to the complexity of mining these clusters, we applied techniques akin to natural language processing to analyze Pfam domains and gene proximity. This approach led to the identification of novel -like gene clusters, showcasing the potential of artificial intelligence to reveal elusive and underrepresented genetic clusters and enhance our understanding of their diversity and role across various bacterial genera.

摘要

植物致病细菌中对大蒜素等硫代亚磺酸盐具有抗性的大蒜素耐受性()基因簇,由于其结构多样,且尽管可能是水平转移但在属内垂直维持的矛盾特性,使用传统方法很难找到。这导致了显著的序列多样性,进一步使其鉴定变得复杂。自然语言处理(NLP),如DeepBGC中使用的技术,通过将基因簇视为一种语言,根据序列中的模式和关系来识别和收集基因簇,提供了一个有前景的解决方案。我们在97 - 1R、辣椒疫霉pv. capsici FDAARGOS 389和番茄丁香假单胞菌pv. tomato DC3000中策划并验证了类似的基因簇。利用来自RefSeq细菌数据库的序列,我们对基因共线性、基因/蛋白质序列、蛋白质结构和预测的蛋白质相互作用进行了比较分析。这种方法使得发现了几个以前其他方法无法检测到的新型类似基因簇,并通过实验进一步验证。我们的工作突出了类似NLP技术在识别代表性不足的基因簇方面的有效性,并扩展了我们对不同细菌属中类似基因簇的多样性和实用性的理解。这项工作证明了这些技术在简化鉴定过程以及增强生物数据在实际场景中的适用性方面的潜力。

重要性

硫代亚磺酸盐,如大蒜素,是葱属植物产生的强效拒食剂和抗菌剂,对植物致病细菌构成挑战。已表明植物致病细菌利用大蒜素耐受性()基因簇来规避这种宿主反应,导致经济上重大的产量损失。由于挖掘这些基因簇的复杂性,我们应用类似于自然语言处理的技术来分析Pfam结构域和基因邻近性。这种方法导致鉴定出新型的类似基因簇,展示了人工智能揭示难以捉摸和代表性不足的基因簇以及增强我们对其在不同细菌属中的多样性和作用的理解的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/874e/12306174/45b5ad125e39/msphere.00023-25.f007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/874e/12306174/9eb0539fbc13/msphere.00023-25.f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/874e/12306174/a35a55c08862/msphere.00023-25.f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/874e/12306174/af3eed7617f6/msphere.00023-25.f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/874e/12306174/e8b127117bf8/msphere.00023-25.f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/874e/12306174/d3c9884d772a/msphere.00023-25.f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/874e/12306174/e19e5d6dd30f/msphere.00023-25.f006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/874e/12306174/45b5ad125e39/msphere.00023-25.f007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/874e/12306174/9eb0539fbc13/msphere.00023-25.f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/874e/12306174/a35a55c08862/msphere.00023-25.f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/874e/12306174/af3eed7617f6/msphere.00023-25.f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/874e/12306174/e8b127117bf8/msphere.00023-25.f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/874e/12306174/d3c9884d772a/msphere.00023-25.f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/874e/12306174/e19e5d6dd30f/msphere.00023-25.f006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/874e/12306174/45b5ad125e39/msphere.00023-25.f007.jpg

相似文献

1
NLP-like deep learning aided in identification and validation of thiosulfinate tolerance clusters in diverse bacteria.类似自然语言处理的深度学习有助于识别和验证多种细菌中的硫代亚磺酸盐耐受性簇。
mSphere. 2025 Jul 29;10(7):e0002325. doi: 10.1128/msphere.00023-25. Epub 2025 Jun 17.
2
AI-Driven Antimicrobial Peptide Discovery: Mining and Generation.人工智能驱动的抗菌肽发现:挖掘与生成
Acc Chem Res. 2025 Jun 17;58(12):1831-1846. doi: 10.1021/acs.accounts.0c00594. Epub 2025 Jun 3.
3
Interventions to improve safe and effective medicines use by consumers: an overview of systematic reviews.改善消费者安全有效用药的干预措施:系统评价概述
Cochrane Database Syst Rev. 2014 Apr 29;2014(4):CD007768. doi: 10.1002/14651858.CD007768.pub3.
4
Factors that influence parents' and informal caregivers' views and practices regarding routine childhood vaccination: a qualitative evidence synthesis.影响父母和非正式照顾者对常规儿童疫苗接种看法和做法的因素:定性证据综合分析。
Cochrane Database Syst Rev. 2021 Oct 27;10(10):CD013265. doi: 10.1002/14651858.CD013265.pub2.
5
AI-based Hepatic Steatosis Detection and Integrated Hepatic Assessment from Cardiac CT Attenuation Scans Enhances All-cause Mortality Risk Stratification: A Multi-center Study.基于人工智能的心脏CT衰减扫描检测肝脂肪变性及综合肝脏评估可增强全因死亡风险分层:一项多中心研究
medRxiv. 2025 Jun 11:2025.06.09.25329157. doi: 10.1101/2025.06.09.25329157.
6
Short-Term Memory Impairment短期记忆障碍
7
Factors that influence caregivers' and adolescents' views and practices regarding human papillomavirus (HPV) vaccination for adolescents: a qualitative evidence synthesis.影响照顾者和青少年对青少年人乳头瘤病毒(HPV)疫苗接种的看法及做法的因素:一项定性证据综合分析
Cochrane Database Syst Rev. 2025 Apr 15;4(4):CD013430. doi: 10.1002/14651858.CD013430.pub2.
8
The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.评估胰高血糖素样肽-1受体激动剂(GLP-1 RAs)减肥效果的网状Meta分析的数量、质量及结果:一项范围综述
Health Technol Assess. 2025 Jun 25:1-73. doi: 10.3310/SKHT8119.
9
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
10
Sexual Harassment and Prevention Training性骚扰与预防培训

本文引用的文献

1
Thiosulfinate Tolerance Gene Clusters Are Common Features of Onion Pathogens.硫代亚磺酸酯耐受基因簇是洋葱病原菌的共同特征。
Mol Plant Microbe Interact. 2024 Jun;37(6):507-519. doi: 10.1094/MPMI-01-24-0005-R. Epub 2024 Jun 18.
2
Improving the generalizability of protein-ligand binding predictions with AI-Bind.利用 AI-Bind 提高蛋白质 - 配体结合预测的泛化能力
Nat Commun. 2023 Apr 8;14(1):1989. doi: 10.1038/s41467-023-37572-z.
3
Levenshtein Distance, Sequence Comparison and Biological Database Search.莱文斯坦距离、序列比较与生物数据库搜索。
IEEE Trans Inf Theory. 2021 Jun;67(6):3287-3294. doi: 10.1109/tit.2020.2996543. Epub 2020 May 21.
4
ZEAL: protein structure alignment based on shape similarity.ZEAL:基于形状相似性的蛋白质结构比对。
Bioinformatics. 2021 Sep 29;37(18):2874-2881. doi: 10.1093/bioinformatics/btab205.
5
Thiosulfinate Tolerance Is a Virulence Strategy of an Atypical Bacterial Pathogen of Onion.硫代亚磺酸酯耐受是洋葱一种非典型细菌病原体的毒力策略。
Curr Biol. 2020 Aug 17;30(16):3130-3140.e6. doi: 10.1016/j.cub.2020.05.092. Epub 2020 Jul 2.
6
Genetic and molecular characterization of multicomponent resistance of against allicin.大蒜素对 的多组分耐药性的遗传和分子特征分析。
Life Sci Alliance. 2020 Mar 31;3(5). doi: 10.26508/lsa.202000670. Print 2020 May.
7
A deep learning genome-mining strategy for biosynthetic gene cluster prediction.深度学习基因组挖掘策略用于生物合成基因簇预测。
Nucleic Acids Res. 2019 Oct 10;47(18):e110. doi: 10.1093/nar/gkz654.
8
A Comparison of the Antibacterial and Antifungal Activities of Thiosulfinate Analogues of Allicin.大蒜素硫代亚磺酸酯类似物的抗菌和抗真菌活性比较。
Sci Rep. 2018 Apr 30;8(1):6763. doi: 10.1038/s41598-018-25154-9.
9
Genetic Diversity Analysis Reveals Limited Genomic Diversity as Well as Accessory Genes Correlated with Onion Pathogenicity.遗传多样性分析揭示了有限的基因组多样性以及与洋葱致病性相关的辅助基因。
Front Microbiol. 2018 Feb 13;9:184. doi: 10.3389/fmicb.2018.00184. eCollection 2018.
10
MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets.MMseqs2支持进行灵敏的蛋白质序列搜索,以分析海量数据集。
Nat Biotechnol. 2017 Nov;35(11):1026-1028. doi: 10.1038/nbt.3988. Epub 2017 Oct 16.