• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DepoScope:使用大型语言模型进行准确的噬菌体解聚酶注释和结构域划定。

DepoScope: Accurate phage depolymerase annotation and domain delineation using large language models.

机构信息

Institute for Integrative Systems Biology (I2SysBio), Universitat de Valencia-CSIC, Paterna, Spain.

KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium.

出版信息

PLoS Comput Biol. 2024 Aug 5;20(8):e1011831. doi: 10.1371/journal.pcbi.1011831. eCollection 2024 Aug.

DOI:10.1371/journal.pcbi.1011831
PMID:39102416
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11326577/
Abstract

Bacteriophages (phages) are viruses that infect bacteria. Many of them produce specific enzymes called depolymerases to break down external polysaccharide structures. Accurate annotation and domain identification of these depolymerases are challenging due to their inherent sequence diversity. Hence, we present DepoScope, a machine learning tool that combines a fine-tuned ESM-2 model with a convolutional neural network to identify depolymerase sequences and their enzymatic domains precisely. To accomplish this, we curated a dataset from the INPHARED phage genome database, created a polysaccharide-degrading domain database, and applied sequential filters to construct a high-quality dataset, which is subsequently used to train DepoScope. Our work is the first approach that combines sequence-level predictions with amino-acid-level predictions for accurate depolymerase detection and functional domain identification. In that way, we believe that DepoScope can greatly enhance our understanding of phage-host interactions at the level of depolymerases.

摘要

噬菌体(phages)是感染细菌的病毒。它们中的许多会产生特定的酶,称为解聚酶,以分解外部多糖结构。由于这些解聚酶具有固有的序列多样性,因此对它们进行准确的注释和结构域鉴定具有挑战性。因此,我们提出了 DepoScope,这是一种机器学习工具,它将经过微调的 ESM-2 模型与卷积神经网络相结合,以准确识别解聚酶序列及其酶结构域。为了实现这一目标,我们从 INPHARED 噬菌体基因组数据库中整理了一个数据集,创建了一个多糖降解结构域数据库,并应用顺序过滤器来构建一个高质量的数据集,然后用该数据集来训练 DepoScope。我们的工作是首次将序列级预测与氨基酸级预测相结合,以实现准确的解聚酶检测和功能结构域鉴定。通过这种方式,我们相信 DepoScope 可以极大地增强我们对噬菌体-宿主相互作用在解聚酶水平上的理解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40af/11326577/04b886bca6c5/pcbi.1011831.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40af/11326577/3141e2105da3/pcbi.1011831.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40af/11326577/5249aba0f4d5/pcbi.1011831.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40af/11326577/beabe6ce26fd/pcbi.1011831.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40af/11326577/04b886bca6c5/pcbi.1011831.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40af/11326577/3141e2105da3/pcbi.1011831.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40af/11326577/5249aba0f4d5/pcbi.1011831.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40af/11326577/beabe6ce26fd/pcbi.1011831.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40af/11326577/04b886bca6c5/pcbi.1011831.g004.jpg

相似文献

1
DepoScope: Accurate phage depolymerase annotation and domain delineation using large language models.DepoScope:使用大型语言模型进行准确的噬菌体解聚酶注释和结构域划定。
PLoS Comput Biol. 2024 Aug 5;20(8):e1011831. doi: 10.1371/journal.pcbi.1011831. eCollection 2024 Aug.
2
Isolation and Characterization of Two Phages Encoding Divergent Depolymerases.分离并鉴定两株编码不同解聚酶的噬菌体。
Int J Mol Sci. 2020 Apr 30;21(9):3160. doi: 10.3390/ijms21093160.
3
Two T7-like Bacteriophages, K5-2 and K5-4, Each Encodes Two Capsule Depolymerases: Isolation and Functional Characterization.两种 T7 样噬菌体 K5-2 和 K5-4,各自编码两种荚膜降解酶:分离与功能特性分析。
Sci Rep. 2017 Jul 4;7(1):4624. doi: 10.1038/s41598-017-04644-2.
4
Genomic and Biochemical Characterization of Acinetobacter Podophage Petty Reveals a Novel Lysis Mechanism and Tail-Associated Depolymerase Activity.短小杆菌噬菌体 Petty 的基因组和生化特性分析揭示了一种新的溶菌机制和尾部相关的解聚酶活性。
J Virol. 2018 Feb 26;92(6). doi: 10.1128/JVI.01064-17. Print 2018 Mar 15.
5
Mechanistic Insights into the Capsule-Targeting Depolymerase from a Klebsiella pneumoniae Bacteriophage.对一株肺炎克雷伯氏菌噬菌体的囊靶向解聚酶的机制见解。
Microbiol Spectr. 2021 Sep 3;9(1):e0102321. doi: 10.1128/Spectrum.01023-21. Epub 2021 Aug 25.
6
Identification of three podoviruses infecting Klebsiella encoding capsule depolymerases that digest specific capsular types.鉴定出三种感染产酸克雷伯菌的短尾噬菌体,编码能消化特定荚膜类型的荚膜降解酶。
Microb Biotechnol. 2019 May;12(3):472-486. doi: 10.1111/1751-7915.13370. Epub 2019 Jan 31.
7
Identification of a phage-derived depolymerase specific for KL64 capsule of Klebsiella pneumoniae and its anti-biofilm effect.鉴定一种针对肺炎克雷伯菌 KL64 荚膜的噬菌体来源的解聚酶及其抗生物膜作用。
Virus Genes. 2021 Oct;57(5):434-442. doi: 10.1007/s11262-021-01847-8. Epub 2021 Jun 22.
8
Biochemical and molecular characterization of the Pseudomonas lemoignei polyhydroxyalkanoate depolymerase system.勒莫因氏假单胞菌聚羟基脂肪酸酯解聚酶系统的生化与分子特征
J Bacteriol. 1995 Feb;177(3):596-607. doi: 10.1128/jb.177.3.596-607.1995.
9
Mechanisms of Acinetobacter baumannii Capsular Polysaccharide Cleavage by Phage Depolymerases.噬菌体解聚酶对鲍曼不动杆菌荚膜多糖的水解机制。
Biochemistry (Mosc). 2020 May;85(5):567-574. doi: 10.1134/S0006297920050053.
10
Poly(3-hydroxyvalerate) depolymerase of Pseudomonas lemoignei.勒莫因氏假单胞菌的聚(3-羟基戊酸酯)解聚酶
Appl Environ Microbiol. 2000 Apr;66(4):1385-92. doi: 10.1128/AEM.66.4.1385-1392.2000.

引用本文的文献

1
Advanced Strategies in Phage Research: Innovations, Applications, and Challenges.噬菌体研究的先进策略:创新、应用与挑战
Microorganisms. 2025 Aug 21;13(8):1960. doi: 10.3390/microorganisms13081960.
2
Phage Host Range Expansion Through Directed Evolution on Highly Phage-Resistant Strains of .通过在高度抗噬菌体菌株上进行定向进化实现噬菌体宿主范围扩展 。 (原文句子不完整,此处补充完整句子结构以便理解翻译内容)
Int J Mol Sci. 2025 Aug 6;26(15):7597. doi: 10.3390/ijms26157597.
3
Optimizing phage therapy with artificial intelligence: a perspective.利用人工智能优化噬菌体疗法:一种观点。

本文引用的文献

1
Large language models improve annotation of prokaryotic viral proteins.大语言模型提高原核病毒蛋白的注释效果。
Nat Microbiol. 2024 Feb;9(2):537-549. doi: 10.1038/s41564-023-01584-8. Epub 2024 Jan 29.
2
DePolymerase Predictor (DePP): a machine learning tool for the targeted identification of phage depolymerases.去聚合酶预测器(DePP):一种用于靶向鉴定噬菌体去聚合酶的机器学习工具。
BMC Bioinformatics. 2023 May 19;24(1):208. doi: 10.1186/s12859-023-05341-w.
3
Fast and accurate protein structure search with Foldseek.使用 Foldseek 进行快速准确的蛋白质结构搜索。
Front Cell Infect Microbiol. 2025 May 27;15:1611857. doi: 10.3389/fcimb.2025.1611857. eCollection 2025.
4
Renewed insights into Ackermannviridae phage biology and applications.对艾克曼病毒科噬菌体生物学及应用的新见解。
Npj Viruses. 2024 Aug 21;2(1):37. doi: 10.1038/s44298-024-00046-0.
5
Phage-derived proteins: Advancing food safety through biocontrol and detection of foodborne pathogens.噬菌体衍生蛋白:通过生物防治和食源性病原体检测推进食品安全。
Compr Rev Food Sci Food Saf. 2025 Mar;24(2):e70124. doi: 10.1111/1541-4337.70124.
6
Isolation and Characterization of Lytic Phages Infecting Clinical from Tunisia.来自突尼斯的感染临床菌株的裂解性噬菌体的分离与鉴定
Antibiotics (Basel). 2024 Dec 2;13(12):1154. doi: 10.3390/antibiotics13121154.
7
Characterization of four novel bacteriophages targeting multi-drug resistant strains of sequence type 147 and 307.鉴定针对序列型 147 和 307 多重耐药株的四种新型噬菌体。
Front Cell Infect Microbiol. 2024 Oct 4;14:1473668. doi: 10.3389/fcimb.2024.1473668. eCollection 2024.
8
Targeted phage hunting to specific clinical isolates is an efficient antibiotic resistance and infection control strategy.靶向噬菌体猎捕特定临床分离株是一种有效的抗生素耐药性和感染控制策略。
Microbiol Spectr. 2024 Oct 3;12(10):e0025424. doi: 10.1128/spectrum.00254-24. Epub 2024 Aug 28.
Nat Biotechnol. 2024 Feb;42(2):243-246. doi: 10.1038/s41587-023-01773-0. Epub 2023 May 8.
4
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
5
InterPro in 2022.InterPro 在 2022 年。
Nucleic Acids Res. 2023 Jan 6;51(D1):D418-D427. doi: 10.1093/nar/gkac993.
6
INfrastructure for a PHAge REference Database: Identification of Large-Scale Biases in the Current Collection of Cultured Phage Genomes.噬菌体参考数据库的基础设施:识别当前培养噬菌体基因组集合中的大规模偏差
Phage (New Rochelle). 2021 Dec 1;2(4):214-223. doi: 10.1089/phage.2021.0007. Epub 2021 Dec 16.
7
Exploiting phage-derived carbohydrate depolymerases for combating infectious diseases.利用噬菌体衍生的碳水化合物解聚酶对抗传染病。
Trends Microbiol. 2022 Aug;30(8):707-709. doi: 10.1016/j.tim.2022.05.002. Epub 2022 Jun 9.
8
SWORD2: hierarchical analysis of protein 3D structures.SWORD2:蛋白质 3D 结构的层次分析。
Nucleic Acids Res. 2022 Jul 5;50(W1):W732-W738. doi: 10.1093/nar/gkac370.
9
Learning meaningful representations of protein sequences.学习蛋白质序列有意义的表示方法。
Nat Commun. 2022 Apr 8;13(1):1914. doi: 10.1038/s41467-022-29443-w.
10
Using deep learning to annotate the protein universe.利用深度学习标注蛋白质宇宙。
Nat Biotechnol. 2022 Jun;40(6):932-937. doi: 10.1038/s41587-021-01179-w. Epub 2022 Feb 21.