DeepSEA：一种用于注释抗微生物蛋白的无序列比对可解释方法。

DeepSEA: an alignment-free explainable approach to annotate antimicrobial resistance proteins.

作者信息

Borelli Tiago Cabral, Paschoal Alexandre Rossi, da Silva Ricardo Roberto

机构信息

Computational Chemical Biology Laboratory, Department of BioMolecular Sciences, School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, 14040-900, Brazil.

NPPNS, Department of BioMolecular Sciences, School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, 14040-900, Brazil.

出版信息

BMC Bioinformatics. 2025 Sep 1;26(1):224. doi: 10.1186/s12859-025-06256-4.

DOI:10.1186/s12859-025-06256-4

PMID:40890570

Abstract

Antimicrobial resistance (AMR) is one of the most concerning modern threats as it places a greater burden on health systems than HIV and malaria combined. Current surveillance strategies for tracking antimicrobial resistance (AMR) rely on genomic comparisons and depend on sequence alignment with strict similarity cutoffs of greater than 95%. Therefore, these methods have high false-negative error rates due to a lack of reference sequences with a representative coverage of AMR protein diversity. Deep learning has been used as an alternative to sequence alignment, as artificial neural networks can extract abstract features from data, thereby limiting the need for sequence comparisons. Here, a convolutional neural network (CNN) was trained to differentiate between antimicrobial resistance proteins and non-resistance proteins, and to annotate them in nine resistance classes. Our model demonstrated higher recall values (> 0.9) than the alignment-based approach for all protein classes tested. Additionally, our CNN architecture allowed us to investigate internal states and explain the model classification regarding protein domain feature importance related to antimicrobial molecule inactivation. Finally, we built an open-source bioinformatic tool ( https://github.com/computational-chemical-biology/DeepSEA-project ) that can be used to annotate antimicrobial resistance proteins and provide information on protein domains without sequence alignment.

摘要

抗菌药物耐药性（AMR）是现代最令人担忧的威胁之一，因为它给卫生系统带来的负担比艾滋病和疟疾加起来还要大。目前用于追踪抗菌药物耐药性（AMR）的监测策略依赖于基因组比较，并取决于与大于95%的严格相似性阈值进行序列比对。因此，由于缺乏具有AMR蛋白多样性代表性覆盖范围的参考序列，这些方法具有较高的假阴性错误率。深度学习已被用作序列比对的替代方法，因为人工神经网络可以从数据中提取抽象特征，从而减少对序列比较的需求。在这里，训练了一个卷积神经网络（CNN）来区分抗菌耐药蛋白和非耐药蛋白，并将它们注释为九个耐药类别。对于所有测试的蛋白质类别，我们的模型显示出比基于比对的方法更高的召回值（>0.9）。此外，我们的CNN架构使我们能够研究内部状态，并解释与抗菌分子失活相关的蛋白质结构域特征重要性的模型分类。最后，我们构建了一个开源生物信息工具（https://github.com/computational-chemical-biology/DeepSEA-project），可用于注释抗菌耐药蛋白，并在无需序列比对的情况下提供蛋白质结构域信息。

相似文献

DeepSEA: an alignment-free explainable approach to annotate antimicrobial resistance proteins.DeepSEA：一种用于注释抗微生物蛋白的无序列比对可解释方法。

BMC Bioinformatics. 2025 Sep 1;26(1):224. doi: 10.1186/s12859-025-06256-4.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

Short-Term Memory Impairment短期记忆障碍

Anterior Approach Total Ankle Arthroplasty with Patient-Specific Cut Guides.使用患者特异性截骨导向器的前路全踝关节置换术。

JBJS Essent Surg Tech. 2025 Aug 15;15(3). doi: 10.2106/JBJS.ST.23.00027. eCollection 2025 Jul-Sep.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究

Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病：网络荟萃分析。

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Ophthalmia Neonatorum新生儿眼炎

ToxinPred 3.0: An improved method for predicting the toxicity of peptides.ToxinPred 3.0：一种改进的多肽毒性预测方法。

Comput Biol Med. 2024 Sep;179:108926. doi: 10.1016/j.compbiomed.2024.108926. Epub 2024 Jul 21.

Deep Learning for the Early Detection of Invasive Ductal Carcinoma in Histopathological Images: Convolutional Neural Network Approach With Transfer Learning.基于深度学习的组织病理学图像中浸润性导管癌早期检测：采用迁移学习的卷积神经网络方法

JMIR Form Res. 2025 Aug 21;9:e62996. doi: 10.2196/62996.

本文引用的文献

Guiding questions to avoid data leakage in biological machine learning applications.指导问题以避免生物机器学习应用中的数据泄露。

Nat Methods. 2024 Aug;21(8):1444-1453. doi: 10.1038/s41592-024-02362-y. Epub 2024 Aug 9.

SeqKit2: A Swiss army knife for sequence and alignment processing.SeqKit2：一款用于序列和比对处理的瑞士军刀式工具。

Imeta. 2024 Apr 5;3(3):e191. doi: 10.1002/imt2.191. eCollection 2024 Jun.

PLM-ARG: antibiotic resistance gene identification using a pretrained protein language model.PLM-ARG：使用预先训练的蛋白质语言模型进行抗生素耐药基因识别。

Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad690.

NCRD: A non-redundant comprehensive database for detecting antibiotic resistance genes.NCRD：一个用于检测抗生素耐药基因的非冗余综合数据库。

iScience. 2023 Oct 5;26(11):108141. doi: 10.1016/j.isci.2023.108141. eCollection 2023 Nov 17.

Genomic surveillance for antimicrobial resistance - a One Health perspective.抗菌药物耐药性的基因组监测——一种从“同一健康”角度出发的方法。

Nat Rev Genet. 2024 Feb;25(2):142-157. doi: 10.1038/s41576-023-00649-y. Epub 2023 Sep 25.

Sequence-structure-function relationships in the microbial protein universe.微生物蛋白质宇宙中的序列-结构-功能关系。

Nat Commun. 2023 Apr 26;14(1):2351. doi: 10.1038/s41467-023-37896-w.

Antimicrobial Peptides Designed against the Ω-Loop of Class A β-Lactamases to Potentiate the Efficacy of β-Lactam Antibiotics.针对A类β-内酰胺酶Ω环设计的抗菌肽，以增强β-内酰胺类抗生素的疗效。

Antibiotics (Basel). 2023 Mar 10;12(3):553. doi: 10.3390/antibiotics12030553.

Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。

Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.

Mapping the determinants of catalysis and substrate specificity of the antibiotic resistance enzyme CTX-M β-lactamase.解析抗生素耐药酶 CTX-M β-内酰胺酶的催化和底物特异性的决定因素。

Commun Biol. 2023 Jan 12;6(1):35. doi: 10.1038/s42003-023-04422-z.

Using deep learning to annotate the protein universe.利用深度学习标注蛋白质宇宙。

Nat Biotechnol. 2022 Jun;40(6):932-937. doi: 10.1038/s41587-021-01179-w. Epub 2022 Feb 21.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

DeepSEA：一种用于注释抗微生物蛋白的无序列比对可解释方法。

DeepSEA: an alignment-free explainable approach to annotate antimicrobial resistance proteins.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献