DeepT3：使用 N 端序列，深度卷积神经网络准确识别革兰氏阴性菌 III 型分泌效应物。

DeepT3: deep convolutional neural networks accurately identify Gram-negative bacterial type III secreted effectors using the N-terminal sequence.

机构信息

School of Public Health, Southwest Medical University, Luzhou, Sichuan, PR, China.

Basic Medical College of Southwest Medical University, Luzhou, Sichuan, PR, China.

出版信息

Bioinformatics. 2019 Jun 1;35(12):2051-2057. doi: 10.1093/bioinformatics/bty931.

DOI:10.1093/bioinformatics/bty931

PMID:30407530

Abstract

MOTIVATION

Various bacterial pathogens can deliver their secreted substrates also called effectors through Type III secretion systems (T3SSs) into host cells and cause diseases. Since T3SS secreted effectors (T3SEs) play important roles in pathogen-host interactions, identifying them is crucial to our understanding of the pathogenic mechanisms of T3SSs. However, the effectors display high level of sequence diversity, therefore making the identification a difficult process. There is a need to develop a novel and effective method to screen and select putative novel effectors from bacterial genomes that can be validated by a smaller number of key experiments.

RESULTS

We develop a deep convolution neural network to directly classify any protein sequence into T3SEs or non-T3SEs, which is useful for both effector prediction and the study of sequence-function relationship. Different from traditional machine learning-based methods, our method automatically extracts T3SE-related features from a protein N-terminal sequence of 100 residues and maps it to the T3SEs space. We train and test our method on the datasets curated from 16 species, yielding an average classification accuracy of 83.7% in the 5-fold cross-validation and an accuracy of 92.6% for the test set. Moreover, when comparing with known state-of-the-art prediction methods, the accuracy of our method is 6.31-20.73% higher than previous methods on a common independent dataset. Besides, we visualize the convolutional kernels and successfully identify the key features of T3SEs, which contain important signal information for secretion. Finally, some effectors reported in the literature are used to further demonstrate the application of DeepT3.

AVAILABILITY AND IMPLEMENTATION

DeepT3 is freely available at: https://github.com/lje00006/DeepT3.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

各种细菌病原体可以通过 III 型分泌系统（T3SS）将其分泌的底物（也称为效应子）输送到宿主细胞中，并导致疾病。由于 T3SS 分泌的效应子（T3SEs）在病原体-宿主相互作用中发挥重要作用，因此识别它们对于我们理解 T3SS 的致病机制至关重要。然而，效应子表现出高度的序列多样性，因此使得识别过程变得困难。需要开发一种新的有效方法，从细菌基因组中筛选和选择假定的新型效应子，然后通过少量关键实验进行验证。

结果

我们开发了一种深度卷积神经网络，可以直接将任何蛋白质序列分类为 T3SEs 或非 T3SEs，这对于效应子预测和序列-功能关系的研究都很有用。与传统基于机器学习的方法不同，我们的方法可以自动从 100 个残基的蛋白质 N 端序列中提取 T3SE 相关特征，并将其映射到 T3SE 空间。我们在从 16 个物种中 curated 的数据集上进行了训练和测试，在 5 折交叉验证中的平均分类准确率为 83.7%，测试集的准确率为 92.6%。此外，与已知的最先进的预测方法相比，在一个共同的独立数据集上，我们的方法的准确率比以前的方法高 6.31-20.73%。此外，我们对卷积核进行了可视化，并成功识别出 T3SE 的关键特征，其中包含分泌的重要信号信息。最后，我们使用文献中报道的一些效应子进一步证明了 DeepT3 的应用。

可用性和实现

DeepT3 可在 https://github.com/lje00006/DeepT3 上免费获得。

补充信息

补充数据可在生物信息学在线获得。

相似文献

DeepT3: deep convolutional neural networks accurately identify Gram-negative bacterial type III secreted effectors using the N-terminal sequence.DeepT3：使用 N 端序列，深度卷积神经网络准确识别革兰氏阴性菌 III 型分泌效应物。

Bioinformatics. 2019 Jun 1;35(12):2051-2057. doi: 10.1093/bioinformatics/bty931.

DeepT3 2.0: improving type III secreted effector predictions by an integrative deep learning framework.DeepT3 2.0：通过集成深度学习框架改进III型分泌效应蛋白预测

NAR Genom Bioinform. 2021 Oct 4;3(4):lqab086. doi: 10.1093/nargab/lqab086. eCollection 2021 Dec.

DeepT3_4: A Hybrid Deep Neural Network Model for the Distinction Between Bacterial Type III and IV Secreted Effectors.DeepT3_4：一种用于区分细菌III型和IV型分泌效应蛋白的混合深度神经网络模型

Front Microbiol. 2021 Jan 21;12:605782. doi: 10.3389/fmicb.2021.605782. eCollection 2021.

ACNNT3: Attention-CNN Framework for Prediction of Sequence-Based Bacterial Type III Secreted Effectors.ACNNT3：基于序列的细菌 III 型分泌效应子预测的注意力-CNN 框架。

Comput Math Methods Med. 2020 Apr 3;2020:3974598. doi: 10.1155/2020/3974598. eCollection 2020.

T3SEpp: an Integrated Prediction Pipeline for Bacterial Type III Secreted Effectors.T3SEpp：一种用于细菌III型分泌效应蛋白的综合预测流程

mSystems. 2020 Aug 4;5(4):e00288-20. doi: 10.1128/mSystems.00288-20.

Bastion3: a two-layer ensemble predictor of type III secreted effectors.堡垒 3：III 型分泌效应物的双层集成预测器。

Bioinformatics. 2019 Jun 1;35(12):2017-2028. doi: 10.1093/bioinformatics/bty914.

Computational prediction of type III secreted proteins from gram-negative bacteria.计算预测革兰氏阴性菌的 III 型分泌蛋白。

BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S47. doi: 10.1186/1471-2105-11-S1-S47.

Effective identification of Gram-negative bacterial type III secreted effectors using position-specific residue conservation profiles.利用位置特异性残基保守性谱有效鉴定革兰氏阴性菌 III 型分泌效应子。

PLoS One. 2013 Dec 31;8(12):e84439. doi: 10.1371/journal.pone.0084439. eCollection 2013.

T3SEdb: data warehousing of virulence effectors secreted by the bacterial Type III Secretion System.T3SEdb：细菌 III 型分泌系统分泌的毒力效应子的数据仓库。

BMC Bioinformatics. 2010 Oct 15;11 Suppl 7(Suppl 7):S4. doi: 10.1186/1471-2105-11-S7-S4.

BEAN 2.0: an integrated web resource for the identification and functional analysis of type III secreted effectors.BEAN 2.0：用于III型分泌效应子鉴定和功能分析的综合网络资源。

Database (Oxford). 2015 Jun 27;2015:bav064. doi: 10.1093/database/bav064. Print 2015.

引用本文的文献

T4Seeker: a hybrid model for type IV secretion effectors identification.T4Seeker：一种用于 IV 型分泌效应器识别的混合模型。

BMC Biol. 2024 Nov 14;22(1):259. doi: 10.1186/s12915-024-02064-z.

A predictive approach for host-pathogen interactions using deep learning and protein sequences.一种利用深度学习和蛋白质序列预测宿主-病原体相互作用的方法。

Virusdisease. 2024 Sep;35(3):434-445. doi: 10.1007/s13337-024-00882-x. Epub 2024 Jul 16.

POOE: predicting oomycete effectors based on a pre-trained large protein language model.POOE：基于预先训练的大型蛋白质语言模型预测卵菌效应子。

mSystems. 2024 Jan 23;9(1):e0100423. doi: 10.1128/msystems.01004-23. Epub 2023 Dec 11.

Protein Sorting Prediction.蛋白质分拣预测。

Methods Mol Biol. 2024;2715:27-63. doi: 10.1007/978-1-0716-3445-5_2.

Natural language processing approach to model the secretion signal of type III effectors.用于模拟III型效应蛋白分泌信号的自然语言处理方法。

Front Plant Sci. 2022 Oct 31;13:1024405. doi: 10.3389/fpls.2022.1024405. eCollection 2022.

Microbial Effectors: Key Determinants in Plant Health and Disease.微生物效应子：植物健康与疾病的关键决定因素

Microorganisms. 2022 Oct 6;10(10):1980. doi: 10.3390/microorganisms10101980.

Protein Science Meets Artificial Intelligence: A Systematic Review and a Biochemical Meta-Analysis of an Inter-Field.蛋白质科学与人工智能相遇：跨领域的系统评价与生化荟萃分析

Front Bioeng Biotechnol. 2022 Jul 7;10:788300. doi: 10.3389/fbioe.2022.788300. eCollection 2022.

Computational Systems Biology of Alfalfa - Bacterial Blight Host-Pathogen Interactions: Uncovering the Complex Molecular Networks for Developing Durable Disease Resistant Crop.紫花苜蓿 - 细菌性叶枯病宿主 - 病原体相互作用的计算系统生物学：揭示用于培育持久抗病作物的复杂分子网络

Front Plant Sci. 2022 Feb 17;12:807354. doi: 10.3389/fpls.2021.807354. eCollection 2021.

T1SEstacker: A Tri-Layer Stacking Model Effectively Predicts Bacterial Type 1 Secreted Proteins Based on C-Terminal Non-repeats-in-Toxin-Motif Sequence Features.T1SEstacker：一种基于毒素基序序列特征中C端非重复序列的三层堆叠模型，可有效预测细菌1型分泌蛋白。

Front Microbiol. 2022 Feb 8;12:813094. doi: 10.3389/fmicb.2021.813094. eCollection 2021.

ProtPlat: an efficient pre-training platform for protein classification based on FastText.ProtPlat：基于 FastText 的高效蛋白质分类预训练平台。

BMC Bioinformatics. 2022 Feb 11;23(1):66. doi: 10.1186/s12859-022-04604-2.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

DeepT3：使用 N 端序列，深度卷积神经网络准确识别革兰氏阴性菌 III 型分泌效应物。

DeepT3: deep convolutional neural networks accurately identify Gram-negative bacterial type III secreted effectors using the N-terminal sequence.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

SUPPLEMENTARY INFORMATION

动机

结果

可用性和实现

补充信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献