• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

T4Seeker:一种用于 IV 型分泌效应器识别的混合模型。

T4Seeker: a hybrid model for type IV secretion effectors identification.

机构信息

Department of Microbiology, University of Hong Kong, Hong Kong, China.

Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, 1 Chengdian Road, Quzhou, Zhejiang, China.

出版信息

BMC Biol. 2024 Nov 14;22(1):259. doi: 10.1186/s12915-024-02064-z.

DOI:10.1186/s12915-024-02064-z
PMID:39543674
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11566746/
Abstract

BACKGROUND

The type IV secretion system is widely present in various bacteria, such as Salmonella, Escherichia coli, and Helicobacter pylori. These bacteria use the type IV secretion system to secrete type IV secretion effectors, infect host cells, and disrupt or modulate the communication pathways. In this study, type III and type VI secretion effectors were used as negative samples to train a robust model.

RESULTS

The area under the curve of T4Seeker on the validation and independent test sets were 0.947 and 0.970, respectively, demonstrating the strong predictive capacity and robustness of T4Seeker. After comparing with the classic and state-of-the-art T4SE identification models, we found that T4Seeker, which is based on traditional features and large language model features, had a higher predictive ability.

CONCLUSION

The T4Seeker proposed in this study demonstrates superior performance in the field of T4SEs prediction. By integrating features at multiple levels, it achieves higher predictive accuracy and strong generalization capability, providing an effective tool for future T4SE research.

摘要

背景

IV 型分泌系统广泛存在于各种细菌中,如沙门氏菌、大肠杆菌和幽门螺杆菌。这些细菌利用 IV 型分泌系统分泌 IV 型分泌效应物,感染宿主细胞,并破坏或调节通讯途径。在这项研究中,III 型和 VI 型分泌效应物被用作阴性样本来训练一个强大的模型。

结果

T4Seeker 在验证集和独立测试集上的曲线下面积分别为 0.947 和 0.970,表明 T4Seeker 具有很强的预测能力和稳健性。与经典和最先进的 T4SE 识别模型进行比较后,我们发现基于传统特征和大型语言模型特征的 T4Seeker 具有更高的预测能力。

结论

本研究提出的 T4Seeker 在 T4SEs 预测领域表现出优异的性能。通过整合多个层次的特征,它实现了更高的预测准确性和强大的泛化能力,为未来的 T4SE 研究提供了有效的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dea/11566746/f9c1483f4b83/12915_2024_2064_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dea/11566746/b5a902cd1fa9/12915_2024_2064_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dea/11566746/7892e0a93f81/12915_2024_2064_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dea/11566746/10f943220ebb/12915_2024_2064_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dea/11566746/6cc7ccb0c546/12915_2024_2064_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dea/11566746/f9c1483f4b83/12915_2024_2064_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dea/11566746/b5a902cd1fa9/12915_2024_2064_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dea/11566746/7892e0a93f81/12915_2024_2064_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dea/11566746/10f943220ebb/12915_2024_2064_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dea/11566746/6cc7ccb0c546/12915_2024_2064_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9dea/11566746/f9c1483f4b83/12915_2024_2064_Fig5_HTML.jpg

相似文献

1
T4Seeker: a hybrid model for type IV secretion effectors identification.T4Seeker:一种用于 IV 型分泌效应器识别的混合模型。
BMC Biol. 2024 Nov 14;22(1):259. doi: 10.1186/s12915-024-02064-z.
2
T4SEfinder: a bioinformatics tool for genome-scale prediction of bacterial type IV secreted effectors using pre-trained protein language model.T4SEfinder:一种使用预先训练的蛋白质语言模型进行基于基因组规模预测细菌 IV 型分泌效应子的生物信息学工具。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab420.
3
iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles.iT4SE-EP:通过探索来自两个PSI-BLAST图谱的进化特征准确鉴定细菌IV型分泌效应蛋白
Molecules. 2021 Apr 24;26(9):2487. doi: 10.3390/molecules26092487.
4
T4SEpp: A pipeline integrating protein language models to predict bacterial type IV secreted effectors.T4SEpp:一种整合蛋白质语言模型以预测细菌IV型分泌效应蛋白的流程。
Comput Struct Biotechnol J. 2024 Jan 23;23:801-812. doi: 10.1016/j.csbj.2024.01.015. eCollection 2024 Dec.
5
Convolutional neural network-based annotation of bacterial type IV secretion system effectors with enhanced accuracy and reduced false discovery.基于卷积神经网络的细菌 IV 型分泌系统效应物注释,具有更高的准确性和更低的假阳性率。
Brief Bioinform. 2020 Sep 25;21(5):1825-1836. doi: 10.1093/bib/bbz120.
6
Systematic analysis and prediction of type IV secreted effector proteins by machine learning approaches.基于机器学习方法的 IV 型分泌效应蛋白的系统分析和预测。
Brief Bioinform. 2019 May 21;20(3):931-951. doi: 10.1093/bib/bbx164.
7
Effective prediction of bacterial type IV secreted effectors by combined features of both C-termini and N-termini.通过C端和N端的联合特征对细菌IV型分泌效应蛋白进行有效预测。
J Comput Aided Mol Des. 2017 Nov;31(11):1029-1038. doi: 10.1007/s10822-017-0080-z. Epub 2017 Nov 10.
8
T4SE-XGB: Interpretable Sequence-Based Prediction of Type IV Secreted Effectors Using eXtreme Gradient Boosting Algorithm.T4SE-XGB:使用极端梯度提升算法对IV型分泌效应蛋白进行基于序列的可解释预测。
Front Microbiol. 2020 Sep 24;11:580382. doi: 10.3389/fmicb.2020.580382. eCollection 2020.
9
Phylogenetic profiling, an untapped resource for the prediction of secreted proteins and its complementation with sequence-based classifiers in bacterial type III, IV and VI secretion systems.系统发育轮廓分析:一种预测细菌 III、IV 和 VI 型分泌系统中分泌蛋白的未开发资源,及其与基于序列分类器的互补分析。
Brief Bioinform. 2019 Jul 19;20(4):1395-1402. doi: 10.1093/bib/bby009.
10
DeepT3_4: A Hybrid Deep Neural Network Model for the Distinction Between Bacterial Type III and IV Secreted Effectors.DeepT3_4:一种用于区分细菌III型和IV型分泌效应蛋白的混合深度神经网络模型
Front Microbiol. 2021 Jan 21;12:605782. doi: 10.3389/fmicb.2021.605782. eCollection 2021.

本文引用的文献

1
T4SEpp: A pipeline integrating protein language models to predict bacterial type IV secreted effectors.T4SEpp:一种整合蛋白质语言模型以预测细菌IV型分泌效应蛋白的流程。
Comput Struct Biotechnol J. 2024 Jan 23;23:801-812. doi: 10.1016/j.csbj.2024.01.015. eCollection 2024 Dec.
2
Deep-STP: a deep learning-based approach to predict snake toxin proteins by using word embeddings.深度序列到蛋白预测(Deep-STP):一种基于深度学习的方法,通过词嵌入来预测蛇毒蛋白。
Front Med (Lausanne). 2024 Jan 17;10:1291352. doi: 10.3389/fmed.2023.1291352. eCollection 2023.
3
Identifying disease-related microbes based on multi-scale variational graph autoencoder embedding Wasserstein distance.
基于多尺度变分图自动编码器嵌入 Wasserstein 距离的疾病相关微生物识别。
BMC Biol. 2023 Dec 20;21(1):294. doi: 10.1186/s12915-023-01796-8.
4
Highly Accurate Estimation of Cell Type Abundance in Bulk Tissues Based on Single-Cell Reference and Domain Adaptive Matching.基于单细胞参考和领域自适应匹配的批量组织中细胞类型丰度的高精度估计。
Adv Sci (Weinh). 2024 Feb;11(7):e2306329. doi: 10.1002/advs.202306329. Epub 2023 Dec 10.
5
TSE-ARF: An adaptive prediction method of effectors across secretion system types.TSE-ARF:一种跨分泌系统类型的效应子自适应预测方法。
Anal Biochem. 2024 Mar;686:115407. doi: 10.1016/j.ab.2023.115407. Epub 2023 Nov 28.
6
Accurately identifying hemagglutinin using sequence information and machine learning methods.使用序列信息和机器学习方法准确识别血凝素。
Front Med (Lausanne). 2023 Oct 31;10:1281880. doi: 10.3389/fmed.2023.1281880. eCollection 2023.
7
DeepSecE: A Deep-Learning-Based Framework for Multiclass Prediction of Secreted Proteins in Gram-Negative Bacteria.DeepSecE:一种基于深度学习的革兰氏阴性菌分泌蛋白多类预测框架。
Research (Wash D C). 2023 Oct 25;6:0258. doi: 10.34133/research.0258. eCollection 2023.
8
Pmf-cpi: assessing drug selectivity with a pretrained multi-functional model for compound-protein interactions.Pmf-cpi:使用预训练的多功能化合物-蛋白质相互作用模型评估药物选择性。
J Cheminform. 2023 Oct 14;15(1):97. doi: 10.1186/s13321-023-00767-z.
9
A First Computational Frame for Recognizing Heparin-Binding Protein.一种用于识别肝素结合蛋白的首个计算框架。
Diagnostics (Basel). 2023 Jul 24;13(14):2465. doi: 10.3390/diagnostics13142465.
10
BioSeq-Diabolo: Biological sequence similarity analysis using Diabolo.BioSeq-Diabolo:使用 Diabolo 进行生物序列相似性分析。
PLoS Comput Biol. 2023 Jun 20;19(6):e1011214. doi: 10.1371/journal.pcbi.1011214. eCollection 2023 Jun.