• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BaPreS:一种使用最佳特征集来预测细菌素的软件工具。

BaPreS: a software tool for predicting bacteriocins using an optimal set of features.

机构信息

School of Electrical Engineering and Computer Science, Washington State University, Pullman, WA, USA.

School of Engineering and Applied Sciences, Washington State University Tri-Cities, Richland, WA, USA.

出版信息

BMC Bioinformatics. 2023 Aug 17;24(1):313. doi: 10.1186/s12859-023-05330-z.

DOI:10.1186/s12859-023-05330-z
PMID:37592230
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10433575/
Abstract

BACKGROUND

Antibiotic resistance is a major public health concern around the globe. As a result, researchers always look for new compounds to develop new antibiotic drugs for combating antibiotic-resistant bacteria. Bacteriocin becomes a promising antimicrobial agent to fight against antibiotic resistance, due to cases of both broad and narrow killing spectra. Sequence matching methods are widely used to identify bacteriocins by comparing them with the known bacteriocin sequences; however, these methods often fail to detect new bacteriocin sequences due to their high diversity. The ability to use a machine learning approach can help find new highly dissimilar bacteriocins for developing highly effective antibiotic drugs. The aim of this work is to develop a machine learning-based software tool called BaPreS (Bacteriocin Prediction Software) using an optimal set of features for detecting bacteriocin protein sequences with high accuracy. We extracted potential features from known bacteriocin and non-bacteriocin sequences by considering the physicochemical and structural properties of the protein sequences. Then we reduced the feature set using statistical justifications and recursive feature elimination technique. Finally, we built support vector machine (SVM) and random forest (RF) models using the selected features and utilized the best machine learning model to implement the software tool.

RESULTS

We applied BaPreS to an established dataset and evaluated its prediction performance. Acquired results show that the software tool can achieve a prediction accuracy of 95.54% for testing protein sequences. This tool allows users to add new bacteriocin or non-bacteriocin sequences in the training dataset to further enhance the predictive power of the tool. We compared the prediction performance of the BaPreS with a popular sequence matching-based tool and a deep learning-based method, and our software tool outperformed both.

CONCLUSIONS

BaPreS is a bacteriocin prediction tool that can be used to discover new highly dissimilar bacteriocins for developing highly effective antibiotic drugs. This software tool can be used with Windows, Linux and macOS operating systems. The open-source software package and its user manual are available at https://github.com/suraiya14/BaPreS .

摘要

背景

抗生素耐药性是全球主要的公共卫生问题。因此,研究人员一直在寻找新的化合物来开发新的抗生素药物以对抗抗药性细菌。由于具有广谱和窄谱杀菌谱,细菌素成为一种有前途的抗菌剂来对抗抗生素耐药性。通过比较已知的细菌素序列来识别细菌素的序列匹配方法被广泛应用;然而,由于其高度多样性,这些方法常常无法检测到新的细菌素序列。使用机器学习方法的能力可以帮助发现新的高度不同的细菌素来开发高效的抗生素药物。这项工作的目的是开发一种基于机器学习的软件工具,称为 BaPreS(细菌素预测软件),该软件使用一组最佳的特征来准确检测细菌素蛋白序列。我们通过考虑蛋白质序列的理化和结构特性,从已知的细菌素和非细菌素序列中提取潜在的特征。然后,我们使用统计理由和递归特征消除技术来减少特征集。最后,我们使用所选特征构建支持向量机(SVM)和随机森林(RF)模型,并利用最佳的机器学习模型来实现软件工具。

结果

我们将 BaPreS 应用于已建立的数据集,并评估了其预测性能。获得的结果表明,该软件工具可以达到 95.54%的测试蛋白质序列的预测准确性。该工具允许用户在训练数据集中添加新的细菌素或非细菌素序列,以进一步提高工具的预测能力。我们比较了 BaPreS 的预测性能与流行的序列匹配工具和基于深度学习的方法,我们的软件工具优于这两者。

结论

BaPreS 是一种细菌素预测工具,可用于发现新的高度不同的细菌素,以开发高效的抗生素药物。该软件工具可用于 Windows、Linux 和 macOS 操作系统。该开源软件包及其用户手册可在 https://github.com/suraiya14/BaPreS 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816d/10433575/800135fc337f/12859_2023_5330_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816d/10433575/06f44faed675/12859_2023_5330_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816d/10433575/3660f15214c3/12859_2023_5330_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816d/10433575/9197ce4d1635/12859_2023_5330_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816d/10433575/9664bc4cd3d7/12859_2023_5330_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816d/10433575/e2f305f2aed7/12859_2023_5330_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816d/10433575/800135fc337f/12859_2023_5330_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816d/10433575/06f44faed675/12859_2023_5330_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816d/10433575/3660f15214c3/12859_2023_5330_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816d/10433575/9197ce4d1635/12859_2023_5330_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816d/10433575/9664bc4cd3d7/12859_2023_5330_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816d/10433575/e2f305f2aed7/12859_2023_5330_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/816d/10433575/800135fc337f/12859_2023_5330_Fig6_HTML.jpg

相似文献

1
BaPreS: a software tool for predicting bacteriocins using an optimal set of features.BaPreS:一种使用最佳特征集来预测细菌素的软件工具。
BMC Bioinformatics. 2023 Aug 17;24(1):313. doi: 10.1186/s12859-023-05330-z.
2
BPAGS: a web application for bacteriocin prediction via feature evaluation using alternating decision tree, genetic algorithm, and linear support vector classifier.BPAGS:一种通过使用交替决策树、遗传算法和线性支持向量分类器进行特征评估来预测细菌素的网络应用程序。
Front Bioinform. 2024 Jan 10;3:1284705. doi: 10.3389/fbinf.2023.1284705. eCollection 2023.
3
A large scale prediction of bacteriocin gene blocks suggests a wide functional spectrum for bacteriocins.对细菌素基因簇的大规模预测表明,细菌素具有广泛的功能谱。
BMC Bioinformatics. 2015 Nov 11;16:381. doi: 10.1186/s12859-015-0792-9.
4
Novel antimicrobial peptide discovery using machine learning and biophysical selection of minimal bacteriocin domains.利用机器学习和最小细菌素结构域的生物物理选择发现新型抗菌肽。
Drug Dev Res. 2020 Feb;81(1):43-51. doi: 10.1002/ddr.21601. Epub 2019 Sep 4.
5
BADASS: BActeriocin-Diversity ASsessment Software.BADASS:细菌素多样性评估软件。
BMC Bioinformatics. 2023 Jan 20;24(1):24. doi: 10.1186/s12859-022-05106-x.
6
Discovery of Novel Type II Bacteriocins Using a New High-Dimensional Bioinformatic Algorithm.利用新型高维生物信息算法发现新型 II 型细菌素。
Front Immunol. 2020 Sep 3;11:1873. doi: 10.3389/fimmu.2020.01873. eCollection 2020.
7
Novel Group of Leaderless Multipeptide Bacteriocins from Gram-Positive Bacteria.革兰氏阳性菌中新型无 leader 多肽细菌素组
Appl Environ Microbiol. 2016 Aug 15;82(17):5216-24. doi: 10.1128/AEM.01094-16. Print 2016 Sep 1.
8
Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.机器学习算法在(放化疗)治疗结果预测中的应用:分类器的实证比较。
Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.
9
Ubericin K, a New Pore-Forming Bacteriocin Targeting mannose-PTS.乌贝丁 K,一种针对甘露糖-PTS 的新型孔形成细菌素。
Microbiol Spectr. 2021 Oct 31;9(2):e0029921. doi: 10.1128/Spectrum.00299-21. Epub 2021 Oct 13.
10
Capreomycin resistance prediction in two species of Mycobacterium using a stacked ensemble method.利用堆叠集成方法预测两种分枝杆菌中的卷曲霉素耐药性。
J Appl Microbiol. 2019 Dec;127(6):1656-1664. doi: 10.1111/jam.14413. Epub 2019 Sep 8.

引用本文的文献

1
Deep learning neural network development for the classification of bacteriocin sequences produced by lactic acid bacteria.用于乳酸菌产生的细菌素序列分类的深度学习神经网络开发
F1000Res. 2025 Jun 20;13:981. doi: 10.12688/f1000research.154432.2. eCollection 2024.
2
Exo-Tox: Identifying Exotoxins from secreted bacterial proteins.外毒素:从分泌的细菌蛋白中鉴定外毒素
BioData Min. 2025 Aug 8;18(1):52. doi: 10.1186/s13040-025-00469-2.
3
Classification and Multi-Functional Use of Bacteriocins in Health, Biotechnology, and Food Industry.

本文引用的文献

1
Bacteriocins: Properties and potential use as antimicrobials.细菌素:特性及作为抗菌剂的潜在用途。
J Clin Lab Anal. 2022 Jan;36(1):e24093. doi: 10.1002/jcla.24093. Epub 2021 Dec 1.
2
RMSCNN: A Random Multi-Scale Convolutional Neural Network for Marine Microbial Bacteriocins Identification.RMSCNN:一种用于海洋微生物细菌素识别的随机多尺度卷积神经网络。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Nov-Dec;19(6):3663-3672. doi: 10.1109/TCBB.2021.3122183. Epub 2022 Dec 8.
3
The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation.
细菌素在健康、生物技术和食品工业中的分类及多功能应用
Antibiotics (Basel). 2024 Jul 18;13(7):666. doi: 10.3390/antibiotics13070666.
4
Tackling the Antimicrobial Resistance "Pandemic" with Machine Learning Tools: A Summary of Available Evidence.使用机器学习工具应对抗微生物药物耐药性“大流行”:现有证据综述
Microorganisms. 2024 Apr 23;12(5):842. doi: 10.3390/microorganisms12050842.
5
In Silico Screening of Bacteriocin Gene Clusters within a Set of Marine Genomes.一组海洋基因组中细菌素基因簇的计算机筛选
Int J Mol Sci. 2024 Feb 22;25(5):2566. doi: 10.3390/ijms25052566.
6
Revisiting the Multifaceted Roles of Bacteriocins : The Multifaceted Roles of Bacteriocins.重新审视细菌素的多方面作用:细菌素的多方面作用。
Microb Ecol. 2024 Feb 14;87(1):41. doi: 10.1007/s00248-024-02357-4.
7
BPAGS: a web application for bacteriocin prediction via feature evaluation using alternating decision tree, genetic algorithm, and linear support vector classifier.BPAGS:一种通过使用交替决策树、遗传算法和线性支持向量分类器进行特征评估来预测细菌素的网络应用程序。
Front Bioinform. 2024 Jan 10;3:1284705. doi: 10.3389/fbinf.2023.1284705. eCollection 2023.
在二分类混淆矩阵评估中,马修斯相关系数(MCC)比平衡准确率、庄家知情度和标记度更可靠。
BioData Min. 2021 Feb 4;14(1):13. doi: 10.1186/s13040-021-00244-z.
4
Ensemble-AMPPred: Robust AMP Prediction and Recognition Using the Ensemble Learning Method with a New Hybrid Feature for Differentiating AMPs.基于集成学习方法的 AMP 预测与识别:利用新型混合特征增强 AMP 区分能力
Genes (Basel). 2021 Jan 21;12(2):137. doi: 10.3390/genes12020137.
5
Better understanding and prediction of antiviral peptides through primary and secondary structure feature importance.通过一级和二级结构特征重要性,更好地理解和预测抗病毒肽。
Sci Rep. 2020 Nov 6;10(1):19260. doi: 10.1038/s41598-020-76161-8.
6
The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation.马修斯相关系数(MCC)在二分类评估中优于 F1 得分和准确率的优势。
BMC Genomics. 2020 Jan 2;21(1):6. doi: 10.1186/s12864-019-6413-7.
7
Novel antimicrobial peptide discovery using machine learning and biophysical selection of minimal bacteriocin domains.利用机器学习和最小细菌素结构域的生物物理选择发现新型抗菌肽。
Drug Dev Res. 2020 Feb;81(1):43-51. doi: 10.1002/ddr.21601. Epub 2019 Sep 4.
8
Capreomycin resistance prediction in two species of Mycobacterium using a stacked ensemble method.利用堆叠集成方法预测两种分枝杆菌中的卷曲霉素耐药性。
J Appl Microbiol. 2019 Dec;127(6):1656-1664. doi: 10.1111/jam.14413. Epub 2019 Sep 8.
9
Phage tail-like particles are versatile bacterial nanomachines - A mini-review.噬菌体尾状颗粒——一种多功能细菌纳米机器的综述
J Adv Res. 2019 Apr 23;19:75-84. doi: 10.1016/j.jare.2019.04.003. eCollection 2019 Sep.
10
Identifying antimicrobial peptides using word embedding with deep recurrent neural networks.使用深度递归神经网络的词嵌入来识别抗菌肽。
Bioinformatics. 2019 Jun 1;35(12):2009-2016. doi: 10.1093/bioinformatics/bty937.