• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CNN-BLPred:一种基于卷积神经网络的β-内酰胺酶(BL)及其分类预测器。

CNN-BLPred: a Convolutional neural network based predictor for β-Lactamases (BL) and their classes.

机构信息

Department of Computational Science and Engineering, North Carolina A&T State University, Greensboro, NC, 27411, USA.

Faculty of Information Science and Electrical Engineering, Kyushu University, 744 Motooka, Nishi-ku, Fukuoka, 819-0395, Japan.

出版信息

BMC Bioinformatics. 2017 Dec 28;18(Suppl 16):577. doi: 10.1186/s12859-017-1972-6.

DOI:10.1186/s12859-017-1972-6
PMID:29297322
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5751796/
Abstract

BACKGROUND

The β-Lactamase (BL) enzyme family is an important class of enzymes that plays a key role in bacterial resistance to antibiotics. As the newly identified number of BL enzymes is increasing daily, it is imperative to develop a computational tool to classify the newly identified BL enzymes into one of its classes. There are two types of classification of BL enzymes: Molecular Classification and Functional Classification. Existing computational methods only address Molecular Classification and the performance of these existing methods is unsatisfactory.

RESULTS

We addressed the unsatisfactory performance of the existing methods by implementing a Deep Learning approach called Convolutional Neural Network (CNN). We developed CNN-BLPred, an approach for the classification of BL proteins. The CNN-BLPred uses Gradient Boosted Feature Selection (GBFS) in order to select the ideal feature set for each BL classification. Based on the rigorous benchmarking of CCN-BLPred using both leave-one-out cross-validation and independent test sets, CCN-BLPred performed better than the other existing algorithms. Compared with other architectures of CNN, Recurrent Neural Network, and Random Forest, the simple CNN architecture with only one convolutional layer performs the best. After feature extraction, we were able to remove ~95% of the 10,912 features using Gradient Boosted Trees. During 10-fold cross validation, we increased the accuracy of the classic BL predictions by 7%. We also increased the accuracy of Class A, Class B, Class C, and Class D performance by an average of 25.64%. The independent test results followed a similar trend.

CONCLUSIONS

We implemented a deep learning algorithm known as Convolutional Neural Network (CNN) to develop a classifier for BL classification. Combined with feature selection on an exhaustive feature set and using balancing method such as Random Oversampling (ROS), Random Undersampling (RUS) and Synthetic Minority Oversampling Technique (SMOTE), CNN-BLPred performs significantly better than existing algorithms for BL classification.

摘要

背景

β-内酰胺酶(BL)酶家族是一类重要的酶,在细菌对抗生素的耐药性中起着关键作用。随着新发现的 BL 酶数量的日益增加,开发一种计算工具将新发现的 BL 酶分类为其类别之一是当务之急。BL 酶有两种分类:分子分类和功能分类。现有的计算方法仅解决分子分类,这些现有方法的性能并不令人满意。

结果

我们通过实施一种称为卷积神经网络(CNN)的深度学习方法来解决现有方法的性能不佳问题。我们开发了一种用于 BL 蛋白分类的方法 CNN-BLPred。CNN-BLPred 使用梯度提升特征选择(GBFS)来为每个 BL 分类选择理想的特征集。基于使用留一法交叉验证和独立测试集对 CCN-BLPred 的严格基准测试,CCN-BLPred 的性能优于其他现有算法。与其他 CNN 架构、递归神经网络和随机森林相比,只有一层卷积的简单 CNN 架构表现最佳。在特征提取后,我们能够使用梯度提升树去除约 95%的 10912 个特征。在 10 折交叉验证中,我们将经典 BL 预测的准确率提高了 7%。我们还将 Class A、Class B、Class C 和 Class D 的性能平均提高了 25.64%。独立测试结果也呈现出类似的趋势。

结论

我们实施了一种称为卷积神经网络(CNN)的深度学习算法,以开发一种 BL 分类分类器。结合对详尽特征集的特征选择以及使用随机过采样(ROS)、随机欠采样(RUS)和合成少数过采样技术(SMOTE)等平衡方法,CNN-BLPred 在 BL 分类方面的性能明显优于现有算法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcb/5751796/6687da2e16b1/12859_2017_1972_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcb/5751796/ba12c6a99534/12859_2017_1972_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcb/5751796/1eb760e9acad/12859_2017_1972_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcb/5751796/a2ca66a8655b/12859_2017_1972_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcb/5751796/db7d9b0bb34b/12859_2017_1972_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcb/5751796/5ef1ab0e4d9d/12859_2017_1972_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcb/5751796/3695824e9c03/12859_2017_1972_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcb/5751796/6687da2e16b1/12859_2017_1972_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcb/5751796/ba12c6a99534/12859_2017_1972_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcb/5751796/1eb760e9acad/12859_2017_1972_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcb/5751796/a2ca66a8655b/12859_2017_1972_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcb/5751796/db7d9b0bb34b/12859_2017_1972_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcb/5751796/5ef1ab0e4d9d/12859_2017_1972_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcb/5751796/3695824e9c03/12859_2017_1972_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbcb/5751796/6687da2e16b1/12859_2017_1972_Fig7_HTML.jpg

相似文献

1
CNN-BLPred: a Convolutional neural network based predictor for β-Lactamases (BL) and their classes.CNN-BLPred:一种基于卷积神经网络的β-内酰胺酶(BL)及其分类预测器。
BMC Bioinformatics. 2017 Dec 28;18(Suppl 16):577. doi: 10.1186/s12859-017-1972-6.
2
Enhancing the prediction of IDC breast cancer staging from gene expression profiles using hybrid feature selection methods and deep learning architecture.使用混合特征选择方法和深度学习架构增强从基因表达谱预测浸润性导管癌乳腺癌分期的能力。
Med Biol Eng Comput. 2023 Nov;61(11):2895-2919. doi: 10.1007/s11517-023-02892-1. Epub 2023 Aug 2.
3
deepNEC: a novel alignment-free tool for the identification and classification of nitrogen biochemical network-related enzymes using deep learning.深度 NEC:一种新颖的无对齐工具,用于使用深度学习识别和分类与氮生化网络相关的酶。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac071.
4
White blood cells detection and classification based on regional convolutional neural networks.基于区域卷积神经网络的白细胞检测与分类。
Med Hypotheses. 2020 Feb;135:109472. doi: 10.1016/j.mehy.2019.109472. Epub 2019 Nov 4.
5
CNN-Siam: multimodal siamese CNN-based deep learning approach for drug‒drug interaction prediction.CNN-Siam:基于双通道 CNN 的深度学习方法用于药物-药物相互作用预测。
BMC Bioinformatics. 2023 Mar 23;24(1):110. doi: 10.1186/s12859-023-05242-y.
6
A deep dive into understanding tumor foci classification using multiparametric MRI based on convolutional neural network.基于卷积神经网络,深入探究利用多参数磁共振成像进行肿瘤病灶分类。
Med Phys. 2020 Sep;47(9):4077-4086. doi: 10.1002/mp.14255. Epub 2020 Jun 12.
7
fMRI volume classification using a 3D convolutional neural network robust to shifted and scaled neuronal activations.使用对移位和缩放神经元激活具有鲁棒性的 3D 卷积神经网络进行 fMRI 体积分类。
Neuroimage. 2020 Dec;223:117328. doi: 10.1016/j.neuroimage.2020.117328. Epub 2020 Sep 5.
8
DeepSSPred: A Deep Learning Based Sulfenylation Site Predictor Via a Novel nSegmented Optimize Federated Feature Encoder.DeepSSPred:一种基于深度学习的新型 nSegmented Optimize 联邦特征编码器的硫化位点预测器。
Protein Pept Lett. 2021;28(6):708-721. doi: 10.2174/0929866527666201202103411.
9
A novel multiscale and multipath convolutional neural network based age-related macular degeneration detection using OCT images.基于 OCT 图像的新型多尺度多路径卷积神经网络年龄相关性黄斑变性检测。
Comput Methods Programs Biomed. 2021 Sep;209:106294. doi: 10.1016/j.cmpb.2021.106294. Epub 2021 Jul 27.
10
The prediction of cardiac abnormality and enhancement in minority class accuracy from imbalanced ECG signals using modified deep neural network models.使用改进的深度神经网络模型从不平衡心电图信号中预测心脏异常及少数类准确率的增强。
Comput Biol Med. 2022 Nov;150:106142. doi: 10.1016/j.compbiomed.2022.106142. Epub 2022 Sep 22.

引用本文的文献

1
Prediction and validation of nanowire proteins in G20 using machine learning and feature engineering.使用机器学习和特征工程对G20中的纳米线蛋白进行预测与验证。
Comput Struct Biotechnol J. 2025 Apr 19;27:1706-1718. doi: 10.1016/j.csbj.2025.04.022. eCollection 2025.
2
An endoscopic ultrasound-based interpretable deep learning model and nomogram for distinguishing pancreatic neuroendocrine tumors from pancreatic cancer.一种基于内镜超声的可解释深度学习模型和列线图,用于区分胰腺神经内分泌肿瘤和胰腺癌。
Sci Rep. 2025 Jan 27;15(1):3383. doi: 10.1038/s41598-024-84749-7.
3
Predictors of the rate of cognitive decline in older adults using machine learning.

本文引用的文献

1
RF-NR: Random Forest Based Approach for Improved Classification of Nuclear Receptors.RF-NR:基于随机森林的核受体分类改进方法。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Nov-Dec;15(6):1844-1852. doi: 10.1109/TCBB.2017.2773063. Epub 2017 Nov 14.
2
Characterisation of fosfomycin resistance mechanisms and molecular epidemiology in extended-spectrum β-lactamase-producing Klebsiella pneumoniae isolates.产超广谱β-内酰胺酶肺炎克雷伯菌中磷霉素耐药机制的特征及分子流行病学研究。
Int J Antimicrob Agents. 2016 Nov;48(5):564-568. doi: 10.1016/j.ijantimicag.2016.08.013. Epub 2016 Sep 30.
3
Deep learning in bioinformatics.
利用机器学习预测老年人认知能力下降的速度。
PLoS One. 2023 Mar 3;18(3):e0280029. doi: 10.1371/journal.pone.0280029. eCollection 2023.
4
Factors related to steroid treatment responsiveness in thyroid eye disease patients and application of SHAP for feature analysis with XGBoost.甲状腺眼病患者类固醇治疗反应性的相关因素及 SHAP 与 XGBoost 特征分析的应用。
Front Endocrinol (Lausanne). 2023 Jan 31;14:1079628. doi: 10.3389/fendo.2023.1079628. eCollection 2023.
5
β-LacFamPred: An online tool for prediction and classification of β-lactamase class, subclass, and family.β-内酰胺酶家族预测工具:一种用于预测和分类β-内酰胺酶类别、亚类和家族的在线工具。
Front Microbiol. 2023 Jan 12;13:1039687. doi: 10.3389/fmicb.2022.1039687. eCollection 2022.
6
Identification of the ubiquitin-proteasome pathway domain by hyperparameter optimization based on a 2D convolutional neural network.基于二维卷积神经网络的超参数优化识别泛素-蛋白酶体途径结构域
Front Genet. 2022 Jul 22;13:851688. doi: 10.3389/fgene.2022.851688. eCollection 2022.
7
Diverse Aquatic Animal Matrices Play a Key Role in Survival and Potential Virulence of Non-O1/O139 Isolates.多种水生动物基质在非O1/O139型分离株的生存及潜在毒力方面发挥关键作用。
Front Microbiol. 2022 Jun 21;13:896767. doi: 10.3389/fmicb.2022.896767. eCollection 2022.
8
Bioinformatic Analyses of Peroxiredoxins and RF-Prx: A Random Forest-Based Predictor and Classifier for Prxs.基于随机森林的过氧化物酶和 RF-Prx 生物信息学分析:Prxs 的预测器和分类器
Methods Mol Biol. 2022;2499:155-176. doi: 10.1007/978-1-0716-2317-6_8.
9
Lact-Pred: A Predictor Developed for Identification of Beta-Lactamases Using Statistical Moments and PseAAC via 5-Step Rule.Lact-Pred:一种使用统计矩和 PseAAC 通过 5 步规则开发的用于识别β-内酰胺酶的预测器。
Comput Intell Neurosci. 2021 Dec 17;2021:8974265. doi: 10.1155/2021/8974265. eCollection 2021.
10
General deep learning model for detecting diabetic retinopathy.用于检测糖尿病性视网膜病变的通用深度学习模型。
BMC Bioinformatics. 2021 Nov 8;22(Suppl 5):84. doi: 10.1186/s12859-021-04005-x.
生物信息学中的深度学习。
Brief Bioinform. 2017 Sep 1;18(5):851-869. doi: 10.1093/bib/bbw068.
4
RF-Hydroxysite: a random forest based predictor for hydroxylation sites.RF-羟基位点:一种基于随机森林的羟基化位点预测器。
Mol Biosyst. 2016 Jul 19;12(8):2427-35. doi: 10.1039/c6mb00179c.
5
RF-Phos: A Novel General Phosphorylation Site Prediction Tool Based on Random Forest.RF-Phos:一种基于随机森林的新型通用磷酸化位点预测工具。
Biomed Res Int. 2016;2016:3281590. doi: 10.1155/2016/3281590. Epub 2016 Mar 15.
6
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.
7
Prediction of β-lactamase and its class by Chou's pseudo-amino acid composition and support vector machine.基于周式伪氨基酸组成和支持向量机预测β-内酰胺酶及其类别
J Theor Biol. 2015 Jan 21;365:96-103. doi: 10.1016/j.jtbi.2014.10.008. Epub 2014 Oct 22.
8
Identification of family specific fingerprints in β-lactamase families.β-内酰胺酶家族中家族特异性指纹的鉴定。
ScientificWorldJournal. 2014 Feb 11;2014:980572. doi: 10.1155/2014/980572. eCollection 2014.
9
BLAD: a comprehensive database of widely circulated beta-lactamases.BLAD:一个广泛传播的β-内酰胺酶的综合数据库。
Bioinformatics. 2013 Oct 1;29(19):2515-6. doi: 10.1093/bioinformatics/btt417. Epub 2013 Aug 13.
10
The role of balanced training and testing data sets for binary classifiers in bioinformatics.生物信息学中用于二分类器的平衡训练集和测试集的作用。
PLoS One. 2013 Jul 9;8(7):e67863. doi: 10.1371/journal.pone.0067863. Print 2013.