• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用两步特征选择方法鉴定噬菌体病毒蛋白。

Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods.

机构信息

Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China.

Hebei Province Key Laboratory of Occupational Health and Safety for Coal Industry, School of Public Health, North China University of Science and Technology, Tangshan 063000, China.

出版信息

Molecules. 2018 Aug 10;23(8):2000. doi: 10.3390/molecules23082000.

DOI:10.3390/molecules23082000
PMID:30103458
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6222849/
Abstract

Accurate identification of phage virion protein is not only a key step for understanding the function of the phage virion protein but also helpful for further understanding the lysis mechanism of the bacterial cell. Since traditional experimental methods are time-consuming and costly for identifying phage virion proteins, it is extremely urgent to apply machine learning methods to accurately and efficiently identify phage virion proteins. In this work, a support vector machine (SVM) based method was proposed by mixing multiple sets of optimal g-gap dipeptide compositions. The analysis of variance (ANOVA) and the minimal-redundancy-maximal-relevance (mRMR) with an increment feature selection (IFS) were applied to single out the optimal feature set. In the five-fold cross-validation test, the proposed method achieved an overall accuracy of 87.95%. We believe that the proposed method will become an efficient and powerful method for scientists concerning phage virion proteins.

摘要

准确识别噬菌体病毒蛋白不仅是了解噬菌体病毒蛋白功能的关键步骤,还有助于进一步了解细菌细胞的裂解机制。由于传统的实验方法在识别噬菌体病毒蛋白时既耗时又昂贵,因此极有必要应用机器学习方法来准确、高效地识别噬菌体病毒蛋白。在这项工作中,提出了一种基于支持向量机(SVM)的方法,通过混合多组最佳的 g-gap 二肽组成来实现。方差分析(ANOVA)和最小冗余最大相关性(mRMR)与增量特征选择(IFS)相结合,用于挑选出最佳特征集。在五重交叉验证测试中,所提出的方法实现了 87.95%的整体准确率。我们相信,所提出的方法将成为噬菌体病毒蛋白领域科学家的一种高效、强大的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6651/6222849/4bda0459e988/molecules-23-02000-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6651/6222849/e006e6f52c99/molecules-23-02000-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6651/6222849/f6e6fce014dc/molecules-23-02000-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6651/6222849/d083d5563622/molecules-23-02000-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6651/6222849/e03ae48deb66/molecules-23-02000-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6651/6222849/4bda0459e988/molecules-23-02000-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6651/6222849/e006e6f52c99/molecules-23-02000-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6651/6222849/f6e6fce014dc/molecules-23-02000-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6651/6222849/d083d5563622/molecules-23-02000-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6651/6222849/e03ae48deb66/molecules-23-02000-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6651/6222849/4bda0459e988/molecules-23-02000-g005.jpg

相似文献

1
Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods.利用两步特征选择方法鉴定噬菌体病毒蛋白。
Molecules. 2018 Aug 10;23(8):2000. doi: 10.3390/molecules23082000.
2
Identification of bacteriophage virion proteins by the ANOVA feature selection and analysis.通过方差分析特征选择和分析鉴定噬菌体病毒粒子蛋白。
Mol Biosyst. 2014 Aug;10(8):2229-35. doi: 10.1039/c4mb00316k.
3
Review and comparative analysis of machine learning-based phage virion protein identification methods.基于机器学习的噬菌体病毒蛋白鉴定方法的回顾与比较分析。
Biochim Biophys Acta Proteins Proteom. 2020 Jun;1868(6):140406. doi: 10.1016/j.bbapap.2020.140406. Epub 2020 Mar 2.
4
Meta-iPVP: a sequence-based meta-predictor for improving the prediction of phage virion proteins using effective feature representation.Meta-iPVP:一种基于序列的元预测器,用于使用有效的特征表示来改进噬菌体衣壳蛋白的预测。
J Comput Aided Mol Des. 2020 Oct;34(10):1105-1116. doi: 10.1007/s10822-020-00323-z. Epub 2020 Jun 16.
5
Naïve Bayes classifier with feature selection to identify phage virion proteins.基于特征选择的朴素贝叶斯分类器,用于鉴定噬菌体病毒蛋白。
Comput Math Methods Med. 2013;2013:530696. doi: 10.1155/2013/530696. Epub 2013 May 15.
6
Recent Advances of Computational Methods for Identifying Bacteriophage Virion Proteins.用于鉴定噬菌体病毒粒子蛋白的计算方法的最新进展
Protein Pept Lett. 2020;27(4):259-264. doi: 10.2174/0929866526666190410124642.
7
PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method.PVPred-SCM:利用评分卡方法改进噬菌体衣壳蛋白的预测和分析。
Cells. 2020 Feb 3;9(2):353. doi: 10.3390/cells9020353.
8
PVP-SVM: Sequence-Based Prediction of Phage Virion Proteins Using a Support Vector Machine.PVP-SVM:使用支持向量机基于序列预测噬菌体病毒粒子蛋白
Front Microbiol. 2018 Mar 16;9:476. doi: 10.3389/fmicb.2018.00476. eCollection 2018.
9
Identifying Antioxidant Proteins by Using Optimal Dipeptide Compositions.利用最优二肽组成鉴定抗氧化蛋白。
Interdiscip Sci. 2016 Jun;8(2):186-191. doi: 10.1007/s12539-015-0124-9. Epub 2015 Sep 7.
10
SCORPION is a stacking-based ensemble learning framework for accurate prediction of phage virion proteins.SCORPION 是一个基于堆叠的集成学习框架,用于准确预测噬菌体病毒蛋白。
Sci Rep. 2022 Mar 8;12(1):4106. doi: 10.1038/s41598-022-08173-5.

引用本文的文献

1
PhaVIP: Phage VIrion Protein classification based on chaos game representation and Vision Transformer.基于混沌游戏表示和 Vision Transformer 的噬菌体衣壳蛋白分类
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i30-i39. doi: 10.1093/bioinformatics/btad229.
2
Prediction of Phage Virion Proteins Using Machine Learning Methods.利用机器学习方法预测噬菌体病毒粒子蛋白。
Molecules. 2023 Feb 28;28(5):2238. doi: 10.3390/molecules28052238.
3
DeePVP: Identification and classification of phage virion proteins using deep learning.基于深度学习的噬菌体病毒蛋白鉴定与分类

本文引用的文献

1
iRNA-2OM: A Sequence-Based Predictor for Identifying 2'-O-Methylation Sites in Homo sapiens.iRNA-2OM:一种用于识别人类2'-O-甲基化位点的基于序列的预测工具。
J Comput Biol. 2018 Nov;25(11):1266-1277. doi: 10.1089/cmb.2018.0004. Epub 2018 Aug 16.
2
HBPred: a tool to identify growth hormone-binding proteins.HBPred:一种识别生长激素结合蛋白的工具。
Int J Biol Sci. 2018 May 22;14(8):957-964. doi: 10.7150/ijbs.24174. eCollection 2018.
3
iRSpot-Pse6NC: Identifying recombination spots in by incorporating hexamer composition into general PseKNC.
Gigascience. 2022 Aug 11;11. doi: 10.1093/gigascience/giac076.
4
Screening of Gene Expression Markers for Corona Virus Disease 2019 Through Boruta_MCFS Feature Selection.通过 Boruta_MCFS 特征选择筛选 2019 年冠状病毒病的基因表达标志物。
Front Public Health. 2022 Jun 22;10:901602. doi: 10.3389/fpubh.2022.901602. eCollection 2022.
5
DeephageTP: a convolutional neural network framework for identifying phage-specific proteins from metagenomic sequencing data.DeephageTP:一种从宏基因组测序数据中识别噬菌体特异性蛋白的卷积神经网络框架。
PeerJ. 2022 Jun 8;10:e13404. doi: 10.7717/peerj.13404. eCollection 2022.
6
CAF-Associated Paracrine Signaling Worsens Outcome and Potentially Contributes to Chemoresistance in Epithelial Ovarian Cancer.癌症相关成纤维细胞(CAF)介导的旁分泌信号传导会恶化上皮性卵巢癌的预后,并可能导致化疗耐药。
Front Oncol. 2022 Mar 3;12:798680. doi: 10.3389/fonc.2022.798680. eCollection 2022.
7
SCORPION is a stacking-based ensemble learning framework for accurate prediction of phage virion proteins.SCORPION 是一个基于堆叠的集成学习框架,用于准确预测噬菌体病毒蛋白。
Sci Rep. 2022 Mar 8;12(1):4106. doi: 10.1038/s41598-022-08173-5.
8
Large-scale comparative review and assessment of computational methods for phage virion proteins identification.噬菌体病毒粒子蛋白质鉴定计算方法的大规模比较综述与评估
EXCLI J. 2022 Jan 3;21:11-29. doi: 10.17179/excli2021-4411. eCollection 2022.
9
Identification of Novel COVID-19 Biomarkers by Multiple Feature Selection Strategies.通过多种特征选择策略鉴定新型 COVID-19 生物标志物。
Comput Math Methods Med. 2021 Sep 27;2021:2203636. doi: 10.1155/2021/2203636. eCollection 2021.
10
Phages and Enzybiotics in Food Biopreservation.噬菌体和食源生物抗菌肽在食品生物保鲜中的应用
Molecules. 2021 Aug 25;26(17):5138. doi: 10.3390/molecules26175138.
iRSpot-Pse6NC:通过将六聚体组成纳入通用 PseKNC 来识别 中的重组热点。
Int J Biol Sci. 2018 May 22;14(8):883-891. doi: 10.7150/ijbs.24616. eCollection 2018.
4
iLoc-lncRNA: predict the subcellular location of lncRNAs by incorporating octamer composition into general PseKNC.iLoc-lncRNA:通过将八聚体组成纳入广义 PseKNC 来预测 lncRNA 的亚细胞位置。
Bioinformatics. 2018 Dec 15;34(24):4196-4204. doi: 10.1093/bioinformatics/bty508.
5
Identification of Bacteriophage Virion Proteins Using Multinomial Naïve Bayes with g-Gap Feature Tree.使用具有 g-gap 特征树的多项朴素贝叶斯识别噬菌体病毒蛋白。
Int J Mol Sci. 2018 Jun 15;19(6):1779. doi: 10.3390/ijms19061779.
6
Machine-Learning-Based Prediction of Cell-Penetrating Peptides and Their Uptake Efficiency with Improved Accuracy.基于机器学习的细胞穿透肽预测及其摄取效率的改进准确性。
J Proteome Res. 2018 Aug 3;17(8):2715-2726. doi: 10.1021/acs.jproteome.8b00148. Epub 2018 Jul 2.
7
AIPpred: Sequence-Based Prediction of Anti-inflammatory Peptides Using Random Forest.AIPpred:利用随机森林基于序列预测抗炎肽
Front Pharmacol. 2018 Mar 27;9:276. doi: 10.3389/fphar.2018.00276. eCollection 2018.
8
PVP-SVM: Sequence-Based Prediction of Phage Virion Proteins Using a Support Vector Machine.PVP-SVM:使用支持向量机基于序列预测噬菌体病毒粒子蛋白
Front Microbiol. 2018 Mar 16;9:476. doi: 10.3389/fmicb.2018.00476. eCollection 2018.
9
NeuroPP: A Tool for the Prediction of Neuropeptide Precursors Based on Optimal Sequence Composition.神经肽原预测工具(NeuroPP):基于最优序列组成的神经肽前体预测工具。
Interdiscip Sci. 2019 Mar;11(1):108-114. doi: 10.1007/s12539-018-0287-2. Epub 2018 Mar 10.
10
PSBinder: A Web Service for Predicting Polystyrene Surface-Binding Peptides.PSBinder:用于预测聚苯乙烯表面结合肽的 Web 服务。
Biomed Res Int. 2017;2017:5761517. doi: 10.1155/2017/5761517. Epub 2017 Dec 27.