• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于二项式贪婪遗传算法的新有效表示和智能核线性判别分析的蛋白质亚核定位。

Protein subnuclear localization based on a new effective representation and intelligent kernel linear discriminant analysis by dichotomous greedy genetic algorithm.

机构信息

School of Information Science and Engineering, Yunnan University, Kunming, PR China.

出版信息

PLoS One. 2018 Apr 12;13(4):e0195636. doi: 10.1371/journal.pone.0195636. eCollection 2018.

DOI:10.1371/journal.pone.0195636
PMID:29649330
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5896989/
Abstract

A wide variety of methods have been proposed in protein subnuclear localization to improve the prediction accuracy. However, one important trend of these means is to treat fusion representation by fusing multiple feature representations, of which, the fusion process takes a lot of time. In view of this, this paper novelly proposed a method by combining a new single feature representation and a new algorithm to obtain good recognition rate. Specifically, based on the position-specific scoring matrix (PSSM), we proposed a new expression, correlation position-specific scoring matrix (CoPSSM) as the protein feature representation. Based on the classic nonlinear dimension reduction algorithm, kernel linear discriminant analysis (KLDA), we added a new discriminant criterion and proposed a dichotomous greedy genetic algorithm (DGGA) to intelligently select its kernel bandwidth parameter. Two public datasets with Jackknife test and KNN classifier were used for the numerical experiments. The results showed that the overall success rate (OSR) with single representation CoPSSM is larger than that with many relevant representations. The OSR of the proposed method can reach as high as 87.444% and 90.3361% for these two datasets, respectively, outperforming many current methods. To show the generalization of the proposed algorithm, two extra standard datasets of protein subcellular were chosen to conduct the expending experiment, and the prediction accuracy by Jackknife test and Independent test is still considerable.

摘要

为了提高预测精度,人们提出了多种蛋白质亚核定位方法。然而,这些方法的一个重要趋势是通过融合多种特征表示来处理融合表示,其中融合过程需要大量时间。针对这一问题,本文创新性地提出了一种方法,该方法结合了一种新的单一特征表示和一种新的算法,以获得良好的识别率。具体来说,基于位置特异性评分矩阵(PSSM),我们提出了一种新的表达式,即相关性位置特异性评分矩阵(CoPSSM)作为蛋白质的特征表示。基于经典的非线性降维算法核线性判别分析(KLDA),我们增加了一个新的判别准则,并提出了一种二项式贪婪遗传算法(DGGA)来智能选择其核带宽参数。使用 Jackknife 测试和 KNN 分类器对两个公共数据集进行了数值实验。结果表明,使用单一表示 CoPSSM 的整体成功率(OSR)大于使用许多相关表示的 OSR。对于这两个数据集,所提出方法的 OSR 分别高达 87.444%和 90.3361%,优于许多现有方法。为了展示所提出算法的泛化性,选择了两个额外的蛋白质亚细胞标准数据集进行扩展实验,Jackknife 测试和独立测试的预测准确性仍然相当可观。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5518/5896989/1993718168cc/pone.0195636.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5518/5896989/04949d1a8e5a/pone.0195636.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5518/5896989/0a1b2b859cf5/pone.0195636.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5518/5896989/d60eefd05d84/pone.0195636.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5518/5896989/1993718168cc/pone.0195636.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5518/5896989/04949d1a8e5a/pone.0195636.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5518/5896989/0a1b2b859cf5/pone.0195636.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5518/5896989/d60eefd05d84/pone.0195636.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5518/5896989/1993718168cc/pone.0195636.g004.jpg

相似文献

1
Protein subnuclear localization based on a new effective representation and intelligent kernel linear discriminant analysis by dichotomous greedy genetic algorithm.基于二项式贪婪遗传算法的新有效表示和智能核线性判别分析的蛋白质亚核定位。
PLoS One. 2018 Apr 12;13(4):e0195636. doi: 10.1371/journal.pone.0195636. eCollection 2018.
2
Protein Sub-Nuclear Localization Based on Effective Fusion Representations and Dimension Reduction Algorithm LDA.基于有效融合表示和降维算法LDA的蛋白质亚核定位
Int J Mol Sci. 2015 Dec 19;16(12):30343-61. doi: 10.3390/ijms161226237.
3
Protein Subcellular Localization with Gaussian Kernel Discriminant Analysis and Its Kernel Parameter Selection.基于高斯核判别分析的蛋白质亚细胞定位及其核参数选择。
Int J Mol Sci. 2017 Dec 15;18(12):2718. doi: 10.3390/ijms18122718.
4
Subcellular location prediction of apoptosis proteins using two novel feature extraction methods based on evolutionary information and LDA.基于进化信息和 LDA 的两种新特征提取方法对凋亡蛋白的亚细胞定位预测
BMC Bioinformatics. 2020 May 24;21(1):212. doi: 10.1186/s12859-020-3539-1.
5
Nuc-PLoc: a new web-server for predicting protein subnuclear localization by fusing PseAA composition and PsePSSM.Nuc-PLoc:一种通过融合伪氨基酸组成和伪位置特异性得分矩阵来预测蛋白质亚核定位的新型网络服务器。
Protein Eng Des Sel. 2007 Nov;20(11):561-7. doi: 10.1093/protein/gzm057. Epub 2007 Nov 10.
6
Prediction of apoptosis protein subcellular location based on position-specific scoring matrix and isometric mapping algorithm.基于位置特异性评分矩阵和等距映射算法预测凋亡蛋白亚细胞定位。
Med Biol Eng Comput. 2019 Dec;57(12):2553-2565. doi: 10.1007/s11517-019-02045-3. Epub 2019 Oct 16.
7
Using pseudo amino acid composition to predict protein subnuclear location with improved hybrid approach.使用伪氨基酸组成通过改进的混合方法预测蛋白质亚核定位。
Amino Acids. 2008 Jan;34(1):119-25. doi: 10.1007/s00726-007-0545-9. Epub 2007 May 21.
8
Robust prediction of protein subcellular localization combining PCA and WSVMs.基于 PCA 和 WSVMs 的蛋白质亚细胞定位的稳健预测。
Comput Biol Med. 2011 Aug;41(8):648-52. doi: 10.1016/j.compbiomed.2011.05.016. Epub 2011 Jun 30.
9
Protein subcellular localization prediction using multiple kernel learning based support vector machine.基于多核学习支持向量机的蛋白质亚细胞定位预测
Mol Biosyst. 2017 Mar 28;13(4):785-795. doi: 10.1039/c6mb00860g.
10
A kernel-induced space selection approach to model selection in KLDA.一种用于KLDA中模型选择的核诱导空间选择方法。
IEEE Trans Neural Netw. 2008 Dec;19(12):2116-31. doi: 10.1109/TNN.2008.2005140.

引用本文的文献

1
Predicting antifreeze proteins with weighted generalized dipeptide composition and multi-regression feature selection ensemble.基于加权广义二肽组成和多元回归特征选择集成预测抗冻蛋白。
BMC Bioinformatics. 2021 Jun 23;22(Suppl 3):340. doi: 10.1186/s12859-021-04251-z.
2
Prediction of protein structural classes by different feature expressions based on 2-D wavelet denoising and fusion.基于二维小波去噪和融合的不同特征表达预测蛋白质结构类别。
BMC Bioinformatics. 2019 Dec 24;20(Suppl 25):701. doi: 10.1186/s12859-019-3276-5.
3
Accurate classification of membrane protein types based on sequence and evolutionary information using deep learning.

本文引用的文献

1
Protein Subcellular Localization with Gaussian Kernel Discriminant Analysis and Its Kernel Parameter Selection.基于高斯核判别分析的蛋白质亚细胞定位及其核参数选择。
Int J Mol Sci. 2017 Dec 15;18(12):2718. doi: 10.3390/ijms18122718.
2
Subcellular localization prediction of apoptosis proteins based on evolutionary information and support vector machine.基于进化信息和支持向量机的凋亡蛋白亚细胞定位预测
Artif Intell Med. 2017 May;78:41-46. doi: 10.1016/j.artmed.2017.05.007. Epub 2017 May 24.
3
Protein subcellular localization prediction using multiple kernel learning based support vector machine.
基于序列和进化信息的膜蛋白类型的深度学习精确分类。
BMC Bioinformatics. 2019 Dec 24;20(Suppl 25):700. doi: 10.1186/s12859-019-3275-6.
4
Prediction of Apoptosis Protein Subcellular Localization with Multilayer Sparse Coding and Oversampling Approach.基于多层稀疏编码和过采样方法预测细胞凋亡蛋白亚细胞定位。
Biomed Res Int. 2019 Jan 30;2019:2436924. doi: 10.1155/2019/2436924. eCollection 2019.
5
Protein-protein interaction sites prediction by ensemble random forests with synthetic minority oversampling technique.基于集成随机森林和合成少数过采样技术的蛋白质-蛋白质相互作用位点预测。
Bioinformatics. 2019 Jul 15;35(14):2395-2402. doi: 10.1093/bioinformatics/bty995.
6
The subcellular organisation of Saccharomyces cerevisiae.酿酒酵母的亚细胞结构组织。
Curr Opin Chem Biol. 2019 Feb;48:86-95. doi: 10.1016/j.cbpa.2018.10.026. Epub 2018 Nov 29.
基于多核学习支持向量机的蛋白质亚细胞定位预测
Mol Biosyst. 2017 Mar 28;13(4):785-795. doi: 10.1039/c6mb00860g.
4
MSLVP: prediction of multiple subcellular localization of viral proteins using a support vector machine.MSLVP:使用支持向量机预测病毒蛋白的多个亚细胞定位
Mol Biosyst. 2016 Jul 19;12(8):2572-86. doi: 10.1039/c6mb00241b.
5
Incorporating organelle correlations into semi-supervised learning for protein subcellular localization prediction.将细胞器相关性纳入半监督学习以进行蛋白质亚细胞定位预测。
Bioinformatics. 2016 Jul 15;32(14):2184-92. doi: 10.1093/bioinformatics/btw219. Epub 2016 Apr 23.
6
A multiple information fusion method for predicting subcellular locations of two different types of bacterial protein simultaneously.一种同时预测两种不同类型细菌蛋白质亚细胞定位的多信息融合方法。
Biosystems. 2016 Jan;139:37-45. doi: 10.1016/j.biosystems.2015.12.002. Epub 2015 Dec 24.
7
Protein Sub-Nuclear Localization Based on Effective Fusion Representations and Dimension Reduction Algorithm LDA.基于有效融合表示和降维算法LDA的蛋白质亚核定位
Int J Mol Sci. 2015 Dec 19;16(12):30343-61. doi: 10.3390/ijms161226237.
8
Prediction of bacterial protein subcellular localization by incorporating various features into Chou's PseAAC and a backward feature selection approach.通过将各种特征纳入周氏伪氨基酸组成并采用反向特征选择方法预测细菌蛋白质亚细胞定位
Biochimie. 2014 Sep;104:100-7. doi: 10.1016/j.biochi.2014.06.001. Epub 2014 Jun 11.
9
Protein sub-nuclear localization prediction using SVM and Pfam domain information.利用支持向量机和Pfam结构域信息进行蛋白质亚核定位预测。
PLoS One. 2014 Jun 4;9(6):e98345. doi: 10.1371/journal.pone.0098345. eCollection 2014.
10
iNR-Drug: predicting the interaction of drugs with nuclear receptors in cellular networking.iNR-Drug:预测细胞网络中药物与核受体的相互作用。
Int J Mol Sci. 2014 Mar 19;15(3):4915-37. doi: 10.3390/ijms15034915.