• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过将进化和物理化学信息整合到 Chou 的通用 PseAAC 中,鉴定蛋白质亚细胞定位。

Identification of protein subcellular localization via integrating evolutionary and physicochemical information into Chou's general PseAAC.

机构信息

School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Yaguan Road, Jinnan District, Tianjin, PR China.

School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Yaguan Road, Jinnan District, Tianjin, PR China; School of Computational Science and Engineering, University of South Carolina, Columbia, USA.

出版信息

J Theor Biol. 2019 Feb 7;462:230-239. doi: 10.1016/j.jtbi.2018.11.012. Epub 2018 Nov 16.

DOI:10.1016/j.jtbi.2018.11.012
PMID:30452958
Abstract

Identifying the location of proteins in a cell plays an important role in understanding their functions, such as drug design, therapeutic target discovery and biological research. However, the traditional subcellular localization experiments are time-consuming, laborious and small scale. With the development of next-generation sequencing technology, the number of proteins has grown exponentially, which lays the foundation of the computational method for identifying protein subcellular localization. Although many methods for predicting subcellular localization of proteins have been proposed, most of them are limited to single-location. In this paper, we propose a multi-kernel SVM to predict subcellular localization of both multi-location and single-location proteins. First, we make use of the evolutionary information extracted from position specific scoring matrix (PSSM) and physicochemical properties of proteins, by Chou's general PseAAC and other efficient functions. Then, we propose a multi-kernel support vector machine (SVM) model to identify multi-label protein subcellular localization. As a result, our method has a good performance on predicting subcellular localization of proteins. It achieves an average precision of 0.7065 and 0.6889 on two human datasets, respectively. All results are higher than those achieved by other existing methods. Therefore, we provide an efficient system via a novel perspective to study the protein subcellular localization.

摘要

确定蛋白质在细胞中的位置在理解其功能方面起着重要作用,例如药物设计、治疗靶点发现和生物研究。然而,传统的亚细胞定位实验既耗时又费力,而且规模较小。随着下一代测序技术的发展,蛋白质的数量呈指数级增长,这为蛋白质亚细胞定位的计算方法奠定了基础。尽管已经提出了许多预测蛋白质亚细胞定位的方法,但大多数方法仅限于单定位。在本文中,我们提出了一种多核支持向量机(Multi-kernel SVM),用于预测多定位和单定位蛋白质的亚细胞定位。首先,我们利用从位置特异性评分矩阵(PSSM)和蛋白质理化性质中提取的进化信息,通过 Chou 的通用 PseAAC 和其他高效功能。然后,我们提出了一种多核支持向量机(SVM)模型来识别多标签蛋白质亚细胞定位。结果表明,我们的方法在预测蛋白质亚细胞定位方面具有良好的性能。在两个人类数据集上,分别实现了 0.7065 和 0.6889 的平均精度。所有结果均高于其他现有方法的结果。因此,我们通过一种新的视角提供了一个有效的系统来研究蛋白质亚细胞定位。

相似文献

1
Identification of protein subcellular localization via integrating evolutionary and physicochemical information into Chou's general PseAAC.通过将进化和物理化学信息整合到 Chou 的通用 PseAAC 中,鉴定蛋白质亚细胞定位。
J Theor Biol. 2019 Feb 7;462:230-239. doi: 10.1016/j.jtbi.2018.11.012. Epub 2018 Nov 16.
2
Prediction of protein subcellular localization with oversampling approach and Chou's general PseAAC.基于过采样方法和周式广义伪氨基酸组成预测蛋白质亚细胞定位
J Theor Biol. 2018 Jan 21;437:239-250. doi: 10.1016/j.jtbi.2017.10.030. Epub 2017 Oct 31.
3
Protein subcellular localization prediction using multiple kernel learning based support vector machine.基于多核学习支持向量机的蛋白质亚细胞定位预测
Mol Biosyst. 2017 Mar 28;13(4):785-795. doi: 10.1039/c6mb00860g.
4
pLoc_bal-mVirus: Predict Subcellular Localization of Multi-Label Virus Proteins by Chou's General PseAAC and IHTS Treatment to Balance Training Dataset.pLoc_bal-mVirus:基于周式广义伪氨基酸组成和用于平衡训练数据集的迭代启发式阈值选择处理预测多标签病毒蛋白的亚细胞定位
Med Chem. 2019;15(5):496-509. doi: 10.2174/1573406415666181217114710.
5
Predicting apoptosis protein subcellular localization by integrating auto-cross correlation and PSSM into Chou's PseAAC.通过将自相关和 PSSM 整合到 Chou 的 PseAAC 中,预测细胞凋亡蛋白的亚细胞定位。
J Theor Biol. 2018 Nov 14;457:163-169. doi: 10.1016/j.jtbi.2018.08.042. Epub 2018 Sep 1.
6
Predict Gram-Positive and Gram-Negative Subcellular Localization via Incorporating Evolutionary Information and Physicochemical Features Into Chou's General PseAAC.通过将进化信息和理化特征纳入周氏通用伪氨基酸组成预测革兰氏阳性和革兰氏阴性亚细胞定位
IEEE Trans Nanobioscience. 2015 Dec;14(8):915-26. doi: 10.1109/TNB.2015.2500186. Epub 2015 Nov 12.
7
Prediction of Protein Subcellular Localization Based on Fusion of Multi-view Features.基于多视图特征融合的蛋白质亚细胞定位预测。
Molecules. 2019 Mar 6;24(5):919. doi: 10.3390/molecules24050919.
8
Predicting plant protein subcellular multi-localization by Chou's PseAAC formulation based multi-label homolog knowledge transfer learning.基于 Chou 的 PseAAC 构象的多标签同源知识转移学习预测植物蛋白质亚细胞多定位。
J Theor Biol. 2012 Oct 7;310:80-7. doi: 10.1016/j.jtbi.2012.06.028. Epub 2012 Jun 27.
9
Predict subcellular locations of singleplex and multiplex proteins by semi-supervised learning and dimension-reducing general mode of Chou's PseAAC.通过半监督学习和 Chou 的 PseAAC 通用模式的降维方法预测单plex 和 multiplex 蛋白质的亚细胞定位。
IEEE Trans Nanobioscience. 2013 Dec;12(4):311-20. doi: 10.1109/TNB.2013.2272014.
10
Accurate prediction of subcellular location of apoptosis proteins combining Chou's PseAAC and PsePSSM based on wavelet denoising.基于小波去噪结合周氏伪氨基酸组成和伪位置特异性得分矩阵对凋亡蛋白亚细胞定位的准确预测
Oncotarget. 2017 Nov 21;8(64):107640-107665. doi: 10.18632/oncotarget.22585. eCollection 2017 Dec 8.

引用本文的文献

1
DeepAIPs-SFLA: Deep Convolutional Model for Prediction of Anti-Inflammatory Peptides Using Binary Pattern Decomposition of Novel Multiview Descriptors with an SFLA Approach.深度人工智能粒子群优化算法:基于新型多视图描述符的二元模式分解与粒子群优化算法的深度卷积模型用于抗炎肽预测
ACS Omega. 2025 Aug 5;10(32):35747-35762. doi: 10.1021/acsomega.5c02422. eCollection 2025 Aug 19.
2
RecGOBD: accurate recognition of gene ontology related brain development protein functions through multi-feature fusion and attention mechanisms.RecGOBD:通过多特征融合和注意力机制准确识别与基因本体相关的脑发育蛋白质功能。
Bioinform Adv. 2024 Nov 4;4(1):vbae163. doi: 10.1093/bioadv/vbae163. eCollection 2024.
3
A Review for Artificial Intelligence Based Protein Subcellular Localization.
基于人工智能的蛋白质亚细胞定位研究综述
Biomolecules. 2024 Mar 27;14(4):409. doi: 10.3390/biom14040409.
4
A GHKNN model based on the physicochemical property extraction method to identify SNARE proteins.一种基于物理化学性质提取方法的GHKNN模型,用于识别SNARE蛋白。
Front Genet. 2022 Nov 23;13:935717. doi: 10.3389/fgene.2022.935717. eCollection 2022.
5
Functional Analysis of Conserved Hypothetical Proteins from the Antarctic Bacterium, Strain BG5 Reveals Protein Cold Adaptation and Thermal Tolerance Strategies.来自南极细菌BG5菌株的保守假设蛋白的功能分析揭示了蛋白质的冷适应和耐热策略。
Microorganisms. 2022 Aug 16;10(8):1654. doi: 10.3390/microorganisms10081654.
6
Identification of Vesicle Transport Proteins Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model.囊泡转运蛋白的鉴定 超图正则化K局部超平面距离最近邻模型
Front Genet. 2022 Jul 13;13:960388. doi: 10.3389/fgene.2022.960388. eCollection 2022.
7
Gm-PLoc: A Subcellular Localization Model of Multi-Label Protein Based on GAN and DeepFM.Gm-PLoc:一种基于生成对抗网络(GAN)和深度因子分解机(DeepFM)的多标签蛋白质亚细胞定位模型
Front Genet. 2022 Jun 15;13:912614. doi: 10.3389/fgene.2022.912614. eCollection 2022.
8
Evaluating hierarchical machine learning approaches to classify biological databases.评估分层机器学习方法对生物数据库进行分类。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac216.
9
Identifying Membrane Protein Types Based on Lifelong Learning With Dynamically Scalable Networks.基于动态可扩展网络的终身学习识别膜蛋白类型
Front Genet. 2022 Mar 14;12:834488. doi: 10.3389/fgene.2021.834488. eCollection 2021.
10
ATGPred-FL: sequence-based prediction of autophagy proteins with feature representation learning.ATGPred-FL:基于特征表示学习的自噬蛋白序列预测。
Amino Acids. 2022 May;54(5):799-809. doi: 10.1007/s00726-022-03145-5. Epub 2022 Mar 14.