• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于蛋白质重叠属性矩阵和三进制编码预测细胞凋亡蛋白亚细胞定位。

Predicting Apoptosis Protein Subcellular Locations based on the Protein Overlapping Property Matrix and Tri-Gram Encoding.

机构信息

AIEN Institute, Shanghai Ocean University, Shanghai 201306, China.

College of Sciences & Engineering, University of Tasmania, 7001 Tasmania, Australia.

出版信息

Int J Mol Sci. 2019 May 11;20(9):2344. doi: 10.3390/ijms20092344.

DOI:10.3390/ijms20092344
PMID:31083553
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6539631/
Abstract

To reveal the working pattern of programmed cell death, knowledge of the subcellular location of apoptosis proteins is essential. Besides the costly and time-consuming method of experimental determination, research into computational locating schemes, focusing mainly on the innovation of representation techniques on protein sequences and the selection of classification algorithms, has become popular in recent decades. In this study, a novel tri-gram encoding model is proposed, which is based on using the protein overlapping property matrix (POPM) for predicting apoptosis protein subcellular location. Next, a 1000-dimensional feature vector is built to represent a protein. Finally, with the help of support vector machine-recursive feature elimination (SVM-RFE), we select the optimal features and put them into a support vector machine (SVM) classifier for predictions. The results of jackknife tests on two benchmark datasets demonstrate that our proposed method can achieve satisfactory prediction performance level with less computing capacity required and could work as a promising tool to predict the subcellular locations of apoptosis proteins.

摘要

为了揭示细胞程序性死亡的工作模式,了解凋亡蛋白的亚细胞定位是必不可少的。除了实验测定这种昂贵且耗时的方法外,近年来,主要集中在蛋白质序列表示技术创新和分类算法选择上的计算定位方案的研究也变得很流行。在这项研究中,提出了一种新的三字符编码模型,该模型基于使用蛋白质重叠特性矩阵(POPM)来预测凋亡蛋白的亚细胞定位。接下来,构建一个 1000 维的特征向量来表示蛋白质。最后,借助支持向量机递归特征消除(SVM-RFE),选择最优特征并将其放入支持向量机(SVM)分类器中进行预测。在两个基准数据集的 Jackknife 测试结果表明,我们提出的方法可以在需要较少计算能力的情况下达到令人满意的预测性能水平,可以作为一种很有前途的工具来预测凋亡蛋白的亚细胞定位。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f71/6539631/4dac2df735d3/ijms-20-02344-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f71/6539631/4dac2df735d3/ijms-20-02344-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f71/6539631/4dac2df735d3/ijms-20-02344-g001.jpg

相似文献

1
Predicting Apoptosis Protein Subcellular Locations based on the Protein Overlapping Property Matrix and Tri-Gram Encoding.基于蛋白质重叠属性矩阵和三进制编码预测细胞凋亡蛋白亚细胞定位。
Int J Mol Sci. 2019 May 11;20(9):2344. doi: 10.3390/ijms20092344.
2
Prediction of subcellular location of apoptosis proteins combining tri-gram encoding based on PSSM and recursive feature elimination.基于位置特异性得分矩阵(PSSM)的三元组编码与递归特征消除相结合预测凋亡蛋白的亚细胞定位
J Theor Biol. 2015 Feb 7;366:8-12. doi: 10.1016/j.jtbi.2014.11.010. Epub 2014 Nov 20.
3
iAPSL-IF: Identification of Apoptosis Protein Subcellular Location Using Integrative Features Captured from Amino Acid Sequences.iAPSL-IF:利用从氨基酸序列中提取的综合特征识别细胞凋亡蛋白亚细胞定位。
Int J Mol Sci. 2018 Apr 13;19(4):1190. doi: 10.3390/ijms19041190.
4
Subcellular location prediction of apoptosis proteins using two novel feature extraction methods based on evolutionary information and LDA.基于进化信息和 LDA 的两种新特征提取方法对凋亡蛋白的亚细胞定位预测
BMC Bioinformatics. 2020 May 24;21(1):212. doi: 10.1186/s12859-020-3539-1.
5
Predicting apoptosis protein subcellular localization by integrating auto-cross correlation and PSSM into Chou's PseAAC.通过将自相关和 PSSM 整合到 Chou 的 PseAAC 中,预测细胞凋亡蛋白的亚细胞定位。
J Theor Biol. 2018 Nov 14;457:163-169. doi: 10.1016/j.jtbi.2018.08.042. Epub 2018 Sep 1.
6
Prediction of protein subcellular localization with oversampling approach and Chou's general PseAAC.基于过采样方法和周式广义伪氨基酸组成预测蛋白质亚细胞定位
J Theor Biol. 2018 Jan 21;437:239-250. doi: 10.1016/j.jtbi.2017.10.030. Epub 2017 Oct 31.
7
Predicting subcellular location of apoptosis proteins with pseudo amino acid composition: approach from amino acid substitution matrix and auto covariance transformation.基于伪氨基酸组成预测细胞凋亡蛋白的亚细胞定位:从氨基酸替换矩阵和自协方差变换角度的方法。
Amino Acids. 2012 May;42(5):1619-25. doi: 10.1007/s00726-011-0848-8. Epub 2011 Feb 23.
8
Subcellular localization prediction of apoptosis proteins based on evolutionary information and support vector machine.基于进化信息和支持向量机的凋亡蛋白亚细胞定位预测
Artif Intell Med. 2017 May;78:41-46. doi: 10.1016/j.artmed.2017.05.007. Epub 2017 May 24.
9
Prediction of Apoptosis Protein Subcellular Localization with Multilayer Sparse Coding and Oversampling Approach.基于多层稀疏编码和过采样方法预测细胞凋亡蛋白亚细胞定位。
Biomed Res Int. 2019 Jan 30;2019:2436924. doi: 10.1155/2019/2436924. eCollection 2019.
10
Prediction of Protein Subcellular Localization Based on Fusion of Multi-view Features.基于多视图特征融合的蛋白质亚细胞定位预测。
Molecules. 2019 Mar 6;24(5):919. doi: 10.3390/molecules24050919.

引用本文的文献

1
A personalized mRNA signature for predicting hypertrophic cardiomyopathy applying machine learning methods.应用机器学习方法的预测肥厚型心肌病的个性化 mRNA 特征。
Sci Rep. 2024 Jul 24;14(1):17023. doi: 10.1038/s41598-024-67201-8.
2
Identification of intelligence-related proteins through a robust two-layer predictor.通过强大的双层预测器鉴定与智力相关的蛋白质。
Commun Integr Biol. 2022 Nov 15;15(1):253-264. doi: 10.1080/19420889.2022.2143101. eCollection 2022.
3
Identification of diagnostic mRNA biomarkers in whole blood for ankylosing spondylitis using WGCNA and machine learning feature selection.

本文引用的文献

1
Prediction of Protein Subcellular Localization Based on Fusion of Multi-view Features.基于多视图特征融合的蛋白质亚细胞定位预测。
Molecules. 2019 Mar 6;24(5):919. doi: 10.3390/molecules24050919.
2
Prediction of apoptosis protein subcellular localization via heterogeneous features and hierarchical extreme learning machine.通过异质特征和分层极限学习机预测细胞凋亡蛋白亚细胞定位。
SAR QSAR Environ Res. 2019 Mar;30(3):209-228. doi: 10.1080/1062936X.2019.1576222. Epub 2019 Feb 26.
3
Empirical comparison and analysis of web-based cell-penetrating peptide prediction tools.
使用 WGCNA 和机器学习特征选择技术在全血中鉴定强直性脊柱炎的诊断性 mRNA 生物标志物。
Front Immunol. 2022 Sep 12;13:956027. doi: 10.3389/fimmu.2022.956027. eCollection 2022.
基于网络的细胞穿透肽预测工具的实证比较和分析。
Brief Bioinform. 2020 Mar 23;21(2):408-420. doi: 10.1093/bib/bby124.
4
mAHTPred: a sequence-based meta-predictor for improving the prediction of anti-hypertensive peptides using effective feature representation.mAHTPred:一种基于序列的元预测器,用于使用有效的特征表示来提高抗高血压肽的预测。
Bioinformatics. 2019 Aug 15;35(16):2757-2765. doi: 10.1093/bioinformatics/bty1047.
5
iGHBP: Computational identification of growth hormone binding proteins from sequences using extremely randomised tree.iGHBP:使用极端随机树从序列中对生长激素结合蛋白进行计算识别。
Comput Struct Biotechnol J. 2018 Oct 24;16:412-420. doi: 10.1016/j.csbj.2018.10.007. eCollection 2018.
6
Exploring sequence-based features for the improved prediction of DNA N4-methylcytosine sites in multiple species.探索基于序列的特征,以提高在多个物种中预测 DNA N4-甲基胞嘧啶位点的能力。
Bioinformatics. 2019 Apr 15;35(8):1326-1333. doi: 10.1093/bioinformatics/bty824.
7
Predicting apoptosis protein subcellular localization by integrating auto-cross correlation and PSSM into Chou's PseAAC.通过将自相关和 PSSM 整合到 Chou 的 PseAAC 中,预测细胞凋亡蛋白的亚细胞定位。
J Theor Biol. 2018 Nov 14;457:163-169. doi: 10.1016/j.jtbi.2018.08.042. Epub 2018 Sep 1.
8
M6APred-EL: A Sequence-Based Predictor for Identifying N6-methyladenosine Sites Using Ensemble Learning.M6APred-EL:一种基于序列的使用集成学习识别N6-甲基腺苷位点的预测器。
Mol Ther Nucleic Acids. 2018 Sep 7;12:635-644. doi: 10.1016/j.omtn.2018.07.004. Epub 2018 Jul 9.
9
Prediction of subcellular location of apoptosis proteins by incorporating PsePSSM and DCCA coefficient based on LFDA dimensionality reduction.基于 LFDA 降维的 PsePSSM 和 DCCA 系数融合预测细胞凋亡蛋白的亚细胞定位。
BMC Genomics. 2018 Jun 19;19(1):478. doi: 10.1186/s12864-018-4849-9.
10
ACPred-FL: a sequence-based predictor using effective feature representation to improve the prediction of anti-cancer peptides.ACPred-FL:一种基于序列的预测器,使用有效的特征表示来提高抗癌肽的预测能力。
Bioinformatics. 2018 Dec 1;34(23):4007-4016. doi: 10.1093/bioinformatics/bty451.