• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于改进的伪位置特异性评分矩阵的整合不同特征预测细胞凋亡蛋白

Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix.

机构信息

School of Information Science and Engineering, Yunnan University, Kunming 650504, China.

出版信息

Biomed Res Int. 2020 Jan 14;2020:4071508. doi: 10.1155/2020/4071508. eCollection 2020.

DOI:10.1155/2020/4071508
PMID:32420339
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7201498/
Abstract

Apoptosis proteins are strongly related to many diseases and play an indispensable role in maintaining the dynamic balance between cell death and division . Obtaining localization information on apoptosis proteins is necessary in understanding their function. To date, few researchers have focused on the problem of apoptosis data imbalance before classification, while this data imbalance is prone to misclassification. Therefore, in this work, we introduce a method to resolve this problem and to enhance prediction accuracy. Firstly, the features of the protein sequence are captured by combining Improving Pseudo-Position-Specific Scoring Matrix (IM-Psepssm) with the Bidirectional Correlation Coefficient (Bid-CC) algorithm from position-specific scoring matrix. Secondly, different features of fusion and resampling strategies are used to reduce the impact of imbalance on apoptosis protein datasets. Finally, the eigenvector adopts the Support Vector Machine (SVM) to the training classification model, and the prediction accuracy is evaluated by jackknife cross-validation tests. The experimental results indicate that, under the same feature vector, adopting resampling methods remarkably boosts many significant indicators in the unsampling method for predicting the localization of apoptosis proteins in the ZD98, ZW225, and CL317 databases. Additionally, we also present new user-friendly local software for readers to apply; the codes and software can be freely accessed at https://github.com/ruanxiaoli/Im-Psepssm.

摘要

凋亡蛋白与许多疾病密切相关,在维持细胞死亡和分裂的动态平衡中起着不可或缺的作用。获得凋亡蛋白的定位信息对于理解其功能是必要的。迄今为止,很少有研究人员关注分类前凋亡数据不平衡的问题,而这种数据不平衡容易导致分类错误。因此,在这项工作中,我们引入了一种解决这个问题并提高预测准确性的方法。首先,通过将改进的伪位置特异性评分矩阵(IM-Psepssm)与来自位置特异性评分矩阵的双向相关系数(Bid-CC)算法相结合,捕获蛋白质序列的特征。其次,采用不同的融合和重采样策略的特征来减少不平衡对凋亡蛋白数据集的影响。最后,特征向量采用支持向量机(SVM)对训练分类模型进行训练,并通过Jackknife 交叉验证测试评估预测精度。实验结果表明,在相同的特征向量下,采用重采样方法可以显著提高在 ZD98、ZW225 和 CL317 数据库中预测凋亡蛋白定位的无采样方法中的许多重要指标。此外,我们还为读者提供了新的用户友好的本地软件;代码和软件可在 https://github.com/ruanxiaoli/Im-Psepssm 上免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5e2/7201498/c44000bbd816/BMRI2020-4071508.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5e2/7201498/8492edc9bd2b/BMRI2020-4071508.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5e2/7201498/3e8a526b1c3f/BMRI2020-4071508.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5e2/7201498/1535a0dd5e9c/BMRI2020-4071508.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5e2/7201498/a25632b9fc56/BMRI2020-4071508.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5e2/7201498/c44000bbd816/BMRI2020-4071508.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5e2/7201498/8492edc9bd2b/BMRI2020-4071508.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5e2/7201498/3e8a526b1c3f/BMRI2020-4071508.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5e2/7201498/1535a0dd5e9c/BMRI2020-4071508.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5e2/7201498/a25632b9fc56/BMRI2020-4071508.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b5e2/7201498/c44000bbd816/BMRI2020-4071508.005.jpg

相似文献

1
Predictions of Apoptosis Proteins by Integrating Different Features Based on Improving Pseudo-Position-Specific Scoring Matrix.基于改进的伪位置特异性评分矩阵的整合不同特征预测细胞凋亡蛋白
Biomed Res Int. 2020 Jan 14;2020:4071508. doi: 10.1155/2020/4071508. eCollection 2020.
2
iAPSL-IF: Identification of Apoptosis Protein Subcellular Location Using Integrative Features Captured from Amino Acid Sequences.iAPSL-IF:利用从氨基酸序列中提取的综合特征识别细胞凋亡蛋白亚细胞定位。
Int J Mol Sci. 2018 Apr 13;19(4):1190. doi: 10.3390/ijms19041190.
3
Prediction of apoptosis protein subcellular location based on position-specific scoring matrix and isometric mapping algorithm.基于位置特异性评分矩阵和等距映射算法预测凋亡蛋白亚细胞定位。
Med Biol Eng Comput. 2019 Dec;57(12):2553-2565. doi: 10.1007/s11517-019-02045-3. Epub 2019 Oct 16.
4
Prediction of protein subcellular localization with oversampling approach and Chou's general PseAAC.基于过采样方法和周式广义伪氨基酸组成预测蛋白质亚细胞定位
J Theor Biol. 2018 Jan 21;437:239-250. doi: 10.1016/j.jtbi.2017.10.030. Epub 2017 Oct 31.
5
Prediction of Apoptosis Protein's Subcellular Localization by Fusing Two Different Descriptors Based on Evolutionary Information.基于进化信息融合两种不同描述符预测凋亡蛋白的亚细胞定位
Acta Biotheor. 2018 Mar;66(1):61-78. doi: 10.1007/s10441-018-9319-x. Epub 2018 Mar 12.
6
Prediction of subcellular location of apoptosis proteins by incorporating PsePSSM and DCCA coefficient based on LFDA dimensionality reduction.基于 LFDA 降维的 PsePSSM 和 DCCA 系数融合预测细胞凋亡蛋白的亚细胞定位。
BMC Genomics. 2018 Jun 19;19(1):478. doi: 10.1186/s12864-018-4849-9.
7
Subcellular location prediction of apoptosis proteins using two novel feature extraction methods based on evolutionary information and LDA.基于进化信息和 LDA 的两种新特征提取方法对凋亡蛋白的亚细胞定位预测
BMC Bioinformatics. 2020 May 24;21(1):212. doi: 10.1186/s12859-020-3539-1.
8
Predicting protein submitochondrial locations by incorporating the pseudo-position specific scoring matrix into the general Chou's pseudo-amino acid composition.通过将伪位置特异性评分矩阵纳入广义 Chou 的伪氨基酸组成来预测蛋白质亚线粒体位置。
J Theor Biol. 2018 Aug 7;450:86-103. doi: 10.1016/j.jtbi.2018.04.026. Epub 2018 Apr 18.
9
Detrended cross-correlation coefficient: Application to predict apoptosis protein subcellular localization.去趋势互相关系数:在预测凋亡蛋白亚细胞定位中的应用。
Math Biosci. 2016 Dec;282:61-67. doi: 10.1016/j.mbs.2016.09.019. Epub 2016 Oct 6.
10
CrystalM: A Multi-View Fusion Approach for Protein Crystallization Prediction.CrystalM:一种用于蛋白质结晶预测的多视图融合方法。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):325-335. doi: 10.1109/TCBB.2019.2912173. Epub 2021 Feb 3.

引用本文的文献

1
Protein subcellular localization prediction tools.蛋白质亚细胞定位预测工具。
Comput Struct Biotechnol J. 2024 Apr 15;23:1796-1807. doi: 10.1016/j.csbj.2024.04.032. eCollection 2024 Dec.

本文引用的文献

1
Prediction of apoptosis protein subcellular location based on position-specific scoring matrix and isometric mapping algorithm.基于位置特异性评分矩阵和等距映射算法预测凋亡蛋白亚细胞定位。
Med Biol Eng Comput. 2019 Dec;57(12):2553-2565. doi: 10.1007/s11517-019-02045-3. Epub 2019 Oct 16.
2
Prediction of Apoptosis Protein Subcellular Localization with Multilayer Sparse Coding and Oversampling Approach.基于多层稀疏编码和过采样方法预测细胞凋亡蛋白亚细胞定位。
Biomed Res Int. 2019 Jan 30;2019:2436924. doi: 10.1155/2019/2436924. eCollection 2019.
3
An ensemble method for multi-type Gram-negative bacterial secreted protein prediction by integrating different PSSM-based features.
基于不同 PSSM 特征整合的多重革兰氏阴性菌分泌蛋白预测的集成方法
SAR QSAR Environ Res. 2019 Mar;30(3):181-194. doi: 10.1080/1062936X.2019.1573438. Epub 2019 Feb 11.
4
Statistical principle-based approach for gene and protein related object recognition.基于统计原理的基因和蛋白质相关对象识别方法。
J Cheminform. 2018 Dec 17;10(1):64. doi: 10.1186/s13321-018-0314-7.
5
Predicting apoptosis protein subcellular localization by integrating auto-cross correlation and PSSM into Chou's PseAAC.通过将自相关和 PSSM 整合到 Chou 的 PseAAC 中,预测细胞凋亡蛋白的亚细胞定位。
J Theor Biol. 2018 Nov 14;457:163-169. doi: 10.1016/j.jtbi.2018.08.042. Epub 2018 Sep 1.
6
XGBFEMF: An XGBoost-Based Framework for Essential Protein Prediction.XGBFEMF:基于 XGBoost 的必需蛋白预测框架。
IEEE Trans Nanobioscience. 2018 Jul;17(3):243-250. doi: 10.1109/TNB.2018.2842219. Epub 2018 May 31.
7
Prediction of subcellular location of apoptosis proteins by incorporating PsePSSM and DCCA coefficient based on LFDA dimensionality reduction.基于 LFDA 降维的 PsePSSM 和 DCCA 系数融合预测细胞凋亡蛋白的亚细胞定位。
BMC Genomics. 2018 Jun 19;19(1):478. doi: 10.1186/s12864-018-4849-9.
8
Amino acid compositions contribute to the proteins' evolution under the influence of their abundances and genomic GC content.氨基酸组成受其丰度和基因组 GC 含量的影响,对蛋白质的进化有贡献。
Sci Rep. 2018 May 9;8(1):7382. doi: 10.1038/s41598-018-25364-1.
9
Prediction of citrullination sites by incorporating k-spaced amino acid pairs into Chou's general pseudo amino acid composition.通过将 k 间隔氨基酸对纳入周元的通用伪氨基酸组成来预测瓜氨酸化位点。
Gene. 2018 Jul 20;664:78-83. doi: 10.1016/j.gene.2018.04.055. Epub 2018 Apr 23.
10
Cell-free prediction of protein expression costs for growing cells.无细胞预测生长细胞的蛋白质表达成本。
Nat Commun. 2018 Apr 13;9(1):1457. doi: 10.1038/s41467-018-03970-x.