采用重采样方法识别蛋白泛素化位点。

Recognition of Protein Pupylation Sites by Adopting Resampling Approach.

机构信息

School of Transportation Management, Dalian Maritime University, Dalian 116026, China.

China Waterborne Transport Research Institute, Beijing 100088, China.

出版信息

Molecules. 2018 Nov 27;23(12):3097. doi: 10.3390/molecules23123097.

DOI:10.3390/molecules23123097

PMID:30486421

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6321382/

Abstract

With the in-depth study of posttranslational modification sites, protein ubiquitination has become the key problem to study the molecular mechanism of posttranslational modification. Pupylation is a widely used process in which a prokaryotic ubiquitin-like protein (Pup) is attached to a substrate through a series of biochemical reactions. However, the experimental methods of identifying pupylation sites is often time-consuming and laborious. This study aims to propose an improved approach for predicting pupylation sites. Firstly, the Pearson correlation coefficient was used to reflect the correlation among different amino acid pairs calculated by the frequency of each amino acid. Then according to a descending ranked order, the multiple types of features were filtered separately by values of Pearson correlation coefficient. Thirdly, to get a qualified balanced dataset, the K-means principal component analysis (KPCA) oversampling technique was employed to synthesize new positive samples and Fuzzy undersampling method was employed to reduce the number of negative samples. Finally, the performance of our method was verified by means of jackknife and a 10-fold cross-validation test. The average results of 10-fold cross-validation showed that the sensitivity (Sn) was 90.53%, specificity (Sp) was 99.8%, accuracy (Acc) was 95.09%, and Matthews Correlation Coefficient (MCC) was 0.91. Moreover, an independent test dataset was used to further measure its performance, and the prediction results achieved the Acc of 83.75%, MCC of 0.49, which was superior to previous predictors. The better performance and stability of our proposed method showed it is an effective way to predict pupylation sites.

摘要

随着对翻译后修饰位点的深入研究，蛋白质泛素化已成为研究翻译后修饰分子机制的关键问题。泛酰化是一种广泛使用的过程，其中一个原核泛素样蛋白（Pup）通过一系列生化反应附着在底物上。然而，鉴定泛酰化位点的实验方法通常既耗时又费力。本研究旨在提出一种改进的预测泛酰化位点的方法。首先，使用 Pearson 相关系数来反映通过每个氨基酸的频率计算得到的不同氨基酸对之间的相关性。然后，根据降序排列，通过 Pearson 相关系数的值分别过滤多种类型的特征。第三，为了获得合格的平衡数据集，采用 K-means 主成分分析（KPCA）过采样技术来合成新的阳性样本，采用 Fuzzy 欠采样方法来减少阴性样本的数量。最后，通过 Jackknife 和 10 倍交叉验证测试来验证我们方法的性能。10 倍交叉验证的平均结果表明，敏感性（Sn）为 90.53%，特异性（Sp）为 99.8%，准确性（Acc）为 95.09%，马修斯相关系数（MCC）为 0.91。此外，使用独立的测试数据集进一步衡量其性能，预测结果达到了 Acc 为 83.75%，MCC 为 0.49，优于以前的预测器。我们提出的方法具有更好的性能和稳定性，表明它是一种有效的预测泛酰化位点的方法。

相似文献

Recognition of Protein Pupylation Sites by Adopting Resampling Approach.

Molecules. 2018 Nov 27;23(12):3097. doi: 10.3390/molecules23123097.

Computational Identification of Protein Pupylation Sites by Using Profile-Based Composition of k-Spaced Amino Acid Pairs.

PLoS One. 2015 Jun 16;10(6):e0129635. doi: 10.1371/journal.pone.0129635. eCollection 2015.

Prediction of pupylation sites using the composition of k-spaced amino acid pairs.

J Theor Biol. 2013 Nov 7;336:11-7. doi: 10.1016/j.jtbi.2013.07.009. Epub 2013 Jul 18.

Pupylation as a signal for proteasomal degradation in bacteria.

Biochim Biophys Acta. 2014 Jan;1843(1):103-13. doi: 10.1016/j.bbamcr.2013.03.022. Epub 2013 Apr 2.

GPS-PUP: computational prediction of pupylation sites in prokaryotic proteins.

Mol Biosyst. 2011 Oct;7(10):2737-40. doi: 10.1039/c1mb05217a. Epub 2011 Aug 18.

Systematic analysis and prediction of pupylation sites in prokaryotic proteins.

PLoS One. 2013 Sep 3;8(9):e74002. doi: 10.1371/journal.pone.0074002. eCollection 2013.

Positive-Unlabeled Learning for Pupylation Sites Prediction.

Biomed Res Int. 2016;2016:4525786. doi: 10.1155/2016/4525786. Epub 2016 Aug 7.

Predicting pupylation sites in prokaryotic proteins using semi-supervised self-training support vector machine algorithm.

Anal Biochem. 2016 Aug 15;507:1-6. doi: 10.1016/j.ab.2016.05.005. Epub 2016 May 16.

Genetic and Proteomic Analyses of Pupylation in Streptomyces coelicolor.

J Bacteriol. 2015 Sep;197(17):2747-53. doi: 10.1128/JB.00302-15. Epub 2015 Jun 1.

Prokaryotic ubiquitin-like protein remains intrinsically disordered when covalently attached to proteasomal target proteins.

BMC Struct Biol. 2017 Feb 1;17(1):1. doi: 10.1186/s12900-017-0072-1.

引用本文的文献

Identifying Pupylation Proteins and Sites by Incorporating Multiple Methods.

Front Endocrinol (Lausanne). 2022 Apr 26;13:849549. doi: 10.3389/fendo.2022.849549. eCollection 2022.

PUP-Fuse: Prediction of Protein Pupylation Sites by Integrating Multiple Sequence Representations.

Int J Mol Sci. 2021 Feb 20;22(4):2120. doi: 10.3390/ijms22042120.

本文引用的文献

Computational Prediction of Protein O-GlcNAc Modification.

Methods Mol Biol. 2018;1754:235-246. doi: 10.1007/978-1-4939-7717-8_14.

O-GlcNAcPRED-II: an integrated classification algorithm for identifying O-GlcNAcylation sites based on fuzzy undersampling and a K-means PCA oversampling technique.

Bioinformatics. 2018 Jun 15;34(12):2029-2036. doi: 10.1093/bioinformatics/bty039.

EPuL: An Enhanced Positive-Unlabeled Learning Algorithm for the Prediction of Pupylation Sites.

Molecules. 2017 Sep 5;22(9):1463. doi: 10.3390/molecules22091463.

Randomized Prospective Biomarker Trial of ERCC1 for Comparing Platinum and Nonplatinum Therapy in Advanced Non-Small-Cell Lung Cancer: ERCC1 Trial (ET).

J Clin Oncol. 2017 Feb;35(4):402-411. doi: 10.1200/JCO.2016.68.1841. Epub 2016 Nov 28.

Positive-Unlabeled Learning for Pupylation Sites Prediction.

Biomed Res Int. 2016;2016:4525786. doi: 10.1155/2016/4525786. Epub 2016 Aug 7.

The emerging role of deubiquitination in nucleotide excision repair.

DNA Repair (Amst). 2016 Aug;44:118-122. doi: 10.1016/j.dnarep.2016.05.035. Epub 2016 Jun 2.

iSulf-Cys: Prediction of S-sulfenylation Sites in Proteins with Physicochemical Properties of Amino Acids.

PLoS One. 2016 Apr 22;11(4):e0154237. doi: 10.1371/journal.pone.0154237. eCollection 2016.

Predicting lysine phosphoglycerylation with fuzzy SVM by incorporating k-spaced amino acid pairs into Chou׳s general PseAAC.

J Theor Biol. 2016 May 21;397:145-50. doi: 10.1016/j.jtbi.2016.02.020. Epub 2016 Feb 22.

SOHSite: incorporating evolutionary information and physicochemical properties to identify protein S-sulfenylation sites.

BMC Genomics. 2016 Jan 11;17 Suppl 1(Suppl 1):9. doi: 10.1186/s12864-015-2299-1.

SUMO and ubiquitin-dependent XPC exchange drives nucleotide excision repair.

Nat Commun. 2015 Jul 7;6:7499. doi: 10.1038/ncomms8499.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

采用重采样方法识别蛋白泛素化位点。

Recognition of Protein Pupylation Sites by Adopting Resampling Approach.

机构信息

School of Transportation Management, Dalian Maritime University, Dalian 116026, China.

China Waterborne Transport Research Institute, Beijing 100088, China.

出版信息

Molecules. 2018 Nov 27;23(12):3097. doi: 10.3390/molecules23123097.

DOI:10.3390/molecules23123097

PMID:30486421

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6321382/

Abstract

摘要

采用重采样方法识别蛋白泛素化位点。

Recognition of Protein Pupylation Sites by Adopting Resampling Approach.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

采用重采样方法识别蛋白泛素化位点。

Recognition of Protein Pupylation Sites by Adopting Resampling Approach.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献