一种表征和鉴定蛋白质泛素化位点的新方案。

A New Scheme to Characterize and Identify Protein Ubiquitination Sites.

作者信息

Nguyen Van-Nui, Huang Kai-Yao, Huang Chien-Hsun, Lai K Robert, Lee Tzong-Yi

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2017 Mar-Apr;14(2):393-403. doi: 10.1109/TCBB.2016.2520939. Epub 2016 Feb 8.

DOI:10.1109/TCBB.2016.2520939

Abstract

Protein ubiquitination, involving the conjugation of ubiquitin on lysine residue, serves as an important modulator of many cellular functions in eukaryotes. Recent advancements in proteomic technology have stimulated increasing interest in identifying ubiquitination sites. However, most computational tools for predicting ubiquitination sites are focused on small-scale data. With an increasing number of experimentally verified ubiquitination sites, we were motivated to design a predictive model for identifying lysine ubiquitination sites for large-scale proteome dataset. This work assessed not only single features, such as amino acid composition (AAC), amino acid pair composition (AAPC) and evolutionary information, but also the effectiveness of incorporating two or more features into a hybrid approach to model construction. The support vector machine (SVM) was applied to generate the prediction models for ubiquitination site identification. Evaluation by five-fold cross-validation showed that the SVM models learned from the combination of hybrid features delivered a better prediction performance. Additionally, a motif discovery tool, MDDLogo, was adopted to characterize the potential substrate motifs of ubiquitination sites. The SVM models integrating the MDDLogo-identified substrate motifs could yield an average accuracy of 68.70 percent. Furthermore, the independent testing result showed that the MDDLogo-clustered SVM models could provide a promising accuracy (78.50 percent) and perform better than other prediction tools. Two cases have demonstrated the effective prediction of ubiquitination sites with corresponding substrate motifs.

摘要

蛋白质泛素化涉及泛素与赖氨酸残基的结合，是真核生物中许多细胞功能的重要调节因子。蛋白质组学技术的最新进展激发了人们对识别泛素化位点的越来越浓厚的兴趣。然而，大多数预测泛素化位点的计算工具都集中在小规模数据上。随着越来越多的泛素化位点通过实验得到验证，我们有动力设计一种预测模型，用于识别大规模蛋白质组数据集中的赖氨酸泛素化位点。这项工作不仅评估了单个特征，如氨基酸组成（AAC）、氨基酸对组成（AAPC）和进化信息，还评估了将两个或更多特征纳入混合方法进行模型构建的有效性。支持向量机（SVM）被用于生成泛素化位点识别的预测模型。通过五折交叉验证进行的评估表明，从混合特征组合中学习到的SVM模型具有更好的预测性能。此外，采用了一种基序发现工具MDDLogo来表征泛素化位点的潜在底物基序。整合了MDDLogo识别的底物基序的SVM模型平均准确率可达68.70%。此外，独立测试结果表明，MDDLogo聚类的SVM模型可以提供可观的准确率（78.50%），并且比其他预测工具表现更好。两个案例已经证明了对具有相应底物基序的泛素化位点的有效预测。

相似文献

A New Scheme to Characterize and Identify Protein Ubiquitination Sites.

IEEE/ACM Trans Comput Biol Bioinform. 2017 Mar-Apr;14(2):393-403. doi: 10.1109/TCBB.2016.2520939. Epub 2016 Feb 8.

UbiSite: incorporating two-layered machine learning method with substrate motifs to predict ubiquitin-conjugation site on lysines.

BMC Syst Biol. 2016 Jan 11;10 Suppl 1(Suppl 1):6. doi: 10.1186/s12918-015-0246-z.

DeepUbi: a deep learning framework for prediction of ubiquitination sites in proteins.

BMC Bioinformatics. 2019 Feb 18;20(1):86. doi: 10.1186/s12859-019-2677-9.

Characterization and identification of lysine glutarylation based on intrinsic interdependence between positions in the substrate sites.

BMC Bioinformatics. 2019 Feb 4;19(Suppl 13):384. doi: 10.1186/s12859-018-2394-9.

SOHSite: incorporating evolutionary information and physicochemical properties to identify protein S-sulfenylation sites.

BMC Genomics. 2016 Jan 11;17 Suppl 1(Suppl 1):9. doi: 10.1186/s12864-015-2299-1.

Incorporating key position and amino acid residue features to identify general and species-specific Ubiquitin conjugation sites.

Bioinformatics. 2013 Jul 1;29(13):1614-22. doi: 10.1093/bioinformatics/btt196. Epub 2013 Apr 26.

Characterization and identification of ubiquitin conjugation sites with E3 ligase recognition specificities.

BMC Bioinformatics. 2015;16 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2105-16-S1-S1. Epub 2015 Jan 21.

MDD-Palm: Identification of protein S-palmitoylation sites with substrate motifs based on maximal dependence decomposition.

PLoS One. 2017 Jun 29;12(6):e0179529. doi: 10.1371/journal.pone.0179529. eCollection 2017.

Characterization and identification of protein O-GlcNAcylation sites with substrate specificity.

BMC Bioinformatics. 2014;15 Suppl 16(Suppl 16):S1. doi: 10.1186/1471-2105-15-S16-S1. Epub 2014 Dec 8.

MDD-SOH: exploiting maximal dependence decomposition to identify S-sulfenylation sites with substrate motifs.

Bioinformatics. 2016 Jan 15;32(2):165-72. doi: 10.1093/bioinformatics/btv558. Epub 2015 Sep 26.

引用本文的文献

Predictive modeling for ubiquitin proteins through advanced machine learning technique.

Heliyon. 2024 Jun 6;10(12):e32517. doi: 10.1016/j.heliyon.2024.e32517. eCollection 2024 Jun 30.

Lysine 222 in PPAR γ1 functions as the key site of MuRF2-mediated ubiquitination modification.

Sci Rep. 2023 Feb 3;13(1):1999. doi: 10.1038/s41598-023-28905-5.

A Caps-Ubi Model for Protein Ubiquitination Site Prediction.

Front Plant Sci. 2022 May 25;13:884903. doi: 10.3389/fpls.2022.884903. eCollection 2022.

UbiComb: A Hybrid Deep Learning Model for Predicting Plant-Specific Protein Ubiquitylation Sites.

Genes (Basel). 2021 May 11;12(5):717. doi: 10.3390/genes12050717.

Incorporating Deep Learning With Word Embedding to Identify Plant Ubiquitylation Sites.

Front Cell Dev Biol. 2020 Sep 30;8:572195. doi: 10.3389/fcell.2020.572195. eCollection 2020.

DeepUbi: a deep learning framework for prediction of ubiquitination sites in proteins.

BMC Bioinformatics. 2019 Feb 18;20(1):86. doi: 10.1186/s12859-019-2677-9.

Large-scale prediction of protein ubiquitination sites using a multimodal deep architecture.

BMC Syst Biol. 2018 Nov 22;12(Suppl 6):109. doi: 10.1186/s12918-018-0628-0.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种表征和鉴定蛋白质泛素化位点的新方案。

A New Scheme to Characterize and Identify Protein Ubiquitination Sites.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献