PseAraUbi：通过整合物理化学和结构特征预测拟南芥泛素化位点。

PseAraUbi: predicting arabidopsis ubiquitination sites by incorporating the physico-chemical and structural features.

机构信息

College of Computer and Information Engineering, Henan Normal University, Xinxiang, 453000, China.

Key Laboratory of Artificial Intelligence and Personalized Learning in Education of Henan Province, Xinxiang, China.

出版信息

Plant Mol Biol. 2022 Sep;110(1-2):81-92. doi: 10.1007/s11103-022-01288-3. Epub 2022 Jul 1.

DOI:10.1007/s11103-022-01288-3

PMID:35773617

Abstract

We makes three kinds of important features from Arabidopsis thaliana: protein secondary structure based on the Chou-Fasman parameter, amino acids hydrophobicity and polarity information, and analyze their properties. Ubiquitination modification is an important post-translational modification of proteins, which participates in the regulation of many important life activities in cells. At present, ubiquitination proteomics research is mostly concentrated in animals and yeasts, while relatively few studies have been carried out in plants. It can be said that the calculation and prediction of Arabidopsis thaliana ubiquitination sites is still in its infancy. Based on this, we describe a calculation method, PseAraUbi (Prediction of Arabidopsis thaliana ubiquitination sites using pseudo amino acid composition), that can effectively detect ubiquitination sites on Arabidopsis thaliana using support vector machine learning classifiers. Based on protein sequence information, extract features from the Chou-Fasman parameter, amino acids hydrophobicity features, polarity information and selected for classification with the Boruta algorithm. PseAraUbi achieves promising performances with an AUC score of 0.953 with fivefold cross-validation on the training dataset, which are significantly better than that of the pioneer Arabidopsis thaliana ubiquitination sites method. We also proved the ability of our proposed method on independent test sets, thus gaining a competitive advantage. In addition, we also in-depth analyzed the physicochemical properties of amino acids in the region adjacent to the ubiquitination site. To facilitate the community, the source code, optimal feature subset, ubiquitination sites dataset in the Arbidopsis proteome are available at GitHub ( https://github.com/HNUBioinformatics/PseAraUbi.git ) for interest users.

摘要

我们从拟南芥中提取了三种重要的特征

基于 Chou-Fasman 参数的蛋白质二级结构、氨基酸疏水性和极性信息，并分析了它们的性质。泛素化修饰是蛋白质的一种重要的翻译后修饰，参与细胞中许多重要生命活动的调节。目前，泛素化蛋白质组学研究主要集中在动物和酵母中，而在植物中相对较少。可以说，拟南芥泛素化位点的计算和预测仍处于起步阶段。基于此，我们描述了一种计算方法 PseAraUbi（使用伪氨基酸组成预测拟南芥泛素化位点），该方法可以有效地使用支持向量机学习分类器检测拟南芥中的泛素化位点。基于蛋白质序列信息，从 Chou-Fasman 参数、氨基酸疏水性特征、极性信息中提取特征，并使用 Boruta 算法进行分类。PseAraUbi 在训练数据集上的五重交叉验证中获得了 0.953 的 AUC 评分，这明显优于先驱的拟南芥泛素化位点方法。我们还在独立测试集上证明了我们提出的方法的能力，从而获得了竞争优势。此外，我们还深入分析了泛素化位点附近区域氨基酸的理化性质。为了方便社区，源代码、最优特征子集、拟南芥蛋白质组中的泛素化位点数据集可在 GitHub（https://github.com/HNUBioinformatics/PseAraUbi.git）上获得，供有兴趣的用户使用。

相似文献

PseAraUbi: predicting arabidopsis ubiquitination sites by incorporating the physico-chemical and structural features.

Plant Mol Biol. 2022 Sep;110(1-2):81-92. doi: 10.1007/s11103-022-01288-3. Epub 2022 Jul 1.

Computational identification of ubiquitination sites in Arabidopsis thaliana using convolutional neural networks.

Plant Mol Biol. 2021 Apr;105(6):601-610. doi: 10.1007/s11103-020-01112-w. Epub 2021 Feb 1.

UbNiRF: A Hybrid Framework Based on Null Importances and Random Forest that Combines Multiple Features to Predict Ubiquitination Sites in and .

Front Biosci (Landmark Ed). 2024 May 21;29(5):197. doi: 10.31083/j.fbl2905197.

PrUb-EL: A hybrid framework based on deep learning for identifying ubiquitination sites in Arabidopsis thaliana using ensemble learning strategy.

Anal Biochem. 2022 Dec 1;658:114935. doi: 10.1016/j.ab.2022.114935. Epub 2022 Oct 4.

Computational methods for ubiquitination site prediction using physicochemical properties of protein sequences.

BMC Bioinformatics. 2016 Mar 3;17:116. doi: 10.1186/s12859-016-0959-z.

RFAthM6A: a new tool for predicting mA sites in Arabidopsis thaliana.

Plant Mol Biol. 2018 Feb;96(3):327-337. doi: 10.1007/s11103-018-0698-9. Epub 2018 Jan 16.

Computational prediction of protein ubiquitination sites mapping on Arabidopsis thaliana.

Comput Biol Chem. 2020 Apr;85:107238. doi: 10.1016/j.compbiolchem.2020.107238. Epub 2020 Feb 19.

Computational identification of ubiquitylation sites from protein sequences.

BMC Bioinformatics. 2008 Jul 15;9:310. doi: 10.1186/1471-2105-9-310.

Incorporating secondary features into the general form of Chou's PseAAC for predicting protein structural class.

Protein Pept Lett. 2012 Nov;19(11):1133-8. doi: 10.2174/092986612803217051.

Large-scale prediction of protein ubiquitination sites using a multimodal deep architecture.

BMC Syst Biol. 2018 Nov 22;12(Suppl 6):109. doi: 10.1186/s12918-018-0628-0.

引用本文的文献

EUP: Enhanced cross-species prediction of ubiquitination sites via a conditional variational autoencoder network based on ESM2.

PLoS Comput Biol. 2025 Jul 16;21(7):e1013268. doi: 10.1371/journal.pcbi.1013268. eCollection 2025 Jul.

本文引用的文献

E3 Ubiquitin Ligase SPL2 Is a Lanthanide-Binding Protein.

Int J Mol Sci. 2021 May 27;22(11):5712. doi: 10.3390/ijms22115712.

Predicting gene phenotype by multi-label multi-class model based on essential functional features.

Mol Genet Genomics. 2021 Jul;296(4):905-918. doi: 10.1007/s00438-021-01789-8. Epub 2021 Apr 29.

Computational identification of ubiquitination sites in Arabidopsis thaliana using convolutional neural networks.

Plant Mol Biol. 2021 Apr;105(6):601-610. doi: 10.1007/s11103-020-01112-w. Epub 2021 Feb 1.

DeepTL-Ubi: A novel deep transfer learning method for effectively predicting ubiquitination sites of multiple species.

Methods. 2021 Aug;192:103-111. doi: 10.1016/j.ymeth.2020.08.003. Epub 2020 Aug 10.

The Positively Charged Active Site of the Bacterial Toxin RelE Causes a Large Shift in the General Base p.

Biochemistry. 2020 May 5;59(17):1665-1671. doi: 10.1021/acs.biochem.9b01047. Epub 2020 Apr 24.

Strategy for Development of Site-Specific Ubiquitin Antibodies.

Front Chem. 2020 Feb 21;8:111. doi: 10.3389/fchem.2020.00111. eCollection 2020.

Computational prediction of protein ubiquitination sites mapping on Arabidopsis thaliana.

Comput Biol Chem. 2020 Apr;85:107238. doi: 10.1016/j.compbiolchem.2020.107238. Epub 2020 Feb 19.

The Role of Atypical Ubiquitin Chains in the Regulation of the Antiviral Innate Immune Response.

Front Cell Dev Biol. 2020 Jan 22;7:392. doi: 10.3389/fcell.2019.00392. eCollection 2019.

Cracking the Ubiquitin Code: The Ubiquitin Toolbox.

Curr Issues Mol Biol. 2020;37:1-20. doi: 10.21775/cimb.037.001. Epub 2019 Nov 1.

Regulation of Ubiquitination Is Central to the Phosphate Starvation Response.

Trends Plant Sci. 2019 Aug;24(8):755-769. doi: 10.1016/j.tplants.2019.05.002. Epub 2019 Jun 5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

PseAraUbi：通过整合物理化学和结构特征预测拟南芥泛素化位点。

PseAraUbi: predicting arabidopsis ubiquitination sites by incorporating the physico-chemical and structural features.

机构信息

出版信息

我们从拟南芥中提取了三种重要的特征

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献