• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

计算预测拟南芥蛋白质泛素化位点。

Computational prediction of protein ubiquitination sites mapping on Arabidopsis thaliana.

机构信息

Bioinformatics Lab., Department of Statistics, Rajshahi University, Rajshahi, 6205, Bangladesh.

Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan.

出版信息

Comput Biol Chem. 2020 Apr;85:107238. doi: 10.1016/j.compbiolchem.2020.107238. Epub 2020 Feb 19.

DOI:10.1016/j.compbiolchem.2020.107238
PMID:32114285
Abstract

Among the protein post-translational modifications (PTMs), ubiquitination is considered as one of the most significant processes which can regulate the cellular functions and various diseases. Identification of ubiquitination sites becomes important for understanding the mechanisms of ubiquitination-related biological processes. Both experimental and computational approaches are available for identifying ubiquitination sites based on protein sequences of different species. The experimental approaches are time-consuming, laborious and costly. In silico prediction is an alternative time saving, easier and cost-effective approach for identifying ubiquitination sites. Moreover, the sequence patterns in the different species around the ubiquitination sites are not similar which demands species-specific predictors. Therefore, in this study, we have proposed a novel computational method for identifying ubiquitination sites based on protein sequences of A. thaliana species which will be robust against outlying observations also. Through the comparative study of two encoding schemes and three classifiers, the random forest (RF) based predictor was selected as the best predictor under the CKSAAP encoding scheme with 1:1 ratio of positive and negative samples (i.e. ubiquitinated and non-ubiquitinated) in training dataset. The proposed predictor produced the area under the ROC curve (AUC score) as 0.91 and 0.86 for 5-fold cross-validation test with the training dataset and the independent test dataset of A. thaliana respectively. The proposed RF based predictor also performed much better than the other existing ubiquitination sites predictors for A. thaliana.

摘要

在蛋白质翻译后修饰(PTMs)中,泛素化被认为是可以调节细胞功能和各种疾病的最重要过程之一。鉴定泛素化位点对于理解泛素化相关生物过程的机制变得非常重要。基于不同物种的蛋白质序列,有实验和计算两种方法可用于鉴定泛素化位点。实验方法既耗时、费力又昂贵。基于序列的计算预测是一种替代方法,可以节省时间、更容易且具有成本效益,可用于鉴定泛素化位点。此外,泛素化位点周围不同物种的序列模式并不相似,这需要特定于物种的预测器。因此,在这项研究中,我们提出了一种基于拟南芥物种蛋白质序列鉴定泛素化位点的新的计算方法,该方法还具有稳健性,可抵抗异常值。通过对两种编码方案和三种分类器的比较研究,基于 CKSAAP 编码方案的随机森林(RF)预测器被选为阳性和阴性样本(即泛素化和非泛素化)比例为 1:1 的训练数据集的最佳预测器。该预测器在 5 折交叉验证测试中,对训练数据集和拟南芥的独立测试数据集的 AUC 分数分别为 0.91 和 0.86。该 RF 预测器也比其他现有的拟南芥泛素化位点预测器表现更好。

相似文献

1
Computational prediction of protein ubiquitination sites mapping on Arabidopsis thaliana.计算预测拟南芥蛋白质泛素化位点。
Comput Biol Chem. 2020 Apr;85:107238. doi: 10.1016/j.compbiolchem.2020.107238. Epub 2020 Feb 19.
2
UbNiRF: A Hybrid Framework Based on Null Importances and Random Forest that Combines Multiple Features to Predict Ubiquitination Sites in and .UbNiRF:一种基于空重要性和随机森林的混合框架,它结合多种特征来预测[具体内容缺失]中的泛素化位点。
Front Biosci (Landmark Ed). 2024 May 21;29(5):197. doi: 10.31083/j.fbl2905197.
3
hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties.hCKSAAP_UbSite:通过利用氨基酸模式和特性改进对人泛素化位点的预测。
Biochim Biophys Acta. 2013 Aug;1834(8):1461-7. doi: 10.1016/j.bbapap.2013.04.006. Epub 2013 Apr 19.
4
Computational identification of ubiquitination sites in Arabidopsis thaliana using convolutional neural networks.利用卷积神经网络对拟南芥泛素化位点进行计算识别。
Plant Mol Biol. 2021 Apr;105(6):601-610. doi: 10.1007/s11103-020-01112-w. Epub 2021 Feb 1.
5
Prediction of serine phosphorylation sites mapping on Schizosaccharomyces Pombe by fusing three encoding schemes with the random forest classifier.通过将三种编码方案与随机森林分类器融合,预测丝氨酸磷酸化位点在裂殖酵母中的映射。
Sci Rep. 2022 Feb 16;12(1):2632. doi: 10.1038/s41598-022-06529-5.
6
PseAraUbi: predicting arabidopsis ubiquitination sites by incorporating the physico-chemical and structural features.PseAraUbi:通过整合物理化学和结构特征预测拟南芥泛素化位点。
Plant Mol Biol. 2022 Sep;110(1-2):81-92. doi: 10.1007/s11103-022-01288-3. Epub 2022 Jul 1.
7
Prediction of ubiquitination sites by using the composition of k-spaced amino acid pairs.基于 k 间隔氨基酸对组成的泛素化位点预测。
PLoS One. 2011;6(7):e22930. doi: 10.1371/journal.pone.0022930. Epub 2011 Jul 29.
8
Prediction of mucin-type O-glycosylation sites in mammalian proteins using the composition of k-spaced amino acid pairs.利用k间隔氨基酸对的组成预测哺乳动物蛋白质中的粘蛋白型O-糖基化位点
BMC Bioinformatics. 2008 Feb 18;9:101. doi: 10.1186/1471-2105-9-101.
9
Prediction of phosphorylation sites based on Krawtchouk image moments.基于克劳特楚克图像矩的磷酸化位点预测。
Proteins. 2017 Dec;85(12):2231-2238. doi: 10.1002/prot.25388. Epub 2017 Sep 29.
10
PrUb-EL: A hybrid framework based on deep learning for identifying ubiquitination sites in Arabidopsis thaliana using ensemble learning strategy.PrUb-EL:一种基于深度学习的混合框架,使用集成学习策略识别拟南芥中的泛素化位点。
Anal Biochem. 2022 Dec 1;658:114935. doi: 10.1016/j.ab.2022.114935. Epub 2022 Oct 4.

引用本文的文献

1
Comprehensive bioinformatics and machine learning analyses for breast cancer staging using TCGA dataset.使用TCGA数据集进行乳腺癌分期的综合生物信息学和机器学习分析。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae628.
2
Novel immune cross-talk between inflammatory bowel disease and IgA nephropathy.炎症性肠病与 IgA 肾病之间新型免疫交叉对话。
Ren Fail. 2024 Dec;46(1):2337288. doi: 10.1080/0886022X.2024.2337288. Epub 2024 Apr 17.
3
HNetGO: protein function prediction via heterogeneous network transformer.HNetGO:基于异质网络转换器的蛋白质功能预测。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbab556.
4
Network-Based Data Analysis Reveals Ion Channel-Related Gene Features in COVID-19: A Bioinformatic Approach.基于网络的数据分析揭示 COVID-19 中与离子通道相关的基因特征:一种生物信息学方法。
Biochem Genet. 2023 Apr;61(2):471-505. doi: 10.1007/s10528-022-10280-x. Epub 2022 Sep 14.
5
PseAraUbi: predicting arabidopsis ubiquitination sites by incorporating the physico-chemical and structural features.PseAraUbi:通过整合物理化学和结构特征预测拟南芥泛素化位点。
Plant Mol Biol. 2022 Sep;110(1-2):81-92. doi: 10.1007/s11103-022-01288-3. Epub 2022 Jul 1.
6
Prediction of serine phosphorylation sites mapping on Schizosaccharomyces Pombe by fusing three encoding schemes with the random forest classifier.通过将三种编码方案与随机森林分类器融合,预测丝氨酸磷酸化位点在裂殖酵母中的映射。
Sci Rep. 2022 Feb 16;12(1):2632. doi: 10.1038/s41598-022-06529-5.
7
AoP-LSE: Antioxidant Proteins Classification Using Deep Latent Space Encoding of Sequence Features.AoP-LSE:基于序列特征的深度潜在空间编码的抗氧化蛋白分类。
Curr Issues Mol Biol. 2021 Oct 9;43(3):1489-1501. doi: 10.3390/cimb43030105.
8
Understanding and Exploiting Post-Translational Modifications for Plant Disease Resistance.理解和利用翻译后修饰来增强植物的抗病性。
Biomolecules. 2021 Jul 30;11(8):1122. doi: 10.3390/biom11081122.
9
An Improved Computational Prediction Model for Lysine Succinylation Sites Mapping on Fusing Three Sequence Encoding Schemes with the Random Forest Classifier.一种改进的计算预测模型,用于通过融合三种序列编码方案与随机森林分类器来映射赖氨酸琥珀酰化位点
Curr Genomics. 2021 Feb;22(2):122-136. doi: 10.2174/1389202922666210219114211.
10
UbiComb: A Hybrid Deep Learning Model for Predicting Plant-Specific Protein Ubiquitylation Sites.UbiComb:一种用于预测植物特异性蛋白质泛素化位点的混合深度学习模型。
Genes (Basel). 2021 May 11;12(5):717. doi: 10.3390/genes12050717.