• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PrUb-EL:一种基于深度学习的混合框架,使用集成学习策略识别拟南芥中的泛素化位点。

PrUb-EL: A hybrid framework based on deep learning for identifying ubiquitination sites in Arabidopsis thaliana using ensemble learning strategy.

作者信息

Wang Houqiang, Li Hong, Gao Weifeng, Xie Jin

机构信息

School of Mathematics and Statistics, Xidian University, Xi'an, 710071, PR China.

School of Mathematics and Statistics, Xidian University, Xi'an, 710071, PR China.

出版信息

Anal Biochem. 2022 Dec 1;658:114935. doi: 10.1016/j.ab.2022.114935. Epub 2022 Oct 4.

DOI:10.1016/j.ab.2022.114935
PMID:36206844
Abstract

Identification of ubiquitination sites is central to many biological experiments. Ubiquitination is a kind of post-translational protein modification (PTM). It is a key mechanism for increasing protein diversity and plays a vital role in regulating cell function. In recent years, many models have been developed to predict ubiquitination sites in humans, mice and yeast. However, few studies have predicted ubiquitination sites in Arabidopsis thaliana. In view of this, a deep network model named PrUb-EL is proposed to predict ubiquitination sites in Arabidopsis thaliana. Firstly, six features based on the protein sequence are extracted with amino acid index database (AAindex), dipeptide deviates from the expected mean (DDE), dipeptide composition (DPC), blocks substitution matrix (BLOSUM62), enhanced amino acid composition (EAAC) and binary encoding. Secondly, the synthetic minority over-sampling technique (SMOTE) is utilized to process the imbalanced data set. Then a new classifier named DG is presented, which includes Dense block, Residual block and Gated recurrent unit (GRU) block. Finally, each of six feature extraction methods is integrated into the DG model, and the ensemble learning strategy is used to gain the final prediction result. Experimental results show that PrUb-EL has good predictive ability with the accuracy (ACC) and area under the ROC curve (auROC) values of 91.00% and 97.70% using 5-fold cross-validation, respectively. Note that the values of ACC and auROC are 88.58% and 96.09% in the independent test, respectively. Compared with previous studies, our model has significantly improved performance thus it is an excellent method for identifying ubiquitination sites in Arabidopsis thaliana. The datasets and code used for the article are available at https://github.com/Tom-Wangy/PreUb-EL.git.

摘要

泛素化位点的识别是许多生物学实验的核心。泛素化是一种蛋白质翻译后修饰(PTM)。它是增加蛋白质多样性的关键机制,在调节细胞功能中起着至关重要的作用。近年来,已经开发了许多模型来预测人类、小鼠和酵母中的泛素化位点。然而,很少有研究预测拟南芥中的泛素化位点。鉴于此,提出了一种名为PrUb-EL的深度网络模型来预测拟南芥中的泛素化位点。首先,使用氨基酸指数数据库(AAindex)、二肽偏离预期均值(DDE)、二肽组成(DPC)、块替换矩阵(BLOSUM62)、增强氨基酸组成(EAAC)和二进制编码提取基于蛋白质序列的六个特征。其次,利用合成少数过采样技术(SMOTE)处理不平衡数据集。然后提出了一种名为DG的新分类器,它包括密集块、残差块和门控循环单元(GRU)块。最后,将六种特征提取方法中的每一种都集成到DG模型中,并使用集成学习策略获得最终预测结果。实验结果表明,PrUb-EL具有良好的预测能力,在5折交叉验证中,准确率(ACC)和ROC曲线下面积(auROC)值分别为91.00%和97.70%。请注意,在独立测试中,ACC和auROC的值分别为88.58%和96.09%。与先前的研究相比,我们的模型性能有了显著提高,因此它是识别拟南芥中泛素化位点的一种优秀方法。本文使用的数据集和代码可在https://github.com/Tom-Wangy/PreUb-EL.git获取。

相似文献

1
PrUb-EL: A hybrid framework based on deep learning for identifying ubiquitination sites in Arabidopsis thaliana using ensemble learning strategy.PrUb-EL:一种基于深度学习的混合框架,使用集成学习策略识别拟南芥中的泛素化位点。
Anal Biochem. 2022 Dec 1;658:114935. doi: 10.1016/j.ab.2022.114935. Epub 2022 Oct 4.
2
UbNiRF: A Hybrid Framework Based on Null Importances and Random Forest that Combines Multiple Features to Predict Ubiquitination Sites in and .UbNiRF:一种基于空重要性和随机森林的混合框架,它结合多种特征来预测[具体内容缺失]中的泛素化位点。
Front Biosci (Landmark Ed). 2024 May 21;29(5):197. doi: 10.31083/j.fbl2905197.
3
Computational identification of ubiquitination sites in Arabidopsis thaliana using convolutional neural networks.利用卷积神经网络对拟南芥泛素化位点进行计算识别。
Plant Mol Biol. 2021 Apr;105(6):601-610. doi: 10.1007/s11103-020-01112-w. Epub 2021 Feb 1.
4
PseAraUbi: predicting arabidopsis ubiquitination sites by incorporating the physico-chemical and structural features.PseAraUbi:通过整合物理化学和结构特征预测拟南芥泛素化位点。
Plant Mol Biol. 2022 Sep;110(1-2):81-92. doi: 10.1007/s11103-022-01288-3. Epub 2022 Jul 1.
5
Prediction of protein ubiquitination sites via multi-view features based on eXtreme gradient boosting classifier.基于极端梯度提升分类器的多视图特征预测蛋白质泛素化位点。
J Mol Graph Model. 2021 Sep;107:107962. doi: 10.1016/j.jmgm.2021.107962. Epub 2021 Jun 15.
6
DeepUbi: a deep learning framework for prediction of ubiquitination sites in proteins.DeepUbi:一种用于预测蛋白质中泛素化位点的深度学习框架。
BMC Bioinformatics. 2019 Feb 18;20(1):86. doi: 10.1186/s12859-019-2677-9.
7
BiGRUD-SA: Protein S-sulfenylation sites prediction based on BiGRU and self-attention.BiGRUD-SA:基于 BiGRU 和自注意力的蛋白质 S-亚磺化位点预测。
Comput Biol Med. 2023 Sep;163:107145. doi: 10.1016/j.compbiomed.2023.107145. Epub 2023 Jun 8.
8
ECAmyloid: An amyloid predictor based on ensemble learning and comprehensive sequence-derived features.ECAmyloid:一种基于集成学习和综合序列衍生特征的淀粉样蛋白预测器。
Comput Biol Chem. 2023 Jun;104:107853. doi: 10.1016/j.compbiolchem.2023.107853. Epub 2023 Mar 23.
9
Computational prediction of protein ubiquitination sites mapping on Arabidopsis thaliana.计算预测拟南芥蛋白质泛素化位点。
Comput Biol Chem. 2020 Apr;85:107238. doi: 10.1016/j.compbiolchem.2020.107238. Epub 2020 Feb 19.
10
PreVFs-RG: A Deep Hybrid Model for Identifying Virulence Factors Based on Residual Block and Gated Recurrent Unit.PreVFs-RG:一种基于残差块和门控循环单元的毒力因子识别深度混合模型。
IEEE/ACM Trans Comput Biol Bioinform. 2023 May-Jun;20(3):1926-1934. doi: 10.1109/TCBB.2022.3223038. Epub 2023 Jun 5.

引用本文的文献

1
KD_MultiSucc: incorporating multi-teacher knowledge distillation and word embeddings for cross-species prediction of protein succinylation sites.KD_MultiSucc:结合多教师知识蒸馏和词嵌入用于蛋白质琥珀酰化位点的跨物种预测
Biol Methods Protoc. 2025 May 28;10(1):bpaf041. doi: 10.1093/biomethods/bpaf041. eCollection 2025.
2
OnmiMHC: a machine learning solution for UCEC tumor vaccine development through enhanced peptide-MHC binding prediction.OnmiMHC:一种通过增强肽-主要组织相容性复合体结合预测来开发子宫内膜癌肿瘤疫苗的机器学习解决方案。
Front Immunol. 2025 Feb 28;16:1550252. doi: 10.3389/fimmu.2025.1550252. eCollection 2025.