• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用可解释的深度表格学习网络预测大豆中的蛋白质磷酸化位点。

Predicting protein phosphorylation sites in soybean using interpretable deep tabular learning network.

机构信息

Department of Plant Science, Faculty of Science, Tarbiat Modarres University, Tehran, Iran.

Department of Biophysics, Faculty of Biological Science, Tarbiat Modares University, Tehran, Iran.

出版信息

Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac015.

DOI:10.1093/bib/bbac015
PMID:35152280
Abstract

Phosphorylation of proteins is one of the most significant post-translational modifications (PTMs) and plays a crucial role in plant functionality due to its impact on signaling, gene expression, enzyme kinetics, protein stability and interactions. Accurate prediction of plant phosphorylation sites (p-sites) is vital as abnormal regulation of phosphorylation usually leads to plant diseases. However, current experimental methods for PTM prediction suffers from high-computational cost and are error-prone. The present study develops machine learning-based prediction techniques, including a high-performance interpretable deep tabular learning network (TabNet) to improve the prediction of protein p-sites in soybean. Moreover, we use a hybrid feature set of sequential-based features, physicochemical properties and position-specific scoring matrices to predict serine (Ser/S), threonine (Thr/T) and tyrosine (Tyr/Y) p-sites in soybean for the first time. The experimentally verified p-sites data of soybean proteins are collected from the eukaryotic phosphorylation sites database and database post-translational modification. We then remove the redundant set of positive and negative samples by dropping protein sequences with >40% similarity. It is found that the developed techniques perform >70% in terms of accuracy. The results demonstrate that the TabNet model is the best performing classifier using hybrid features and with window size of 13, resulted in 78.96 and 77.24% sensitivity and specificity, respectively. The results indicate that the TabNet method has advantages in terms of high-performance and interpretability. The proposed technique can automatically analyze the data without any measurement errors and any human intervention. Furthermore, it can be used to predict putative protein p-sites in plants effectively. The collected dataset and source code are publicly deposited at https://github.com/Elham-khalili/Soybean-P-sites-Prediction.

摘要

蛋白质磷酸化是最重要的翻译后修饰(PTMs)之一,由于其对信号转导、基因表达、酶动力学、蛋白质稳定性和相互作用的影响,在植物功能中起着至关重要的作用。准确预测植物磷酸化位点(p-sites)至关重要,因为磷酸化的异常调节通常会导致植物疾病。然而,目前用于 PTM 预测的实验方法存在计算成本高和易错的问题。本研究开发了基于机器学习的预测技术,包括高性能可解释的深度表格学习网络(TabNet),以提高对大豆中蛋白质 p-sites 的预测。此外,我们首次使用基于序列的特征、理化性质和位置特异性评分矩阵的混合特征集来预测大豆中的丝氨酸(Ser/S)、苏氨酸(Thr/T)和酪氨酸(Tyr/Y)p-sites。从真核磷酸化位点数据库和数据库翻译后修饰中收集大豆蛋白质的实验验证的 p-sites 数据。然后,通过删除具有>40%相似性的蛋白质序列来去除正、负样本的冗余集。结果表明,所开发的技术在准确性方面的性能超过 70%。结果表明,TabNet 模型是使用混合特征和窗口大小为 13 的最佳分类器,分别得到 78.96%和 77.24%的灵敏度和特异性。结果表明,TabNet 方法在高性能和可解释性方面具有优势。该技术可以自动分析数据,而不会产生任何测量误差或任何人为干预。此外,它可以有效地用于预测植物中的假定蛋白质 p-sites。收集的数据集和源代码在 https://github.com/Elham-khalili/Soybean-P-sites-Prediction 上公开存储。

相似文献

1
Predicting protein phosphorylation sites in soybean using interpretable deep tabular learning network.利用可解释的深度表格学习网络预测大豆中的蛋白质磷酸化位点。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac015.
2
Predicting phosphorylation sites using machine learning by integrating the sequence, structure, and functional information of proteins.利用机器学习整合蛋白质的序列、结构和功能信息来预测磷酸化位点。
J Transl Med. 2021 May 24;19(1):218. doi: 10.1186/s12967-021-02851-0.
3
DeepPPSite: A deep learning-based model for analysis and prediction of phosphorylation sites using efficient sequence information.DeepPPSite:一种基于深度学习的模型,用于利用有效的序列信息分析和预测磷酸化位点。
Anal Biochem. 2021 Jan 1;612:113955. doi: 10.1016/j.ab.2020.113955. Epub 2020 Sep 16.
4
DF-Phos: Prediction of Protein Phosphorylation Sites by Deep Forest.DF-Phos:基于深度森林的蛋白质磷酸化位点预测。
J Biochem. 2024 Mar 25;175(4):447-456. doi: 10.1093/jb/mvad116.
5
Machine Learning Techniques for Soybean Charcoal Rot Disease Prediction.用于大豆炭腐病预测的机器学习技术
Front Plant Sci. 2020 Dec 14;11:590529. doi: 10.3389/fpls.2020.590529. eCollection 2020.
6
Accurately predicting microbial phosphorylation sites using evolutionary and structural features.利用进化和结构特征准确预测微生物磷酸化位点。
Gene. 2023 Jan 30;851:146993. doi: 10.1016/j.gene.2022.146993. Epub 2022 Oct 20.
7
Boosting phosphorylation site prediction with sequence feature-based machine learning.基于序列特征的机器学习提高磷酸化位点预测。
Proteins. 2020 Feb;88(2):284-291. doi: 10.1002/prot.25801. Epub 2019 Aug 22.
8
A Review of Machine Learning and Algorithmic Methods for Protein Phosphorylation Site Prediction.机器学习和算法方法在蛋白质磷酸化位点预测中的研究进展综述
Genomics Proteomics Bioinformatics. 2023 Dec;21(6):1266-1285. doi: 10.1016/j.gpb.2023.03.007. Epub 2023 Oct 19.
9
PSSM-Sumo: deep learning based intelligent model for prediction of sumoylation sites using discriminative features.PSSM-Sumo:基于深度学习的智能模型,用于使用判别特征预测类泛素化位点。
BMC Bioinformatics. 2024 Aug 30;25(1):284. doi: 10.1186/s12859-024-05917-0.
10
Large-scale comparative assessment of computational predictors for lysine post-translational modification sites.大规模比较评估赖氨酸翻译后修饰位点的计算预测因子。
Brief Bioinform. 2019 Nov 27;20(6):2267-2290. doi: 10.1093/bib/bby089.

引用本文的文献

1
Protein modification in neurodegenerative diseases.神经退行性疾病中的蛋白质修饰
MedComm (2020). 2024 Aug 4;5(8):e674. doi: 10.1002/mco2.674. eCollection 2024 Aug.
2
Strigolactone insensitivity affects differential shoot and root transcriptome in barley.独脚金内酯不敏感影响大麦地上部和根部的差异转录组。
J Appl Genet. 2025 Feb;66(1):15-28. doi: 10.1007/s13353-024-00885-w. Epub 2024 Jun 14.
3
Analysis and review of techniques and tools based on machine learning and deep learning for prediction of lysine malonylation sites in protein sequences.
基于机器学习和深度学习的赖氨酸丙二酰化位点预测的技术和工具的分析与综述。
Database (Oxford). 2024 Jan 19;2024. doi: 10.1093/database/baad094.
4
Machine learning-based approaches for ubiquitination site prediction in human proteins.基于机器学习的人类蛋白质泛素化位点预测方法。
BMC Bioinformatics. 2023 Nov 28;24(1):449. doi: 10.1186/s12859-023-05581-w.
5
A Review of Machine Learning and Algorithmic Methods for Protein Phosphorylation Site Prediction.机器学习和算法方法在蛋白质磷酸化位点预测中的研究进展综述
Genomics Proteomics Bioinformatics. 2023 Dec;21(6):1266-1285. doi: 10.1016/j.gpb.2023.03.007. Epub 2023 Oct 19.
6
Bimodal artificial intelligence using TabNet for differentiating spinal cord tumors-Integration of patient background information and images.使用TabNet的双峰人工智能用于鉴别脊髓肿瘤——患者背景信息与图像的整合
iScience. 2023 Sep 14;26(10):107900. doi: 10.1016/j.isci.2023.107900. eCollection 2023 Oct 20.
7
Prediction and Visual Analysis of Food Safety Risk Based on TabNet-GRA.基于TabNet-GRA的食品安全风险预测与可视化分析
Foods. 2023 Aug 18;12(16):3113. doi: 10.3390/foods12163113.
8
Transformer-based deep learning for predicting protein properties in the life sciences.基于 Transformer 的深度学习在生命科学中预测蛋白质性质。
Elife. 2023 Jan 18;12:e82819. doi: 10.7554/eLife.82819.
9
Small Tweaks, Major Changes: Post-Translational Modifications That Occur within M2 Macrophages in the Tumor Microenvironment.微小调整,重大改变:肿瘤微环境中M2巨噬细胞内发生的翻译后修饰
Cancers (Basel). 2022 Nov 10;14(22):5532. doi: 10.3390/cancers14225532.
10
Identification of phosphorylation site using S-padding strategy based convolutional neural network.基于S填充策略的卷积神经网络对磷酸化位点的识别
Health Inf Sci Syst. 2022 Sep 17;10(1):29. doi: 10.1007/s13755-022-00196-6. eCollection 2022 Dec.