Suppr
超能文献

当前用于预测蛋白质赖氨酸酰化位点的计算工具。

Current computational tools for protein lysine acylation site prediction.

机构信息

Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China.

State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences (CAAS), Anyang 455000, China.

出版信息

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae469.

DOI:10.1093/bib/bbae469

PMID:39316944

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11421846/

Abstract

As a main subtype of post-translational modification (PTM), protein lysine acylations (PLAs) play crucial roles in regulating diverse functions of proteins. With recent advancements in proteomics technology, the identification of PTM is becoming a data-rich field. A large amount of experimentally verified data is urgently required to be translated into valuable biological insights. With computational approaches, PLA can be accurately detected across the whole proteome, even for organisms with small-scale datasets. Herein, a comprehensive summary of 166 in silico PLA prediction methods is presented, including a single type of PLA site and multiple types of PLA sites. This recapitulation covers important aspects that are critical for the development of a robust predictor, including data collection and preparation, sample selection, feature representation, classification algorithm design, model evaluation, and method availability. Notably, we discuss the application of protein language models and transfer learning to solve the small-sample learning issue. We also highlight the prediction methods developed for functionally relevant PLA sites and species/substrate/cell-type-specific PLA sites. In conclusion, this systematic review could potentially facilitate the development of novel PLA predictors and offer useful insights to researchers from various disciplines.

摘要

作为翻译后修饰（PTM）的主要亚型之一，蛋白质赖氨酸酰化（PLA）在调节蛋白质的多种功能方面发挥着关键作用。随着蛋白质组学技术的最新进展，PTM 的鉴定正成为一个数据丰富的领域。迫切需要将大量经过实验验证的数据转化为有价值的生物学见解。通过计算方法，可以在整个蛋白质组中准确检测 PLA，即使对于数据集规模较小的生物体也是如此。本文全面总结了 166 种基于计算的 PLA 预测方法，包括单一类型的 PLA 位点和多种类型的 PLA 位点。这一综述涵盖了开发稳健预测器的关键方面，包括数据收集和准备、样本选择、特征表示、分类算法设计、模型评估和方法可用性。值得注意的是，我们讨论了蛋白质语言模型和迁移学习在解决小样本学习问题中的应用。我们还强调了针对功能相关 PLA 位点和物种/底物/细胞类型特异性 PLA 位点开发的预测方法。总之，本系统综述可能有助于开发新的 PLA 预测器，并为来自不同学科的研究人员提供有用的见解。

相似文献

Current computational tools for protein lysine acylation site prediction.

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae469.

Large-scale comparative assessment of computational predictors for lysine post-translational modification sites.

Brief Bioinform. 2019 Nov 27;20(6):2267-2290. doi: 10.1093/bib/bby089.

Prediction of Protein Lysine Acylation by Integrating Primary Sequence Information with Multiple Functional Features.

J Proteome Res. 2016 Dec 2;15(12):4234-4244. doi: 10.1021/acs.jproteome.6b00240. Epub 2016 Nov 2.

A systematic identification of species-specific protein succinylation sites using joint element features information.

Int J Nanomedicine. 2017 Aug 28;12:6303-6315. doi: 10.2147/IJN.S140875. eCollection 2017.

FLAMS: Find Lysine Acylations and other Modification Sites.

Bioinformatics. 2024 Jan 2;40(1). doi: 10.1093/bioinformatics/btae005.

ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning.

Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad063.

Computational Prediction of Protein Epsilon Lysine Acetylation Sites Based on a Feature Selection Method.

Comb Chem High Throughput Screen. 2017;20(7):629-637. doi: 10.2174/1386207320666170314093216.

KbhbXG: A Machine learning architecture based on XGBoost for prediction of lysine β-Hydroxybutyrylation (Kbhb) modification sites.

Methods. 2024 Jul;227:27-34. doi: 10.1016/j.ymeth.2024.04.016. Epub 2024 Apr 27.

Glypred: Lysine Glycation Site Prediction via CCU-LightGBM-BiLSTM Framework with Multi-Head Attention Mechanism.

J Chem Inf Model. 2024 Aug 26;64(16):6699-6711. doi: 10.1021/acs.jcim.4c01034. Epub 2024 Aug 9.

A Review of Machine Learning and Algorithmic Methods for Protein Phosphorylation Site Prediction.

Genomics Proteomics Bioinformatics. 2023 Dec;21(6):1266-1285. doi: 10.1016/j.gpb.2023.03.007. Epub 2023 Oct 19.

引用本文的文献

An efficient machine-learning framework for predicting protein post-translational modification sites.

Sci Rep. 2025 Aug 25;15(1):31179. doi: 10.1038/s41598-025-13178-x.

本文引用的文献

DeepKla: An attention mechanism-based deep neural network for protein lysine lactylation site prediction.

Imeta. 2022 Mar 15;1(1):e11. doi: 10.1002/imt2.11. eCollection 2022 Mar.

TransPTM: a transformer-based model for non-histone acetylation site prediction.

Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae219.

An Integrated Analytical Approach for Screening Functional Post-Translational Modification Sites in Metabolic Enzymes.

ACS Omega. 2024 Apr 18;9(17):19003-19008. doi: 10.1021/acsomega.3c09514. eCollection 2024 Apr 30.

FuncPhos-STR: An integrated deep neural network for functional phosphosite prediction based on AlphaFold protein structure and dynamics.

Int J Biol Macromol. 2024 May;266(Pt 1):131180. doi: 10.1016/j.ijbiomac.2024.131180. Epub 2024 Mar 27.

Combining machine learning with structure-based protein design to predict and engineer post-translational modifications of proteins.

PLoS Comput Biol. 2024 Mar 14;20(3):e1011939. doi: 10.1371/journal.pcbi.1011939. eCollection 2024 Mar.

Analysis and review of techniques and tools based on machine learning and deep learning for prediction of lysine malonylation sites in protein sequences.

Database (Oxford). 2024 Jan 19;2024. doi: 10.1093/database/baad094.

MVNN-HNHC:A multi-view neural network for identification of human non-histone crotonylation sites.

Anal Biochem. 2024 Apr;687:115426. doi: 10.1016/j.ab.2023.115426. Epub 2023 Dec 22.

MSTL-Kace: Prediction of Prokaryotic Lysine Acetylation Sites Based on Multistage Transfer Learning Strategy.

ACS Omega. 2023 Oct 25;8(44):41930-41942. doi: 10.1021/acsomega.3c07086. eCollection 2023 Nov 7.

A Review of Machine Learning and Algorithmic Methods for Protein Phosphorylation Site Prediction.

Genomics Proteomics Bioinformatics. 2023 Dec;21(6):1266-1285. doi: 10.1016/j.gpb.2023.03.007. Epub 2023 Oct 19.

BioAutoMATED: An end-to-end automated machine learning tool for explanation and design of biological sequences.

Cell Syst. 2023 Jun 21;14(6):525-542.e9. doi: 10.1016/j.cels.2023.05.007.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

当前用于预测蛋白质赖氨酸酰化位点的计算工具。

Current computational tools for protein lysine acylation site prediction.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译