• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Computational analysis and prediction of lysine malonylation sites by exploiting informative features in an integrative machine-learning framework.利用综合机器学习框架中的信息特征对赖氨酸丙二酰化位点进行计算分析和预测。
Brief Bioinform. 2019 Nov 27;20(6):2185-2199. doi: 10.1093/bib/bby079.
2
Computational prediction of species-specific malonylation sites via enhanced characteristic strategy.通过增强特征策略对物种特异性丙二酰化位点进行计算预测。
Bioinformatics. 2017 May 15;33(10):1457-1463. doi: 10.1093/bioinformatics/btw755.
3
Mal-Prec: computational prediction of protein Malonylation sites via machine learning based feature integration : Malonylation site prediction.Mal-Prec:基于机器学习的特征整合的蛋白质丙二酰化位点计算预测:丙二酰化位点预测。
BMC Genomics. 2020 Nov 23;21(1):812. doi: 10.1186/s12864-020-07166-w.
4
Predicting lysine-malonylation sites of proteins using sequence and predicted structural features.基于序列和预测结构特征预测蛋白质赖氨酸丙二酰化位点。
J Comput Chem. 2018 Aug 15;39(22):1757-1763. doi: 10.1002/jcc.25353. Epub 2018 May 14.
5
Mal-Light: Enhancing Lysine Malonylation Sites Prediction Problem Using Evolutionary-based Features.Mal-Light:利用基于进化的特征增强赖氨酸丙二酰化位点预测问题
IEEE Access. 2020;8:77888-77902. doi: 10.1109/access.2020.2989713. Epub 2020 Apr 22.
6
Large-scale comparative assessment of computational predictors for lysine post-translational modification sites.大规模比较评估赖氨酸翻译后修饰位点的计算预测因子。
Brief Bioinform. 2019 Nov 27;20(6):2267-2290. doi: 10.1093/bib/bby089.
7
Analysis and review of techniques and tools based on machine learning and deep learning for prediction of lysine malonylation sites in protein sequences.基于机器学习和深度学习的赖氨酸丙二酰化位点预测的技术和工具的分析与综述。
Database (Oxford). 2024 Jan 19;2024. doi: 10.1093/database/baad094.
8
Global Profiling of Protein Lysine Malonylation in Escherichia coli Reveals Its Role in Energy Metabolism.大肠杆菌中蛋白质赖氨酸丙二酰化的全局分析揭示了其在能量代谢中的作用。
J Proteome Res. 2016 Jun 3;15(6):2060-71. doi: 10.1021/acs.jproteome.6b00264. Epub 2016 May 23.
9
PeNGaRoo, a combined gradient boosting and ensemble learning framework for predicting non-classical secreted proteins.PeNGaRoo,一种组合梯度提升和集成学习框架,用于预测非经典分泌蛋白。
Bioinformatics. 2020 Feb 1;36(3):704-712. doi: 10.1093/bioinformatics/btz629.
10
Systematic analysis of the lysine malonylome in common wheat.系统分析普通小麦赖氨酸丙二酰化组。
BMC Genomics. 2018 Mar 20;19(1):209. doi: 10.1186/s12864-018-4535-y.

引用本文的文献

1
Systematic qualitative proteome-wide analysis of lysine malonylation profiling in Platycodon grandiflorus.桔梗赖氨酸丙二酰化谱的全蛋白质组系统定性分析
Amino Acids. 2025 Jan 15;57(1):9. doi: 10.1007/s00726-024-03432-3.
2
Current computational tools for protein lysine acylation site prediction.当前用于预测蛋白质赖氨酸酰化位点的计算工具。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae469.
3
Analysis and review of techniques and tools based on machine learning and deep learning for prediction of lysine malonylation sites in protein sequences.基于机器学习和深度学习的赖氨酸丙二酰化位点预测的技术和工具的分析与综述。
Database (Oxford). 2024 Jan 19;2024. doi: 10.1093/database/baad094.
4
Advancing the accuracy of SARS-CoV-2 phosphorylation site detection via meta-learning approach.通过元学习方法提高 SARS-CoV-2 磷酸化位点检测的准确性。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad433.
5
A Review of Machine Learning and Algorithmic Methods for Protein Phosphorylation Site Prediction.机器学习和算法方法在蛋白质磷酸化位点预测中的研究进展综述
Genomics Proteomics Bioinformatics. 2023 Dec;21(6):1266-1285. doi: 10.1016/j.gpb.2023.03.007. Epub 2023 Oct 19.
6
Drug-target interaction prediction based on protein features, using wrapper feature selection.基于蛋白质特征的药物-靶标相互作用预测,使用包装器特征选择。
Sci Rep. 2023 Mar 3;13(1):3594. doi: 10.1038/s41598-023-30026-y.
7
Beyond metabolic waste: lysine lactylation and its potential roles in cancer progression and cell fate determination.超越代谢废物:赖氨酸酰化及其在癌症进展和细胞命运决定中的潜在作用。
Cell Oncol (Dordr). 2023 Jun;46(3):465-480. doi: 10.1007/s13402-023-00775-z. Epub 2023 Jan 19.
8
BERT-Kgly: A Bidirectional Encoder Representations From Transformers (BERT)-Based Model for Predicting Lysine Glycation Site for .BERT-Kgly:一种基于双向编码器表征变换器(BERT)的赖氨酸糖基化位点预测模型
Front Bioinform. 2022 Feb 18;2:834153. doi: 10.3389/fbinf.2022.834153. eCollection 2022.
9
Deep Neural Network Framework Based on Word Embedding for Protein Glutarylation Sites Prediction.基于词嵌入的深度神经网络框架用于蛋白质戊二酰化位点预测
Life (Basel). 2022 Aug 10;12(8):1213. doi: 10.3390/life12081213.
10
Development of Machine-Learning Model to Predict COVID-19 Mortality: Application of Ensemble Model and Regarding Feature Impacts.用于预测新冠肺炎死亡率的机器学习模型的开发:集成模型的应用及特征影响分析
Diagnostics (Basel). 2022 Jun 14;12(6):1464. doi: 10.3390/diagnostics12061464.

本文引用的文献

1
Predicting lysine-malonylation sites of proteins using sequence and predicted structural features.基于序列和预测结构特征预测蛋白质赖氨酸丙二酰化位点。
J Comput Chem. 2018 Aug 15;39(22):1757-1763. doi: 10.1002/jcc.25353. Epub 2018 May 14.
2
Feature selection with interactions in logistic regression models using multivariate synergies for a GWAS application.使用多元协同作用在逻辑回归模型中进行具有交互作用的特征选择,用于 GWAS 应用。
BMC Genomics. 2018 Mar 21;19(Suppl 4):170. doi: 10.1186/s12864-018-4552-x.
3
Bastion6: a bioinformatics approach for accurate prediction of type VI secreted effectors.Bastion6:一种用于准确预测 VI 型分泌效应器的生物信息学方法。
Bioinformatics. 2018 Aug 1;34(15):2546-2555. doi: 10.1093/bioinformatics/bty155.
4
Features and regulation of non-enzymatic post-translational modifications.非酶促翻译后修饰的特点和调控。
Nat Chem Biol. 2018 Feb 14;14(3):244-252. doi: 10.1038/nchembio.2575.
5
PREvaIL, an integrative approach for inferring catalytic residues using sequence, structural, and network features in a machine-learning framework.PREvaIL,一种基于机器学习框架,使用序列、结构和网络特征推断催化残基的综合方法。
J Theor Biol. 2018 Apr 14;443:125-137. doi: 10.1016/j.jtbi.2018.01.023. Epub 2018 Feb 1.
6
Systematic analysis and prediction of type IV secreted effector proteins by machine learning approaches.基于机器学习方法的 IV 型分泌效应蛋白的系统分析和预测。
Brief Bioinform. 2019 May 21;20(3):931-951. doi: 10.1093/bib/bbx164.
7
PaRSnIP: sequence-based protein solubility prediction using gradient boosting machine.PaRSnIP:基于梯度提升机的序列基蛋白质溶解性预测。
Bioinformatics. 2018 Apr 1;34(7):1092-1098. doi: 10.1093/bioinformatics/btx662.
8
PROSPERous: high-throughput prediction of substrate cleavage sites for 90 proteases with improved accuracy.PROSPERous:提高准确性的 90 种蛋白酶底物切割位点的高通量预测。
Bioinformatics. 2018 Feb 15;34(4):684-687. doi: 10.1093/bioinformatics/btx670.
9
POSSUM: a bioinformatics toolkit for generating numerical sequence feature descriptors based on PSSM profiles.POSSUM:一种基于位置特异性得分矩阵(PSSM)谱生成数字序列特征描述符的生物信息学工具包。
Bioinformatics. 2017 Sep 1;33(17):2756-2758. doi: 10.1093/bioinformatics/btx302.
10
Highly accurate prediction of protein self-interactions by incorporating the average block and PSSM information into the general PseAAC.通过将平均块和位置特异性得分矩阵(PSSM)信息纳入通用伪氨基酸组成(PseAAC)来实现对蛋白质自相互作用的高度准确预测。
J Theor Biol. 2017 Nov 7;432:80-86. doi: 10.1016/j.jtbi.2017.08.009. Epub 2017 Aug 9.

利用综合机器学习框架中的信息特征对赖氨酸丙二酰化位点进行计算分析和预测。

Computational analysis and prediction of lysine malonylation sites by exploiting informative features in an integrative machine-learning framework.

机构信息

School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin 541004, China.

Infection and Immunity Program, Biomedicine Discovery Institute and Department of Microbiology, Monash University, VIC 3800, Australia.

出版信息

Brief Bioinform. 2019 Nov 27;20(6):2185-2199. doi: 10.1093/bib/bby079.

DOI:10.1093/bib/bby079
PMID:30351377
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6954445/
Abstract

As a newly discovered post-translational modification (PTM), lysine malonylation (Kmal) regulates a myriad of cellular processes from prokaryotes to eukaryotes and has important implications in human diseases. Despite its functional significance, computational methods to accurately identify malonylation sites are still lacking and urgently needed. In particular, there is currently no comprehensive analysis and assessment of different features and machine learning (ML) methods that are required for constructing the necessary prediction models. Here, we review, analyze and compare 11 different feature encoding methods, with the goal of extracting key patterns and characteristics from residue sequences of Kmal sites. We identify optimized feature sets, with which four commonly used ML methods (random forest, support vector machines, K-nearest neighbor and logistic regression) and one recently proposed [Light Gradient Boosting Machine (LightGBM)] are trained on data from three species, namely, Escherichia coli, Mus musculus and Homo sapiens, and compared using randomized 10-fold cross-validation tests. We show that integration of the single method-based models through ensemble learning further improves the prediction performance and model robustness on the independent test. When compared to the existing state-of-the-art predictor, MaloPred, the optimal ensemble models were more accurate for all three species (AUC: 0.930, 0.923 and 0.944 for E. coli, M. musculus and H. sapiens, respectively). Using the ensemble models, we developed an accessible online predictor, kmal-sp, available at http://kmalsp.erc.monash.edu/. We hope that this comprehensive survey and the proposed strategy for building more accurate models can serve as a useful guide for inspiring future developments of computational methods for PTM site prediction, expedite the discovery of new malonylation and other PTM types and facilitate hypothesis-driven experimental validation of novel malonylated substrates and malonylation sites.

摘要

赖氨酸丙二酰化(Kmal)作为一种新发现的翻译后修饰(PTM),调节着从原核生物到真核生物的无数细胞过程,在人类疾病中具有重要意义。尽管其功能意义重大,但准确识别丙二酰化位点的计算方法仍然缺乏,且迫切需要。特别是,目前还没有全面分析和评估构建必要预测模型所需的不同特征和机器学习(ML)方法。在这里,我们回顾、分析和比较了 11 种不同的特征编码方法,旨在从 Kmal 位点的残基序列中提取关键模式和特征。我们确定了优化的特征集,并用其训练来自三种生物的四种常用 ML 方法(随机森林、支持向量机、K 最近邻和逻辑回归)和一种新提出的[Light Gradient Boosting Machine (LightGBM)],并使用随机 10 折交叉验证测试进行比较。我们表明,通过集成学习将单一方法模型集成可以进一步提高独立测试的预测性能和模型稳健性。与现有的最先进的预测器 MaloPred 相比,最优集成模型在所有三种生物(E. coli、M. musculus 和 H. sapiens 的 AUC:0.930、0.923 和 0.944)上的预测性能和模型稳健性都更高。我们使用集成模型开发了一个易于访问的在线预测器,kmal-sp,可在 http://kmalsp.erc.monash.edu/ 获得。我们希望,这项全面的调查和构建更准确模型的建议策略可以为启发未来的 PTM 位点预测计算方法的发展提供有用的指导,加速新的丙二酰化和其他 PTM 类型的发现,并促进针对新型丙二酰化底物和丙二酰化位点的假设驱动的实验验证。