• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过分析实例硬度和特征重要性来计算识别多个赖氨酸 PTM 位点。

Computational identification of multiple lysine PTM sites by analyzing the instance hardness and feature importance.

机构信息

Computer Science and Engineering, Rajshahi University of Engineering and Technology, Rajshahi, 6204, Bangladesh.

Computer Science and Engineering, University of Rajshahi, Rajshahi, 6205, Bangladesh.

出版信息

Sci Rep. 2021 Sep 23;11(1):18882. doi: 10.1038/s41598-021-98458-y.

DOI:10.1038/s41598-021-98458-y
PMID:34556767
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8460736/
Abstract

Identification of post-translational modifications (PTM) is significant in the study of computational proteomics, cell biology, pathogenesis, and drug development due to its role in many bio-molecular mechanisms. Though there are several computational tools to identify individual PTMs, only three predictors have been established to predict multiple PTMs at the same lysine residue. Furthermore, detailed analysis and assessment on dataset balancing and the significance of different feature encoding techniques for a suitable multi-PTM prediction model are still lacking. This study introduces a computational method named 'iMul-kSite' for predicting acetylation, crotonylation, methylation, succinylation, and glutarylation, from an unrecognized peptide sample with one, multiple, or no modifications. After successfully eliminating the redundant data samples from the majority class by analyzing the hardness of the sequence-coupling information, feature representation has been optimized by adopting the combination of ANOVA F-Test and incremental feature selection approach. The proposed predictor predicts multi-label PTM sites with 92.83% accuracy using the top 100 features. It has also achieved a 93.36% aiming rate and 96.23% coverage rate, which are much better than the existing state-of-the-art predictors on the validation test. This performance indicates that 'iMul-kSite' can be used as a supportive tool for further K-PTM study. For the convenience of the experimental scientists, 'iMul-kSite' has been deployed as a user-friendly web-server at http://103.99.176.239/iMul-kSite .

摘要

鉴定翻译后修饰(PTM)在计算蛋白质组学、细胞生物学、发病机制和药物开发的研究中具有重要意义,因为它在许多生物分子机制中发挥作用。尽管有几种计算工具可以识别单个 PTM,但仅建立了三个预测器来预测同一赖氨酸残基上的多个 PTM。此外,对于合适的多 PTM 预测模型,在数据集平衡和不同特征编码技术的重要性方面,仍然缺乏详细的分析和评估。本研究介绍了一种名为'iMul-kSite'的计算方法,用于从一个未识别的肽样本中预测乙酰化、巴豆酰化、甲基化、琥珀酰化和戊二酰化,该样本具有一个、多个或没有修饰。通过分析序列耦合信息的硬度,成功地从多数类中消除了冗余数据样本后,采用方差分析 F 检验和增量特征选择方法的组合优化了特征表示。该预测器使用前 100 个特征以 92.83%的准确率预测多标签 PTM 位点。它还实现了 93.36%的靶向率和 96.23%的覆盖率,这明显优于验证测试中现有的最先进的预测器。这种性能表明'iMul-kSite'可以作为进一步 K-PTM 研究的辅助工具。为了方便实验科学家,'iMul-kSite'已作为一个用户友好的网络服务器部署在 http://103.99.176.239/iMul-kSite 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7fb/8460736/1e26f71d4072/41598_2021_98458_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7fb/8460736/ae8ee07792d9/41598_2021_98458_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7fb/8460736/e7c619d4b453/41598_2021_98458_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7fb/8460736/ac06ef6fdc1b/41598_2021_98458_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7fb/8460736/1e26f71d4072/41598_2021_98458_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7fb/8460736/ae8ee07792d9/41598_2021_98458_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7fb/8460736/e7c619d4b453/41598_2021_98458_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7fb/8460736/ac06ef6fdc1b/41598_2021_98458_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7fb/8460736/1e26f71d4072/41598_2021_98458_Fig4_HTML.jpg

相似文献

1
Computational identification of multiple lysine PTM sites by analyzing the instance hardness and feature importance.通过分析实例硬度和特征重要性来计算识别多个赖氨酸 PTM 位点。
Sci Rep. 2021 Sep 23;11(1):18882. doi: 10.1038/s41598-021-98458-y.
2
predML-Site: Predicting Multiple Lysine PTM Sites With Optimal Feature Representation and Data Imbalance Minimization.predML-Site:通过优化特征表示和最小化数据不平衡来预测多个赖氨酸翻译后修饰位点
IEEE/ACM Trans Comput Biol Bioinform. 2022 Nov-Dec;19(6):3624-3634. doi: 10.1109/TCBB.2021.3114349. Epub 2022 Dec 8.
3
MLysPRED: graph-based multi-view clustering and multi-dimensional normal distribution resampling techniques to predict multiple lysine sites.MLysPRED:基于图的多视图聚类和多维正态分布重采样技术,用于预测多个赖氨酸位点。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac277.
4
Large-scale comparative assessment of computational predictors for lysine post-translational modification sites.大规模比较评估赖氨酸翻译后修饰位点的计算预测因子。
Brief Bioinform. 2019 Nov 27;20(6):2267-2290. doi: 10.1093/bib/bby089.
5
iPTM-mLys: identifying multiple lysine PTM sites and their different types.iPTM-mLys:鉴定多个赖氨酸 PTM 位点及其不同类型。
Bioinformatics. 2016 Oct 15;32(20):3116-3123. doi: 10.1093/bioinformatics/btw380. Epub 2016 Jun 22.
6
RMTLysPTM: recognizing multiple types of lysine PTM sites by deep analysis on sequences.RMTLysPTM:通过对序列进行深度分析来识别多种类型的赖氨酸翻译后修饰位点
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad450.
7
PTM-ssMP: A Web Server for Predicting Different Types of Post-translational Modification Sites Using Novel Site-specific Modification Profile.PTM-ssMP:一个使用新型的特异性修饰谱预测不同类型的翻译后修饰位点的网络服务器。
Int J Biol Sci. 2018 May 22;14(8):946-956. doi: 10.7150/ijbs.24121. eCollection 2018.
8
Prediction and functional analysis of prokaryote lysine acetylation site by incorporating six types of features into Chou's general PseAAC.通过将六种特征纳入到 Chou 的通用 PseAAC 中,预测和功能分析原核赖氨酸乙酰化位点。
J Theor Biol. 2019 Jan 14;461:92-101. doi: 10.1016/j.jtbi.2018.10.047. Epub 2018 Oct 23.
9
PSSM-Suc: Accurately predicting succinylation using position specific scoring matrix into bigram for feature extraction.PSSM-Suc:利用位置特异性评分矩阵将双字母组用于特征提取,准确预测琥珀酰化。
J Theor Biol. 2017 Jul 21;425:97-102. doi: 10.1016/j.jtbi.2017.05.005. Epub 2017 May 5.
10
pSuc-EDBAM: Predicting lysine succinylation sites in proteins based on ensemble dense blocks and an attention module.pSuc-EDBAM:基于集成密集块和注意力模块预测蛋白质中的赖氨酸琥珀酰化位点。
BMC Bioinformatics. 2022 Oct 31;23(1):450. doi: 10.1186/s12859-022-05001-5.

引用本文的文献

1
MlyPredCSED: based on extreme point deviation compensated clustering combined with cross-scale convolutional neural networks to predict multiple lysine sites in human.MlyPredCSED:基于极点偏差补偿聚类与跨尺度卷积神经网络相结合来预测人类中的多个赖氨酸位点。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf189.
2
PreMLS: The undersampling technique based on ClusterCentroids to predict multiple lysine sites.基于 ClusterCentroids 的欠采样技术预测多个赖氨酸位点。
PLoS Comput Biol. 2024 Oct 22;20(10):e1012544. doi: 10.1371/journal.pcbi.1012544. eCollection 2024 Oct.
3
Current computational tools for protein lysine acylation site prediction.

本文引用的文献

1
predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance.通过将概率序列耦合信息纳入 PseAAC 并解决数据不平衡问题来预测磷酸化糖基化位点。
PLoS One. 2021 Apr 1;16(4):e0249396. doi: 10.1371/journal.pone.0249396. eCollection 2021.
2
Computational Identification of Lysine Glutarylation Sites Using Positive-Unlabeled Learning.利用正无标记学习法对赖氨酸戊二酰化位点进行计算识别
Curr Genomics. 2020 Apr;21(3):204-211. doi: 10.2174/1389202921666200511072327.
3
Prediction of lysine glutarylation sites by maximum relevance minimum redundancy feature selection.
当前用于预测蛋白质赖氨酸酰化位点的计算工具。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae469.
4
Impact of Lysine Succinylation on the Biology of Fungi.赖氨酸琥珀酰化对真菌生物学的影响
Curr Issues Mol Biol. 2024 Jan 23;46(2):1020-1046. doi: 10.3390/cimb46020065.
5
RMTLysPTM: recognizing multiple types of lysine PTM sites by deep analysis on sequences.RMTLysPTM:通过对序列进行深度分析来识别多种类型的赖氨酸翻译后修饰位点
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad450.
6
Protein post-translational modification by lysine succinylation: Biochemistry, biological implications, and therapeutic opportunities.赖氨酸琥珀酰化介导的蛋白质翻译后修饰:生物化学、生物学意义及治疗潜力
Genes Dis. 2022 Apr 7;10(4):1242-1262. doi: 10.1016/j.gendis.2022.03.009. eCollection 2023 Jul.
基于最大相关最小冗余特征选择的赖氨酸戊二酰化位点预测
Anal Biochem. 2018 Jun 1;550:1-7. doi: 10.1016/j.ab.2018.04.005. Epub 2018 Apr 8.
4
Prediction of protein N-formylation using the composition of k-spaced amino acid pairs.利用k间隔氨基酸对的组成预测蛋白质N-甲酰化
Anal Biochem. 2017 Oct 1;534:40-45. doi: 10.1016/j.ab.2017.07.011. Epub 2017 Jul 11.
5
iMulti-HumPhos: a multi-label classifier for identifying human phosphorylated proteins using multiple kernel learning based support vector machines.iMulti-HumPhos:一种基于多核学习支持向量机的用于识别人类磷酸化蛋白质的多标签分类器。
Mol Biosyst. 2017 Jul 25;13(8):1608-1618. doi: 10.1039/c7mb00180k.
6
Protein subcellular localization prediction using multiple kernel learning based support vector machine.基于多核学习支持向量机的蛋白质亚细胞定位预测
Mol Biosyst. 2017 Mar 28;13(4):785-795. doi: 10.1039/c6mb00860g.
7
iPTM-mLys: identifying multiple lysine PTM sites and their different types.iPTM-mLys:鉴定多个赖氨酸 PTM 位点及其不同类型。
Bioinformatics. 2016 Oct 15;32(20):3116-3123. doi: 10.1093/bioinformatics/btw380. Epub 2016 Jun 22.
8
iSuc-PseOpt: Identifying lysine succinylation sites in proteins by incorporating sequence-coupling effects into pseudo components and optimizing imbalanced training dataset.iSuc-PseOpt:通过将序列耦合效应纳入伪组件并优化不平衡训练数据集来识别蛋白质中的赖氨酸琥珀酰化位点。
Anal Biochem. 2016 Mar 15;497:48-56. doi: 10.1016/j.ab.2015.12.009. Epub 2015 Dec 23.
9
Recent Progress in Predicting Posttranslational Modification Sites in Proteins.蛋白质翻译后修饰位点预测的最新进展
Curr Top Med Chem. 2016;16(6):591-603. doi: 10.2174/1568026615666150819110421.
10
Impacts of bioinformatics to medicinal chemistry.生物信息学对药物化学的影响。
Med Chem. 2015;11(3):218-34. doi: 10.2174/1573406411666141229162834.