• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Mal-Prec:基于机器学习的特征整合的蛋白质丙二酰化位点计算预测:丙二酰化位点预测。

Mal-Prec: computational prediction of protein Malonylation sites via machine learning based feature integration : Malonylation site prediction.

机构信息

Department of Bioinformatics, School of Medical Informatics and Engineering, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.

Jiangsu Key Laboratory of New Drug Research and Clinical Pharmacy, School of Pharmacy, Xuzhou Medical University, Xuzhou, 221000, Jiangsu, China.

出版信息

BMC Genomics. 2020 Nov 23;21(1):812. doi: 10.1186/s12864-020-07166-w.

DOI:10.1186/s12864-020-07166-w
PMID:33225896
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7682087/
Abstract

BACKGROUND

Malonylation is a recently discovered post-translational modification that is associated with a variety of diseases such as Type 2 Diabetes Mellitus and different types of cancers. Compared with experimental identification of malonylation sites, computational method is a time-effective process with comparatively low costs.

RESULTS

In this study, we proposed a novel computational model called Mal-Prec (Malonylation Prediction) for malonylation site prediction through the combination of Principal Component Analysis and Support Vector Machine. One-hot encoding, physio-chemical properties, and composition of k-spaced acid pairs were initially performed to extract sequence features. PCA was then applied to select optimal feature subsets while SVM was adopted to predict malonylation sites. Five-fold cross-validation results showed that Mal-Prec can achieve better prediction performance compared with other approaches. AUC (area under the receiver operating characteristic curves) analysis achieved 96.47 and 90.72% on 5-fold cross-validation of independent data sets, respectively.

CONCLUSION

Mal-Prec is a computationally reliable method for identifying malonylation sites in protein sequences. It outperforms existing prediction tools and can serve as a useful tool for identifying and discovering novel malonylation sites in human proteins. Mal-Prec is coded in MATLAB and is publicly available at https://github.com/flyinsky6/Mal-Prec , together with the data sets used in this study.

摘要

背景

丙二酰化是一种新发现的翻译后修饰,与 2 型糖尿病和各种癌症等多种疾病有关。与丙二酰化位点的实验鉴定相比,计算方法是一种具有时间效益且成本相对较低的过程。

结果

在这项研究中,我们通过主成分分析和支持向量机的结合,提出了一种名为 Mal-Prec(丙二酰化预测)的新型计算模型,用于丙二酰化位点预测。首先进行了独热编码、理化性质和 k 间隔酸对组成,以提取序列特征。然后应用 PCA 选择最佳特征子集,同时使用 SVM 预测丙二酰化位点。五重交叉验证结果表明,与其他方法相比,Mal-Prec 可以实现更好的预测性能。在独立数据集的五重交叉验证中,AUC(接受者操作特征曲线下的面积)分析分别达到 96.47%和 90.72%。

结论

Mal-Prec 是一种用于识别蛋白质序列中丙二酰化位点的计算可靠方法。它优于现有的预测工具,可作为识别和发现人类蛋白质中新型丙二酰化位点的有用工具。Mal-Prec 是用 MATLAB 编写的,可在 https://github.com/flyinsky6/Mal-Prec 上公开获取,同时提供了本研究中使用的数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2e0/7682087/f51a225d5832/12864_2020_7166_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2e0/7682087/fea22b68ae2c/12864_2020_7166_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2e0/7682087/6a3f5dc04f75/12864_2020_7166_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2e0/7682087/c5bdc8794a3d/12864_2020_7166_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2e0/7682087/67400a82f134/12864_2020_7166_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2e0/7682087/f77d8c007b3c/12864_2020_7166_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2e0/7682087/f51a225d5832/12864_2020_7166_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2e0/7682087/fea22b68ae2c/12864_2020_7166_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2e0/7682087/6a3f5dc04f75/12864_2020_7166_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2e0/7682087/c5bdc8794a3d/12864_2020_7166_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2e0/7682087/67400a82f134/12864_2020_7166_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2e0/7682087/f77d8c007b3c/12864_2020_7166_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2e0/7682087/f51a225d5832/12864_2020_7166_Fig6_HTML.jpg

相似文献

1
Mal-Prec: computational prediction of protein Malonylation sites via machine learning based feature integration : Malonylation site prediction.Mal-Prec:基于机器学习的特征整合的蛋白质丙二酰化位点计算预测:丙二酰化位点预测。
BMC Genomics. 2020 Nov 23;21(1):812. doi: 10.1186/s12864-020-07166-w.
2
Computational analysis and prediction of lysine malonylation sites by exploiting informative features in an integrative machine-learning framework.利用综合机器学习框架中的信息特征对赖氨酸丙二酰化位点进行计算分析和预测。
Brief Bioinform. 2019 Nov 27;20(6):2185-2199. doi: 10.1093/bib/bby079.
3
Analysis and review of techniques and tools based on machine learning and deep learning for prediction of lysine malonylation sites in protein sequences.基于机器学习和深度学习的赖氨酸丙二酰化位点预测的技术和工具的分析与综述。
Database (Oxford). 2024 Jan 19;2024. doi: 10.1093/database/baad094.
4
Predicting lysine-malonylation sites of proteins using sequence and predicted structural features.基于序列和预测结构特征预测蛋白质赖氨酸丙二酰化位点。
J Comput Chem. 2018 Aug 15;39(22):1757-1763. doi: 10.1002/jcc.25353. Epub 2018 May 14.
5
Incorporating hybrid models into lysine malonylation sites prediction on mammalian and plant proteins.将混合模型纳入哺乳动物和植物蛋白质赖氨酸丙二酰化位点预测中。
Sci Rep. 2020 Jun 29;10(1):10541. doi: 10.1038/s41598-020-67384-w.
6
A hybrid feature extraction scheme for efficient malonylation site prediction.一种用于高效预测琥珀酰化位点的混合特征提取方案。
Sci Rep. 2022 Apr 6;12(1):5756. doi: 10.1038/s41598-022-08555-9.
7
Prediction of Lysine Malonylation Sites Based on Pseudo Amino Acid.基于伪氨基酸的赖氨酸丙二酰化位点预测
Comb Chem High Throughput Screen. 2017;20(7):622-628. doi: 10.2174/1386207320666170314102647.
8
Mal-Lys: prediction of lysine malonylation sites in proteins integrated sequence-based features with mRMR feature selection.Mal-Lys:一种基于序列的整合特征与 mRMR 特征选择的蛋白质赖氨酸丙二酰化位点预测方法。
Sci Rep. 2016 Dec 2;6:38318. doi: 10.1038/srep38318.
9
Mal-Light: Enhancing Lysine Malonylation Sites Prediction Problem Using Evolutionary-based Features.Mal-Light:利用基于进化的特征增强赖氨酸丙二酰化位点预测问题
IEEE Access. 2020;8:77888-77902. doi: 10.1109/access.2020.2989713. Epub 2020 Apr 22.
10
Computational Method for Identifying Malonylation Sites by Using Random Forest Algorithm.基于随机森林算法的丙二酰化修饰位点鉴定的计算方法
Comb Chem High Throughput Screen. 2020;23(4):304-312. doi: 10.2174/1386207322666181227144318.

引用本文的文献

1
Systematic qualitative proteome-wide analysis of lysine malonylation profiling in Platycodon grandiflorus.桔梗赖氨酸丙二酰化谱的全蛋白质组系统定性分析
Amino Acids. 2025 Jan 15;57(1):9. doi: 10.1007/s00726-024-03432-3.
2
PMTPred: machine-learning-based prediction of protein methyltransferases using the composition of k-spaced amino acid pairs.PMTPred:基于k间隔氨基酸对组成的蛋白质甲基转移酶的机器学习预测
Mol Divers. 2024 Aug;28(4):2301-2315. doi: 10.1007/s11030-024-10937-2. Epub 2024 Jul 21.
3
Analysis and review of techniques and tools based on machine learning and deep learning for prediction of lysine malonylation sites in protein sequences.

本文引用的文献

1
Characterization and identification of lysine glutarylation based on intrinsic interdependence between positions in the substrate sites.基于底物结合位点中位置的内在相关性对赖氨酸瓜氨酸化的表征和鉴定。
BMC Bioinformatics. 2019 Feb 4;19(Suppl 13):384. doi: 10.1186/s12859-018-2394-9.
2
Integration of A Deep Learning Classifier with A Random Forest Approach for Predicting Malonylation Sites.深度学习分类器与随机森林方法相结合,用于预测丙二酰化位点。
Genomics Proteomics Bioinformatics. 2018 Dec;16(6):451-459. doi: 10.1016/j.gpb.2018.08.004. Epub 2019 Jan 11.
3
Prediction of Protein Acetylation Sites using Kernel Naive Bayes Classifier Based on Protein Sequences Profiling.
基于机器学习和深度学习的赖氨酸丙二酰化位点预测的技术和工具的分析与综述。
Database (Oxford). 2024 Jan 19;2024. doi: 10.1093/database/baad094.
4
GBDT_KgluSite: An improved computational prediction model for lysine glutarylation sites based on feature fusion and GBDT classifier.GBDT_KgluSite:一种基于特征融合和 GBDT 分类器的赖氨酸谷氨酰化位点改进计算预测模型。
BMC Genomics. 2023 Dec 11;24(1):765. doi: 10.1186/s12864-023-09834-z.
5
A hybrid feature extraction scheme for efficient malonylation site prediction.一种用于高效预测琥珀酰化位点的混合特征提取方案。
Sci Rep. 2022 Apr 6;12(1):5756. doi: 10.1038/s41598-022-08555-9.
基于蛋白质序列特征的核朴素贝叶斯分类器预测蛋白质乙酰化位点
Bioinformation. 2018 May 31;14(5):213-218. doi: 10.6026/97320630014213. eCollection 2018.
4
Predicting lysine-malonylation sites of proteins using sequence and predicted structural features.基于序列和预测结构特征预测蛋白质赖氨酸丙二酰化位点。
J Comput Chem. 2018 Aug 15;39(22):1757-1763. doi: 10.1002/jcc.25353. Epub 2018 May 14.
5
Prediction of citrullination sites by incorporating k-spaced amino acid pairs into Chou's general pseudo amino acid composition.通过将 k 间隔氨基酸对纳入周元的通用伪氨基酸组成来预测瓜氨酸化位点。
Gene. 2018 Jul 20;664:78-83. doi: 10.1016/j.gene.2018.04.055. Epub 2018 Apr 23.
6
Systematic analysis of the lysine malonylome in common wheat.系统分析普通小麦赖氨酸丙二酰化组。
BMC Genomics. 2018 Mar 20;19(1):209. doi: 10.1186/s12864-018-4535-y.
7
Applications of Support Vector Machine (SVM) Learning in Cancer Genomics.支持向量机(SVM)学习在癌症基因组学中的应用。
Cancer Genomics Proteomics. 2018 Jan-Feb;15(1):41-51. doi: 10.21873/cgp.20063.
8
Computational prediction of species-specific malonylation sites via enhanced characteristic strategy.通过增强特征策略对物种特异性丙二酰化位点进行计算预测。
Bioinformatics. 2017 May 15;33(10):1457-1463. doi: 10.1093/bioinformatics/btw755.
9
Mal-Lys: prediction of lysine malonylation sites in proteins integrated sequence-based features with mRMR feature selection.Mal-Lys:一种基于序列的整合特征与 mRMR 特征选择的蛋白质赖氨酸丙二酰化位点预测方法。
Sci Rep. 2016 Dec 2;6:38318. doi: 10.1038/srep38318.
10
DephosSite: a machine learning approach for discovering phosphotase-specific dephosphorylation sites.DephosSite:一种用于发现磷酸酶特异性去磷酸化位点的机器学习方法。
Sci Rep. 2016 Mar 22;6:23510. doi: 10.1038/srep23510.