• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度学习分类器与随机森林方法相结合,用于预测丙二酰化位点。

Integration of A Deep Learning Classifier with A Random Forest Approach for Predicting Malonylation Sites.

机构信息

School of Basic Medicine, Qingdao University, Qingdao 266021, China.

School of Data Science and Software Engineering, Qingdao University, Qingdao 266021, China.

出版信息

Genomics Proteomics Bioinformatics. 2018 Dec;16(6):451-459. doi: 10.1016/j.gpb.2018.08.004. Epub 2019 Jan 11.

DOI:10.1016/j.gpb.2018.08.004
PMID:30639696
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6411950/
Abstract

As a newly-identified protein post-translational modification, malonylation is involved in a variety of biological functions. Recognizing malonylation sites in substrates represents an initial but crucial step in elucidating the molecular mechanisms underlying protein malonylation. In this study, we constructed a deep learning (DL) network classifier based on long short-term memory (LSTM) with word embedding (LSTM) for the prediction of mammalian malonylation sites. LSTM performs better than traditional classifiers developed with common pre-defined feature encodings or a DL classifier based on LSTM with a one-hot vector. The performance of LSTM is sensitive to the size of the training set, but this limitation can be overcome by integration with a traditional machine learning (ML) classifier. Accordingly, an integrated approach called LEMP was developed, which includes LSTM and the random forest classifier with a novel encoding of enhanced amino acid content. LEMP performs not only better than the individual classifiers but also superior to the currently-available malonylation predictors. Additionally, it demonstrates a promising performance with a low false positive rate, which is highly useful in the prediction application. Overall, LEMP is a useful tool for easily identifying malonylation sites with high confidence. LEMP is available at http://www.bioinfogo.org/lemp.

摘要

作为一种新鉴定的蛋白质翻译后修饰,丙二酰化参与多种生物学功能。鉴定底物中的丙二酰化位点是阐明蛋白质丙二酰化分子机制的初始但关键的步骤。在这项研究中,我们构建了一个基于长短期记忆(LSTM)和词嵌入(LSTM)的深度学习(DL)网络分类器,用于预测哺乳动物丙二酰化位点。LSTM 的性能优于使用常见预定义特征编码或基于 LSTM 的 DL 分类器的传统分类器。LSTM 的性能对训练集的大小敏感,但可以通过与传统机器学习(ML)分类器集成来克服这一限制。因此,开发了一种名为 LEMP 的集成方法,它包括 LSTM 和随机森林分类器,以及一种新的增强氨基酸含量编码。LEMP 的性能不仅优于单个分类器,而且优于现有的丙二酰化预测器。此外,它还具有较低的假阳性率,在预测应用中非常有用。总的来说,LEMP 是一种用于轻松识别丙二酰化位点的有用工具,具有较高的置信度。LEMP 可在 http://www.bioinfogo.org/lemp 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/6411950/66e41b58c535/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/6411950/53f20fc60a14/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/6411950/d37eb77e1a0b/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/6411950/1e602830cad7/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/6411950/2f2f7765763e/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/6411950/d99755fa9247/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/6411950/5433a4606ac2/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/6411950/66e41b58c535/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/6411950/53f20fc60a14/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/6411950/d37eb77e1a0b/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/6411950/1e602830cad7/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/6411950/2f2f7765763e/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/6411950/d99755fa9247/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/6411950/5433a4606ac2/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d005/6411950/66e41b58c535/gr7.jpg

相似文献

1
Integration of A Deep Learning Classifier with A Random Forest Approach for Predicting Malonylation Sites.深度学习分类器与随机森林方法相结合,用于预测丙二酰化位点。
Genomics Proteomics Bioinformatics. 2018 Dec;16(6):451-459. doi: 10.1016/j.gpb.2018.08.004. Epub 2019 Jan 11.
2
Analysis and review of techniques and tools based on machine learning and deep learning for prediction of lysine malonylation sites in protein sequences.基于机器学习和深度学习的赖氨酸丙二酰化位点预测的技术和工具的分析与综述。
Database (Oxford). 2024 Jan 19;2024. doi: 10.1093/database/baad094.
3
Predicting lysine-malonylation sites of proteins using sequence and predicted structural features.基于序列和预测结构特征预测蛋白质赖氨酸丙二酰化位点。
J Comput Chem. 2018 Aug 15;39(22):1757-1763. doi: 10.1002/jcc.25353. Epub 2018 May 14.
4
Computational analysis and prediction of lysine malonylation sites by exploiting informative features in an integrative machine-learning framework.利用综合机器学习框架中的信息特征对赖氨酸丙二酰化位点进行计算分析和预测。
Brief Bioinform. 2019 Nov 27;20(6):2185-2199. doi: 10.1093/bib/bby079.
5
Computational Method for Identifying Malonylation Sites by Using Random Forest Algorithm.基于随机森林算法的丙二酰化修饰位点鉴定的计算方法
Comb Chem High Throughput Screen. 2020;23(4):304-312. doi: 10.2174/1386207322666181227144318.
6
Mal-Prec: computational prediction of protein Malonylation sites via machine learning based feature integration : Malonylation site prediction.Mal-Prec:基于机器学习的特征整合的蛋白质丙二酰化位点计算预测:丙二酰化位点预测。
BMC Genomics. 2020 Nov 23;21(1):812. doi: 10.1186/s12864-020-07166-w.
7
Prediction of Lysine Malonylation Sites Based on Pseudo Amino Acid.基于伪氨基酸的赖氨酸丙二酰化位点预测
Comb Chem High Throughput Screen. 2017;20(7):622-628. doi: 10.2174/1386207320666170314102647.
8
RF-MaloSite and DL-Malosite: Methods based on random forest and deep learning to identify malonylation sites.RF-MaloSite和DL-Malosite:基于随机森林和深度学习识别丙二酰化位点的方法。
Comput Struct Biotechnol J. 2020 Mar 4;18:852-860. doi: 10.1016/j.csbj.2020.02.012. eCollection 2020.
9
BERMP: a cross-species classifier for predicting mA sites by integrating a deep learning algorithm and a random forest approach.BERMP:一种跨物种的 mA 位点预测分类器,它集成了深度学习算法和随机森林方法。
Int J Biol Sci. 2018 Sep 7;14(12):1669-1677. doi: 10.7150/ijbs.27819. eCollection 2018.
10
Mini-review: Recent advances in post-translational modification site prediction based on deep learning.小型综述:基于深度学习的翻译后修饰位点预测的最新进展
Comput Struct Biotechnol J. 2022 Jun 30;20:3522-3532. doi: 10.1016/j.csbj.2022.06.045. eCollection 2022.

引用本文的文献

1
CT Radiomics-based machine learning approach for the invasiveness of pulmonary ground-glass nodules prediction.基于CT影像组学的机器学习方法用于预测肺磨玻璃结节的侵袭性
Eur J Radiol Open. 2025 Aug 23;15:100680. doi: 10.1016/j.ejro.2025.100680. eCollection 2025 Dec.
2
CT-based radiomics integrated model for brain metastases in stage III/IV ALK-positive lung adenocarcinoma patients.基于CT的放射组学综合模型用于Ⅲ/Ⅳ期ALK阳性肺腺癌患者脑转移的研究
Front Oncol. 2025 Jun 18;15:1585930. doi: 10.3389/fonc.2025.1585930. eCollection 2025.
3
A Machine Learning Model for Predicting Intensive Care Unit Admission in Inpatients with COVID-19 Using Clinical Data and Laboratory Biomarkers.

本文引用的文献

1
DeepNitro: Prediction of Protein Nitration and Nitrosylation Sites by Deep Learning.DeepNitro:基于深度学习的蛋白质硝化和亚硝化位点预测。
Genomics Proteomics Bioinformatics. 2018 Aug;16(4):294-306. doi: 10.1016/j.gpb.2018.04.007. Epub 2018 Sep 27.
2
PTM-ssMP: A Web Server for Predicting Different Types of Post-translational Modification Sites Using Novel Site-specific Modification Profile.PTM-ssMP:一个使用新型的特异性修饰谱预测不同类型的翻译后修饰位点的网络服务器。
Int J Biol Sci. 2018 May 22;14(8):946-956. doi: 10.7150/ijbs.24121. eCollection 2018.
3
Deep Learning and Its Applications in Biomedicine.
一种利用临床数据和实验室生物标志物预测COVID-19住院患者重症监护病房入院情况的机器学习模型。
Biomedicines. 2025 Apr 24;13(5):1025. doi: 10.3390/biomedicines13051025.
4
ACP-DPE: A Dual-Channel Deep Learning Model for Anticancer Peptide Prediction.ACP-DPE:一种用于抗癌肽预测的双通道深度学习模型。
IET Syst Biol. 2025 Jan-Dec;19(1):e70010. doi: 10.1049/syb2.70010.
5
Systematic qualitative proteome-wide analysis of lysine malonylation profiling in Platycodon grandiflorus.桔梗赖氨酸丙二酰化谱的全蛋白质组系统定性分析
Amino Acids. 2025 Jan 15;57(1):9. doi: 10.1007/s00726-024-03432-3.
6
Current computational tools for protein lysine acylation site prediction.当前用于预测蛋白质赖氨酸酰化位点的计算工具。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae469.
7
Compositional features analysis by machine learning in genome represents linear adaptation of monkeypox virus.通过机器学习对基因组进行的组成特征分析代表了猴痘病毒的线性适应。
Front Genet. 2024 Mar 1;15:1361952. doi: 10.3389/fgene.2024.1361952. eCollection 2024.
8
Analysis and review of techniques and tools based on machine learning and deep learning for prediction of lysine malonylation sites in protein sequences.基于机器学习和深度学习的赖氨酸丙二酰化位点预测的技术和工具的分析与综述。
Database (Oxford). 2024 Jan 19;2024. doi: 10.1093/database/baad094.
9
Machine learning-based models for the prediction of breast cancer recurrence risk.基于机器学习的乳腺癌复发风险预测模型。
BMC Med Inform Decis Mak. 2023 Nov 29;23(1):276. doi: 10.1186/s12911-023-02377-z.
10
Urban nexus and transformative pathways towards resilient cities: A case of the Gauteng City-Region, South Africa.城市关联与迈向韧性城市的转型路径:以南非豪登省城市区域为例
Cities. 2021 Sep;116:103266. doi: 10.1016/j.cities.2021.103266. Epub 2021 May 23.
深度学习及其在生物医学中的应用。
Genomics Proteomics Bioinformatics. 2018 Feb;16(1):17-32. doi: 10.1016/j.gpb.2017.07.003. Epub 2018 Mar 6.
4
MusiteDeep: a deep-learning framework for general and kinase-specific phosphorylation site prediction.MusiteDeep:一个用于通用和激酶特异性磷酸化位点预测的深度学习框架。
Bioinformatics. 2017 Dec 15;33(24):3909-3916. doi: 10.1093/bioinformatics/btx496.
5
PLMD: An updated data resource of protein lysine modifications.PLMD:蛋白质赖氨酸修饰的更新数据资源。
J Genet Genomics. 2017 May 20;44(5):243-250. doi: 10.1016/j.jgg.2017.03.007. Epub 2017 May 3.
6
Prediction of post-translational modification sites using multiple kernel support vector machine.使用多核支持向量机预测翻译后修饰位点
PeerJ. 2017 Apr 27;5:e3261. doi: 10.7717/peerj.3261. eCollection 2017.
7
Capturing non-local interactions by long short-term memory bidirectional recurrent neural networks for improving prediction of protein secondary structure, backbone angles, contact numbers and solvent accessibility.利用长短期记忆双向递归神经网络捕捉非局部相互作用,提高蛋白质二级结构、主链角度、接触数和溶剂可及性的预测能力。
Bioinformatics. 2017 Sep 15;33(18):2842-2849. doi: 10.1093/bioinformatics/btx218.
8
Computational prediction of species-specific malonylation sites via enhanced characteristic strategy.通过增强特征策略对物种特异性丙二酰化位点进行计算预测。
Bioinformatics. 2017 May 15;33(10):1457-1463. doi: 10.1093/bioinformatics/btw755.
9
Improving protein disorder prediction by deep bidirectional long short-term memory recurrent neural networks.通过深度双向长短期记忆循环神经网络改进蛋白质无序预测。
Bioinformatics. 2017 Mar 1;33(5):685-692. doi: 10.1093/bioinformatics/btw678.
10
Mal-Lys: prediction of lysine malonylation sites in proteins integrated sequence-based features with mRMR feature selection.Mal-Lys:一种基于序列的整合特征与 mRMR 特征选择的蛋白质赖氨酸丙二酰化位点预测方法。
Sci Rep. 2016 Dec 2;6:38318. doi: 10.1038/srep38318.