• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于支持向量机的级联分类器分析和预测人类乙酰化作用。

Analysis and prediction of human acetylation using a cascade classifier based on support vector machine.

机构信息

School of Information Science and Technology, Northeast Normal University, Changchun, 130117, China.

出版信息

BMC Bioinformatics. 2019 Jun 17;20(1):346. doi: 10.1186/s12859-019-2938-7.

DOI:10.1186/s12859-019-2938-7
PMID:31208321
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6580503/
Abstract

BACKGROUND

Acetylation on lysine is a widespread post-translational modification which is reversible and plays a crucial role in some biological activities. To better understand the mechanism, it is necessary to identify acetylation sites in proteins accurately. Computational methods are popular because they are more convenient and faster than experimental methods. In this study, we proposed a new computational method to predict acetylation sites in human by combining sequence features and structural features including physicochemical property (PCP), position specific score matrix (PSSM), auto covariation (AC), residue composition (RC), secondary structure (SS) and accessible surface area (ASA), which can well characterize the information of acetylated lysine sites. Besides, a two-step feature selection was applied, which combined mRMR and IFS. It finally trained a cascade classifier based on SVM, which successfully solved the imbalance between positive samples and negative samples and covered all negative sample information.

RESULTS

The performance of this method is measured with a specificity of 72.19% and a sensibility of 76.71% on independent dataset which shows that a cascade SVM classifier outperforms single SVM classifier.

CONCLUSIONS

In addition to the analysis of experimental results, we also made a systematic and comprehensive analysis of the acetylation data.

摘要

背景

赖氨酸乙酰化是一种广泛存在的翻译后修饰,具有可逆性,在某些生物活性中起着至关重要的作用。为了更好地理解其机制,有必要准确识别蛋白质中的乙酰化位点。由于计算方法比实验方法更方便、更快,因此很受欢迎。在这项研究中,我们提出了一种新的计算方法,通过结合序列特征和结构特征,包括理化性质(PCP)、位置特异性评分矩阵(PSSM)、自协变(AC)、残基组成(RC)、二级结构(SS)和可及表面积(ASA),来预测人类蛋白质中的乙酰化位点。此外,应用了两步特征选择,结合了 mRMR 和 IFS。最后,基于 SVM 训练了级联分类器,成功解决了正负样本之间的不平衡问题,并涵盖了所有负样本信息。

结果

在独立数据集上,该方法的特异性为 72.19%,敏感性为 76.71%,这表明级联 SVM 分类器优于单个 SVM 分类器。

结论

除了对实验结果进行分析外,我们还对乙酰化数据进行了系统和全面的分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/b8cf84823dbf/12859_2019_2938_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/c7f3396f1670/12859_2019_2938_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/beef76b8ac80/12859_2019_2938_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/e88892f5c3f3/12859_2019_2938_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/668688b5be62/12859_2019_2938_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/627c4d6a5112/12859_2019_2938_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/d310a9341daf/12859_2019_2938_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/9753d9f31fa7/12859_2019_2938_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/b7713a7f1e63/12859_2019_2938_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/588923e91eae/12859_2019_2938_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/fd0bc367c3f0/12859_2019_2938_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/5b41268d917f/12859_2019_2938_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/795bd382de20/12859_2019_2938_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/b8cf84823dbf/12859_2019_2938_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/c7f3396f1670/12859_2019_2938_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/beef76b8ac80/12859_2019_2938_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/e88892f5c3f3/12859_2019_2938_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/668688b5be62/12859_2019_2938_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/627c4d6a5112/12859_2019_2938_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/d310a9341daf/12859_2019_2938_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/9753d9f31fa7/12859_2019_2938_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/b7713a7f1e63/12859_2019_2938_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/588923e91eae/12859_2019_2938_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/fd0bc367c3f0/12859_2019_2938_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/5b41268d917f/12859_2019_2938_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/795bd382de20/12859_2019_2938_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4288/6580503/b8cf84823dbf/12859_2019_2938_Fig13_HTML.jpg

相似文献

1
Analysis and prediction of human acetylation using a cascade classifier based on support vector machine.基于支持向量机的级联分类器分析和预测人类乙酰化作用。
BMC Bioinformatics. 2019 Jun 17;20(1):346. doi: 10.1186/s12859-019-2938-7.
2
Improved Species-Specific Lysine Acetylation Site Prediction Based on a Large Variety of Features Set.基于多种特征集改进物种特异性赖氨酸乙酰化位点预测
PLoS One. 2016 May 16;11(5):e0155370. doi: 10.1371/journal.pone.0155370. eCollection 2016.
3
Prediction of lysine propionylation sites using biased SVM and incorporating four different sequence features into Chou's PseAAC.利用有偏支持向量机并将四种不同序列特征纳入周氏伪氨基酸组成对赖氨酸丙酰化位点进行预测。
J Mol Graph Model. 2017 Sep;76:356-363. doi: 10.1016/j.jmgm.2017.07.022. Epub 2017 Jul 25.
4
A method to distinguish between lysine acetylation and lysine ubiquitination with feature selection and analysis.一种通过特征选择和分析来区分赖氨酸乙酰化和赖氨酸泛素化的方法。
J Biomol Struct Dyn. 2015;33(11):2479-90. doi: 10.1080/07391102.2014.1001793. Epub 2015 Jan 23.
5
Computational Prediction of Protein Epsilon Lysine Acetylation Sites Based on a Feature Selection Method.基于特征选择方法的蛋白质ε-赖氨酸乙酰化位点的计算预测
Comb Chem High Throughput Screen. 2017;20(7):629-637. doi: 10.2174/1386207320666170314093216.
6
Lysine acetylation sites prediction using an ensemble of support vector machine classifiers.赖氨酸乙酰化位点预测的支持向量机集成分类器方法。
J Theor Biol. 2010 May 7;264(1):130-5. doi: 10.1016/j.jtbi.2010.01.013. Epub 2010 Jan 18.
7
iDPGK: characterization and identification of lysine phosphoglycerylation sites based on sequence-based features.iDPGK:基于序列特征的赖氨酸磷酸甘油化位点的表征和鉴定。
BMC Bioinformatics. 2020 Dec 9;21(1):568. doi: 10.1186/s12859-020-03916-5.
8
Position-specific analysis and prediction for protein lysine acetylation based on multiple features.基于多种特征的蛋白质赖氨酸乙酰化的位置特异性分析和预测。
PLoS One. 2012;7(11):e49108. doi: 10.1371/journal.pone.0049108. Epub 2012 Nov 16.
9
Succinylation Site Prediction Based on Protein Sequences Using the IFS-LightGBM (BO) Model.基于序列信息的蛋白质琥珀酰化修饰位点预测的 IFS-LightGBM(BO)模型
Comput Math Methods Med. 2020 Nov 10;2020:8858489. doi: 10.1155/2020/8858489. eCollection 2020.
10
Bigram-PGK: phosphoglycerylation prediction using the technique of bigram probabilities of position specific scoring matrix.双元模型-PGK:基于位置特异得分矩阵双元概率技术的磷酸甘油酰化预测。
BMC Mol Cell Biol. 2019 Dec 20;20(Suppl 2):57. doi: 10.1186/s12860-019-0240-1.

引用本文的文献

1
Current computational tools for protein lysine acylation site prediction.当前用于预测蛋白质赖氨酸酰化位点的计算工具。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae469.
2
RMTLysPTM: recognizing multiple types of lysine PTM sites by deep analysis on sequences.RMTLysPTM:通过对序列进行深度分析来识别多种类型的赖氨酸翻译后修饰位点
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad450.
3
Mini-review: Recent advances in post-translational modification site prediction based on deep learning.小型综述:基于深度学习的翻译后修饰位点预测的最新进展

本文引用的文献

1
iRNA(m6A)-PseDNC: Identifying N-methyladenosine sites using pseudo dinucleotide composition.基于伪二核苷酸组成的 m6A-psiDNC:识别 N6-甲基腺苷位点。
Anal Biochem. 2018 Nov 15;561-562:59-65. doi: 10.1016/j.ab.2018.09.002. Epub 2018 Sep 8.
2
iMethyl-STTNC: Identification of N-methyladenosine sites by extending the idea of SAAC into Chou's PseAAC to formulate RNA sequences.iMethyl-STTNC:通过将 SAAC 的思想扩展到 Chou 的 PseAAC 中,以制定 RNA 序列来识别 N6-甲基腺苷位点。
J Theor Biol. 2018 Oct 14;455:205-211. doi: 10.1016/j.jtbi.2018.07.018. Epub 2018 Jul 19.
3
iRNA-3typeA: Identifying Three Types of Modification at RNA's Adenosine Sites.
Comput Struct Biotechnol J. 2022 Jun 30;20:3522-3532. doi: 10.1016/j.csbj.2022.06.045. eCollection 2022.
4
STALLION: a stacking-based ensemble learning framework for prokaryotic lysine acetylation site prediction.STALLION:一种基于堆叠的集成学习框架,用于预测细菌赖氨酸乙酰化位点。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab376.
5
A Transfer Learning-Based Approach for Lysine Propionylation Prediction.一种基于迁移学习的赖氨酸丙酰化预测方法。
Front Physiol. 2021 Apr 21;12:658633. doi: 10.3389/fphys.2021.658633. eCollection 2021.
6
PredNTS: Improved and Robust Prediction of Nitrotyrosine Sites by Integrating Multiple Sequence Features.PredNTS:通过整合多种序列特征提高和增强对硝化酪氨酸位点的预测。
Int J Mol Sci. 2021 Mar 8;22(5):2704. doi: 10.3390/ijms22052704.
7
PUP-Fuse: Prediction of Protein Pupylation Sites by Integrating Multiple Sequence Representations.PUP-Fuse:通过整合多种序列表示来预测蛋白泛素化位点。
Int J Mol Sci. 2021 Feb 20;22(4):2120. doi: 10.3390/ijms22042120.
iRNA-3型A:鉴定RNA腺苷位点的三种修饰类型。
Mol Ther Nucleic Acids. 2018 Jun 1;11:468-474. doi: 10.1016/j.omtn.2018.03.012. Epub 2018 Mar 30.
4
iDNA6mA-PseKNC: Identifying DNA N-methyladenosine sites by incorporating nucleotide physicochemical properties into PseKNC.iDNA6mA-PseKNC:通过将核苷酸理化性质纳入 PseKNC 来鉴定 DNA N6-甲基腺苷位点。
Genomics. 2019 Jan;111(1):96-102. doi: 10.1016/j.ygeno.2018.01.005. Epub 2018 Jan 31.
5
iRNA-2methyl: Identify RNA 2'-O-methylation Sites by Incorporating Sequence-Coupled Effects into General PseKNC and Ensemble Classifier.iRNA-2甲基化:通过将序列耦合效应纳入通用伪核苷酸组成和集成分类器来识别RNA 2'-O-甲基化位点。
Med Chem. 2017;13(8):734-743. doi: 10.2174/1573406413666170623082245.
6
PLMD: An updated data resource of protein lysine modifications.PLMD:蛋白质赖氨酸修饰的更新数据资源。
J Genet Genomics. 2017 May 20;44(5):243-250. doi: 10.1016/j.jgg.2017.03.007. Epub 2017 May 3.
7
iRNAm5C-PseDNC: identifying RNA 5-methylcytosine sites by incorporating physical-chemical properties into pseudo dinucleotide composition.iRNAm5C-PseDNC:通过将理化性质融入伪二核苷酸组成来识别RNA 5-甲基胞嘧啶位点
Oncotarget. 2017 Jun 20;8(25):41178-41188. doi: 10.18632/oncotarget.17104.
8
GPS-PAIL: prediction of lysine acetyltransferase-specific modification sites from protein sequences.GPS-PAIL:从蛋白质序列预测赖氨酸乙酰转移酶特异性修饰位点。
Sci Rep. 2016 Dec 22;6:39787. doi: 10.1038/srep39787.
9
iPTM-mLys: identifying multiple lysine PTM sites and their different types.iPTM-mLys:鉴定多个赖氨酸 PTM 位点及其不同类型。
Bioinformatics. 2016 Oct 15;32(20):3116-3123. doi: 10.1093/bioinformatics/btw380. Epub 2016 Jun 22.
10
Improved Species-Specific Lysine Acetylation Site Prediction Based on a Large Variety of Features Set.基于多种特征集改进物种特异性赖氨酸乙酰化位点预测
PLoS One. 2016 May 16;11(5):e0155370. doi: 10.1371/journal.pone.0155370. eCollection 2016.