• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于位置的统计特征和多分类识别 4-羧基谷氨酸残基位点

Identification of 4-carboxyglutamate residue sites based on position based statistical feature and multiple classification.

机构信息

Department of Computer Sciences, Bahria University Lahore Campus, Lahore, 25000, Pakistan.

University of Management and Technology, Lahore, 25000, Pakistan.

出版信息

Sci Rep. 2020 Oct 9;10(1):16913. doi: 10.1038/s41598-020-73107-y.

DOI:10.1038/s41598-020-73107-y
PMID:33037248
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7547663/
Abstract

Glutamic acid is an alpha-amino acid used by all living beings in protein biosynthesis. One of the important glutamic acid modifications is post-translationally modified 4-carboxyglutamate. It has a significant role in blood coagulation. 4-carboxyglumates are required for the binding of calcium ions. On the contrary, this modification can also cause different diseases such as bone resorption, osteoporosis, papilloma, and plaque atherosclerosis. Considering its importance, it is necessary to predict the occurrence of glutamic acid carboxylation in amino acid stretches. As there is no computational based prediction model available to identify 4-carboxyglutamate modification, this study is, therefore, designed to predict 4-carboxyglutamate sites with a less computational cost. A machine learning model is devised with a Multilayered Perceptron (MLP) classifier using Chou's 5-step rule. It may help in learning statistical moments and based on this learning, the prediction is to be made accurately either it is 4-carboxyglutamate residue site or detected residue site having no 4-carboxyglutamate. Prediction accuracy of the proposed model is 94% using an independent set test, while obtained prediction accuracy is 99% by self-consistency tests.

摘要

谷氨酸是一种在蛋白质生物合成中被所有生物使用的α-氨基酸。谷氨酸的一个重要修饰是翻译后修饰的 4-羧基谷氨酸。它在血液凝固中起着重要作用。4-羧基谷氨酸是结合钙离子所必需的。相反,这种修饰也会导致不同的疾病,如骨吸收、骨质疏松症、乳头瘤和斑块动脉粥样硬化。考虑到它的重要性,有必要预测氨基酸序列中谷氨酸的羧化发生情况。由于目前还没有基于计算的预测模型来识别 4-羧基谷氨酸的修饰,因此本研究旨在设计一种计算成本较低的预测 4-羧基谷氨酸位点的方法。该研究使用多层感知器(MLP)分类器设计了一个机器学习模型,该模型使用了 Chou 的 5 步规则。它可以帮助学习统计矩,并在此基础上进行准确的预测,无论是 4-羧基谷氨酸残基位点还是检测到的没有 4-羧基谷氨酸的残基位点。使用独立数据集测试,所提出的模型的预测准确性为 94%,而通过自一致性测试获得的预测准确性为 99%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/14be6298bf0a/41598_2020_73107_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/a1ac0df5056b/41598_2020_73107_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/39afe69e57c5/41598_2020_73107_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/44b180c317f0/41598_2020_73107_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/fd0c45224c62/41598_2020_73107_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/ce0d62d3b45e/41598_2020_73107_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/8d00c50e8695/41598_2020_73107_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/2840e8731239/41598_2020_73107_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/0b52ac937da5/41598_2020_73107_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/14f1ed41edab/41598_2020_73107_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/58b66a83aeaa/41598_2020_73107_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/540972470050/41598_2020_73107_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/14be6298bf0a/41598_2020_73107_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/a1ac0df5056b/41598_2020_73107_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/39afe69e57c5/41598_2020_73107_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/44b180c317f0/41598_2020_73107_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/fd0c45224c62/41598_2020_73107_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/ce0d62d3b45e/41598_2020_73107_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/8d00c50e8695/41598_2020_73107_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/2840e8731239/41598_2020_73107_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/0b52ac937da5/41598_2020_73107_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/14f1ed41edab/41598_2020_73107_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/58b66a83aeaa/41598_2020_73107_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/540972470050/41598_2020_73107_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d280/7547663/14be6298bf0a/41598_2020_73107_Fig12_HTML.jpg

相似文献

1
Identification of 4-carboxyglutamate residue sites based on position based statistical feature and multiple classification.基于位置的统计特征和多分类识别 4-羧基谷氨酸残基位点
Sci Rep. 2020 Oct 9;10(1):16913. doi: 10.1038/s41598-020-73107-y.
2
Computational identification of 4-carboxyglutamate sites to supplement physiological studies using deep learning.利用深度学习计算鉴定 4-羧基谷氨酸位点,以补充生理研究。
Sci Rep. 2022 Jan 7;12(1):128. doi: 10.1038/s41598-021-03895-4.
3
iSulfoTyr-PseAAC: Identify Tyrosine Sulfation Sites by Incorporating Statistical Moments Chou's 5-steps Rule and Pseudo Components.iSulfoTyr-PseAAC:通过结合统计矩、周氏五步法则和伪组分来识别酪氨酸硫酸化位点
Curr Genomics. 2019 May;20(4):306-320. doi: 10.2174/1389202920666190819091609.
4
Direct identification of the calcium-binding amino acid, gamma-carboxyglutamate, in mineralized tissue.在矿化组织中直接鉴定钙结合氨基酸γ-羧基谷氨酸。
Proc Natl Acad Sci U S A. 1975 Oct;72(10):3925-9. doi: 10.1073/pnas.72.10.3925.
5
SPrenylC-PseAAC: A sequence-based model developed via Chou's 5-steps rule and general PseAAC for identifying S-prenylation sites in proteins.SPrenylC-PseAAC:一种基于序列的模型,通过 Chou 的 5 步规则和广义 PseAAC 开发,用于识别蛋白质中的 S- prenylation 位点。
J Theor Biol. 2019 May 7;468:1-11. doi: 10.1016/j.jtbi.2019.02.007. Epub 2019 Feb 12.
6
iMethylK_pseAAC: Improving Accuracy of Lysine Methylation Sites Identification by Incorporating Statistical Moments and Position Relative Features into General PseAAC Chou's 5-steps Rule.iMethylK_pseAAC:通过将统计矩和位置相关特征纳入通用伪氨基酸组成的周氏五步法则来提高赖氨酸甲基化位点识别的准确性
Curr Genomics. 2019 May;20(4):275-292. doi: 10.2174/1389202920666190809095206.
7
Using CHOU'S 5-Steps Rule to Predict O-Linked Serine Glycosylation Sites by Blending Position Relative Features and Statistical Moment.使用 CHOU'S 5 步规则,通过混合位置相对特征和统计矩来预测 O-链接丝氨酸糖基化位点。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Sep-Oct;18(5):2045-2056. doi: 10.1109/TCBB.2020.2968441. Epub 2021 Oct 11.
8
iPhosH-PseAAC: Identify Phosphohistidine Sites in Proteins by Blending Statistical Moments and Position Relative Features According to the Chou's 5-Step Rule and General Pseudo Amino Acid Composition.iPhosH-PseAAC:根据周的五步法则和广义伪氨基酸组成,通过融合统计矩和位置相对特征来识别蛋白质中的磷酸组氨酸位点。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Mar-Apr;18(2):596-610. doi: 10.1109/TCBB.2019.2919025. Epub 2021 Apr 6.
9
Carboxylator: incorporating solvent-accessible surface area for identifying protein carboxylation sites.Carboxylator:一种用于鉴定蛋白质羧化位点的方法,它整合了溶剂可及表面积的概念。
J Comput Aided Mol Des. 2011 Oct;25(10):987-95. doi: 10.1007/s10822-011-9477-2. Epub 2011 Oct 22.
10
iSUMOK-PseAAC: prediction of lysine sumoylation sites using statistical moments and Chou's PseAAC.iSUMOK-PseAAC:利用统计矩和周氏伪氨基酸组成预测赖氨酸的类泛素化位点
PeerJ. 2021 Aug 4;9:e11581. doi: 10.7717/peerj.11581. eCollection 2021.

引用本文的文献

1
Harnessing multi-output machine learning approach and dynamical observables from network structure to optimize COVID-19 intervention strategies.利用多输出机器学习方法和网络结构中的动态可观测量来优化新冠肺炎干预策略。
Biol Methods Protoc. 2025 Jun 5;10(1):bpaf039. doi: 10.1093/biomethods/bpaf039. eCollection 2025.
2
2OM-Pred: prediction of 2-O-methylation sites in ribonucleic acid using diverse classifiers.2OM-Pred:使用多种分类器预测核糖核酸中的2-O-甲基化位点。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf282.
3
m5c-iEnsem: 5-methylcytosine sites identification through ensemble models.

本文引用的文献

1
iPhosY-PseAAC: identify phosphotyrosine sites by incorporating sequence statistical moments into PseAAC.iPhosY-PseAAC:通过将序列统计矩纳入伪氨基酸组成来识别磷酸酪氨酸位点。
Mol Biol Rep. 2018 Dec;45(6):2501-2509. doi: 10.1007/s11033-018-4417-z. Epub 2018 Oct 11.
2
Predicting subcellular localization of multi-label proteins by incorporating the sequence features into Chou's PseAAC.通过将序列特征纳入 Chou 的 PseAAC 来预测多标签蛋白质的亚细胞定位。
Genomics. 2019 Dec;111(6):1325-1332. doi: 10.1016/j.ygeno.2018.09.004. Epub 2018 Sep 7.
3
Predicting membrane protein types by incorporating a novel feature set into Chou's general PseAAC.
m5c-iEnsem:通过集成模型进行5-甲基胞嘧啶位点识别。
Bioinformatics. 2022 Jan 1;41(1). doi: 10.1093/bioinformatics/btae722.
4
Multiclass classification of Autism Spectrum Disorder, attention deficit hyperactivity disorder, and typically developed individuals using fMRI functional connectivity analysis.使用 fMRI 功能连接分析对自闭症谱系障碍、注意缺陷多动障碍和典型发育个体进行多类分类。
PLoS One. 2024 Oct 17;19(10):e0305630. doi: 10.1371/journal.pone.0305630. eCollection 2024.
5
DEL-Thyroid: deep ensemble learning framework for detection of thyroid cancer progression through genomic mutation.DEL-Thyroid:通过基因组突变检测甲状腺癌进展的深度集成学习框架。
BMC Med Inform Decis Mak. 2024 Jul 22;24(1):198. doi: 10.1186/s12911-024-02604-1.
6
A Framework for Prediction of Oncogenomic Progression Aiding Personalized Treatment of Gastric Cancer.一种辅助胃癌个性化治疗的肿瘤基因组进展预测框架。
Diagnostics (Basel). 2023 Jul 6;13(13):2291. doi: 10.3390/diagnostics13132291.
7
Hemolytic-Pred: A machine learning-based predictor for hemolytic proteins using position and composition-based features.溶血预测器:一种基于机器学习的溶血蛋白预测工具,使用基于位置和组成的特征。
Digit Health. 2023 Jul 5;9:20552076231180739. doi: 10.1177/20552076231180739. eCollection 2023 Jan-Dec.
8
Ensemble Learning for Hormone Binding Protein Prediction: A Promising Approach for Early Diagnosis of Thyroid Hormone Disorders in Serum.用于激素结合蛋白预测的集成学习:血清甲状腺激素紊乱早期诊断的一种有前景的方法。
Diagnostics (Basel). 2023 Jun 1;13(11):1940. doi: 10.3390/diagnostics13111940.
9
EDLM: Ensemble Deep Learning Model to Detect Mutation for the Early Detection of Cholangiocarcinoma.EDLM:用于胆管癌早期检测的突变检测集成深度学习模型。
Genes (Basel). 2023 May 18;14(5):1104. doi: 10.3390/genes14051104.
10
Diagnosis of autism spectrum disorder based on functional brain networks and machine learning.基于功能脑网络和机器学习的自闭症谱系障碍诊断。
Sci Rep. 2023 May 18;13(1):8072. doi: 10.1038/s41598-023-34650-6.
通过将一种新的特征集纳入到周的通用 PseAAC 中,来预测膜蛋白类型。
J Theor Biol. 2018 Oct 14;455:319-328. doi: 10.1016/j.jtbi.2018.07.032. Epub 2018 Jul 27.
4
Predicting structural classes of proteins by incorporating their global and local physicochemical and conformational properties into general Chou's PseAAC.通过将蛋白质的全局和局部物理化学和构象特性纳入通用的 Chou 的 PseAAC 中,来预测蛋白质的结构类别。
J Theor Biol. 2018 Oct 7;454:139-145. doi: 10.1016/j.jtbi.2018.05.033. Epub 2018 Jun 2.
5
iPhosT-PseAAC: Identify phosphothreonine sites by incorporating sequence statistical moments into PseAAC.iPhosT-PseAAC:通过将序列统计矩纳入伪氨基酸组成来识别磷酸苏氨酸位点。
Anal Biochem. 2018 Jun 1;550:109-116. doi: 10.1016/j.ab.2018.04.021. Epub 2018 Apr 25.
6
Using Chou's general PseAAC to analyze the evolutionary relationship of receptor associated proteins (RAP) with various folding patterns of protein domains.使用周的通用伪氨基酸组成方法来分析受体相关蛋白(RAP)与蛋白质结构域各种折叠模式之间的进化关系。
J Theor Biol. 2018 May 14;445:62-74. doi: 10.1016/j.jtbi.2018.02.008. Epub 2018 Feb 22.
7
iMem-2LSAAC: A two-level model for discrimination of membrane proteins and their types by extending the notion of SAAC into chou's pseudo amino acid composition.iMem-2LSAAC:一种通过将SAAC概念扩展到周氏伪氨基酸组成来区分膜蛋白及其类型的两级模型。
J Theor Biol. 2018 Apr 7;442:11-21. doi: 10.1016/j.jtbi.2018.01.008. Epub 2018 Jan 11.
8
Prediction of N-linked glycosylation sites using position relative features and statistical moments.利用位置相关特征和统计矩预测N-糖基化位点
PLoS One. 2017 Aug 10;12(8):e0181966. doi: 10.1371/journal.pone.0181966. eCollection 2017.
9
Recent advances in hemophilia B therapy.乙型血友病治疗的最新进展。
Drug Deliv Transl Res. 2017 Jun;7(3):359-371. doi: 10.1007/s13346-017-0365-8.
10
A Treatise to Computational Approaches Towards Prediction of Membrane Protein and Its Subtypes.关于膜蛋白及其亚型预测的计算方法的论文
J Membr Biol. 2017 Feb;250(1):55-76. doi: 10.1007/s00232-016-9937-7. Epub 2016 Nov 19.