• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用深度学习计算鉴定 4-羧基谷氨酸位点,以补充生理研究。

Computational identification of 4-carboxyglutamate sites to supplement physiological studies using deep learning.

机构信息

Department of Computer Science, University of Management and Technology, Lahore, 54770, Pakistan.

Computer and Information Sciences Department, Universiti Teknologi PETRONAS, 32610, Seri Iskandar, Malaysia.

出版信息

Sci Rep. 2022 Jan 7;12(1):128. doi: 10.1038/s41598-021-03895-4.

DOI:10.1038/s41598-021-03895-4
PMID:34996975
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8741832/
Abstract

In biological systems, Glutamic acid is a crucial amino acid which is used in protein biosynthesis. Carboxylation of glutamic acid is a significant post-translational modification which plays important role in blood coagulation by activating prothrombin to thrombin. Contrariwise, 4-carboxy-glutamate is also found to be involved in diseases including plaque atherosclerosis, osteoporosis, mineralized heart valves, bone resorption and serves as biomarker for onset of these diseases. Owing to the pathophysiological significance of 4-carboxyglutamate, its identification is important to better understand pathophysiological systems. The wet lab identification of prospective 4-carboxyglutamate sites is costly, laborious and time consuming due to inherent difficulties of in-vivo, ex-vivo and in vitro experiments. To supplement these experiments, we proposed, implemented, and evaluated a different approach to develop 4-carboxyglutamate site predictors using pseudo amino acid compositions (PseAAC) and deep neural networks (DNNs). Our approach does not require any feature extraction and employs deep neural networks to learn feature representation of peptide sequences and performing classification thereof. Proposed approach is validated using standard performance evaluation metrics. Among different deep neural networks, convolutional neural network-based predictor achieved best scores on independent dataset with accuracy of 94.7%, AuC score of 0.91 and F1-score of 0.874 which shows the promise of proposed approach. The iCarboxE-Deep server is deployed at https://share.streamlit.io/sheraz-n/carboxyglutamate/app.py .

摘要

在生物系统中,谷氨酸是一种重要的氨基酸,用于蛋白质的生物合成。谷氨酸的羧化作用是一种重要的翻译后修饰,通过激活凝血酶原转化为凝血酶,在血液凝固中发挥重要作用。相反,4-羧基谷氨酸也被发现与包括斑块动脉粥样硬化、骨质疏松症、矿化心脏瓣膜、骨吸收在内的疾病有关,并作为这些疾病发病的生物标志物。由于 4-羧基谷氨酸的病理生理意义,其鉴定对于更好地理解病理生理系统非常重要。由于体内、体外和体外实验固有的困难,对潜在 4-羧基谷氨酸位点的湿实验室鉴定既昂贵又费力且耗时。为了补充这些实验,我们提出、实施和评估了一种使用伪氨基酸组成(PseAAC)和深度神经网络(DNN)开发 4-羧基谷氨酸位点预测器的不同方法。我们的方法不需要任何特征提取,并使用深度神经网络学习肽序列的特征表示,并对其进行分类。所提出的方法使用标准性能评估指标进行验证。在不同的深度神经网络中,基于卷积神经网络的预测器在独立数据集上取得了最佳成绩,准确率为 94.7%,AUC 得分为 0.91,F1 得分为 0.874,这表明了所提出方法的前景。iCarboxE-Deep 服务器部署在 https://share.streamlit.io/sheraz-n/carboxyglutamate/app.py 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/eb664614f2ec/41598_2021_3895_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/c139e32ed506/41598_2021_3895_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/86e7939596a0/41598_2021_3895_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/4856f13adaa4/41598_2021_3895_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/301d3e5e192c/41598_2021_3895_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/7f71aeae5f38/41598_2021_3895_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/f53afbf7b8fb/41598_2021_3895_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/a70204d09dad/41598_2021_3895_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/f694ab489ece/41598_2021_3895_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/b4ab72b61289/41598_2021_3895_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/ecc796a6696e/41598_2021_3895_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/c53558b694fa/41598_2021_3895_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/eb664614f2ec/41598_2021_3895_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/c139e32ed506/41598_2021_3895_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/86e7939596a0/41598_2021_3895_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/4856f13adaa4/41598_2021_3895_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/301d3e5e192c/41598_2021_3895_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/7f71aeae5f38/41598_2021_3895_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/f53afbf7b8fb/41598_2021_3895_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/a70204d09dad/41598_2021_3895_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/f694ab489ece/41598_2021_3895_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/b4ab72b61289/41598_2021_3895_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/ecc796a6696e/41598_2021_3895_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/c53558b694fa/41598_2021_3895_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6774/8741832/eb664614f2ec/41598_2021_3895_Fig12_HTML.jpg

相似文献

1
Computational identification of 4-carboxyglutamate sites to supplement physiological studies using deep learning.利用深度学习计算鉴定 4-羧基谷氨酸位点,以补充生理研究。
Sci Rep. 2022 Jan 7;12(1):128. doi: 10.1038/s41598-021-03895-4.
2
iGluK-Deep: computational identification of lysine glutarylation sites using deep neural networks with general pseudo amino acid compositions.iGluK-Deep:利用具有通用伪氨基酸组成的深度神经网络对赖氨酸戊二酰化位点进行计算识别。
J Biomol Struct Dyn. 2022;40(22):11691-11704. doi: 10.1080/07391102.2021.1962738. Epub 2021 Aug 16.
3
iPhosS(Deep)-PseAAC: Identification of Phosphoserine Sites in Proteins Using Deep Learning on General Pseudo Amino Acid Compositions.iPhosS(Deep)-PseAAC:基于广义伪氨基酸组成的深度学习算法鉴定蛋白质磷酸丝氨酸位点
IEEE/ACM Trans Comput Biol Bioinform. 2022 May-Jun;19(3):1703-1714. doi: 10.1109/TCBB.2020.3040747. Epub 2022 Jun 3.
4
Identification of 4-carboxyglutamate residue sites based on position based statistical feature and multiple classification.基于位置的统计特征和多分类识别 4-羧基谷氨酸残基位点
Sci Rep. 2020 Oct 9;10(1):16913. doi: 10.1038/s41598-020-73107-y.
5
Optimization of serine phosphorylation prediction in proteins by comparing human engineered features and deep representations.通过比较人类工程特征和深度表示来优化蛋白质丝氨酸磷酸化预测。
Anal Biochem. 2021 Feb 15;615:114069. doi: 10.1016/j.ab.2020.114069. Epub 2020 Dec 16.
6
DeepSSPred: A Deep Learning Based Sulfenylation Site Predictor Via a Novel nSegmented Optimize Federated Feature Encoder.DeepSSPred:一种基于深度学习的新型 nSegmented Optimize 联邦特征编码器的硫化位点预测器。
Protein Pept Lett. 2021;28(6):708-721. doi: 10.2174/0929866527666201202103411.
7
DeepPPSite: A deep learning-based model for analysis and prediction of phosphorylation sites using efficient sequence information.DeepPPSite:一种基于深度学习的模型,用于利用有效的序列信息分析和预测磷酸化位点。
Anal Biochem. 2021 Jan 1;612:113955. doi: 10.1016/j.ab.2020.113955. Epub 2020 Sep 16.
8
Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences.全面综述和评估基于 RNA 序列预测 RNA 转录后修饰位点的计算方法。
Brief Bioinform. 2020 Sep 25;21(5):1676-1696. doi: 10.1093/bib/bbz112.
9
DeepPRMS: advanced deep learning model to predict protein arginine methylation sites.DeepPRMS:用于预测蛋白质精氨酸甲基化位点的高级深度学习模型。
Brief Funct Genomics. 2024 Jul 19;23(4):452-463. doi: 10.1093/bfgp/elae001.
10
Direct identification of the calcium-binding amino acid, gamma-carboxyglutamate, in mineralized tissue.在矿化组织中直接鉴定钙结合氨基酸γ-羧基谷氨酸。
Proc Natl Acad Sci U S A. 1975 Oct;72(10):3925-9. doi: 10.1073/pnas.72.10.3925.

引用本文的文献

1
TNFR-LSTM: A Deep Intelligent Model for Identification of Tumour Necroses Factor Receptor (TNFR) Activity.TNFR-LSTM:一种用于识别肿瘤坏死因子受体(TNFR)活性的深度智能模型。
IET Syst Biol. 2025 Jan-Dec;19(1):e70007. doi: 10.1049/syb2.70007.
2
ESM-BBB-Pred: a fine-tuned ESM 2.0 and deep neural networks for the identification of blood-brain barrier peptides.ESM-BBB-Pred:一种用于识别血脑屏障肽的微调ESM 2.0和深度神经网络。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf066.
3
DeepO-GlcNAc: a web server for prediction of protein O-GlcNAcylation sites using deep learning combined with attention mechanism.

本文引用的文献

1
Optimization of serine phosphorylation prediction in proteins by comparing human engineered features and deep representations.通过比较人类工程特征和深度表示来优化蛋白质丝氨酸磷酸化预测。
Anal Biochem. 2021 Feb 15;615:114069. doi: 10.1016/j.ab.2020.114069. Epub 2020 Dec 16.
2
iPhosS(Deep)-PseAAC: Identification of Phosphoserine Sites in Proteins Using Deep Learning on General Pseudo Amino Acid Compositions.iPhosS(Deep)-PseAAC:基于广义伪氨基酸组成的深度学习算法鉴定蛋白质磷酸丝氨酸位点
IEEE/ACM Trans Comput Biol Bioinform. 2022 May-Jun;19(3):1703-1714. doi: 10.1109/TCBB.2020.3040747. Epub 2022 Jun 3.
3
DeepO-GlcNAc:一种利用深度学习结合注意力机制预测蛋白质O-连接N-乙酰葡糖胺化位点的网络服务器。
Front Cell Dev Biol. 2024 Oct 10;12:1456728. doi: 10.3389/fcell.2024.1456728. eCollection 2024.
4
Protein feature engineering framework for AMPylation site prediction.蛋白质修饰位点预测的特征工程框架。
Sci Rep. 2024 Apr 15;14(1):8695. doi: 10.1038/s41598-024-58450-8.
5
Stmol: A component for building interactive molecular visualizations within streamlit web-applications.Stmol:一个用于在Streamlit网络应用程序中构建交互式分子可视化的组件。
Front Mol Biosci. 2022 Sep 23;9:990846. doi: 10.3389/fmolb.2022.990846. eCollection 2022.
Deep-Kcr: accurate detection of lysine crotonylation sites using deep learning method.
Deep-Kcr:一种使用深度学习方法准确检测赖氨酸巴豆酰化位点的技术。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa255.
4
Identification of 4-carboxyglutamate residue sites based on position based statistical feature and multiple classification.基于位置的统计特征和多分类识别 4-羧基谷氨酸残基位点
Sci Rep. 2020 Oct 9;10(1):16913. doi: 10.1038/s41598-020-73107-y.
5
iCarPS: a computational tool for identifying protein carbonylation sites by novel encoded features.iCarPS:一种通过新型编码特征识别蛋白质羰基化位点的计算工具。
Bioinformatics. 2021 Apr 19;37(2):171-177. doi: 10.1093/bioinformatics/btaa702.
6
The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation.马修斯相关系数(MCC)在二分类评估中优于 F1 得分和准确率的优势。
BMC Genomics. 2020 Jan 2;21(1):6. doi: 10.1186/s12864-019-6413-7.
7
Prediction of lysine formylation sites using the composition of k-spaced amino acid pairs via Chou's 5-steps rule and general pseudo components.利用 Chou 的 5 步规则和广义伪氨基酸组成预测赖氨酸酰化位点。
Genomics. 2020 Jan;112(1):859-866. doi: 10.1016/j.ygeno.2019.05.027. Epub 2019 Jun 6.
8
iPhosH-PseAAC: Identify Phosphohistidine Sites in Proteins by Blending Statistical Moments and Position Relative Features According to the Chou's 5-Step Rule and General Pseudo Amino Acid Composition.iPhosH-PseAAC:根据周的五步法则和广义伪氨基酸组成,通过融合统计矩和位置相对特征来识别蛋白质中的磷酸组氨酸位点。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Mar-Apr;18(2):596-610. doi: 10.1109/TCBB.2019.2919025. Epub 2021 Apr 6.
9
SPrenylC-PseAAC: A sequence-based model developed via Chou's 5-steps rule and general PseAAC for identifying S-prenylation sites in proteins.SPrenylC-PseAAC:一种基于序列的模型,通过 Chou 的 5 步规则和广义 PseAAC 开发,用于识别蛋白质中的 S- prenylation 位点。
J Theor Biol. 2019 May 7;468:1-11. doi: 10.1016/j.jtbi.2019.02.007. Epub 2019 Feb 12.
10
SPalmitoylC-PseAAC: A sequence-based model developed via Chou's 5-steps rule and general PseAAC for identifying S-palmitoylation sites in proteins.SPalmitoylC-PseAAC:一种基于序列的模型,通过 Chou 的 5 步规则和通用 PseAAC 开发,用于识别蛋白质中的 S-棕榈酰化位点。
Anal Biochem. 2019 Mar 1;568:14-23. doi: 10.1016/j.ab.2018.12.019. Epub 2018 Dec 26.