• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iHyd-LysSite(EPSV):通过提取增强位置和序列变异特征技术,使用统计公式识别蛋白质中的羟赖氨酸位点。

iHyd-LysSite (EPSV): Identifying Hydroxylysine Sites in Protein Using Statistical Formulation by Extracting Enhanced Position and Sequence Variant Feature Technique.

作者信息

Mahmood Muhammad Khalid, Ehsan Asma, Khan Yaser Daanial, Chou Kuo-Chen

机构信息

1Department of Mathematics, University of the Punjab, Lahore, Pakistan; 2Faculty of Information Technology, University of Management and Tecnology, Lahore, Pakistan; 3Gordon Life Science Institute, Boston, MA02478, USA.

出版信息

Curr Genomics. 2020 Nov;21(7):536-545. doi: 10.2174/1389202921999200831142629.

DOI:10.2174/1389202921999200831142629
PMID:33214770
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7604750/
Abstract

INTRODUCTION

Hydroxylation is one of the most important post-translational modifications (PTM) in cellular functions and is linked to various diseases. The addition of one of the hydroxyl groups (OH) to the lysine sites produces hydroxylysine when undergoes chemical modification.

METHODS

The method which is used in this study for identifying hydroxylysine sites based on powerful mathematical and statistical methodology incorporating the sequence-order effect and composition of each object within protein sequences. This predictor is called "iHyd-LysSite (EPSV)" (identifying hydroxylysine sites by extracting enhanced position and sequence variant technique). The prediction of hydroxylysine sites by experimental methods is difficult, laborious and highly expensive. technique is an alternative approach to identify hydroxylysine sites in proteins.

RESULTS

The experimental results require that the predictive model should have high sensitivity and specificity values and must be more accurate. The self-consistency, independent, 10-fold cross-validation and jackknife tests are performed for validation purposes. These tests are resulted by using three renowned classifiers, Neural Networks (NN), Random Forest (RF) and Support Vector Machine (SVM) with the demanding prediction rate. The overall predictive outcomes are extraordinarily superior to the results obtained by previous predictors. The proposed model contributed an excellent prediction rate in the system for NN, RF, and SVM classifiers. The sensitivity and specificity results using all these classifiers for jackknife test are 96.08%, 94.99%, 98.16% and 97.52%, 98.52%, 80.95%.

CONCLUSION

The results obtained by the proposed tool show that this method may meet the future demand of hydroxylysine sites with a better prediction rate over the existing methods.

摘要

引言

羟基化是细胞功能中最重要的翻译后修饰(PTM)之一,与多种疾病相关。赖氨酸位点经化学修饰添加一个羟基(OH)后会产生羟赖氨酸。

方法

本研究中用于识别羟赖氨酸位点的方法基于强大的数学和统计方法,该方法纳入了蛋白质序列中每个对象的序列顺序效应和组成。这个预测器被称为“iHyd-LysSite (EPSV)”(通过提取增强位置和序列变异技术识别羟赖氨酸位点)。通过实验方法预测羟赖氨酸位点既困难、费力又成本高昂。该技术是识别蛋白质中羟赖氨酸位点的一种替代方法。

结果

实验结果要求预测模型应具有高灵敏度和特异性值,且必须更准确。为了验证目的,进行了自一致性、独立、10折交叉验证和留一法测试。这些测试使用三种著名的分类器——神经网络(NN)、随机森林(RF)和支持向量机(SVM),并具有苛刻的预测率。总体预测结果比以前的预测器获得的结果异常优越。所提出的模型在NN、RF和SVM分类器系统中贡献了出色的预测率。使用所有这些分类器进行留一法测试的灵敏度和特异性结果分别为96.08%、94.99%、98.16%和97.52%、98.52%、80.95%。

结论

所提出工具获得的结果表明,该方法可能以比现有方法更好的预测率满足未来对羟赖氨酸位点的需求。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e992/7604750/e29e24fd53da/CG-21-536_F6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e992/7604750/c4129c063c58/CG-21-536_F1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e992/7604750/ee607cebfd57/CG-21-536_F2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e992/7604750/3da6fcd361c4/CG-21-536_F3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e992/7604750/00b20c9c90f5/CG-21-536_F4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e992/7604750/a5079df07e08/CG-21-536_F5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e992/7604750/e29e24fd53da/CG-21-536_F6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e992/7604750/c4129c063c58/CG-21-536_F1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e992/7604750/ee607cebfd57/CG-21-536_F2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e992/7604750/3da6fcd361c4/CG-21-536_F3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e992/7604750/00b20c9c90f5/CG-21-536_F4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e992/7604750/a5079df07e08/CG-21-536_F5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e992/7604750/e29e24fd53da/CG-21-536_F6.jpg

相似文献

1
iHyd-LysSite (EPSV): Identifying Hydroxylysine Sites in Protein Using Statistical Formulation by Extracting Enhanced Position and Sequence Variant Feature Technique.iHyd-LysSite(EPSV):通过提取增强位置和序列变异特征技术,使用统计公式识别蛋白质中的羟赖氨酸位点。
Curr Genomics. 2020 Nov;21(7):536-545. doi: 10.2174/1389202921999200831142629.
2
iHyd-PseAAC (EPSV): Identifying Hydroxylation Sites in Proteins by Extracting Enhanced Position and Sequence Variant Feature Chou's 5-Step Rule and General Pseudo Amino Acid Composition.iHyd-PseAAC(EPSV):通过提取增强的位置和序列变异特征、周氏五步法则和广义伪氨基酸组成来识别蛋白质中的羟基化位点。
Curr Genomics. 2019 Feb;20(2):124-133. doi: 10.2174/1389202920666190325162307.
3
iHyd-PseCp: Identify hydroxyproline and hydroxylysine in proteins by incorporating sequence-coupled effects into general PseAAC.iHyd-PseCp:通过将序列耦合效应纳入通用伪氨基酸组成来鉴定蛋白质中的羟脯氨酸和羟赖氨酸。
Oncotarget. 2016 Jul 12;7(28):44310-44321. doi: 10.18632/oncotarget.10027.
4
iHyd-PseAAC: predicting hydroxyproline and hydroxylysine in proteins by incorporating dipeptide position-specific propensity into pseudo amino acid composition.iHyd-PseAAC:通过将二肽位置特异性倾向纳入伪氨基酸组成来预测蛋白质中的羟脯氨酸和羟赖氨酸
Int J Mol Sci. 2014 May 5;15(5):7594-610. doi: 10.3390/ijms15057594.
5
A Hybrid Deep Learning Model for Predicting Protein Hydroxylation Sites.一种用于预测蛋白质羟基化位点的混合深度学习模型。
Int J Mol Sci. 2018 Sep 18;19(9):2817. doi: 10.3390/ijms19092817.
6
Prediction and analysis of protein hydroxyproline and hydroxylysine.预测和分析蛋白质羟脯氨酸和羟赖氨酸。
PLoS One. 2010 Dec 31;5(12):e15917. doi: 10.1371/journal.pone.0015917.
7
OH-PRED: prediction of protein hydroxylation sites by incorporating adapted normal distribution bi-profile Bayes feature extraction and physicochemical properties of amino acids.OH-PRED:通过结合适应性正态分布双轮廓贝叶斯特征提取和氨基酸的物理化学性质预测蛋白质羟基化位点
J Biomol Struct Dyn. 2017 Mar;35(4):829-835. doi: 10.1080/07391102.2016.1163294. Epub 2016 May 4.
8
HydPred: a novel method for the identification of protein hydroxylation sites that reveals new insights into human inherited disease.HydPred:一种用于识别蛋白质羟基化位点的新方法,为人类遗传疾病提供了新见解。
Mol Biosyst. 2016 Feb;12(2):490-8. doi: 10.1039/c5mb00681c.
9
iPhosT-PseAAC: Identify phosphothreonine sites by incorporating sequence statistical moments into PseAAC.iPhosT-PseAAC:通过将序列统计矩纳入伪氨基酸组成来识别磷酸苏氨酸位点。
Anal Biochem. 2018 Jun 1;550:109-116. doi: 10.1016/j.ab.2018.04.021. Epub 2018 Apr 25.
10
iDPGK: characterization and identification of lysine phosphoglycerylation sites based on sequence-based features.iDPGK:基于序列特征的赖氨酸磷酸甘油化位点的表征和鉴定。
BMC Bioinformatics. 2020 Dec 9;21(1):568. doi: 10.1186/s12859-020-03916-5.

引用本文的文献

1
iDLB-Pred: identification of disordered lipid binding residues in protein sequences using convolutional neural network.iDLB-Pred:使用卷积神经网络鉴定蛋白质序列中紊乱脂质结合残基
Sci Rep. 2024 Oct 21;14(1):24724. doi: 10.1038/s41598-024-75700-x.
2
m1A-Ensem: accurate identification of 1-methyladenosine sites through ensemble models.m1A-Ensem:通过集成模型准确识别1-甲基腺苷位点。
BioData Min. 2024 Feb 15;17(1):4. doi: 10.1186/s13040-023-00353-x.
3
A Framework for Prediction of Oncogenomic Progression Aiding Personalized Treatment of Gastric Cancer.

本文引用的文献

1
MusiteDeep: a deep-learning based webserver for protein post-translational modification site prediction and visualization.MusiteDeep:一个基于深度学习的蛋白质翻译后修饰位点预测和可视化的网络服务器。
Nucleic Acids Res. 2020 Jul 2;48(W1):W140-W146. doi: 10.1093/nar/gkaa275.
2
iCrotoK-PseAAC: Identify lysine crotonylation sites by blending position relative statistical features according to the Chou's 5-step rule.iCrotoK-PseAAC:根据周的五步规则,通过混合位置相对统计特征来识别赖氨酸巴豆酰化位点。
PLoS One. 2019 Nov 21;14(11):e0223993. doi: 10.1371/journal.pone.0223993. eCollection 2019.
3
iHyd-PseAAC (EPSV): Identifying Hydroxylation Sites in Proteins by Extracting Enhanced Position and Sequence Variant Feature Chou's 5-Step Rule and General Pseudo Amino Acid Composition.
一种辅助胃癌个性化治疗的肿瘤基因组进展预测框架。
Diagnostics (Basel). 2023 Jul 6;13(13):2291. doi: 10.3390/diagnostics13132291.
4
Hemolytic-Pred: A machine learning-based predictor for hemolytic proteins using position and composition-based features.溶血预测器:一种基于机器学习的溶血蛋白预测工具,使用基于位置和组成的特征。
Digit Health. 2023 Jul 5;9:20552076231180739. doi: 10.1177/20552076231180739. eCollection 2023 Jan-Dec.
5
Ensemble Learning for Hormone Binding Protein Prediction: A Promising Approach for Early Diagnosis of Thyroid Hormone Disorders in Serum.用于激素结合蛋白预测的集成学习:血清甲状腺激素紊乱早期诊断的一种有前景的方法。
Diagnostics (Basel). 2023 Jun 1;13(11):1940. doi: 10.3390/diagnostics13111940.
6
EDLM: Ensemble Deep Learning Model to Detect Mutation for the Early Detection of Cholangiocarcinoma.EDLM:用于胆管癌早期检测的突变检测集成深度学习模型。
Genes (Basel). 2023 May 18;14(5):1104. doi: 10.3390/genes14051104.
7
iDHU-Ensem: Identification of dihydrouridine sites through ensemble learning models.iDHU-Ensem:通过集成学习模型识别二氢尿苷位点。
Digit Health. 2023 Mar 29;9:20552076231165963. doi: 10.1177/20552076231165963. eCollection 2023 Jan-Dec.
8
IGPred-HDnet: Prediction of Immunoglobulin Proteins Using Graphical Features and the Hierarchal Deep Learning-Based Approach.IGPred-HDnet:基于图特征和层次深度学习的免疫球蛋白蛋白预测方法。
Comput Intell Neurosci. 2023 Jan 25;2023:2465414. doi: 10.1155/2023/2465414. eCollection 2023.
9
DHU-Pred: accurate prediction of dihydrouridine sites using position and composition variant features on diverse classifiers.DHU-Pred:使用多种分类器上的位置和组成变体特征准确预测二氢尿嘧啶位点。
PeerJ. 2022 Oct 27;10:e14104. doi: 10.7717/peerj.14104. eCollection 2022.
10
An analytical study on the identification of N-linked glycosylation sites using machine learning model.基于机器学习模型的N-糖基化位点识别分析研究
PeerJ Comput Sci. 2022 Sep 21;8:e1069. doi: 10.7717/peerj-cs.1069. eCollection 2022.
iHyd-PseAAC(EPSV):通过提取增强的位置和序列变异特征、周氏五步法则和广义伪氨基酸组成来识别蛋白质中的羟基化位点。
Curr Genomics. 2019 Feb;20(2):124-133. doi: 10.2174/1389202920666190325162307.
4
Prediction of lysine formylation sites using the composition of k-spaced amino acid pairs via Chou's 5-steps rule and general pseudo components.利用 Chou 的 5 步规则和广义伪氨基酸组成预测赖氨酸酰化位点。
Genomics. 2020 Jan;112(1):859-866. doi: 10.1016/j.ygeno.2019.05.027. Epub 2019 Jun 6.
5
Computational identification of microbial phosphorylation sites by the enhanced characteristics of sequence information.通过增强序列信息的特征来计算识别微生物磷酸化位点。
Sci Rep. 2019 Jun 4;9(1):8258. doi: 10.1038/s41598-019-44548-x.
6
HawkDock: a web server to predict and analyze the protein-protein complex based on computational docking and MM/GBSA.HawkDock:一个基于计算对接和 MM/GBSA 预测和分析蛋白质-蛋白质复合物的网络服务器。
Nucleic Acids Res. 2019 Jul 2;47(W1):W322-W330. doi: 10.1093/nar/gkz397.
7
Large-Scale Assessment of Bioinformatics Tools for Lysine Succinylation Sites.赖氨酸琥珀酰化位点的生物信息学工具的大规模评估。
Cells. 2019 Jan 28;8(2):95. doi: 10.3390/cells8020095.
8
PTM-ssMP: A Web Server for Predicting Different Types of Post-translational Modification Sites Using Novel Site-specific Modification Profile.PTM-ssMP:一个使用新型的特异性修饰谱预测不同类型的翻译后修饰位点的网络服务器。
Int J Biol Sci. 2018 May 22;14(8):946-956. doi: 10.7150/ijbs.24121. eCollection 2018.
9
A Novel Modeling in Mathematical Biology for Classification of Signal Peptides.一种用于信号肽分类的数学生物学新模型。
Sci Rep. 2018 Jan 18;8(1):1039. doi: 10.1038/s41598-018-19491-y.
10
Computational identification of protein S-sulfenylation sites by incorporating the multiple sequence features information.通过整合多序列特征信息对蛋白质S-亚磺酰化位点进行计算识别。
Mol Biosyst. 2017 Nov 21;13(12):2545-2550. doi: 10.1039/c7mb00491e.