School of Engineering & Physics, University of the South Pacific, Suva, Fiji.
Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Tokyo, Japan.
BMC Bioinformatics. 2019 Feb 4;19(Suppl 13):547. doi: 10.1186/s12859-018-2547-x.
Glycation is a one of the post-translational modifications (PTM) where sugar molecules and residues in protein sequences are covalently bonded. It has become one of the clinically important PTM in recent times attributed to many chronic and age related complications. Being a non-enzymatic reaction, it is a great challenge when it comes to its prediction due to the lack of significant bias in the sequence motifs.
We developed a classifier, GlyStruct based on support vector machine, to predict glycated and non-glycated lysine residues using structural properties of amino acid residues. The features used were secondary structure, accessible surface area and the local backbone torsion angles. For this work, a benchmark dataset was extracted containing 235 glycated and 303 non-glycated lysine residues. GlyStruct demonstrated improved performance of approximately 10% in comparison to benchmark method of Gly-PseAAC. The performance for GlyStruct on the metrics, sensitivity, specificity, accuracy and Mathew's correlation coefficient were 0.7013, 0.7989, 0.7562, and 0.5065, respectively for 10-fold cross-validation.
Glycation has emerged to be one of the clinically important PTM of proteins in recent times. Therefore, the development of computational tools become necessary to predict glycation, which could help medical professionals administer drugs and manage patients more effectively. The proposed predictor manages to classify glycated and non-glycated lysine residues with promising results consistently on various cross-validation schemes and outperforms other state of the art methods.
糖基化是一种翻译后修饰(PTM)之一,其中糖分子和蛋白质序列中的残基通过共价键结合。由于许多慢性和与年龄相关的并发症,它已成为最近临床上重要的 PTM 之一。由于序列基序中没有明显的偏差,因此作为一种非酶反应,对其进行预测是一个巨大的挑战。
我们开发了一种基于支持向量机的分类器 GlyStruct,用于使用氨基酸残基的结构特性预测糖化和非糖化赖氨酸残基。使用的特征包括二级结构、可及表面积和局部骨架扭转角。为此工作,提取了一个包含 235 个糖化和 303 个非糖化赖氨酸残基的基准数据集。与 Gly-PseAAC 的基准方法相比,GlyStruct 的性能提高了约 10%。在 10 倍交叉验证中,GlyStruct 在灵敏度、特异性、准确性和 Matthew 相关系数等指标上的性能分别为 0.7013、0.7989、0.7562 和 0.5065。
糖基化已成为近年来蛋白质中临床重要的 PTM 之一。因此,开发计算工具来预测糖化成为必要,这可以帮助医疗专业人员更有效地管理药物和管理患者。所提出的预测器能够一致地对各种交叉验证方案进行分类,对糖化和非糖化赖氨酸残基进行分类,并取得了有希望的结果,优于其他最先进的方法。