Senger Ryan S, Karim M Nazmul
Delaware Biotechnology Institute, University of Delaware, Newark, DE 19711, USA.
Math Biosci. 2008 Jan;211(1):89-104. doi: 10.1016/j.mbs.2007.10.005. Epub 2007 Oct 30.
A model was developed for novel prediction of N-linked glycan branching pattern classification for CHO-derived N-linked glycoproteins. The model consists of 30 independent recurrent neural networks and uses predicted quantities of secondary structure elements and residue solvent accessibility as an input vector. The model was designed to predict the major component of a heterogeneous mixture of CHO-derived glycoforms of a recombinant protein under normal growth conditions. Resulting glycosylation prediction is classified as either complex-type or high mannose. The incorporation of predicted quantities in the input vector allowed for theoretical mutant N-linked glycan branching predictions without initial experimental analysis of protein structures. Primary amino acid sequence data were effectively eliminated from the input vector space based on neural network prediction analyses. This provided further evidence that localized protein secondary structure elements and conformational structure may play more important roles in determining glycan branching patterns than does the primary sequence of a polypeptide. A confidence interval parameter was incorporated into the model to enable identification of false predictions. The model was further tested using published experimental results for mutants of the tissue-type plasminogen activator protein [J. Wilhelm, S.G. Lee, N.K. Kalyan, S.M. Cheng, F. Wiener, W. Pierzchala, P.P. Hung, Alterations in the domain structure of tissue-type plasminogen activator change the nature of asparagine glycosylation. Biotechnology (N.Y.) 8 (1990) 321-325].
开发了一种用于预测CHO来源的N-连接糖蛋白N-连接聚糖分支模式分类的新模型。该模型由30个独立的递归神经网络组成,并使用预测的二级结构元件数量和残基溶剂可及性作为输入向量。该模型旨在预测重组蛋白在正常生长条件下CHO来源糖型的异质混合物的主要成分。所得的糖基化预测分为复合型或高甘露糖型。在输入向量中纳入预测数量,使得无需对蛋白质结构进行初始实验分析就能进行理论突变型N-连接聚糖分支预测。基于神经网络预测分析,从输入向量空间中有效消除了一级氨基酸序列数据。这进一步证明,与多肽的一级序列相比,局部蛋白质二级结构元件和构象结构在决定聚糖分支模式中可能发挥更重要的作用。在模型中纳入了置信区间参数,以识别错误预测。使用已发表的组织型纤溶酶原激活蛋白突变体的实验结果对该模型进行了进一步测试[J. 威廉,S.G. 李,N.K. 卡利安,S.M. 程,F. 维纳,W. 皮尔兹查拉,P.P. 洪,组织型纤溶酶原激活剂结构域结构的改变改变了天冬酰胺糖基化的性质。生物技术(纽约)8(1990)321 - 325]。