Suppr超能文献

利用多视图循环神经网络从人类糖蛋白的蛋白质结构中解码糖基化潜力。

Decoding glycosylation potential from protein structure across human glycoproteins with a multi-view recurrent neural network.

作者信息

Kellman Benjamin P, Mariethoz Julien, Zhang Yujie, Shaul Sigal, Alteri Mia, Sandoval Daniel, Jeffris Mia, Armingol Erick, Bao Bokan, Lisacek Frederique, Bojar Daniel, Lewis Nathan E

机构信息

Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA.

Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA.

出版信息

bioRxiv. 2024 May 23:2024.05.15.594334. doi: 10.1101/2024.05.15.594334.

Abstract

Glycosylation is described as a non-templated biosynthesis. Yet, the template-free premise is antithetical to the observation that different N-glycans are consistently placed at specific sites. It has been proposed that glycosite-proximal protein structures could constrain glycosylation and explain the observed microheterogeneity. Using site-specific glycosylation data, we trained a hybrid neural network to parse glycosites (recurrent neural network) and match them to feasible N-glycosylation events (graph neural network). From glycosite-flanking sequences, the algorithm predicts most human N-glycosylation events documented in the GlyConnect database and proposed structures corresponding to observed monosaccharide composition of the glycans at these sites. The algorithm also recapitulated glycosylation in Enhanced Aromatic Sequons, SARS-CoV-2 spike, and IgG3 variants, thus demonstrating the ability of the algorithm to predict both glycan structure and abundance. Thus, protein structure constrains glycosylation, and the neural network enables predictive glycosylation of uncharacterized or novel protein sequences and genetic variants.

摘要

糖基化被描述为一种非模板化生物合成。然而,无模板这一前提与不同N-聚糖始终位于特定位点的观察结果相悖。有人提出,糖基化位点附近的蛋白质结构可能会限制糖基化,并解释所观察到的微观异质性。利用位点特异性糖基化数据,我们训练了一个混合神经网络来解析糖基化位点(循环神经网络),并将它们与可行的N-糖基化事件(图神经网络)进行匹配。该算法从糖基化位点侧翼序列预测了GlyConnect数据库中记录的大多数人类N-糖基化事件,并提出了与这些位点观察到的聚糖单糖组成相对应的结构。该算法还概括了增强型芳香序列、SARS-CoV-2刺突蛋白和IgG3变体中的糖基化,从而证明了该算法预测聚糖结构和丰度的能力。因此,蛋白质结构限制了糖基化,而神经网络能够对未表征或新的蛋白质序列及基因变体进行预测性糖基化。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20b6/11118808/d63aa7d8b3fd/nihpp-2024.05.15.594334v2-f0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验