Liu Gang, Neelamegham Sriram
Department of Chemical and Biological Engineering, State University of New York, Buffalo, NY, USA.
Wiley Interdiscip Rev Syst Biol Med. 2015 Jul-Aug;7(4):163-81. doi: 10.1002/wsbm.1296. Epub 2015 Apr 13.
The glycome constitutes the entire complement of free carbohydrates and glycoconjugates expressed on whole cells or tissues. 'Systems Glycobiology' is an emerging discipline that aims to quantitatively describe and analyse the glycome. Here, instead of developing a detailed understanding of single biochemical processes, a combination of computational and experimental tools are used to seek an integrated or 'systems-level' view. This can explain how multiple biochemical reactions and transport processes interact with each other to control glycome biosynthesis and function. Computational methods in this field commonly build in silico reaction network models to describe experimental data derived from structural studies that measure cell-surface glycan distribution. While considerable progress has been made, several challenges remain due to the complex and heterogeneous nature of this post-translational modification. First, for the in silico models to be standardized and shared among laboratories, it is necessary to integrate glycan structure information and glycosylation-related enzyme definitions into the mathematical models. Second, as glycoinformatics resources grow, it would be attractive to utilize 'Big Data' stored in these repositories for model construction and validation. Third, while the technology for profiling the glycome at the whole-cell level has been standardized, there is a need to integrate mass spectrometry derived site-specific glycosylation data into the models. The current review discusses progress that is being made to resolve the above bottlenecks. The focus is on how computational models can bridge the gap between 'data' generated in wet-laboratory studies with 'knowledge' that can enhance our understanding of the glycome.
糖组构成了在全细胞或组织上表达的游离碳水化合物和糖缀合物的全部集合。“系统糖生物学”是一门新兴学科,旨在对糖组进行定量描述和分析。在这里,不是详细了解单个生化过程,而是使用计算和实验工具的组合来寻求综合的或“系统水平”的观点。这可以解释多个生化反应和转运过程如何相互作用以控制糖组的生物合成和功能。该领域的计算方法通常构建计算机反应网络模型,以描述从测量细胞表面聚糖分布的结构研究中获得的实验数据。虽然已经取得了相当大的进展,但由于这种翻译后修饰的复杂性和异质性,仍然存在一些挑战。首先,为了使计算机模型在各实验室之间标准化和共享,有必要将聚糖结构信息和糖基化相关酶的定义整合到数学模型中。其次,随着糖组学资源的增加,利用存储在这些数据库中的“大数据”进行模型构建和验证将很有吸引力。第三,虽然全细胞水平上糖组分析技术已经标准化,但仍需要将质谱衍生的位点特异性糖基化数据整合到模型中。本综述讨论了为解决上述瓶颈所取得的进展。重点是计算模型如何能够弥合湿实验室研究中产生的“数据”与能够增强我们对糖组理解的“知识”之间的差距。