Matsubara Masaaki, Bolton Evan E, Aoki-Kinoshita Kiyoko F, Yamada Issaku
The Noguchi Institute, Itabashi, Tokyo, 173-0003, Japan.
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA.
Anal Bioanal Chem. 2025 Feb;417(5):945-956. doi: 10.1007/s00216-024-05508-1. Epub 2024 Aug 30.
Integration of glycan-related databases between different research fields is essential in glycoscience. It requires knowledge across the breadth of science because most glycans exist as glycoconjugates. On the other hand, especially between chemistry and biology, glycan data has not been easy to integrate due to the huge variety of glycan structure representations. We have developed WURCS (Web 3.0 Unique Representation of Carbohydrate Structures) as a notation for representing all glycan structures uniquely for the purpose of integrating data across scientific data resources. While the integration of glycan data in the field of biology has been greatly advanced, in the field of chemistry, progress has been hampered due to the lack of appropriate rules to extract sugars from chemical structures. Thus, we developed a unique algorithm to determine the range of structures allowed to be considered as sugars from the structural formulae of compounds, and we developed software to extract sugars in WURCS format according to this algorithm. In this manuscript, we show that our algorithm can extract sugars from glycoconjugate molecules represented at the molecular level and can distinguish them from other biomolecules, such as amino acids, nucleic acids, and lipids. Available as software, MolWURCS is freely available and downloadable ( https://gitlab.com/glycoinfo/molwurcs ).
在糖科学领域,整合不同研究领域中与聚糖相关的数据库至关重要。这需要广泛的科学知识,因为大多数聚糖以糖缀合物的形式存在。另一方面,尤其是在化学和生物学之间,由于聚糖结构表示形式的巨大多样性,聚糖数据一直难以整合。我们开发了WURCS(碳水化合物结构的Web 3.0唯一表示法)作为一种表示法,用于唯一地表示所有聚糖结构,以便跨科学数据资源整合数据。虽然生物学领域中聚糖数据的整合取得了很大进展,但在化学领域,由于缺乏从化学结构中提取糖类的适当规则,进展受到了阻碍。因此,我们开发了一种独特的算法,用于从化合物的结构式中确定可被视为糖类的结构范围,并开发了软件,根据该算法提取WURCS格式的糖类。在本手稿中,我们表明我们的算法可以从分子水平表示的糖缀合物分子中提取糖类,并能将它们与其他生物分子,如氨基酸、核酸和脂质区分开来。MolWURCS作为软件可免费获取和下载(https://gitlab.com/glycoinfo/molwurcs )。