N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninsky prospect 47, Moscow, 119991, Russia.
Sci Data. 2022 Mar 30;9(1):131. doi: 10.1038/s41597-022-01186-9.
The Carbohydrate Structure Database (CSDB, http://csdb.glycoscience.ru/ ) is a free curated repository storing various data on glycans of bacterial, fungal and plant origins. Currently, it maintains a close-to-full coverage on bacterial and fungal carbohydrates up to the year 2020. The CSDB web-interface provides free access to the database content and dedicated tools. Still, the number of these tools and the types of the corresponding analyses is limited, whereas the database itself contains data that can be used in a broader scope of analytical studies. In this paper, we present CSDB source data files and a self-contained SQL dump, and exemplify their possible application in glycan-related studies. By using CSDB in an SQL format, the user can gain access to the chain length distribution or charge distribution (as an example) in a given set of glycans defined according to specific structural, taxonomic, or other parameters, whereas the source text dump files can be imported to any dedicated database with a specific internal architecture differing from that of CSDB.
碳水化合物结构数据库(CSDB,http://csdb.glycoscience.ru/)是一个免费的经过精心整理的存储库,用于存储细菌、真菌和植物来源的聚糖的各种数据。目前,它对细菌和真菌的碳水化合物的涵盖范围接近 2020 年的最新数据。CSDB 的 Web 界面提供了对数据库内容和专用工具的免费访问。尽管这些工具的数量和相应分析的类型有限,但数据库本身包含的数据可以在更广泛的分析研究中使用。在本文中,我们展示了 CSDB 的源数据文件和一个自包含的 SQL 转储文件,并举例说明了它们在聚糖相关研究中的可能应用。通过在 SQL 格式中使用 CSDB,用户可以访问根据特定结构、分类或其他参数定义的给定聚糖集中的链长分布或电荷分布(例如),而源文本转储文件可以导入到任何具有与 CSDB 不同的特定内部架构的专用数据库中。