Department of Chemical Physiology, The Scripps Research Institute , 10550 North Torrey Pines Road, La Jolla, CA 92037, USA.
ACS Chem Biol. 2012 May 18;7(5):829-34. doi: 10.1021/cb300003z. Epub 2012 Mar 9.
Carbohydrate libraries printed in glycan micorarray format have had a great impact on the high-throughput analysis of the specificity of a wide range of mammalian, plant, and bacterial lectins. Chemical and chemo-enzymatic synthesis allows the construction of diverse glycan libraries but requires substantial effort and resources. To leverage the synthetic effort, the ideal library would be a minimal subset of all structures that provides optimal diversity. Therefore, a measure of library diversity is needed. To this end, we developed a linear representation of glycans using standard chemoinformatic tools. This representation was applied to measure pairwise similarity and consequently diversity of glycan libraries in a single value. The diversities of four existing sialoside glycan arrays were compared. More diverse arrays are proposed reducing the number of glycans. This algorithm can be applied to diverse aspects of library design from target structure selection to the choice of building blocks for their synthesis.
糖库以聚糖微阵列的形式被打印,这对高通量分析各种哺乳动物、植物和细菌凝集素的特异性产生了巨大的影响。化学和化学酶合成允许构建各种聚糖文库,但需要大量的努力和资源。为了利用合成的努力,理想的文库应该是所有结构的最小子集,提供最佳的多样性。因此,需要一种文库多样性的度量方法。为此,我们使用标准的化学信息学工具开发了聚糖的线性表示。该表示方法用于测量糖库的成对相似性,并因此用单个值表示糖库的多样性。比较了四个现有的唾液酸聚糖阵列的多样性。通过减少糖的数量,提出了更具多样性的阵列。该算法可应用于文库设计的各个方面,从目标结构的选择到合成用构建块的选择。