Klein Jennifer J, Baker Nancy C, Foil Daniel H, Zorn Kimberley M, Urbina Fabio, Puhl Ana C, Ekins Sean
Collaborations Pharmaceuticals, Inc., 840 Main Campus Drive, Lab 3510, Raleigh, North Carolina 27606, United States.
ParlezChem, 123 W Union Street, Hillsborough, North Carolina 27278, United States.
ACS Omega. 2021 Jan 20;6(4):3186-3193. doi: 10.1021/acsomega.0c05591. eCollection 2021 Feb 2.
Rare diseases impact hundreds of millions of individuals worldwide. However, few therapies exist to treat the rare disease population because financial resources are limited, the number of patients affected is low, bioactivity data is often nonexistent, and very few animal models exist to support preclinical development efforts. Sialidosis is an ultrarare lysosomal storage disorder in which mutations in the NEU1 gene result in the deficiency of the lysosomal enzyme sialidase-1. This enzyme catalyzes the removal of sialic acid moieties from glycoproteins and glycolipids. Therefore, the defective or deficient protein leads to the buildup of sialylated glycoproteins as well as several characteristic symptoms of sialidosis including visual impairment, ataxia, hepatomegaly, dysostosis multiplex, and developmental delay. In this study, we used a bibliometric tool to generate links between lysosomal storage disease (LSD) targets and existing bioactivity data that could be curated in order to build machine learning models and screen compounds . We focused on sialidase as an example, and we used the data curated from the literature to build a Bayesian model which was then used to score compound libraries and rank these molecules for testing. Two compounds were identified from testing using microscale thermophoresis, namely sulfameter ( 2.15 ± 1.02 μM) and mexenone ( 8.88 ± 4.02 μM), which validated our approach to identifying new molecules binding to this protein, which could represent possible drug candidates that can be evaluated further as potential chaperones for this ultrarare lysosomal disease for which there is currently no treatment. Combining bibliometric and machine learning approaches has the ability to assist in curating small molecule data and model building, respectively, for rare disease drug discovery. This approach also has the capability to identify new compounds that are potential drug candidates.
罕见病影响着全球数亿人。然而,用于治疗罕见病患者的疗法却很少,这是因为财政资源有限、受影响患者数量少、生物活性数据往往不存在,而且几乎没有动物模型来支持临床前开发工作。唾液酸沉积症是一种极为罕见的溶酶体贮积症,其中NEU1基因突变导致溶酶体酶唾液酸酶-1缺乏。这种酶催化从糖蛋白和糖脂中去除唾液酸部分。因此,有缺陷或缺乏的蛋白质会导致唾液酸化糖蛋白的积累以及唾液酸沉积症的几种特征性症状,包括视力障碍、共济失调、肝肿大、多发性骨发育异常和发育迟缓。在本研究中,我们使用文献计量工具在溶酶体贮积症(LSD)靶点与现有的可整理生物活性数据之间建立联系,以便构建机器学习模型并筛选化合物。我们以唾液酸酶为例,利用从文献中整理的数据构建了一个贝叶斯模型,然后用该模型对化合物库进行评分并对这些分子进行排名以进行测试。通过微量热泳法测试鉴定出了两种化合物,即磺胺米隆(2.15±1.02μM)和美昔酮(8.88±4.02μM),这验证了我们识别与该蛋白结合的新分子的方法,这些新分子可能代表潜在的药物候选物,可作为这种极为罕见的溶酶体疾病的潜在伴侣分子进一步评估,目前该疾病尚无治疗方法。结合文献计量和机器学习方法能够分别协助整理小分子数据和进行模型构建,以用于罕见病药物研发。这种方法还能够识别出潜在的药物候选新化合物。