Gawriljuk Victor O, Foil Daniel H, Puhl Ana C, Zorn Kimberley M, Lane Thomas R, Riabova Olga, Makarov Vadim, Godoy Andre S, Oliva Glaucius, Ekins Sean
São Carlos Institute of Physics, University of São Paulo, Av. João Dagnone, 1100 - Santa Angelina, São Carlos, São Paulo 13563-120, Brazil.
Collaborations Pharmaceuticals, Inc., 840 Main Campus Drive, Lab 3510, Raleigh, North Carolina 27606, United States.
J Chem Inf Model. 2021 Aug 23;61(8):3804-3813. doi: 10.1021/acs.jcim.1c00460. Epub 2021 Jul 21.
Yellow fever (YF) is an acute viral hemorrhagic disease transmitted by infected mosquitoes. Large epidemics of YF occur when the virus is introduced into heavily populated areas with high mosquito density and low vaccination coverage. The lack of a specific small molecule drug treatment against YF as well as for homologous infections, such as zika and dengue, highlights the importance of these flaviviruses as a public health concern. With the advancement in computer hardware and bioactivity data availability, new tools based on machine learning methods have been introduced into drug discovery, as a means to utilize the growing high throughput screening (HTS) data generated to reduce costs and increase the speed of drug development. The use of predictive machine learning models using previously published data from HTS campaigns or data available in public databases, can enable the selection of compounds with desirable bioactivity and absorption, distribution, metabolism, and excretion profiles. In this study, we have collated cell-based assay data for yellow fever virus from the literature and public databases. The data were used to build predictive models with several machine learning methods that could prioritize compounds for in vitro testing. Five molecules were prioritized and tested in vitro from which we have identified a new pyrazolesulfonamide derivative with EC 3.2 μM and CC 24 μM, which represents a new scaffold suitable for hit-to-lead optimization that can expand the available drug discovery candidates for YF.
黄热病(YF)是一种由受感染蚊子传播的急性病毒性出血热疾病。当病毒传入蚊虫密度高且疫苗接种覆盖率低的人口密集地区时,就会发生大规模黄热病疫情。缺乏针对黄热病以及寨卡病毒和登革热等同源感染的特异性小分子药物治疗方法,凸显了这些黄病毒作为公共卫生问题的重要性。随着计算机硬件的进步和生物活性数据的可得性,基于机器学习方法的新工具已被引入药物研发领域,作为一种利用不断增长的高通量筛选(HTS)数据来降低成本并提高药物开发速度的手段。利用之前高通量筛选活动中发表的数据或公共数据库中可用的数据构建预测性机器学习模型,可以筛选出具有理想生物活性以及吸收、分布、代谢和排泄特征的化合物。在本研究中,我们从文献和公共数据库中整理了黄热病病毒基于细胞检测的数据。这些数据被用于通过多种机器学习方法构建预测模型,从而对用于体外测试的化合物进行优先级排序。我们对五个分子进行了优先级排序并进行了体外测试,从中鉴定出一种新的吡唑磺酰胺衍生物,其半数有效浓度(EC)为3.2 μM,半数细胞毒性浓度(CC)为24 μM,这代表了一种适合从活性分子到先导化合物优化的新骨架,可扩大黄热病可用的药物研发候选物范围。