Center for Computational and Integrative Biology, Rutgers University, Camden, NJ, USA.
Department of Chemistry, Rutgers University, Camden, NJ, USA.
Methods Mol Biol. 2022;2474:169-187. doi: 10.1007/978-1-0716-2213-1_16.
Advances in high-throughput screening (HTS) revolutionized the environmental and health sciences data landscape. However, new compounds still need to be experimentally synthesized and tested to obtain HTS data, which will still be costly and time-consuming when a large set of new compounds need to be studied against many tests. Quantitative structure-activity relationship (QSAR) modeling is a standard method to fill data gaps for new compounds. The major challenge for many toxicologists, especially those with limited computational backgrounds, is efficiently developing optimized QSAR models for each assay with missing data for certain test compounds. This chapter aims to introduce a freely available and user-friendly QSAR modeling workflow, which trains and optimizes models using five algorithms without the need for a programming background.
高通量筛选 (HTS) 的进展彻底改变了环境和健康科学数据领域。然而,为了获得 HTS 数据,仍然需要对新化合物进行实验合成和测试,当需要对大量新化合物进行许多测试时,这仍然是昂贵和耗时的。定量构效关系 (QSAR) 建模是填补新化合物数据空白的标准方法。对于许多毒理学家来说,尤其是那些计算背景有限的毒理学家来说,主要的挑战是有效地为每个具有某些测试化合物缺失数据的测定法开发经过优化的 QSAR 模型。本章旨在介绍一个免费且易于使用的 QSAR 建模工作流程,该流程使用五种算法进行训练和优化模型,而无需编程背景。