Department of Environmental Sciences and Engineering, University of North Carolina, Chapel Hill, North Carolina 27599, USA.
Toxicol Sci. 2012 May;127(1):1-9. doi: 10.1093/toxsci/kfs095. Epub 2012 Mar 2.
Quantitative structure-activity relationship (QSAR) models are widely used for in silico prediction of in vivo toxicity of drug candidates or environmental chemicals, adding value to candidate selection in drug development or in a search for less hazardous and more sustainable alternatives for chemicals in commerce. The development of traditional QSAR models is enabled by numerical descriptors representing the inherent chemical properties that can be easily defined for any number of molecules; however, traditional QSAR models often have limited predictive power due to the lack of data and complexity of in vivo endpoints. Although it has been indeed difficult to obtain experimentally derived toxicity data on a large number of chemicals in the past, the results of quantitative in vitro screening of thousands of environmental chemicals in hundreds of experimental systems are now available and continue to accumulate. In addition, publicly accessible toxicogenomics data collected on hundreds of chemicals provide another dimension of molecular information that is potentially useful for predictive toxicity modeling. These new characteristics of molecular bioactivity arising from short-term biological assays, i.e., in vitro screening and/or in vivo toxicogenomics data can now be exploited in combination with chemical structural information to generate hybrid QSAR-like quantitative models to predict human toxicity and carcinogenicity. Using several case studies, we illustrate the benefits of a hybrid modeling approach, namely improvements in the accuracy of models, enhanced interpretation of the most predictive features, and expanded applicability domain for wider chemical space coverage.
定量构效关系(QSAR)模型广泛用于药物候选物或环境化学物质体内毒性的计算预测,为药物开发中的候选物选择或寻找商业用化学品更具危害性和更可持续的替代品增加了价值。传统 QSAR 模型的发展得益于代表固有化学性质的数值描述符,这些描述符可以很容易地为任意数量的分子定义;然而,由于缺乏数据和体内终点的复杂性,传统的 QSAR 模型通常具有有限的预测能力。尽管过去确实难以获得大量化学物质的实验衍生毒性数据,但现在已经可以获得数百个实验系统中数千种环境化学物质的定量体外筛选结果,并且这些结果还在不断积累。此外,数百种化学物质的公开可获取毒理学基因组学数据提供了分子信息的另一个维度,对于预测毒性建模可能是有用的。这些源自短期生物测定(即体外筛选和/或体内毒理学基因组学数据)的分子生物活性的新特征现在可以与化学结构信息结合使用,以生成混合 QSAR 样定量模型来预测人类毒性和致癌性。我们通过几个案例研究说明了混合建模方法的好处,即提高了模型的准确性、增强了对最具预测性特征的解释以及扩大了更广泛化学空间覆盖范围的适用性域。