Richard A M
MD-68, Environmental Carcinogenesis Division, National Health and Environmental Effects Research Laboratory, US Environmental Protection Agency, Research Triangle Park, NC 27711, USA.
Mutat Res. 1998 May 25;400(1-2):493-507. doi: 10.1016/s0027-5107(98)00068-2.
There is a great deal of current interest in the use of commercial, automated programs for the prediction of mutagenicity and carcinogenicity based on chemical structure. However, the goal of accurate and reliable toxicity prediction for any chemical, based solely on structural information remains elusive. The toxicity prediction challenge is global in its objective, but limited in its solution, to within local domains of chemicals acting according to similar mechanisms of action in the biological system; to predict, we must be able to generalize based on chemical structure, but the biology fundamentally limits our ability to do so. Available commercial systems for mutagenicity and/or carcinogenicity prediction differ in their specifics, yet most fall in two major categories: (1) automated approaches that rely on the use of statistics for extracting correlations between structure and activity; and (2) knowledge-based expert systems that rely on a set of programmed rules distilled from available knowledge and human expert judgement. These two categories of approaches differ in the ways that they represent, process, and generalize chemical-biological activity information. An application of four commercial systems (TOPKAT, CASE/MULTI-CASE, DEREK, and OncoLogic) to mutagenicity and carcinogenicity prediction for a particular class of chemicals-the haloacetic acids (HAs)-is presented to highlight these differences. Some discussion is devoted to the issue of gauging the relative performance of commercial prediction systems, as well as to the role of prospective prediction exercises in this effort. And finally, an alternative approach that stops short of delivering a prediction to a user, involving structure-searching and data base exploration, is briefly considered.
当前,人们对使用基于化学结构的商业自动化程序来预测诱变性和致癌性有着浓厚的兴趣。然而,仅基于结构信息对任何化学品进行准确可靠的毒性预测这一目标仍然难以实现。毒性预测的挑战目标是全球性的,但在解决方案上却受到限制,仅限于在生物系统中按照相似作用机制起作用的化学品的局部领域内;为了进行预测,我们必须能够基于化学结构进行归纳,但生物学从根本上限制了我们这样做的能力。现有的用于诱变性和/或致癌性预测的商业系统在具体细节上有所不同,但大多数可分为两大类:(1)依靠统计学方法提取结构与活性之间相关性的自动化方法;(2)基于知识的专家系统,它依赖于从现有知识和人类专家判断中提炼出的一组编程规则。这两类方法在表示、处理和归纳化学生物活性信息的方式上有所不同。本文介绍了四种商业系统(TOPKAT、CASE/MULTI-CASE、DEREK和OncoLogic)在对一类特定化学品——卤乙酸(HAs)进行诱变性和致癌性预测方面的应用,以突出这些差异。还专门讨论了评估商业预测系统相对性能的问题,以及前瞻性预测练习在这项工作中的作用。最后,简要考虑了一种不直接向用户提供预测结果的替代方法,该方法涉及结构搜索和数据库探索。