Kirkland David, Aardema Marilyn, Henderson Leigh, Müller Lutz
Covance Laboratories Limited, Otley Road, Harrogate HG3 1PY, UK.
Mutat Res. 2005 Jul 4;584(1-2):1-256. doi: 10.1016/j.mrgentox.2005.02.004.
The performance of a battery of three of the most commonly used in vitro genotoxicity tests--Ames+mouse lymphoma assay (MLA)+in vitro micronucleus (MN) or chromosomal aberrations (CA) test--has been evaluated for its ability to discriminate rodent carcinogens and non-carcinogens, from a large database of over 700 chemicals compiled from the CPDB ("Gold"), NTP, IARC and other publications. We re-evaluated many (113 MLA and 30 CA) previously published genotoxicity results in order to categorise the performance of these assays using the response categories we established. The sensitivity of the three-test battery was high. Of the 553 carcinogens for which there were valid genotoxicity data, 93% of the rodent carcinogens evaluated in at least one assay gave positive results in at least one of the three tests. Combinations of two and three test systems had greater sensitivity than individual tests resulting in sensitivities of around 90% or more, depending on test combination. Only 19 carcinogens (out of 206 tested in all three tests, considering CA and MN as alternatives) gave consistently negative results in a full three-test battery. Most were either carcinogenic via a non-genotoxic mechanism (liver enzyme inducers, peroxisome proliferators, hormonal carcinogens) considered not necessarily relevant for humans, or were extremely weak (presumed) genotoxic carcinogens (e.g. N-nitrosodiphenylamine). Two carcinogens (5-chloro-o-toluidine, 1,1,2,2-tetrachloroethane) may have a genotoxic element to their carcinogenicity and may have been expected to produce positive results somewhere in the battery. We identified 183 chemicals that were non-carcinogenic after testing in both male and female rats and mice. There were genotoxicity data on 177 of these. The specificity of the Ames test was reasonable (73.9%), but all mammalian cell tests had very low specificity (i.e. below 45%), and this declined to extremely low levels in combinations of two and three test systems. When all three tests were performed, 75-95% of non-carcinogens gave positive (i.e. false positive) results in at least one test in the battery. The extremely low specificity highlights the importance of understanding the mechanism by which genotoxicity may be induced (whether it is relevant for the whole animal or human) and using weight of evidence approaches to assess the carcinogenic risk from a positive genotoxicity signal. It also highlights deficiencies in the current prediction from and understanding of such in vitro results for the in vivo situation. It may even signal the need for either a reassessment of the conditions and criteria for positive results (cytotoxicity, solubility, etc.) or the development and use of a completely new set of in vitro tests (e.g. mutation in transgenic cell lines, systems with inherent metabolic activity avoiding the use of S9, measurement of genetic changes in more cancer-relevant genes or hotspots of genes, etc.). It was very difficult to assess the performance of the in vitro MN test, particularly in combination with other assays, because the published database for this assay is relatively small at this time. The specificity values for the in vitro MN assay may improve if data from a larger proportion of the known non-carcinogens becomes available, and a larger published database of results with the MN assay is urgently needed if this test is to be appreciated for regulatory use. However, specificity levels of <50% will still be unacceptable. Despite these issues, by adopting a relative predictivity (RP) measure (ratio of real:false results), it was possible to establish that positive results in all three tests indicate the chemical is greater than three times more likely to be a rodent carcinogen than a non-carcinogen. Likewise, negative results in all three tests indicate the chemical is greater than two times more likely to be a rodent non-carcinogen than a carcinogen. This RP measure is considered a useful tool for industry to assess the likelihood of a chemical possessing carcinogenic potential from batteries of positive or negative results.
我们从CPDB(“黄金标准”)、NTP、IARC及其他出版物汇编的700多种化学品的大型数据库中,评估了一组三种最常用的体外遗传毒性试验——艾姆斯试验+小鼠淋巴瘤试验(MLA)+体外微核(MN)或染色体畸变(CA)试验——区分啮齿类致癌物和非致癌物的能力。我们重新评估了许多(113项MLA和30项CA)先前发表的遗传毒性结果,以便使用我们建立的反应类别对这些试验的性能进行分类。三项试验组合的敏感性很高。在有有效遗传毒性数据的553种致癌物中,至少在一种试验中评估的啮齿类致癌物中有93%在三项试验中的至少一项中给出了阳性结果。两种和三种试验系统的组合比单个试验具有更高的敏感性,根据试验组合的不同,敏感性可达90%左右或更高。在三项试验(将CA和MN视为替代方法)中全部测试的206种致癌物中,只有19种在完整的三项试验组合中始终给出阴性结果。大多数要么是通过非遗传毒性机制致癌(肝酶诱导剂、过氧化物酶体增殖剂、激素致癌物),认为对人类不一定相关,要么是极其微弱的(假定的)遗传毒性致癌物(如N-亚硝基二苯胺)。两种致癌物(5-氯邻甲苯胺、1,1,2,2-四氯乙烷)的致癌性可能具有遗传毒性因素,可能预期在试验组合中的某处产生阳性结果。我们鉴定出183种在雄性和雌性大鼠及小鼠试验后为非致癌物的化学品。其中177种有遗传毒性数据。艾姆斯试验的特异性合理(73.9%),但所有哺乳动物细胞试验的特异性都非常低(即低于45%),在两种和三种试验系统的组合中更是降至极低水平。当进行所有三项试验时,75-95%的非致癌物在试验组合中的至少一项试验中给出阳性(即假阳性)结果。极低的特异性突出了理解遗传毒性可能被诱导的机制(无论其对整个动物或人类是否相关)以及使用证据权重方法评估阳性遗传毒性信号致癌风险的重要性。它还突出了当前对这种体外结果用于体内情况的预测和理解方面的不足。这甚至可能表明需要重新评估阳性结果的条件和标准(细胞毒性、溶解度等),或者开发和使用一套全新的体外试验(如转基因细胞系中的突变、具有固有代谢活性的系统以避免使用S9、测量更多与癌症相关基因或基因热点中的遗传变化等)。评估体外MN试验的性能非常困难,特别是与其他试验组合时,因为此时该试验的已发表数据库相对较小。如果能获得已知非致癌物中更大比例的数据,体外MN试验的特异性值可能会提高,如果要使该试验在监管中得到认可,迫切需要更大的MN试验结果已发表数据库。然而,低于50%的特异性水平仍然不可接受。尽管存在这些问题,但通过采用相对预测性(RP)度量(真实结果与错误结果的比率),可以确定在所有三项试验中得到阳性结果表明该化学品是啮齿类致癌物的可能性比非致癌物大3倍以上。同样,在所有三项试验中得到阴性结果表明该化学品是啮齿类非致癌物的可能性比致癌物大2倍以上。这种RP度量被认为是行业评估化学品从阳性或阴性结果组合中具有致癌潜力可能性的有用工具。