Kirkland David, Pfuhler Stefan, Tweats David, Aardema Marilyn, Corvi Raffaella, Darroudi Firouz, Elhajouji Azeddine, Glatt Hansruedi, Hastwell Paul, Hayashi Makoto, Kasper Peter, Kirchner Stephan, Lynch Anthony, Marzin Daniel, Maurici Daniela, Meunier Jean-Roc, Müller Lutz, Nohynek Gerhard, Parry James, Parry Elizabeth, Thybaud Veronique, Tice Ray, van Benthem Jan, Vanparys Philippe, White Paul
Covance Laboratories Ltd., Otley Road, Harrogate HG3 1PY, UK.
Mutat Res. 2007 Mar 30;628(1):31-55. doi: 10.1016/j.mrgentox.2006.11.008. Epub 2007 Jan 13.
Workshop participants agreed that genotoxicity tests in mammalian cells in vitro produce a remarkably high and unacceptable occurrence of irrelevant positive results (e.g. when compared with rodent carcinogenicity). As reported in several recent reviews, the rate of irrelevant positives (i.e. low specificity) for some studies using in vitro methods (when compared to this "gold standard") means that an increased number of test articles are subjected to additional in vivo genotoxicity testing, in many cases before, e.g. the efficacy (in the case of pharmaceuticals) of the compound has been evaluated. If in vitro tests were more predictive for in vivo genotoxicity and carcinogenicity (i.e. fewer false positives) then there would be a significant reduction in the number of animals used. Beyond animal (or human) carcinogenicity as the "gold standard", it is acknowledged that genotoxicity tests provide much information about cellular behaviour, cell division processes and cellular fate to a (geno)toxic insult. Since the disease impact of these effects is seldom known, and a verification of relevant toxicity is normally also the subject of (sub)chronic animal studies, the prediction of in vivo relevant results from in vitro genotoxicity tests is also important for aspects that may not have a direct impact on carcinogenesis as the ultimate endpoint of concern. In order to address the high rate of in vitro false positive results, a 2-day workshop was held at the European Centre for the Validation of Alternative Methods (ECVAM), Ispra, Italy in April 2006. More than 20 genotoxicity experts from academia, government and industry were invited to review data from the currently available cell systems, to discuss whether there exist cells and test systems that have a reduced tendency to false positive results, to review potential modifications to existing protocols and cell systems that might result in improved specificity, and to review the performance of some new test systems that show promise of improved specificity without sacrificing sensitivity. It was concluded that better guidance on the likely mechanisms resulting in positive results that are not biologically relevant for human health, and how to obtain evidence for those mechanisms, is needed both for practitioners and regulatory reviewers. Participants discussed the fact that cell lines commonly used for genotoxicity testing have a number of deficiencies that may contribute to the high false positive rate. These include, amongst others, lack of normal metabolism leading to reliance on exogenous metabolic activation systems (e.g. Aroclor-induced rat S9), impaired p53 function and altered DNA repair capability. The high concentrations of test chemicals (i.e. 10 mM or 5000 microg/ml, unless precluded by solubility or excessive toxicity) and the high levels of cytotoxicity currently required in mammalian cell genotoxicity tests were discussed as further potential sources of false positive results. Even if the goal is to detect carcinogens with short in vitro tests under more or less acute conditions, it does not seem logical to exceed the capabilities of cellular metabolic turnover, activation and defence processes. The concept of "promiscuous activation" was discussed. For numerous mutagens, the decisive in vivo enzymes are missing in vitro. However, if the substrate concentration is increased sufficiently, some other enzymes (that are unimportant in vivo) may take over the activation-leading to the same or a different active metabolite. Since we often do not use the right enzyme systems for positive controls in vitro, we have to rely on their promiscuous activation, i.e. to use excessive concentrations to get an empirical correlation between genotoxicity and carcinogenicity. A thorough review of published and industry data is urgently needed to determine whether the currently required limit concentration of 10mM or 5000 microg/ml, and high levels of cytotoxicity, are necessary for the detection of in vivo genotoxins and DNA-reactive, mutagenic carcinogens. In addition, various measures of cytotoxicity are currently allowable under OECD test guidelines, but there are few comparative data on whether different measures would result in different maximum concentrations for testing. A detailed comparison of cytotoxicity assessment strategies is needed. An assessment of whether test endpoints can be selected that are not intrinsically associated with cytotoxicity, and therefore are less susceptible to artefacts produced by cytotoxicity, should also be undertaken. There was agreement amongst the workshop participants that cell systems which are p53 and DNA-repair proficient, and have defined Phase 1 and Phase 2 metabolism, covering a broad set of enzyme forms, and used within the context of appropriately set limits of concentration and cytotoxicity, offer the best hope for reduced false positives. Whilst there is some evidence that human lymphocytes are less susceptible to false positives than the current rodent cell lines, other cell systems based on HepG2, TK6 and MCL-5 cells, as well as 3D skin models based on primary human keratinocytes also show some promise. Other human cell lines such as HepaRG, and human stem cells (the target for carcinogenicity) have not been used for genotoxicity investigations and should be considered for evaluation. Genetic engineering is also a valuable tool to incorporate missing enzyme systems into target cells. A collaborative research programme is needed to identify, further develop and evaluate new cell systems with appropriate sensitivity but improved specificity. In order to review current data for selection of appropriate top concentrations, measures and levels of cytotoxicity, metabolism, and to be able to improve existing or validate new assay systems, the participants called for the establishment of an expert group to identify the in vivo genotoxins and DNA-reactive, mutagenic carcinogens that we expect our in vitro genotoxicity assays to detect as well as the non-genotoxins and non-carcinogens we expect them not to detect.
研讨会参与者一致认为,体外哺乳动物细胞遗传毒性试验产生的不相关阳性结果发生率极高且令人无法接受(例如,与啮齿动物致癌性试验相比)。正如最近几篇综述所报道的,一些使用体外方法的研究(与这个“金标准”相比)的不相关阳性率(即低特异性)意味着越来越多的受试物要接受额外的体内遗传毒性试验,在许多情况下,例如在评估化合物的疗效(对于药物而言)之前就进行了此类试验。如果体外试验对体内遗传毒性和致癌性更具预测性(即假阳性更少),那么使用的动物数量将显著减少。除了将动物(或人类)致癌性作为“金标准”外,人们认识到遗传毒性试验能提供许多关于细胞行为、细胞分裂过程以及细胞对(基因)毒性损伤的命运的信息。由于这些效应的疾病影响很少为人所知,而且相关毒性的验证通常也是(亚)慢性动物研究的主题,因此从体外遗传毒性试验预测体内相关结果对于那些可能对作为最终关注终点的致癌作用没有直接影响的方面也很重要。为了解决体外假阳性结果率高的问题,2006年4月在意大利伊斯普拉的欧洲替代方法验证中心(ECVAM)举办了一次为期两天的研讨会。来自学术界、政府和工业界的20多位遗传毒性专家受邀审查现有细胞系统的数据,讨论是否存在假阳性结果倾向较低的细胞和试验系统,审查对现有方案和细胞系统可能的改进措施,这些改进可能会提高特异性,并审查一些新试验系统的性能,这些新系统有望在不牺牲敏感性的情况下提高特异性。得出的结论是,对于从业者和监管评审人员来说,都需要更好地指导如何解释导致对人类健康无生物学相关性的阳性结果的可能机制,以及如何获取这些机制的证据。参与者讨论了这样一个事实,即常用于遗传毒性试验的细胞系存在一些缺陷,这些缺陷可能导致高假阳性率。其中包括缺乏正常代谢,导致依赖外源性代谢激活系统(例如艾氏剂诱导的大鼠S9)、p53功能受损以及DNA修复能力改变。哺乳动物细胞遗传毒性试验中目前要求的受试化学物高浓度(即10 mM或5000 μg/ml,除非因溶解度或毒性过大而排除)和高细胞毒性水平被讨论为假阳性结果的进一步潜在来源。即使目标是在或多或少急性的条件下通过短期体外试验检测致癌物,但超过细胞代谢周转、激活和防御过程的能力似乎也不合理。讨论了“混杂激活”的概念。对于许多诱变剂来说,体外缺乏决定性的体内酶。然而,如果底物浓度充分增加,一些其他酶(在体内不重要)可能会接管激活作用,导致产生相同或不同的活性代谢物。由于我们在体外通常没有使用正确的酶系统作为阳性对照,我们不得不依赖它们的混杂激活,即使用过高浓度来获得遗传毒性和致癌性之间的经验性关联。迫切需要对已发表的数据和行业数据进行全面审查,以确定目前要求的10 mM或5000 μg/ml的极限浓度以及高细胞毒性水平对于检测体内基因毒素和DNA反应性诱变致癌物是否必要。此外,目前经合组织试验指南允许采用各种细胞毒性测量方法,但关于不同测量方法是否会导致不同的测试最大浓度,几乎没有比较数据。需要对细胞毒性评估策略进行详细比较。还应评估是否可以选择与细胞毒性没有内在关联、因此不易受细胞毒性产生的假象影响的试验终点。研讨会参与者一致认为,p53和DNA修复功能正常、具有明确的I相和II相代谢、涵盖广泛酶形式且在适当设定的浓度和细胞毒性限度范围内使用的细胞系统,最有希望减少假阳性。虽然有一些证据表明人类淋巴细胞比目前的啮齿动物细胞系更不易出现假阳性,但基于HepG2、TK6和MCL - 5细胞的其他细胞系统,以及基于原代人角质形成细胞构建的3D皮肤模型也显示出一些前景。其他人类细胞系,如HepaRG以及人类干细胞(致癌作用的靶细胞)尚未用于遗传毒性研究,应考虑进行评估。基因工程也是将缺失的酶系统引入靶细胞的一种有价值的工具。需要开展一项合作研究计划,以识别、进一步开发和评估具有适当敏感性但特异性更高的新细胞系统。为了审查当前用于选择合适的最高浓度、细胞毒性测量方法和水平、代谢等方面的数据,以便能够改进现有试验系统或验证新的试验系统,参与者呼吁成立一个专家组,以确定我们期望体外遗传毒性试验检测到的体内基因毒素和DNA反应性诱变致癌物,以及我们期望它们不检测到的非基因毒素和非致癌物。