Silvanovich Andre, Nemeth Margaret A, Song Ping, Herman Rod, Tagliani Laura, Bannon Gary A
Monsanto Company, Product Characterization Center, Global Regulatory Affairs, St. Louis, Missouri 63167, USA.
Toxicol Sci. 2006 Mar;90(1):252-8. doi: 10.1093/toxsci/kfj068. Epub 2005 Dec 7.
Typically, genetically engineered crops contain traits encoded by one or a few newly expressed proteins. The allergenicity assessment of newly expressed proteins is an important component in the safety evaluation of genetically engineered plants. One aspect of this assessment involves sequence searches that compare the amino acid sequence of the protein to all known allergens. Analyses are performed to determine the potential for immunologically based cross-reactivity where IgE directed against a known allergen could bind to the protein and elicit a clinical reaction in sensitized individuals. Bioinformatic searches are designed to detect global sequence similarity and short contiguous amino acid sequence identity. It has been suggested that potential allergen cross-reactivity may be predicted by identifying matches as short as six to eight contiguous amino acids between the protein of interest and a known allergen. A series of analyses were performed, and match probabilities were calculated for different size peptides to determine if there was a scientifically justified search window size that identified allergen sequence characteristics. Four probability modeling methods were tested: (1) a mock protein and a mock allergen database, (2) a mock protein and genuine allergen database, (3) a genuine allergen and genuine protein database, and (4) a genuine allergen and genuine protein database combined with a correction for repeating peptides. These analyses indicated that searches for short amino acid sequence matches of eight amino acids or fewer to identify proteins as potential cross-reactive allergens is a product of chance and adds little value to allergy assessments for newly expressed proteins.
通常,转基因作物含有由一种或几种新表达的蛋白质编码的性状。新表达蛋白质的致敏性评估是转基因植物安全性评估的重要组成部分。该评估的一个方面涉及序列搜索,即将蛋白质的氨基酸序列与所有已知过敏原进行比较。进行分析以确定基于免疫的交叉反应可能性,即针对已知过敏原的IgE可能与该蛋白质结合并在致敏个体中引发临床反应。生物信息学搜索旨在检测整体序列相似性和短相邻氨基酸序列同一性。有人提出,通过识别目标蛋白质与已知过敏原之间短至六到八个相邻氨基酸的匹配,可以预测潜在的过敏原交叉反应。进行了一系列分析,并计算了不同大小肽段的匹配概率,以确定是否存在一个科学合理的搜索窗口大小来识别过敏原序列特征。测试了四种概率建模方法:(1)模拟蛋白质和模拟过敏原数据库,(2)模拟蛋白质和真实过敏原数据库,(3)真实过敏原和真实蛋白质数据库,以及(4)真实过敏原和真实蛋白质数据库结合对重复肽段的校正。这些分析表明,搜索八个或更少氨基酸的短氨基酸序列匹配以将蛋白质鉴定为潜在的交叉反应性过敏原是偶然的结果,对新表达蛋白质的过敏评估几乎没有价值。