Song Ping, Herman Rod A, Kumpatla Siva
Dow AgroSciences, 9330 Zionsville Road, Indianapolis, IN 46268, United States.
Dow AgroSciences, 9330 Zionsville Road, Indianapolis, IN 46268, United States.
Food Chem Toxicol. 2014 Sep;71:142-8. doi: 10.1016/j.fct.2014.06.008. Epub 2014 Jun 19.
To address the high false positive rate using >35% identity over 80 amino acids in the regulatory assessment of transgenic proteins for potential allergenicity and the change of E-value with database size, the Needleman-Wunsch global sequence alignment and a one-to-one (1:1) local FASTA search (one protein in the target database at a time) using FASTA were evaluated by comparing proteins randomly selected from Arabidopsis, rice, corn, and soybean with known allergens in a peer-reviewed allergen database (http://www.allergenonline.org/). Compared with the approach of searching >35%/80aa+, the false positive rate measured by specificity rate for identification of true allergens was reduced by a 1:1 global sequence alignment with a cut-off threshold of ≧30% identity and a 1:1 FASTA local alignment with a cut-off E-value of ≦1.0E-09 while maintaining the same sensitivity. Hence, a 1:1 sequence comparison, especially using the FASTA local alignment tool with a biological relevant E-value of 1.0E-09 as a threshold, is recommended for the regulatory assessment of sequence identities between transgenic proteins in food crops and known allergens.
为了解决在转基因蛋白潜在致敏性的监管评估中使用超过80个氨基酸且一致性>35%时出现的高假阳性率问题,以及E值随数据库大小的变化,通过将从拟南芥、水稻、玉米和大豆中随机选择的蛋白质与同行评审的过敏原数据库(http://www.allergenonline.org/)中的已知过敏原进行比较,对Needleman-Wunsch全局序列比对和使用FASTA进行的一对一(1:1)局部FASTA搜索(一次在目标数据库中搜索一种蛋白质)进行了评估。与搜索>35%/80aa+的方法相比,在保持相同灵敏度的同时,通过使用一致性≧30%的截止阈值进行1:1全局序列比对和使用截止E值≦1.0E-09进行1:1 FASTA局部比对,以特异性率衡量的鉴定真正过敏原的假阳性率降低了。因此,建议采用1:1序列比较,特别是使用具有生物学相关E值1.0E-09作为阈值的FASTA局部比对工具,用于粮食作物中转基因蛋白与已知过敏原之间序列一致性的监管评估。