Lebeda F J, Olson M A
Department of Cell Biology and Biochemistry, U.S. Army Medical Research Institute of Infectious Diseases, Frederick, Maryland 21702-5011, USA.
J Protein Chem. 1997 Aug;16(6):607-18. doi: 10.1023/a:1026370807586.
A novel computational approach was examined for predicting epitopes from primary structures of the seven immunologically distinct botulinum neurotoxins (BoNT/A-G) and tetanus toxin (TeTX). An artificial neural network [Rost and Sander (1994), Proteins 20, 216] was used to estimate residue solvent accessibilities in multiple aligned sequences. A similar network trained to predict secondary structures was also used to examine this protein family, whose tertiary fold is presently unknown. The algorithm was validated by showing that it was 80% accurate in determining the secondary structure of avian egg-white lysozyme and that it correctly identified highly solvent-exposed residues that correspond to the major contact regions of lysozyme-antibody cocrystals. When sequences of the heavy (H) chains of TeTX and BoNT/A-G were analyzed, this algorithm predicted that the most highly exposed regions were clustered at the sequentially nonconserved N- and C-termini [Lebeda and Olson (1994), Proteins 20, 293]. The secondary structures and the remaining highly solvent-accessible regions were, in contrast, predicted to be conserved. In experiments reported by others, H-chain fragments that induced immunological protection against BoNT/A overlap with these predicted most highly exposed regions. It is also known that the C-terminal halves of the TeTX and BoNT/A H-chains interfere with holotoxin binding to ectoacceptors on nerve endings. Thus, the present results provide a theoretical framework for predicting the sites that could assist in the development of genetically engineered vaccines and that could interact with neurally located toxin ectoacceptors. Finally, because the most highly solvent-exposed regions were not well conserved, it is hypothesized that nonconserved, potential contact sites partially account for the existence of different dominant binding regions for type-specific neutralizing antibodies.
研究了一种新的计算方法,用于从七种免疫上不同的肉毒杆菌神经毒素(BoNT/A - G)和破伤风毒素(TeTX)的一级结构预测表位。使用人工神经网络[罗斯特和桑德(1994年),《蛋白质》20卷,216页]来估计多序列比对中残基的溶剂可及性。一个经过训练用于预测二级结构的类似网络也被用来研究这个三级结构目前未知的蛋白质家族。该算法通过以下方式得到验证:在确定鸡卵清溶菌酶的二级结构方面准确率达80%,并且能正确识别与溶菌酶 - 抗体共晶体主要接触区域相对应的高度溶剂暴露残基。当分析TeTX和BoNT/A - G重链(H链)的序列时,该算法预测,最高度暴露的区域聚集在序列上不保守的N端和C端[莱贝达和奥尔森(1994年),《蛋白质》20卷,293页]。相比之下,二级结构和其余高度溶剂可及区域预计是保守的。在其他人报告的实验中,诱导针对BoNT/A产生免疫保护作用的H链片段与这些预测的最高度暴露区域重叠。还已知TeTX和BoNT/A H链的C端一半会干扰全毒素与神经末梢上外受体的结合。因此,目前的结果为预测有助于开发基因工程疫苗以及可能与神经定位的毒素外受体相互作用的位点提供了理论框架。最后,由于最高度溶剂暴露区域保守性不佳,推测非保守的潜在接触位点部分解释了针对型特异性中和抗体不同优势结合区域的存在。