Sealy Center for Structural Biology and Molecular Biophysics, Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX 77555-0304, United States.
Southern Regional Research Center, U.S. Department of Agriculture, Agriculture Research Service, 1100 Robert E. Lee Boulevard, New Orleans, LA 70124, United States.
Mol Immunol. 2018 Jul;99:1-8. doi: 10.1016/j.molimm.2018.03.022. Epub 2018 Apr 6.
Quantitative guidelines to distinguish allergenic proteins from related, but non-allergenic ones are urgently needed for regulatory agencies, biotech companies and physicians. In a previous study, we found that allergenic proteins populate a relatively small number of protein families, as characterized by the Pfam database. However, these families also contain non-allergenic proteins, meaning that allergenic determinants must lie within more discrete regions of the sequence. Thus, new methods are needed to discriminate allergenic proteins within those families. Physical-Chemical Properties (PCP)-motifs specific for allergens within a Pfam class were determined for 17 highly populated protein domains. A novel scoring method based on PCP-motifs that characterize known allergenic proteins within these families was developed, and validated for those domains. The motif scores distinguished sequences of allergens from a large selection of 80,000 randomly selected non-allergenic sequences. The motif scores for the birch pollen allergen (Bet v 1) family, which also contains related fruit and nut allergens, correlated better than global sequence similarities with clinically observed cross-reactivities among those allergens. Further, we demonstrated that the average scores of allergen specific motifs for allergenic profilins are significantly different from the scores of non-allergenic profilins. Several of the selective motifs coincide with experimentally determined IgE epitopes of allergenic profilins. The motifs also discriminated allergenic pectate lyases, including Jun a 1 from mountain cedar pollen, from similar proteins in the human microbiome, which can be assumed to be non-allergens. The latter lacked key motifs characteristic of the known allergens, some of which correlate with known IgE binding sites.
定量指南对于监管机构、生物技术公司和医生来说,都是急需将过敏原蛋白与相关但非过敏原蛋白区分开来的。在之前的一项研究中,我们发现过敏原蛋白存在于相对较少的蛋白质家族中,这些家族由 Pfam 数据库来进行特征描述。然而,这些家族中也包含非过敏原蛋白,这意味着过敏原决定簇必须位于序列中更离散的区域内。因此,需要新的方法来区分这些家族中的过敏原蛋白。针对 Pfam 类中的过敏原,我们确定了 17 个高度流行的蛋白质结构域的物理化学特性(PCP)-基序。开发了一种基于 PCP 基序的新评分方法,用于对这些家族中的已知过敏原蛋白进行特征描述,并对这些结构域进行了验证。基序评分能够将过敏原序列与 80,000 个随机选择的非过敏原序列的大型选择区分开来。对于桦树花粉过敏原(Bet v 1)家族,该家族还包含相关的水果和坚果过敏原,其基序评分与临床观察到的这些过敏原之间的交叉反应性相关性优于全局序列相似性。此外,我们还证明了过敏原特异性丝状蛋白的基序评分与非过敏原丝状蛋白的评分显著不同。几个选择性基序与过敏原丝状蛋白的实验确定的 IgE 表位相吻合。这些基序还区分了过敏原果胶裂解酶,包括来自山雪松花粉的 Jun a 1,与人类微生物组中的类似蛋白质,这些蛋白质可以被假定为非过敏原。后者缺乏已知过敏原的特征性关键基序,其中一些与已知的 IgE 结合位点相关。