Li Yuchen, Sackett Peter Wad, Nielsen Morten, Barra Carolina
Department of Health Technology, Technical University of Denmark, Kgs. Lyngby, Copenhagen 2800, Denmark.
Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, San Martin 1650, Argentina.
Bioinform Adv. 2023 Oct 16;3(1):vbad151. doi: 10.1093/bioadv/vbad151. eCollection 2023.
Allergy is a pathological immune reaction towards innocuous protein antigens. Although only a narrow fraction of plant or animal proteins induce allergy, atopic disorders affect millions of children and adults and cost billions in healthcare systems worldwide. predictors can aid in the development of more innocuous food sources. Previous allergenicity predictors used sequence similarity, common structural domains, and amino acid physicochemical features. However, these predictors strongly rely on sequence similarity to known allergens and fail to predict protein allergenicity accurately when similarity diminishes.
To overcome these limitations, we collected allergens from AllergenOnline, a curated database of IgE-inducing allergens, carefully removed allergen redundancy with a novel protein partitioning pipeline, and developed a new allergen prediction method, introducing MHC presentation propensity as a novel feature. NetAllergen outperformed a sequence similarity-based BLAST baseline approach, and previous allergenicity predictor AlgPred 2 when similarity to known allergens is limited.
The web service NetAllergen and the datasets are available at https://services.healthtech.dtu.dk/services/NetAllergen-1.0/.
过敏是对无害蛋白质抗原的病理性免疫反应。尽管只有一小部分植物或动物蛋白会引发过敏,但特应性疾病影响着全球数百万儿童和成年人,给医疗保健系统造成了数十亿美元的损失。预测因子有助于开发更无害的食物来源。先前的过敏原预测因子使用序列相似性、常见结构域和氨基酸物理化学特征。然而,这些预测因子强烈依赖于与已知过敏原的序列相似性,当相似性降低时,无法准确预测蛋白质的致敏性。
为克服这些局限性,我们从AllergenOnline(一个经过整理的诱导IgE的过敏原数据库)中收集了过敏原,使用一种新颖的蛋白质划分流程仔细去除过敏原冗余,并开发了一种新的过敏原预测方法,引入了MHC呈递倾向作为一个新特征。当与已知过敏原的相似性有限时,NetAllergen的表现优于基于序列相似性的BLAST基线方法以及先前的过敏原预测因子AlgPred 2。
网络服务NetAllergen和数据集可在https://services.healthtech.dtu.dk/services/NetAllergen-1.0/获取。