• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

NetAllergen,一种整合了MHC-II呈递倾向以改进变应原性预测的随机森林模型。

NetAllergen, a random forest model integrating MHC-II presentation propensity for improved allergenicity prediction.

作者信息

Li Yuchen, Sackett Peter Wad, Nielsen Morten, Barra Carolina

机构信息

Department of Health Technology, Technical University of Denmark, Kgs. Lyngby, Copenhagen 2800, Denmark.

Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, San Martin 1650, Argentina.

出版信息

Bioinform Adv. 2023 Oct 16;3(1):vbad151. doi: 10.1093/bioadv/vbad151. eCollection 2023.

DOI:10.1093/bioadv/vbad151
PMID:37901344
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10603389/
Abstract

MOTIVATION

Allergy is a pathological immune reaction towards innocuous protein antigens. Although only a narrow fraction of plant or animal proteins induce allergy, atopic disorders affect millions of children and adults and cost billions in healthcare systems worldwide. predictors can aid in the development of more innocuous food sources. Previous allergenicity predictors used sequence similarity, common structural domains, and amino acid physicochemical features. However, these predictors strongly rely on sequence similarity to known allergens and fail to predict protein allergenicity accurately when similarity diminishes.

RESULTS

To overcome these limitations, we collected allergens from AllergenOnline, a curated database of IgE-inducing allergens, carefully removed allergen redundancy with a novel protein partitioning pipeline, and developed a new allergen prediction method, introducing MHC presentation propensity as a novel feature. NetAllergen outperformed a sequence similarity-based BLAST baseline approach, and previous allergenicity predictor AlgPred 2 when similarity to known allergens is limited.

AVAILABILITY AND IMPLEMENTATION

The web service NetAllergen and the datasets are available at https://services.healthtech.dtu.dk/services/NetAllergen-1.0/.

摘要

动机

过敏是对无害蛋白质抗原的病理性免疫反应。尽管只有一小部分植物或动物蛋白会引发过敏,但特应性疾病影响着全球数百万儿童和成年人,给医疗保健系统造成了数十亿美元的损失。预测因子有助于开发更无害的食物来源。先前的过敏原预测因子使用序列相似性、常见结构域和氨基酸物理化学特征。然而,这些预测因子强烈依赖于与已知过敏原的序列相似性,当相似性降低时,无法准确预测蛋白质的致敏性。

结果

为克服这些局限性,我们从AllergenOnline(一个经过整理的诱导IgE的过敏原数据库)中收集了过敏原,使用一种新颖的蛋白质划分流程仔细去除过敏原冗余,并开发了一种新的过敏原预测方法,引入了MHC呈递倾向作为一个新特征。当与已知过敏原的相似性有限时,NetAllergen的表现优于基于序列相似性的BLAST基线方法以及先前的过敏原预测因子AlgPred 2。

可用性和实现方式

网络服务NetAllergen和数据集可在https://services.healthtech.dtu.dk/services/NetAllergen-1.0/获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/622b/10603389/53b7bee37378/vbad151f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/622b/10603389/70259a8a93e8/vbad151f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/622b/10603389/2c69e4ce54c0/vbad151f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/622b/10603389/9743b604a4e7/vbad151f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/622b/10603389/ff48e1391291/vbad151f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/622b/10603389/8a1f54dc88ce/vbad151f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/622b/10603389/53b7bee37378/vbad151f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/622b/10603389/70259a8a93e8/vbad151f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/622b/10603389/2c69e4ce54c0/vbad151f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/622b/10603389/9743b604a4e7/vbad151f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/622b/10603389/ff48e1391291/vbad151f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/622b/10603389/8a1f54dc88ce/vbad151f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/622b/10603389/53b7bee37378/vbad151f6.jpg

相似文献

1
NetAllergen, a random forest model integrating MHC-II presentation propensity for improved allergenicity prediction.NetAllergen,一种整合了MHC-II呈递倾向以改进变应原性预测的随机森林模型。
Bioinform Adv. 2023 Oct 16;3(1):vbad151. doi: 10.1093/bioadv/vbad151. eCollection 2023.
2
In silico tools to assess the potential allergenicity of shiitake mushrooms (Lentinula edodes).计算机工具评估香菇(Lentinula edodes)的潜在致敏性。
J Sci Food Agric. 2023 Jan 30;103(2):877-890. doi: 10.1002/jsfa.12199. Epub 2022 Sep 20.
3
AllerTOP--a server for in silico prediction of allergens.AllerTOP——一种用于过敏原体内预测的服务器。
BMC Bioinformatics. 2013;14 Suppl 6(Suppl 6):S4. doi: 10.1186/1471-2105-14-S6-S4. Epub 2013 Apr 17.
4
Identification of putative and potential cross-reactive chickpea (Cicer arietinum) allergens through an in silico approach.通过计算机分析鉴定菜豆(鹰嘴豆)潜在过敏原和交叉过敏原。
Comput Biol Chem. 2013 Dec;47:149-55. doi: 10.1016/j.compbiolchem.2013.08.003. Epub 2013 Sep 17.
5
In silico analyses of structural and allergenicity features of sapodilla (Manilkara zapota) acidic thaumatin-like protein in comparison with allergenic plant TLPs.比较番荔枝(Manilkara zapota)酸性苏氨酸蛋白酶与变应原性植物 TLPs 的结构和变应原性特征的计算分析。
Mol Immunol. 2014 Feb;57(2):119-28. doi: 10.1016/j.molimm.2013.08.010. Epub 2013 Oct 1.
6
Allergenicity prediction of novel and modified proteins: Not a mission impossible! Development of a Random Forest allergenicity prediction model.新型和改良蛋白的变应原性预测:并非不可能完成的任务!建立随机森林变应原性预测模型。
Regul Toxicol Pharmacol. 2019 Oct;107:104422. doi: 10.1016/j.yrtph.2019.104422. Epub 2019 Jul 13.
7
Pollen-related food allergy: cloning and immunological analysis of isoforms and mutants of Mal d 1, the major apple allergen, and Bet v 1, the major birch pollen allergen.花粉相关食物过敏:主要苹果过敏原Mal d 1和主要桦树花粉过敏原Bet v 1的亚型及突变体的克隆与免疫分析
Eur J Nutr. 1999 Aug;38(4):201-15. doi: 10.1007/s003940050063.
8
In silico allergenicity prediction of several lipid transfer proteins.几种脂质转移蛋白的计算机模拟变应原性预测
Comput Biol Chem. 2016 Feb;60:32-42. doi: 10.1016/j.compbiolchem.2015.11.006. Epub 2015 Nov 28.
9
AlgPred 2.0: an improved method for predicting allergenic proteins and mapping of IgE epitopes.AlgPred 2.0:一种改进的过敏原蛋白预测方法和 IgE 表位作图。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa294.
10
How much does transgenesis affect wheat allergenicity?: Assessment in two GM lines over-expressing endogenous genes.转基因对小麦致敏性的影响有多大?:两种过表达内源基因的转基因系的评估。
J Proteomics. 2013 Mar 27;80:281-91. doi: 10.1016/j.jprot.2013.01.028. Epub 2013 Feb 9.

引用本文的文献

1
Multimodal deep learning for allergenic proteins prediction.用于变应原蛋白预测的多模态深度学习
BMC Biol. 2025 Jul 31;23(1):232. doi: 10.1186/s12915-025-02347-z.
2
SpanSeq: similarity-based sequence data splitting method for improved development and assessment of deep learning projects.SpanSeq:基于相似度的序列数据分割方法,用于改进深度学习项目的开发与评估。
NAR Genom Bioinform. 2024 Aug 16;6(3):lqae106. doi: 10.1093/nargab/lqae106. eCollection 2024 Sep.
3
The receiver operating characteristic curve accurately assesses imbalanced datasets.

本文引用的文献

1
Could machine learning fuel a reproducibility crisis in science?机器学习会引发科学领域的可重复性危机吗?
Nature. 2022 Aug;608(7922):250-251. doi: 10.1038/d41586-022-02035-w.
2
NetSurfP-3.0: accurate and fast prediction of protein structural features by protein language models and deep learning.NetSurfP-3.0:通过蛋白质语言模型和深度学习实现蛋白质结构特征的准确快速预测。
Nucleic Acids Res. 2022 Jul 5;50(W1):W510-W515. doi: 10.1093/nar/gkac439.
3
The COMPARE Database: A Public Resource for Allergen Identification, Adapted for Continuous Improvement.
受试者工作特征曲线能准确评估不均衡数据集。
Patterns (N Y). 2024 May 31;5(6):100994. doi: 10.1016/j.patter.2024.100994. eCollection 2024 Jun 14.
4
AutoEpiCollect, a Novel Machine Learning-Based GUI Software for Vaccine Design: Application to Pan-Cancer Vaccine Design Targeting PIK3CA Neoantigens.AutoEpiCollect,一种用于疫苗设计的新型基于机器学习的图形用户界面软件:在针对PIK3CA新抗原的泛癌疫苗设计中的应用。
Bioengineering (Basel). 2024 Mar 27;11(4):322. doi: 10.3390/bioengineering11040322.
COMPARE数据库:用于过敏原鉴定的公共资源,适用于持续改进。
Front Allergy. 2021 Aug 6;2:700533. doi: 10.3389/falgy.2021.700533. eCollection 2021.
4
A phylogenetic approach for weighting genetic sequences.一种用于遗传序列加权的系统发育方法。
BMC Bioinformatics. 2021 May 28;22(1):285. doi: 10.1186/s12859-021-04183-8.
5
AlgPred 2.0: an improved method for predicting allergenic proteins and mapping of IgE epitopes.AlgPred 2.0:一种改进的过敏原蛋白预测方法和 IgE 表位作图。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa294.
6
Research Progress in Atopic March.特应性进程研究进展。
Front Immunol. 2020 Aug 27;11:1907. doi: 10.3389/fimmu.2020.01907. eCollection 2020.
7
A novel fusion based on the evolutionary features for protein fold recognition using support vector machines.一种基于进化特征的新型融合方法,用于使用支持向量机进行蛋白质折叠识别。
Sci Rep. 2020 Sep 1;10(1):14368. doi: 10.1038/s41598-020-71172-x.
8
NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data.NetMHCpan-4.1 和 NetMHCIIpan-4.0:通过同时对基序进行分解以及整合 MS MHC 洗脱配体数据,改进了 MHC 抗原呈递的预测。
Nucleic Acids Res. 2020 Jul 2;48(W1):W449-W454. doi: 10.1093/nar/gkaa379.
9
Improved Prediction of MHC II Antigen Presentation through Integration and Motif Deconvolution of Mass Spectrometry MHC Eluted Ligand Data.通过整合和基序反卷积质谱 MHC 洗脱配体数据提高 MHC II 抗原呈递的预测。
J Proteome Res. 2020 Jun 5;19(6):2304-2315. doi: 10.1021/acs.jproteome.9b00874. Epub 2020 Apr 30.
10
HLA-DQ and RBFOX1 as susceptibility genes for an outbreak of hydrolyzed wheat allergy.HLA-DQ 和 RBFOX1 作为水解小麦过敏爆发的易感基因。
J Allergy Clin Immunol. 2019 Nov;144(5):1354-1363. doi: 10.1016/j.jaci.2019.06.034. Epub 2019 Jul 10.