通过计算机模拟可变长度肽段提取和机器学习，变应原蛋白的计算检测达到了新的准确性水平。

Computational detection of allergenic proteins attains a new level of accuracy with in silico variable-length peptide extraction and machine learning.

作者信息

Soeria-Atmadja D, Lundell T, Gustafsson M G, Hammerling U

机构信息

Division of Toxicology, National Food Administration, Uppsala, Sweden

出版信息

Nucleic Acids Res. 2006;34(13):3779-93. doi: 10.1093/nar/gkl467. Epub 2006 Aug 23.

DOI:10.1093/nar/gkl467

PMID:16977698

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1540723/

Abstract

The placing of novel or new-in-the-context proteins on the market, appearing in genetically modified foods, certain bio-pharmaceuticals and some household products leads to human exposure to proteins that may elicit allergic responses. Accurate methods to detect allergens are therefore necessary to ensure consumer/patient safety. We demonstrate that it is possible to reach a new level of accuracy in computational detection of allergenic proteins by presenting a novel detector, Detection based on Filtered Length-adjusted Allergen Peptides (DFLAP). The DFLAP algorithm extracts variable length allergen sequence fragments and employs modern machine learning techniques in the form of a support vector machine. In particular, this new detector shows hitherto unmatched specificity when challenged to the Swiss-Prot repository without appreciable loss of sensitivity. DFLAP is also the first reported detector that successfully discriminates between allergens and non-allergens occurring in protein families known to hold both categories. Allergenicity assessment for specific protein sequences of interest using DFLAP is possible via ulfh@slv.se.

摘要

新型蛋白质或在特定背景下出现的新蛋白质投放市场，这些蛋白质存在于转基因食品、某些生物制药和一些家用产品中，会导致人类接触可能引发过敏反应的蛋白质。因此，需要准确的方法来检测过敏原，以确保消费者/患者的安全。我们展示了通过提出一种新型检测器——基于过滤长度调整过敏原肽的检测法（DFLAP），在计算检测过敏原蛋白方面可以达到新的准确性水平。DFLAP算法提取可变长度的过敏原序列片段，并采用支持向量机形式的现代机器学习技术。特别是，当在瑞士蛋白质数据库中进行测试时，这种新型检测器显示出迄今为止无与伦比的特异性，且灵敏度没有明显损失。DFLAP也是第一个成功区分已知同时包含过敏原和非过敏原的蛋白质家族中过敏原和非过敏原的报道检测器。通过ulfh@slv.se可以使用DFLAP对特定感兴趣的蛋白质序列进行致敏性评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bfe/1540723/486140e8495f/gkl467f1.jpg

相似文献

Computational detection of allergenic proteins attains a new level of accuracy with in silico variable-length peptide extraction and machine learning.通过计算机模拟可变长度肽段提取和机器学习，变应原蛋白的计算检测达到了新的准确性水平。

Nucleic Acids Res. 2006;34(13):3779-93. doi: 10.1093/nar/gkl467. Epub 2006 Aug 23.

Supervised identification of allergen-representative peptides for in silico detection of potentially allergenic proteins.用于计算机检测潜在过敏原性蛋白质的过敏原代表性肽段的监督识别。

Bioinformatics. 2005 Jan 1;21(1):39-50. doi: 10.1093/bioinformatics/bth477. Epub 2004 Aug 19.

EVALLER: a web server for in silico assessment of potential protein allergenicity.EVALLER：用于潜在蛋白质致敏性计算机模拟评估的网络服务器。

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W694-700. doi: 10.1093/nar/gkm370. Epub 2007 May 30.

In silico prediction of allergenic proteins.致敏蛋白的计算机模拟预测

Methods Mol Biol. 2014;1184:375-88. doi: 10.1007/978-1-4939-1115-8_21.

AllerTOP--a server for in silico prediction of allergens.AllerTOP——一种用于过敏原体内预测的服务器。

BMC Bioinformatics. 2013;14 Suppl 6(Suppl 6):S4. doi: 10.1186/1471-2105-14-S6-S4. Epub 2013 Apr 17.

SORTALLER: predicting allergens using substantially optimized algorithm on allergen family featured peptides.SORTALLER：使用基于过敏原家族特征肽的优化算法预测过敏原。

Bioinformatics. 2012 Aug 15;28(16):2178-9. doi: 10.1093/bioinformatics/bts326. Epub 2012 Jun 12.

AllerTool: a web server for predicting allergenicity and allergic cross-reactivity in proteins.AllerTool：一个用于预测蛋白质致敏性和过敏交叉反应性的网络服务器。

Bioinformatics. 2007 Feb 15;23(4):504-6. doi: 10.1093/bioinformatics/btl621. Epub 2006 Dec 6.

Identification of continuous, allergenic regions of the major shrimp allergen Pen a 1 (tropomyosin).主要虾过敏原Pen a 1（原肌球蛋白）连续致敏区域的鉴定。

Int Arch Allergy Immunol. 2002 Jan;127(1):27-37. doi: 10.1159/000048166.

WebAllergen: a web server for predicting allergenic proteins.网络变应原：一个用于预测变应原性蛋白质的网络服务器。

Bioinformatics. 2005 May 15;21(10):2570-1. doi: 10.1093/bioinformatics/bti356. Epub 2005 Mar 3.

AlgPred: prediction of allergenic proteins and mapping of IgE epitopes.AlgPred：变应原蛋白预测及IgE表位图谱绘制

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W202-9. doi: 10.1093/nar/gkl343.

引用本文的文献

Multimodal deep learning for allergenic proteins prediction.用于变应原蛋白预测的多模态深度学习

BMC Biol. 2025 Jul 31;23(1):232. doi: 10.1186/s12915-025-02347-z.

Computational prediction of allergenic proteins based on multi-feature fusion.基于多特征融合的变应原蛋白的计算预测

Front Genet. 2023 Oct 19;14:1294159. doi: 10.3389/fgene.2023.1294159. eCollection 2023.

Sequence-Based Prediction of Plant Allergenic Proteins: Machine Learning Classification Approach.基于序列的植物变应原蛋白预测：机器学习分类方法

ACS Omega. 2023 Jan 20;8(4):3698-3704. doi: 10.1021/acsomega.2c02842. eCollection 2023 Jan 31.

A Comparative Analysis of Novel Deep Learning and Ensemble Learning Models to Predict the Allergenicity of Food Proteins.用于预测食物蛋白变应原性的新型深度学习与集成学习模型的比较分析

Foods. 2021 Apr 9;10(4):809. doi: 10.3390/foods10040809.

Origin and Functional Prediction of Pollen Allergens in Plants.植物花粉过敏原的起源与功能预测

Plant Physiol. 2016 Sep;172(1):341-57. doi: 10.1104/pp.16.00625. Epub 2016 Jul 19.

PREAL: prediction of allergenic protein by maximum Relevance Minimum Redundancy (mRMR) feature selection.PREAL：通过最大相关最小冗余（mRMR）特征选择预测变应原蛋白。

BMC Syst Biol. 2013;7 Suppl 5(Suppl 5):S9. doi: 10.1186/1752-0509-7-S5-S9. Epub 2013 Dec 9.

Allerdictor: fast allergen prediction using text classification techniques.Allerdictor：使用文本分类技术的快速过敏原预测

Bioinformatics. 2014 Apr 15;30(8):1120-1128. doi: 10.1093/bioinformatics/btu004. Epub 2014 Jan 7.

Evaluation and integration of existing methods for computational prediction of allergens.评价和整合现有用于过敏原计算预测的方法。

BMC Bioinformatics. 2013;14 Suppl 4(Suppl 4):S1. doi: 10.1186/1471-2105-14-S4-S1. Epub 2013 Mar 8.

Immunoinformatics: an integrated scenario.免疫信息学：一个综合的场景。

Immunology. 2010 Oct;131(2):153-68. doi: 10.1111/j.1365-2567.2010.03330.x. Epub 2010 Aug 16.

EVALLER: a web server for in silico assessment of potential protein allergenicity.EVALLER：用于潜在蛋白质致敏性计算机模拟评估的网络服务器。

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W694-700. doi: 10.1093/nar/gkm370. Epub 2007 May 30.

本文引用的文献

[Development of Allergen Database for Food Safety (ADFS): an integrated database to search allergens and predict allergenicity].[食品安全过敏原数据库（ADFS）的开发：一个用于搜索过敏原和预测致敏性的综合数据库]

Kokuritsu Iyakuhin Shokuhin Eisei Kenkyusho Hokoku. 2005(123):32-6.

Pollen allergens are restricted to few protein families and show distinct patterns of species distribution.花粉过敏原局限于少数蛋白质家族，并呈现出独特的物种分布模式。

J Allergy Clin Immunol. 2006 Jan;117(1):141-7. doi: 10.1016/j.jaci.2005.09.010. Epub 2005 Nov 28.

Reduced allergenic potency of VR9-1, a mutant of the major shrimp allergen Pen a 1 (tropomyosin).VR9-1（主要虾过敏原Pen a 1（原肌球蛋白）的一种突变体）的变应原性降低

J Immunol. 2005 Dec 15;175(12):8354-64. doi: 10.4049/jimmunol.175.12.8354.

The value of short amino acid sequence matches for prediction of protein allergenicity.用于预测蛋白质致敏性的短氨基酸序列匹配的价值。

Toxicol Sci. 2006 Mar;90(1):252-8. doi: 10.1093/toxsci/kfj068. Epub 2005 Dec 7.

External cross-validation for unbiased evaluation of protein family detectors: application to allergens.用于蛋白质家族检测器无偏评估的外部交叉验证：在过敏原中的应用。

Proteins. 2005 Dec 1;61(4):918-25. doi: 10.1002/prot.20656.

Assessment of sequence homology and cross-reactivity.序列同源性和交叉反应性评估。

Toxicol Appl Pharmacol. 2005 Sep 1;207(2 Suppl):149-51. doi: 10.1016/j.taap.2005.01.021.

Assessing genetically modified crops to minimize the risk of increased food allergy: a review.评估转基因作物以将食物过敏风险增加降至最低：综述

Int Arch Allergy Immunol. 2005 Jun;137(2):153-66. doi: 10.1159/000086314. Epub 2005 Jun 8.

J Allergy Clin Immunol. 2005 Jan;115(1):163-70. doi: 10.1016/j.jaci.2004.10.026.

Structural, biological, and evolutionary relationships of plant food allergens sensitizing via the gastrointestinal tract.通过胃肠道致敏的植物性食物过敏原的结构、生物学及进化关系。

Crit Rev Food Sci Nutr. 2004;44(5):379-407. doi: 10.1080/10408690490489224.

Allermatch, a webtool for the prediction of potential allergenicity according to current FAO/WHO Codex alimentarius guidelines.Allermatch，一种根据当前粮农组织/世界卫生组织食品法典委员会指南预测潜在致敏性的网络工具。

BMC Bioinformatics. 2004 Sep 16;5:133. doi: 10.1186/1471-2105-5-133.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过计算机模拟可变长度肽段提取和机器学习，变应原蛋白的计算检测达到了新的准确性水平。

Computational detection of allergenic proteins attains a new level of accuracy with in silico variable-length peptide extraction and machine learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献