• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于蛋白质序列特征和 BERT 嵌入的线性 B 细胞表位预测。

Prediction of linear B-cell epitopes based on protein sequence features and BERT embeddings.

机构信息

School of Humanistic Medicine, Anhui Medical University, Hefei, 230032, Anhui, China.

School of Biomedical Engineering, Anhui Medical University, Hefei, 230030, Anhui, China.

出版信息

Sci Rep. 2024 Jan 30;14(1):2464. doi: 10.1038/s41598-024-53028-w.

DOI:10.1038/s41598-024-53028-w
PMID:38291341
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10828400/
Abstract

Linear B-cell epitopes (BCEs) play a key role in the development of peptide vaccines and immunodiagnostic reagents. Therefore, the accurate identification of linear BCEs is of great importance in the prevention of infectious diseases and the diagnosis of related diseases. The experimental methods used to identify BCEs are both expensive and time-consuming and they do not meet the demand for identification of large-scale protein sequence data. As a result, there is a need to develop an efficient and accurate computational method to rapidly identify linear BCE sequences. In this work, we developed the new linear BCE prediction method LBCE-BERT. This method is based on peptide chain sequence information and natural language model BERT embedding information, using an XGBoost classifier. The models were trained on three benchmark datasets. The model was training on three benchmark datasets for hyperparameter selection and was subsequently evaluated on several test datasets. The result indicate that our proposed method outperforms others in terms of AUROC and accuracy. The LBCE-BERT model is publicly available at: https://github.com/Lfang111/LBCE-BERT .

摘要

线性 B 细胞表位(BCEs)在肽疫苗和免疫诊断试剂的开发中起着关键作用。因此,准确识别线性 BCE 对于传染病的预防和相关疾病的诊断具有重要意义。用于识别 BCE 的实验方法既昂贵又耗时,无法满足大规模蛋白质序列数据识别的需求。因此,需要开发一种高效、准确的计算方法来快速识别线性 BCE 序列。在这项工作中,我们开发了新的线性 BCE 预测方法 LBCE-BERT。该方法基于肽链序列信息和自然语言模型 BERT 嵌入信息,使用 XGBoost 分类器。模型在三个基准数据集上进行训练。通过超参数选择在三个基准数据集上进行模型训练,然后在几个测试数据集上进行评估。结果表明,我们提出的方法在 AUROC 和准确性方面优于其他方法。LBCE-BERT 模型可在:https://github.com/Lfang111/LBCE-BERT 上获得。

相似文献

1
Prediction of linear B-cell epitopes based on protein sequence features and BERT embeddings.基于蛋白质序列特征和 BERT 嵌入的线性 B 细胞表位预测。
Sci Rep. 2024 Jan 30;14(1):2464. doi: 10.1038/s41598-024-53028-w.
2
LBCE-XGB: A XGBoost Model for Predicting Linear B-Cell Epitopes Based on BERT Embeddings.LBCE-XGB:一种基于BERT嵌入的用于预测线性B细胞表位的XGBoost模型。
Interdiscip Sci. 2023 Jun;15(2):293-305. doi: 10.1007/s12539-023-00549-z. Epub 2023 Jan 16.
3
EpitopeVec: linear epitope prediction using deep protein sequence embeddings.EpitopeVec:使用深度蛋白质序列嵌入进行线性表位预测。
Bioinformatics. 2021 Dec 7;37(23):4517-4525. doi: 10.1093/bioinformatics/btab467.
4
iBCE-EL: A New Ensemble Learning Framework for Improved Linear B-Cell Epitope Prediction.iBCE-EL:一种用于改进线性 B 细胞表位预测的新集成学习框架。
Front Immunol. 2018 Jul 27;9:1695. doi: 10.3389/fimmu.2018.01695. eCollection 2018.
5
NetBCE: An Interpretable Deep Neural Network for Accurate Prediction of Linear B-cell Epitopes.NetBCE:一种用于线性 B 细胞表位准确预测的可解释深度神经网络。
Genomics Proteomics Bioinformatics. 2022 Oct;20(5):1002-1012. doi: 10.1016/j.gpb.2022.11.009. Epub 2022 Dec 13.
6
Inferring linear-B cell epitopes using 2-step metaheuristic variant-feature selection using genetic algorithm.使用两步元启发式变特征选择与遗传算法推断线性 B 细胞表位。
Sci Rep. 2023 Sep 5;13(1):14593. doi: 10.1038/s41598-023-41179-1.
7
A Structure-Based B-cell Epitope Prediction Model Through Combing Local and Global Features.基于结构的 B 细胞表位预测模型:结合局部和全局特征。
Front Immunol. 2022 Jul 1;13:890943. doi: 10.3389/fimmu.2022.890943. eCollection 2022.
8
BERT-Kcr: prediction of lysine crotonylation sites by a transfer learning method with pre-trained BERT models.BERT-Kcr:基于预训练BERT模型的迁移学习方法预测赖氨酸巴豆酰化位点
Bioinformatics. 2022 Jan 12;38(3):648-654. doi: 10.1093/bioinformatics/btab712.
9
BERT-Kgly: A Bidirectional Encoder Representations From Transformers (BERT)-Based Model for Predicting Lysine Glycation Site for .BERT-Kgly:一种基于双向编码器表征变换器(BERT)的赖氨酸糖基化位点预测模型
Front Bioinform. 2022 Feb 18;2:834153. doi: 10.3389/fbinf.2022.834153. eCollection 2022.
10
Multi-perspectives and challenges in identifying B-cell epitopes.鉴定 B 细胞表位的多视角和挑战。
Protein Sci. 2023 Nov;32(11):e4785. doi: 10.1002/pro.4785.

引用本文的文献

1
annATAC: automatic cell type annotation for scATAC-seq data based on language model.annATAC:基于语言模型的单细胞染色质可及性测序数据自动细胞类型注释
BMC Biol. 2025 May 28;23(1):145. doi: 10.1186/s12915-025-02244-5.
2
Serum antibodies against mimotopes of Merkel cell polyomavirus oncoproteins detected by a novel immunoassay in healthy individuals and Merkel cell carcinoma patients.通过新型免疫测定法在健康个体和 Merkel 细胞癌患者中检测到针对 Merkel 细胞多瘤病毒癌蛋白模拟表位的血清抗体。
Microb Biotechnol. 2024 Oct;17(10):e14536. doi: 10.1111/1751-7915.14536.
3
Integrating machine learning to advance epitope mapping.

本文引用的文献

1
BERT-Kgly: A Bidirectional Encoder Representations From Transformers (BERT)-Based Model for Predicting Lysine Glycation Site for .BERT-Kgly:一种基于双向编码器表征变换器(BERT)的赖氨酸糖基化位点预测模型
Front Bioinform. 2022 Feb 18;2:834153. doi: 10.3389/fbinf.2022.834153. eCollection 2022.
2
A comparative analysis of amino acid encoding schemes for the prediction of flexible length linear B-cell epitopes.氨基酸编码方案在预测柔性长度线性 B 细胞表位中的比较分析。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac356.
3
Machine Learning Techniques for the Prediction of B-Cell and T-Cell Epitopes as Potential Vaccine Targets with a Specific Focus on SARS-CoV-2 Pathogen: A Review.
整合机器学习以推进表位作图。
Front Immunol. 2024 Sep 30;15:1463931. doi: 10.3389/fimmu.2024.1463931. eCollection 2024.
用于预测作为潜在疫苗靶点的B细胞和T细胞表位的机器学习技术,特别关注SARS-CoV-2病原体:综述
Pathogens. 2022 Jan 24;11(2):146. doi: 10.3390/pathogens11020146.
4
BERT-Kcr: prediction of lysine crotonylation sites by a transfer learning method with pre-trained BERT models.BERT-Kcr:基于预训练BERT模型的迁移学习方法预测赖氨酸巴豆酰化位点
Bioinformatics. 2022 Jan 12;38(3):648-654. doi: 10.1093/bioinformatics/btab712.
5
EpitopeVec: linear epitope prediction using deep protein sequence embeddings.EpitopeVec:使用深度蛋白质序列嵌入进行线性表位预测。
Bioinformatics. 2021 Dec 7;37(23):4517-4525. doi: 10.1093/bioinformatics/btab467.
6
EpiDope: a deep neural network for linear B-cell epitope prediction.EpiDope:用于线性B细胞表位预测的深度神经网络。
Bioinformatics. 2021 Jul 19;37(12):1784. doi: 10.1093/bioinformatics/btab390.
7
A novel antibacterial peptide recognition algorithm based on BERT.基于 BERT 的新型抗菌肽识别算法。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab200.
8
Linear B-Cell Epitope Prediction for In Silico Vaccine Design: A Performance Review of Methods Available via Command-Line Interface.线性 B 细胞表位预测在计算机疫苗设计中的应用:通过命令行界面提供的方法的性能评估。
Int J Mol Sci. 2021 Mar 22;22(6):3210. doi: 10.3390/ijms22063210.
9
iLBE for Computational Identification of Linear B-cell Epitopes by Integrating Sequence and Evolutionary Features.iLBE:通过整合序列和进化特征计算识别线性 B 细胞表位的方法。
Genomics Proteomics Bioinformatics. 2020 Oct;18(5):593-600. doi: 10.1016/j.gpb.2019.04.004. Epub 2020 Oct 22.
10
EpiDope: a deep neural network for linear B-cell epitope prediction.EpiDope:用于线性 B 细胞表位预测的深度神经网络。
Bioinformatics. 2021 May 1;37(4):448-455. doi: 10.1093/bioinformatics/btaa773.