• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PredMHC:一种使用混合特征的主要组织相容性复合体有效预测器。

PredMHC: An Effective Predictor of Major Histocompatibility Complex Using Mixed Features.

作者信息

Chen Dong, Li Yanjuan

机构信息

College of Electrical and Information Engineering, Quzhou University, Quzhou, China.

出版信息

Front Genet. 2022 Apr 25;13:875112. doi: 10.3389/fgene.2022.875112. eCollection 2022.

DOI:10.3389/fgene.2022.875112
PMID:35547252
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9081368/
Abstract

The major histocompatibility complex (MHC) is a large locus on vertebrate DNA that contains a tightly linked set of polymorphic genes encoding cell surface proteins essential for the adaptive immune system. The groups of proteins encoded in the MHC play an important role in the adaptive immune system. Therefore, the accurate identification of the MHC is necessary to understand its role in the adaptive immune system. An effective predictor called PredMHC is established in this study to identify the MHC from protein sequences. Firstly, PredMHC encoded a protein sequence with mixed features including 188D, APAAC, KSCTriad, CKSAAGP, and PAAC. Secondly, three classifiers including SGD, SMO, and random forest were trained on the mixed features of the protein sequence. Finally, the prediction result was obtained by the voting of the three classifiers. The experimental results of the 10-fold cross-validation test in the training dataset showed that PredMHC can obtain 91.69% accuracy. Experimental results on comparison with other features, classifiers, and existing methods showed the effectiveness of PredMHC in predicting the MHC.

摘要

主要组织相容性复合体(MHC)是脊椎动物DNA上的一个大位点,它包含一组紧密连锁的多态基因,这些基因编码适应性免疫系统所必需的细胞表面蛋白。MHC中编码的蛋白质组在适应性免疫系统中发挥着重要作用。因此,准确识别MHC对于理解其在适应性免疫系统中的作用至关重要。本研究建立了一种名为PredMHC的有效预测器,用于从蛋白质序列中识别MHC。首先,PredMHC对具有混合特征的蛋白质序列进行编码,这些特征包括188D、APAAC、KSCTriad、CKSAAGP和PAAC。其次,使用蛋白质序列的混合特征对包括SGD、SMO和随机森林在内的三种分类器进行训练。最后,通过三种分类器的投票获得预测结果。训练数据集中10倍交叉验证测试的实验结果表明,PredMHC可以获得91.69%的准确率。与其他特征、分类器和现有方法比较的实验结果表明,PredMHC在预测MHC方面是有效的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b255/9081368/86f4f6383c65/fgene-13-875112-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b255/9081368/86f4f6383c65/fgene-13-875112-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b255/9081368/86f4f6383c65/fgene-13-875112-g001.jpg

相似文献

1
PredMHC: An Effective Predictor of Major Histocompatibility Complex Using Mixed Features.PredMHC:一种使用混合特征的主要组织相容性复合体有效预测器。
Front Genet. 2022 Apr 25;13:875112. doi: 10.3389/fgene.2022.875112. eCollection 2022.
2
Prediction of diabetic protein markers based on an ensemble method.基于集成方法的糖尿病蛋白质标志物预测
Front Biosci (Landmark Ed). 2021 Jul 30;26(7):207-221. doi: 10.52586/4935.
3
ELM-MHC: An Improved MHC Identification Method with Extreme Learning Machine Algorithm.ELM-MHC:一种基于极端学习机算法的 MHC 鉴定方法的改进。
J Proteome Res. 2019 Mar 1;18(3):1392-1401. doi: 10.1021/acs.jproteome.9b00012. Epub 2019 Feb 18.
4
Improved Prediction of Protein-Protein Interaction Mapping on by Using Amino Acid Sequence Features in a Supervised Learning Framework.利用监督学习框架中的氨基酸序列特征改进蛋白质相互作用预测映射。
Protein Pept Lett. 2021;28(1):74-83. doi: 10.2174/0929866527666200610141258.
5
Protein-RNA interface residue prediction using machine learning: an assessment of the state of the art.基于机器学习的蛋白质-RNA 界面残基预测:现状评估。
BMC Bioinformatics. 2012 May 10;13:89. doi: 10.1186/1471-2105-13-89.
6
Prediction of human major histocompatibility complex class II binding peptides by continuous kernel discrimination method.连续核判别方法预测人类主要组织相容性复合体 II 结合肽。
Artif Intell Med. 2012 Jun;55(2):107-15. doi: 10.1016/j.artmed.2011.10.005. Epub 2011 Nov 30.
7
An Effective Antifreeze Protein Predictor with Ensemble Classifiers and Comprehensive Sequence Descriptors.一种使用集成分类器和综合序列描述符的有效抗冻蛋白预测器。
Int J Mol Sci. 2015 Sep 7;16(9):21191-214. doi: 10.3390/ijms160921191.
8
RFAmyloid: A Web Server for Predicting Amyloid Proteins.RFAmyloid:用于预测淀粉样蛋白的网络服务器。
Int J Mol Sci. 2018 Jul 16;19(7):2071. doi: 10.3390/ijms19072071.
9
i6mA-VC: A Multi-Classifier Voting Method for the Computational Identification of DNA N6-methyladenine Sites.i6mA-VC:一种用于计算鉴定 DNA N6-甲基腺嘌呤位点的多分类器投票方法。
Interdiscip Sci. 2021 Sep;13(3):413-425. doi: 10.1007/s12539-021-00429-4. Epub 2021 Apr 8.
10
A Model Stacking Framework for Identifying DNA Binding Proteins by Orchestrating Multi-View Features and Classifiers.一种通过协调多视图特征和分类器来识别DNA结合蛋白的模型堆叠框架。
Genes (Basel). 2018 Aug 1;9(8):394. doi: 10.3390/genes9080394.

引用本文的文献

1
Sensitive, High-Throughput HLA-I and HLA-II Immunopeptidomics Using Parallel Accumulation-Serial Fragmentation Mass Spectrometry.采用平行累积-串联碎裂质谱技术进行敏感、高通量的 HLA-I 和 HLA-II 免疫肽组学分析。
Mol Cell Proteomics. 2023 Jun;22(6):100563. doi: 10.1016/j.mcpro.2023.100563. Epub 2023 May 3.

本文引用的文献

1
Understanding the constitutive presentation of MHC class I immunopeptidomes in primary tissues.了解主要组织中MHC I类免疫肽组的组成性呈现。
iScience. 2022 Jan 18;25(2):103768. doi: 10.1016/j.isci.2022.103768. eCollection 2022 Feb 18.
2
A CNN-based multi-target fast classification method for AR-SSVEP.一种基于卷积神经网络的用于自相关稳态视觉诱发电位的多目标快速分类方法。
Comput Biol Med. 2022 Feb;141:105042. doi: 10.1016/j.compbiomed.2021.105042. Epub 2021 Nov 18.
3
AFP-CMBPred: Computational identification of antifreeze proteins by extending consensus sequences into multi-blocks evolutionary information.
AFP-CMBPred:通过将共识序列扩展到多块进化信息来计算识别抗冻蛋白。
Comput Biol Med. 2021 Dec;139:105006. doi: 10.1016/j.compbiomed.2021.105006. Epub 2021 Nov 2.
4
Accurate and efficient mosquito genus classification algorithm using candidate-elimination and nearest centroid on extracted features of wingbeat acoustic properties.利用翅膀拍动声学特性提取特征的候选消除和最近质心进行准确高效的蚊子属分类算法。
Comput Biol Med. 2021 Dec;139:104973. doi: 10.1016/j.compbiomed.2021.104973. Epub 2021 Oct 25.
5
Classification of alcoholic EEG signals using wavelet scattering transform-based features.基于小波散射变换特征的酒精性脑电图信号分类。
Comput Biol Med. 2021 Dec;139:104969. doi: 10.1016/j.compbiomed.2021.104969. Epub 2021 Oct 22.
6
Automatic tracking of healthy joint kinematics from stereo-radiography sequences.从立体射线照相序列中自动跟踪健康关节运动学。
Comput Biol Med. 2021 Dec;139:104945. doi: 10.1016/j.compbiomed.2021.104945. Epub 2021 Oct 14.
7
Self-supervised CT super-resolution with hybrid model.基于混合模型的 CT 超分辨率自监督学习。
Comput Biol Med. 2021 Nov;138:104775. doi: 10.1016/j.compbiomed.2021.104775. Epub 2021 Aug 21.
8
Editorial: Feature Representation and Learning Methods With Applications in Protein Secondary Structure.社论:蛋白质二级结构中的特征表示与学习方法及其应用
Front Bioeng Biotechnol. 2021 Sep 9;9:748722. doi: 10.3389/fbioe.2021.748722. eCollection 2021.
9
Deep learning-based Hounsfield unit value measurement method for bolus tracking images in cerebral computed tomography angiography.基于深度学习的脑 CT 血管造影中团注追踪图像的 Hounsfield 单位值测量方法。
Comput Biol Med. 2021 Oct;137:104824. doi: 10.1016/j.compbiomed.2021.104824. Epub 2021 Sep 2.
10
iAtbP-Hyb-EnC: Prediction of antitubercular peptides via heterogeneous feature representation and genetic algorithm based ensemble learning model.iAtbP-Hyb-EnC:基于异质特征表示和遗传算法的集合学习模型对抗结核肽的预测。
Comput Biol Med. 2021 Oct;137:104778. doi: 10.1016/j.compbiomed.2021.104778. Epub 2021 Aug 25.