IEEE/ACM Trans Comput Biol Bioinform. 2021 Sep-Oct;18(5):1831-1840. doi: 10.1109/TCBB.2020.2968419. Epub 2021 Oct 7.
Anti-inflammatory peptides (AIEs) have recently emerged as promising therapeutic agent for treatment of various inflammatory diseases, such as rheumatoid arthritis and Alzheimer's disease. Therefore, detecting the correlation between amino acid sequence and its anti-inflammatory property is of great importance for the discovery of new AIEs. To address this issue, we propose a novel prediction tool for accurate identification of peptides as anti-inflammatory epitopes or non anti-inflammatory epitopes. Most of all, we encode the original peptide sequence for better mining and exploring the information and patterns, based on the three feature representations as amino acid contact, position specific scoring matrix, physicochemical property. At the same time, we exploit several feature extraction models and utilize one feature selection model, in order to construct many base classifiers from various feature representations. More specifically, we develop an effective classification model, with which we can extract and learn a set of informative features from the ensemble classifier chain model with different group of base classifiers. Furthermore, in order to test the predictive power of our model, we conduct the comparative experiments on the leave-one-out cross-validation and the independent test. It shows that our novel predictor performs great accurate for identification of AIEs as well as existing outstanding prediction tools. Source codes are available at https://github.com/guofei-tju/Ensemble-classifier-chain-model.
抗炎肽 (AIEs) 最近作为治疗各种炎症性疾病(如类风湿性关节炎和阿尔茨海默病)的有前途的治疗剂而出现。因此,检测氨基酸序列与其抗炎特性之间的相关性对于发现新的 AIEs 非常重要。为了解决这个问题,我们提出了一种新的预测工具,用于准确识别抗炎表位或非抗炎表位的肽。最重要的是,我们基于氨基酸接触、位置特异性评分矩阵和物理化学性质这三种特征表示来对原始肽序列进行编码,以便更好地挖掘和探索信息和模式。同时,我们利用了多种特征提取模型并利用了一种特征选择模型,以便从各种特征表示中构建许多基本分类器。更具体地说,我们开发了一种有效的分类模型,通过该模型,我们可以从具有不同基分类器组的集成分类器链模型中提取和学习一组信息丰富的特征。此外,为了测试我们模型的预测能力,我们在留一交叉验证和独立测试上进行了对比实验。结果表明,我们的新型预测器在识别 AIEs 方面以及现有的优秀预测工具方面都具有出色的准确性。源代码可在 https://github.com/guofei-tju/Ensemble-classifier-chain-model 上获得。