Department of Software Engineering, Daffodil International University, Daffodil Smart City (DSC), Savar, Dhaka, Bangladesh.
Universidad Europea del Atlántico, Santander, Spain.
PLoS One. 2024 Nov 14;19(11):e0313835. doi: 10.1371/journal.pone.0313835. eCollection 2024.
Interleukin-10, a highly effective cytokine recognized for its anti-inflammatory properties, plays a critical role in the immune system. In addition to its well-documented capacity to mitigate inflammation, IL-10 can unexpectedly demonstrate pro-inflammatory characteristics under specific circumstances. The presence of both aspects emphasizes the vital need to identify the IL-10-induced peptide. To mitigate the drawbacks of manual identification, which include its high cost, this study introduces StackIL10, an ensemble learning model based on stacking, to identify IL-10-inducing peptides in a precise and efficient manner. Ten Amino-acid-composition-based Feature Extraction approaches are considered. The StackIL10, stacking ensemble, the model with five optimized Machine Learning Algorithm (specifically LGBM, RF, SVM, Decision Tree, KNN) as the base learners and a Logistic Regression as the meta learner was constructed, and the identification rate reached 91.7%, MCC of 0.833 with 0.9078 Specificity. Experiments were conducted to examine the impact of various enhancement techniques on the correctness of IL-10 Prediction. These experiments included comparisons between single models and various combinations of stacking-based ensemble models. It was demonstrated that the model proposed in this study was more effective than singular models and produced satisfactory results, thereby improving the identification of peptides that induce IL-10.
白细胞介素-10(IL-10)是一种具有高效抗炎特性的细胞因子,在免疫系统中起着关键作用。除了其减轻炎症的能力已得到充分证明外,IL-10 在特定情况下还可能表现出促炎特性。这两个方面都强调了确定 IL-10 诱导肽的重要性。为了减轻手动识别的缺点,包括其高成本,本研究引入了基于堆叠的集成学习模型 StackIL10,以精确高效地识别 IL-10 诱导肽。考虑了十种基于氨基酸组成的特征提取方法。构建了 StackIL10、堆叠集成、具有五个优化机器学习算法(具体为 LGBM、RF、SVM、决策树、KNN)作为基础学习者和逻辑回归作为元学习者的模型,识别率达到 91.7%,MCC 为 0.833,特异性为 0.9078。进行了实验来检查各种增强技术对 IL-10 预测正确性的影响。这些实验包括对单个模型和基于堆叠的集成模型的各种组合进行比较。结果表明,本研究提出的模型比单一模型更有效,产生了令人满意的结果,从而提高了对诱导 IL-10 的肽的识别能力。