Liu Mingyou, Wu Tao, Li Xue, Zhu Yingxue, Chen Sen, Huang Jian, Zhou Fengfeng, Liu Hongmei
School of Biology and Engineering (School of Health Medicine Modern Industry), Guizhou Medical University, Guiyang, China.
Engineering Research Center of Health Medicine Biotechnology of Guizhou Province, Guizhou Medical University, Guiyang, China.
Front Genet. 2024 Feb 29;15:1352504. doi: 10.3389/fgene.2024.1352504. eCollection 2024.
: Cancer is a significant global health problem that continues to cause a high number of deaths worldwide. Traditional cancer treatments often come with risks that can compromise the functionality of vital organs. As a potential alternative to these conventional therapies, Anticancer peptides (ACPs) have garnered attention for their small size, high specificity, and reduced toxicity, making them as a promising option for cancer treatments. : However, the process of identifying effective ACPs through wet-lab screening experiments is time-consuming and requires a lot of labor. To overcome this challenge, a deep ensemble learning method is constructed to predict anticancer peptides (ACPs) in this study. To evaluate the reliability of the framework, four different datasets are used in this study for training and testing. During the training process of the model, integration of feature selection methods, feature dimensionality reduction measures, and optimization of the deep ensemble model are carried out. Finally, we explored the interpretability of features that affected the final prediction results and built a web server platform to facilitate anticancer peptides prediction, which can be used by all researchers for further studies. This web server can be accessed at http://lmylab.online:5001/. : The result of this study achieves an accuracy rate of 98.53% and an AUC (Area under Curve) value of 0.9972 on the ACPfel dataset, it has improvements on other datasets as well.
癌症是一个重大的全球健康问题,在全球范围内持续导致大量死亡。传统的癌症治疗方法往往伴随着风险,可能会损害重要器官的功能。作为这些传统疗法的潜在替代方案,抗癌肽(ACPs)因其体积小、特异性高和毒性低而受到关注,使其成为癌症治疗的一个有前景的选择。然而,通过湿实验室筛选实验鉴定有效的抗癌肽的过程既耗时又需要大量人力。为了克服这一挑战,本研究构建了一种深度集成学习方法来预测抗癌肽(ACPs)。为了评估该框架的可靠性,本研究使用了四个不同的数据集进行训练和测试。在模型的训练过程中,进行了特征选择方法的集成、特征降维措施以及深度集成模型的优化。最后,我们探索了影响最终预测结果的特征的可解释性,并构建了一个网络服务器平台以促进抗癌肽的预测,所有研究人员均可使用该平台进行进一步研究。该网络服务器可通过http://lmylab.online:5001/访问。本研究结果在ACPfel数据集上达到了98.53%的准确率和0.9972的AUC(曲线下面积)值,在其他数据集上也有改进。