Suppr超能文献

有效鉴定与差异分析抗癌肽。

Effective identification and differential analysis of anticancer peptides.

机构信息

School of Mathematics and Statistics, Northeastern University at Qinhuangdao, Qinhuangdao, PR China; Hebei Innovation Center for Smart Perception and Applied Technology of Agricultural Data, Qinhuangdao, PR China.

School of Mathematics and Statistics, Northeastern University at Qinhuangdao, Qinhuangdao, PR China.

出版信息

Biosystems. 2024 Jul;241:105246. doi: 10.1016/j.biosystems.2024.105246. Epub 2024 Jun 5.

Abstract

Anticancer peptides (ACPs) have recently emerged as promising cancer therapeutics due to their selectivity and lower toxicity. However, the number of experimentally validated ACPs is limited, and identifying ACPs from large-scale sequence data is time-consuming and expensive. Therefore, it is critical to develop and improve upon existing computational models for identifying ACPs. In this study, a computational method named ACP_DA was proposed based on peptide residue composition and physiochemical properties information. To curtail overfitting and reduce computational costs, a sequential forward selection method was utilized to construct the optimal feature groups. Subsequently, the feature vectors were fed into light gradient boosting machine classifier for model construction. It was observed by an independent set test that ACP_DA achieved the highest Matthew's correlation coefficient of 0.63 and accuracy of 0.8129, displaying at least a 2% enhancement compared to state-of-the-art methods. The satisfactory results demonstrate the effectiveness of ACP_DA as a powerful tool for identifying ACPs, with the potential to significantly contribute to the development and optimization of promising therapies. The data and resource codes are available at https://github.com/Zlclab/ACP_DA.

摘要

抗癌肽(ACPs)由于其选择性和较低的毒性,最近成为有前途的癌症治疗方法。然而,经过实验验证的 ACP 数量有限,并且从大规模序列数据中识别 ACP 既耗时又昂贵。因此,开发和改进现有的用于识别 ACP 的计算模型至关重要。在这项研究中,提出了一种名为 ACP_DA 的计算方法,该方法基于肽残基组成和物理化学性质信息。为了减少过拟合和降低计算成本,使用顺序前向选择方法来构建最优特征组。然后,将特征向量输入到轻梯度提升机分类器中进行模型构建。通过独立集测试观察到,ACP_DA 实现了最高的马修相关系数 0.63 和 0.8129 的准确率,与最先进的方法相比至少提高了 2%。令人满意的结果表明,ACP_DA 是一种识别 ACP 的有效工具,它有可能为有前途的治疗方法的开发和优化做出重大贡献。数据和资源代码可在 https://github.com/Zlclab/ACP_DA 上获得。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验