State Key Laboratory of Natural Medicines, School of Life Science and Technology, China Pharmaceutical University, Nanjing 210009, China.
Int J Mol Sci. 2021 May 26;22(11):5630. doi: 10.3390/ijms22115630.
Recently, anticancer peptides (ACPs) have emerged as unique and promising therapeutic agents for cancer treatment compared with antibody and small molecule drugs. In addition to experimental methods of ACPs discovery, it is also necessary to develop accurate machine learning models for ACP prediction. In this study, features were extracted from the three-dimensional (3D) structure of peptides to develop the model, compared to most of the previous computational models, which are based on sequence information. In order to develop ACPs with more potency, more selectivity and less toxicity, the model for predicting ACPs, hemolytic peptides and toxic peptides were established by peptides 3D structure separately. Multiple datasets were collected according to whether the peptide sequence was chemically modified. After feature extraction and screening, diverse algorithms were used to build the model. Twelve models with excellent performance (Acc > 90%) in the ACPs mixed datasets were used to form a hybrid model to predict the candidate ACPs, and then the optimal model of hemolytic peptides (Acc = 73.68%) and toxic peptides (Acc = 85.5%) was used for safety prediction. Novel ACPs were found by using those models, and five peptides were randomly selected to determine their anticancer activity and toxic side effects in vitro experiments.
最近,与抗体和小分子药物相比,抗癌肽 (ACPs) 作为癌症治疗的独特且有前途的治疗剂而备受关注。除了发现 ACPs 的实验方法外,还需要开发用于 ACP 预测的准确机器学习模型。在这项研究中,从肽的三维 (3D) 结构中提取特征来开发模型,与大多数之前基于序列信息的计算模型相比。为了开发具有更高效力、更高选择性和更低毒性的 ACPs,分别通过肽 3D 结构建立了用于预测 ACPs、溶血肽和毒性肽的模型。根据肽序列是否经过化学修饰,收集了多个数据集。经过特征提取和筛选后,使用多种算法构建模型。在 ACPs 混合数据集上表现出色的 12 个模型(Acc>90%)用于形成混合模型来预测候选 ACPs,然后使用最佳的溶血肽模型(Acc=73.68%)和毒性肽模型(Acc=85.5%)进行安全预测。使用这些模型发现了新的 ACPs,并随机选择了 5 种肽在体外实验中测定其抗癌活性和毒性副作用。