Wang Fanjin, Sangfuang Nannapat, McCoubrey Laura E, Yadav Vipul, Elbadawi Moe, Orlu Mine, Gaisford Simon, Basit Abdul W
Intract Pharma Ltd. London Bioscience Innovation Centre, 2 Royal College St, London NW1 0NH, UK.
UCL School of Pharmacy, 29-39 Brunswick Square, London WC1N 1AX, UK.
Int J Pharm. 2023 Mar 5;634:122643. doi: 10.1016/j.ijpharm.2023.122643. Epub 2023 Jan 25.
The oral delivery of peptide therapeutics could facilitate precision treatment of numerous gastrointestinal (GI) and systemic diseases with simple administration for patients. However, the vast majority of licensed peptide drugs are currently administered parenterally due to prohibitive peptide instability in the GI tract. As such, the development of GI-stable peptides is receiving considerable investment. This study provides researchers with the first tool to predict the GI stability of peptide therapeutics based solely on the amino acid sequence. Both unsupervised and supervised machine learning techniques were trained on literature-extracted data describing peptide stability in simulated gastric and small intestinal fluid (SGF and SIF). Based on 109 peptide incubations, classification models for SGF and SIF were developed. The best models utilized k-Nearest Neighbor (for SGF) and XGBoost (for SIF) algorithms, with accuracies of 75.1% (SGF) and 69.3% (SIF), and f1 scores of 84.5% (SGF) and 73.4% (SIF) under 5-fold cross-validation. Feature importance analysis demonstrated that peptides' lipophilicity, rigidity, and size were key determinants of stability. These models are now available to those working on the development of oral peptide therapeutics.
肽类疗法的口服给药可以通过为患者提供简单的给药方式,促进对多种胃肠道(GI)和全身性疾病的精准治疗。然而,由于肽在胃肠道中极不稳定,目前绝大多数已获许可的肽类药物都是通过肠胃外给药。因此,胃肠道稳定肽的开发正获得大量投资。本研究为研究人员提供了首个仅基于氨基酸序列预测肽类疗法胃肠道稳定性的工具。无监督和有监督的机器学习技术都在从文献中提取的、描述肽在模拟胃液和小肠液(SGF和SIF)中稳定性的数据上进行了训练。基于109次肽孵育实验,开发了SGF和SIF的分类模型。最佳模型使用了k近邻算法(用于SGF)和XGBoost算法(用于SIF),在5折交叉验证下,准确率分别为75.1%(SGF)和69.3%(SIF),f1分数分别为84.5%(SGF)和73.4%(SIF)。特征重要性分析表明,肽的亲脂性、刚性和大小是稳定性的关键决定因素。这些模型现在可供从事口服肽类疗法开发的人员使用。