Department of Physiology, Ajou University School of Medicine, Suwon, 16499, Republic of Korea.
Department of Integrative Biotechnology, College of Biotechnology and Bioengineering, Sungkyunkwan University, Suwon, 16419, Republic of Korea.
Comput Biol Med. 2023 Oct;165:107386. doi: 10.1016/j.compbiomed.2023.107386. Epub 2023 Aug 14.
Diabetes mellitus has become a major public health concern associated with high mortality and reduced life expectancy and can cause blindness, heart attacks, kidney failure, lower limb amputations, and strokes. A new generation of antidiabetic peptides (ADPs) that act on β-cells or T-cells to regulate insulin production is being developed to alleviate the effects of diabetes. However, the lack of effective peptide-mining tools has hampered the discovery of these promising drugs. Hence, novel computational tools need to be developed urgently. In this study, we present ADP-Fuse, a novel two-layer prediction framework capable of accurately identifying ADPs or non-ADPs and categorizing them into type 1 and type 2 ADPs. First, we comprehensively evaluated 22 peptide sequence-derived features coupled with eight notable machine learning algorithms. Subsequently, the most suitable feature descriptors and classifiers for both layers were identified. The output of these single-feature models, embedded with multiview information, was trained with an appropriate classifier to provide the final prediction. Comprehensive cross-validation and independent tests substantiate that ADP-Fuse surpasses single-feature models and the feature fusion approach for the prediction of ADPs and their types. In addition, the SHapley Additive exPlanation method was used to elucidate the contributions of individual features to the prediction of ADPs and their types. Finally, a user-friendly web server for ADP-Fuse was developed and made publicly accessible (https://balalab-skku.org/ADP-Fuse), enabling the swift screening and identification of novel ADPs and their types. This framework is expected to contribute significantly to antidiabetic peptide identification.
糖尿病已成为一个主要的公共卫生关注点,与高死亡率和预期寿命降低有关,可导致失明、心脏病发作、肾衰竭、下肢截肢和中风。为了缓解糖尿病的影响,正在开发新一代作用于β细胞或 T 细胞以调节胰岛素产生的抗糖尿病肽 (ADPs)。然而,缺乏有效的肽挖掘工具阻碍了这些有前途药物的发现。因此,急需开发新的计算工具。在这项研究中,我们提出了 ADP-Fuse,这是一种新的两层预测框架,能够准确识别 ADP 或非 ADP,并将它们分类为 1 型和 2 型 ADP。首先,我们全面评估了 22 个肽序列衍生特征,结合了八种著名的机器学习算法。随后,确定了两层最适合的特征描述符和分类器。这些单特征模型的输出,嵌入多视图信息,用合适的分类器进行训练,提供最终预测。全面的交叉验证和独立测试证实,ADP-Fuse 优于单特征模型和特征融合方法,可用于预测 ADP 及其类型。此外,还使用 Shapley Additive exPlanation 方法阐明了各个特征对 ADP 及其类型预测的贡献。最后,开发了一个用于 ADP-Fuse 的用户友好型网络服务器,并公开发布(https://balalab-skku.org/ADP-Fuse),能够快速筛选和识别新型 ADP 及其类型。该框架有望为抗糖尿病肽的鉴定做出重大贡献。