Charoenkwan Phasit, Chumnanpuen Pramote, Schaduangrat Nalini, Shoombuatong Watshara
Modern Management and Information Technology, College of Arts, Media and Technology, Chiang Mai University, Chiang Mai 50200, Thailand.
Department of Zoology, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; Kasetsart University International College (KUIC), Kasetsart University, Bangkok 10900, Thailand.
Methods. 2025 Feb;234:131-140. doi: 10.1016/j.ymeth.2024.12.005. Epub 2024 Dec 19.
Identifying angiotensin-I-converting enzyme (ACE) inhibitory peptides accurately is crucial for understanding the primary factor that regulates the renin-angiotensin system and for providing guidance in developing new potential drugs. Given the inherent experimental complexities, using computational methods for in silico peptide identification could be indispensable for facilitating the high-throughput characterization of ACE inhibitory peptides. In this paper, we propose a novel deep stacking-based ensemble learning framework, termed Deepstack-ACE, to precisely identify ACE inhibitory peptides. In Deepstack-ACE, the input peptide sequences are fed into the word2vec embedding technique to generate sequence representations. Then, these representations were employed to train five powerful deep learning methods, including long short-term memory, convolutional neural network, multi-layer perceptron, gated recurrent unit network, and recurrent neural network, for the construction of base-classifiers. Finally, the optimized stacked model was constructed based on the best combination of selected base-classifiers. Benchmarking experiments showed that Deepstack-ACE attained a more accurate and robust identification of ACE inhibitory peptides compared to its base-classifiers and several conventional machine learning classifiers. Remarkably, in the independent test, our proposed model significantly outperformed the current state-of-the-art methods, with a balanced accuracy of 0.916, sensitivity of 0.911, and Matthews correlation coefficient scores of 0.826. Moreover, we developed a user-friendly web server for Deepstack-ACE, which is freely available at https://pmlabqsar.pythonanywhere.com/Deepstack-ACE. We anticipate that our proposed Deepstack-ACE model can provide a faster and reasonably accurate identification of ACE inhibitory peptides.
准确识别血管紧张素转换酶(ACE)抑制肽对于理解调节肾素-血管紧张素系统的主要因素以及为开发新的潜在药物提供指导至关重要。鉴于固有的实验复杂性,使用计算方法进行计算机辅助肽识别对于促进ACE抑制肽的高通量表征可能是不可或缺的。在本文中,我们提出了一种基于深度堆叠的新型集成学习框架,称为Deepstack-ACE,以精确识别ACE抑制肽。在Deepstack-ACE中,将输入的肽序列输入到word2vec嵌入技术中以生成序列表示。然后,利用这些表示来训练五种强大的深度学习方法,包括长短期记忆网络、卷积神经网络、多层感知器、门控循环单元网络和递归神经网络,以构建基础分类器。最后,基于所选基础分类器的最佳组合构建优化的堆叠模型。基准实验表明,与基础分类器和几种传统机器学习分类器相比,Deepstack-ACE对ACE抑制肽的识别更准确、更稳健。值得注意的是,在独立测试中,我们提出的模型显著优于当前的最先进方法,平衡准确率为0.916,灵敏度为0.911,马修斯相关系数得分为0.826。此外,我们为Deepstack-ACE开发了一个用户友好的网络服务器,可在https://pmlabqsar.pythonanywhere.com/Deepstack-ACE上免费获得。我们预计,我们提出的Deepstack-ACE模型可以更快、更合理准确地识别ACE抑制肽。