Ahmed Saeed, Schaduangrat Nalini, Chumnanpuen Pramote, Shoombuatong Watshara
Center for Research Innovation and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand.
Department of Computer Science, University of Swabi, Swabis, Pakistan.
Protein Sci. 2025 Jun;34(6):e70026. doi: 10.1002/pro.70026.
Accurate identification of angiotensin-I-converting enzyme (ACE) inhibitory peptides is essential for understanding the primary factor regulating the renin-angiotensin system and guiding the development of new drug candidates. Given the inherent challenges in experimental processes, computational methods for in silico peptide identification can be invaluable for enabling high-throughput characterization of ACE inhibitory peptides. This study introduces GRU4ACE, an innovative deep learning framework based on multi-view information for identifying ACE inhibitory peptides. First, GRU4ACE utilizes multi-source feature encoding methods to capture the information embedded in ACE inhibitory peptides, including sequential information, graphical information, semantic information, and contextual information. Specifically, the feature representations used herein are derived from conventional feature descriptors, natural language processing (NLP)-based embeddings, and pre-trained protein language model (PLM)-based embeddings. Next, multiple feature embeddings were fused, and the elastic net was employed for feature optimization. Finally, the optimal feature subset with strong feature representation was input into a gated recurrent unit (GRU). The proposed GRU4ACE approach demonstrated superior performance over existing methods in terms of the independent test. To be specific, the balanced accuracy, sensitivity, and MCC scores of GRU4ACE reached 0.948, 0.934, and 0.895, which were 6.46%, 8.92%, and 12.51% higher than those of the compared methods, respectively. In addition, when comparing well-regarded feature descriptors, we found that the proposed multi-view features effectively captured crucial information, leading to improved ACE inhibitory peptide prediction performance. These comprehensive results highlight that GRU4ACE enhances prediction accuracy and significantly narrows down the search for new potential antihypertensive drugs.
准确识别血管紧张素转换酶(ACE)抑制肽对于理解调节肾素-血管紧张素系统的主要因素以及指导新候选药物的开发至关重要。鉴于实验过程中存在的固有挑战,用于计算机肽识别的计算方法对于实现ACE抑制肽的高通量表征可能具有重要价值。本研究介绍了GRU4ACE,这是一种基于多视图信息的创新深度学习框架,用于识别ACE抑制肽。首先,GRU4ACE利用多源特征编码方法来捕获ACE抑制肽中嵌入的信息,包括序列信息、图形信息、语义信息和上下文信息。具体而言,本文使用的特征表示来自传统特征描述符、基于自然语言处理(NLP)的嵌入以及基于预训练蛋白质语言模型(PLM)的嵌入。接下来,融合多个特征嵌入,并采用弹性网络进行特征优化。最后,将具有强特征表示的最优特征子集输入门控循环单元(GRU)。在独立测试方面,所提出的GRU4ACE方法表现优于现有方法。具体来说,GRU4ACE的平衡准确率、灵敏度和马修斯相关系数(MCC)得分分别达到0.948、0.934和0.895,分别比比较方法高6.46%、8.92%和12.51%。此外,在比较广受认可的特征描述符时,我们发现所提出的多视图特征有效地捕获了关键信息,从而提高了ACE抑制肽的预测性能。这些综合结果表明,GRU4ACE提高了预测准确性,并显著缩小了寻找新的潜在抗高血压药物的范围。