Zhiguo Yu, Zixuan Li, Peng Li
School of Informatics, Hunan University of Chinese Medicine, Changsha, 410208, Hunan, China.
BMC Plant Biol. 2025 Jul 19;25(1):933. doi: 10.1186/s12870-025-06878-z.
Plant peptide-protein interactions (PepPI) play a crucial role in plant growth, development, immune regulation, and environmental adaptation. However, existing computational methods still face several challenges in PepPI prediction. First, most methods fail to adequately integrate multimodal information such as sequence, structure, and disorder properties, leading to inadequate characterization of complex interaction patterns. Second, existing models have difficulty in capturing cross-dependent features between peptides and proteins, limiting the prediction performance. Finally, the lack of a unified framework capable of integrated modeling from both global and local perspectives leads to a still imperfect understanding and prediction of PepPI. To this end, we propose a multiple characterization framework based on cross-modal feature fusion-MultiRepPI-for efficient prediction of plant PepPI. In this framework, we innovatively introduce several key modules aimed at comprehensively improving the representation of peptide and protein features. First, a cross-modal encoding module (CME) is designed by fusing convolutional neural networks, recurrent neural networks, and feature enhancement mechanisms, which is capable of extracting multi-scale deep features from peptide and protein sequences, and thus better capturing their interactions at different levels. Secondly, bi-directional attention and PepPI gating mechanisms were used to design the cross-modal attention module (CMA), which deeply mines the key interaction patterns between peptides and proteins, and enhances the ability of the model to focus on important binding sites. Finally, the disordered feature extraction (DFE) module was designed to specifically identify disordered regions of plant proteins and extract dynamic features to further enhance the accuracy of plant PepPI prediction. A systematic experimental evaluation of MultiRepPI was conducted on a benchmark dataset. The experimental results show that MultiRepPI provides a significant improvement in both prediction performance and binding residue recognition compared to existing state-of-the-art methods. This framework provides a reliable tool for efficient prediction of plant PepPIs, while laying a solid foundation for plant biology research and peptide drug development.
植物肽 - 蛋白质相互作用(PepPI)在植物生长、发育、免疫调节和环境适应中起着至关重要的作用。然而,现有的计算方法在PepPI预测中仍面临若干挑战。首先,大多数方法未能充分整合序列、结构和无序特性等多模态信息,导致对复杂相互作用模式的表征不足。其次,现有模型难以捕捉肽和蛋白质之间的交叉依赖特征,限制了预测性能。最后,缺乏一个能够从全局和局部角度进行综合建模的统一框架,导致对PepPI的理解和预测仍不完美。为此,我们提出了一种基于跨模态特征融合的多重表征框架——MultiRepPI,用于高效预测植物PepPI。在这个框架中,我们创新性地引入了几个关键模块,旨在全面提升肽和蛋白质特征的表征能力。首先,通过融合卷积神经网络、循环神经网络和特征增强机制设计了一个跨模态编码模块(CME),它能够从肽和蛋白质序列中提取多尺度深度特征,从而更好地捕捉它们在不同层次上的相互作用。其次,使用双向注意力和PepPI门控机制设计了跨模态注意力模块(CMA),它深入挖掘肽和蛋白质之间的关键相互作用模式,并增强模型聚焦于重要结合位点的能力。最后,设计了无序特征提取(DFE)模块,专门识别植物蛋白质的无序区域并提取动态特征,以进一步提高植物PepPI预测的准确性。在一个基准数据集上对MultiRepPI进行了系统的实验评估。实验结果表明,与现有的最先进方法相比,MultiRepPI在预测性能和结合残基识别方面都有显著提升。该框架为高效预测植物PepPI提供了一个可靠的工具,同时为植物生物学研究和肽药物开发奠定了坚实的基础。