Centre for Multidisciplinary Research and Engineering, Vinca Institute of Nuclear Sciences, University of Belgrade, Belgrade, Serbia.
School of Cellular and Molecular Medicine, University of Bristol, Bristol, UK.
Sci Rep. 2018 Jul 12;8(1):10563. doi: 10.1038/s41598-018-28815-x.
Intrinsically disordered proteins (IDPs) are characterized by the lack of a fixed tertiary structure and are involved in the regulation of key biological processes via binding to multiple protein partners. IDPs are malleable, adapting to structurally different partners, and this flexibility stems from features encoded in the primary structure. The assumption that universal sequence information will facilitate coverage of the sparse zones of the human interactome motivated us to explore the possibility of predicting protein-protein interactions (PPIs) that involve IDPs based on sequence characteristics. We developed a method that relies on features of the interacting and non-interacting protein pairs and utilizes machine learning to classify and predict IDP PPIs. Consideration of both sequence determinants specific for conformational organizations and the multiplicity of IDP interactions in the training phase ensured a reliable approach that is superior to current state-of-the-art methods. By applying a strict evaluation procedure, we confirm that our method predicts interactions of the IDP of interest even on the proteome-scale. This service is provided as a web tool to expedite the discovery of new interactions and IDP functions with enhanced efficiency.
无定形蛋白质(IDPs)的特征是缺乏固定的三级结构,并通过与多个蛋白质伴侣结合参与关键的生物过程调节。IDPs 具有可塑 性,能够适应结构不同的伴侣,这种灵活性源于一级结构中编码的特征。我们假设普遍的序列信息将有助于覆盖人类相互作用组的稀疏区域,这促使我们探索基于序列特征预测涉及 IDP 的蛋白质-蛋白质相互作用(PPIs)的可能性。我们开发了一种方法,该方法依赖于相互作用和非相互作用蛋白质对的特征,并利用机器学习对 IDP PPI 进行分类和预测。在训练阶段考虑构象组织的特定序列决定因素和 IDP 相互作用的多样性,确保了一种可靠的方法,优于当前最先进的方法。通过应用严格的评估程序,我们证实我们的方法甚至可以在蛋白质组范围内预测感兴趣的 IDP 的相互作用。此服务作为网络工具提供,可加快新相互作用和 IDP 功能的发现,提高效率。