Benavides Tiburon L, Montelione Gaetano T
Department of Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180 USA.
Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180 USA.
bioRxiv. 2024 Sep 22:2024.09.19.613999. doi: 10.1101/2024.09.19.613999.
Protein-polypeptide interactions, including those involving intrinsically-disordered peptides and intrinsically-disordered regions of protein binding partners, are crucial for many biological functions. However, experimental structure determination of protein-peptide complexes can be challenging. Computational methods, while promising, generally require experimental data for validation and refinement. Here we present , an integrated modeling approach to determine the structures of protein-peptide complexes. This method combines AlphaFold2 (AF2) enhanced sampling methods with a Bayesian conformational selection process based on experimental Nuclear Magnetic Resonance (NMR) Chemical Shift Perturbation (CSP) data and AF2 confidence metrics. Using a curated dataset of 108 protein-peptide complexes from the Biological Magnetic Resonance Data Bank (BMRB), we observe that while AF2 typically yields models with excellent consistency with experimental CSP data, applying enhanced sampling followed by data-guided conformational selection routinely results in ensembles of structures with improved agreement with NMR observables. For two systems, we cross-validate the CSP-selected models using independently acquired nuclear Overhauser effect (NOE) NMR data and demonstrate how CSP and NMR can be combined using our Bayesian framework for model selection. is a novel method for integrative modeling of protein-peptide complexes and has broad implications for studies of protein-peptide interactions and aiding in understanding their biological functions.
蛋白质与多肽的相互作用,包括那些涉及内在无序肽和蛋白质结合伴侣的内在无序区域的相互作用,对许多生物学功能至关重要。然而,蛋白质 - 肽复合物的实验结构测定可能具有挑战性。计算方法虽然很有前景,但通常需要实验数据进行验证和优化。在这里,我们提出了一种用于确定蛋白质 - 肽复合物结构的综合建模方法。该方法将AlphaFold2(AF2)增强采样方法与基于实验核磁共振(NMR)化学位移扰动(CSP)数据和AF2置信度指标的贝叶斯构象选择过程相结合。使用来自生物磁共振数据库(BMRB)的108个蛋白质 - 肽复合物的精选数据集,我们观察到,虽然AF2通常产生与实验CSP数据具有极好一致性的模型,但应用增强采样然后进行数据引导的构象选择通常会产生与NMR观测值具有更好一致性的结构集合。对于两个系统,我们使用独立获取的核Overhauser效应(NOE)NMR数据对CSP选择的模型进行交叉验证,并展示如何使用我们的贝叶斯框架进行模型选择将CSP和NMR结合起来。这是一种用于蛋白质 - 肽复合物综合建模的新方法,对蛋白质 - 肽相互作用的研究以及帮助理解其生物学功能具有广泛的意义。