Institute of Physics, Polish Academy of Sciences, Al. Lotników 32/46, 02-668 Warsaw, Poland.
Molecules. 2024 Jun 11;29(12):2768. doi: 10.3390/molecules29122768.
Galectin-3 is a protein involved in many intra- and extra-cellular processes. It has been identified as a diagnostic or prognostic biomarker for certain types of heart disease, kidney disease and cancer. Galectin-3 comprises a carbohydrate recognition domain (CRD) and an N-terminal domain (NTD), which is unstructured and contains eight collagen-like Pro-Gly-rich tandem repeats. While the structure of the CRD has been solved using protein crystallography, current knowledge about conformations of full-length galectin-3 is limited. To fill in this knowledge gap, we performed molecular dynamics (MD) simulations of full-length galectin-3. We systematically re-scaled the solute-solvent interactions in the Martini 3 force field to obtain the best possible agreement between available data from SAXS experiments and the ensemble of conformations generated in the MD simulations. The simulation conformations were found to be very diverse, as reflected, e.g., by (i) large fluctuations in the radius of gyration, ranging from about 2 to 5 nm, and (ii) multiple transient contacts made by amino acid residues in the NTD. Consistent with evidence from NMR experiments, contacts between the CRD and NTD were observed to not involve the carbohydrate-binding site on the CRD surface. Contacts within the NTD were found to be made most frequently by aromatic residues. Formation of fuzzy complexes with unspecific stoichiometry was observed to be mediated mostly by the NTD. Taken together, we offer a detailed picture of the conformational ensemble of full-length galectin-3, which will be important for explaining the biological functions of this protein at the molecular level.
半乳糖凝集素-3 是一种参与多种细胞内和细胞外过程的蛋白质。它已被确定为某些类型心脏病、肾病和癌症的诊断或预后生物标志物。半乳糖凝集素-3 由一个碳水化合物识别结构域 (CRD) 和一个无结构的 N 端结构域 (NTD) 组成,该 NTD 包含八个胶原样脯氨酸-甘氨酸丰富的串联重复序列。虽然 CRD 的结构已通过蛋白质晶体学解决,但目前对半乳糖凝集素-3 全长构象的了解有限。为了填补这一知识空白,我们对半乳糖凝集素-3 全长进行了分子动力学 (MD) 模拟。我们系统地重新调整了 Martini 3 力场中的溶剂-溶质相互作用,以获得 SAXS 实验数据和 MD 模拟生成的构象集合之间的最佳一致性。模拟构象非常多样化,例如,(i)旋轮半径的波动很大,范围约为 2 至 5nm,(ii)NTD 中的氨基酸残基发生多次瞬时接触。与 NMR 实验证据一致,CRD 和 NTD 之间的接触不涉及 CRD 表面的碳水化合物结合位点。发现 NTD 内的接触最常由芳香族残基形成。形成无特异性化学计量的模糊复合物主要由 NTD 介导。总之,我们提供了全长半乳糖凝集素-3 构象集合的详细图片,这对于在分子水平上解释该蛋白质的生物学功能非常重要。