Suppr超能文献

使用综合蛋白质化学计量学方法对丝氨酸蛋白酶的配体选择性进行建模可提高模型性能,并允许对特征进行多靶点依赖性解释。

Modelling ligand selectivity of serine proteases using integrative proteochemometric approaches improves model performance and allows the multi-target dependent interpretation of features.

作者信息

Ain Qurrat U, Méndez-Lucio Oscar, Ciriano Isidro Cortés, Malliavin Thérèse, van Westen Gerard J P, Bender Andreas

机构信息

Centre for Molecular Informatics, Department of Chemistry, Lensfield Road, CB2 1EW, University of Cambridge, UK.

出版信息

Integr Biol (Camb). 2014 Nov;6(11):1023-33. doi: 10.1039/c4ib00175c.

Abstract

Serine proteases, implicated in important physiological functions, have a high intra-family similarity, which leads to unwanted off-target effects of inhibitors with insufficient selectivity. However, the availability of sequence and structure data has now made it possible to develop approaches to design pharmacological agents that can discriminate successfully between their related binding sites. In this study, we have quantified the relationship between 12,625 distinct protease inhibitors and their bioactivity against 67 targets of the serine protease family (20,213 data points) in an integrative manner, using proteochemometric modelling (PCM). The benchmarking of 21 different target descriptors motivated the usage of specific binding pocket amino acid descriptors, which helped in the identification of active site residues and selective compound chemotypes affecting compound affinity and selectivity. PCM models performed better than alternative approaches (models trained using exclusively compound descriptors on all available data, QSAR) employed for comparison with R(2)/RMSE values of 0.64 ± 0.23/0.66 ± 0.20 vs. 0.35 ± 0.27/1.05 ± 0.27 log units, respectively. Moreover, the interpretation of the PCM model singled out various chemical substructures responsible for bioactivity and selectivity towards particular proteases (thrombin, trypsin and coagulation factor 10) in agreement with the literature. For instance, absence of a tertiary sulphonamide was identified to be responsible for decreased selective activity (by on average 0.27 ± 0.65 pChEMBL units) on FA10. Among the binding pocket residues, the amino acids (arginine, leucine and tyrosine) at positions 35, 39, 60, 93, 140 and 207 were observed as key contributing residues for selective affinity on these three targets.

摘要

丝氨酸蛋白酶参与重要的生理功能,其家族内部相似度很高,这导致选择性不足的抑制剂会产生不必要的脱靶效应。然而,序列和结构数据的可得性现在使得开发能够成功区分其相关结合位点的药理剂方法成为可能。在本研究中,我们使用蛋白质化学计量学建模(PCM),以综合方式量化了12,625种不同的蛋白酶抑制剂与其针对丝氨酸蛋白酶家族67个靶点的生物活性之间的关系(20,213个数据点)。对21种不同靶点描述符的基准测试促使使用特定的结合口袋氨基酸描述符,这有助于识别影响化合物亲和力和选择性的活性位点残基和选择性化合物化学类型。与使用所有可用数据上的纯化合物描述符训练的替代方法(QSAR)相比,PCM模型表现更好,R(2)/RMSE值分别为0.64±0.23/0.66±0.20对0.35±0.27/1.05±0.27对数单位。此外,PCM模型的解释确定了各种负责对特定蛋白酶(凝血酶、胰蛋白酶和凝血因子10)的生物活性和选择性的化学子结构,这与文献一致。例如,发现缺少叔磺酰胺会导致对FA10的选择性活性降低(平均降低0.27±0.65 pChEMBL单位)。在结合口袋残基中,观察到35、39、60、93、140和207位的氨基酸(精氨酸、亮氨酸和酪氨酸)是这三个靶点选择性亲和力的关键贡献残基。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验