Wang Caihua, Liu Juan, Luo Fei, Deng Zixing, Hu Qian-Nan
BMC Syst Biol. 2015;9 Suppl 1(Suppl 1):S2. doi: 10.1186/1752-0509-9-S1-S2. Epub 2015 Jan 21.
Cell proliferation, differentiation, Gene expression, metabolism, immunization and signal transduction require the participation of ligands and targets. It is a great challenge to identify rules governing molecular recognition between chemical topological substructures of ligands and the binding sites of the targets.
We suppose that the ligand-target interactions are determined by ligand substructures as well as the physical-chemical properties of the binding sites. Therefore, we propose a fragment interaction model (FIM) to describe the interactions between ligands and targets, with the purpose of facilitating the chemical interpretation of ligand-target binding. First we extract target-ligand complexes from sc-PDB database, based on which, we get the target binding sites and the ligands. Then we represent each binding site as a fragment vector based on a target fragment dictionary that is composed of 199 clusters (denoted as fragements in this work) obtained by clustering 4200 trimers according to their physical-chemical properties. And then, we represent each ligand as a substructure vector based on a dictionary containing 747 substructures. Finally, we build the FIM by generating the interaction matrix M (representing the fragment interaction network), and the FIM can later be used for predicting unknown ligand-target interactions as well as providing the binding details of the interactions.
The five-fold cross validation results show that the proposed model can get higher AUC score (92%) than three prevalence algorithms CS-PD (80%), BLM-NII (85%) and RF (85%), demonstrating the remarkable predictive ability of FIM. We also show that the ligand binding sites (local information) overweight the sequence similarities (global information) in ligand-target binding, and introducing too much global information would be harmful to the predictive ability. Moreover, The derived fragment interaction network can provide the chemical insights on the interactions.
The target and ligand bindings are local events, and the local information dominate the binding ability. Though integrating of the global information can promote the predictive ability, the role is very limited. The fragment interaction network is helpful for understanding the mechanism of the ligand-target interaction.
细胞增殖、分化、基因表达、代谢、免疫和信号转导都需要配体和靶点的参与。识别配体的化学拓扑子结构与靶点结合位点之间的分子识别规则是一项巨大挑战。
我们假设配体 - 靶点相互作用由配体子结构以及结合位点的物理化学性质决定。因此,我们提出一种片段相互作用模型(FIM)来描述配体与靶点之间的相互作用,目的是促进对配体 - 靶点结合的化学解释。首先,我们从sc - PDB数据库中提取靶点 - 配体复合物,在此基础上,我们得到靶点结合位点和配体。然后,我们基于一个靶点片段字典将每个结合位点表示为一个片段向量,该字典由根据4200个三聚体的物理化学性质聚类得到的199个簇(在本工作中表示为片段)组成。接着,我们基于一个包含747个子结构的字典将每个配体表示为一个子结构向量。最后,我们通过生成相互作用矩阵M(代表片段相互作用网络)构建FIM,FIM随后可用于预测未知的配体 - 靶点相互作用以及提供相互作用的结合细节。
五重交叉验证结果表明,所提出的模型能够获得比三种流行算法CS - PD(80%)、BLM - NII(85%)和RF(85%)更高的AUC分数(92%),证明了FIM具有显著的预测能力。我们还表明,在配体 - 靶点结合中,配体结合位点(局部信息)比序列相似性(全局信息)更重要,引入过多全局信息会对预测能力有害。此外,推导得到的片段相互作用网络可以为相互作用提供化学见解。
靶点与配体的结合是局部事件,局部信息主导结合能力。虽然整合全局信息可以提高预测能力,但作用非常有限。片段相互作用网络有助于理解配体 - 靶点相互作用的机制。