Picard Milan, Scott-Boyer Marie-Pier, Bodein Antoine, Leclercq Mickaël, Prunier Julien, Périn Olivier, Droit Arnaud
Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada.
Digital Transformation and Innovation Department, L'Oréal Advanced Research, Aulnay-sous-bois, France.
Comput Struct Biotechnol J. 2024 Jun 15;24:464-475. doi: 10.1016/j.csbj.2024.06.012. eCollection 2024 Dec.
The discovery of novel therapeutic targets, defined as proteins which drugs can interact with to induce therapeutic benefits, typically represent the first and most important step of drug discovery. One solution for target discovery is target repositioning, a strategy which relies on the repurposing of known targets for new diseases, leading to new treatments, less side effects and potential drug synergies. Biological networks have emerged as powerful tools for integrating heterogeneous data and facilitating the prediction of biological or therapeutic properties. Consequently, they are widely employed to predict new therapeutic targets by characterizing potential candidates, often based on their interactions within a Protein-Protein Interaction (PPI) network, and their proximity to genes associated with the disease. However, over-reliance on PPI networks and the assumption that potential targets are necessarily near known genes can introduce biases that may limit the effectiveness of these methods. This study addresses these limitations in two ways. First, by exploiting a multi-layer network which incorporates additional information such as gene regulation, metabolite interactions, metabolic pathways, and several disease signatures such as Differentially Expressed Genes, mutated genes, Copy Number Alteration, and structural variants. Second, by extracting relevant features from the network using several approaches including proximity to disease-associated genes, but also unbiased approaches such as propagation-based methods, topological metrics, and module detection algorithms. Using prostate cancer as a case study, the best features were identified and utilized to train machine learning algorithms to predict 5 novel promising therapeutic targets for prostate cancer: IGF2R, C5AR, RAB7, SETD2 and NPBWR1.
新型治疗靶点的发现,即药物可与之相互作用以产生治疗益处的蛋白质,通常是药物发现的首要且最重要的步骤。靶点发现的一种解决方案是靶点重新定位,这是一种将已知靶点用于新疾病的策略,可带来新的治疗方法、更少的副作用以及潜在的药物协同作用。生物网络已成为整合异构数据和促进生物或治疗特性预测的强大工具。因此,它们被广泛用于通过表征潜在候选物来预测新的治疗靶点,通常基于它们在蛋白质 - 蛋白质相互作用(PPI)网络中的相互作用以及它们与疾病相关基因的接近程度。然而,过度依赖PPI网络以及认为潜在靶点必然靠近已知基因的假设可能会引入偏差,从而限制这些方法的有效性。本研究通过两种方式解决这些局限性。首先,利用多层网络,该网络纳入了额外信息,如基因调控、代谢物相互作用、代谢途径以及几种疾病特征,如差异表达基因、突变基因、拷贝数改变和结构变异。其次,通过多种方法从网络中提取相关特征,包括与疾病相关基因的接近程度,但也包括无偏差方法,如基于传播的方法、拓扑度量和模块检测算法。以前列腺癌为例,识别出最佳特征并用于训练机器学习算法,以预测5个新的有前景的前列腺癌治疗靶点:IGF2R、C5AR、RAB7、SETD2和NPBWR1。