Department of Computer Engineering, Gachon University, 5-22Ho, IT college, 1324 Seongnam-daero, Seongnam-si, 13120, South Korea.
BMC Bioinformatics. 2018 Nov 21;19(1):446. doi: 10.1186/s12859-018-2490-x.
Diverse interactions occur between biomolecules, such as activation, inhibition, expression, or repression. However, previous network-based studies of drug repositioning have employed interaction on the binary protein-protein interaction (PPI) network without considering the characteristics of the interactions. Recently, some studies of drug repositioning using gene expression data found that associations between drug and disease genes are useful information for identifying novel drugs to treat diseases. However, the gene expression profiles for drugs and diseases are not always available. Although gene expression profiles of drugs and diseases are available, existing methods cannot use the drugs or diseases, when differentially expressed genes in the profiles are not included in their network.
We developed a novel method for identifying candidate indications of existing drugs considering types of interactions between biomolecules based on known drug-disease associations. To obtain associations between drug and disease genes, we constructed a directed network using protein interaction and gene regulation data obtained from various public databases providing diverse biological pathways. The network includes three types of edges depending on relationships between biomolecules. To quantify the association between a target gene and a disease gene, we explored the shortest paths from the target gene to the disease gene and calculated the types and weights of the shortest paths. For each drug-disease pair, we built a vector consisting of values for each disease gene influenced by the drug. Using the vectors and known drug-disease associations, we constructed classifiers to identify novel drugs for each disease.
We propose a method for exploring candidate drugs of diseases using associations between drugs and disease genes derived from a directed gene network instead of gene regulation data obtained from gene expression profiles. Compared to existing methods that require information on gene relationships and gene expression data, our method can be applied to a greater number of drugs and diseases. Furthermore, to validate our predictions, we compared the predictions with drug-disease pairs in clinical trials using the hypergeometric test, which showed significant results. Our method also showed better performance compared to existing methods for the area under the receiver operating characteristic curve (AUC).
生物分子之间存在多种相互作用,如激活、抑制、表达或抑制。然而,以前基于网络的药物重定位研究采用了二元蛋白质-蛋白质相互作用(PPI)网络上的相互作用,而没有考虑相互作用的特征。最近,一些使用基因表达数据进行药物重定位的研究发现,药物和疾病基因之间的关联是识别治疗疾病的新型药物的有用信息。然而,并非总是可以获得药物和疾病的基因表达谱。尽管有药物和疾病的基因表达谱,但现有的方法在谱中差异表达的基因不包含在其网络中时,无法使用药物或疾病。
我们开发了一种基于已知药物-疾病关联,根据生物分子之间相互作用的类型识别现有药物候选适应症的新方法。为了获得药物和疾病基因之间的关联,我们使用从各种提供不同生物途径的公共数据库中获得的蛋白质相互作用和基因调控数据构建了一个有向网络。该网络包括三种类型的边,取决于生物分子之间的关系。为了量化靶基因与疾病基因之间的关联,我们探索了从靶基因到疾病基因的最短路径,并计算了最短路径的类型和权重。对于每个药物-疾病对,我们构建了一个由受药物影响的每个疾病基因的值组成的向量。使用这些向量和已知的药物-疾病关联,我们为每个疾病构建了一个分类器来识别新型药物。
我们提出了一种使用从有向基因网络中推导的药物与疾病基因之间的关联来探索疾病候选药物的方法,而不是使用从基因表达谱中获得的基因调控数据。与需要基因关系和基因表达数据信息的现有方法相比,我们的方法可以应用于更多的药物和疾病。此外,为了验证我们的预测,我们使用超几何检验将预测与临床试验中的药物-疾病对进行了比较,结果表明有显著差异。我们的方法在接收器操作特性曲线(AUC)下的面积也优于现有方法。