Department of Bioengineering, University of California Los Angeles, Los Angeles, CA, USA.
Department of Computer Science, University of Pisa, Pisa, Italy.
Integr Biol (Camb). 2022 Mar 21;14(1):13-24. doi: 10.1093/intbio/zyac002.
With high drug attrition, protein-protein interaction (PPI) network models are attractive as efficient methods for predicting drug outcomes by analyzing proteins downstream of drug targets. Unfortunately, these methods tend to overpredict associations and they have low precision and prediction performance; performance is often no better than random (AUROC ~0.5). Typically, PPI models identify ranked phenotypes associated with downstream proteins, yet methods differ in prioritization of downstream proteins. Most methods apply global approaches for assessing all phenotypes. We hypothesized that a per-phenotype analysis could improve prediction performance. We compared two global approaches-statistical and distance-based-and our novel per-phenotype approach, 'context-specific interaction' (CSI) analysis, on severe side effect prediction. We used a novel dataset of adverse events (or designated medical events, DMEs) and discovered that CSI had a 50% improvement over global approaches (AUROC 0.77 compared to 0.51), and a 76-95% improvement in average precision (0.499 compared to 0.284, 0.256). Our results provide a quantitative rationale for considering downstream proteins on a per-phenotype basis when using PPI network methods to predict drug phenotypes.
由于药物淘汰率高,蛋白质-蛋白质相互作用 (PPI) 网络模型通过分析药物靶点下游的蛋白质,成为预测药物结果的有效方法而备受关注。不幸的是,这些方法往往会过度预测关联,并且它们的精度和预测性能较低;性能通常不比随机更好(AUROC~0.5)。通常,PPI 模型会识别与下游蛋白质相关的排名表型,但下游蛋白质的优先级排序方法有所不同。大多数方法应用全局方法来评估所有表型。我们假设逐表型分析可以提高预测性能。我们比较了两种全局方法(统计和基于距离的方法)和我们新颖的逐表型方法“特定于上下文的相互作用”(CSI)分析,用于严重副作用预测。我们使用了一个新的不良事件(或指定的医疗事件,DME)数据集,并发现 CSI 比全局方法提高了 50%(AUROC 从 0.51 提高到 0.77),平均精度提高了 76-95%(从 0.284 提高到 0.499,0.256)。我们的结果为在使用 PPI 网络方法预测药物表型时,基于逐表型考虑下游蛋白质提供了定量依据。