Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA.
Present address: Bioinformatics Research and Development Laboratory, Genomic Sciences and Precision Medicine Center, Medical College of Wisconsin, Milwaukee, WI, USA.
BMC Med Genomics. 2019 Jan 31;12(Suppl 1):15. doi: 10.1186/s12920-018-0449-4.
Predicting cellular responses to drugs has been a major challenge for personalized drug therapy regimen. Recent pharmacogenomic studies measured the sensitivities of heterogeneous cell lines to numerous drugs, and provided valuable data resources to develop and validate computational approaches for the prediction of drug responses. Most of current approaches predict drug sensitivity by building prediction models with individual genes, which suffer from low reproducibility due to biologic variability and difficulty to interpret biological relevance of novel gene-drug associations. As an alternative, pathway activity scores derived from gene expression could predict drug response of cancer cells.
In this study, pathway-based prediction models were built with four approaches inferring pathway activity in unsupervised manner, including competitive scoring approaches (DiffRank and GSVA) and self-contained scoring approaches (PLAGE and Z-score). These unsupervised pathway activity inference approaches were applied to predict drug responses of cancer cells using data from Cancer Cell Line Encyclopedia (CCLE).
Our analysis on all the 24 drugs from CCLE demonstrated that pathway-based models achieved better predictions for 14 out of the 24 drugs, while taking fewer features as inputs. Further investigation on indicated that pathway-based models indeed captured pathways involving drug-related genes (targets, transporters and metabolic enzymes) for majority of drugs, whereas gene-models failed to identify these drug-related genes, in most cases. Among the four approaches, competitive scoring (DiffRank and GSVA) provided more accurate predictions and captured more pathways involving drug-related genes than self-contained scoring (PLAGE and Z-Score). Detailed interpretation of top pathways from the top method (DiffRank) highlights the merit of pathway-based approaches to predict drug response by identifying pathways relevant to drug mechanisms.
Taken together, pathway-based modeling with inferred pathway activity is a promising alternative to predict drug response, with the ability to easily interpret results and provide biological insights into the mechanisms of drug actions.
预测细胞对药物的反应一直是个性化药物治疗方案的主要挑战。最近的药物基因组学研究测量了异质细胞系对众多药物的敏感性,并提供了有价值的数据资源,用于开发和验证预测药物反应的计算方法。目前大多数方法都是通过构建基于个体基因的预测模型来预测药物敏感性,但由于生物学变异性和难以解释新的基因-药物关联的生物学相关性,这些模型的可重复性较低。作为替代方案,基于基因表达的途径活性评分可以预测癌细胞对药物的反应。
在这项研究中,我们使用四种无监督方法推断途径活性,包括竞争评分方法(DiffRank 和 GSVA)和自包含评分方法(PLAGE 和 Z 分数),构建了基于途径的预测模型。这些无监督途径活性推断方法应用于使用癌症细胞系百科全书(CCLE)的数据预测癌细胞对 24 种药物的反应。
我们对 CCLE 中所有 24 种药物的分析表明,基于途径的模型在 24 种药物中的 14 种药物的预测中表现更好,同时作为输入的特征更少。进一步的研究表明,基于途径的模型确实捕捉到了涉及药物相关基因(靶标、转运体和代谢酶)的途径,而基因模型在大多数情况下无法识别这些药物相关基因。在这四种方法中,竞争评分(DiffRank 和 GSVA)比自包含评分(PLAGE 和 Z 分数)提供了更准确的预测结果,并捕获了更多涉及药物相关基因的途径。从最佳方法(DiffRank)中提取的前几个途径的详细解释突出了基于途径的方法通过识别与药物机制相关的途径来预测药物反应的优势。
综上所述,推断途径活性的基于途径的建模是一种很有前途的替代方案,可以轻松解释结果,并为药物作用机制提供生物学见解。