Chen Bo-Juen, Litvin Oren, Ungar Lyle, Pe'er Dana
Department of Biomedical Informatics, Columbia University, New York, New York, 10032, United States of America; Department of Biological Sciences, Department of Systems Biology, Columbia University, New York, New York, 10027, United States of America.
Department of Biological Sciences, Department of Systems Biology, Columbia University, New York, New York, 10027, United States of America.
PLoS One. 2015 Aug 14;10(8):e0133850. doi: 10.1371/journal.pone.0133850. eCollection 2015.
Recent screening of drug sensitivity in large panels of cancer cell lines provides a valuable resource towards developing algorithms that predict drug response. Since more samples provide increased statistical power, most approaches to prediction of drug sensitivity pool multiple cancer types together without distinction. However, pan-cancer results can be misleading due to the confounding effects of tissues or cancer subtypes. On the other hand, independent analysis for each cancer-type is hampered by small sample size. To balance this trade-off, we present CHER (Contextual Heterogeneity Enabled Regression), an algorithm that builds predictive models for drug sensitivity by selecting predictive genomic features and deciding which ones should-and should not-be shared across different cancers, tissues and drugs. CHER provides significantly more accurate models of drug sensitivity than comparable elastic-net-based models. Moreover, CHER provides better insight into the underlying biological processes by finding a sparse set of shared and type-specific genomic features.
最近在大量癌细胞系中进行的药物敏感性筛选为开发预测药物反应的算法提供了宝贵资源。由于更多样本能提供更强的统计效力,大多数预测药物敏感性的方法不加区分地将多种癌症类型合并在一起。然而,由于组织或癌症亚型的混杂效应,泛癌结果可能会产生误导。另一方面,每种癌症类型的独立分析因样本量小而受到阻碍。为了平衡这种权衡,我们提出了CHER(上下文异质性增强回归)算法,该算法通过选择预测性基因组特征并决定哪些特征应在不同癌症、组织和药物之间共享以及哪些不应共享,来构建药物敏感性预测模型。与基于弹性网络的可比模型相比,CHER能提供显著更准确的药物敏感性模型。此外,CHER通过找到一组稀疏的共享和特定类型的基因组特征,能更好地洞察潜在的生物学过程。