Center for Safe Artificial Intelligence for Healthcare, School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, Texas, USA.
Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland.
J Am Med Inform Assoc. 2021 Jan 15;28(1):42-51. doi: 10.1093/jamia/ocaa212.
Drug combination screening has advantages in identifying cancer treatment options with higher efficacy without degradation in terms of safety. A key challenge is that the accumulated number of observations in in-vitro drug responses varies greatly among different cancer types, where some tissues are more understudied than the others. Thus, we aim to develop a drug synergy prediction model for understudied tissues as a way of overcoming data scarcity problems.
We collected a comprehensive set of genetic, molecular, phenotypic features for cancer cell lines. We developed a drug synergy prediction model based on multitask deep neural networks to integrate multimodal input and multiple output. We also utilized transfer learning from data-rich tissues to data-poor tissues.
We showed improved accuracy in predicting synergy in both data-rich tissues and understudied tissues. In data-rich tissue, the prediction model accuracy was 0.9577 AUROC for binarized classification task and 174.3 mean squared error for regression task. We observed that an adequate transfer learning strategy significantly increases accuracy in the understudied tissues.
Our synergy prediction model can be used to rank synergistic drug combinations in understudied tissues and thus help to prioritize future in-vitro experiments. Code is available at https://github.com/yejinjkim/synergy-transfer.
药物组合筛选在识别具有更高疗效而安全性不降低的癌症治疗方案方面具有优势。一个关键的挑战是,不同癌症类型之间的体外药物反应累积观察数量差异很大,其中一些组织的研究比其他组织更少。因此,我们旨在开发一种针对研究较少组织的药物协同预测模型,以克服数据匮乏问题。
我们收集了一套全面的癌症细胞系的遗传、分子、表型特征。我们开发了一种基于多任务深度神经网络的药物协同预测模型,以整合多模态输入和多个输出。我们还利用从数据丰富的组织到数据匮乏的组织的迁移学习。
我们在数据丰富的组织和研究较少的组织中都显示出了提高协同预测准确性的效果。在数据丰富的组织中,用于二分类任务的预测模型准确性为 0.9577 AUROC,用于回归任务的准确性为 174.3 均方误差。我们观察到,适当的迁移学习策略显著提高了研究较少组织的准确性。
我们的协同预测模型可用于对研究较少的组织中的协同药物组合进行排序,从而有助于确定未来的体外实验优先级。代码可在 https://github.com/yejinjkim/synergy-transfer 上获得。