School of mathematics and physics, Anhui Jianzhu University, Hefei, China.
School of mathematics and physics, Anhui Jianzhu University, Hefei, China; Institute of Intelligent Machines, Hefei Institutes of Physical Science, CAS, Hefei, China.
Artif Intell Med. 2024 Jun;152:102864. doi: 10.1016/j.artmed.2024.102864. Epub 2024 Apr 2.
Predicting the response of tumor cells to anti-tumor drugs is critical to realizing cancer precision medicine. Currently, most existing methods ignore the regulatory relationships between genes and thus have unsatisfactory predictive performance. In this paper, we propose to predict anti-tumor drug efficacy via learning the activity representation of tumor cells based on a priori knowledge of gene regulation networks (GRNs). Specifically, the method simulates the cellular biosystem by synthesizing a cell-gene activity network and then infers a new low-dimensional activity representation for tumor cells from the raw high-dimensional expression profile. The simulated cell-gene network mainly comprises known gene regulatory networks collected from multiple resources and fuses tumor cells by linking them to hotspot genes that are over- or under-expressed in them. The resulting activity representation could not only reflect the shallow expression profile (hotspot genes) but also mines in-depth information of gene regulation activity in tumor cells before treatment. Finally, we build deep learning models on the activity representation for predicting drug efficacy in tumor cells. Experimental results on the benchmark GDSC dataset demonstrate the superior performance of the proposed method over SOTA methods with the highest AUC of 0.954 in the efficacy label prediction and the best R of 0.834 in the regression of half maximal inhibitory concentration (IC50) values, suggesting the potential value of the proposed method in practice.
预测肿瘤细胞对肿瘤药物的反应对于实现癌症精准医疗至关重要。目前,大多数现有的方法都忽略了基因之间的调控关系,因此预测性能并不理想。在本文中,我们提出通过学习基于基因调控网络(GRN)先验知识的肿瘤细胞活性表示来预测抗肿瘤药物的疗效。具体来说,该方法通过合成细胞-基因活性网络来模拟细胞生物系统,然后从原始的高维表达谱中推断肿瘤细胞的新的低维活性表示。模拟的细胞-基因网络主要包含从多个资源中收集的已知基因调控网络,并通过将它们与过度或低表达的热点基因连接起来融合肿瘤细胞。由此产生的活性表示不仅可以反映浅层的表达谱(热点基因),还可以挖掘治疗前肿瘤细胞中基因调控活性的深入信息。最后,我们在活性表示上构建深度学习模型来预测肿瘤细胞中的药物疗效。在基准 GDSC 数据集上的实验结果表明,与 SOTA 方法相比,该方法具有卓越的性能,在疗效标签预测中最高 AUC 为 0.954,在半最大抑制浓度(IC50)值的回归中最佳 R 为 0.834,表明该方法在实际应用中具有潜在价值。