Huang Yue, Rong Zhiwei, Zhang Liuchao, Xu Zhenyi, Ji Jianxin, He Jia, Liu Weisha, Hou Yan, Li Kang
Department of Biostatistics, School of Public Health, Harbin Medical University, Harbin, China.
Department of Biostatistics, School of Public Health, Peking University, Beijing, China.
Front Oncol. 2023 Jan 26;13:1047556. doi: 10.3389/fonc.2023.1047556. eCollection 2023.
The prediction of response to drugs before initiating therapy based on transcriptome data is a major challenge. However, identifying effective drug response label data costs time and resources. Methods available often predict poorly and fail to identify robust biomarkers due to the curse of dimensionality: high dimensionality and low sample size. Therefore, this necessitates the development of predictive models to effectively predict the response to drugs using limited labeled data while being interpretable. In this study, we report a novel Hierarchical Graph Random Neural Networks (HiRAND) framework to predict the drug response using transcriptome data of few labeled data and additional unlabeled data. HiRAND completes the information integration of the gene graph and sample graph by graph convolutional network (GCN). The innovation of our model is leveraging data augmentation strategy to solve the dilemma of limited labeled data and using consistency regularization to optimize the prediction consistency of unlabeled data across different data augmentations. The results showed that HiRAND achieved better performance than competitive methods in various prediction scenarios, including both simulation data and multiple drug response data. We found that the prediction ability of HiRAND in the drug vorinostat showed the best results across all 62 drugs. In addition, HiRAND was interpreted to identify the key genes most important to vorinostat response, highlighting critical roles for ribosomal protein-related genes in the response to histone deacetylase inhibition. Our HiRAND could be utilized as an efficient framework for improving the drug response prediction performance using few labeled data.
在开始治疗前基于转录组数据预测药物反应是一项重大挑战。然而,识别有效的药物反应标签数据既耗时又耗资源。由于维度灾难(高维度和低样本量),现有的方法预测效果往往不佳,并且无法识别稳健的生物标志物。因此,这就需要开发预测模型,以便在可解释的同时利用有限的标记数据有效地预测药物反应。在本研究中,我们报告了一种新颖的分层图随机神经网络(HiRAND)框架,用于使用少量标记数据和额外的未标记数据的转录组数据来预测药物反应。HiRAND通过图卷积网络(GCN)完成基因图和样本图的信息整合。我们模型的创新之处在于利用数据增强策略来解决标记数据有限的困境,并使用一致性正则化来优化未标记数据在不同数据增强下的预测一致性。结果表明,在各种预测场景中,包括模拟数据和多种药物反应数据,HiRAND都比竞争方法表现更好。我们发现,HiRAND在伏立诺他药物的预测能力在所有62种药物中表现最佳。此外,对HiRAND进行解释以识别对伏立诺他反应最重要的关键基因,突出了核糖体蛋白相关基因在组蛋白去乙酰化酶抑制反应中的关键作用。我们的HiRAND可以作为一个有效的框架,用于利用少量标记数据提高药物反应预测性能。