Department of Mathematics, Shanghai Normal University, Shanghai, China.
School of Mathematics and Statistics, Hainan Normal University, Haikou, China.
PLoS One. 2018 Oct 5;13(10):e0205155. doi: 10.1371/journal.pone.0205155. eCollection 2018.
Drug response prediction is a critical step for personalized treatment of cancer patients and ultimately leads to precision medicine. A lot of machine-learning based methods have been proposed to predict drug response from different types of genomic data. However, currently available methods could only give a "point" prediction of drug response value but fail to provide the reliability and distribution of the prediction, which are of equal interest in clinical practice. In this paper, we proposed a method based on quantile regression forest and applied it to the CCLE dataset. Through the out-of-bag validation, our method achieved much higher prediction accuracy of drug response than other available tools. The assessment of prediction reliability by prediction intervals and its significance in personalized medicine were illustrated by several examples. Functional analysis of selected drug response associated genes showed that the proposed method achieves more biologically plausible results.
药物反应预测是癌症患者个体化治疗的关键步骤,最终导致精准医学。已经提出了许多基于机器学习的方法来从不同类型的基因组数据中预测药物反应。然而,目前可用的方法只能对药物反应值进行“点”预测,但无法提供预测的可靠性和分布,这在临床实践中同样具有重要意义。在本文中,我们提出了一种基于分位数回归森林的方法,并将其应用于 CCLE 数据集。通过袋外验证,我们的方法在药物反应预测方面的准确性明显高于其他可用工具。通过预测区间评估预测可靠性及其在个性化医学中的意义,并通过几个示例进行了说明。对选定的药物反应相关基因的功能分析表明,该方法得出的结果更具生物学意义。