Taj Farzan, Stein Lincoln D
Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A1, Canada.
Adaptive Oncology, Ontario Institute for Cancer Research, Toronto, ON M5G 0A3, Canada.
Bioinform Adv. 2024 Jan 20;4(1):vbae010. doi: 10.1093/bioadv/vbae010. eCollection 2024.
A major challenge in cancer care is that patients with similar demographics, tumor types, and medical histories can respond quite differently to the same drug regimens. This difference is largely explained by genetic and other molecular variabilities among the patients and their cancers. Efforts in the pharmacogenomics field are underway to understand better the relationship between the genome of the patient's healthy and tumor cells and their response to therapy. To advance this goal, research groups and consortia have undertaken large-scale systematic screening of panels of drugs across multiple cancer cell lines that have been molecularly profiled by genomics, proteomics, and similar techniques. These large data drug screening sets have been applied to the problem of drug response prediction (DRP), the challenge of predicting the response of a previously untested drug/cell-line combination. Although deep learning algorithms outperform traditional methods, there are still many challenges in DRP that ultimately result in these models' low generalizability and hampers their clinical application.
In this article, we describe a novel algorithm that addresses the major shortcomings of current DRP methods by combining multiple cell line characterization data, addressing drug response data skewness, and improving chemical compound representation.
MMDRP is implemented as an open-source, Python-based, command-line program and is available at https://github.com/LincolnSteinLab/MMDRP.
癌症治疗中的一个主要挑战是,具有相似人口统计学特征、肿瘤类型和病史的患者对相同的药物治疗方案可能有截然不同的反应。这种差异在很大程度上可以由患者及其癌症之间的基因和其他分子变异性来解释。药物基因组学领域正在努力更好地理解患者健康细胞和肿瘤细胞的基因组与其对治疗的反应之间的关系。为了推进这一目标,研究团队和联盟对通过基因组学、蛋白质组学和类似技术进行分子特征分析的多种癌细胞系进行了大规模的系统药物筛选。这些大规模数据药物筛选集已应用于药物反应预测(DRP)问题,即预测先前未经测试的药物/细胞系组合反应的挑战。尽管深度学习算法优于传统方法,但DRP仍存在许多挑战,最终导致这些模型的泛化能力较低,并阻碍了它们的临床应用。
在本文中,我们描述了一种新颖的算法,该算法通过结合多种细胞系特征数据、解决药物反应数据偏态问题以及改进化合物表示来解决当前DRP方法的主要缺点。
MMDRP作为一个基于Python的开源命令行程序实现,可在https://github.com/LincolnSteinLab/MMDRP上获取。