SASTRA Deemed to be University, Thanjavur, Tamilnadu 613401, India.
SASTRA Deemed to be University, Thanjavur, Tamilnadu 613401, India.
Comput Methods Programs Biomed. 2022 Sep;224:107027. doi: 10.1016/j.cmpb.2022.107027. Epub 2022 Jul 20.
The prediction of multiple drug efficacies using machine learning prediction techniques based on clinical and molecular attributes of tumors is a new approach in the field of precision medicine of oncology. The selection of suitable and effective therapeutic drugs among the potential drugs is performed computationally considering the tumor features. In this study, we developed and validated machine learning models to predict the efficacy of five anti-cancer drugs according to the clinical and molecular attributes of 30 oral squamous cell carcinoma (OSCC) cohorts. This sounds a bit odd - consider: Ranking of the drugs was achieved using their apoptotic priming.
We developed multiple drug efficacy prediction models based on three types of tumor characteristics by applying machine learning methods, including multi-target regression (MTR) and support vector regression (SVR). The prediction accuracy of existing machine learning methods was enhanced by introducing novel pre-processing techniques to develop Enhanced MTR (E_MTR), Enhanced Log-based MTR (EL_MTR), Enhanced Multi-target SVR (EM_SVR), and Enhanced Log-based Multi-target SVR (ELM_SVR). As a unique capability, ELM_SVR and EL_MTR rank the drugs based on their predicted efficacy. All the drug efficacy prediction models were built using OSCC real samples and theoretical samples. The best model was selected was based on dataset size and evaluation metrics, such as error terms, residuals and parameter tuning, and cross-validated (CV) using 30 real samples and 340 theoretical samples.
When 30 real tumor samples were used for the train-test and CV methods, MTR models predicted the efficacy with less error than SVR models. Comparatively, using 340 theoretical samples for the train-test and CV methods, though MTR improved the performance, SVR predicted the efficacy with zero error. We found that, for small samples, the proposed MTR provided a 0.01 difference between actual apoptotic priming and predicted priming of five drugs. For large samples, the predicted values by the proposed SVR had a difference of 0.00001. The error terms (Actual vs. Predicted) also reveal that the enhanced log model is suitable when MTR is applied. Meanwhile, the enhanced model is suitable for SVR learning for multiple drug efficacy prediction. It was found that the predicted ranks of the drugs based on the multi-targeted efficacy prediction exactly match the actual rankings.
We developed efficient statistical and machine learning models using MTR and SVR analysis for anticancer drug efficacy, which will be useful in the field of precision medicine to choose the most suitable drugs in personalized manner. The performance results of the proposed enhanced ranking techniques are described as follows: i) EL_MTR is the best to predict multiple anticancer drug efficacies and improve the accuracy of ranking drugs, irrespective of sample size; and ii) ELM_SVR performs better than other MTR models with a large sample size and precise ranking process.
基于肿瘤临床和分子特征,利用机器学习预测技术预测多种药物疗效是肿瘤精准医学领域的一种新方法。考虑到肿瘤特征,通过计算在潜在药物中选择合适且有效的治疗药物。在这项研究中,我们开发并验证了机器学习模型,根据 30 个口腔鳞状细胞癌(OSCC)队列的临床和分子特征,预测五种抗癌药物的疗效。这听起来有点奇怪 - 考虑一下:根据细胞凋亡引发作用对药物进行排序。
我们通过应用机器学习方法(包括多目标回归(MTR)和支持向量回归(SVR)),基于三种肿瘤特征开发了多种药物疗效预测模型。通过引入新颖的预处理技术,开发了增强型 MTR(E_MTR)、增强型基于对数的 MTR(EL_MTR)、增强型多目标 SVR(EM_SVR)和增强型基于对数的多目标 SVR(ELM_SVR),提高了现有机器学习方法的预测准确性。作为一种独特的能力,ELM_SVR 和 EL_MTR 根据预测的疗效对药物进行排序。所有的药物疗效预测模型都是使用 OSCC 真实样本和理论样本构建的。根据数据集大小和评估指标(如误差项、残差和参数调整),选择最佳模型,并使用 30 个真实样本和 340 个理论样本进行交叉验证(CV)。
当使用 30 个真实肿瘤样本进行训练-测试和 CV 方法时,MTR 模型预测的疗效误差小于 SVR 模型。相比之下,当使用 340 个理论样本进行训练-测试和 CV 方法时,尽管 MTR 提高了性能,但 SVR 预测的疗效没有误差。我们发现,对于小样本,所提出的 MTR 在五种药物的实际细胞凋亡引发作用和预测引发作用之间提供了 0.01 的差异。对于大样本,所提出的 SVR 的预测值差异为 0.00001。误差项(实际值与预测值)还表明,在应用 MTR 时,增强对数模型是合适的。同时,增强模型适用于 SVR 学习以进行多种药物疗效预测。发现基于多靶向疗效预测的药物排序与实际排序完全匹配。
我们使用 MTR 和 SVR 分析开发了有效的统计和机器学习模型,用于抗癌药物疗效预测,这将有助于精准医学领域以个性化方式选择最合适的药物。所提出的增强排序技术的性能结果如下:i)EL_MTR 是预测多种抗癌药物疗效和提高药物排序准确性的最佳方法,无论样本大小如何;ii)ELM_SVR 在大样本量和精确排序过程中表现优于其他 MTR 模型。