Department of Bioinformatics, Pirogov Russian National Research Medical University, Moscow, Russia.
Department of Bioinformatics, Institute of Biomedical Chemistry, Moscow, Russia.
SAR QSAR Environ Res. 2024 Jan;35(1):1-9. doi: 10.1080/1062936X.2023.2289050. Epub 2024 Jan 29.
In silico prediction of cell line cytotoxicity considerably decreases time and financial costs during drug development of new antineoplastic agents. (Q)SAR models for the prediction of drug-like compound cytotoxicity in relation to nine breast cancer cell lines (T47D, ZR-75-1, MX1, Hs-578T, MCF7-DOX, MCF7, Bcap37, MCF7R, BT-20) were created by GUSAR software based on the data from ChEMBL database (v. 30). The separate datasets related with IC and IG values were used for the creation of (Q)SAR models for each cell line. Based on leave-one-out and 5F CV procedures, 24 reasonable (Q)SAR models were selected for the creation of a freely available web-application (BC CLC-Pred: https://www.way2drug.com/bc/) to predict substance cytotoxicity in relation to human breast cancer cell lines. The mean accuracies of prediction , RMSE, Balance Accuracy for the selected (Q)SAR models calculated by 5F CV were 0.599, 0.679 and 0.875, respectively. As a result, BC CLC-Pred provides simultaneous quantitative and qualitative predictions of IC and IG values for most of the nine breast cancer cell lines, which may be helpful in selecting promising compounds and optimizing lead compounds during the development of new antineoplastic agents against breast cancer.
基于 ChEMBL 数据库(v. 30)的数据,GUSAR 软件创建了用于预测 9 种乳腺癌细胞系(T47D、ZR-75-1、MX1、Hs-578T、MCF7-DOX、MCF7、Bcap37、MCF7R、BT-20)中药物样化合物细胞毒性的(QSAR)模型。分别使用与 IC 和 IG 值相关的数据集来为每个细胞系创建(QSAR)模型。基于留一法和 5F CV 程序,选择了 24 个合理的(QSAR)模型,用于创建一个免费的网络应用程序(BC CLC-Pred:https://www.way2drug.com/bc/),以预测与人类乳腺癌细胞系相关的物质细胞毒性。通过 5F CV 计算的选定(QSAR)模型的平均预测准确性、RMSE 和平衡准确性分别为 0.599、0.679 和 0.875。因此,BC CLC-Pred 为大多数 9 种乳腺癌细胞系提供了 IC 和 IG 值的同时定量和定性预测,这可能有助于在开发针对乳腺癌的新型抗肿瘤药物时选择有前途的化合物和优化先导化合物。