Toropova Alla P, Toropov Andrey A, Roncaglioni Alessandra, Benfenati Emilio
Department of Environmental Health Science, Istituto di Ricerche Farmacologiche Mario Negri (IRCCS), Via Mario Negri 2, 20156 Milano, Italy.
Toxics. 2025 Apr 16;13(4):309. doi: 10.3390/toxics13040309.
The optimal descriptors generated by the CORAL software are studied as potential models of cardiotoxicity. Two significantly different cardiotoxicity databases are studied here. Database 1 contains 394 hERG inhibitors (pIC50) and external 200 substances that are potential drugs, which were used to confirm the predictive potential of the approach for Database 1. Database 2 contains cardiotoxicity data for 13864 different compounds in a format where active is denoted as 1 and inactive is denoted as 0. The same model-building algorithms were applied to all three databases using the Monte Carlo method and Las Vegas algorithm. The latter was used to rationally distribute the available data into training and validation sets. The Monte Carlo optimization for the correlation weights of different molecular features extracted from SMILES was improved by including the conformity coefficient of the correlation prediction (CCCP). This improvement provided greater predictive potential in the considered models.
研究了由CORAL软件生成的最佳描述符作为心脏毒性的潜在模型。这里研究了两个显著不同的心脏毒性数据库。数据库1包含394种hERG抑制剂(pIC50)和200种外部潜在药物物质,用于确认该方法对数据库1的预测潜力。数据库2包含13864种不同化合物的心脏毒性数据,其格式为活性表示为1,非活性表示为0。使用蒙特卡罗方法和拉斯维加斯算法将相同的模型构建算法应用于所有三个数据库。后者用于将可用数据合理地分配到训练集和验证集。通过纳入相关预测的一致性系数(CCCP),改进了从SMILES中提取的不同分子特征的相关权重的蒙特卡罗优化。这种改进在所考虑的模型中提供了更大的预测潜力。