College of Computer Science, Huanggang Normal University, Huanggang, 438000, China.
Department of Pathology, Faculty of Veterinary and Animal Sciences, The Islamia University of Bahawalpur, Bahawalpur, Pakistan.
Mol Divers. 2024 Aug;28(4):1849-1868. doi: 10.1007/s11030-023-10690-y. Epub 2023 Jul 7.
The role of NLRP3 inflammasome in innate immunity is newly recognized. The NLRP3 protein is a family of nucleotide-binding and oligomerization domain-like receptors as well as a pyrin domain-containing protein. It has been shown that NLRP3 may contribute to the development and progression of a variety of diseases, such as multiple sclerosis, metabolic disorders, inflammatory bowel disease, and other auto-immune and auto-inflammatory conditions. The use of machine learning methods in pharmaceutical research has been widespread for several decades. An important objective of this study is to apply machine learning approaches for the multinomial classification of NLRP3 inhibitors. However, data imbalances can affect machine learning. Therefore, a synthetic minority oversampling technique (SMOTE) has been developed to increase the sensitivity of classifiers to minority groups. The QSAR modelling was performed using 154 molecules retrieved from the ChEMBL database (version 29). The accuracy of the multiclass classification top six models was found to fall within ranges of 0.99 to 0.86, and log loss ranges of 0.2 to 2.3, respectively. The results showed that the receiver operating characteristic curve (ROC) plot values significantly improved when tuning parameters were adjusted and imbalanced data was handled. Moreover, the results demonstrated that SMOTE offers a significant advantage in handling imbalanced datasets as well as substantial improvements in overall accuracy of machine learning models. The top models were then used to predict data from unseen datasets. In summary, these QSAR classification models exhibited robust statistical results and were interpretable, which strongly supported their use for rapid screening of NLRP3 inhibitors.
NLRP3 炎性小体在先天免疫中的作用是新发现的。NLRP3 蛋白是核苷酸结合和寡聚结构域样受体家族以及含有 pyrin 结构域的蛋白。已经表明,NLRP3 可能有助于多种疾病的发展和进展,如多发性硬化症、代谢紊乱、炎症性肠病和其他自身免疫和自身炎症性疾病。几十年来,机器学习方法在药物研究中的应用已经非常广泛。本研究的一个重要目标是应用机器学习方法对 NLRP3 抑制剂进行多项分类。然而,数据不平衡会影响机器学习。因此,开发了一种合成少数群体过采样技术 (SMOTE) 来提高分类器对少数群体的敏感性。使用从 ChEMBL 数据库(版本 29)中检索到的 154 个分子进行了 QSAR 建模。发现六级多类分类模型的准确性范围在 0.99 到 0.86 之间,对数损失范围在 0.2 到 2.3 之间。结果表明,调整参数并处理不平衡数据后,接收器操作特性曲线 (ROC) 图的值显著提高。此外,结果表明 SMOTE 在处理不平衡数据集方面具有显著优势,并且可以大大提高机器学习模型的整体准确性。然后使用顶级模型来预测来自未见数据集的数据。总之,这些 QSAR 分类模型表现出稳健的统计结果和可解释性,强烈支持它们用于快速筛选 NLRP3 抑制剂。