Department of Computer Sciences, Faculty of Informatics, Camagüey University, 74650, Camagüey City, Cuba.
Universidad Tecnológica Metropolitana (UTEM), 8940577, Santiago, Chile.
Mol Divers. 2024 Aug;28(4):1983-1994. doi: 10.1007/s11030-023-10638-2. Epub 2023 Apr 5.
Ubiquitin-proteasome system (UPS) is a highly regulated mechanism of intracellular protein degradation and turnover. The UPS is involved in different biological activities, such as the regulation of gene transcription and cell cycle. Several researchers have applied cheminformatics and artificial intelligence methods to study the inhibition of proteasomes, including the prediction of UPP inhibitors. Following this idea, we applied a new tool for obtaining molecular descriptors (MDs) for modeling proteasome Inhibition in terms of EC (µmol/L), in which a set of new MDs called atomic weighted vectors (AWV) and several prediction algorithms were used in cheminformatics studies. In the manuscript, a set of descriptors based on AWV are presented as datasets for training different machine learning techniques, such as linear regression, multiple linear regression (MLR), random forest (RF), K-nearest neighbors (IBK), multi-layer perceptron, best-first search, and genetic algorithm. The results suggest that these atomic descriptors allow adequate modeling of proteasome inhibitors despite artificial intelligence techniques, as a variant to build efficient models for the prediction of inhibitory activity.
泛素-蛋白酶体系统 (UPS) 是一种高度调控的细胞内蛋白质降解和周转机制。UPS 参与多种生物活性,如基因转录和细胞周期的调节。一些研究人员已经应用化学信息学和人工智能方法来研究蛋白酶体的抑制,包括预测 UPP 抑制剂。受此启发,我们应用了一种新的获得分子描述符 (MD) 的工具,用于根据 EC(µmol/L)来模拟蛋白酶体抑制,其中使用了一组新的称为原子加权向量 (AWV) 的 MD 和几种预测算法进行化学信息学研究。在本文中,提出了一组基于 AWV 的描述符作为数据集,用于训练不同的机器学习技术,如线性回归、多元线性回归 (MLR)、随机森林 (RF)、K-最近邻 (IBK)、多层感知机、最佳优先搜索和遗传算法。结果表明,尽管使用了人工智能技术,但这些原子描述符允许对蛋白酶体抑制剂进行充分的建模,作为构建高效预测抑制活性模型的一种选择。