Damavandi Sedigheh, Shiri Fereshteh, Emamjomeh Abbasali, Pirhadi Somayeh, Beyzaei Hamid
Department of Bioinformatics, Laboratory of Computational Biotechnology and Bioinformatics (CBB Lab), University of Zabol, Zabol, Iran.
Department of Chemistry, Faculty of Science, University of Zabol, Zabol, Iran.
BMC Chem. 2023 Jul 6;17(1):70. doi: 10.1186/s13065-023-00991-6.
Lactate dehydrogenase (LDH) is a tetramer enzyme that converts pyruvate to lactate reversibly. This enzyme becomes important because it is associated with diseases such as cancers, heart disease, liver problems, and most importantly, corona disease. As a system-based method, proteochemometrics does not require knowledge of the protein's three-dimensional structure, but rather depends on the amino acid sequence and protein descriptors. Here, we applied this methodology to model a set of LDHA and LDHB isoenzyme inhibitors. To implement the proteochemetrics method, the camb package in the R Studio Server programming environment was used. The activity of 312 compounds of LDHA and LDHB isoenzyme inhibitors from the valid Binding DB database was retrieved. The proteochemometrics method was applied to three machine learning algorithms gradient amplification model, random forest, and support vector machine as regression methods to find the best model. Through the combination of different models into an ensemble (greedy and stacking optimization), we explored the possibility of improving the performance of models. For the RF best ensemble model of inhibitors of LDHA and LDHB isoenzymes, and were 0.66 and 0.62, respectively. LDH inhibitory activation is influenced by Morgan fingerprints and topological structure descriptors.
乳酸脱氢酶(LDH)是一种四聚体酶,可将丙酮酸可逆地转化为乳酸。这种酶之所以重要,是因为它与癌症、心脏病、肝脏问题等疾病有关,最重要的是,还与冠状病毒病有关。作为一种基于系统的方法,蛋白质化学计量学不需要了解蛋白质的三维结构,而是依赖于氨基酸序列和蛋白质描述符。在这里,我们应用这种方法对一组LDHA和LDHB同工酶抑制剂进行建模。为了实施蛋白质化学计量学方法,我们使用了R Studio Server编程环境中的camb包。从有效的Binding DB数据库中检索了312种LDHA和LDHB同工酶抑制剂化合物的活性。蛋白质化学计量学方法被应用于三种机器学习算法梯度放大模型、随机森林和支持向量机,作为回归方法来寻找最佳模型。通过将不同模型组合成一个集成模型(贪心和堆叠优化),我们探索了提高模型性能的可能性。对于LDHA和LDHB同工酶抑制剂的RF最佳集成模型,其值分别为0.66和0.62。LDH抑制激活受摩根指纹和拓扑结构描述符的影响。