Suppr超能文献

通过机器学习方法预测新型选择性肿瘤坏死因子-α转换酶(TACE)抑制剂并表征相关分子描述符

Prediction of novel and selective TNF-alpha converting enzyme (TACE) inhibitors and characterization of correlative molecular descriptors by machine learning approaches.

作者信息

Cong Yong, Yang Xue-Gang, Lv Wei, Xue Ying

机构信息

Key Lab of Green Chemistry and Technology in Ministry of Education, College of Chemistry, Sichuan University, Chengdu 610064, People's Republic of China.

出版信息

J Mol Graph Model. 2009 Oct;28(3):236-44. doi: 10.1016/j.jmgm.2009.08.001. Epub 2009 Aug 8.

Abstract

The inhibition of TNF-alpha converting enzyme (TACE) has been explored as a feasible therapy for the treatment of rheumatoid arthritis (RA) and Crohn's disease (CD). Recently, large numbers of novel and selective TACE inhibitors have been reported. It is desirable to develop machine learning (ML) models for identifying the inhibitors of TACE in the early drug design phase and test the prediction capabilities of these ML models. This work evaluated four ML methods, support vector machine (SVM), k-nearest neighbor (k-NN), back-propagation neural network (BPNN) and C4.5 decision tree (C4.5 DT), which were trained and tested by using a diverse set of 443 TACE inhibitors and 759 non-inhibitors. A well-established feature selection method, the recursive feature elimination (RFE) method, was used to select the most appropriate descriptors for classification from a large pool of descriptors, and two evaluation methods, 5-fold cross-validation and independent evaluation, were used to assess the performances of these developed models. In this study, all these ML models have already achieved promising prediction accuracies. By using the RFE method, the prediction accuracies are further improved. In k-NN, the model gives the best prediction for TACE inhibitors (98.32%), and the SVM bears the best prediction for non-inhibitors (99.51%). Both the k-NN and SVM model give the best overall prediction accuracy (98.45%). To the best of our knowledge, the SVM model developed in this work is the first one for the classification prediction of TACE inhibitors with a broad applicability domain. Our study suggests that ML methods, particularly SVM, are potentially useful for facilitating the discovery of TACE inhibitors and for exhibiting the molecular descriptors associated with TACE inhibitors.

摘要

肿瘤坏死因子-α转换酶(TACE)的抑制作用已被探索作为治疗类风湿性关节炎(RA)和克罗恩病(CD)的一种可行疗法。最近,已报道了大量新型且具有选择性的TACE抑制剂。在药物设计早期阶段开发用于识别TACE抑制剂的机器学习(ML)模型并测试这些ML模型的预测能力是很有必要的。这项工作评估了四种ML方法,即支持向量机(SVM)、k近邻算法(k-NN)、反向传播神经网络(BPNN)和C4.5决策树(C4.5 DT),使用443种不同的TACE抑制剂和759种非抑制剂对其进行训练和测试。采用一种成熟的特征选择方法——递归特征消除(RFE)方法,从大量描述符中选择最合适的描述符用于分类,并使用两种评估方法,即五折交叉验证和独立评估,来评估这些开发模型的性能。在本研究中,所有这些ML模型都已取得了可观的预测准确率。通过使用RFE方法,预测准确率进一步提高。在k-NN中,该模型对TACE抑制剂的预测最佳(98.32%),而SVM对非抑制剂的预测最佳(99.51%)。k-NN和SVM模型的总体预测准确率均最佳(98.45%)。据我们所知,本研究中开发的SVM模型是首个用于TACE抑制剂分类预测且具有广泛适用域的模型。我们的研究表明,ML方法,尤其是SVM,对于促进TACE抑制剂的发现以及展示与TACE抑制剂相关的分子描述符具有潜在的作用。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验