支持向量机线性组合的元参数确定

Determination of Meta-Parameters for Support Vector Machine Linear Combinations.

作者信息

Jasial Swarit, Balfer Jenny, Vogt Martin, Bajorath Jürgen

机构信息

Department of Life Science Informatics, Bonn-Aachen International Center for Information Technology, Rheinische Friedrich-Wilhelms-Universität Bonn, Dahlmannstr. 2, 53113 Bonn, Germany tel: +49-228-2699-306; fax: +49-228-2699-341.

出版信息

Mol Inform. 2015 Feb;34(2-3):127-33. doi: 10.1002/minf.201400163. Epub 2015 Feb 17.

DOI:10.1002/minf.201400163

PMID:27490035

Abstract

Support vector machines (SVMs) are among the most popular machine learning methods for compound classification and other chemoinformatics tasks such as, for example, the prediction of ligand-target pairs or compound activity profiles. Depending on the specific applications, different SVM strategies can be used. For example, in the context of potency-directed virtual screening, linear combinations of multiple SVM models have been shown to enrich database selection sets with potent compounds compared to individual models. An open question concerning the use of SVM linear combinations (SVM-LCs) is how to best weight the models on a relative scale. Typically, linear weights are subjectively set. Herein, preferred weighting factors for SVM-LC were systematically determined. Therefore, weights were treated as meta-parameters and optimized by machine learning to enrich data set rankings with highly active compounds. The meta-parameter approach has been applied to 10 screening data sets and found to further improve SVM performance over other SVM-LCs and support vector regression (SVR) models. The results show that optimal weights depend on data set characteristics and chosen molecular representations. In addition, individual models often do not contribute to the performance of SVM-LCs. Taken together, these findings emphasize the need for systematic meta-parameter estimation.

摘要

支持向量机（SVM）是用于化合物分类和其他化学信息学任务（例如预测配体-靶点对或化合物活性谱）的最流行的机器学习方法之一。根据具体应用，可以使用不同的SVM策略。例如，在效价导向的虚拟筛选中，与单个模型相比，多个SVM模型的线性组合已被证明可以用强效化合物丰富数据库选择集。关于使用SVM线性组合（SVM-LC）的一个悬而未决的问题是如何在相对尺度上对模型进行最佳加权。通常，线性权重是主观设定的。在此，系统地确定了SVM-LC的优选加权因子。因此，权重被视为元参数，并通过机器学习进行优化，以用高活性化合物丰富数据集排名。元参数方法已应用于10个筛选数据集，发现与其他SVM-LC和支持向量回归（SVR）模型相比，它能进一步提高SVM的性能。结果表明，最佳权重取决于数据集特征和所选的分子表示。此外，单个模型通常对SVM-LC的性能没有贡献。综上所述，这些发现强调了系统进行元参数估计的必要性。

相似文献

Determination of Meta-Parameters for Support Vector Machine Linear Combinations.

Mol Inform. 2015 Feb;34(2-3):127-33. doi: 10.1002/minf.201400163. Epub 2015 Feb 17.

Exploring Alternative Strategies for the Identification of Potent Compounds Using Support Vector Machine and Regression Modeling.

J Chem Inf Model. 2019 Mar 25;59(3):983-992. doi: 10.1021/acs.jcim.8b00584. Epub 2018 Dec 14.

Support Vector Machine Classification and Regression Prioritize Different Structural Features for Binary Compound Activity and Potency Value Prediction.

ACS Omega. 2017 Oct 31;2(10):6371-6379. doi: 10.1021/acsomega.7b01079. Epub 2017 Oct 4.

Systematic artifacts in support vector regression-based compound potency prediction revealed by statistical and activity landscape analysis.

PLoS One. 2015 Mar 5;10(3):e0119301. doi: 10.1371/journal.pone.0119301. eCollection 2015.

Potency-directed similarity searching using support vector machines.

Chem Biol Drug Des. 2011 Jan;77(1):30-8. doi: 10.1111/j.1747-0285.2010.01059.x. Epub 2010 Nov 29.

Influence of Varying Training Set Composition and Size on Support Vector Machine-Based Prediction of Active Compounds.

J Chem Inf Model. 2017 Apr 24;57(4):710-716. doi: 10.1021/acs.jcim.7b00088. Epub 2017 Apr 10.

Evolution of Support Vector Machine and Regression Modeling in Chemoinformatics and Drug Discovery.

J Comput Aided Mol Des. 2022 May;36(5):355-362. doi: 10.1007/s10822-022-00442-9. Epub 2022 Mar 19.

Support vector machines with constraints for sparsity in the primal parameters.

IEEE Trans Neural Netw. 2011 Aug;22(8):1269-83. doi: 10.1109/TNN.2011.2148727. Epub 2011 Jul 5.

Comparison of confirmed inactive and randomly selected compounds as negative training examples in support vector machine-based virtual screening.

J Chem Inf Model. 2013 Jul 22;53(7):1595-601. doi: 10.1021/ci4002712. Epub 2013 Jul 3.

The construction of support vector machine classifier using the firefly algorithm.

Comput Intell Neurosci. 2015;2015:212719. doi: 10.1155/2015/212719. Epub 2015 Feb 23.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

支持向量机线性组合的元参数确定

Determination of Meta-Parameters for Support Vector Machine Linear Combinations.

作者信息

Jasial Swarit, Balfer Jenny, Vogt Martin, Bajorath Jürgen

机构信息

出版信息

Mol Inform. 2015 Feb;34(2-3):127-33. doi: 10.1002/minf.201400163. Epub 2015 Feb 17.

DOI:10.1002/minf.201400163

PMID:27490035

Abstract

摘要

支持向量机线性组合的元参数确定

Determination of Meta-Parameters for Support Vector Machine Linear Combinations.

作者信息

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

支持向量机线性组合的元参数确定

Determination of Meta-Parameters for Support Vector Machine Linear Combinations.

作者信息

机构信息

出版信息

相似文献