用于多相催化预测建模的支持向量机：基于两个实际应用的全面介绍与过拟合研究

Support vector machines for predictive modeling in heterogeneous catalysis: a comprehensive introduction and overfitting investigation based on two real applications.

作者信息

Baumes L A, Serra J M, Serna P, Corma A

机构信息

Instituto de Tecnología Química (UPV-CSIC), av. Naranjos s/n, 46022 Valencia, Spain.

出版信息

J Comb Chem. 2006 Jul-Aug;8(4):583-96. doi: 10.1021/cc050093m.

DOI:10.1021/cc050093m

PMID:16827571

Abstract

This works provides an introduction to support vector machines (SVMs) for predictive modeling in heterogeneous catalysis, describing step by step the methodology with a highlighting of the points which make such technique an attractive approach. We first investigate linear SVMs, working in detail through a simple example based on experimental data derived from a study aiming at optimizing olefin epoxidation catalysts applying high-throughput experimentation. This case study has been chosen to underline SVM features in a visual manner because of the few catalytic variables investigated. It is shown how SVMs transform original data into another representation space of higher dimensionality. The concepts of Vapnik-Chervonenkis dimension and structural risk minimization are introduced. The SVM methodology is evaluated with a second catalytic application, that is, light paraffin isomerization. Finally, we discuss why SVMs is a strategic method, as compared to other machine learning techniques, such as neural networks or induction trees, and why emphasis is put on the problem of overfitting.

摘要

本文介绍了用于多相催化预测建模的支持向量机（SVM），逐步描述了该方法，并突出了使其成为一种有吸引力的方法的要点。我们首先研究线性支持向量机，通过一个基于从旨在应用高通量实验优化烯烃环氧化催化剂的研究中获得的实验数据的简单示例进行详细分析。选择这个案例研究是为了以直观的方式强调支持向量机的特征，因为所研究的催化变量较少。展示了支持向量机如何将原始数据转换到更高维度的另一个表示空间。引入了Vapnik-Chervonenkis维度和结构风险最小化的概念。支持向量机方法在第二个催化应用即轻质石蜡异构化中进行了评估。最后，我们讨论了与其他机器学习技术（如神经网络或归纳树）相比，为什么支持向量机是一种战略性方法，以及为什么强调过拟合问题。

相似文献

Support vector machines for predictive modeling in heterogeneous catalysis: a comprehensive introduction and overfitting investigation based on two real applications.

J Comb Chem. 2006 Jul-Aug;8(4):583-96. doi: 10.1021/cc050093m.

Integrating support vector machines and neural networks.

Neural Netw. 2007 Jul;20(5):590-7. doi: 10.1016/j.neunet.2006.12.003. Epub 2006 Dec 22.

Training a support vector machine in the primal.

Neural Comput. 2007 May;19(5):1155-78. doi: 10.1162/neco.2007.19.5.1155.

New support vector-based design method for binary hierarchical classifiers for multi-class classification problems.

Neural Netw. 2008 Mar-Apr;21(2-3):502-10. doi: 10.1016/j.neunet.2007.12.005. Epub 2007 Dec 8.

Subspace-based support vector machines for pattern classification.

Neural Netw. 2009 Jul-Aug;22(5-6):558-67. doi: 10.1016/j.neunet.2009.06.026. Epub 2009 Jul 2.

Two criteria for model selection in multiclass support vector machines.

IEEE Trans Syst Man Cybern B Cybern. 2008 Dec;38(6):1432-48. doi: 10.1109/TSMCB.2008.927272.

Predictive learning with structured (grouped) data.

Neural Netw. 2009 Jul-Aug;22(5-6):766-73. doi: 10.1016/j.neunet.2009.06.030. Epub 2009 Jul 2.

Support vector machine based training of multilayer feedforward neural networks as optimized by particle swarm algorithm: application in QSAR studies of bioactivity of organic compounds.

J Comput Chem. 2007 Jan 30;28(2):519-27. doi: 10.1002/jcc.20561.

Theoretical analysis for solution of support vector data description.

Neural Netw. 2011 May;24(4):360-9. doi: 10.1016/j.neunet.2011.01.007. Epub 2011 Feb 3.

Human detection in images via piecewise linear support vector machines.

IEEE Trans Image Process. 2013 Feb;22(2):778-89. doi: 10.1109/TIP.2012.2222901. Epub 2012 Oct 5.

引用本文的文献

Developing machine learning for heterogeneous catalysis with experimental and computational data.

Nat Rev Chem. 2025 Jul 18. doi: 10.1038/s41570-025-00740-4.

Research Progress in Epoxidation of Light Small-Molecule Olefins.

Molecules. 2025 Mar 17;30(6):1340. doi: 10.3390/molecules30061340.

Decoding the synergy: unveiling gradient boosting regression model for multivariate quantitation of pioglitazone, alogliptin and glimepiride in pure and tablet dosage forms.

BMC Chem. 2024 Nov 29;18(1):237. doi: 10.1186/s13065-024-01351-8.

Machine Learning Descriptors for Data-Driven Catalysis Study.

Adv Sci (Weinh). 2023 Aug;10(22):e2301020. doi: 10.1002/advs.202301020. Epub 2023 May 16.

Systematic Data-Driven Modeling of Bimetallic Catalyst Performance for the Hydrogenation of 5-Ethoxymethylfurfural with Variable Selection and Regularization.

Ind Eng Chem Res. 2022 Apr 13;61(14):4752-4762. doi: 10.1021/acs.iecr.1c03995. Epub 2022 Mar 31.

Model-Based Reasoning of Clinical Diagnosis in Integrative Medicine: Real-World Methodological Study of Electronic Medical Records and Natural Language Processing Methods.

JMIR Med Inform. 2020 Dec 21;8(12):e23082. doi: 10.2196/23082.

Machine learning dihydrogen activation in the chemical space surrounding Vaska's complex.

Chem Sci. 2020 Apr 7;11(18):4584-4601. doi: 10.1039/d0sc00445f. eCollection 2020 May 14.

tmQM Dataset-Quantum Geometries and Properties of 86k Transition Metal Complexes.

J Chem Inf Model. 2020 Dec 28;60(12):6135-6146. doi: 10.1021/acs.jcim.0c01041. Epub 2020 Nov 9.

Prediction of clinical and biomarker conformed Alzheimer's disease and mild cognitive impairment from multi-feature brain structural MRI using age-correction from a large independent lifespan sample.

Neuroimage Clin. 2020;28:102387. doi: 10.1016/j.nicl.2020.102387. Epub 2020 Aug 19.

Towards operando computational modeling in heterogeneous catalysis.

Chem Soc Rev. 2018 Nov 12;47(22):8307-8348. doi: 10.1039/c8cs00398j.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于多相催化预测建模的支持向量机：基于两个实际应用的全面介绍与过拟合研究

Support vector machines for predictive modeling in heterogeneous catalysis: a comprehensive introduction and overfitting investigation based on two real applications.

作者信息

Baumes L A, Serra J M, Serna P, Corma A

机构信息

Instituto de Tecnología Química (UPV-CSIC), av. Naranjos s/n, 46022 Valencia, Spain.

出版信息

J Comb Chem. 2006 Jul-Aug;8(4):583-96. doi: 10.1021/cc050093m.

DOI:10.1021/cc050093m

PMID:16827571

Abstract

摘要

用于多相催化预测建模的支持向量机：基于两个实际应用的全面介绍与过拟合研究

Support vector machines for predictive modeling in heterogeneous catalysis: a comprehensive introduction and overfitting investigation based on two real applications.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于多相催化预测建模的支持向量机：基于两个实际应用的全面介绍与过拟合研究

Support vector machines for predictive modeling in heterogeneous catalysis: a comprehensive introduction and overfitting investigation based on two real applications.

作者信息

机构信息

出版信息

相似文献

引用本文的文献