用复合机器学习方法预测化学反应的立体选择性

Predicting the stereoselectivity of chemical reactions by composite machine learning method.

作者信息

Chung Jihoon, Li Justin, Saimon Amirul Islam, Hong Pengyu, Kong Zhenyu

机构信息

Department of Industrial Engineering, Pusan National University, Busan, Korea.

Management, Entrepreneurship, and Technology, University of California, Berkeley, CA, USA.

出版信息

Sci Rep. 2024 May 27;14(1):12131. doi: 10.1038/s41598-024-62158-0.

DOI:10.1038/s41598-024-62158-0

PMID:38802415

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11130203/

Abstract

Stereoselective reactions have played a vital role in the emergence of life, evolution, human biology, and medicine. However, for a long time, most industrial and academic efforts followed a trial-and-error approach for asymmetric synthesis in stereoselective reactions. In addition, most previous studies have been qualitatively focused on the influence of steric and electronic effects on stereoselective reactions. Therefore, quantitatively understanding the stereoselectivity of a given chemical reaction is extremely difficult. As proof of principle, this paper develops a novel composite machine learning method for quantitatively predicting the enantioselectivity representing the degree to which one enantiomer is preferentially produced from the reactions. Specifically, machine learning methods that are widely used in data analytics, including Random Forest, Support Vector Regression, and LASSO, are utilized. In addition, the Bayesian optimization and permutation importance tests are provided for an in-depth understanding of reactions and accurate prediction. Finally, the proposed composite method approximates the key features of the available reactions by using Gaussian mixture models, which provide suitable machine learning methods for new reactions. The case studies using the real stereoselective reactions show that the proposed method is effective and provides a solid foundation for further application to other chemical reactions.

摘要

立体选择性反应在生命起源、进化、人类生物学和医学中发挥了至关重要的作用。然而，长期以来，大多数工业和学术研究在立体选择性反应的不对称合成中都采用试错法。此外，以往的大多数研究在定性上都集中于空间和电子效应对立体选择性反应的影响。因此，定量理解给定化学反应的立体选择性极其困难。作为原理证明，本文开发了一种新型的复合机器学习方法，用于定量预测对映选择性，该对映选择性表示从反应中优先生成一种对映体的程度。具体而言，使用了在数据分析中广泛应用的机器学习方法，包括随机森林、支持向量回归和套索回归。此外，还提供了贝叶斯优化和排列重要性测试，以深入理解反应并进行准确预测。最后，所提出的复合方法通过使用高斯混合模型来逼近现有反应的关键特征，这为新反应提供了合适的机器学习方法。使用实际立体选择性反应的案例研究表明，所提出的方法是有效的，并为进一步应用于其他化学反应奠定了坚实基础。

相似文献

Predicting the stereoselectivity of chemical reactions by composite machine learning method.用复合机器学习方法预测化学反应的立体选择性

Sci Rep. 2024 May 27;14(1):12131. doi: 10.1038/s41598-024-62158-0.

Predicting glycosylation stereoselectivity using machine learning.使用机器学习预测糖基化立体选择性。

Chem Sci. 2020 Dec 26;12(8):2931-2939. doi: 10.1039/d0sc06222g.

Predicting Reaction Yields via Supervised Learning.通过有监督学习预测反应产率。

Acc Chem Res. 2021 Apr 20;54(8):1856-1865. doi: 10.1021/acs.accounts.0c00770. Epub 2021 Mar 31.

Importance of Engineered and Learned Molecular Representations in Predicting Organic Reactivity, Selectivity, and Chemical Properties.在预测有机反应性、选择性和化学性质方面，工程化和学习的分子表示的重要性。

Acc Chem Res. 2021 Feb 16;54(4):827-836. doi: 10.1021/acs.accounts.0c00745. Epub 2021 Feb 3.

Machine learning studies on asymmetric relay Heck reaction-Potential avenues for reaction development.关于不对称接力Heck反应的机器学习研究——反应发展的潜在途径

J Chem Phys. 2022 Mar 21;156(11):114303. doi: 10.1063/5.0084432.

Causal Artificial Intelligence Models of Food Quality Data.食品质量数据的因果人工智能模型。

Food Technol Biotechnol. 2024 Mar;62(1):102-109. doi: 10.17113/ftb.62.01.24.8301.

Bayesian Optimization for Chemical Reactions.化学反应的贝叶斯优化

Chimia (Aarau). 2023 Feb 22;77(1-2):31-38. doi: 10.2533/chimia.2023.31.

Machine Learning-Based Boosted Regression Ensemble Combined with Hyperparameter Tuning for Optimal Adaptive Learning.基于机器学习的增强回归集成与超参数调整相结合，实现最优自适应学习。

Sensors (Basel). 2022 May 16;22(10):3776. doi: 10.3390/s22103776.

Machine Learning Models for Predicting Influential Factors of Early Outcomes in Acute Ischemic Stroke: Registry-Based Study.用于预测急性缺血性卒中早期预后影响因素的机器学习模型：基于登记处的研究

JMIR Med Inform. 2022 Mar 25;10(3):e32508. doi: 10.2196/32508.

Optimizing Machine Learning Algorithms for Landslide Susceptibility Mapping along the Karakoram Highway, Gilgit Baltistan, Pakistan: A Comparative Study of Baseline, Bayesian, and Metaheuristic Hyperparameter Optimization Techniques.优化巴基斯坦吉尔吉特-巴尔蒂斯坦喀喇昆仑公路沿线滑坡易发性制图的机器学习算法：基线、贝叶斯和元启发式超参数优化技术的比较研究

Sensors (Basel). 2023 Aug 1;23(15):6843. doi: 10.3390/s23156843.

本文引用的文献

Cross-validated permutation feature importance considering correlation between features.考虑特征间相关性的交叉验证排列特征重要性

Anal Sci Adv. 2022 Sep 7;3(9-10):278-287. doi: 10.1002/ansa.202200018. eCollection 2022 Oct.

Enhanced Structure-Based Prediction of Chiral Stationary Phases for Chromatographic Enantioseparation from 3D Molecular Conformations.基于3D分子构象的色谱对映体拆分手性固定相的增强结构预测

Anal Chem. 2024 Feb 13;96(6):2351-2359. doi: 10.1021/acs.analchem.3c04028. Epub 2024 Feb 3.

Explainable Supervised Machine Learning Model To Predict Solvation Gibbs Energy.可解释的监督机器学习模型，用于预测溶剂化吉布斯自由能。

J Chem Inf Model. 2024 Apr 8;64(7):2250-2262. doi: 10.1021/acs.jcim.3c00544. Epub 2023 Aug 21.

Explainable Solvation Free Energy Prediction Combining Graph Neural Networks with Chemical Intuition.结合图神经网络与化学直觉的可解释溶剂化自由能预测

J Chem Inf Model. 2022 Nov 28;62(22):5457-5470. doi: 10.1021/acs.jcim.2c01013. Epub 2022 Nov 1.

MLSolvA: solvation free energy prediction from pairwise atomistic interactions by machine learning.MLSolvA：通过机器学习从成对原子相互作用预测溶剂化自由能。

J Cheminform. 2021 Jul 31;13(1):56. doi: 10.1186/s13321-021-00533-z.

Graph-Based Approaches for Predicting Solvation Energy in Multiple Solvents: Open Datasets and Machine Learning Models.基于图的方法在多种溶剂中预测溶剂化能：开放数据集和机器学习模型。

J Phys Chem A. 2021 Jul 15;125(27):5990-5998. doi: 10.1021/acs.jpca.1c01960. Epub 2021 Jun 30.

Predicting glycosylation stereoselectivity using machine learning.使用机器学习预测糖基化立体选择性。

Chem Sci. 2020 Dec 26;12(8):2931-2939. doi: 10.1039/d0sc06222g.

Learning Atomic Interactions through Solvation Free Energy Prediction Using Graph Neural Networks.通过图神经网络预测溶剂化自由能来学习原子相互作用。

J Chem Inf Model. 2021 Feb 22;61(2):689-698. doi: 10.1021/acs.jcim.0c01413. Epub 2021 Feb 5.

Holistic prediction of enantioselectivity in asymmetric catalysis.整体预测手性催化中的对映选择性。

Nature. 2019 Jul;571(7765):343-348. doi: 10.1038/s41586-019-1384-z. Epub 2019 Jul 17.

Prediction of higher-selectivity catalysts by computer-driven workflow and machine learning.通过计算机驱动的工作流程和机器学习预测高选择性催化剂。

Science. 2019 Jan 18;363(6424). doi: 10.1126/science.aau5631.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用复合机器学习方法预测化学反应的立体选择性

Predicting the stereoselectivity of chemical reactions by composite machine learning method.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献