• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习和分子描述符助力不对称催化中的合理溶剂选择。

Machine learning and molecular descriptors enable rational solvent selection in asymmetric catalysis.

作者信息

Amar Yehia, Schweidtmann Artur M, Deutsch Paul, Cao Liwei, Lapkin Alexei

机构信息

Department of Chemical Engineering and Biotechnology , University of Cambridge , Philippa Fawcett Drive , Cambridge , CB3 0AS , UK . Email:

Aachener Verfahrenstechnik - Process Systems Engineering , RWTH Aachen University , Aachen , Germany.

出版信息

Chem Sci. 2019 May 30;10(27):6697-6706. doi: 10.1039/c9sc01844a. eCollection 2019 Jul 21.

DOI:10.1039/c9sc01844a
PMID:31367324
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6625492/
Abstract

Rational solvent selection remains a significant challenge in process development. Here we describe a hybrid mechanistic-machine learning approach, geared towards automated process development workflow. A library of 459 solvents was used, for which 12 conventional molecular descriptors, two reaction-specific descriptors, and additional descriptors based on screening charge density, were calculated. Gaussian process surrogate models were trained on experimental data from a Rh(CO)(acac)/Josiphos catalysed asymmetric hydrogenation of a chiral α-β unsaturated γ-lactam. With two simultaneous objectives - high conversion and high diastereomeric excess - the multi-objective algorithm, trained on the initial dataset of 25 solvents, has identified solvents leading to better reaction outcomes. In addition to being a powerful design of experiments (DoE) methodology, the resulting Gaussian process surrogate model for conversion is, in statistical terms, predictive, with a cross-validation correlation coefficient of 0.84. After identifying promising solvents, the composition of solvent mixtures and optimal reaction temperature were found using a black-box Bayesian optimisation. We then demonstrated the application of a new genetic programming approach to select an appropriate machine learning model for a specific physical system, which should allow the transition of the overall process development workflow into the future robotic laboratories.

摘要

在工艺开发中,合理选择溶剂仍然是一项重大挑战。在此,我们描述了一种混合的机理-机器学习方法,旨在实现自动化工艺开发工作流程。我们使用了一个包含459种溶剂的库,并计算了12种常规分子描述符、两种反应特定描述符以及基于筛选电荷密度的其他描述符。高斯过程代理模型是根据Rh(CO)(acac)/Josiphos催化的手性α-β不饱和γ-内酰胺不对称氢化反应的实验数据进行训练的。通过两个同时的目标——高转化率和高非对映体过量——在最初的25种溶剂数据集上训练的多目标算法,已经识别出能带来更好反应结果的溶剂。除了是一种强大的实验设计(DoE)方法外,由此产生的用于转化率的高斯过程代理模型在统计上具有预测性,交叉验证相关系数为0.84。在识别出有前景的溶剂后,使用黑箱贝叶斯优化方法确定了溶剂混合物的组成和最佳反应温度。然后,我们展示了一种新的遗传编程方法在为特定物理系统选择合适的机器学习模型方面的应用,这应该能够使整个工艺开发工作流程向未来的机器人实验室过渡。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/a1539e35393a/c9sc01844a-f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/80197c207a2b/c9sc01844a-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/9a69b1e2a827/c9sc01844a-s1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/183289d5488c/c9sc01844a-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/a372c2fba3aa/c9sc01844a-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/630a40dd8d13/c9sc01844a-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/83098e0cd385/c9sc01844a-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/34131ade1a74/c9sc01844a-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/1540ba02adc8/c9sc01844a-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/199b6961d45e/c9sc01844a-f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/a1539e35393a/c9sc01844a-f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/80197c207a2b/c9sc01844a-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/9a69b1e2a827/c9sc01844a-s1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/183289d5488c/c9sc01844a-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/a372c2fba3aa/c9sc01844a-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/630a40dd8d13/c9sc01844a-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/83098e0cd385/c9sc01844a-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/34131ade1a74/c9sc01844a-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/1540ba02adc8/c9sc01844a-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/199b6961d45e/c9sc01844a-f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dde5/6625492/a1539e35393a/c9sc01844a-f9.jpg

相似文献

1
Machine learning and molecular descriptors enable rational solvent selection in asymmetric catalysis.机器学习和分子描述符助力不对称催化中的合理溶剂选择。
Chem Sci. 2019 May 30;10(27):6697-6706. doi: 10.1039/c9sc01844a. eCollection 2019 Jul 21.
2
Applications of Machine Learning in Alloy Catalysts: Rational Selection and Future Development of Descriptors.机器学习在合金催化剂中的应用:描述符的合理选择和未来发展。
Adv Sci (Weinh). 2022 Apr;9(12):e2106043. doi: 10.1002/advs.202106043. Epub 2022 Mar 1.
3
Molecular Machine Learning for Chemical Catalysis: Prospects and Challenges.分子机器学习在化学催化中的应用:前景与挑战。
Acc Chem Res. 2023 Feb 7;56(3):402-412. doi: 10.1021/acs.accounts.2c00801. Epub 2023 Jan 30.
4
Probing machine learning models based on high throughput experimentation data for the discovery of asymmetric hydrogenation catalysts.基于高通量实验数据探索机器学习模型以发现不对称氢化催化剂。
Chem Sci. 2024 Jul 16;15(34):13618-13630. doi: 10.1039/d4sc03647f. eCollection 2024 Aug 28.
5
Automated classification of tropical shrub species: a hybrid of leaf shape and machine learning approach.热带灌木物种的自动分类:叶形与机器学习方法的结合
PeerJ. 2017 Sep 12;5:e3792. doi: 10.7717/peerj.3792. eCollection 2017.
6
Easy and fast prediction of green solvents for small molecule donor-based organic solar cells through machine learning.通过机器学习轻松快速地预测用于小分子供体基有机太阳能电池的绿色溶剂。
Phys Chem Chem Phys. 2023 Apr 12;25(15):10417-10426. doi: 10.1039/d3cp00177f.
7
Feedback in Flow for Accelerated Reaction Development.流场中的反馈促进反应开发。
Acc Chem Res. 2016 Sep 20;49(9):1786-96. doi: 10.1021/acs.accounts.6b00261. Epub 2016 Aug 15.
8
Machine Learning Descriptors for Data-Driven Catalysis Study.用于数据驱动催化研究的机器学习描述符
Adv Sci (Weinh). 2023 Aug;10(22):e2301020. doi: 10.1002/advs.202301020. Epub 2023 May 16.
9
Machine learning prediction of empirical polarity using SMILES encoding of organic solvents.基于有机溶剂 SMILES 编码的机器学习预测经验极性。
Mol Divers. 2023 Oct;27(5):2331-2343. doi: 10.1007/s11030-022-10559-6. Epub 2022 Nov 5.
10
Predicting Solvent-Dependent Nucleophilicity Parameter with a Causal Structure Property Relationship.用因果结构属性关系预测溶剂依赖性亲核性参数。
J Chem Inf Model. 2021 Oct 25;61(10):4890-4899. doi: 10.1021/acs.jcim.1c00610. Epub 2021 Sep 22.

引用本文的文献

1
Predictive design of crystallographic chiral separation.晶体学手性分离的预测设计。
Nat Commun. 2025 Aug 26;16(1):7977. doi: 10.1038/s41467-025-62825-4.
2
Advancing genetic engineering with active learning: theory, implementations and potential opportunities.通过主动学习推进基因工程:理论、实现与潜在机遇
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf286.
3
Machine Learning-Driven Optimization of Continuous-Flow Photoredox Amine Synthesis.机器学习驱动的连续流光催化胺合成优化

本文引用的文献

1
International chemical identifier for reactions (RInChI).反应的国际化学标识符(RInChI)。
J Cheminform. 2018 May 9;10(1):22. doi: 10.1186/s13321-018-0277-8.
2
Predictive and mechanistic multivariate linear regression models for reaction development.反应发展的预测性和机制性多元线性回归模型。
Chem Sci. 2018 Jan 23;9(9):2398-2412. doi: 10.1039/c7sc04679k. eCollection 2018 Mar 7.
3
Predicting reaction performance in C-N cross-coupling using machine learning.使用机器学习预测 C-N 交叉偶联反应性能。
Org Process Res Dev. 2025 May 21;29(6):1411-1422. doi: 10.1021/acs.oprd.4c00533. eCollection 2025 Jun 20.
4
Deep Reinforcement Learning-Based Self-Optimization of Flow Chemistry.基于深度强化学习的流动化学自优化
ACS Eng Au. 2025 May 13;5(3):247-266. doi: 10.1021/acsengineeringau.5c00004. eCollection 2025 Jun 18.
5
Surfactant representation using COSMO screened charge density for adsorption isotherm prediction using Physics-Informed Neural Network (PINN).使用COSMO筛选电荷密度表示表面活性剂以通过物理信息神经网络(PINN)预测吸附等温线。
J Cheminform. 2025 May 26;17(1):84. doi: 10.1186/s13321-025-01027-y.
6
Solvent Screening for Separation Processes Using Machine Learning and High-Throughput Technologies.利用机器学习和高通量技术进行分离过程的溶剂筛选
Chem Bio Eng. 2025 Mar 5;2(4):210-228. doi: 10.1021/cbe.4c00170. eCollection 2025 Apr 24.
7
Substrate specific closed-loop optimization of carbohydrate protective group chemistry using Bayesian optimization and transfer learning.使用贝叶斯优化和迁移学习对碳水化合物保护基化学进行底物特异性闭环优化。
Chem Sci. 2023 May 18;14(23):6319-6329. doi: 10.1039/d3sc01261a. eCollection 2023 Jun 14.
8
Accelerated Chemical Reaction Optimization Using Multi-Task Learning.基于多任务学习的加速化学反应优化
ACS Cent Sci. 2023 Apr 13;9(5):957-968. doi: 10.1021/acscentsci.3c00050. eCollection 2023 May 24.
9
A Brief Introduction to Chemical Reaction Optimization.化学反应优化简介。
Chem Rev. 2023 Mar 22;123(6):3089-3126. doi: 10.1021/acs.chemrev.2c00798. Epub 2023 Feb 23.
10
Transferrable selectivity profiles enable prediction in synergistic catalyst space.可转移的选择性概况能够在协同催化剂空间中进行预测。
Chem Sci. 2023 Jan 17;14(7):1885-1895. doi: 10.1039/d2sc05974f. eCollection 2023 Feb 15.
Science. 2018 Apr 13;360(6385):186-190. doi: 10.1126/science.aar5169. Epub 2018 Feb 15.
4
A Multiobjective Optimization Including Results of Life Cycle Assessment in Developing Biorenewables-Based Processes.在开发基于生物可再生资源的工艺过程中,将生命周期评估结果纳入多目标优化。
ChemSusChem. 2017 Sep 22;10(18):3632-3643. doi: 10.1002/cssc.201700927. Epub 2017 Sep 4.
5
Human versus Robots in the Discovery and Crystallization of Gigantic Polyoxometalates.人类与机器人在巨型多金属氧酸盐的发现和结晶中的作用。
Angew Chem Int Ed Engl. 2017 Aug 28;56(36):10815-10820. doi: 10.1002/anie.201705721. Epub 2017 Aug 3.
6
Self-optimisation and model-based design of experiments for developing a C-H activation flow process.用于开发碳氢活化流动过程的实验的自优化与基于模型的设计
Beilstein J Org Chem. 2017 Jan 24;13:150-163. doi: 10.3762/bjoc.13.18. eCollection 2017.
7
Quantifying the equilibrium partitioning of substituted polycyclic aromatic hydrocarbons in aerosols and clouds using COSMOtherm.使用COSMOtherm量化取代多环芳烃在气溶胶和云中的平衡分配。
Environ Sci Process Impacts. 2017 Mar 22;19(3):288-299. doi: 10.1039/c6em00636a.
8
Feedback in Flow for Accelerated Reaction Development.流场中的反馈促进反应开发。
Acc Chem Res. 2016 Sep 20;49(9):1786-96. doi: 10.1021/acs.accounts.6b00261. Epub 2016 Aug 15.
9
The Development of Multidimensional Analysis Tools for Asymmetric Catalysis and Beyond.多维分析工具在不对称催化及其他领域的发展
Acc Chem Res. 2016 Jun 21;49(6):1292-301. doi: 10.1021/acs.accounts.6b00194. Epub 2016 May 24.
10
Parameterization of phosphine ligands reveals mechanistic pathways and predicts reaction outcomes.膦配体的参数化揭示了反应机制途径并预测了反应结果。
Nat Chem. 2016 Jun;8(6):610-7. doi: 10.1038/nchem.2501. Epub 2016 May 16.