• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

pKalculator:一种用于C-H键的p预测器。

pKalculator: A p predictor for C-H bonds.

作者信息

Borup Rasmus M, Ree Nicolai, Jensen Jan H

机构信息

Department of Chemistry, University of Copenhagen, Copenhagen, DK-2100, Denmark.

出版信息

Beilstein J Org Chem. 2024 Jul 16;20:1614-1622. doi: 10.3762/bjoc.20.144. eCollection 2024.

DOI:10.3762/bjoc.20.144
PMID:39076289
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11285060/
Abstract

Determining the p values of various C-H sites in organic molecules offers valuable insights for synthetic chemists in predicting reaction sites. As molecular complexity increases, this task becomes more challenging. This paper introduces pKalculator, a quantum chemistry (QM)-based workflow for automatic computations of C-H p values, which is used to generate a training dataset for a machine learning (ML) model. The QM workflow is benchmarked against 695 experimentally determined C-H p values in DMSO. The ML model is trained on a diverse dataset of 775 molecules with 3910 C-H sites. Our ML model predicts C-H p values with a mean absolute error (MAE) and a root mean squared error (RMSE) of 1.24 and 2.15 p units, respectively. Furthermore, we employ our model on 1043 p -dependent reactions (aldol, Claisen, and Michael) and successfully indicate the reaction sites with a Matthew's correlation coefficient (MCC) of 0.82.

摘要

确定有机分子中各种C-H位点的p值,为合成化学家预测反应位点提供了有价值的见解。随着分子复杂性的增加,这项任务变得更具挑战性。本文介绍了pKalculator,这是一种基于量子化学(QM)的工作流程,用于自动计算C-H p值,该工作流程用于生成机器学习(ML)模型的训练数据集。该QM工作流程以二甲基亚砜中695个实验测定的C-H p值为基准进行测试。ML模型在包含775个分子、3910个C-H位点的多样化数据集上进行训练。我们的ML模型预测C-H p值的平均绝对误差(MAE)和均方根误差(RMSE)分别为1.24和2.15个p单位。此外,我们将模型应用于1043个与p相关的反应(羟醛缩合、克莱森缩合和迈克尔加成反应),并成功地以0.82的马修斯相关系数(MCC)指出了反应位点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57b4/11285060/6f27a297f1a9/Beilstein_J_Org_Chem-20-1614-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57b4/11285060/ca4c9bdcb676/Beilstein_J_Org_Chem-20-1614-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57b4/11285060/794e0174e44b/Beilstein_J_Org_Chem-20-1614-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57b4/11285060/de5936bdc63c/Beilstein_J_Org_Chem-20-1614-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57b4/11285060/6f27a297f1a9/Beilstein_J_Org_Chem-20-1614-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57b4/11285060/ca4c9bdcb676/Beilstein_J_Org_Chem-20-1614-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57b4/11285060/794e0174e44b/Beilstein_J_Org_Chem-20-1614-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57b4/11285060/de5936bdc63c/Beilstein_J_Org_Chem-20-1614-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57b4/11285060/6f27a297f1a9/Beilstein_J_Org_Chem-20-1614-g004.jpg

相似文献

1
pKalculator: A p predictor for C-H bonds.pKalculator:一种用于C-H键的p预测器。
Beilstein J Org Chem. 2024 Jul 16;20:1614-1622. doi: 10.3762/bjoc.20.144. eCollection 2024.
2
The SAMPL6 challenge on predicting aqueous pK values from EC-RISM theory.SAMPL6 挑战赛:从 EC-RISM 理论预测水相 pk 值。
J Comput Aided Mol Des. 2018 Oct;32(10):1151-1163. doi: 10.1007/s10822-018-0140-z. Epub 2018 Aug 2.
3
Prediction of 3-hydroxypyridin-4-one (HPO) hydroxyl pKa values.预测 3-羟基吡啶-4-酮 (HPO) 的羟基 pKa 值。
Dalton Trans. 2012 Jun 7;41(21):6549-57. doi: 10.1039/c2dt12396g. Epub 2012 Mar 6.
4
Extending pKa prediction accuracy: high-throughput pKa measurements to understand pKa modulation of new chemical series.拓展 pKa 预测精度:高通量 pKa 测量以理解新化学系列的 pKa 调节。
Eur J Med Chem. 2010 Sep;45(9):4270-9. doi: 10.1016/j.ejmech.2010.06.026. Epub 2010 Jun 23.
5
Machine learning meets pK .机器学习与药物解离常数相遇。
F1000Res. 2020 Feb 13;9. doi: 10.12688/f1000research.22090.2. eCollection 2020.
6
Best of both worlds: combining pharma data and state of the art modeling technology to improve in Silico pKa prediction.两全其美:结合制药数据与先进建模技术以改进计算机辅助pKa预测。
J Chem Inf Model. 2015 Feb 23;55(2):389-97. doi: 10.1021/ci500585w. Epub 2014 Dec 16.
7
Machine Learning and Pharmacometrics for Prediction of Pharmacokinetic Data: Differences, Similarities and Challenges Illustrated with Rifampicin.用于预测药代动力学数据的机器学习与药代动力学:以利福平为例说明差异、相似性及挑战
Pharmaceutics. 2022 Jul 22;14(8):1530. doi: 10.3390/pharmaceutics14081530.
8
Holistic Prediction of the pK in Diverse Solvents Based on a Machine-Learning Approach.基于机器学习方法的不同溶剂中 pK 的整体预测。
Angew Chem Int Ed Engl. 2020 Oct 19;59(43):19282-19291. doi: 10.1002/anie.202008528. Epub 2020 Aug 25.
9
Comparative Analysis of Machine Learning Models for Predicting the Mechanical Behavior of Bio-Based Cellular Composite Sandwich Structures.用于预测生物基蜂窝复合夹层结构力学行为的机器学习模型的比较分析
Materials (Basel). 2024 Jul 15;17(14):3493. doi: 10.3390/ma17143493.
10
SAMPL6: calculation of macroscopic pK values from ab initio quantum mechanical free energies.SAMPL6:从头算量子力学自由能计算宏观 pK 值。
J Comput Aided Mol Des. 2018 Oct;32(10):1203-1216. doi: 10.1007/s10822-018-0138-6. Epub 2018 Aug 6.

引用本文的文献

1
Computational tools for the prediction of site- and regioselectivity of organic reactions.用于预测有机反应位点和区域选择性的计算工具。
Chem Sci. 2025 Mar 4;16(13):5383-5412. doi: 10.1039/d5sc00541h. eCollection 2025 Mar 26.
2
Atom-based machine learning for estimating nucleophilicity and electrophilicity with applications to retrosynthesis and chemical stability.基于原子的机器学习用于估计亲核性和亲电性及其在逆合成和化学稳定性方面的应用
Chem Sci. 2025 Feb 25;16(13):5676-5687. doi: 10.1039/d4sc07297a. eCollection 2025 Mar 26.

本文引用的文献

1
DECIMER.ai: an open platform for automated optical chemical structure identification, segmentation and recognition in scientific publications.DECIMER.ai:一个用于科学出版物中光学化学结构自动识别、分割和识别的开放平台。
Nat Commun. 2023 Aug 19;14(1):5045. doi: 10.1038/s41467-023-40782-0.
2
Hybrid Machine Learning Approach to Predict the Site Selectivity of Iridium-Catalyzed Arene Borylation.用于预测铱催化芳烃硼化反应位点选择性的混合机器学习方法
J Am Chem Soc. 2023 Aug 9;145(31):17367-17376. doi: 10.1021/jacs.3c04986. Epub 2023 Jul 31.
3
Late-stage C-H functionalization offers new opportunities in drug discovery.
晚期碳氢键官能团化在药物研发中提供了新的机遇。
Nat Rev Chem. 2021 Aug;5(8):522-545. doi: 10.1038/s41570-021-00300-6. Epub 2021 Jul 13.
4
What the Heck?-Automated Regioselectivity Calculations of Palladium-Catalyzed Heck Reactions Using Quantum Chemistry.究竟怎么回事?——利用量子化学对钯催化的Heck反应进行区域选择性自动计算
ACS Omega. 2022 Dec 2;7(49):45617-45623. doi: 10.1021/acsomega.2c06378. eCollection 2022 Dec 13.
5
DECIMER 1.0: deep learning for chemical image recognition using transformers.DECIMER 1.0:使用Transformer进行化学图像识别的深度学习
J Cheminform. 2021 Aug 17;13(1):61. doi: 10.1186/s13321-021-00538-8.
6
rSCAN-3c: A "Swiss army knife" composite electronic-structure method.rSCAN-3c:一种“瑞士军刀”式的复合电子结构方法。
J Chem Phys. 2021 Feb 14;154(6):064103. doi: 10.1063/5.0040021.
7
RegioSQM20: improved prediction of the regioselectivity of electrophilic aromatic substitutions.RegioSQM20:改进对亲电芳香取代区域选择性的预测。
J Cheminform. 2021 Feb 12;13(1):10. doi: 10.1186/s13321-021-00490-7.
8
Machine learning models for hydrogen bond donor and acceptor strengths using large and diverse training data generated by first-principles interaction free energies.利用第一性原理相互作用自由能生成的大量多样训练数据构建的氢键供体和受体强度机器学习模型。
J Cheminform. 2019 Sep 11;11(1):59. doi: 10.1186/s13321-019-0381-4.
9
DECIMER: towards deep learning for chemical image recognition.DECIMER:迈向用于化学图像识别的深度学习
J Cheminform. 2020 Oct 27;12(1):65. doi: 10.1186/s13321-020-00469-w.
10
Holistic Prediction of the pK in Diverse Solvents Based on a Machine-Learning Approach.基于机器学习方法的不同溶剂中 pK 的整体预测。
Angew Chem Int Ed Engl. 2020 Oct 19;59(43):19282-19291. doi: 10.1002/anie.202008528. Epub 2020 Aug 25.