基于SAMPL5挑战对环己烷-水分配系数的盲预测。

Blind prediction of cyclohexane-water distribution coefficients from the SAMPL5 challenge.

作者信息

Bannan Caitlin C, Burley Kalistyn H, Chiu Michael, Shirts Michael R, Gilson Michael K, Mobley David L

机构信息

Department of Chemistry, University of California, 147 Bison Modular, Irvine, CA, 92697, USA.

Department of Pharmaceutical Sciences, University of California, 147 Bison Modular, Irvine, CA, 92697, USA.

出版信息

J Comput Aided Mol Des. 2016 Nov;30(11):927-944. doi: 10.1007/s10822-016-9954-8. Epub 2016 Sep 27.

DOI:10.1007/s10822-016-9954-8

PMID:27677750

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5209301/

Abstract

In the recent SAMPL5 challenge, participants submitted predictions for cyclohexane/water distribution coefficients for a set of 53 small molecules. Distribution coefficients (log D) replace the hydration free energies that were a central part of the past five SAMPL challenges. A wide variety of computational methods were represented by the 76 submissions from 18 participating groups. Here, we analyze submissions by a variety of error metrics and provide details for a number of reference calculations we performed. As in the SAMPL4 challenge, we assessed the ability of participants to evaluate not just their statistical uncertainty, but their model uncertainty-how well they can predict the magnitude of their model or force field error for specific predictions. Unfortunately, this remains an area where prediction and analysis need improvement. In SAMPL4 the top performing submissions achieved a root-mean-squared error (RMSE) around 1.5 kcal/mol. If we anticipate accuracy in log D predictions to be similar to the hydration free energy predictions in SAMPL4, the expected error here would be around 1.54 log units. Only a few submissions had an RMSE below 2.5 log units in their predicted log D values. However, distribution coefficients introduced complexities not present in past SAMPL challenges, including tautomer enumeration, that are likely to be important in predicting biomolecular properties of interest to drug discovery, therefore some decrease in accuracy would be expected. Overall, the SAMPL5 distribution coefficient challenge provided great insight into the importance of modeling a variety of physical effects. We believe these types of measurements will be a promising source of data for future blind challenges, especially in view of the relatively straightforward nature of the experiments and the level of insight provided.

摘要

在最近的SAMPL5挑战中，参与者提交了一组53个小分子的环己烷/水分配系数预测值。分配系数（log D）取代了过去五次SAMPL挑战中的核心部分——水合自由能。来自18个参与小组的76份提交材料代表了各种各样的计算方法。在这里，我们通过各种误差指标分析提交材料，并提供我们进行的一些参考计算的详细信息。与SAMPL4挑战一样，我们评估了参与者不仅评估其统计不确定性，而且评估其模型不确定性的能力——他们能多好地预测特定预测的模型或力场误差的大小。不幸的是，这仍然是一个预测和分析需要改进的领域。在SAMPL4中，表现最佳的提交材料的均方根误差（RMSE）约为1.5千卡/摩尔。如果我们预计log D预测的准确性与SAMPL4中的水合自由能预测相似，那么这里的预期误差约为1.54个对数单位。在预测的log D值中，只有少数提交材料的RMSE低于2.5个对数单位。然而，分配系数引入了过去SAMPL挑战中不存在的复杂性，包括互变异构体枚举，这在预测药物发现感兴趣的生物分子特性方面可能很重要，因此预计准确性会有所下降。总体而言，SAMPL5分配系数挑战让我们深入了解了对各种物理效应进行建模的重要性。我们相信，这些类型的测量将成为未来盲测的一个有前景的数据来源，特别是鉴于实验相对简单的性质以及所提供的洞察水平。

相似文献

Blind prediction of cyclohexane-water distribution coefficients from the SAMPL5 challenge.基于SAMPL5挑战对环己烷-水分配系数的盲预测。

J Comput Aided Mol Des. 2016 Nov;30(11):927-944. doi: 10.1007/s10822-016-9954-8. Epub 2016 Sep 27.

Assessing the accuracy of octanol-water partition coefficient predictions in the SAMPL6 Part II log P Challenge.评估 SAMPL6 第 II 部分 log P 挑战中辛醇-水分配系数预测的准确性。

J Comput Aided Mol Des. 2020 Apr;34(4):335-370. doi: 10.1007/s10822-020-00295-0. Epub 2020 Feb 27.

Measuring experimental cyclohexane-water distribution coefficients for the SAMPL5 challenge.测量用于SAMPL5挑战的实验性环己烷-水分配系数。

J Comput Aided Mol Des. 2016 Nov;30(11):945-958. doi: 10.1007/s10822-016-9971-7. Epub 2016 Oct 7.

Extended solvent-contact model approach to blind SAMPL5 prediction challenge for the distribution coefficients of drug-like molecules.扩展溶剂接触模型方法用于药物类分子分配系数的盲SAMPL5预测挑战

J Comput Aided Mol Des. 2016 Nov;30(11):1019-1033. doi: 10.1007/s10822-016-9928-x. Epub 2016 Jul 23.

Calculation of distribution coefficients in the SAMPL5 challenge from atomic solvation parameters and surface areas.基于原子溶剂化参数和表面积计算SAMPL5挑战中的分配系数。

J Comput Aided Mol Des. 2016 Nov;30(11):1079-1086. doi: 10.1007/s10822-016-9951-y. Epub 2016 Sep 1.

All-atom/coarse-grained hybrid predictions of distribution coefficients in SAMPL5.SAMPL5中分配系数的全原子/粗粒度混合预测。

J Comput Aided Mol Des. 2016 Nov;30(11):969-976. doi: 10.1007/s10822-016-9926-z. Epub 2016 Jul 26.

Prediction of cyclohexane-water distribution coefficients with COSMO-RS on the SAMPL5 data set.使用COSMO-RS对SAMPL5数据集进行环己烷-水分配系数的预测。

J Comput Aided Mol Des. 2016 Nov;30(11):959-967. doi: 10.1007/s10822-016-9927-y. Epub 2016 Jul 26.

Predicting cyclohexane/water distribution coefficients for the SAMPL5 challenge using MOSCED and the SMD solvation model.使用MOSCED和SMD溶剂化模型预测SAMPL5挑战中的环己烷/水分配系数。

J Comput Aided Mol Des. 2016 Nov;30(11):1007-1017. doi: 10.1007/s10822-016-9945-9. Epub 2016 Aug 26.

The SAMPL5 challenge for embedded-cluster integral equation theory: solvation free energies, aqueous pK , and cyclohexane-water log D.嵌入式簇积分方程理论的SAMPL5挑战：溶剂化自由能、水相pK及环己烷-水的log D

J Comput Aided Mol Des. 2016 Nov;30(11):1035-1044. doi: 10.1007/s10822-016-9939-7. Epub 2016 Aug 23.

Prediction of cyclohexane-water distribution coefficients for the SAMPL5 data set using molecular dynamics simulations with the OPLS-AA force field.使用OPLS-AA力场通过分子动力学模拟预测SAMPL5数据集的环己烷-水分配系数。

J Comput Aided Mol Des. 2016 Nov;30(11):1045-1058. doi: 10.1007/s10822-016-9949-5. Epub 2016 Aug 31.

引用本文的文献

ABCG2: A Milestone Charge Model for Accurate Solvation Free Energy Calculation.ABCG2：用于精确溶剂化自由能计算的里程碑式电荷模型。

J Chem Theory Comput. 2025 Mar 25;21(6):3032-3043. doi: 10.1021/acs.jctc.5c00038. Epub 2025 Mar 11.

Expanded ensemble predictions of toluene-water partition coefficients in the SAMPL9 log challenge.SAMPL9对数挑战中甲苯-水分配系数的扩展系综预测。

Phys Chem Chem Phys. 2025 Mar 19;27(12):6005-6013. doi: 10.1039/d4cp03621b.

Influence of Selective Deoxyfluorination on the Molecular Structure of Type-2 -Acetyllactosamine.选择性脱氧氟代作用对 2-乙酰乳糖胺型结构的影响。

J Org Chem. 2024 Sep 6;89(17):11875-11890. doi: 10.1021/acs.joc.4c00879. Epub 2024 Aug 23.

Development and test of highly accurate endpoint free energy methods. 3: partition coefficient prediction using a Poisson-Boltzmann method combined with a solvent accessible surface area model for SAMPL challenges.高精度终点自由能方法的开发与测试。3：结合溶剂可及表面积模型的泊松-玻尔兹曼方法用于SAMPL挑战的分配系数预测。

Phys Chem Chem Phys. 2023 Dec 21;26(1):85-94. doi: 10.1039/d3cp04174c.

Predicting absolute aqueous solubility by applying a machine learning model for an artificially liquid-state as proxy for the solid-state.通过应用机器学习模型预测人工液态（作为固态的替代物）的绝对水溶解度。

J Comput Aided Mol Des. 2023 Dec;37(12):765-789. doi: 10.1007/s10822-023-00538-w. Epub 2023 Oct 25.

Artificial intelligence for natural product drug discovery.人工智能在天然产物药物发现中的应用。

Nat Rev Drug Discov. 2023 Nov;22(11):895-916. doi: 10.1038/s41573-023-00774-7. Epub 2023 Sep 11.

Best practices for constructing, preparing, and evaluating protein-ligand binding affinity benchmarks [Article v0.1].构建、准备和评估蛋白质-配体结合亲和力基准的最佳实践[文章v0.1]

Living J Comput Mol Sci. 2022;4(1). doi: 10.33011/livecoms.4.1.1497. Epub 2022 Aug 30.

An overview of the SAMPL8 host-guest binding challenge.SAMPL8 亲合作用结合挑战概述。

J Comput Aided Mol Des. 2022 Oct;36(10):707-734. doi: 10.1007/s10822-022-00462-5. Epub 2022 Oct 14.

Does Hamiltonian Replica Exchange via Lambda-Hopping Enhance the Sampling in Alchemical Free Energy Calculations?通过拉马克跳跃的哈密顿副本交换是否能增强化学自由能计算中的采样？

Molecules. 2022 Jul 11;27(14):4426. doi: 10.3390/molecules27144426.

CACHE (Critical Assessment of Computational Hit-finding Experiments): A public-private partnership benchmarking initiative to enable the development of computational methods for hit-finding.CACHE（计算命中发现实验的批判性评估）：一项公私合作的基准测试计划，旨在推动用于命中发现的计算方法的开发。

Nat Rev Chem. 2022 Apr;6(4):287-295. doi: 10.1038/s41570-022-00363-z. Epub 2022 Feb 15.

本文引用的文献

Measuring experimental cyclohexane-water distribution coefficients for the SAMPL5 challenge.测量用于SAMPL5挑战的实验性环己烷-水分配系数。

J Comput Aided Mol Des. 2016 Nov;30(11):945-958. doi: 10.1007/s10822-016-9971-7. Epub 2016 Oct 7.

Partition coefficients for the SAMPL5 challenge using transfer free energies.使用转移自由能计算SAMPL5挑战的分配系数。

J Comput Aided Mol Des. 2016 Nov;30(11):1129-1138. doi: 10.1007/s10822-016-9964-6. Epub 2016 Sep 19.

Blind prediction of distribution in the SAMPL5 challenge with QM based protomer and pK corrections.基于量子力学的原体和pK校正对SAMPL5挑战中分布的盲预测。

J Comput Aided Mol Des. 2016 Nov;30(11):1087-1100. doi: 10.1007/s10822-016-9955-7. Epub 2016 Sep 19.

Adapting the semi-explicit assembly solvation model for estimating water-cyclohexane partitioning with the SAMPL5 molecules.采用半显式组装溶剂化模型估算SAMPL5分子的水-环己烷分配系数。

J Comput Aided Mol Des. 2016 Nov;30(11):1067-1077. doi: 10.1007/s10822-016-9961-9. Epub 2016 Sep 8.

SAMPL5: 3D-RISM partition coefficient calculations with partial molar volume corrections and solute conformational sampling.样本5：具有偏摩尔体积校正和溶质构象采样的三维反应性分子模拟理论（3D-RISM）分配系数计算

J Comput Aided Mol Des. 2016 Nov;30(11):1115-1127. doi: 10.1007/s10822-016-9947-7. Epub 2016 Sep 1.

Calculation of distribution coefficients in the SAMPL5 challenge from atomic solvation parameters and surface areas.基于原子溶剂化参数和表面积计算SAMPL5挑战中的分配系数。

J Comput Aided Mol Des. 2016 Nov;30(11):1079-1086. doi: 10.1007/s10822-016-9951-y. Epub 2016 Sep 1.

Prediction of cyclohexane-water distribution coefficient for SAMPL5 drug-like compounds with the QMPFF3 and ARROW polarizable force fields.使用QMPFF3和ARROW可极化力场预测SAMPL5类药物化合物的环己烷-水分配系数。

J Comput Aided Mol Des. 2016 Nov;30(11):977-988. doi: 10.1007/s10822-016-9958-4. Epub 2016 Sep 1.

J Comput Aided Mol Des. 2016 Nov;30(11):1045-1058. doi: 10.1007/s10822-016-9949-5. Epub 2016 Aug 31.

Calculating distribution coefficients based on multi-scale free energy simulations: an evaluation of MM and QM/MM explicit solvent simulations of water-cyclohexane transfer in the SAMPL5 challenge.基于多尺度自由能模拟计算分配系数：对SAMPL5挑战中水分子 - 环己烷转移的分子力学（MM）和量子力学/分子力学（QM/MM）显式溶剂模拟的评估

J Comput Aided Mol Des. 2016 Nov;30(11):989-1006. doi: 10.1007/s10822-016-9936-x. Epub 2016 Aug 30.

Predicting water-to-cyclohexane partitioning of the SAMPL5 molecules using dielectric balancing of force fields.利用力场的介电平衡预测SAMPL5分子在水与环己烷之间的分配情况。

J Comput Aided Mol Des. 2016 Nov;30(11):1059-1065. doi: 10.1007/s10822-016-9950-z. Epub 2016 Aug 29.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验