Suppr超能文献

基于SAMPL5挑战对环己烷-水分配系数的盲预测。

Blind prediction of cyclohexane-water distribution coefficients from the SAMPL5 challenge.

作者信息

Bannan Caitlin C, Burley Kalistyn H, Chiu Michael, Shirts Michael R, Gilson Michael K, Mobley David L

机构信息

Department of Chemistry, University of California, 147 Bison Modular, Irvine, CA, 92697, USA.

Department of Pharmaceutical Sciences, University of California, 147 Bison Modular, Irvine, CA, 92697, USA.

出版信息

J Comput Aided Mol Des. 2016 Nov;30(11):927-944. doi: 10.1007/s10822-016-9954-8. Epub 2016 Sep 27.

Abstract

In the recent SAMPL5 challenge, participants submitted predictions for cyclohexane/water distribution coefficients for a set of 53 small molecules. Distribution coefficients (log D) replace the hydration free energies that were a central part of the past five SAMPL challenges. A wide variety of computational methods were represented by the 76 submissions from 18 participating groups. Here, we analyze submissions by a variety of error metrics and provide details for a number of reference calculations we performed. As in the SAMPL4 challenge, we assessed the ability of participants to evaluate not just their statistical uncertainty, but their model uncertainty-how well they can predict the magnitude of their model or force field error for specific predictions. Unfortunately, this remains an area where prediction and analysis need improvement. In SAMPL4 the top performing submissions achieved a root-mean-squared error (RMSE) around 1.5 kcal/mol. If we anticipate accuracy in log D predictions to be similar to the hydration free energy predictions in SAMPL4, the expected error here would be around 1.54 log units. Only a few submissions had an RMSE below 2.5 log units in their predicted log D values. However, distribution coefficients introduced complexities not present in past SAMPL challenges, including tautomer enumeration, that are likely to be important in predicting biomolecular properties of interest to drug discovery, therefore some decrease in accuracy would be expected. Overall, the SAMPL5 distribution coefficient challenge provided great insight into the importance of modeling a variety of physical effects. We believe these types of measurements will be a promising source of data for future blind challenges, especially in view of the relatively straightforward nature of the experiments and the level of insight provided.

摘要

在最近的SAMPL5挑战中,参与者提交了一组53个小分子的环己烷/水分配系数预测值。分配系数(log D)取代了过去五次SAMPL挑战中的核心部分——水合自由能。来自18个参与小组的76份提交材料代表了各种各样的计算方法。在这里,我们通过各种误差指标分析提交材料,并提供我们进行的一些参考计算的详细信息。与SAMPL4挑战一样,我们评估了参与者不仅评估其统计不确定性,而且评估其模型不确定性的能力——他们能多好地预测特定预测的模型或力场误差的大小。不幸的是,这仍然是一个预测和分析需要改进的领域。在SAMPL4中,表现最佳的提交材料的均方根误差(RMSE)约为1.5千卡/摩尔。如果我们预计log D预测的准确性与SAMPL4中的水合自由能预测相似,那么这里的预期误差约为1.54个对数单位。在预测的log D值中,只有少数提交材料的RMSE低于2.5个对数单位。然而,分配系数引入了过去SAMPL挑战中不存在的复杂性,包括互变异构体枚举,这在预测药物发现感兴趣的生物分子特性方面可能很重要,因此预计准确性会有所下降。总体而言,SAMPL5分配系数挑战让我们深入了解了对各种物理效应进行建模的重要性。我们相信,这些类型的测量将成为未来盲测的一个有前景的数据来源,特别是鉴于实验相对简单的性质以及所提供的洞察水平。

相似文献

1
Blind prediction of cyclohexane-water distribution coefficients from the SAMPL5 challenge.
J Comput Aided Mol Des. 2016 Nov;30(11):927-944. doi: 10.1007/s10822-016-9954-8. Epub 2016 Sep 27.
2
Assessing the accuracy of octanol-water partition coefficient predictions in the SAMPL6 Part II log P Challenge.
J Comput Aided Mol Des. 2020 Apr;34(4):335-370. doi: 10.1007/s10822-020-00295-0. Epub 2020 Feb 27.
3
Measuring experimental cyclohexane-water distribution coefficients for the SAMPL5 challenge.
J Comput Aided Mol Des. 2016 Nov;30(11):945-958. doi: 10.1007/s10822-016-9971-7. Epub 2016 Oct 7.
4
Extended solvent-contact model approach to blind SAMPL5 prediction challenge for the distribution coefficients of drug-like molecules.
J Comput Aided Mol Des. 2016 Nov;30(11):1019-1033. doi: 10.1007/s10822-016-9928-x. Epub 2016 Jul 23.
5
Calculation of distribution coefficients in the SAMPL5 challenge from atomic solvation parameters and surface areas.
J Comput Aided Mol Des. 2016 Nov;30(11):1079-1086. doi: 10.1007/s10822-016-9951-y. Epub 2016 Sep 1.
6
All-atom/coarse-grained hybrid predictions of distribution coefficients in SAMPL5.
J Comput Aided Mol Des. 2016 Nov;30(11):969-976. doi: 10.1007/s10822-016-9926-z. Epub 2016 Jul 26.
7
Prediction of cyclohexane-water distribution coefficients with COSMO-RS on the SAMPL5 data set.
J Comput Aided Mol Des. 2016 Nov;30(11):959-967. doi: 10.1007/s10822-016-9927-y. Epub 2016 Jul 26.
8
Predicting cyclohexane/water distribution coefficients for the SAMPL5 challenge using MOSCED and the SMD solvation model.
J Comput Aided Mol Des. 2016 Nov;30(11):1007-1017. doi: 10.1007/s10822-016-9945-9. Epub 2016 Aug 26.
9
The SAMPL5 challenge for embedded-cluster integral equation theory: solvation free energies, aqueous pK , and cyclohexane-water log D.
J Comput Aided Mol Des. 2016 Nov;30(11):1035-1044. doi: 10.1007/s10822-016-9939-7. Epub 2016 Aug 23.
10
Prediction of cyclohexane-water distribution coefficients for the SAMPL5 data set using molecular dynamics simulations with the OPLS-AA force field.
J Comput Aided Mol Des. 2016 Nov;30(11):1045-1058. doi: 10.1007/s10822-016-9949-5. Epub 2016 Aug 31.

引用本文的文献

1
ABCG2: A Milestone Charge Model for Accurate Solvation Free Energy Calculation.
J Chem Theory Comput. 2025 Mar 25;21(6):3032-3043. doi: 10.1021/acs.jctc.5c00038. Epub 2025 Mar 11.
2
Expanded ensemble predictions of toluene-water partition coefficients in the SAMPL9 log  challenge.
Phys Chem Chem Phys. 2025 Mar 19;27(12):6005-6013. doi: 10.1039/d4cp03621b.
3
Influence of Selective Deoxyfluorination on the Molecular Structure of Type-2 -Acetyllactosamine.
J Org Chem. 2024 Sep 6;89(17):11875-11890. doi: 10.1021/acs.joc.4c00879. Epub 2024 Aug 23.
5
Predicting absolute aqueous solubility by applying a machine learning model for an artificially liquid-state as proxy for the solid-state.
J Comput Aided Mol Des. 2023 Dec;37(12):765-789. doi: 10.1007/s10822-023-00538-w. Epub 2023 Oct 25.
6
Artificial intelligence for natural product drug discovery.
Nat Rev Drug Discov. 2023 Nov;22(11):895-916. doi: 10.1038/s41573-023-00774-7. Epub 2023 Sep 11.
7
Best practices for constructing, preparing, and evaluating protein-ligand binding affinity benchmarks [Article v0.1].
Living J Comput Mol Sci. 2022;4(1). doi: 10.33011/livecoms.4.1.1497. Epub 2022 Aug 30.
8
An overview of the SAMPL8 host-guest binding challenge.
J Comput Aided Mol Des. 2022 Oct;36(10):707-734. doi: 10.1007/s10822-022-00462-5. Epub 2022 Oct 14.

本文引用的文献

1
Measuring experimental cyclohexane-water distribution coefficients for the SAMPL5 challenge.
J Comput Aided Mol Des. 2016 Nov;30(11):945-958. doi: 10.1007/s10822-016-9971-7. Epub 2016 Oct 7.
2
Partition coefficients for the SAMPL5 challenge using transfer free energies.
J Comput Aided Mol Des. 2016 Nov;30(11):1129-1138. doi: 10.1007/s10822-016-9964-6. Epub 2016 Sep 19.
3
Blind prediction of distribution in the SAMPL5 challenge with QM based protomer and pK corrections.
J Comput Aided Mol Des. 2016 Nov;30(11):1087-1100. doi: 10.1007/s10822-016-9955-7. Epub 2016 Sep 19.
4
Adapting the semi-explicit assembly solvation model for estimating water-cyclohexane partitioning with the SAMPL5 molecules.
J Comput Aided Mol Des. 2016 Nov;30(11):1067-1077. doi: 10.1007/s10822-016-9961-9. Epub 2016 Sep 8.
5
SAMPL5: 3D-RISM partition coefficient calculations with partial molar volume corrections and solute conformational sampling.
J Comput Aided Mol Des. 2016 Nov;30(11):1115-1127. doi: 10.1007/s10822-016-9947-7. Epub 2016 Sep 1.
6
Calculation of distribution coefficients in the SAMPL5 challenge from atomic solvation parameters and surface areas.
J Comput Aided Mol Des. 2016 Nov;30(11):1079-1086. doi: 10.1007/s10822-016-9951-y. Epub 2016 Sep 1.
7
Prediction of cyclohexane-water distribution coefficient for SAMPL5 drug-like compounds with the QMPFF3 and ARROW polarizable force fields.
J Comput Aided Mol Des. 2016 Nov;30(11):977-988. doi: 10.1007/s10822-016-9958-4. Epub 2016 Sep 1.
8
Prediction of cyclohexane-water distribution coefficients for the SAMPL5 data set using molecular dynamics simulations with the OPLS-AA force field.
J Comput Aided Mol Des. 2016 Nov;30(11):1045-1058. doi: 10.1007/s10822-016-9949-5. Epub 2016 Aug 31.
10
Predicting water-to-cyclohexane partitioning of the SAMPL5 molecules using dielectric balancing of force fields.
J Comput Aided Mol Des. 2016 Nov;30(11):1059-1065. doi: 10.1007/s10822-016-9950-z. Epub 2016 Aug 29.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验