Nedyalkova Miroslava, Heredia Diana, Barroso-Flores Joaquín, Lattuada Marco
Swiss National Center for Competence in Research (NCCR) Bio-inspired Materials, University of Fribourg, Chemin des Verdiers 4, Fribourg CH-1700, Switzerland.
Department of Inorganic Chemistry, Faculty of Chemistry and Pharmacy, University of Sofia 'St. Kl. Ohridski', Sofia 1504, Bulgaria.
ACS Omega. 2025 Jan 16;10(3):3128-3140. doi: 10.1021/acsomega.4c10413. eCollection 2025 Jan 28.
Arsonic acids (RAsO(OH)), prevalent in contaminated food, water, air, and soil, pose significant environmental and health risks due to their variable ionization states, which influence key properties such as lipophilicity, solubility, and membrane permeability. Accurate p prediction for these compounds is critical yet challenging, as existing models often exhibit limitations across diverse chemical spaces. This study presents a comparative analysis of p predictions for arsonic acids using a support vector machine-based machine learning (ML) approach and three density functional theory (DFT)-based models. The DFT models evaluated include correlations to the maximum surface electrostatic potential ( ), atomic charges derived from a solvation model (solvation model based on density), and a scaled solvent-accessible surface method. Results indicate that the scaled solvent-accessible surface approach yielded high mean unsigned errors, rendering it less effective. In contrast, the atomic charge-based method on the conjugated arsonate base provided the most accurate predictions. The ML-based approach demonstrated strong predictive performance, suggesting its potential utility in broader chemical spaces. The obtained values for p from show a weak prediction level, because the way of predicting p is related only to the electrostatic character of the molecule. However, p is influenced by many factors, including the molecular structure, solvation, resonance, inductive effects, and local atomic environments. cannot fully capture these different interactions, as it gives a simplistic view of the overall molecular potential field.
砷酸(RAsO(OH))在受污染的食物、水、空气和土壤中普遍存在,由于其可变的电离状态会影响诸如亲脂性、溶解度和膜通透性等关键特性,因而对环境和健康构成重大风险。准确预测这些化合物的pKa具有至关重要的意义,但也颇具挑战性,因为现有模型在不同化学空间中往往存在局限性。本研究对使用基于支持向量机的机器学习(ML)方法和三种基于密度泛函理论(DFT)的模型来预测砷酸的pKa进行了比较分析。所评估的DFT模型包括与最大表面静电势( )的相关性、源自溶剂化模型(基于密度的溶剂化模型)的原子电荷以及一种缩放的溶剂可及表面方法。结果表明,缩放的溶剂可及表面方法产生了较高的平均绝对误差,效果较差。相比之下,基于共轭砷酸根碱的原子电荷方法提供了最准确的预测。基于ML的方法表现出强大的预测性能,表明其在更广泛化学空间中的潜在效用。从 获得的pKa值显示出较弱的预测水平,因为预测pKa的方式仅与分子的静电特征相关。然而,pKa受许多因素影响,包括分子结构、溶剂化、共振、诱导效应和局部原子环境。 无法完全捕捉这些不同的相互作用,因为它对整体分子势场给出了过于简单的看法。