• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于有机溶剂 SMILES 编码的机器学习预测经验极性。

Machine learning prediction of empirical polarity using SMILES encoding of organic solvents.

机构信息

Department of Chemistry & Centre for Advanced Studies in Chemistry, Panjab University, Chandigarh, 160014, India.

出版信息

Mol Divers. 2023 Oct;27(5):2331-2343. doi: 10.1007/s11030-022-10559-6. Epub 2022 Nov 5.

DOI:10.1007/s11030-022-10559-6
PMID:36334165
Abstract

Machine learning based statistical models have played a significant role in increasing the speed and accuracy with which the chemical and physical properties of chemical compounds can be predicted as compared to the experimental, and traditional ab initio and quantum mechanical approaches. The transformative impact that these techniques have, in the field of chemical sciences has completely changed the way experiments are designed. The last decade has seen the prominence of computer-aided molecular design based on machine learning algorithms. The major challenge has been the generation of machine-readable data in the form of descriptors and observations for training the model, which can again be time-consuming and computationally expensive if atomic coordinates based molecular encoding approach is used. In this study, we have tried to solve this problem using SMILES representation of molecules for generating various topological, physicochemical, electronic and steric descriptors using open-source cheminformatics packages. With the aid of the data generated using these packages, we have been able to develop a simple and explainable quantitative structure property relationship model using artificial neural network based on 7 numerical descriptors and 1 categorical descriptor for predicting the empirical polarity of a wide diversity of organic solvents. Since polarity is the representation of various solute-solvent and solvent-solvent interactions taking place in an organic transformation, its intuition beforehand will definitely help a chemist in a better experimental design. An ANN algorithm based on 8 descriptors was successfully employed to predict the E(30) values of organic solvents.

摘要

基于机器学习的统计模型在提高化合物化学和物理性质预测的速度和准确性方面发挥了重要作用,与实验以及传统的从头算和量子力学方法相比。这些技术在化学科学领域产生的变革性影响彻底改变了实验设计的方式。过去十年见证了基于机器学习算法的计算机辅助分子设计的兴起。主要的挑战是生成可用于训练模型的机器可读数据,以描述符和观察值的形式呈现,如果使用基于原子坐标的分子编码方法,这可能会既耗时又昂贵。在这项研究中,我们试图使用分子的 SMILES 表示来解决这个问题,以便使用开源化学信息学软件包生成各种拓扑、物理化学、电子和立体描述符。借助这些软件包生成的数据,我们已经能够使用基于 7 个数值描述符和 1 个类别描述符的人工神经网络开发一个简单且可解释的定量构效关系模型,用于预测广泛多样的有机溶剂的经验极性。由于极性是有机转化中发生的各种溶质-溶剂和溶剂-溶剂相互作用的表示,因此事先了解极性肯定会帮助化学家进行更好的实验设计。成功地使用基于 8 个描述符的 ANN 算法来预测有机溶剂的 E(30) 值。

相似文献

1
Machine learning prediction of empirical polarity using SMILES encoding of organic solvents.基于有机溶剂 SMILES 编码的机器学习预测经验极性。
Mol Divers. 2023 Oct;27(5):2331-2343. doi: 10.1007/s11030-022-10559-6. Epub 2022 Nov 5.
2
Explainable Solvation Free Energy Prediction Combining Graph Neural Networks with Chemical Intuition.结合图神经网络与化学直觉的可解释溶剂化自由能预测
J Chem Inf Model. 2022 Nov 28;62(22):5457-5470. doi: 10.1021/acs.jcim.2c01013. Epub 2022 Nov 1.
3
Explainable Supervised Machine Learning Model To Predict Solvation Gibbs Energy.可解释的监督机器学习模型,用于预测溶剂化吉布斯自由能。
J Chem Inf Model. 2024 Apr 8;64(7):2250-2262. doi: 10.1021/acs.jcim.3c00544. Epub 2023 Aug 21.
4
Importance of Engineered and Learned Molecular Representations in Predicting Organic Reactivity, Selectivity, and Chemical Properties.在预测有机反应性、选择性和化学性质方面,工程化和学习的分子表示的重要性。
Acc Chem Res. 2021 Feb 16;54(4):827-836. doi: 10.1021/acs.accounts.0c00745. Epub 2021 Feb 3.
5
Comparative Analysis of Chemical Descriptors by Machine Learning Reveals Atomistic Insights into Solute-Lipid Interactions.基于机器学习的化学描述符对比分析揭示了溶质-脂质相互作用的原子水平见解。
Mol Pharm. 2024 Jul 1;21(7):3343-3355. doi: 10.1021/acs.molpharmaceut.4c00080. Epub 2024 May 23.
6
Predicting absolute aqueous solubility by applying a machine learning model for an artificially liquid-state as proxy for the solid-state.通过应用机器学习模型预测人工液态(作为固态的替代物)的绝对水溶解度。
J Comput Aided Mol Des. 2023 Dec;37(12):765-789. doi: 10.1007/s10822-023-00538-w. Epub 2023 Oct 25.
7
Solvent-Specific Featurization for Predicting Free Energies of Solvation through Machine Learning.通过机器学习进行溶剂特异性特征化,以预测溶剂化自由能。
J Chem Inf Model. 2019 Apr 22;59(4):1338-1346. doi: 10.1021/acs.jcim.8b00901. Epub 2019 Mar 18.
8
Prediction of non-ideal behavior of polarity/polarizability scales of solvent mixtures by integration of a novel COSMO-RS molecular descriptor and neural networks.通过整合新型COSMO-RS分子描述符和神经网络预测溶剂混合物极性/极化率标度的非理想行为。
Phys Chem Chem Phys. 2008 Oct 21;10(39):5967-75. doi: 10.1039/b807617k. Epub 2008 Aug 11.
9
Performance comparison of nonlinear and linear regression algorithms coupled with different attribute selection methods for quantitative structure - retention relationships modelling in micellar liquid chromatography.胶束液相色谱中非线性和线性回归算法与不同属性选择方法相结合的定量结构 - 保留关系建模的性能比较。
J Chromatogr A. 2020 Jul 19;1623:461146. doi: 10.1016/j.chroma.2020.461146. Epub 2020 Apr 29.
10
Predicting drug solubility in organic solvents mixtures: A machine-learning approach supported by high-throughput experimentation.预测有机溶剂混合物中的药物溶解度:一种基于高通量实验的机器学习方法。
Int J Pharm. 2024 Jul 20;660:124233. doi: 10.1016/j.ijpharm.2024.124233. Epub 2024 May 18.

引用本文的文献

1
Solvent Screening for Separation Processes Using Machine Learning and High-Throughput Technologies.利用机器学习和高通量技术进行分离过程的溶剂筛选
Chem Bio Eng. 2025 Mar 5;2(4):210-228. doi: 10.1021/cbe.4c00170. eCollection 2025 Apr 24.
2
Machine Learning-Aided Inverse Design and Discovery of Novel Polymeric Materials for Membrane Separation.机器学习辅助的用于膜分离的新型高分子材料逆设计与发现
Environ Sci Technol. 2025 Jan 21;59(2):993-1012. doi: 10.1021/acs.est.4c08298. Epub 2024 Dec 16.
3
Integrating structure annotation and machine learning approaches to develop graphene toxicity models.

本文引用的文献

1
Machine learning meets mechanistic modelling for accurate prediction of experimental activation energies.机器学习与机理建模相结合以精确预测实验活化能。
Chem Sci. 2020 Nov 5;12(3):1163-1175. doi: 10.1039/d0sc04896h. eCollection 2021 Jan 21.
2
Why 90% of clinical drug development fails and how to improve it?为什么90%的临床药物研发会失败以及如何改进?
Acta Pharm Sin B. 2022 Jul;12(7):3049-3062. doi: 10.1016/j.apsb.2022.02.002. Epub 2022 Feb 11.
3
Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.
整合结构注释和机器学习方法以开发石墨烯毒性模型。
Carbon N Y. 2023 Feb;204:484-494. doi: 10.1016/j.carbon.2022.12.065. Epub 2022 Dec 26.
停止为高风险决策解释黑箱机器学习模型,转而使用可解释模型。
Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.
4
Artificial intelligence-enabled virtual screening of ultra-large chemical libraries with deep docking.基于深度对接的人工智能辅助超大规模化学库虚拟筛选。
Nat Protoc. 2022 Mar;17(3):672-697. doi: 10.1038/s41596-021-00659-2. Epub 2022 Feb 4.
5
A machine learning approach for predicting the nucleophilicity of organic molecules.一种用于预测有机分子亲核性的机器学习方法。
Phys Chem Chem Phys. 2022 Jan 19;24(3):1821-1829. doi: 10.1039/d1cp05072a.
6
Machine learning enabling prediction of the bond dissociation enthalpy of hypervalent iodine from SMILES.机器学习能够根据 SMILES 预测高价碘键离解焓。
Sci Rep. 2021 Oct 12;11(1):20207. doi: 10.1038/s41598-021-99369-8.
7
Influence of solvent structure and hydrogen bonding on catalysis at solid-liquid interfaces.溶剂结构和氢键对固液界面催化的影响。
Chem Soc Rev. 2021 Nov 15;50(22):12308-12337. doi: 10.1039/d1cs00539a.
8
Predicting Solvent-Dependent Nucleophilicity Parameter with a Causal Structure Property Relationship.用因果结构属性关系预测溶剂依赖性亲核性参数。
J Chem Inf Model. 2021 Oct 25;61(10):4890-4899. doi: 10.1021/acs.jcim.1c00610. Epub 2021 Sep 22.
9
Progress on open chemoinformatic tools for expanding and exploring the chemical space.用于拓展和探索化学空间的开放 chemoinformatic 工具的进展。
J Comput Aided Mol Des. 2022 May;36(5):341-354. doi: 10.1007/s10822-021-00399-1. Epub 2021 Jun 18.
10
Advancing Discovery in Chemistry with Artificial Intelligence: From Reaction Outcomes to New Materials and Catalysts.利用人工智能推动化学领域的发现:从反应结果到新材料和催化剂
Acc Chem Res. 2021 May 18;54(10):2335-2336. doi: 10.1021/acs.accounts.1c00232.