Lhasa Limited, Granary Wharf House, 2 Canal Wharf, Leeds LS11 5PS, United Kingdom.
J Chem Inf Model. 2020 Oct 26;60(10):4560-4568. doi: 10.1021/acs.jcim.0c00483. Epub 2020 Oct 8.
Prediction of whether a compound is "aromatic" is at first glance a relatively simple task-does it obey Hückel's rule (planar cyclic π-system with 4n + 2 electrons) or not? However, aromaticity is far from a binary property, and there are distinct variations in the chemical and biological behavior of different systems which obey Hückel's rule and are thus classified as aromatic. To that end, the aromaticity of each molecule in a large public dataset was quantified by an extension of the work of Raczyńska et al. Building on this data, a method is proposed for machine learning the degree of aromaticity of each aromatic ring in a molecule. Categories are derived from the numeric results, allowing the differentiation of structural patterns between them and thus a better representation of the underlying chemical and biological behavior in expert and (Q)SAR systems.
预测一个化合物是否具有“芳香性”乍一看是一项相对简单的任务——它是否符合休克尔规则(平面环状π系统,具有 4n+2 个电子)?然而,芳香性远非二元性质,具有不同化学和生物学行为的不同系统之间存在明显的差异,这些系统遵守休克尔规则,因此被归类为芳香性。为此,通过扩展 Raczyńska 等人的工作,对大型公共数据集的每个分子的芳香性进行了量化。在此基础上,提出了一种用于机器学习分子中每个芳环芳香度的方法。从数值结果中得出类别,允许区分它们之间的结构模式,从而更好地反映专家和(QSAR)系统中潜在的化学和生物学行为。