UFZ Department of Ecological Chemistry, Helmholtz Centre for Environmental Research, Permoserstrasse 15, 04318 Leipzig, Germany.
J Chem Inf Model. 2010 Jul 26;50(7):1223-32. doi: 10.1021/ci1001179.
An algorithm is introduced that enables a fast generation of all possible prototropic tautomers resulting from the mobile H atoms and associated heteroatoms as defined in the InChI code. The InChI-derived set of possible tautomers comprises (1,3)-shifts for open-chain molecules and (1,n)-shifts (with n being an odd number >3) for ring systems. In addition, our algorithm includes also, as extension to the InChI scope, those larger (1,n)-shifts that can be constructed from joining separate but conjugated InChI sequences of tautomer-active heteroatoms. The developed algorithm is described in detail, with all major steps illustrated through explicit examples. Application to approximately 72,500 organic compounds taken from EINECS (European Inventory of Existing Commercial Chemical Substances) shows that around 11% of the substances occur in different heteroatom-prototropic tautomeric forms. Additional QSAR (quantitative structure-activity relationship) predictions of their soil sorption coefficient and water solubility reveal variations across tautomers up to more than two and 4 orders of magnitude, respectively. For a small subset of nine compounds, analysis of quantum chemically predicted tautomer energies supports the view that among all tautomers of a given compound, those restricted to H atom exchanges between heteroatoms usually include the thermodynamically most stable structures.
引入了一种算法,能够快速生成由 InChI 代码中定义的可移动 H 原子和相关杂原子产生的所有可能的互变异构体。由 InChI 衍生的可能互变异构体集包括开链分子的(1,3)-移位和环系统的(1,n)-移位(其中 n 是大于 3 的奇数)。此外,我们的算法还扩展了 InChI 范围,包括那些可以由连接互变异构活性杂原子的单独但共轭 InChI 序列构建的更大的(1,n)-移位。详细描述了开发的算法,并通过显式示例说明了所有主要步骤。对来自 EINECS(欧洲现有商业化学物质清单)的约 72500 种有机化合物的应用表明,约 11%的物质以不同的杂原子互变异构形式存在。对其土壤吸附系数和水溶解度的附加 QSAR(定量结构-活性关系)预测显示,互变异构体之间的变化高达两个和 4 个数量级以上。对于九种化合物的一小部分,量子化学预测互变异构体能量的分析支持这样一种观点,即在给定化合物的所有互变异构体中,那些限制在杂原子之间的 H 原子交换的互变异构体通常包括热力学上最稳定的结构。