Franke R, Huebel S, Streich W J
Environ Health Perspect. 1985 Sep;61:239-55. doi: 10.1289/ehp.8561239.
For large and diverse data sets, simple QSAR methods based on linear and additive models can no longer be applied. In such cases topological methods using descriptors directly derivable from two-dimensional chemical structures provide a useful alternative. The results of such analyses can be used for lead optimization, to guide biological testing and even aid in the design of novel compounds. Various types of topological descriptors and algorithms are briefly discussed. Which of those is to be selected depends on the objective of the investigation and the properties of the data set. Two new methods, LOGANA and LOCON, are discussed in some more detail. With the help of these methods, substructural patterns ("topological pharmacophores") characteristic of compounds possessing a certain biological property can be evaluated. Both methods are designed in such a way that full use can be made of the data handling capacity of computers while maintaining an optimal impact of the experience of the researcher. They are model-free and do not require any mathematical knowledge. While LOGANA deals with semiquantitative or even qualitative biological data, LOCON can be applied to activity data on a continuous scale. The basic procedure in both cases consists in the stepwise combination of substructural descriptors by the logical operations "and," "or" and "not." With a simple example the utility of the methods is demonstrated.
对于大型多样的数据集,基于线性和加性模型的简单定量构效关系(QSAR)方法已不再适用。在这种情况下,使用可直接从二维化学结构导出的描述符的拓扑方法提供了一种有用的替代方法。此类分析的结果可用于先导化合物优化,指导生物学测试,甚至有助于新型化合物的设计。简要讨论了各种类型的拓扑描述符和算法。选择哪一种取决于研究目的和数据集的特性。对两种新方法LOGANA和LOCON进行了更详细的讨论。借助这些方法,可以评估具有特定生物学特性的化合物的亚结构模式(“拓扑药效团”)。这两种方法的设计方式是,在充分利用计算机数据处理能力的同时,保持研究人员经验的最佳影响。它们无需模型,也不需要任何数学知识。LOGANA处理半定量甚至定性的生物学数据,而LOCON可应用于连续尺度的活性数据。两种情况下的基本程序都是通过逻辑运算“与”、“或”和“非”逐步组合亚结构描述符。通过一个简单的例子展示了这些方法的实用性。