Klopman G, Rosenkranz H
Department of Chemistry, Case Western Reserve University, Cleveland, OH 44106.
Environ Health Perspect. 1991 Dec;96:67-75. doi: 10.1289/ehp.919667.
The most important criteria for the development and analysis of databases for elucidating the structural bases of toxicological activity include the integrity of the databases with respect to uniformity of the experimental protocol and interpretation of the test results and inclusion of chemicals representing different chemical classes and differing mechanisms of action. Within these criteria, it is demonstrated that when the chemicals are chosen at random, the larger the database, the better the predictivity of chemicals not included in the learning set. It is shown however, that when chemicals are selected on the basis of structural features, that a learning set of approximately 180 chemicals is as informative as a database consisting of 800 chemicals chosen at random.
数据库在实验方案一致性和测试结果解释方面的完整性,以及纳入代表不同化学类别和不同作用机制的化学物质。在这些标准范围内,结果表明,当随机选择化学物质时,数据库越大,对学习集中未包含的化学物质的预测性就越好。然而,结果显示,当根据结构特征选择化学物质时,大约180种化学物质的学习集与由800种随机选择的化学物质组成的数据库信息量相当。