Devillers J, Doucet-Panaye A, Doucet J P
a CTIS , Rillieux La Pape, France.
SAR QSAR Environ Res. 2015;26(4):263-78. doi: 10.1080/1062936X.2015.1026571. Epub 2015 Apr 11.
An attempt was made to derive structure-activity models allowing the prediction of the larvicidal activity of structurally diverse chemicals against mosquitoes. A database of 188 chemicals with their activity on Aedes aegypti larvae was constituted from analysis of original publications. The activity values were expressed in log 1/IC50 (concentration required to produce 50% inhibition of larval development, mmol). All the chemicals were encoded by means of CODESSA and autocorrelation descriptors. Partial least squares analysis, classification and regression tree, random forest and boosting regression tree analyses, Kohonen self-organizing maps, linear artificial neural networks, three-layer perceptrons, radial basis function artificial neural networks and support vector machines with linear, polynomial, radial basis function and sigmoid kernels were tested as statistical tools. Because quantitative models did not give good results, a two-class model was designed. The three-layer perceptron significantly outperformed the other statistical approaches regardless of the threshold value used to split the data into active and inactive compounds. The most interesting configuration included eight autocorrelation descriptors as input neurons and four neurons in the hidden layer. This led to more than 96% of good predictions on both the training set and external test set of 88 and 100 chemicals, respectively. From the overall simulation results, new candidate molecules were proposed which will be shortly synthesized and tested.
人们尝试推导结构-活性模型,以预测结构多样的化学物质对蚊子的杀幼虫活性。通过对原始文献的分析,构建了一个包含188种化学物质及其对埃及伊蚊幼虫活性的数据库。活性值以log 1/IC50(产生50%幼虫发育抑制所需的浓度,mmol)表示。所有化学物质均通过CODESSA和自相关描述符进行编码。测试了偏最小二乘法分析、分类与回归树、随机森林和提升回归树分析、Kohonen自组织映射、线性人工神经网络、三层感知器、径向基函数人工神经网络以及具有线性、多项式、径向基函数和Sigmoid核的支持向量机等统计工具。由于定量模型效果不佳,设计了一个两类模型。无论用于将数据分为活性和非活性化合物的阈值如何,三层感知器的表现均显著优于其他统计方法。最有趣的配置包括八个自相关描述符作为输入神经元,隐藏层中有四个神经元。这分别在包含88种和100种化学物质的训练集和外部测试集上实现了超过96%的良好预测。根据总体模拟结果,提出了新的候选分子,不久将进行合成和测试。