Plumb A Philip, Rowe Raymond C, York Peter, Brown Martin
Pharmaceutical and Analytical R&D, AstraZeneca R&D Charnwood, Bakewell Road, Loughborough, Leicestershire LE11 5RH, UK.
Eur J Pharm Sci. 2005 Jul-Aug;25(4-5):395-405. doi: 10.1016/j.ejps.2005.04.010.
The purpose of this study was to determine whether artificial neural network (ANN) programs implementing different backpropagation algorithms and default settings are capable of generating equivalent highly predictive models. Three ANN packages were used: INForm, CAD/Chem and MATLAB. Twenty variants of gradient descent, conjugate gradient, quasi-Newton and Bayesian regularization algorithms were used to train networks containing a single hidden layer of 3-12 nodes. All INForm and CAD/Chem models trained satisfactorily for tensile strength, disintegration time and percentage dissolution at 15, 30, 45 and 60 min. Similarly, acceptable training was obtained for MATLAB models using Bayesian regularization. Training of MATLAB models with other algorithms was erratic. This effect was attributed to a tendency for the MATLAB implementation of the algorithms to attenuate training in local minima of the error surface. Predictive models for tablet capping and friability could not be generated. The most predictive models from each ANN package varied with respect to the optimum network architecture and training algorithm. No significant differences were found in the predictive ability of these models. It is concluded that comparable models are obtainable from different ANN programs provided that both the network architecture and training algorithm are optimised. A broad strategy for optimisation of the predictive ability of an ANN model is proposed.
本研究的目的是确定实施不同反向传播算法和默认设置的人工神经网络(ANN)程序是否能够生成等效的高预测性模型。使用了三个ANN软件包:INForm、CAD/Chem和MATLAB。使用了梯度下降、共轭梯度、拟牛顿和贝叶斯正则化算法的20种变体来训练包含一个具有3 - 12个节点的隐藏层的网络。所有INForm和CAD/Chem模型在拉伸强度、崩解时间以及在15、30、45和60分钟时的溶出百分比方面都训练得令人满意。同样,使用贝叶斯正则化的MATLAB模型也获得了可接受的训练结果。使用其他算法对MATLAB模型进行训练时不稳定。这种效应归因于算法在MATLAB中的实现倾向于在误差表面的局部最小值处减弱训练。无法生成片剂顶裂和脆碎度的预测模型。每个ANN软件包中最具预测性的模型在最佳网络架构和训练算法方面各不相同。这些模型的预测能力没有发现显著差异。得出的结论是,只要网络架构和训练算法都经过优化,就可以从不同的ANN程序中获得可比的模型。提出了一种优化ANN模型预测能力的广泛策略。