Nantasenamat Chanin, Isarankura-Na-Ayudhya Chartchalerm, Tansila Natta, Naenna Thanakorn, Prachayasittikul Virapong
Department of Clinical Microbiology, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand.
J Comput Chem. 2007 May;28(7):1275-89. doi: 10.1002/jcc.20656.
The prediction of the excitation and the emission maxima of green fluorescent protein (GFP) chromophores were investigated by a quantitative structure-property relationship study. A data set of 19 GFP color variants and an additional data set consisting of 29 synthetic GFP chromophores were collected from the literature. Artificial neural network implementing the back-propagation algorithm was employed. The proposed computational approach reliably predicted the excitation and the emission maxima of GFP chromophores with correlation coefficient exceeding 0.9. The usefulness of quantum chemical descriptors was revealed by a comparative study with other molecular descriptors. Assignment of appropriate protonation state of the chromophore for the GFP color variants data set was shown to be necessary for good predictive performance. Results suggest that the confinement of the GFP chromophore has no significant influence on the predictive performance of the data set used. A comparative investigation with the traditional modeling methods, particularly multiple linear regression and partial least squares, reveals that artificial neural network is the most suitable modeling approach for the GFP spectral properties. It is anticipated that this methodology has great potential in accelerating the design and engineering of novel GFP color variants of scientific or industrial interest.
通过定量结构-性质关系研究,对绿色荧光蛋白(GFP)发色团的激发和发射最大值进行了预测。从文献中收集了19个GFP颜色变体的数据集以及由29个合成GFP发色团组成的另一个数据集。采用了实现反向传播算法的人工神经网络。所提出的计算方法可靠地预测了GFP发色团的激发和发射最大值,相关系数超过0.9。通过与其他分子描述符的比较研究,揭示了量子化学描述符的有用性。对于GFP颜色变体数据集,显示出为发色团指定合适的质子化状态对于良好的预测性能是必要的。结果表明,GFP发色团的受限对所用数据集的预测性能没有显著影响。与传统建模方法(特别是多元线性回归和偏最小二乘法)的比较研究表明,人工神经网络是用于GFP光谱性质的最合适建模方法。预计该方法在加速设计和工程化具有科学或工业价值的新型GFP颜色变体方面具有巨大潜力。