Bertinetto Carlo Giuseppe, Duce Celia, Micheli Alessio, Solaro Roberto, Tiné Maria Rosaria
Dept. of Chemistry and Industrial Chemistry, University of Pisa via Risorgimento 35, 56126 Pisa, Italy.
Dept. of Computer Science, University of Pisa, Largo B, Pontecorvo 3, 56127 Pisa, Italy.
Mol Inform. 2010 Sep 17;29(8-9):635-43. doi: 10.1002/minf.201000079. Epub 2010 Sep 21.
The glass transition temperature (Tg ) of acrylic and methacrylic random copolymers was investigated by means of Quantitative Structure-Property Relationship (QSPR) methodology based on Recursive Neural Networks (RNN). This method can directly take molecular structures as input, in the form of labelled trees, without needing predefined descriptors. It was applied to three data sets containing up to 615 polymers (340 homopolymers and 275 copolymers). The adopted representation was able to account for the structure of the repeating unit as well as average macromolecular characteristics, such as stereoregularity and molar composition. The best result, obtained on a data set focused on copolymers, showed a Mean Average Residual (MAR) of 4.9 K, a standard error of prediction (S) of 6.1 K and a squared correlation coefficient (R(2) ) of 0.98 for the test set, with an optimal rate with respect to the training error. Through the treatment of homopolymers and copolymers both as separated and merged data sets, we also showed that the proposed approach is particularly suited for generalizing prediction of polymer properties to various types of chemical structures in a uniform setting.
采用基于递归神经网络(RNN)的定量结构-性质关系(QSPR)方法,对丙烯酸和甲基丙烯酸无规共聚物的玻璃化转变温度(Tg)进行了研究。该方法能够直接将分子结构作为输入,以标记树的形式呈现,无需预先定义描述符。它被应用于三个数据集,其中包含多达615种聚合物(340种均聚物和275种共聚物)。所采用的表示方式能够解释重复单元的结构以及平均大分子特征,如立构规整性和摩尔组成。在一个聚焦于共聚物的数据集上获得的最佳结果显示,测试集的平均平均残差(MAR)为4.9 K,预测标准误差(S)为6.1 K,平方相关系数(R²)为0.98,相对于训练误差具有最优速率。通过将均聚物和共聚物分别作为独立数据集以及合并数据集进行处理,我们还表明,所提出的方法特别适合在统一环境中将聚合物性质的预测推广到各种类型的化学结构。