Park Jaehong, Shim Youngseon, Lee Franklin, Rammohan Aravind, Goyal Sushmit, Shim Munbo, Jeong Changwook, Kim Dae Sin
Innovation Center, Samsung Electronics Co., Ltd., 1 Samsungjeonja-ro, Hwaseong-si, Gyeonggi-do 18448, Korea.
Science and Technology Division, Corning Incorporated, Corning, New York 14831, United States.
ACS Polym Au. 2022 Jan 21;2(4):213-222. doi: 10.1021/acspolymersau.1c00050. eCollection 2022 Aug 10.
We present machine learning models for the prediction of thermal and mechanical properties of polymers based on the graph convolutional network (GCN). GCN-based models provide reliable prediction performances for the glass transition temperature ( ), melting temperature ( ), density (ρ), and elastic modulus () with substantial dependence on the dataset, which is the best for ( ∼ 0.9) and worst for ( ∼ 0.5). It is found that the GCN representations for polymers provide prediction performances of their properties comparable to the popular extended-connectivity circular fingerprint (ECFP) representation. Notably, the GCN combined with the neural network regression (GCN-NN) slightly outperforms the ECFP. It is investigated how the GCN captures important structural features of polymers to learn their properties. Using the dimensionality reduction, we demonstrate that the polymers are organized in the principal subspace of the GCN representation spaces with respect to the backbone rigidity. The organization in the representation space adaptively changes with the training and through the NN layers, which might facilitate a subsequent prediction of target properties based on the relationships between the structure and the property. The GCN models are found to provide an advantage to automatically extract a backbone rigidity, strongly correlated with , as well as a potential transferability to predict other properties associated with a backbone rigidity. Our results indicate both the capability and limitations of the GCN in learning to describe polymer systems depending on the property.
我们提出了基于图卷积网络(GCN)的用于预测聚合物热性能和力学性能的机器学习模型。基于GCN的模型对玻璃化转变温度( )、熔点( )、密度(ρ)和弹性模量( )提供了可靠的预测性能,且很大程度上依赖于数据集,其中对 的预测效果最佳( ∼ 0.9),对 的预测效果最差( ∼ 0.5)。研究发现,聚合物的GCN表示提供的性能预测与流行的扩展连接性圆形指纹(ECFP)表示相当。值得注意的是,GCN与神经网络回归相结合(GCN-NN)略优于ECFP。研究了GCN如何捕捉聚合物的重要结构特征以了解其性能。通过降维,我们证明了聚合物在GCN表示空间的主子空间中相对于主链刚性进行组织。表示空间中的这种组织随着训练并通过神经网络层自适应变化,这可能有助于基于结构与性能之间的关系对目标性能进行后续预测。发现GCN模型具有自动提取与 高度相关的主链刚性的优势,以及预测与主链刚性相关的其他性能的潜在可转移性。我们的结果表明了GCN在学习描述聚合物系统方面的能力和局限性,这取决于所关注的性能。