Materials Genome Institute, Shanghai University, and Shanghai Materials Genome Institute, Shanghai 200444, China.
Department of Chemistry, College of Sciences, Shanghai University, Shanghai 200444, China.
J Phys Chem B. 2021 Jan 21;125(2):601-611. doi: 10.1021/acs.jpcb.0c08674. Epub 2021 Jan 7.
Polymer band gap is one of the most important properties associated with electric conductivity. In this work, the machine learning model called support vector regression (SVR) was developed to predict the polymer band gap, where the training data of the polymer band gap were obtained from DFT computation while the descriptors were generated from Dragon. After feature selection with the maximum relevance minimum redundancy, the SVR model using 16 key features as inputs gave the optimal performance for predicting polymer band gaps. The determination coefficient () of the SVR model between the DFT computations and SVR predictions of polymer band gaps reached as high as 0.824 for the leave-one-out cross-validation and 0.925 for the independent test. Besides, the 16 key features were explored through correlation analysis and sensitivity analysis. The available model can be used to screen out the polymers with targeted band gaps before experiments, which is very helpful for rapid design of new polymers.
聚合物带隙是与电导率最相关的重要性质之一。在这项工作中,开发了一种称为支持向量回归(SVR)的机器学习模型来预测聚合物带隙,其中聚合物带隙的训练数据是从 DFT 计算中获得的,而描述符是从 Dragon 中生成的。在用最大相关性最小冗余进行特征选择后,使用 16 个关键特征作为输入的 SVR 模型在预测聚合物带隙方面表现出最佳性能。DFT 计算和 SVR 预测聚合物带隙之间的 SVR 模型的确定系数()在留一交叉验证中高达 0.824,在独立测试中高达 0.925。此外,还通过相关分析和敏感性分析探索了 16 个关键特征。该模型可用于在实验前筛选出具有目标带隙的聚合物,这对于快速设计新型聚合物非常有帮助。