Ghule Siddharth, Dash Soumya Ranjan, Bagchi Sayan, Joshi Kavita, Vanka Kumar
Physical and Materials Chemistry Division, CSIR-National Chemical Laboratory (CSIR-NCL), Dr. Homi Bhabha Road, Pashan, Pune 411008, India.
Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India.
ACS Omega. 2022 Mar 29;7(14):11742-11755. doi: 10.1021/acsomega.1c06856. eCollection 2022 Apr 12.
This study investigates four machine-learning (ML) models to predict the redox potentials of phenazine derivatives in dimethoxyethane using density functional theory (DFT). A small data set of 151 phenazine derivatives having only one type of functional group per molecule (20 unique groups) was used for the training. Prediction accuracy was improved by a combined strategy of feature selection and hyperparameter optimization, using the external validation set. Models were evaluated on the external test set containing new functional groups and diverse molecular structures. High prediction accuracies of > 0.74 were obtained on the external test set. Despite being trained on the molecules with a single type of functional group, models were able to predict the redox potentials of derivatives containing multiple and different types of functional groups with good accuracies ( > 0.7). This type of performance for predicting redox potential from such a small and simple data set of phenazine derivatives has never been reported before. Redox flow batteries (RFBs) are emerging as promising candidates for energy storage systems. However, new green and efficient materials are required for their widespread usage. We believe that the hybrid DFT-ML approach demonstrated in this report would help in accelerating the virtual screening of phenazine derivatives, thus saving computational and experimental costs. Using this approach, we have identified promising phenazine derivatives for green energy storage systems such as RFBs.
本研究使用密度泛函理论(DFT)研究了四种机器学习(ML)模型,以预测吩嗪衍生物在二甲氧基乙烷中的氧化还原电位。使用一个包含151种吩嗪衍生物的小数据集进行训练,每个分子仅有一种官能团(共20种独特的官能团)。通过使用外部验证集进行特征选择和超参数优化的组合策略,提高了预测准确性。在包含新官能团和多样分子结构的外部测试集上对模型进行了评估。在外部测试集上获得了大于0.74的高预测准确率。尽管模型是在仅含单一类型官能团的分子上进行训练的,但它们能够以良好的准确率(大于0.7)预测含有多种不同类型官能团的衍生物的氧化还原电位。此前从未报道过从如此小且简单的吩嗪衍生物数据集中预测氧化还原电位能有这种类型的表现。氧化还原液流电池(RFBs)正成为储能系统中有前景的候选者。然而,其广泛应用需要新的绿色高效材料。我们相信本报告中展示的混合DFT-ML方法将有助于加速吩嗪衍生物的虚拟筛选,从而节省计算和实验成本。使用这种方法,我们已经为诸如RFBs等绿色储能系统确定了有前景的吩嗪衍生物。