Suppr超能文献

用于聚合物带隙高精度预测的堆叠泛化方法。 (你提供的原文中“with”后面似乎缺少具体内容,我按照常见理解进行了补充翻译,若有偏差请指出。)

: Stacked Generalization with for Highly Accurate Predictions of Polymer Bandgap.

作者信息

Goh Kai Leong, Goto Atsushi, Lu Yunpeng

机构信息

School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, 50 Nanyang Avenue, Singapore 639798, Singapore.

出版信息

ACS Omega. 2022 Aug 15;7(34):29787-29793. doi: 10.1021/acsomega.2c02554. eCollection 2022 Aug 30.

Abstract

Recently, the Ramprasad group reported a quantitative structure-property relationship (QSPR) model for predicting the values of 4209 polymers, which yielded a test set score of 0.90 and a test set root-mean-square error (RMSE) score of 0.44 at a train/test split ratio of 80/20. In this paper, we present a new QSPR model named , which performs a two-level stacked generalization using the light gradient boosting machine. At level 1, multiple weak models are trained, and at level 2, they are combined into a strong final model. Four molecular fingerprints were generated from the simplified molecular input line entry system notations of the polymers. They were trimmed using recursive feature elimination and used as the initial input features for training the weak models. The output predictions of the weak models were used as the new input features for training the final model, which completes the model training process. Our results show that the best test set and the RMSE scores of at the train/test split ratio of 80/20 were 0.92 and 0.41, respectively. The accuracy scores further improved to 0.94 and 0.34, respectively, when the train/test split ratio of 95/5 was used.

摘要

最近,拉姆普拉萨德团队报告了一种用于预测4209种聚合物值的定量结构-性质关系(QSPR)模型,在80/20的训练/测试分割比例下,该模型的测试集得分0.90,测试集均方根误差(RMSE)得分为0.44。在本文中,我们提出了一种名为 的新QSPR模型,该模型使用轻梯度提升机进行两级堆叠泛化。在第1级,训练多个弱模型,在第2级,将它们组合成一个强大的最终模型。从聚合物的简化分子输入线性条目系统符号生成了四种分子指纹。使用递归特征消除对它们进行修剪,并将其用作训练弱模型的初始输入特征。弱模型的输出预测用作训练最终模型的新输入特征,从而完成 模型训练过程。我们的结果表明,在80/20的训练/测试分割比例下,最佳测试集 得分和RMSE得分分别为0.92和0.41。当使用95/5的训练/测试分割比例时,准确率得分分别进一步提高到0.94和0.34。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4e6/9434625/99405708e0dc/ao2c02554_0002.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验