Department of Chemical Engineering , Bogazici University , Bebek , Besiktas, 34342 Istanbul , Turkey.
Department of Chemical and Biological Engineering , Koç University , Rumelifeneri Yolu , Sariyer, 34450 Istanbul , Turkey.
ACS Comb Sci. 2019 Apr 8;21(4):257-268. doi: 10.1021/acscombsci.8b00150. Epub 2019 Mar 13.
A database containing 2224 data points for CH storage or delivery in metal-organic frameworks (MOFs) was analyzed using machine-learning tools to extract knowledge for generalization. The database was first reviewed to observe the basic trends and patterns. It was then analyzed using decision trees and artificial neural networks (ANN) to extract hidden information and develop rules and heuristics for future studies. Five-fold cross validations were used in each analysis to test the validity of the models with data not seen before. Decision-tree analyses were carried out using six user-defined descriptors and two structural properties, separately. The crystal structure and the total degree of unsaturation were found to be the effective user-defined descriptors, whereas the pore volume and maximum pore diameter, as structural properties, were sufficient to determine the MOFs having high CH-storage capacity. Moreover, a high pore volume is always required, as expected. In ANN analyses, models were also developed by using user-defined descriptors and structural properties separately. It was observed that the user-defined descriptors were not sufficient to describe the CH-storage capacities of MOFs, whereas the structural properties in particular led to accurate CH-storage predictions with an RMSE of 26.8 and an R of 0.92 for testing.
分析了一个包含 2224 个金属有机骨架(MOF)中 CH 储存或输送数据点的数据库,使用机器学习工具从中提取知识以实现概括。首先对数据库进行了审查,以观察基本趋势和模式。然后使用决策树和人工神经网络(ANN)进行分析,以提取隐藏信息,并为未来的研究制定规则和启发式方法。在每次分析中都使用了五重交叉验证,以测试模型对以前未见数据的有效性。决策树分析分别使用了六个用户定义的描述符和两个结构特性。晶体结构和总不饱和度被发现是有效的用户定义描述符,而作为结构特性的孔体积和最大孔径足以确定具有高 CH 储存容量的 MOF。此外,正如预期的那样,高孔体积总是需要的。在 ANN 分析中,还分别使用用户定义的描述符和结构特性来开发模型。观察到用户定义的描述符不足以描述 MOF 的 CH 储存能力,而结构特性特别是导致了准确的 CH 储存预测,测试的 RMSE 为 26.8,R 为 0.92。