Liu Xujie, Wang Yang, Yuan Jiongpeng, Li Xiaojing, Wu Siwei, Bao Ying, Feng Zhenzhen, Ou Feilong, He Yan
School of Biomedical and Pharmaceutical Sciences, Guangdong University of Technology, Guangzhou 510006, China.
Bioengineering (Basel). 2022 Sep 30;9(10):517. doi: 10.3390/bioengineering9100517.
Metal-organic frameworks (MOFs) have been widely researched as drug delivery systems due to their intrinsic porous structures. Herein, machine learning (ML) technologies were applied for the screening of MOFs with high drug loading capacity. To achieve this, first, a comprehensive dataset was gathered, including 40 data points from more than 100 different publications. The organic linkers, metal ions, and the functional groups, as well as the surface area and the pore volume of the investigated MOFs, were chosen as the model's inputs, and the output was the ibuprofen (IBU) loading capacity. Thereafter, various advanced and powerful machine learning algorithms, such as support vector regression (SVR), random forest (RF), adaptive boosting (AdaBoost), and categorical boosting (CatBoost), were employed to predict the ibuprofen loading capacity of MOFs. The coefficient of determination (R) of 0.70, 0.72, 0.66, and 0.76 were obtained for the SVR, RF, AdaBoost, and CatBoost approaches, respectively. Among all the algorithms, CatBoost was the most reliable, exhibiting superior performance regarding the sparse matrices and categorical features. Shapley additive explanations (SHAP) analysis was employed to explore the impact of the eigenvalues of the model's outputs. Our initial results indicate that this methodology is a well generalized, straightforward, and cost-effective method that can be applied not only for the prediction of IBU loading capacity, but also in many other biomaterials projects.
金属有机框架(MOFs)由于其固有的多孔结构,已被广泛研究用作药物递送系统。在此,机器学习(ML)技术被应用于筛选具有高药物负载能力的MOFs。为实现这一目标,首先收集了一个综合数据集,其中包括来自100多篇不同出版物的40个数据点。选择所研究MOFs的有机连接体、金属离子、官能团以及表面积和孔体积作为模型的输入,输出为布洛芬(IBU)负载能力。此后,采用了各种先进且强大的机器学习算法,如支持向量回归(SVR)、随机森林(RF)、自适应提升(AdaBoost)和分类提升(CatBoost)来预测MOFs的布洛芬负载能力。SVR、RF、AdaBoost和CatBoost方法分别获得了0.70、0.72、0.66和0.76的决定系数(R)。在所有算法中,CatBoost最可靠,在稀疏矩阵和分类特征方面表现出卓越性能。采用夏普利加性解释(SHAP)分析来探究模型输出特征值的影响。我们的初步结果表明,该方法是一种通用、直接且经济高效的方法,不仅可用于预测IBU负载能力,还可应用于许多其他生物材料项目。