Usman Jamilu, Abba Sani I, Abdu Fahad Jibrin, Yogarathinam Lukka Thuyavan, Usman Abdullah G, Lawal Dahiru, Salhi Billel, Aljundi Isam H
Interdisciplinary Research Centre for Membranes and Water Security (IRC-MWS), King Fahd University of Petroleum and Minerals Dhahran 31261 Saudi Arabia
Department of Chemical Engineering, Prince Mohammad Bin Fahd University Al Khobar 31952 Saudi Arabia.
RSC Adv. 2024 Oct 1;14(43):31259-31273. doi: 10.1039/d4ra06078d.
Addressing global freshwater scarcity requires innovative technological solutions, among which desalination through thin-film composite polyamide membranes stands out. The performance of these membranes plays a vital role in desalination, necessitating advanced predictive modeling for optimization. This study harnesses machine learning (ML) algorithms, including support vector machine (SVM), neural networks (NN), linear regression (LR), and multivariate linear regression (MLR), alongside their ensemble techniques to predict and enhance average water flux (AWF) and average salt rejection (ASR) essential metrics of desalination efficiency. To ensure model interpretability and feature importance analysis, SHapley Additive exPlanations (SHAP) were employed, providing both global and local insights into feature contributions. Initially, the individual models were validated, with NN demonstrating superior performance for both AWF and ASR, achieving the lowest mean absolute error (MAE = 0.001) and root mean squared error (RMSE = 0.0111) for AWF and an MAE = 0.0107 and RMSE = 0.0982 for ASR. The accuracy of predictions improved significantly with ensemble models, as evidenced by the near-perfect Nash-Sutcliffe efficiency (NSE) values. Specifically, the NN ensemble (NN-E) and Linear Regression ensemble (LR-E) reached an MAE and RMSE of 0.001 and 0.0111, respectively, for AWF. For ASR, NN-E reduced the MAE to 0.0013 and the RMSE to 0.0089, while LR-E maintained competitive performance with an MAE of 0.0133 and an RMSE of 0.0936. SHAP analysis revealed that features such as MDP and TMC were critical drivers of performance, with MDP showing the most significant positive impact on ASR. These findings demonstrate the dominance of ensemble methods over individual algorithms in predicting key desalination parameters. The enhanced precision in estimating AWF and ASR offered by these neuro-intelligent ensembles, combined with the interpretability provided by SHAP analysis, can lead to significant environmental and operational improvements in membrane performance, optimizing resource usage and minimizing ecological impacts. This study paves the way for integrating intelligent ML ensembles and SHAP-based interpretability into the practical field of membrane technology, marking a step forward toward sustainable and efficient desalination processes.
解决全球淡水短缺问题需要创新的技术解决方案,其中通过薄膜复合聚酰胺膜进行海水淡化尤为突出。这些膜的性能在海水淡化中起着至关重要的作用,因此需要先进的预测模型来进行优化。本研究利用机器学习(ML)算法,包括支持向量机(SVM)、神经网络(NN)、线性回归(LR)和多元线性回归(MLR),以及它们的集成技术,来预测和提高平均水通量(AWF)和平均脱盐率(ASR),这是海水淡化效率的关键指标。为了确保模型的可解释性和特征重要性分析,采用了SHapley加性解释(SHAP),提供了关于特征贡献的全局和局部见解。最初,对各个模型进行了验证,结果表明NN在AWF和ASR方面均表现出卓越的性能,对于AWF实现了最低的平均绝对误差(MAE = 0.001)和均方根误差(RMSE = 0.0111),对于ASR则MAE = 0.0107,RMSE = 0.0982。集成模型显著提高了预测的准确性,近乎完美的纳什-萨特克利夫效率(NSE)值证明了这一点。具体而言,对于AWF,NN集成(NN-E)和线性回归集成(LR-E)的MAE和RMSE分别达到了0.001和0.0111。对于ASR,NN-E将MAE降低到0.0013,RMSE降低到0.0089,而LR-E保持了具有竞争力的性能,MAE为0.0133,RMSE为0.0936。SHAP分析表明,诸如MDP和TMC等特征是性能的关键驱动因素,其中MDP对ASR的正向影响最为显著。这些发现表明,在预测关键海水淡化参数方面,集成方法优于单个算法。这些神经智能集成所提供的在估计AWF和ASR方面的更高精度,结合SHAP分析提供的可解释性,能够在膜性能方面带来显著的环境和操作改进,优化资源利用并最小化生态影响。本研究为将智能ML集成和基于SHAP的可解释性整合到膜技术实际领域铺平了道路,标志着朝着可持续和高效海水淡化过程迈出了一步。