Ibrahim Ahmed Farid
Department of Petroleum Engineering and Geosciences, King Fahd University of Petroleum and Minerals, 31261, Dhahran, Saudi Arabia.
Center for Integrative Petroleum Research, King Fahd University of Petroleum and Minerals, 31261, Dhahran, Saudi Arabia.
Sci Rep. 2025 May 30;15(1):18972. doi: 10.1038/s41598-025-02304-4.
The global transition to clean energy has highlighted hydrogen (H) as a sustainable fuel, with underground hydrogen storage (UHS) in geological formations emerging as a key solution. Accurately predicting fluid interactions, particularly interfacial tension (IFT), is critical for ensuring reservoir integrity and storage security in UHS. IFT is key in fluid behavior, influencing structural and residual trapping capacities. However, measuring IFT for H-brine systems is challenging due to H's volatility and the complexity of reservoir conditions. This study applies machine learning (ML) techniques to predict IFT between H and brine across various salt types, concentrations, and gas compositions. A dataset was used with variables such as temperature, pressure, brine salinity, and gas composition (H, CH, CO). Several ML models, including Random Forests (RF), Gradient Boosting Regressor (GBR), Extreme Gradient Boosting Regressor (XGBoost), Artificial Neural Networks (ANN), Decision Trees (DT), and Linear Regression (LR), were trained and evaluated. RF, GBR, and XGBoost achieved R values over 0.99 in training, 0.97 in testing, and all exceeded 0.975 in validation. These top models achieved RMSE values below 1.3 mN/m and MAPE values under 1.5%, confirming their high predictive accuracy. Residual frequency analysis and APRE results further confirmed these ensemble models' low bias and high reliability, with error distributions centered near zero. DT performed slightly lower, with R values of 0.93, while LR struggled to model the non-linear behavior of IFT. A novel salt equivalency metric was introduced, transforming multiple salt variables into a single parameter and improving model generalization while maintaining high prediction accuracy (R = 0.98). Sensitivity analysis and SHAP (Shapley Additive Explanations) analysis revealed temperature as the dominant factor influencing IFT, followed by CO concentration and pressure, while divalent salts (CaCl, MgCl) exhibited a stronger impact than monovalent salts (NaCl, KCl). This study optimizes hydrogen storage by offering a generalized, high-accuracy ML model that captures nonlinear fluid interactions in H-brine systems. Integrating real-world experimental data with ML-driven insights enhances reservoir simulation accuracy, improves hydrogen injection strategies, and supports the global transition toward sustainable energy storage solutions.
全球向清洁能源的转型使氢气(H)成为一种可持续燃料,地质构造中的地下氢气储存(UHS)成为关键解决方案。准确预测流体相互作用,特别是界面张力(IFT),对于确保UHS中的储层完整性和储存安全性至关重要。IFT是流体行为的关键,影响结构和残余捕集能力。然而,由于氢气的挥发性和储层条件的复杂性,测量氢-盐水系统的IFT具有挑战性。本研究应用机器学习(ML)技术预测不同盐类型、浓度和气体组成下氢气与盐水之间的IFT。使用了一个包含温度、压力、盐水盐度和气体组成(氢气、甲烷、二氧化碳)等变量的数据集。对包括随机森林(RF)、梯度提升回归器(GBR)、极端梯度提升回归器(XGBoost)、人工神经网络(ANN)、决策树(DT)和线性回归(LR)在内的几种ML模型进行了训练和评估。RF、GBR和XGBoost在训练中的R值超过0.99,测试中的R值为0.97,验证中的R值均超过0.975。这些顶级模型的均方根误差(RMSE)值低于1.3 mN/m,平均绝对百分比误差(MAPE)值低于1.5%,证实了它们的高预测准确性。残差频率分析和平均百分比相对误差(APRE)结果进一步证实了这些集成模型的低偏差和高可靠性,误差分布集中在零附近。DT的表现略低,R值为0.93,而LR难以对IFT的非线性行为进行建模。引入了一种新的盐当量度量,将多个盐变量转换为一个单一参数,在保持高预测准确性(R = 0.98)的同时提高了模型的泛化能力。敏感性分析和SHAP(Shapley值加法解释)分析表明,温度是影响IFT的主导因素,其次是二氧化碳浓度和压力,而二价盐(氯化钙、氯化镁)的影响比一价盐(氯化钠、氯化钾)更强。本研究通过提供一个能够捕捉氢-盐水系统中非线性流体相互作用的广义高精度ML模型,优化了氢气储存。将实际实验数据与ML驱动的见解相结合,提高了储层模拟的准确性,改进了氢气注入策略,并支持全球向可持续储能解决方案的转型。