Saleh Khaled, Mabrouk Walid M, Metwally Ahmed
Geophysics Department, Faculty of Science, Cairo University, Giza, Egypt.
PetroShahd Company, Zahraa Maadi, Cairo, Egypt.
Sci Rep. 2025 Apr 29;15(1):14957. doi: 10.1038/s41598-025-97938-9.
Compressional sonic logs is one of the important logs for subsurface characterization, reservoir evaluation, and wellbore stability analysis. However, acquiring these logs is often challenging due to logistical constraints. This study explores the application of machine learning (ML) techniques to predict compressional sonic logs using conventional well logs from five wells. The methodology involves data preprocessing, feature selection, and training various regression models, including Random Forest, CatBoost, XGBoost, K-Nearest Neighbors (KNN), Support Vector Machines (SVM), and Deep Neural Networks (DNN). Model performance is optimized through hyperparameter tuning and evaluated using correlation coefficients and root mean square error (RMSE) metrics. Results indicate that ensemble models (Random Forest, CatBoost, XGBoost) achieve the highest accuracy, with correlation coefficients ranging from 89 to 89.6% and RMSE between 5.85 and 6.03. Additionally, feature engineering and data cleaning significantly improve model performance, while input scaling is essential for SVM, KNN, and DNN models. Incorporating blind well testing further enhances reliability. This study presents a robust ML-based workflow for predicting compressional sonic logs, offering a cost-effective solution for reservoir management and geomechanical analysis.
压缩声波测井是地下特征描述、储层评价和井筒稳定性分析的重要测井方法之一。然而,由于后勤限制,获取这些测井数据往往具有挑战性。本研究探索了机器学习(ML)技术在利用五口井的常规测井数据预测压缩声波测井方面的应用。该方法包括数据预处理、特征选择以及训练各种回归模型,包括随机森林、CatBoost、XGBoost、K近邻(KNN)、支持向量机(SVM)和深度神经网络(DNN)。通过超参数调整优化模型性能,并使用相关系数和均方根误差(RMSE)指标进行评估。结果表明,集成模型(随机森林、CatBoost、XGBoost)具有最高的准确率,相关系数在89%至89.6%之间,RMSE在5.85至6.03之间。此外,特征工程和数据清理显著提高了模型性能,而输入缩放对于SVM、KNN和DNN模型至关重要。纳入盲井测试进一步提高了可靠性。本研究提出了一种基于ML的稳健工作流程来预测压缩声波测井,为储层管理和地质力学分析提供了一种经济高效的解决方案。