• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

监督式机器学习和无监督数据压缩模型在利用钻井、岩石物理和测井数据进行孔隙压力预测中的应用。

Application of supervised machine learning and unsupervised data compression models for pore pressure prediction employing drilling, petrophysical, and well log data.

作者信息

Siddique Abu Bakker, Munshi Tanveer Alam, Rakin Nazmul Islam, Hashan Mahamudul, Chnapa Sushmita Sarker, Jahan Labiba Nusrat

机构信息

Department of Petroleum and Mining Engineering, Shahjalal University of Science and Technology, Sylhet, 3114, Bangladesh.

出版信息

Sci Rep. 2025 Jul 9;15(1):24706. doi: 10.1038/s41598-025-89199-3.

DOI:10.1038/s41598-025-89199-3
PMID:40634540
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12241424/
Abstract

Accurate determination of pore pressure is critical in the design of wells, determining a safe range of mud properties, and estimating the required mud weight to ensure wellbore stability. Conventional techniques for forecasting pore pressure, such as the Eaton, Bower, or compressibility methods, have certain constraints. These methods depend on empirical relationships and constants that can differ between basins. This study proposes an effective data-driven approach that utilizes machine learning algorithms to forecast reservoir pore pressure. A total of five machine learning algorithms, namely multivariable regression (MVR), polynomial regression (PR), random forest (RF), CatBoost regression, and multilayer perception (MLP), are applied in this research. Hybrid stacking modeling is employed for the first time to forecast pore pressure and to improve the accuracy and robustness of the results by combining different methodologies. Principal component analysis is also utilized (PCA) to extract features, hence expediting the entire process by reducing dimensionality. To accomplish the objectives, 1811 recordings are selected from the Volve Field, situated approximately 200 km west of Stavanger, Norway. These recordings encompass depth data; well logs, including NPHI, GR, DT, RD, RHOB, RS, and RT; drilling activities, specifically ROP; and petrophysical parameters, including BVW, K, PHIF, SW, and VCL. Pore pressure is used as the output level to generate data-driven models. 70% of the dataset is used for training the machine learning models, while the remaining 30% is reserved for testing the models to evaluate their performance and generalization capability. Data standardization is conducted to ensure that the utilized data is statistically well-distributed, devoid of measurement mistakes, and impervious to instrumental noise. Regression metrics, such as mean MAE, R, Adjusted R RMSE, MinE, and MaxE are employed to evaluate the efficacy of the models. The results suggest that the stacking model, which integrates CatBoost and Random Forest (RF) as base models and Polynomial Regression (PR) as the meta-model, achieves an R of 0.9846, an adjusted R of 0.9842, MAE of 11.20 and an RMSE of 22.747 on the testing dataset. This makes it the most accurate model for pore pressure prediction, followed closely by CatBoost. The MVR, exhibiting an R of 0.896 and an RMSE of 57.931, is the least efficient model. A thorough comparison of all analyzed models indicates that the algorithms, ranked by performance metrics, are Stack_2, CatBoost, Stack_1, RF, PR, Stack_3, MLP, and MVR. Hybrid stacking improves performance even without hyperparameter tuning. PCA significantly speeds up the entire process by lowering the number of dimensions, hence enhancing the cost-effectiveness of the procedure. Using a few petrophysical, drilling, and well log data, the methodology presented in this work can help engineers and researchers quickly and precisely determine the reservoir pore pressure, validating the safe and cost-effective drilling operations.

摘要

准确测定孔隙压力对于井的设计、确定泥浆性能的安全范围以及估算确保井筒稳定性所需的泥浆重量至关重要。传统的孔隙压力预测技术,如伊顿法、鲍尔法或压缩性方法,存在一定的局限性。这些方法依赖于经验关系和常数,而不同盆地之间这些关系和常数可能会有所不同。本研究提出了一种有效的数据驱动方法,利用机器学习算法来预测储层孔隙压力。本研究共应用了五种机器学习算法,即多变量回归(MVR)、多项式回归(PR)、随机森林(RF)、CatBoost回归和多层感知(MLP)。首次采用混合堆叠建模来预测孔隙压力,并通过结合不同方法提高结果的准确性和稳健性。还利用主成分分析(PCA)来提取特征,从而通过降维加快整个过程。为实现这些目标,从位于挪威斯塔万格以西约200公里处的沃尔夫油田选取了1811条记录。这些记录包括深度数据;测井数据,包括中子孔隙度指数(NPHI)、自然伽马(GR)、声波时差(DT)、电阻率(RD)、体积密度(RHOB)、饱和度(RS)和电阻率(RT);钻井活动,特别是机械钻速(ROP);以及岩石物理参数,包括束缚水体积(BVW)、渗透率(K)、孔隙度指数(PHIF)、含水饱和度(SW)和粘土含量(VCL)。孔隙压力用作输出量来生成数据驱动模型。数据集的70%用于训练机器学习模型,其余30%留作测试模型,以评估其性能和泛化能力。进行数据标准化以确保所使用的数据在统计上分布良好,没有测量错误,并且不受仪器噪声影响。采用回归指标,如平均平均绝对误差(MAE)、相关系数(R)、调整后的相关系数(Adjusted R)、均方根误差(RMSE)、最小误差(MinE)和最大误差(MaxE)来评估模型的有效性。结果表明,以CatBoost和随机森林(RF)作为基础模型、多项式回归(PR)作为元模型的堆叠模型在测试数据集上的相关系数(R)为0.9846,调整后的相关系数(Adjusted R)为0.9842,平均绝对误差(MAE)为11.20,均方根误差(RMSE)为22.747。这使其成为孔隙压力预测最准确的模型,其次是CatBoost。多变量回归(MVR)的相关系数(R)为0.896,均方根误差(RMSE)为57.931,是效率最低的模型。对所有分析模型的全面比较表明,如果按性能指标排序,这些算法依次为:Stack_2、CatBoost、Stack_1、RF、PR、Stack_3、MLP和MVR。即使不进行超参数调整,混合堆叠也能提高性能。主成分分析通过降低维数显著加快了整个过程,从而提高了该过程的成本效益。利用一些岩石物理、钻井和测井数据,本研究提出的方法可以帮助工程师和研究人员快速准确地确定储层孔隙压力,从而验证安全且经济高效的钻井作业。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/74f3797595e8/41598_2025_89199_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/6edbd847b124/41598_2025_89199_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/5d83e9c55b0b/41598_2025_89199_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/e2003b89d52b/41598_2025_89199_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/123ebc494cfd/41598_2025_89199_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/7c1a248be9f0/41598_2025_89199_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/a3e462493c3b/41598_2025_89199_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/911bf1bc7452/41598_2025_89199_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/bd9ead813a51/41598_2025_89199_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/f85714bb0657/41598_2025_89199_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/93091e016b74/41598_2025_89199_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/693e99713675/41598_2025_89199_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/d3dc47c45a41/41598_2025_89199_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/9d5ffe4114c0/41598_2025_89199_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/763194902b8e/41598_2025_89199_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/74f3797595e8/41598_2025_89199_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/6edbd847b124/41598_2025_89199_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/5d83e9c55b0b/41598_2025_89199_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/e2003b89d52b/41598_2025_89199_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/123ebc494cfd/41598_2025_89199_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/7c1a248be9f0/41598_2025_89199_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/a3e462493c3b/41598_2025_89199_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/911bf1bc7452/41598_2025_89199_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/bd9ead813a51/41598_2025_89199_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/f85714bb0657/41598_2025_89199_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/93091e016b74/41598_2025_89199_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/693e99713675/41598_2025_89199_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/d3dc47c45a41/41598_2025_89199_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/9d5ffe4114c0/41598_2025_89199_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/763194902b8e/41598_2025_89199_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19f3/12241424/74f3797595e8/41598_2025_89199_Fig15_HTML.jpg

相似文献

1
Application of supervised machine learning and unsupervised data compression models for pore pressure prediction employing drilling, petrophysical, and well log data.监督式机器学习和无监督数据压缩模型在利用钻井、岩石物理和测井数据进行孔隙压力预测中的应用。
Sci Rep. 2025 Jul 9;15(1):24706. doi: 10.1038/s41598-025-89199-3.
2
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
3
Short-Term Memory Impairment短期记忆障碍
4
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
5
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
6
Sexual Harassment and Prevention Training性骚扰与预防培训
7
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
8
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
9
Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。
Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.
10
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

本文引用的文献

1
Hybrid physics-machine learning models for predicting rate of penetration in the Halahatang oil field, Tarim Basin.用于预测塔里木盆地哈拉哈塘油田机械钻速的物理-机器学习混合模型。
Sci Rep. 2024 Mar 12;14(1):5957. doi: 10.1038/s41598-024-56640-y.
2
Enhanced machine learning-ensemble method for estimation of oil formation volume factor at reservoir conditions.用于估算油藏条件下原油体积系数的增强型机器学习集成方法。
Sci Rep. 2023 Sep 14;13(1):15199. doi: 10.1038/s41598-023-42469-4.
3
Application of machine learning in predicting oil rate decline for Bakken shale oil wells.
机器学习在预测 Bakken 页岩油井产油量递减中的应用。
Sci Rep. 2022 Sep 28;12(1):16154. doi: 10.1038/s41598-022-20401-6.
4
A case study of petrophysical rock typing and permeability prediction using machine learning in a heterogenous carbonate reservoir in Iran.伊朗一个非均质碳酸盐岩油藏中利用机器学习进行岩石物理岩石分类和渗透率预测的案例研究。
Sci Rep. 2022 Mar 16;12(1):4505. doi: 10.1038/s41598-022-08575-5.
5
Data-Driven Modeling Approach for Pore Pressure Gradient Prediction while Drilling from Drilling Parameters.基于钻井参数的随钻孔隙压力梯度预测的数据驱动建模方法
ACS Omega. 2021 May 19;6(21):13807-13816. doi: 10.1021/acsomega.1c01340. eCollection 2021 Jun 1.