Mohammadian Erfan, Kheirollahi Mahdi, Liu Bo, Ostadhassan Mehdi, Sabet Maziyar
Key Laboratory of Continental Shale Hydrocarbon Accumulation and Efficient Development, Ministry of Education, Northeast Petroleum University, Daqing, 163318, China.
School of Mining Engineering, College of Engineering, University of Tehran, Tehran, Iran.
Sci Rep. 2022 Mar 16;12(1):4505. doi: 10.1038/s41598-022-08575-5.
Petrophysical rock typing (PRT) and permeability prediction are of great significance for various disciplines of oil and gas industry. This study offers a novel, explainable data-driven approach to enhance the accuracy of petrophysical rock typing via a combination of supervised and unsupervised machine learning methods. 128 core data, including porosity, permeability, connate water saturation (S), and radius of pore throats at 35% mercury injection (R) were obtained from a heterogeneous carbonate reservoir in Iran and used to train a supervised machine learning algorithm called Extreme Gradient Boosting (XGB). The algorithm output was a modified formation zone index (FZIM*), which was used to accurately estimate permeability (R = 0.97) and R (R = 0.95). Moreover, FZIM* was combined with an unsupervised machine learning algorithm (K-means clustering) to find the optimum number of PRTs. 4 petrophysical rock types (PRTs) were identified via this method, and the range of their properties was discussed. Lastly, shapely values and parameter importance analysis were conducted to explain the correlation between each input parameter and the output and the contribution of each parameter on the value of FZIM*. Permeability and R were found to be most influential parameters, where S had the lowest impact on FZIM*.
岩石物理岩石分类(PRT)和渗透率预测对石油和天然气行业的各个学科都具有重要意义。本研究提供了一种新颖的、可解释的数据驱动方法,通过监督学习和无监督学习方法相结合来提高岩石物理岩石分类的准确性。从伊朗一个非均质碳酸盐岩储层获取了128个岩心数据,包括孔隙度、渗透率、原生水饱和度(S)和汞注入量为35%时的喉道半径(R),并用于训练一种名为极端梯度提升(XGB)的监督学习算法。该算法的输出是一个修正的地层带指数(FZIM*),用于准确估计渗透率(R = 0.97)和R(R = 0.95)。此外,FZIM与一种无监督学习算法(K均值聚类)相结合,以找到PRT的最佳数量。通过该方法识别出4种岩石物理岩石类型(PRT),并讨论了它们的属性范围。最后,进行了Shapley值和参数重要性分析,以解释每个输入参数与输出之间的相关性以及每个参数对FZIM值的贡献。发现渗透率和R是最具影响力的参数,而S对FZIM*的影响最小。