Zhu Jie, Zhao Yuhan, Hu Qiujia, Zhang Yang, Shao Tangsha, Fan Bin, Jiang Yaodong, Chen Zhen, Zhao Meng
, China University of Mining and Technology, Beijing Campus, Beijing 100083, China.
PetroChina Huabei Oilfield Co, Renqiu 062550, China.
ACS Omega. 2022 Apr 7;7(15):13083-13094. doi: 10.1021/acsomega.2c00519. eCollection 2022 Apr 19.
It is of great significance to evaluate and predict coalbed methane (CBM) production for the exploitation and exploration of CBM. The flow characteristics of gas and water are very complicated and important in the process of CBM exploitation. In recent years, machine learning has been introduced to analyze CBM well production and its influence based on the historical production data. However, there are some problems with the determination of hyperparameters in machine learning algorithms. Some previous random forests (RF) models of CBM production prediction were suitable for individual CBM wells, but for different types of CBM wells, a large amount of time is needed to adjust the hyperparameters. Therefore, a genetic algorithm (GA) was applied to optimize RF, and a hybrid GA-RF algorithm was presented to solve this problem, which can automatically adjust two important hyperparameters, and , and adapt different types of CBM wells. Meanwhile, the Pearson method and RF were carried out in this work to analyze the data of CBM well production to avoid multicollinearity caused by the improper selection of the model's independent variables. The importance and correlation analysis of drainage control parameters, including casing pressure ( ), bottom-hole pressure ( ), stroke frequency ( ), liquid column depth ( ), daily decline of bottom-hole pressure ( ), and daily decline of casing pressure ( ) were obtained. It was found that the casing pressure, bottom-hole pressure, and stroke frequency had more effects on the gas production of CBM wells than other drainage control parameters. Furthermore, the correlation and importance order of the influencing factors were: > > > > > and > > > > > , respectively. A CBM production model based on the GA-RF algorithm was constructed to study and predict the gas production of CBM wells in Qinshui Basin, China. Compared with the production model based on RF, this model can automatically optimize its hyperparameters to adapt to different types of CBM wells, and the mean-square-error of the GA-RF algorithm can be reduced by 40-60% than that of RF. 93% of the training errors were less than 5%, and 89% of the prediction errors were less than 10%. The GA-RF model can spot promptly the main influencing factors of CBM production and has high accuracy for the production prediction of CBM wells.
评估和预测煤层气(CBM)产量对于煤层气的开发和勘探具有重要意义。在煤层气开采过程中,气水流动特性非常复杂且至关重要。近年来,基于历史生产数据引入机器学习来分析煤层气井产量及其影响因素。然而,机器学习算法中的超参数确定存在一些问题。以往一些用于煤层气产量预测的随机森林(RF)模型适用于单个煤层气井,但对于不同类型的煤层气井,需要大量时间来调整超参数。因此,应用遗传算法(GA)对RF进行优化,提出了一种混合GA - RF算法来解决此问题,该算法可以自动调整两个重要超参数,并适应不同类型的煤层气井。同时,在这项工作中采用皮尔逊方法和RF对煤层气井生产数据进行分析,以避免因模型自变量选择不当导致的多重共线性。获得了排水控制参数的重要性和相关性分析结果,包括套管压力()、井底压力()、冲次()、液柱深度()、井底压力日降()和套管压力日降()。研究发现,套管压力、井底压力和冲次对煤层气井产气的影响比其他排水控制参数更大。此外,影响因素的相关性和重要性顺序分别为:> > > > > > 和 > > > > > > 。构建了基于GA - RF算法的煤层气产量模型,用于研究和预测中国沁水盆地煤层气井的产气情况。与基于RF的产量模型相比,该模型可以自动优化其超参数以适应不同类型的煤层气井(此处括号内容原文缺失),并且GA - RF算法的均方误差比RF降低了40 - 60%。93%的训练误差小于5%,89%的预测误差小于10%。GA - RF模型能够迅速找出煤层气产量的主要影响因素,对煤层气井产量预测具有较高的准确性。