• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在自动化机器学习框架中使用特征选择方法进行数据驱动的总有机碳预测。

Data-driven total organic carbon prediction using feature selection methods incorporated in an automated machine learning framework.

作者信息

Macêdo Bruno da Silva, Wayo Dennis Delali Kwesi, Campos Deivid, De Santis Rodrigo Barbosa, Martinho Alfeu Dias, Yaseen Zaher Mundher, Saporetti Camila Martins, Goliatt Leonardo

机构信息

Department of Computer Science, Federal University of Lavras, Lavras, MG, 37200-000, Brazil.

Faculty of Chemical and Process Engineering Technology, Universiti Malaysia Pahang Al-Sultan Abdullah, 26300, Kuantan, Malaysia.

出版信息

Sci Rep. 2025 Mar 27;15(1):10658. doi: 10.1038/s41598-025-91224-4.

DOI:10.1038/s41598-025-91224-4
PMID:40148387
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11950205/
Abstract

An accurate assessment of shale gas resources is highly important for the sustainable development of these energy resources. Total organic carbon (TOC) analysis thus becomes fundamental for understanding the distribution and quality of hydrocarbon source rocks within a shale gas reservoir. The elevation of the TOC is often associated with the presence of source rocks, indicating the potential for oil and gas production. TOC assessment is performed using laboratory methods, which can be time-consuming and costly. Data-driven models have been successfully applied to model the relationship between TOC and other constituents and to predict the TOC content. However, these methods depend on extensive parameter adjustments that must be carefully conducted in different sedimentary environments. In this context, Automated Machine Learning (AutoML) is an alternative for accurately predicting TOCs, saving time-consuming fine-tuning steps in model development. This study aims to develop an AutoML strategy for estimating TOC using well log data. This procedure automatically preprocesses the search for the best method parameters, reducing the execution time. Among the methods evaluated, Extremely Randomized Trees (XT) performed best (R = 0.8632, MSE = 0.1806) in the test set. The proposed strategy provides a powerful data-driven method, which allows real-world use of the well to assist in data analysis and subsequent decision-making.

摘要

准确评估页岩气资源对这些能源的可持续发展至关重要。因此,总有机碳(TOC)分析成为了解页岩气藏内烃源岩分布和质量的基础。TOC的升高通常与烃源岩的存在相关,表明油气生产的潜力。TOC评估采用实验室方法进行,这可能既耗时又昂贵。数据驱动模型已成功应用于模拟TOC与其他成分之间的关系并预测TOC含量。然而,这些方法依赖于必须在不同沉积环境中仔细进行的大量参数调整。在这种情况下,自动机器学习(AutoML)是准确预测TOC的一种替代方法,可节省模型开发中耗时的微调步骤。本研究旨在开发一种使用测井数据估算TOC的AutoML策略。该过程会自动预处理以寻找最佳方法参数,从而减少执行时间。在所评估的方法中,极端随机树(XT)在测试集中表现最佳(R = 0.8632,MSE = 0.1806)。所提出的策略提供了一种强大的数据驱动方法,可在实际中利用测井数据协助数据分析和后续决策。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/2feceb3e969a/41598_2025_91224_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/b9e69138b48a/41598_2025_91224_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/f95b5b2001c4/41598_2025_91224_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/fe7dd09a6511/41598_2025_91224_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/0a1adab15f3c/41598_2025_91224_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/2a623980bb8f/41598_2025_91224_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/a7cf8e2d8868/41598_2025_91224_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/411b73034a01/41598_2025_91224_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/e3dd389f7147/41598_2025_91224_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/2feceb3e969a/41598_2025_91224_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/b9e69138b48a/41598_2025_91224_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/f95b5b2001c4/41598_2025_91224_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/fe7dd09a6511/41598_2025_91224_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/0a1adab15f3c/41598_2025_91224_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/2a623980bb8f/41598_2025_91224_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/a7cf8e2d8868/41598_2025_91224_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/411b73034a01/41598_2025_91224_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/e3dd389f7147/41598_2025_91224_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/454a/11950205/2feceb3e969a/41598_2025_91224_Fig9_HTML.jpg

相似文献

1
Data-driven total organic carbon prediction using feature selection methods incorporated in an automated machine learning framework.在自动化机器学习框架中使用特征选择方法进行数据驱动的总有机碳预测。
Sci Rep. 2025 Mar 27;15(1):10658. doi: 10.1038/s41598-025-91224-4.
2
Prediction of Total Organic Carbon in Organic-Rich Shale Rocks Using Thermal Neutron Parameters.利用热中子参数预测富含有机质页岩中的总有机碳
ACS Omega. 2023 Jan 24;8(5):4790-4801. doi: 10.1021/acsomega.2c06918. eCollection 2023 Feb 7.
3
Hierarchical automated machine learning (AutoML) for advanced unconventional reservoir characterization.用于高级非常规储层表征的分层自动机器学习(AutoML)。
Sci Rep. 2023 Aug 24;13(1):13812. doi: 10.1038/s41598-023-40904-0.
4
Experimental Simulation of Hydrocarbon Expulsion in Semi-open Systems from Variable Organic Richness Source Rocks.半开放系统中不同有机丰度源岩排烃的实验模拟
ACS Omega. 2021 May 24;6(22):14664-14676. doi: 10.1021/acsomega.1c01800. eCollection 2021 Jun 8.
5
Predicting Grape Sugar Content under Quality Attributes Using Normalized Difference Vegetation Index Data and Automated Machine Learning.基于归一化植被指数数据和自动化机器学习预测葡萄含糖量与品质属性。
Sensors (Basel). 2022 Apr 23;22(9):3249. doi: 10.3390/s22093249.
6
Application of Various Machine Learning Techniques in Predicting Total Organic Carbon from Well Logs.各种机器学习技术在测井总有机碳预测中的应用。
Comput Intell Neurosci. 2021 Aug 30;2021:7390055. doi: 10.1155/2021/7390055. eCollection 2021.
7
Hydrocarbon Generation from Low-Mature Saline Lacustrine Sediments Studied Using Machine Learning and Chemometric Methods: The Succession of the Sikeshu Sag, Junggar Basin, NW China.运用机器学习和化学计量学方法研究低成熟盐湖相沉积物的生烃作用:中国西北准噶尔盆地四棵树凹陷的演化
ACS Omega. 2023 Mar 7;8(11):10314-10334. doi: 10.1021/acsomega.2c07991. eCollection 2023 Mar 21.
8
Machine learning-based fracturing parameter optimization for horizontal wells in Panke field shale oil.基于机器学习的盘克油田页岩油水平井压裂参数优化
Sci Rep. 2024 Mar 13;14(1):6046. doi: 10.1038/s41598-024-56660-8.
9
Geochemical Evaluation of Paleocene Source Rocks in the Kohat Sub-Basin, Pakistan.巴基斯坦科哈特次盆地古新世烃源岩的地球化学评价
ACS Omega. 2024 Mar 15;9(12):14123-14141. doi: 10.1021/acsomega.3c09457. eCollection 2024 Mar 26.
10
Lithofacies Types and Physical Characteristics of Organic-Rich Shale in the Wufeng-Longmaxi Formation, Xichang Basin, China.中国西昌盆地五峰-龙马溪组富有机质页岩的岩相类型及物理特征
ACS Omega. 2023 May 15;8(20):18165-18179. doi: 10.1021/acsomega.3c01307. eCollection 2023 May 23.

本文引用的文献

1
Short-term streamflow modeling using data-intelligence evolutionary machine learning models.使用数据智能进化机器学习模型进行短期径流建模。
Sci Rep. 2023 Aug 24;13(1):13824. doi: 10.1038/s41598-023-41113-5.
2
Hierarchical automated machine learning (AutoML) for advanced unconventional reservoir characterization.用于高级非常规储层表征的分层自动机器学习(AutoML)。
Sci Rep. 2023 Aug 24;13(1):13812. doi: 10.1038/s41598-023-40904-0.
3
Artificial intelligence for the diagnosis of clinically significant prostate cancer based on multimodal data: a multicenter study.
基于多模态数据的人工智能诊断临床显著前列腺癌:一项多中心研究。
BMC Med. 2023 Jul 24;21(1):270. doi: 10.1186/s12916-023-02964-x.
4
Review of ML and AutoML Solutions to Forecast Time-Series Data.用于预测时间序列数据的机器学习和自动机器学习解决方案综述。
Arch Comput Methods Eng. 2022;29(7):5297-5311. doi: 10.1007/s11831-022-09765-0. Epub 2022 Jun 1.
5
Data-Driven Approaches to Predict Thermal Maturity Indices of Organic Matter Using Artificial Neural Networks.基于数据驱动的方法利用人工神经网络预测有机质热成熟度指标
ACS Omega. 2020 Sep 30;5(40):26169-26181. doi: 10.1021/acsomega.0c03751. eCollection 2020 Oct 13.
6
Mutual information between discrete and continuous data sets.离散数据集与连续数据集之间的互信息。
PLoS One. 2014 Feb 19;9(2):e87357. doi: 10.1371/journal.pone.0087357. eCollection 2014.