Tolstaya Ekaterina, Shakirov Anuar, Mezghani Mokhles, Safonov Sergey
Aramco Innovations LLC, 117127 Moscow, Russia.
EXPEC Advanced Research Center Saudi Aramco, Dhahran 34466, Saudi Arabia.
J Imaging. 2023 Jun 21;9(7):126. doi: 10.3390/jimaging9070126.
In this paper, we considered one of the problems that arise during drilling automation, namely the automation of lithology identification from drill cuttings images. Usually, this work is performed by experienced geologists, but this is a tedious and subjective process. Drill cuttings are the cheapest source of rock formation samples; therefore, reliable lithology prediction can greatly reduce the cost of analysis during drilling. To predict the lithology content from images of cuttings samples, we used a convolutional neural network (CNN). For training a model with an acceptable generalization ability, we applied dataset-cleaning techniques, which help to reveal bad samples, as well as samples with uncertain labels. It was shown that the model trained on a cleaned dataset performs better in terms of accuracy. Data cleaning was performed using a cross-validation technique, as well as a clustering analysis of embeddings, where it is possible to identify clusters with distinctive visual characteristics and clusters where visually similar samples of rocks are attributed to different lithologies during the labeling process.
在本文中,我们考虑了钻井自动化过程中出现的一个问题,即从钻屑图像中识别岩性的自动化。通常,这项工作由经验丰富的地质学家完成,但这是一个繁琐且主观的过程。钻屑是岩层样本最便宜的来源;因此,可靠的岩性预测可以大大降低钻井过程中的分析成本。为了从岩屑样本图像中预测岩性含量,我们使用了卷积神经网络(CNN)。为了训练具有可接受泛化能力的模型,我们应用了数据集清理技术,这些技术有助于发现不良样本以及标签不确定的样本。结果表明,在清理后的数据集上训练的模型在准确性方面表现更好。数据清理使用了交叉验证技术以及嵌入的聚类分析,在聚类分析中,可以识别具有独特视觉特征的聚类以及在标注过程中视觉上相似的岩石样本被归为不同岩性的聚类。