We investigate systems to automatically interpret cement evaluation logs using supervised machine learning (ML). Such systems can provide instant rough interpretations that may then be used as a basis for human interpretation. Here, we compare the performance of two approaches, one previously published and one new. The previous approach is based on deep convolutional neural networks (CNNs) that autonomously learn to extract features from well log data, whereas the new approach uses feature engineering, in which we use our own domain knowledge to extract features.
We base this work on a data set of approximately 60 km of well log data. Specialist interpreters have classified these logs according to the bond quality (BQ; six ordinal classes) and hydraulic isolation (HI; two classes) of solids outside the casing. We train the ML systems to reproduce these reference interpretations in segments of 1 m in length. The CNNs directly receive log data as a collection of 2D images and 1D curves. In the feature-engineering approach, we combine the extracted features with various classifiers.
For BQ, the CNNs' interpretation exactly matches the reference 51.6% of the time. It does not miss by more than one class 88.5% of the time. For HI, the CNNs match the reference 86.7% of the time. The best-performing feature-based classifier, which is an ensemble of individual classifiers, provides better results of 57.4, 89.5, and 88.9%, respectively.
Our results indicate two main reasons why feature-based classifiers may perform particularly well on this task. First, there is some subjectivity inherent in the well log interpretations that are used to train and test ML systems. Second, well logs comprise many different and complex pieces of data. For these reasons, this data set may be particularly liable to overfitting. This may favor approaches based on feature engineering, where we apply our domain knowledge to extract a few pieces of essential information from the data instead of leaving the job of understanding the data to an ML system that may misinterpret spurious patterns as generalizable. It may also favor simpler classifiers with less overfitting capacity.
This paper shows how petroleum researchers and engineers can implement automatic interpretation systems for cement evaluation logs using ML methods that are easier to apply and deploy while also performing better than an approach based on autonomous feature extraction. This approach could also be adapted for automatic interpretation of other types of well log data.