Implementing a Drilling Reporting Data Mining Tool Using Natural Language Processing Sentiment Analysis Techniques
- Peter Kowalchuk (Halliburton)
- Document ID
- Society of Petroleum Engineers
- SPE Middle East Oil and Gas Show and Conference, 18-21 March, Manama, Bahrain
- Publication Date
- Document Type
- Conference Paper
- 2019. Society of Petroleum Engineers
- 6.1.5 Human Resources, Competence and Training, 1.6 Drilling Operations, 6 Health, Safety, Security, Environment and Social Responsibility, 7.6.4 Data Mining, 6.1 HSSE & Social Responsibility Management, 7.6 Information Management and Systems, 7 Management and Information
- ai, ddr, reports, analytics, optimization
- 6 in the last 30 days
- 7 since 2007
- Show more detail
- View rights & permissions
|SPE Member Price:||USD 8.50|
|SPE Non-Member Price:||USD 25.00|
Drilling operations generate much information, such as daily drilling reports and reports generated by service companies, support personnel, and other stakeholders. These reports can be unstructured with information presented in a variety of formats. The extraction of this information is frequently challenging, which limits its use in future projects. Natural language processing provides an efficient way of mining and obtaining knowledge. This paper demonstrates how these techniques were used to analyze vast amounts of historical documents to quickly rank well complexity and determine which aspects of drilling operations were most critical.
Sentiment analysis can be used to classify documents and other pieces of information into separate categories. In social media, it is used to analyze the collective perception of a given trending item. The technique was used here to classify wells into two ranked categorized lists. First, a classification listed wells by drilling issues. Second, a complexity ranking was defined so that each well could be classified as easy or difficult to drill. To build the sentiment analysis tool, a random set of training wells and their respective documents were selected. From these documents, a list of words was identified in what became known as highlighting sessions. During these sessions, subject matter experts (SMEs) classified words found in the documents. This "bag of words" was then used to train a classifier capable of ranking the wells related to the documents. A probability was associated to each well, providing a likelihood of inclusion in a given category.
The methodology proved to be successful, ranking drilling documents in both defined category sets. Results show that the list of ranked wells can be used by SMEs to identify which wells are relevant and deserve detailed analysis. The list generated for both categories provided a guideline for further analysis, particularly identifying wells with little value. Results also showed the importance of correctly developing a list of words, an adequate training set, and the language used, as well as the need for SMEs to produce the final analysis. The technology showed promising results with real-world applications being conceivable with its current level of maturity. However, the results also indicated room for improving its effectiveness by refining the highlighting sessions, word lists, types of classifier used, and final ranking methodology.
The use of methods and technology to help improve and enable the analysis of unstructured data in the drilling space should increase over time. This paper shows how current technology can already be used in practical real-life cases to produce tangible value.
|File Size||1 MB||Number of Pages||14|
Antoniak, M., Dalgliesh, J., Verkruyse, M. 2016. Natural Language Processing Techniques on Oil and Gas Drilling Data. Presented at the SPE Intelligent Energy International Conference and Exhibition, Aberdeen, Scotland, UK, 6-8 September. SPE-181015-MS. https://doi.org/10.2118/181015-MS.
Castiñeira, D., Toronyi, R., and Saleri, N. 2018. Machine Learning and Natural Language Processing for Automated Analysis of Drilling and Completion Data. Presented at the SPE Kingdom of Saudi Arabia Annual Technical Symposium and Exhibition, Dammam, Saudi Arabia, 23-26 April. SPE-192280-MS. https://doi.org/10.2118/192280-MS.
Fayzullin, T., Kowalchuk, P., Goebel, T. 2013. Using Real-Time Operations Interventions in a Drilling and Subsurface Collaborative Environment. Presented at the SPE/IADC Middle East Drilling Technology Conference & Exhibition, Dubai, UAE, 7-9 October. SPE-166695-MS. https://doi.org/10.2118/166695-MS.
Hoffimann, J., Mao, Y., Wesley, A. 2018. Sequence Mining and Pattern Analysis in Drilling Reports with Deep Natural Language Processing. Presented at the SPE Annual Technical Conference and Exhibition, Dallas, Texas, USA, 24-26 September. SPE-191505-MS. https://doi.org/10.2118/191505-MS.
Pang, B. and Lee, L. 2008. Opinion Mining and Sentiment Analysis. Foundations and Trends in Information Retrieval. Vol. 2, Nos. 1–2: 1–135. Hanover, Massachusetts, USA: Now Publishers, Inc. https://www.nowpublishers.com/article/DownloadSummary/INR-011 (accessed 11 January 2019).
Ursem, L-J., Williams, J.H., Pellerin, N.M. 2003. Real Time Operations Centers; The people aspects of Drilling Decision Making. Presented at the SPE/IADC Drilling Conference, Amsterdam, The Netherlands, 19-21 February. SPE-79893-MS. https://doi.org/10.2118/79893-MS.