Machine learning on Crays to optimise petrophysical workflows in oil and
gas exploration
- URL: http://arxiv.org/abs/2010.02087v1
- Date: Thu, 1 Oct 2020 10:14:58 GMT
- Title: Machine learning on Crays to optimise petrophysical workflows in oil and
gas exploration
- Authors: Nick Brown, Anna Roubickova, Ioanna Lampaki, Lucy MacGregor, Michelle
Ellis, Paola Vera de Newton
- Abstract summary: In this paper we present work done, in collaboration with Rock Solid Images (RSI), using supervised machine learning on a Cray XC30.
We describe the use of mathematical models that have been trained using raw well log data.
We explore how the predictions from these models compare against the interpretations of human petrophysicists.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The oil and gas industry is awash with sub-surface data, which is used to
characterize the rock and fluid properties beneath the seabed. This in turn
drives commercial decision making and exploration, but the industry currently
relies upon highly manual workflows when processing data. A key question is
whether this can be improved using machine learning to complement the
activities of petrophysicists searching for hydrocarbons. In this paper we
present work done, in collaboration with Rock Solid Images (RSI), using
supervised machine learning on a Cray XC30 to train models that streamline the
manual data interpretation process. With a general aim of decreasing the
petrophysical interpretation time down from over 7 days to 7 minutes, in this
paper we describe the use of mathematical models that have been trained using
raw well log data, for completing each of the four stages of a petrophysical
interpretation workflow, along with initial data cleaning. We explore how the
predictions from these models compare against the interpretations of human
petrophysicists, along with numerous options and techniques that were used to
optimise the prediction of our models. The power provided by modern
supercomputers such as Cray machines is crucial here, but some popular machine
learning framework are unable to take full advantage of modern HPC machines. As
such we will also explore the suitability of the machine learning tools we have
used, and describe steps we took to work round their limitations. The result of
this work is the ability, for the first time, to use machine learning for the
entire petrophysical workflow. Whilst there are numerous challenges,
limitations and caveats, we demonstrate that machine learning has an important
role to play in the processing of sub-surface data.
Related papers
- Machine Learning Operations: A Mapping Study [0.0]
This article discusses the issues that exist in several components of the MLOps pipeline.
A systematic mapping study is performed to identify the challenges that arise in the MLOps system categorized by different focus areas.
The main value of this work is it maps distinctive challenges in MLOps along with the recommended solutions outlined in our study.
arXiv Detail & Related papers (2024-09-28T17:17:40Z) - Any-point Trajectory Modeling for Policy Learning [64.23861308947852]
We introduce Any-point Trajectory Modeling (ATM) to predict future trajectories of arbitrary points within a video frame.
ATM outperforms strong video pre-training baselines by 80% on average.
We show effective transfer learning of manipulation skills from human videos and videos from a different robot morphology.
arXiv Detail & Related papers (2023-12-28T23:34:43Z) - Combatting Human Trafficking in the Cyberspace: A Natural Language
Processing-Based Methodology to Analyze the Language in Online Advertisements [55.2480439325792]
This project tackles the pressing issue of human trafficking in online C2C marketplaces through advanced Natural Language Processing (NLP) techniques.
We introduce a novel methodology for generating pseudo-labeled datasets with minimal supervision, serving as a rich resource for training state-of-the-art NLP models.
A key contribution is the implementation of an interpretability framework using Integrated Gradients, providing explainable insights crucial for law enforcement.
arXiv Detail & Related papers (2023-11-22T02:45:01Z) - Time Series Predictions in Unmonitored Sites: A Survey of Machine Learning Techniques in Water Resources [10.307058787085094]
Prediction of dynamic environmental variables in unmonitored sites remains a long-standing challenge for water resources science.
Modern machine learning methods increasingly outperform their process-based and empirical model counterparts for hydrologic time series prediction.
We review relevant state-of-the art applications of machine learning for streamflow, water quality, and other water resources prediction.
arXiv Detail & Related papers (2023-08-18T18:30:33Z) - Advancing Reacting Flow Simulations with Data-Driven Models [50.9598607067535]
Key to effective use of machine learning tools in multi-physics problems is to couple them to physical and computer models.
The present chapter reviews some of the open opportunities for the application of data-driven reduced-order modeling of combustion systems.
arXiv Detail & Related papers (2022-09-05T16:48:34Z) - AI for Porosity and Permeability Prediction from Geologic Core X-Ray
Micro-Tomography [0.0]
We propose to use self-supervised pretraining of the very small CNN-transformer-based model to predict the physical properties of the rocks.
We show that this technique prevents overfitting even for extremely small datasets.
arXiv Detail & Related papers (2022-05-26T06:55:03Z) - SOLIS -- The MLOps journey from data acquisition to actionable insights [62.997667081978825]
In this paper we present a unified deployment pipeline and freedom-to-operate approach that supports all requirements while using basic cross-platform tensor framework and script language engines.
This approach however does not supply the needed procedures and pipelines for the actual deployment of machine learning capabilities in real production grade systems.
arXiv Detail & Related papers (2021-12-22T14:45:37Z) - Automated Machine Learning Techniques for Data Streams [91.3755431537592]
This paper surveys the state-of-the-art open-source AutoML tools, applies them to data collected from streams, and measures how their performance changes over time.
The results show that off-the-shelf AutoML tools can provide satisfactory results but in the presence of concept drift, detection or adaptation techniques have to be applied to maintain the predictive accuracy over time.
arXiv Detail & Related papers (2021-06-14T11:42:46Z) - tf.data: A Machine Learning Data Processing Framework [0.4588028371034406]
Training machine learning models requires feeding input data for models to ingest.
We present tf.data, a framework for building and executing efficient input pipelines for machine learning jobs.
We demonstrate that input pipeline performance is critical to the end-to-end training time of state-of-the-art machine learning models.
arXiv Detail & Related papers (2021-01-28T17:16:46Z) - Towards CRISP-ML(Q): A Machine Learning Process Model with Quality
Assurance Methodology [53.063411515511056]
We propose a process model for the development of machine learning applications.
The first phase combines business and data understanding as data availability oftentimes affects the feasibility of the project.
The sixth phase covers state-of-the-art approaches for monitoring and maintenance of a machine learning applications.
arXiv Detail & Related papers (2020-03-11T08:25:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.