Machine learning approach for automated probabilistic well operation optimization

Abstract

A methodology for providing set-point recommendations in an automated manner to optimize the operation of a well-producing fluid, by establishing a live synergy between physics-based simulation and real-time field data, through the employment of machine learning models. The machine learning models serve two distinct purposes in this approach: 1. Accelerate emulation of the numerical physics-based simulation to enable real-time solutions 2. Provide a probabilistic estimate of the unknown operating conditions of a well and updating the estimate based on the response to the set-point changes made, thus improving with each iteration.

Claims

1. A method of using a neural network based solution to provide real-time set-point recommendations for an operating well, comprising: step 1: collecting data from a sensor feed; mapping the sensor feed data with a well-specific design metadata of the operating well, the well-specific design metadata selected from the list consisting of the depth and operating pressures of the gas lift valves, the performance curves of the ESPs, deviation survey data, well design, and completion parameters; cleaning up the sensor feed data through outlier detection; step 2: using both the sensor feed data and the well-specific metadata to derive features to use in a physics-based simulation; running the physics-based simulation with synthetic data randomized to produce a first set of simulation data; using the first set of simulation data to train a neural network; running the physics-based simulation with well-specific data for the operating well, to produce well-specific simulation data; retraining the neural network using the well-specific simulation data; using the retrained neural network to create a second set of simulation data; step 3: applying inverse modeling to the second set of simulation data to produce a probability distribution of unknown input states for the operating well; step 4: use a highly probable value of the unknown input states as an input to the physics-based simulation to produce an automated set-point recommendation; and implementing the recommendation at the well.

2. The method of claim 1, wherein the step of cleaning up the sensor feed data comprises identifying and labeling of an observed state that can interfere with the method of providing real-time set-point recommendations.

3. The method of claim 2, further comprising training of a supervised machine learning model to identify the observed state.

4. The method of claim 3, wherein the supervised machine learning model comprises at least one of the groups consisting of a random forest, a neural network model based on a sequence model and a support vector classifier.

5. The method of claim 2, further comprising training the supervised machine learning model to act upon the observed state.

6. The method of claim 1, further comprising following implementation of the recommendation, updating the inverse modeling by re-doing steps 1 through 4.

Description

BRIEF DESCRIPTION OF THE FIGURES

(1) FIG. 1 is a summary of the workflow involved in the implementation of the current inventive subject matter.

(2) FIG. 2 is an example of gas life performance curves for possible operating for a well period.

(3) FIG. 3 is Table 1, showing an example of the parametric distribution.

DETAILED DESCRIPTION

(4) Referring now to FIG. 1, the inventive process can be described as four steps that operate in a cyclical manner:

(5) Step 1—Data gathering and transformation: A first and crucial step for an automated process is setting up a system to collect data from the sensor feed recording the pressures, temperature and other relevant signals from the wells. The name of the well, the timestamp at which the sensor value is recorded is stored. The sensor data is mapped to the associated well metadata, some examples of which include the well design and completion parameters such as the depth and operating pressures of the gas lift valves, the performance curves of the ESPs. Subsequently, the data is cleaned up to identify and remove outliers. Further transformation of data is performed to get features out of the raw data. Some examples of such features include estimation of the operating gas lift valve, identification of flowing or shut-in state of well and recording such information as a boolean filter, estimation of oil, water production rates from tank sensors and estimation of gas-oil or gas-liquid ratios set intervals. Events or states of the well which need special treatment for example interference from other wells, compressor instabilities, and shut-ins are labeled. These labels are used to train supervised machine learning models to identify and act upon such special states. Examples of such supervised machine learning models may include random forests, neural network based sequence models or support vector classifiers. Models are also developed to identify changes in set-point based on thresholds and isolate periods prior and posterior to such a change for evaluation.

(6) Step 2—Simulation: This step is explained with an example simulation setup for an optimization process for a gas lift operation. The objective of gas lift optimization is to maximize the oil production rate increments per marginal increment in the gas injection rate. FIG. 2 shows a gas lift performance curves generated for example cases resulting from a combination of parameters.

(7) Step 3—Inverse modeling: Simulation generates non-unique solutions, there can be several combinations of simulation inputs which provide the same output. In order to identify the most likely operating state of the input parameters, a probabilistic approach is used. Based on the transition of matching simulation cases producing the oil rate output at a given state, prior to and after a setpoint change, the probability of a given simulation case to be representative of the real well is learned and updated. The probability distribution of unknown states transitions over time as the well and reservoir transitions, but these can only transition to proximate states. A state space model is used to determine the probability distribution of unknown states at time t[0] based on the corresponding distribution at times t[−1], t[−2] . . . t[−n] in conjunction with the matching simulation cases. The learning improves more significantly as the well is perturbed changing its operating state which occurs during past changes in set points, as well as ongoing changes from implementing recommendations.

(8) Step 4—Recommendation: Based on the probability assigned to simulation cases from the inverse modeling step, the expected economic value of a set point change in either direction or by holding at the current state is assessed. The operating costs and the revenue generated at various set points are inputs to this model. Based on this, the model generates a recommendation to either change or to hold the current operating setpoint. This recommendation is communicated to the well site and/or the operator through a cloud-based well monitoring platform. The implementation can be done at the well site or through a remote controller triggered by manual input, or by any automated or manual control device.

(9) FIG. 2 depicts an example of gas life performance curves for possible operating for a well period.

(10) Estimating the optimum set point of gas injection using a physics-based model involves developing simulations which take various input parameters along with the gas injection rate and predicts the oil production rate. The input parameters can be either measured or assumed. To provide a recommendation for a certain well, a lookback period is used to measure the statistics of each parameter to account for the variance.

(11) The bottom-hole parameters usually are not measured for months at a time and have a greater degree of uncertainty, hence a wider or full range of possibility is considered for them, with each time-step, the degree of uncertainty of the parameter goes down. For the parameters which can be measured the data recorded by the sensors is captured and cleaned to remove outliers. Here is a sample (hypothetical) input matrix for one such run. To start with, a uniform distribution is used for each of these parameters.

(12) Additional to these time-dependent input parameters shown in Table 1 of FIG. 3, there are parameters which are fixed for a life of a well, such as the deviation survey data of the well. For the purposes of consistency, the deviation survey data has been translated and simplified into 6 points: 3 True vertical depth points at 0.70 and 90-degree inclination respectively of the well and their corresponding measured depths. Combined with the deviation survey, we have 15 input parameters per well per time period. For such an evaluation the total number of simulations to be run is 67.5 Million. It takes years to run this quantity of simulation per recommendation per well. To circumvent this problem, a neural network based solution is used for achieving high-speed computation with minimal loss of accuracy.

(13) The rate of physics-based simulation using a commercial wellbore fluid flow simulator has been observed to be between 12,000-50,000 simulations per day. At this rate, it takes years to generate recommendations. Also, considering the variation in deviation data of wells, the simulation data runs for one well may or may not be applicable for another well. Hence, a neural network-based approach was selected to solve this problem. The neural network is trained on randomly generated data samples in the order of 200,000-300,00 cases which can be generated in 15-20 days. The neural network is trained on the distribution of randomly generated synthetic well-deviations. This base neural network model can be a biased model as it tries to minimize the error across all cases. Such a model trained on randomly generated synthetic well data is observed to have an R-squared value of 0.8-0.96 when tested on simulations generated for real wells. A transfer learning protocol is used to generate well-specific models with high accuracy of R-squared which is consistently in the 0.950-0.999 zone. This is achieved by generating a relatively smaller sample of well-specific simulations in the order of 500-5000 and retraining the final layer of the neural network while freezing the other layers at the time of retraining. The Well specific neural network model achieved through transfer learning can generate the results for 1 Million simulations in less than 10 seconds while the physics-based simulator will take between 20-80 days for the same. This is what makes it possible to have real-time optimization recommendations for thousands of wells while incorporating the effects of uncertainty, variance, and transition.

(14) The system follows a closed loop after a change is made to the setpoint, the response to the change is assessed using the field data sensor stream and data processing technique as described in step 1. The model updates itself on a platform by following steps 1 through 4. Examples of such a platform could be cloud-based, a fully connected on-premises platform or an edge device.

(15) As used herein, and unless the context dictates otherwise, the term “coupled to” is intended to include both direct coupling (in which two elements that are coupled to each other contact each other) and indirect coupling (in which at least one additional element is located between the two elements). Therefore, the terms “coupled to” and “coupled with” are used synonymously.

(16) It should be apparent to those skilled in the art that many more modifications besides those already described are possible without departing from the inventive concepts herein. The inventive subject matter, therefore, is not to be restricted except in the spirit of the appended claims. Moreover, in interpreting both the specification and the claims, all terms should be interpreted in the broadest possible manner consistent with the context. In particular, the terms “comprises” and “comprising” should be interpreted as referring to elements, components, or steps in a non-exclusive manner, indicating that the referenced elements, components, or steps may be present, or utilized, or combined with other elements, components, or steps that are not expressly referenced. Where the specification claims refers to at least one of something selected from the group consisting of A, B, C . . . and N, the text should be interpreted as requiring only one element from the group, not A plus N, or B plus N, etc.

Machine learning approach for automated probabilistic well operation optimization

Assignee

Inventors

Cpc classification

Classification Explorer

G06F2113/08

PHYSICS

Classification Explorer

G06N3/096

PHYSICS

Classification Explorer

G06N3/08

PHYSICS

Classification Explorer

E21B2200/22

FIXED CONSTRUCTIONS

Classification Explorer

G01V99/005

PHYSICS

Classification Explorer

G01V2210/72

PHYSICS

Classification Explorer

G06N3/09

PHYSICS

Classification Explorer

G06F30/27

PHYSICS

Classification Explorer

G06N20/10

PHYSICS

Classification Explorer

E21B2200/20

FIXED CONSTRUCTIONS

Classification Explorer

E21B43/00

FIXED CONSTRUCTIONS

Classification Explorer

G06N5/01

PHYSICS

Classification Explorer

G06N20/20

PHYSICS

Classification Explorer

E21B47/10

FIXED CONSTRUCTIONS

Classification Explorer

G06F30/28

PHYSICS

International classification

Classification Explorer

G01V99/00

PHYSICS

Classification Explorer

E21B47/10

FIXED CONSTRUCTIONS

Classification Explorer

G06N3/08

PHYSICS

Classification Explorer

G06F30/27

PHYSICS

Classification Explorer

G06F30/28

PHYSICS

Abstract

Claims

Description