Prediction of Reservoir Fluid Properties from Mud-Gas Data
20220163503 · 2022-05-26
Inventors
- TAO YANG (STAVANGER, NO)
- Ibnu Hafidz Arief (Oslo, NO)
- Martin Niemann (Oslo, NO)
- Thibault Forest (Sandnes, NO)
- Knut Kristian Meisingset (Hafrsfjord, NO)
Cpc classification
E21B49/005
FIXED CONSTRUCTIONS
G01V9/007
PHYSICS
International classification
Abstract
The present disclosure relates to techniques for prediction of reservoir fluid properties of a hydrocarbon reservoir fluid, such as the density, the saturation pressure, the formation volume factor and the gas-oil ratio of the reservoir fluid. To predict the reservoir fluid properties, a model is generated by selecting a subset of available reservoir samples based on a degree of biodegradation of the samples, generating an input data set comprising input data and target data, the input data comprising measured or predicted mud-gas data; and generating a model using the input data. The application of this technique allows a continuous log of the selected property to be generated using mud-gas data collected during the well drilling process.
Claims
1. A method of generating a model for predicting at least one property of a fluid at a sample location within a hydrocarbon reservoir, comprising: providing an initial data set relating to a first plurality of sample locations, the initial data set comprising reservoir fluid properties data for each sample location; selecting a second plurality of sample locations as a subset of the first plurality sample locations, the selection being based on a degree of biodegradation of a fluid at the sample location; generating an input data set comprising input data and target data, the input data comprising mud-gas data, or a derivative thereof, for the second plurality of sample locations, and the target data comprising the at least one property of the hydrocarbon reservoir for each of the second plurality of sample locations; and generating a model using the input data set such that the model can be used to predict the at least one property of the fluid at the sample location based on measured mud-gas data for the sample location.
2. A method according to claim 1, wherein the at least one property comprises a gas-oil ratio of the fluid at the sample location.
3. A method according to claim 1, wherein the at least one property comprises a density of the fluid at the sample location.
4. A method according to claim 1, wherein the at least one property comprises a saturation pressure of the fluid at the sample location.
5. A method according to claim 1, wherein the at least one property comprises a formation volume factor of the fluid at the sample location.
6. A method according to claim 1, wherein the at least one property comprises a concentration of a C.sub.7+ hydrocarbon within the fluid at the sample location.
7. A method according to claim 1, wherein the mud-gas data is indicative of a concentration of C.sub.1 to C.sub.5 hydrocarbon gases at the sample location.
8. A method according to claim 1, wherein the input data further comprises an oil-related property of the fluid at the sample location.
9. A method according to claim 1, wherein generating the model comprises: training a machine learning algorithm with a first subset of the training data set; and testing the machine learning algorithm with a second, disjoint subset of the training data set.
10. A method according to claim 9, wherein the input data of the first subset comprises estimated mud-gas data, or a derivative thereof, and wherein the input data of the second subset comprises measured mud-gas data, or a derivative thereof.
11. A computer-based model for predicting at least one property of a fluid at a sample location within a hydrocarbon reservoir based on measured mud-gas data for that sample location, the computer-based model having been generated by a method according claim 1.
12. A tangible computer-readable medium storing the computer-based model of claim 11.
13. A method of predicting a value of a fluid property of a fluid at a sample location within a hydrocarbon reservoir, the method comprising: providing measured mud-gas data for the sample location; and predicting the values of a fluid property of the fluid at the sample location by supplying the measured mud-gas data to a model according to claim 11.
14. A method according to claim 13, further comprising: determining a quality for the measured mud-gas data; and providing an indication of confidence associated with the predicted value of the fluid property, wherein the indication of confidence is based on the quality of the measured mud-gas data.
15. A method according to claim 14, wherein confidence is reduced by one or more of: fluctuations of a component concentration greater than a threshold amplitude within a predetermined depth range; the missing of a predetermined number of preceding measurements; a C.sub.1, C.sub.4 or C.sub.5 concentration below a respective predetermined threshold.
16. A method according to claim 13, further comprising: determining whether the fluid has been biodegraded by comparison of a ratio of i-C.sub.4/n-C.sub.4 from the measured mud-gas data to a predetermined threshold and/or a ratio of i-C.sub.5/n-C.sub.5 from the measured mud-gas data to a predetermined threshold; and providing an indication of confidence associated with the predicted value of the fluid property, wherein the indication of confidence is based on whether the fluid has been biodegraded.
17. A method according to claim 13, wherein providing the measured mud-gas data comprises: processing a drilling fluid used whilst drilling through the sample location to release hydrocarbon gases from the drilling fluid; and analysing the released hydrocarbon gases to determine their composition, wherein the measured mud-gas data indicates the concentration of the released hydrocarbon gases.
18. A method of predicting a value of a fluid property of a fluid along a length of a well through a hydrocarbon reservoir, the method comprising: predicting a value of a fluid property of a fluid at a plurality of sample locations along a length of a well using a method according to claim 13.
19. A method according to claim 18, further comprising: sampling a fluid at a sample location along a length of a well; determining a value of the fluid property of the fluid at the sample location, based on the sampling of the fluid; and adjusting the predicted values of the fluid property at one or more of the sample locations based on the determined value of the fluid property.
20. A method according to claim 18, further comprising: displaying, using an electronic display screen, a graph plotting one or both of the predicting value and the adjusted predicted value of the fluid property against a location of the respective sample location for each of the plurality of sample locations along the length of the well.
Description
[0060] Certain preferred embodiments of the present disclosure will now be described in greater detail, by way of example only and with reference to the accompanying drawings, in which:
[0061]
[0062]
[0063]
[0064]
[0065]
[0066]
[0067]
[0068]
[0069]
[0070]
[0071]
[0072]
[0073]
[0074] Drilling fluid is a fluid used to aid the drilling of boreholes into the earth. The main functions of drilling fluid include providing hydrostatic pressure to prevent formation fluids from entering into the well bore, keeping the drill bit cool and clean during drilling, carrying out drill cuttings, and suspending the drill cuttings while drilling is paused and when the drilling assembly is brought in and out of the hole.
[0075] Drilling fluids are broadly categorised into water-based drilling fluid, non-aqueous drilling fluid, often referred to as oil-based drilling fluid, and gaseous drilling fluid. The present disclosure is particularly applicable to liquid drilling fluid, i.e. water-based drilling fluid or non-aqueous drilling fluid, which are commonly referred to as “drilling mud”.
[0076] Mud-gas logging entails gathering data from hydrocarbon gas detectors that record the levels of gases brought up to the surface in the drilling mud during a bore drilling operation.
[0077] Conventional mud-gas logging is used to identify the location of oil and gas zones as they are penetrated, which can be identified by the presence of gas in the mud system. This may be used to provide a general indication of the type of reservoir, as well as to determine where to take downhole fluid samples for more detailed analysis of the fluid composition.
[0078] The presence of hydrocarbon gas may be detected, for example, with a total gas detector.
[0079] Once the presence of hydrocarbon gas is detected, its composition may be examined for example with a gas chromatograph. The detection of the composition of the mud gas described below is sometimes referred to as “advanced mud-gas logging”.
[0080] The most common gas component present is usually methane (C.sub.1). The presence of heavier hydrocarbons, such as C.sub.2 (ethane), C.sub.3 (propane), C.sub.4 (butane) and C.sub.5 (pentane) may indicate an oil or a “wet” gas zone. Even heavier molecules, up to about C.sub.7 (heptane) or C.sub.8 (octane), may also be detectable, but are typically present only in very low concentrations. Consequently, the concentrations of these hydrocarbons are often not recorded. The composition of the mud gas can be examined in order to provide predictions of the C.sub.1 to C.sub.5 concentrations within the reservoir fluid.
[0081] The measured mud-gas data is usually referred to as “raw” mud-gas data and is not comparable to the actual composition of the reservoir, since the mud gas contains gases that do not originate from the reservoir (e.g. gases present in the drilling mud or remaining from previous injection when recycling the drilling mud) and also because lighter hydrocarbon (e.g. C.sub.1) are carried more easily by the drilling mud than heavier hydrocarbons (e.g. C.sub.2 to
[0082] Firstly, a recycling correction is made to eliminate contamination of the gas by gases originating from previous injections of the drilling mud. This correction is applied based a separate mud-gas measurement that was taken before the drilling mud was injected into the drilling string.
[0083] Secondly, an extraction efficiency correction step is applied to increase the concentration of intermediate components (from 0.sub.2 to such that the mud-gas data after this step closely resembles a corresponding reservoir fluid sample composition.
[0084] The mud-gas-data after hydrocarbon recycling correction and extraction efficiency correction is usually referred to as “fully corrected” mud-gas data.
[0085] Alternatively, or additionally, the mud gas composition may be examined to estimate certain fluid properties of the reservoir fluid. For example, broad, empirical correlations exist to correlate the mud-gas composition against certain fluid properties of the reservoir fluid. For example, extremely dry gas reservoirs should give mud gas that comprises mostly C.sub.1 and not much C.sub.2+, e.g. with each of the C.sub.1/C.sub.2, C.sub.1/C.sub.3, C.sub.1/C.sub.4 and C.sub.1/C.sub.5 ratios (for the raw mud-gas data) being greater than 50. Wet gas reservoirs will often have ratios between 20 and 50, and oil reservoirs will have ratios between 2 and 20.
[0086] Whilst the above correlations broadly indicate the type of reservoir, it has previously not been possible to use the mud gas composition to provide precise predictions of the reservoir fluid properties.
[0087] The following technique allows for the accurate determination of certain properties of the reservoir fluid that are influenced by the oil composition of the reservoir fluid, based on the mud-gas data. Such reservoir fluid properties include the gas-oil ratio, the density of the reservoir fluid, the saturation pressure of the reservoir fluid, and the formation volume factor of the reservoir fluid.
[0088] An exemplary mud-gas analysis tool 20 is shown schematically in
[0089] The tool 20 is coupled to a flow line 10 containing drilling mud returned from a borehole of a well. As discussed above, the drilling mud may be water-based mud or oil-based mud.
[0090] The tool 20 comprises a sampling probe 22 disposed with respect to the flow line 10 so as to collect a sample 24 of the drilling mud from the flow line 10. The drilling mud sample 24 is preferably a continuous sample, i.e. such that a portion of the flow of drilling mud within the flow line 10 is diverted through the mud-gas analysis tool 20.
[0091] The drilling mud sample 24 is supplied to a gas-separation chamber 26 where at least a portion of the gas carried by the drilling mud is released. The sample of drilling mud may be heated by a heater 28 upstream of the gas-separation chamber 26. Heating the drilling mud sample 24 helps to release the gas from the drilling mud sample 24. Typically, the mud sample 24 is heated to a temperature of around 80° C. to 90° C.
[0092] The released gas 30 is directed from the separation chamber 26 to a gas analysis unit (not shown), while the degassed mud 32 is returned to the flow line 10 or to another location for re-use.
[0093] The gas analyser may comprise a total gas detector, which may provide a basic quantitative indication as to how much gas is being extracted from the drilling mud by the tool 20. Total gas detection typically incorporates either a catalytic filament detector, also called a hotwire detector, or a hydrogen flame ionization detector.
[0094] A catalytic filament detector operates on the principle of catalytic combustion of hydrocarbons in the presence of a heated platinum wire at gas concentration below the lower explosive limit. The increasing heat due to combustion causes a corresponding increase in the resistance of the platinum wire filament. This resistance increase may be measured through the use of a Wheatstone bridge or equivalent detection circuit.
[0095] A hydrogen flame ionization detector functions on the principle of hydrocarbon molecule ionization in the presence of a very hot hydrogen flame. These ions are subjected to a strong electrical field resulting in a measurable current flow.
[0096] The gas analysis device may additionally or alternatively comprise an apparatus for detailed analysis of the hydrocarbon mixture. This analysis is usually performed by a gas chromatograph. However, several other detecting devices may also be utilised including a mass spectrometer, an infrared analyser or a thermal conductivity analyser.
[0097] A gas chromatograph is a rapid sampling, batch processing instrument that provides a proportional analysis of a series of hydrocarbons. Gas chromatographs can be configured to separate almost any suite of gases, but typically oilfield chromatographs are designed to separate the paraffin series of hydrocarbons from methane (C.sub.1) through pentane (C.sub.5) at room temperature, using air as a carrier. The chromatograph will report (in units or in mole percent) the quantity of each component of the gas detected.
[0098] A carrier gas stream 34, commonly comprising air, may be supplied to the separation chamber 26 and mixed with the released gas 30 to form a gas mixture 36 that is supplied to the gas analysis unit. The carrier gas stream 34 provides a continuous flow of carrier gas in order to provide a substantially continuous flow rate of the gas mixture 36 from separation chamber 26 to the gas analysis unit. Additionally, in the case of a gas analyser comprising a combustor, the use of air as the carrier gas may provide the necessary oxygen for combustion.
[0099] In some arrangements, the tool 20 may be configured to detect and/or remove H.sub.2S from the gas to prevent adverse effects that could influence hydrocarbon detection.
[0100] In some embodiments, non-combustibles gases, such as helium, carbon dioxide and nitrogen, can be detected by the gas analyser in conjunction with the logging of hydrocarbons.
[0101]
[0102] Three fluid samples 48, 50, 52 have been taken from the well site 40.
[0103] The first sample 48 is a mud-gas sample taken from the first well 44. That is to say, the first sample comprises measurements of the concentration of C.sub.1 to C.sub.5 gases released by the water-based drilling mud.
[0104] The second sample 50 is a mud-gas sample taken from the second well 46. That is to say, the second sample comprises measurements of the concentration of C.sub.1 to C.sub.5 gases released by the oil-based drilling mud.
[0105] The third sample 52 is a wireline sample, taken directly from the reservoir 42.
[0106]
[0107] It should be noted that whilst the third sample 52 also comprises data relating to the oil content of reservoir, the ratios composing
[0108] It will be apparent from
[0109] Various, empirically derived relationships exist that can be used to correct the proportional compositions of the mud gas in order to estimate the corresponding proportional compositions of the gases within the reservoir.
[0110]
[0111] As can be seen from
[0112] The following technique seeks to utilise a machine learning algorithm to produce a model that accurately estimates certain properties relating to the reservoir fluid, in particular the gas-oil ratio and the density of the reservoir fluid, based on the mud-gas data.
[0113] The general workflow for predicting the gas-oil ratio from mud-gas data comprises preparing and quality controlling a database containing reservoir sample data and mud-gas data, and training a machine learning model using the processed reservoir sample data from the database to predict the gas-oil ratio.
[0114]
[0115] In the following example, an input data set 102 comprising data relating to 1153 reservoir samples was prepared. This input data set 102 was generated by pre-processing a larger, initial data set to generate suitable data for training the machine-learning algorithm.
[0116] The initial data set in this example comprises reservoir fluid properties data from more than 2000 reservoir fluid samples. Reservoir samples may be obtained, for example, by downhole fluid sampling. However, other techniques could also be used, for example by taking a sample of well fluid after the well has been completed.
[0117] Most of the reservoir fluid samples used in this analysis were from the Norwegian Continental Shelf (NCS), while the remaining samples were from international sites operated by Equinor®. The majority of the reservoir properties data from the NCS is public and can be accessed from oil field data published by the Norwegian Petroleum Directorate (NPD).
[0118] The reservoir fluid properties data should include at least hydrocarbon composition data, which may be either in the form of direct measurements of the concentration of each hydrocarbon component within the sample, typically covering C.sub.1 to C.sub.36+ hydrocarbons. In some embodiments, the concentration data may be in the form of relative data (e.g. as a ratio of compositions of different hydrocarbons) or may be otherwise normalised. The reservoir fluid properties data may optionally also include concentrations of one or more other constituents within the well.
[0119] The reservoir fluid properties data may include one or more derived properties of the reservoir fluid sample. Such derived properties may include the target property to be determined by the machine-learning algorithm, e.g. a gas-oil ratio in this case. Other derived properties may include a density of the fluid.
[0120] The reservoir fluid properties data is sometimes referred to as PVT data, as it is commonly obtained in a pressure-volume-temperature (PVT) laboratory, where researchers will employ various instruments to determine reservoir fluid behaviour and properties from the reservoir samples.
[0121] The initial data set for the machine learning algorithm 100 was first screened to remove reservoir samples that might skew the machine learning algorithm. This was performed using various quality control algorithms, which were applied either manually or algorithmically. Early attempts to generate models based on all of the reservoir samples within the initial data set, i.e. without this step, were found to be significantly less accurate in their predictions of the gas-oil ratio.
[0122] A first screening process was applied in order to identify and remove samples where the reservoir fluid properties data was not suitable for training the machine learning algorithm. The first screening process comprised a two-stage process.
[0123] The present inventors chose to examine single-flash gas-oil measurements, since these were the most commonly reported gas-oil measurement (1734 of the samples in the initial data set) and have standard reference conditions. Thus, any samples for which a single-flash gas-oil measurement was not available were removed.
[0124] Next, outliers were removed. To do this, a visual inspection and a manual verification process were carried out. Additionally, a target filter was applied which removed samples showing evidence of oil-based mud contamination, and also data with gas-oil ratio less than 10 Sm.sup.3/Sm.sup.3 or higher than 20 000 Sm.sup.3/Sm.sup.3.
[0125] After the first screening process, 1383 samples remained.
[0126] In some embodiments, the first screening process may be omitted.
[0127] A second screening process was then carried out comprising identifying and removing samples where biodegradation has occurred.
[0128] Under certain conditions, living microorganisms (primarily bacteria, but also yeasts, moulds, and filamentous fungi) can metabolize various hydrocarbons within a reservoir. This process is known as biodegradation and dramatically affects the composition and properties of the fluid. Different classes of hydrocarbon have different susceptibilities to biodegradation. The early stages of biodegradation are characterized by the loss of n-alkanes, followed by loss of acyclic isoprenoids. The remaining compound classes (e.g., highly branched and cyclic saturated hydrocarbons as well as aromatic compounds) are most resistant to biodegradation, but will also eventually be consumed.
[0129] The inventors have identified that the reservoir samples from biodegraded reservoirs consistently represent a ‘branch’ which behaves different than the trends from the majority of non-biodegraded reservoir samples. In particular, the normalized C.sub.1 to C.sub.5 composition of a biodegraded reservoir appears similar to that of a lean gas reservoir, but has a much higher gas-oil ratio. As mud gas is indicative of only the concentrations of these lighter hydrocarbons, biodegraded reservoir samples were found to cause significant errors in the prediction of gas-oil ratio by machine learning models.
[0130] Whilst biodegradation is a well-recognised phenomenon, its effects on mud-gas data had not previously been recognised. After excluding samples containing biodegraded oils from the reservoir fluid database, the remaining fluid samples were found to demonstrate a much stronger consistency.
[0131] Biodegradation of a reservoir sample can be identified either by a direct measurement for the microorganisms, or by analysis of the hydrocarbon composition of the reservoir fluid. Samples showing above a threshold level of biodegradation were removed from the training data set. In this example, any sample showing detectable biodegradation was excluded from the input data set 102. However, it will be appreciated that a permissible level of biodegradation may be appropriate in some situations, or where only a limited number of samples is available.
[0132] In the present example, approximately 20% of samples were removed during the second screening, leaving the 1153 “good” samples.
[0133] In some embodiments, the second screening process may be omitted.
[0134] Application of the above two types of screening was found to significantly improve the accuracy of the model generated by the machine-learning algorithm.
[0135] Next, the input data set 102 was generated from the initial data set for the samples that passed the screening.
[0136] The input data set 102 comprises reservoir fluid properties data for each sample. However, measured mud-gas data was not available for each sample. The inventors therefore investigated how closely the C.sub.1-C.sub.5 composition predicted by correction of the measured mud-gas data, where available, correlated to the actual C.sub.1-C.sub.5 composition of the corresponding reservoir fluid sample.
[0137] The analysis indicated that there were no significant distributional differences between the composition of corrected mud-gas data and composition of the reservoir fluid properties data. When considering the mud-gas data and the reservoir fluid properties data at the level of independent observations, good correlation was also present. Several outliers were present. However, these cases were each identified as samples having low quality mud-gas data, as will be discussed in greater detail later.
[0138] This supports the idea that a machine learning model can be built based on the reservoir fluid properties data, which could then be used to predict a gas-oil ratio from measured mud-gas data.
[0139] The measured mud-gas data comprises measured hydrocarbon composition data for gas released from the drilling fluid from the sample location.
[0140] It will be appreciated that there is a lag-time between the drill bit passing through the sample location, and when the mud reaches the surface and is analysed. However, workers in this field will be familiar with the procedures for calculating the lag time to determine the depth to which the mud-gas sample corresponds. Therefore, this will not be discussed in detail.
[0141] The composition data for the mud-gas preferably comprises data for at least C.sub.1 to C.sub.4 hydrocarbons, and preferably at least C.sub.1 to C.sub.5 hydrocarbons (as is the case in the present example). In some cases, concentrations for up to C.sub.7 or greater hydrocarbons may be included.
[0142] The composition data may be stored either as a direct measurement of concentration (e.g. measured in ppm or similar units), or alternatively as a relative concentration (e.g. as a proportion of another hydrocarbon, usually C.sub.1). In some embodiments, the composition data may be normalised.
[0143] The mud-gas data may comprise data relating to the drilling operation, such as the type of drilling fluid used. Alternatively, or additionally, the composition data may be stored in an adjusted form, e.g. having been pre-corrected to give equivalent estimations of the gas composition of the sample itself.
[0144] The input data set 102 comprises target data and input data for each sample that passed the screening. The target data corresponds to the desired output of the model. The input data corresponds to the data that will be input into the eventual model.
[0145] The target data in this example is a gas-oil ratio, and in this example is the single-flash gas-oil measurement of the sample. As discussed above, this data was stored as part of the reservoir properties data within the initial data set. Alternatively, other measurements of gas-oil ratio may be used, or a gas-oil ratio may be derived from the reservoir composition data, i.e. based on the concentrations of the various hydrocarbons.
[0146] The input data is mud-gas data, i.e. data indicative of the composition of gases released from the drilling fluid from the sample location. Preferably, the data used is corrected mud-gas data, as discussed previously, i.e. such that the model is independent of factors such as the type of drilling mud used.
[0147] The input data may be the measured mud-gas data for the sample. However, as discussed above, this was not available in many cases and it has been found that corrected measured mud-gas data corresponds closely to the respective hydrocarbon composition of the reservoir properties data for the sample. Thus, the input data may use predicted data, i.e. predicted, corrected mud-gas data, which corresponds to the concentrations of the lighter hydrocarbons from the reservoir properties data.
[0148] In this example, the input data for each sample comprised the concentrations of C.sub.1 to C.sub.5 hydrocarbons from the reservoir fluid properties data for the sample.
[0149] It should be noted that the composition data may be provided in any suitable form, such as a direct measurement of concentration, a relative concentration, or a normalised concentration.
[0150] Next, a model generation is performed, in which a model is generated and validated based on the input data set 102.
[0151] The input data set 102 is first divided into a training data set 104 and a testing data set 106. The input data set 102 is preferably curated such that at least the testing data set 106 contains data that spans the various classes of the input data set 102 as a whole (e.g. dry gas reservoirs, wet gas reservoirs, oil reservoirs).
[0152] Typically, at least 50% of the input data set 102 should be used for training, and at least 10% of the input data set 102 should be used for testing. Common ratios include 50:50, 70:30, 75:25, 80:20, 90:10. However, it will be appreciated that other divisions may be used instead.
[0153] Generally the larger the training data set, the more accurate the model will be. However, if too small a test data set is used (or indeed if no test data set is used) then it is not possible to confidently verify the accuracy of the model, e.g. making it difficult to detect an over-fitted model (only accurate for the specific training data).
[0154] In this example, all samples from wells that did not have measured mud-gas data were used to form the training data set 104, and the samples with measured mud-gas data were used to form the testing data set 106. As a result, the training data set 104 included about 80% of the input data set 102 (i.e. 922 samples) and the testing data set 106 included the other 20% of the input data set 102 (i.e. 231 samples).
[0155] To generate a model, a machine learning algorithm is provided with the training data set 104, and a set of training parameters to control the machine learning algorithm.
[0156] The inventors identified that a Gaussian Process algorithm was the most accurate model, followed by Universal Kriging, Random Forest, KMean and Elastic Net. However, it will be appreciated that any suitable algorithm may be used. Those operating within this field will be familiar with the procedures for selecting and utilising a machine learning algorithm. Therefore, this will not be discussed in detail.
[0157] Model validation 108, e.g. cross-validation, may then then be performed. During the model validation 108, the model is tested to determine how well it predicts new data that was not used in estimating the model, in order to flag problems such as over fitting or selection bias. Model validation 108 is an optional step.
[0158] Cross-validation involves partitioning the training data set 104 into complementary subsets, performing the model fitting using one subset of the training data set 104, and validating the analysis on the other subset of the training data set 104. To reduce variability, most methods use multiple rounds of cross-validation, performed using different partitions, and the validation results are combined (e.g. averaged) over the rounds to give an estimate of the model's predictive performance (e.g. a mean average prediction error, MAPE).
[0159] In this example K-fold cross-validation, and particularly 4-fold cross-validation is used. In K-fold cross-validation, the training data 104 is separated in K disjoint subsets (in this case, four), known as “folds”. Then, cross-validation is performed by training the model on all of the data except for one fold, and validating the trained model using the fold that was not used for training. The best model is then selected as the model having the best predictive performance, e.g. the lowest MAPE.
[0160] A first testing step 110 is then performed, in which the model is tested using the training data set 104 as a whole. The results of the first testing step 110 is shown in
[0161] A second testing step 112 is then performed, in which the model is tested using the test data set 106. As discussed previously, this is a curated set of data that is broadly representative of the data as a whole, and was not used during the generation of the model.
[0162] The results of the second testing step 112 is shown in
[0163] The model has been found to predict a gas-oil ratio of the reservoir fluid based on C.sub.1 to C.sub.5 mud-gas data with MAPE of about 36% for the screened test data. This is acceptable for predictions made during the drilling phase, as such predictions were not previously possible.
[0164] It was further found that the gas oil ratio prediction accuracy could be improved significantly if any additional oil-related information is available in the drilling phase or wireline operations. For example, a second model for gas-oil ratio prediction was generated in a pressure gradient (indicative of fluid density) was added as an additional parameter of the input data.
[0165] The results of the application of the second model to a testing data set are shown in
[0166] Understanding the quality of the measured mud-gas data is important before performing a fluid property (e.g. gas-oil ratio) prediction because the mud-gas data quality will significantly impact prediction accuracy. The following characteristics of the mud-gas data values have been identified as indicating low quality or unreliable data: [0167] Large fluctuations of a component within a small depth range. [0168] First observations after missing measurements. [0169] C.sub.1 content below a given threshold. [0170] C.sub.4 or C.sub.5 content below a given threshold.
[0171] To quantify the quality of the mud-gas data, the inventors derived a quality control metric (QC metric) which ranged from 0 to 1. High quality mud-gas data would have QC metric value close to 1. If one or more of the above factors are found, then the QC metric would be reduced. Low-quality mud-gas data was indicated by QC metric close to 0. A single numeric quality measure between 0 and 1 can be plotted side-by-side with a predicted fluid property log (as will be discussed below) to visualize the confidence level associated with each prediction, based on mud-gas data quality.
[0172]
[0173] As can be seen, the samples having a higher QC metric correspond closely, whilst samples having a lower QC metric have poor correspondence. Thus, these factors provide a useful indication of the accuracy of a prediction of the gas-oil ratio.
[0174] As discussed above, different classes of hydrocarbon have different susceptibilities to biodegradation. In particular, n-alkanes are more susceptible to biodegradation than i-alkanes. C.sub.4 and C.sub.5 hydrocarbons are present in gas phase as both butane and isobutene and pentane and isopentane, respectively, which can be separately identified by the mud-gas logging tool 20. Thus, by examination of the i-C.sub.4/n-C.sub.4 and i-C.sub.5/n-C.sub.5.ratio, it is possible to estimate whether biodegradation has occurred (with a high i-C.sub.4/n-C.sub.4 ratio or high and i-C.sub.5/n-C.sub.5 ratio indicating likely biodegradation).
[0175] This may be included either as part of the QC metric analysis, or may be provided separately.
[0176] Mud-gas data is generated continuously during the drilling process. Therefore, by applying the machine learning model to the mud-gas data, it is possible to provide, at an early stage of the well installation procedure, a continuous log for the well bore of the predicted reservoir property, e.g. gas-oil ratio or fluid density. This is something that has not been possible previously until much later in the process.
[0177]
[0178] Whilst fluid property logs generated based on mud-gas data alone are useful, particularly because they will be available at an early stage of the proceedings, it is desirable to further increase the accuracy of these logs. Thus, in some embodiments, the fluid property log may be calibrated following the taking of subsequent samples, for example using the downhole fluid samples 202, 204.
[0179] In one embodiment, one or more downhole fluid samples 202, 204 may be taken providing an indication of the fluid property. The continuous fluid property log may then be adjusted based on a value of the fluid property derived from these samples. For example, a shift may be applied to the log so as to align the fluid property value of the log with a measure value of the fluid property indicated by the downhole fluid samples.
[0180] Where multiple samples are taken, the shift to be applied to the log between the sample locations may be calculated by interpolation of the shifts applied at each sample location.
[0181] This technique provides a highly precise, continuous log of a desired fluid property of the reservoir.
[0182] Whilst the above examples have been described in the context of a gas-oil ratio as the target reservoir fluid property, the same technique may also be employed to create a model for estimating other reservoir fluid properties of the reservoir fluid at a sample location, based on measured mud-gas data. Exemplary reservoir fluid properties include a fluid density of the reservoir fluid, a saturation pressure of the reservoir fluid, and a formation volume factor of the reservoir fluid.
[0183] In one example, two models were generated for the prediction of the fluid density of the reservoir fluid, based on measured mud-gas data. One model was generated for predicting stock tank oil density (i.e. after flashing the fluid to nominal atmospheric storage pressure and temperature), and another model was generated for predicting live reservoir fluid density (i.e. at reservoir pressure and temperature). Both models were generated in substantially the same manner as discussed above in respect of the first model for gas-oil ratio prediction.
[0184] These models were found to predict the density of the fluid within the reservoir based on measured C.sub.1 to C.sub.5 mud-gas data with a MAPE of about 3% for the stock tank oil density and a MAPE of about 5% for the live reservoir fluid density.
[0185] Furthermore, a similar technique has also been used to train a model to estimate the reservoir fluid composition and corresponding C.sub.7+ fraction properties. This is advantageous, as this information can be used to for an equations of state (EOS) model calculation. The EOS model for a particular fluid is an expression that describes the relationship between pressure, temperature and volume of the fluid and can be used to predict the phase behaviour of the fluid in order to derive further properties thereof.
[0186] It is normally considered necessary to know at least the following properties of the fluid in order to determine the equations of state: [0187] 1) The absolute composition of each of the C.sub.1 to C.sub.6 hydrocarbons and the absolute composition of the C.sub.7+ hydrocarbons combined; [0188] 2) The average hydrocarbon density of the C.sub.7+ hydrocarbons; and [0189] 3) The average hydrocarbon molecular weight of the C.sub.7+ hydrocarbons.
[0190] When determining the equations of state for a fluid, the C.sub.7+ hydrocarbons are usually grouped together because these hydrocarbons usually remain in the liquid/oil phase. A standard C.sub.7+ characterisation method can split the C.sub.7+ into multiple pseudo components for EOS calculation.
[0191] It will now be shown that each of these properties can be determined using the machine learning models discussed above, thus allowing the EOS model to predict all reservoir fluid properties at the sample location.
[0192]
[0193] A similar model (not shown) is then determined for estimating an absolute composition for C.sub.7+ hydrocarbons within the reservoir fluid, which can be used to calculate the absolute (percentage) composition of C.sub.6 hydrocarbons when combined data from the model used in
[0194] Thus, it is possible to estimate the absolute composition of each of the C.sub.1 to C.sub.6 hydrocarbons and the absolute composition of the C.sub.7+ hydrocarbons combined.
[0195]
[0196]
[0197] As can be seen, all of the models show a good accuracy for the prediction of these properties, suggesting that the use of such machine models should produce good approximations to apply the EOS model for reservoir fluid property prediction.
[0198] Although individual fluid property models (like density and GOR) were developed in the first examples, it will be appreciated that a physical model could be generated that would calculate all fluid properties. The EOS model approach in the second example demonstrates a good solution for predicting all reservoir fluid properties.
[0199] Whilst preferred embodiments have been described above, it will be appreciated that these have been provided by way of example only, and the scope of the invention is to be limited only by the following claims.