DETECTION OF STRUCTURAL ANOMALIES IN A PIPELINE NETWORK
20220205956 · 2022-06-30
Inventors
- Mark Leslie STEPHENS (Adelaide, AU)
- Luke DIX (Adelaide, AU)
- Chi ZHANG (Adelaide, AU)
- Jinzhe GONG (Adelaide, AU)
- Benjamin CAZZOLATO (Adelaide, AU)
- Martin F. LAMBERT (Adelaide, AU)
Cpc classification
E03B7/003
FIXED CONSTRUCTIONS
G01M5/0025
PHYSICS
E03B7/071
FIXED CONSTRUCTIONS
F17D5/06
MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
Y02A20/15
GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
International classification
Abstract
Methods of processing a data signal obtained from a sensor sensing a dynamic signal to detect a structural anomaly event are disclosed. In one embodiment, a method includes obtaining signal components attributable to fluid flow at a location within an operational pipeline network; processing the data signal to extract one or more features; characterising the one or more extracted features; and detecting an indication of a structural anomaly event proximal the location depending on the characterisation; wherein the structural anomaly event includes an occurrence and/or further development of a structural anomaly.
Claims
1. A method of processing a data signal obtained from a sensor sensing a dynamic signal including signal components attributable to fluid flow at a location within an operational pipeline network, the method including: processing the data signal to extract one or more features; characterising the one or more extracted features; and detecting an indication of a structural anomaly event proximal the location depending on the characterisation; wherein the structural anomaly event includes an occurrence and/or further development of a structural anomaly.
2. A method according to claim 1 wherein the operational pipeline network is a mains water distribution pipeline network.
3. A method according to claim 2 wherein the operational mains water distribution pipeline network includes a network of cast iron pipes and wherein detecting an indication of a structural anomaly structural anomaly event includes: a. detecting an indicator of a circumferential crack in a pipe; or b. detecting an indicator of a longitudinal crack in a pipe, depending on the characterisation of the one or more extracted features.
4. (canceled)
5. A method according to claim 1 wherein the data signal comprises one or more data segments obtained from sampling the dynamic signal.
6. A method according to claim 5 wherein the duration of the or each one or more segment is less than about 60 seconds.
7. A method according to claim 6 wherein the duration of each segment is less than about 30 seconds.
8. A method according to claim 7 wherein the duration of each segment is less than about 20 seconds.
9. A method according to claim 8 wherein the duration of each segment is less than about 15 seconds.
10. A method according to claim 1, wherein the dynamic signal is an acoustic signal and wherein the data signal is an uncompressed wave file.
11. The method of claim 5, wherein each segment comprises a sequence of overlapping data frames.
12. A method according to claim 1, wherein the one or more extracted features includes one or more frequency domain features extracted from a frequency domain characteristic of the data signal, said frequency domain characteristic including a power spectrum density derived from the data signal.
13. A method according to claim 12 wherein the one or more frequency domain features includes at least one of: a value of a median frequency; a value of a mean frequency; a value of a peak frequency; a standard deviation of a power spectral density; a kurtosis of a power spectral density; a skewness of a power spectral density; a spectral flatness; a spectral centroid; a spectral spread; and a distribution of a power spectral density.
14. A method according to claim 1, wherein the one or more extracted features includes one or more time domain features of the data signal.
15. A method according to claim 1 wherein the one or more extracted features includes a combination of one or more frequency domain features and one or more time domain features of the data signal.
16. A method according to claim 1, wherein characterising a feature to detect an indication of a structural anomaly event proximal the location depending on the feature includes one or more of: a. comparing values of an instance of the feature with a reference value, b. recognising a pattern associated with the feature to thereby discriminate the feature; and c. identifying an predetermined behavior and/or attribute of the feature to discriminate the feature.
17. (canceled)
18. (canceled)
19. A method according to claim 11, further including processing the data signal to derive the at least one power spectrum density, said processing including: framing a segment of the data signal as a sequence of data frames; and processing a selection of the data frames to determine a separate respective power spectrum for each processed segment.
20. The method according to claim 19 wherein extracting one or more frequency-domain features from the power spectrum to identify an indicator includes: processing each separate power spectrum to determine a respective first signal parameter value; and processing a set of the determined first signal parameter values to determine an indicator value for processing to characterise the data signal as indicating a structural anomaly event proximal the location depending on the first signal parameter value.
21. The method according to claim 20 wherein the first signal parameter includes a median frequency of one or more the power spectral densities derived from or for the data signal.
22. The method of claim 21, wherein the median frequency is a normalised median frequency.
23. The method of claim 20 wherein the first signal parameter includes a root-mean-squared noise derived for or from the data signal.
24. (canceled)
25. The method of claim 20 wherein processing a set of determined first signal parameter values to identify an indicator value includes identifying the indicator value as a first signal parameter value which n % of an ordered set of the first signal parameter values exceeds.
26. The method of claim 25 wherein n % is between about 50% and about 75%.
27. The method of claim 25 wherein n % is between about 75% and about 90%.
28. The method of claim 25 wherein n % is about 90%.
29. The method according to claim 12 wherein the frequency domain features includes: a. at least one power spectrum for each processed segment; or b. a temporal distribution of a plurality of the separate respective power spectrums for each processed segment
30. (canceled)
31. The method according to claim 29 further including detecting a change in a profile of the distribution of the or each power spectrum as an indicator of a structural anomaly event proximal the location.
32. (canceled)
33. A method according to claim 29 further including detecting a characteristic of the or each power spectrum as indicating a structural anomaly event proximal the location, and wherein the characteristic is detected using a neural network operable to receive a collection of prior power spectrums as an input and detect the characteristic indicating a structural anomaly event based on a variation in the or each power spectrum.
34. (canceled)
35. A system for processing a data signal obtained from a sensor sensing a dynamic signal including signal components attributable to fluid flow at a location within an operational pipeline network, the system including: a memory; a set of computer readable instructions stored in the memory. a processor for executing the set of computer readable instructions to: process the data signal to extract one or more features; characterise the one or more extracted features; and detect an indication of a structural anomaly event proximal the location depending on the characterisation; wherein the structural anomaly event includes an occurrence and/or further development of a structural anomaly.
36. An apparatus for processing a data signal obtained from a sensor sensing a dynamic signal including signal components attributable to fluid flow at a location within an operational pipeline network, the apparatus including: means for processing the data signal to extract one or more features; means for characterising the one or more extracted features; and means for detecting an indication of a structural anomaly event proximal the location depending on the characterisation; wherein the structural anomaly event includes an occurrence and/or further development of a structural anomaly.
37. A computer-readable storage device storing instructions that, when executed by a processor, cause the processor to perform operations comprising: obtaining a data signal from a sensor sensing a dynamic signal including signal components attributable to fluid flow at a location within an operational pipeline network: processing the data signal to extract one or more features; characterising the one or more extracted features; and detecting an indication of a structural anomaly event proximal the location depending on the characterisation; wherein the structural anomaly event includes an occurrence and/or further development of a structural anomaly.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0066] Embodiments of the present invention will be discussed with reference to the accompanying drawings wherein:
[0067]
[0068]
[0069]
[0070]
[0071]
[0072]
[0073]
[0074]
[0075]
[0076]
[0077]
[0078]
[0079]
[0080]
[0081]
[0082]
[0083]
[0084]
[0085]
[0086]
[0087]
[0088]
[0089]
[0090]
[0091]
[0092]
[0093]
[0094]
[0095]
[0096]
[0097]
[0098]
[0099]
[0100]
[0101]
[0102]
[0103]
[0104]
[0105]
[0106]
[0107]
[0108]
[0109]
[0110]
[0111]
[0112]
[0113]
[0114]
[0115]
[0116]
[0117]
DESCRIPTION OF EMBODIMENTS
[0118] Referring initially to
[0119] Before continuing further with a description of the system 100, it is to be understood that references to the term “operational pipeline network”, where used throughout this specification, are to be understood to denote a pipeline network 106 which is operationally transporting a fluid or a gas using one or more pipes. A “pipe network”, in this context, is to be understood as an interconnected arrangement of one or more pipes of the same or different pipe materials for transporting a fluid or a gas. Examples of pipe materials include metal (such as cast iron, ductile iron, mild steel, copper), ceramic, polymers, fibreglass and resin, reinforced concrete, prestressed concrete and cement materials.
[0120] In the description that follows, the operational pipeline network 106 will be described as a pipeline network for transporting and distributing water 108 (such as potable water) using a network of cast iron pipes 110. However, is to be appreciated that methods and systems of the present disclosure are not limited to water distribution pipeline networks nor to a network of cast iron pipes 106. Indeed, it is possible that embodiments of the disclosure may be applicable to other types of water pipeline networks, such as pipeline networks for transporting and/or distributing waste water, unprocessed water, salt (sea) water, river water, artesian water, spring water and irrigation water, as non-limiting examples. Furthermore, it is also possible that embodiments may be applicable for use with operational pipeline networks within an industrial environment, such as pipeline networks for transporting and/or distributing oil, gas, air, heating or cooling fluid, hydraulic fluid or lubricating fluid. Examples include, but are not limited to, petroleum refinery, storage and supply, processing factories, wine making and storage, minerals processing, gas supply networks, hot water heating supply networks, or airport fuel systems. It will thus be appreciated that although the following description refers to a water distribution pipeline network, methods and systems according to various embodiments of the present disclosure are not limited to a water distribution pipeline network 106 and thus may find application in processing sensed signal data to detect an indication of a structural anomaly event at or near a monitored location in various types of pipeline networks of different pipe materials.
[0121] Returning now to
[0122] It is to be noted that whilst system 100 is depicted as providing plural types of data communication channels C1, C2, C3 for data communication between the one or more sensing units 104 at locations L1, L2, and L3 respectively and one or more of the processing units 120, 122, it is not essential that plural types of data communication channels be provided. It will be appreciated that a number of different types of data communication channels may be suitable and that the type(s) and configuration of the or each data communication channel will depend on the communication capabilities of the sensing unit 104 and the communication requirements and implementation considerations arising from that. In the present case, to assist with the discussion that follows, three examples of sensing units 104 are shown at L1, L2, L3 respectively, with each sensing units 104 having different data communication requirements for data communication via a respective channel C1, C2, C3.
[0123] In the present case the plural sensing units 104 are spatially located across the pipeline network 108. Each sensing units 104 includes one or more sensors for sensing a dynamic signal 5 including signal components attributable to fluid flow at or near each respective monitored location (L1, L2 and L3). In embodiments, the spatial location of the or each sensing unit 104 may be set, or otherwise determined and/or varied according to different criteria. Suitable criteria include, but are not limited to, critical pipe distribution, pipe materials, pipe main break distribution information, critical customer travel and transport routes and other utility criticalities.
[0124] As will be described in more detail following, embodiments including plural sensing units 104 spatially located across the pipeline network 102, such as the embodiment shown as system 100, may provide for improved spatial localisation, investigation and repair of indicated structural anomalies. Such improvements may arise as a result of the data signal from the separately located sensing units 104 including information which may assist with positionally and/or directionally locating a structural anomaly relative to the acoustic sensor of each sensing unit 104.
[0125] Although system 100 is depicted as including plural sensing units 104, it will of course be appreciated that it is not essential that the system 100 include plural sensing units 104 since, in some embodiments, it is possible that a single sensing unit 104 is provided to sense a dynamic signal 5 including signal components attributable to fluid flow at a single monitored location of the operational pipeline network 106 for detecting an indication of a structural anomaly event proximal thereto. Nevertheless, it is preferred that plural sensing units 104 be provided to provide improved positional and/or directional locating of a structural anomaly.
[0126] In the present case, each sensing unit 104 is configured to sense a dynamic signal 5 including signal components which are attributable to vibro-acoustic energy caused by fluid flow at or near the respective monitored location, at the least, so as to generate a respective data signal. In this respect, in this specification references to a “dynamic signal” are to be understood to denote a time-varying signal including signal components attributable to the vibro-acoustic energy of fluid flowing within a pipe of the water distribution pipeline network 102, at the least. The time-varying signal components may include, for example, amplitude and/or frequency components of the “dynamic signal”. A dynamic signal will thus have amplitude related features and frequency related features. A non-limiting example of a “dynamic signal” is an acoustic signal, such as noise, acoustic pressure waves and acoustic vibration waves.
[0127] The dynamic signal 5, and thus the sensed data signal 10 (ref.
[0128] In relation to the noise and vibrations attributable to vibro-acoustic energy caused by fluid flow at or near the respective monitored location, where fluid flow is disrupted or modified by a structural anomaly which creates a “leak” (such as a crack, joint, hole or any other defect), at least some of the signal components of the sensed dynamic signal 5 attributable to such noise and vibrations may include noise and vibrations arising from the fluid contained within the pipe discharging through the breakage, such as noise arising from an acoustic wave radiating from the leak location. Noise attributable to an acoustic wave radiating from the leak location of this type may be detected by sensors located within the pipe, or in contact with the pipe, or proximal to the pipe. For example, it is possible that noise attributable to an acoustic wave radiating from the leak location be detected by a sensor located within a medium supporting the pipe, such as a soil medium.
[0129] Depending on the type of sensor employed by a sensing unit 104 to sense a dynamic signal 5 including coherent noise and vibrations arising from an acoustic wave radiating from the leak location, the sensor may be positioned internally within a pipe (that is, within the contained fluid), on or near fittings in mechanical communication with the pipe, or on or in surrounding physical elements (such as the soil or other medium in which the pipe is located). Examples of suitable sensor types include accelerometers, hydrophones, microphones, pressure transducers, optical sensors, and strain gauges. However, it is to be appreciated that any sensor suitable for sensing a dynamic signal 5 including components attributable to an acoustic wave radiating from the leak location may be used.
[0130] Irrespective of the type of sensing unit 104, each sensing unit 104 provides a data signal 10 in the form of an analog or digital domain representation of the respective sensed dynamic signal 5 for processing.
[0131] Providing the data signal 10 may involve one or more signal conversion and/or conditioning operations. In a typical case, for example, a sensing unit 104 will be configured to convert a sensed dynamic signal 5 to a digital signal, such that the signal conversion operations will include analog-to-digital (A/D) conversion. Examples of signal conditioning operations that may be performed on the sensed dynamic signal 5 include analog and/or digital domain bandpass filtering (e.g., low-pass filtering). For example, the or each sensing unit 104 may include in-built high-pass filter, such as a high-pass filter having a cut-off frequency of 30 Hz, and an in-built anti-aliasing filter. Particular cut-off frequencies may depend on the pipe material.
[0132] Turning now to
[0133] In the sensing unit 104 depicted in
[0134] In some embodiments, output data 202 includes the data signal 10 for output data communication to an external processing unit (such as processors 120, 122 shown in
[0135] The data signal 10 may include a continuous real-time signal, or it may include segmented data in the form of a series or set of data segments including time-limited periodic samples of the dynamic signal 5.
[0136] For example, in an embodiment, processor 202 periodically acquires a segment of a sensed signal 201 from transducer 200 to generate the data signal 202 as a sequence of wave files (such as a .wav file), with each wave file including a digital domain representation of a respective segment of sensed signal 201. For example, processor 202 may use a sampling rate of about 5 kHz to sample a 10 second segment of sensed signal 201 at 10 minute intervals. Other sampling rates and sample intervals may be used. For example, a sampling rate in the range of 256 Hz to 10 kHz may be suitable. Each resultant 10 second wave file is then communicated to, for example, an external processor, such as processor 122 (ref.
[0137] In other embodiments, it is possible that processor 202 locally processes sensed signal 201 for local processing to detect an indication of a structural anomaly event. In such embodiments, it is possible that processor 202 may generate the data signal 202 to encode a detection event for communication to a remote monitoring station (such as station 124 shown in
[0138] In some embodiments, a signal energy level threshold may be configured locally at each sensing unit 104 such that exceedance of the threshold may trigger an ad-hoc data transmission of data 202 encoding a detection event. Techniques for processing sensed signal data to obtain an indication of a structural anomaly event within an operational pipeline network will be described following. Turning now to
[0139]
[0140] Initially, a processor, such as the processor 122 shown in
[0141] As will be described following, extracting one or more features of the data signal 10 and characterising the one or more extracted features to detect an indication of a structural anomaly event proximal the location depending on the characterisation may involve different techniques. Examples of suitable techniques for processing a data signal 10 to extract and characterise one or more features thereof to detect an indication of a structural anomaly event proximal the location depending on the characterisation will now be presented.
EXAMPLE 1
Magnitude, Frequency and/or RMS Signal Feature Extraction and Characterisation
[0142]
[0143] In the description below, the schema 500 shown in
[0144] With reference initially to
[0145] Pre-processing block 502, obtains at step 702 (ref.
[0146] Segment S may have any suitable data format. One example data structure for S is shown in
[0147] In one particular example, the wave file is recorded as a file having a sampling rate of 4681 Hz and an approximate duration (t.sub.DUR) of 10 seconds to provide (N) 46,786 data points (d.sub.x), In this example, obtaining a data signal 10 by the pre-processing block 502 involves converting the UINT8 data segments S of data signal 10 into double precision values which are then normalised so as to have a minimum possible value of −1 and a maximum possible valve of +1. Normalisation normalises the sensed signal data relative to a maximum possible dynamic range of the sensor transducer 200 of the sensing unit 104 (ref.
where x is the original measurement of the dynamic response, and {tilde over (x)} is the normalised data with a minimum possible value of −1 and a maximum possible value of 1.
[0148] In the present case, pre-processing block 502 also includes a filter, such as a high-pass filter having a cut-off frequency and an in-built anti-aliasing filter. In an embodiment, the pre-processing block 502 includes a high-pass filter having a cut-off frequency of 30 Hz. However, it will be appreciated that the type and configuration of the filter may vary according to implementation considerations.
[0149] Turning now to
[0150] Windowing process 602 arranges, at step 704 (ref.
[0151] In certain cases, at least some of the data frames of the set A={a.sub.1, a.sub.2, . . . a.sub.n} will include data which has been contaminated by vibro-acoustic energy from non-leak/crack sources, such as traffic noise, water meter ticking noise or other sources of incoherent noise. In a particular example, data frames having features which may indicate that the data frame includes data which has been significantly contaminated by such noise may be attenuated or removed prior to further processing by any suitable technique. In the illustrated embodiment, noise processing process 604 attenuates or removes, from the set of data frames A, data frames (a) having features which may indicate, for example, that the data frame(s) includes data which has been contaminated by vibro-acoustic energy from non-leak/crack sources, such as environmental sources using a suitable noise processing technique.
[0152] One example of a suitable technique includes setting a signal threshold value such that any data frames having a peak signal value which exceeds the signal threshold value are taken to have been contaminated by incoherent noise and are removed before any following analysis. For example, a signal threshold value may be selected as 0.95 (for normalised data that has a possible maximum of 1). It will of course be appreciated that other signal threshold values may be used. In this respect, lowering the threshold for the peak-based pre-processing may remove more contaminated data frames. However, this may also increase the risk of removing useful data.
[0153] Another example of a suitable technique for attenuating or removing data frames having features which may indicate that the data frame includes data which has been contaminated, includes determining, at step 706, (ref.
where N is the total number of data points within a data frame a, and x.sub.i is the value of the i.sub.th data point of the data frame after normalisation.
[0154] In this example, data frames having an RMS value which exceed an RMS threshold value are removed from the set of data frames A={a.sub.1, a.sub.2, a.sub.3, . . . , a.sub.n} to thereby provide a subset of data frames B formed, at step 708, according to the RMS threshold value. One example of a RMS threshold value is the RMS value which 5% of the RMS values for the data frames exceed. Other threshold values may also be suitable.
[0155] Having constructed a subset B of data frames from which “noisy” frames may have been excluded, FFT process 606 transforms each data frame of the subset B into the frequency domain to generate, after square process 608, and at step 710, data (P.sub.1, . . . P.sub.N) representing the power spectral density (PSD) of each transformed remaining data frame(s) of the subset B.
[0156] Transforming each data frame of the subset B into a frequency domain representation comprising PSD data (P.sub.1, . . . P.sub.N) may involve applying a suitable tapering window function to each data frame of the subset B so as to enhance frequency resolution and reduce spectral leakage prior to applying the fast Fourier transform (FFT) process 606.
[0157] One example of a suitable tapering window function is a Hamming window defined by Equation (3).
where n is an integer from 0 to N−1, N is the total length (data points) of the window, w[n] is the n.sub.th value in the window. Applying a window to a segment S of data involves multiplying the window function with the data.
[0158] Suitable FFT 602 and square processes 608 for converting a time-domain data signal into a frequency domain signal data representing a PSD would be understood to a skilled person. A fast Fourier transform for data with discrete samples is defined by Equation (4).
where N is the total length (data points) of the data to be transformed, x.sub.n (n=0, . . . , N−1) are the data in the time domain, P.sub.k (k=0, . . . , N−1) are the transformed results in the frequency domain, j is the imaginary unit.
[0159] Feature analysis and detection block 508 processes, at step 714, the PSD data (P.sub.1, . . . P.sub.N) for each respective data frame to determine a value of a statistical parameter, which in this example is a normalised median frequency (MF) value, using median frequency determination process 610 to thereby provide a set of normalised MF values. Before continuing further, although the following description describes the use of a normalised MF value, it is to be appreciated that other statistical parameters may be used, either separately or in combination with the MF value. For example, in relation to time-dependent features, it is possible that standard deviation, mean and percentile-based ranges could be used either separately or in combination with the MF value.
[0160] In the present case, the dimensional MF values are normalised with respect to the Nyquist frequency (ie. half of the sampling frequency) so as to provide normalised MF values in the range of 0 to 1. The MF value can be determined by finding the frequency bin which divides the PSD into two halves as shown in Equation (4).
where P.sub.k is the PSD value at the kth frequency bin, N is the total number of frequency bins in a PSD, and M is the frequency bin that divides the PSD into two halves. The MF is the frequency corresponding to the M.sub.th frequency bin. The normalised MF is the MF divided by the Nyquist frequency, which is half of the sampling frequency.
[0161] Median frequency determination process 610 then processes at step 714 (ref.
[0162] Before proceeding further, although the below description describes the use of an L90 value, it is to be appreciated that the present disclosure is not intended to be so limited. In other words, in some embodiments, other LXX values (where XX is a two-digit number) may be extracted, such as L95, L85, L80, L75 as further non-limiting examples. Other LXX values may also be suitable.
[0163] Having extracted, for the data signal 10, a feature in the form of the L90 MF value, MF analyser 612 then characterises, at step 716, the L90 MF value for a segment (or a set of L90 MF values for plural segments S) of the data signal 10 to determine whether the data signal 10 includes an indication of a structural anomaly event proximal the location, depending on a characterisation of the L90 MF value or indeed on a characterisation a set of L90 MF values including a current L90 MF value.
[0164] In this respect, the applicants have found that an L90 MF value which can be characterised as having a persistent increase over time may indicate an occurrence and/or further development of a structural anomaly event, such as a new through-wall crack, in pipes proximal to a sensing unit 104. Accordingly, in one embodiment, L90 MF values for plural segments S of the signal data 10 sampled at different times are analysed to detect and/or recognise changes in the set of L90 MF values as indicating the occurrence and/or further development of a structural anomaly event. In one example, threshold L90 MF values may be set for automated alarm generation, with the selection of a threshold based on, for example, historical L90 MF values for a normal (ie. no-fault) condition at each sensing location, or indeed to classify a leak as being due to a particular crack type.
[0165] In another example, embodiments may process plural segments S of the data signal 10 obtained at a sampling interval such that S={S.sub.t+S.sub.t+Δt+S.sub.t+2Δt+ . . . +S.sub.t+nΔt} where Δt is the sampling interval, and apply statistical techniques, pattern recognition techniques, change detection techniques, digital filtering, or combinations thereof to a resultant set of L90 MF values characterising the data signal 10 to thereby detect an indication of a structural anomaly event proximal the location depending on the characterisation.
[0166] Although the above examples involves characterising either an extracted discrete L90 MF value or an extracted temporal set of L90 MF values to detect an indication of a structural anomaly event proximal the location depending on a characterisation of the L90 MF value or values, it may be desirable to determine other statistical indicators to validate and/or supplement an indication obtained from an L90 MF value. For example, in some embodiments it is possible that a suitable RMS value may extracted for the or each segment S and used either separately or in combination with the L90 MF value or values to values to detect an indication of a structural anomaly event. One example of a suitable RMS value is an L90 RMS value.
[0167] A discrete L90 RMS value for a single segment S of a data signal 10, or indeed a temporal set of L90 RMS values for a set of plural segments S={S.sub.t+S.sub.t+Δt+S.sub.t+2Δt+ . . . +S.sub.t+nΔt} where Δt is the sampling interval, of the data signal 10 may be determined using a similar approach to that described in relation to the L90 MF value above.
[0168] To determine an L90 RMS value for a single segment of a data signal 10, the data signal 10 is first conditioned via filter process 502 and windowing process 503. A RMS value is then determined for each of the resulted data frames using Equation (1) to obtain a set of RMS values. The set of RMS values are then ranked from lowest to highest, and a feature is extracted as an L90 RMS value for which 90% of the set RMS values exceeds. In this case, the L90 RMS value is the extracted feature for the data signal 10, and detection of an indication of a structural anomaly event proximal the location depends on its characterisation. For an L90 RMS value for a single segment of the data signal 10, characterisation of the L90 RMS value may involve characterising the L90 RMS values as exceeding a signal magnitude threshold, or falling with one or more ranges.
[0169] For a set of L90 RMS values obtained for plural segments S of the data signal 10, detection of an indication of a structural anomaly event proximal the location depending on the characterisation of the L90 RMS values may involve statistical techniques, pattern recognition techniques, CUSUM processing, digital filtering (such as Kalman filtering techniques), p-value, or combinations thereof, to the set of L90 RMS values to identify changes in the acoustic wave that relate to a damaged or crack pipe.
[0170] In a further example, a set of L90 RMS values and a set of L90 MF values may be extracted from a data set comprising a collection of segments collected at different times for a particular sensing location. The or each extracted set of determined L90 RMS and/or L90 MF values may then characterised so as to detect an indication of a structural anomaly event proximal the location depending on the characterisation. For example, a set of extracted L90 RMS and L90 MF values which can be characterised as having a persistent increase in the L90 RMS and L90 MF values may indicate the detection of an occurrence and/or further development of a structural anomaly event, such as a new through-wall crack, in the pipes surrounding the sensor or sensors of the sensing unit 104.
[0171]
[0172]
[0173] The above examples relates to the application of statistical based characterisation of frequency and/or RMS signal features of a data signal which have been extracted using a process described. However, it is to be appreciated that other methods may be used to characterise the extracted features so to obtain indication of the occurrence, attributes and/or further development of a structural anomaly event within an operational pipeline network. Other suitable methods may involve, machine learning characterisation and detection techniques, pattern recognition techniques, CUSUM processing, digital filtering (such as Kalman filtering techniques), or combinations thereof, to identify, by characterising the frequency and/or RMS signal features of the data signal 10, changes in the acoustic wave that relate to a damaged or crack pipe.
EXAMPLE 2
Predictive Frequency and/or RMS Signal Extraction and Characterisation
[0174]
[0175] As shown in
[0176] In this example, each segment of a set of plural segments S={S.sub.t+S.sub.t+Δt+S.sub.t+2Δt+ . . . +S.sub.t+nΔt} where Δt is the sampling interval, are separately partitioned into a selectable number of frames (e.g., N=256 windows for a 10 second signal) by windowing process 602 using the windowing process described above with reference to
[0177] Feature extraction process 502 shown in
[0178] The above described operations thus involve deriving a PSD for each frame of a segment S, and processing each derived PSD to determine a median frequency (MF) and an RMS value for each frame of the segment S, with each MF and RMS value determined for the frames of a segment then being stored in the corresponding MF Vector 804 or RMS Vector 806 for further processing by analysis and detection function 508.
[0179] In certain embodiments, and as is shown in
[0180] At step 908, boundary generation process 816 determines a standard deviation for the values of corresponding frames in the stored MF and RMS vectors (using each of the stored vectors for all segments and the variation in the MF and RMS values in each corresponding frame) and generates four additional vectors as lower and upper bounds for the MF and RMS vectors 812, 814 respectively. Alternatively, a selectable percentage offset from the MF and RMS vectors 812, 814 may be generated to establish two additional vectors to act as the lower and upper bounds for both the MF and RMS vectors 812, 814.
[0181] Continuing now with reference to
[0182] A “current” PSD is then derived from the spectrum for each signal component within each frame of the current segment S. An MF value of each PSD for each frame of the current segment S is then extracted (using, in this example, the MF determination process 610 of segment feature extraction function 506) and used to construct, at step 902, a “current MF vector” 804 (ref.
[0183] Having constructed the current MF vector 804 and the current RMS vector 806, boundary processor 818 then characterises the current MF vector 804 and/or the current RMS vector 806 to detect an “alert condition” indication of a structural anomaly event proximal the location depending on the characterisation the extracted features.
[0184] In certain embodiments, characterising the current MF vector 804 and/or the current RMS vector 806 involves assessing, at step 904, differences between the MF and RMS values in each frame of the current MF vector 804 and the current RMS vector 806 respectively and the corresponding frame values of the MF reference vector 812 and the RMS reference vector 814 respectively. Assessment of differences between current MF/RMS values and reference MF/RMS values may be performed using any suitable technique.
MF and/or RMS Boundary Processing
[0185] One example of a suitable technique involves boundary processing process 818 comparing the frame values of the MF vector 804 and/or the RMS vector 806 respectively and the corresponding frame values of the reference MF/RMS vectors 812, 814 to determine if the MF/RMS value, in any frame, exceeds the respective establish upper bound and/or is less than the respective established lower bound discussed above. In this way, for example, a metric, such as a percentage out of bounds, may be determined using the current and reference MF/RMS value for corresponding frames. Alternatively or additionally, a number of out of bounds occurrences across all frames may be counted and a peak and/or average percentage out of bounds determined.
[0186] If the current MF/RMS frame values are within the respective established respective MF/RMS upper and lower bounds generated by boundary generation process 816, no anomaly indication is raised by alert processor 820. In this case, the data stored in data store 808 may be updated such that data for the “oldest” segment is shifted out of the data store 808 and frame values from the current RMS vector 806 and MF vector 804 included as each new segment S becomes available in a FIFO “sliding window” type manner. Reference vectors 812, 814 are thereafter updated according to the updated values.
[0187] If a MF and/or RMS frame value of the MF vector 804 and/or the RMS vector 806 respectively is below the corresponding lower bound, the current MF and/or RMS values may be stored in data store 808, or other suitable storage, for visualisation and assessment of signal level shifts at specific measurement sensors and an anomaly alert raised by alert processor 820. If particular frame values of the current MF/RMS values are below the corresponding lower bound, then the reference MF/RMS values may be updated as for the case where the current MF/RMS values are all within the bounds.
[0188] Occurrences above the respective upper bound, as statistically quantified in terms of a number of these occurrences (ie. a count), a peak and average percentage above the upper bound of the occurrences and/or an integrated total MF/RMS value of the current MF/RMS values above the reference MF 812/RMS 814 values upper bound, may be used to raise a structural anomaly event indication, such as a leak alert (possibly subject to an assessment of the noise as Gaussian).
[0189] In certain embodiments, if the current MF/RMS frame values (one or more of them) are above the respective upper bound then the reference MF/RMS values may be subject to additional or alternative methods of adjustment before the data signal 10 for the next sample is obtained and analysed.
[0190] For example, in certain embodiments, in a first method of adjustment the reference MF/RMS frame values (for time<t where t is the current time) are “frozen” or “retained” and not updated using the current MF/RMS values (for time=t) before obtaining the data signal for the next recording sample t+Δt (where Δt is the recording time interval). The new or next “current” MF/RMS values for time t+Δt are then processed to determine differences between current MF/RMS values and reference MF/RMS values above. Quantified statistics for the number of upper bound exceedance occurrences (the count), the peak and average percentage above the upper bound of the occurrences and the integrated total power of the current MF/RMS values above the reference MF/RMS values upper bound are used to re-confirm a structural anomaly indication, in the form of an alert, for the new or next “current” MF/RMS values at time t+Δt.
[0191] In a second method of adjustment, the reference MF/RMS values (for time<t where t is the current time) are not “frozen” or “retained” and are instead updated using the current MF/RMS values (for time=t) before moving to the next recording time (t+Δt (where Δt is the recording time interval). The new or next “current” MF/RMS values for time t+Δt are then subject to the methods described above for assessing differences between current MF/RMS values and reference MF/RMS values.
[0192] In embodiments, an operational response indication or requirement may also be set in associated with an alert, depending on statistical quantification of the exceedance of the upper bound, and analysis of the associated spectrum, with associated different response levels (matching practical response capability). For example, if the peak percentage above the upper bound is 10% or less (operator selectable) then an operational response time indication or requirement may be set as slow (for example, >2 weeks). However, if a peak percentage above the upper bound is 100% or more (operator selectable) then the operational response indication or requirement may be set as “fast” (for example, <24 hours).
Rate of Change Processing
[0193] In certain embodiments, a set of statistics may be generated to characterise a rate of growth of extracted features of the data signal 10 within the framework of the established MF/RMS values and their respective frames.
[0194] For example, in certain embodiments a rate of change (ROC) of a peak and average percentage above the upper bound of the occurrences and/or the integrated total MF/RMS value, also above the upper bound, for the new or next “current” MF/RMS values may be determined by establishing differences between each parameter for the new or next “current” (time=t+Δt) relative to the updated reference MF/RMS value (including the effect of the changes at time t) and quantifying a percentage shift and/or percentage shift in time.
[0195] The above is repeated for time=t+2Δt, t+3Δt . . . t+nΔt.
[0196] Depending on the sampling interval Δt, a selectable number of consecutive data signals may be used to assess the ROC of the extracted feature or features, such as indicative statistic parameters in order to classify a leak/crack event. For example, if the ROC: [0197] reduces to 0 (±a % tolerance) after a selectable number of consecutive recordings (the assessment period) then the fault may be assigned a diagnosis as either a likely joint leak or circumferential crack; or [0198] continues to increase (selectable bands can be set—e.g., 10% growth rate for pre-determined number of consecutive recordings) then the fault may be assigned a diagnosis as a likely longitudinal crack.
[0199]
[0200] As shown, the functional block diagram 920 involves obtaining a set of acoustic measurements performing signal analysis and feature extraction 504/506, establishing benchmarks 924 for, ROC analysis 926, comparing on-going data with the established benchmarks 928 to provide statistical measures 930, 932 and an alert process 934 for conditionally generating an alert for investigation and rectification 936 depending on the statistical measures 930, 932.
[0201] In this example, signal analysis and feature extraction 504/506 involves the same processes described above in relation to the segment processing function 504 and feature extraction process 506. ROC analysis 926 may involve one or more targets to detect and characterise a structural anomaly as a longitudinal crack, a circumferential crack or a joint leak.
[0202] Establishing the benchmarks (BM) 924 may involve establishing short period BMs and/or longer period BMs. Short period BMs may be based on, for example, a daily or weekly period, whereas longer period BMs may be based on a monthly period. It will of course be appreciated that other suitable periods may be used to establish the benchmarks. For example, a suitable period may be set, or adjusted, depending on parameters such as the time of year, weather, or water demand/usage patterns.
[0203] Comparison 238 of on-going data with BMs may involve using measurements over, for example, a previous week, month, or plural months, depending on the ANN model structure established. However, longer and “rolling” data through BMs may be used to provide a continuous ROC indication.
[0204] The ROC of the peak and average percentage above the upper bound of the occurrences and the integrated total MF/RMS value, also above the upper bound, for the new or next “current” MF/RMS values may also be determined by establishing the differences between each parameter for the current (time=t) and new or next “current” (time=t+Δt) and quantifying a percentage shift and/or percentage shift in time.
[0205] In certain embodiments, the ROC is assessed relative to the established benchmark MF/RMS values within their respective frames (e.g., frequency windows) or for benchmarks established using integrated totals across all frames. For example, benchmarks may be established for each measurement location and can be established over durations of days, weeks or months.
[0206] Benchmarks established over shorter periods may reflect (and account for) dynamic changes in system and environmental noise at specific locations. Benchmarks established over longer periods are less reactive to dynamic changes in system and environmental noise at specific locations and are more sensitive to changes in MF/RMS within their respective frames and can be used to detect early crack development.
[0207] Measurement data for a period of days, weeks or months may be compared with the relevant established benchmarks to determine quantified statistics for the ROC that is occurring.
[0208]
[0209] Specific longitudinal crack events are known to only result in growth in specific frequency and/or MF frames (e.g., frequency windows). Integrated total median frequency/RMS value deviations from benchmarks are too coarse for detection and frequency band specific statistics must be calculated.
[0210]
[0211] In certain embodiments, it is possible that ROC may be represented visually for short and/or long period benchmarks as 2D heat-maps, as shown in the examples depicted in
[0212] The lower three plots depict a slow growing (over more than 30 days) longitudinal crack event differenced relative to a week, month and two month benchmarks. In certain embodiments, the gradual growth of the longitudinal crack and stronger manifestation relative to the long period benchmark enable its specific diagnosis by, for example, visual inspection, pattern recognition using an ANN, or an/or classification using an CNN or SVM with a suitably trained classifier.
[0213] While the above examples relate to frequency and/or MF processed data, analogous methods are applicable to RMS and other magnitude data.
Processing of PSD Reconstructed from MF/RMS
[0214] In certain embodiments, it is possible that a PSD characteristic for each of plural segments S may be constructed from the corresponding RMS and MF vectors generated over a selected number of intervals (i.e., acoustic data measurement intervals) and processed to provide a characteristic or representation including features for indicating a structural anomaly event depending on the characterisation or representation.
[0215] For example, functional block diagram 950 for one approach for processing a PSD characteristic reconstructed from corresponding RMS and MF vectors generated over a selected number of intervals using PSD generation function 952 and PSD processing function 954 is shown in
[0216] In certain embodiments, PSD generation function 952 generates a mapping of PSDs generated over a selected number of intervals for processing by PSD processing function 954. In this respect,
[0217]
Use of MF and/or RMS in Machine Learning
[0218]
[0219] In the example depicted in
[0220] As shown in
[0221] In the example shown in
[0222] One example of such a departure is evident, in
[0223] In this respect,
[0224] In the example shown in
[0225] In the example shown in
CUSUM Processing
[0226]
[0227] The example schema 1000 depicted in
[0228] In general terms, a CUSUM algorithm accumulates differences between the “current” frame values for the MF/RMS vectors 804, 806 and the corresponding reference frame values of the reference MF/RMS vectors 812, 814 as follows:
C.sub.i.sup.+=max[0,C.sub.i−1.sup.++z.sub.i−K.sub.i] Equation (6)
where z.sub.i is the frame value of the i.sup.th frame of either the current MF vector 804 or the current RMS vector 806, and K.sub.i is the corresponding frame value of the corresponding reference vector 812 or 814 respectively. Since K.sub.i represents a “normal” value for the relevant frame window, z.sub.i-K.sub.i represents the deviation from the normal value. In embodiments, for a particular frame value z.sub.i, the reference value K.sub.i is a reference value obtained or determined for the same frame for the previous 30 days.
[0229] In embodiments, when the CUSUM C.sub.i.sup.+ is characterised as exceeding a user-defined threshold T, the corresponding data signal 10 is identified as an outlier indicating a structural anomaly event proximal the location at which the data signal 10 was sensed. In this respect, an outlier region (OR) by CUSUM may be defined by:
OR(T)={z.sub.i:C.sub.i.sup.+>T} Equation (7)
where the threshold T represents either the increase in median frequency or RMS value required to characterise the extracted MF/RMS vectors 808, 806 as indicating the structural anomaly event.
[0230] In one embodiment employing a CUSUM algorithm, each 10 second wave file is broken into n frames, with each frame having an associated MF value and an associated RMS value.
[0231] Turning to
[0232] In certain embodiments, a CUSUM algorithm is applied for each 10 second wave files (with n frames) with an initialization C.sub.i.sup.+ being 0. Equation (6) is then applied iteratively until C.sub.n.sup.+ for the last frame is calculated. C.sub.n.sup.+ is a representation of accumulated MF or RMS for a particular day, and as is shown in
[0233] Although the above described examples involve particular techniques for processing MF and/or RMS noise features extracted for each of the plural frames of a data signal 10 to characterise the features as indicating a structural anomaly, it is possible that other techniques may be used. Other suitable techniques may involve statistical techniques, machine learning, pattern recognition techniques, digital filtering (such as Kalman filtering techniques), or combinations thereof. For example, differences between the current median frequency/RMS values and reference median frequency/RMS values may be assessed by applying Kalman filtering to vector representations of signal magnitude versus time.
EXAMPLE 3
Predictive Power Spectral Extraction and Characterisation
[0234] Turning now to
[0235] In the example depicted in
[0236] With reference to
[0237] To obtain a single representative median PSD, the median PSD value for each frequency bin is obtained by analysing all the power density values for that particular frequency bin across all the available PSDs. The median value is the value at the 50th percentile, meaning that 50% of the values are smaller than the median and 50% are larger than the median. In certain embodiments, the median value can be obtained by ranking all the power density values for a particular frequency bin (across the set of PSDs) from the lowest to the highest, and then selecting the middle one in the rank as the median value. Combining all the median power density values for all frequency bins then provides the representative median PSD for the data segment S.
[0238] The representative (mean and/or median) PSD may be determined for each segment S in a set of historical data segments (the historical data segment set can be defined by a “sliding window” manner, e.g. the previous 30 data segments relative to the current segment). A set of representative PSDs determined in this way may then be used to determine a reference PSD.
[0239] In this example, the or each mean PSD for each specific frequency bin is determined from the set of the representative PSDs. The combination of the mean power density results for all the frequency bins available provides a reference PSD for the set of historical data segments. That is, the or each reference PSD comprises a vector containing N power density values (e.g., power per frequency=dB/Hz) determined over N discrete (selectable) frequency bands covering the full range of frequencies recorded by the set of segments.
[0240] In certain embodiments, multiple frequency bins can be combined to form a single frequency band. In such a case, the power densities of all the bins used to form the band may be combined to represent the power density of that band. The result is a vector representation of the mean and/or median reference PSD with M discrete power density values over M discrete (selectable) frequency bands. In other embodiments, a frequency band may contain a single frequency bin. The more general concept of “frequency band” is used in the description that follows.
[0241] A standard deviation of the values in the mean and/or median reference PSD vectors may be determined (using all segments and the variation in the power density values in each discrete frequency band) and used to establish two additional vectors in the form of a lower and upper bound vectors for the mean and/or median reference PSD vectors.
[0242] Alternatively, a selectable percentage offset from the mean and/or median reference PSD vector values is used to establish two additional vectors as lower and upper bound vectors.
[0243] A segment S for a current sample time may then be obtained and a “current” PSD vector 1102 extracted. The current PSD vector 1102 may be determined using the same discrete frequency bands used to establish the corresponding reference PSD vector 1106. Differences between the current PSD vector 1102 and a corresponding reference PSD vector 1106 may then be characterised to detect an indication of a structural anomaly event. Any suitable method may be used to characterise the difference for detection purposes.
[0244] In a first example of a suitable method for characterising the difference for detection purposes, a mathematical comparison of the values of the current PSD vector 1102 and reference PSD vector 1106 may be undertaken by boundary processor 818 to determine if the PSD of the current segment, in any frequency band, either exceeds an establish upper bound or is less than an established lower bound. A percentage out of bounds may then be determined using the current and reference power density values of the respective vectors for the corresponding frequency bands. The number of out of bounds occurrences across all frequency bands may be counted and the peak and average percentage out of bounds recorded. An integration of the power densities of the current PSD vector 1102 that are above the upper bound of the power densities of the reference PSD vector 1106 may also be performed to determine the total power that is above the upper bound for the current PSD vector 1102.
[0245] In certain embodiments, if the current PSD vector 1102 values are within the established reference PSD upper and lower bounds then no leak or anomaly alert is raised. In such a case, the selected number of segments (ie. past) is updated with the oldest segment removed and the current segment included. The reference segments are thereby updated and shifted (by one recording time interval) as each new segment becomes available in a “sliding window” manner.
[0246] Occurrences below the lower bound may be stored for visualisation and assessment of signal level shifts at specific measurement sensors and an anomaly alert raised accordingly.
[0247] If the current PSD vector has power density(s) below the lower bound then the reference PSD vector is updated as for the case where the current PSD has all power density(s) within the bounds.
[0248] Occurrences above the upper bound (statistically quantified in terms of the number of these occurrences (the count), the peak and average percentage above the upper bound of the occurrences and the integrated total power of the current PSD vector above the reference PSD upper bound) are used to raise a leak/crack alert (subject to assessment of the signal as Gaussian).
[0249]
[0250] Plot diagrams 1110-A includes a plot of the characteristic of a current median PSD 1116-A prior to a leak/crack event on 11 Dec. 2007. As shown, plot 1116-A substantially falls within the upper 1112 and lower 1114 bounds across the frequency range of 0 Hz to 2500 Hz.
[0251] On the other hand, plot diagrams 1110-B includes a plot 1116-B of the characteristic of the current median PSD after a pipe crack event on 12 Dec. 2017. As shown, plot 1116-B exceeds the upper bound 1112 across substantially the frequency range of 0 Hz to 2500 Hz and, in this example, indicates a structural anomaly event for processing as a leak/crack alert by alert processor 820.
[0252] Although the above described examples involved particular techniques for processing a current median PSD plot feature extracted for a data signal 10 to characterise the current median PSD as indicating a structural anomaly, it is possible that other techniques may be used. Other suitable techniques may involve, for example, statistical techniques, machine learning (such as pattern classification or recognition techniques), digital filtering (such as Kalman filtering techniques), or combinations thereof.
Use of Kalman Filtering Techniques in PSD Processing
[0253] In certain embodiments, a Kalman filtering technique may involve obtaining a data signal 10 for a current recording time and then deriving a current PSD using the above described approach. The current PSD may be determined using the same discrete frequency bands used to establish a reference PSD. Differences between the current PSD and the reference PSD may then be assessed by applying Kalman filtering to the PSDs.
[0254] In certain embodiments, a Kalman filtering technique may detect a structural anomaly event for processing as a leak/crack alert by an alert processor when differences between the current PSD and the reference PSD exceed a Kalman filter estimate based on a threshold.
Machine Learning Processing of RMS Magnitude and PSD
[0255] In certain embodiments, an ANN or RNN machine learning approach can be trained to patterns in noise magnitude data with deviations from these trained patterns diagnosable as leaks.
[0256] With reference now to
[0257] The generated banded PSD training data sets are then used to establish predicted patterns (labelled as ‘ANN Prediction’ in
[0258] In certain embodiments, a departure from the ANN prediction may indicate a crack/leak event. Statistical average and peak percentage departures can be used to quantify the departure and urgency of operational response as previously described in this document.
[0259]
[0260]
[0261] In another example of a machine learning approach, as depicted in
[0262] Different trained ANN predictive models may be derived depending on the number of frequency bands used and the number of historical days, weeks or months used to provide training data. Analogously to the ROC and benchmark approach described above with reference to
[0263]
[0264]
[0265]
Integration of ROC and ANN/RNN Techniques
[0266] In certain embodiments, the statistical quantifications described above for the ROC methods may be applied to the above described ANN (and RNN) predictive methods. Furthermore, the use of the above described benchmarks derived using mean/median and standard deviation methods in the ROC method may be replaced with an ANN predicted “benchmark” based method.
[0267]
[0268] Signal analysis and feature extraction 1232 may involve the same processes described above in relation to the segment processing function 504 and feature extraction process 506. ANN 1234 may involve the same target for ROC analysis, namely, to detect and characterise a structural anomaly as a longitudinal crack, a circumferential crack or a joint leak.
[0269] Establishing the ANN model structure 1236 may involve establishing an ANN 1234 model based on a weekly period to, for example, use data from a previous week to derive an ANN prediction for, for example, a next day. Alternatively, the ANN model structure 1236 may involve establishing an ANN 1234 model based on a monthly period to, for example, use data from a previous month to derive an ANN prediction for, for example, a next day. Alternatively, establishing the ANN model structure 1236 may involve establishing an ANN 1234 model based on a period comprising plural months to, for example, use data from plural previous months to derive an ANN prediction for, for example, a next day. Irrespective of which approach for establishing the ANN model structure 1236 is adopted, each model provides predictions on a band-by-band basis. It will of course be appreciated that other suitable periods may be used to establish the ANN model structure 1236. For example, a suitable period may be set, or adjusted, depending on parameters such as the time of year, weather, or water demand/usage patterns.
[0270] Training the ANN models 1238 may involve using all the data from a single sensing unit 104 (ref.
[0271] Comparison of on-going data with ANN predictions 1240 may involve using measurements over, for example, a previous week, month, or plural months, depending on the ANN model structure established. However longer and “rolling” data through ANN predictions maybe used to provide a continuous ROC indication.
[0272]
[0273] An average prediction obtained using the model can be taken over a selected number of historical previous time periods (e.g., 50) and maximum and minimum deviations from this average determined with a selectable threshold added above the maximum and below the minimum to set detection limits.
[0274] A single or more exceedances of the thresholds can be used to generate an operational alert and/or an integrated total of exceedance between a measured MF vector or PSD (current) and predicted range. A typical non-event indication would exhibit a PSD within the threshold alerts bands (pink) whereas and a typical event indication (rapidly developing crack) would exhibit a measured PSD above the maximum threshold across a range of frequency bands.
[0275] Returning to
[0276] For example, if the non-Gaussian signal is continuous within a data signal 10 then the current PSD derived from the signal will be structured. This structure may be assessed by analysing, for example, the number of power density “spikes” (out of upper bound occurrences), within the established frequency bands, such that if they occur in greater than a predetermined and/or selectable number of non-adjacent frequency bands (selectable) a non-leak noise source may be indicated.
[0277] Furthermore, two or more power density “spikes” with the peak frequencies being integer multiples of each other (selectable tolerance for defining integer) may also be indicative of a source other than through-wall pipe cracks.
[0278] If the non-Gaussian signal is not continuous then this is taken to eliminate a leak either from the pipe and/or upstream of water meters connected to the pipe. However, a leak downstream of system water meters could not be ruled out. At this point, in embodiments visual classification of the spectrum(s) may be undertaken manually or by using visual pattern recognition machine learning tools (with these tools trained, for example, to standard pipe leak and non-pipe leak noise signals). In this respect,
[0279] The operational response to the leak/crack alert is adjusted based on statistical quantification of the exceedance of the upper bound, and analysis of the associated spectrum, with 3 different response levels (matching practical response capability). For example, if the peak percentage above the upper bound is 10% or less (operator selectable) then the operational response time is slow (>2 weeks). However, if the peak percentage above the upper bound is 100% or more (operator selectable) then the operational response time is fast (<24 hours). Similarly, bounds can be established for the machine learning approaches and operational response levels established.
EXAMPLE 4
Machine Learning Characterisation and Detection
[0280] As set out above, certain embodiments of the disclosure involve machine learning characterisation and detection. Embodiments which employ machine learning characterisation and detection may include statistical checks which are applied to a data signal 10 or features extracted from a data signal 10. In this respect, prior to proceeding further, in the below description, reference will be made to the data signal 10 as comprising a wave file. However, it is to be appreciated that the application of the method exemplified below is not to be so limited.
[0281] The following example relates to the application of a machine learning characterisation and detection of features which have been extracted using a process similar to that described above in relation to Example 2 above. However, it is to be appreciated that similar machine learning characterisation and detection techniques to those described below could equally be applied to the features extracted for other examples presented above.
[0282]
[0283] In the illustrated embodiment, a training data set 1202 comprising a set of data signals 10 (such as a wave file) including, for example, a leak/crack induced signal from cracked pipes, environmental noise forms, and no significant noise. The training data set 1201 is subjected to feature extraction process 1204 to extract time-domain and/or frequency domain features of the data signals 10, such as means, standard deviations, entropy, dominant frequencies and other statistics (e.g., noise distribution). The features are thus extracted from historical wave files including measured noises which may or may not include leak/crack induced signals. In this respect, it is not essential to use leak noise and no leak noise files from the same sensing location and wave files from plural sensing units can be used to significantly increase the number of sound files available for training and thus the size of the training data set 1202.
[0284] A signal classifier 1208 including, for example, a decision tree algorithm(s) or a supporting vector machine algorithm, or other classification algorithms, is then trained by training process 1206 to machine learn features associated with signals including leaks/crack or not. Training of the classifier 1208 by training process 1206 may be validated by validation process 1210 to check for accuracy at the completion of each classification step.
[0285] Turning now to
[0286] In the example depicted in
[0287] At step 1224, frequency domain features are extracted by feature extraction process 1204 (ref.
[0288] At step 1226, mel-frequency cepstral coefficients are determined. Methods for determining mel-frequency cepstral coefficients for a data signal 10 in the form of a wave file would be well understood to a skilled person.
[0289] In certain embodiments, only features that contribute to the classification are selected as the input for the machine learning model. The feature can be selected by filter method (features are evaluated individually and selected before running a machine learning model), wrapped method (feature selection is treated as an optimization problem where a machine learning model is run iteratively to search for the optimal combination of features) or embedded method (the feature selection is based the feature importance which is a result of machine learning model).
[0290] The results of the training and performance of the trained signal classifier 1208, when applied to current (or test) data, may be visualised using a decision tree map or confusion matrix, such as the decision tree 2300 illustrated in
[0291] The depicted training decision tree map 2300 represents the numbers of true positive (leak/crack=−1), true negative (no leak/crack=1), false positive (leak/crack=−1) and false negative (leak/crack=1) occurrences after feature extraction and then training of the “decision tree” (for example) training classifier for 1800 events sets (physically confirmed leak/crack (main break type) and no leak/crack wave files) with twenty-seven features extracted from a raw acoustic wave file. In the example shown in
[0307] The sensitivity of the classifier training to the extracted features can be determined using “Violin” and “Box” plots, amongst other diagnostic techniques, as shown by way of example in
[0308] The example “Violin” plot presented in
[0309] With reference now to
[0310] Returning again now to
[0311] During this classification, based on the extracted features, signal classifier 1208 then makes a prediction, for processing by alert processor 820, as to whether the current data signal 10 includes features indicating a structural anomaly event, such as a leak/crack event, or whether it does not include such features.
[0312] Accuracy of the prediction may be validated (either by field leak/crack localisation and the repair of a water system fault (including main breaks) or not (with no subsequent water system fault emerging). Validation of the prediction (whether accurate or not) may provide a further training information that may be used to update 1212 the historical wave files prior to the training process 1206 re-training the signal classifier 1208. A time period before re-training the signal classifier 1208 may be varied depending on the number of wave files that are being recorded and associated with physical outcomes. Examples of predictions for signal magnitude and PSD plots for leak/crack and no leak/crack events are shown in
[0313]
[0314] As the number of validated leak/crack events and their associated wave files in a training set increases, it is envisaged that “true positive” outcomes will increase as re-training of the classifier is undertaken. Adjustments to the feature extraction process can also be made to weight existing features that are determined and used for classification and/or to add additional mathematically determined features which are more or less sensitive to particular features. Training has been conducted for wave files including circumferential and longitudinal cracks on pipes as well as a full range of environmental noises (typically illustrated in the figures above). Specific training for sets of water system faults, including main breaks but also including faults downstream of water meters, at valves and at joints, together with all or classes of environmental noise has been undertaken.
[0315] In embodiments, a signal classifier may include a Convolutional Neural Network (CNN). Unlike a decision tree or support vector machine (SVM) that use the features as inputs, a CNN does not require “hand-designed” low-level features and it is capable of learning mid to high-level information from the input data by the millions of parameters inside the deep neural network. CNN model takes the 2-dimensional spectrogram as the input, so that the loss of information from the raw wave file is minimum.
[0316] Training a CNN model requires a substantial amount of labelled data, which can be very expensive or sometime impractical. However, the machine learning models that use selected features require less data to train. These two types of model are complementary in terms of the size of training data required.
[0317]
[0318]
[0319]
[0320] Wave files confirmed as pipe cracks or other environmental noise by the field investigation may be included in the data set to retrain the CNN classifier 2104.
[0321] A data set of wave files with manual label may be collected for CNN classifier training and validating. Each wave file in this data set may contain a signal that is induced by unknown pipe breaks or known environmental sources. Data augmentation can be applied, if required, to increase the data size. The data is randomly split into training data and validation data. A CNN classifier that is validated by the validation data is then selected to process wave files.
[0322] With reference now to
[0323] In certain embodiments, a CNN training model may be validated by validation data that the training model hasn't seen during training. One example of a confusion matrix arising from such a validation process is shown in
[0324] In view of the above, it will be appreciated that embodiments of the disclosure may involve various techniques for processing a data signal from one or more sensors to obtain an indication of an occurrence and/or further development of a structural anomaly event at one or more locations within an operational pipeline network. Embodiments of the present disclosure process the data signal to extract one or more features and characterise the one or more extracted features to detect an indication of a structural anomaly event proximal the location depending on the characterisation.
[0325] As described above, various one or more features may be extracted and characterised using different techniques. Furthermore, having detected a leak/crack event, embodiments may then classify the leak/crack event depending on the characterisation of the one or more features. In this respect,
Circumferential Cracks:
[0326] Spectral analysis: up to 3 new peaks with new energy in an associated PSD with the power density in any 50 Hz band being above a 15% threshold established using historical (no crack) PSDs; [0327] Signal magnitude: between a 20 to 30 dB rise in acoustic noise level above sensed background acoustic noise level; [0328] Environmental noise is greater than 80% “drowned out” by a new noise from the crack which has formed; [0329] Crack growth time to 100% crack development: between 1 to 10 hrs=>20 dB/10 hrs=2 dB/hr (slowest rise rate) and 30 dB/1 hr=30 dB/hr (fastest rise rate); and [0330] Have an elevated noise plateau after 90% crack development which remains stable (±10% variation) until repair.
Longitudinal Cracks:
[0331] Spectral analysis: one new peak when new (very short) crack is first detected in spectra or PSD with this peak occurring in a 50 Hz band anywhere between 100 to 2400 Hz with additional power intensity measured in this band above a 15% threshold established using historical (no crack) PSDs; [0332] Spectral analysis: up to 10 new peaks with new energy in the associated PSD, in bands anywhere between 50 Hz to 300 Hz wide, when crack and associated noise has reached “stable” state described below; [0333] Signal magnitude: up to 5 dB rise in acoustic noise level above the logger's background acoustic noise level during a period of slow and “stable” crack evolution (see below) [0334] Signal Magnitude: up to a further 5 to 15 dB rise in acoustic noise level above the sensing unit's sensed background noise level during a period of faster and “unstable” crack evolution (see below); [0335] Evolution time between first detection of new power density in PSD (and associated sustained magnitude rise<5 dB), and then “near stable” crack period, through to “unstable” crack period during which the noise level increases before failure (see below), is currently in a range between 70 to 220 days (slowest growth rate); [0336] Evolution time for noise from a crack to grow from an environmental noise level+5 dB (from above) to an environmental noise level +10 to +20 dB (during “unstable” crack development period) in a range between 5 to 20 days=>5 dB/20 days=0.25 dB/day (slowest rise rate) and 15 dB/5 days=3.0 (fastest rise rate) during the period of “unstable” crack growth; and [0337] Typically a continuous or semi-continuous increase in noise from the first occurrence of the crack to the repair or failure.
[0338] Throughout the specification and the claims that follow, unless the context requires otherwise, the words “comprise” and “include” and variations such as “comprising” and “including” will be understood to imply the inclusion of a stated integer or group of integers, but not the exclusion of any other integer or group of integers.
[0339] The reference to any prior art in this specification is not, and should not be taken as, an acknowledgement of any form of suggestion that such prior art forms part of the common general knowledge.
[0340] It will be appreciated by those skilled in the art that the invention is not restricted in its use to the particular application described. Neither is the present invention restricted in its preferred embodiment with regard to the particular elements and/or features described or depicted herein. It will be appreciated that the invention is not limited to the embodiment or embodiments disclosed, but is capable of numerous rearrangements, modifications and substitutions without departing from the scope of the invention as set forth and defined by the following claims.