METHOD AND SYSTEM FOR MONITORING A REMOTE SYSTEM
20220012644 · 2022-01-13
Inventors
- David CLIFTON (Oxford (Oxfordshire), GB)
- Patrick THOMPSON (Oxford (Oxfordshire), GB)
- Heloise GREEFF (Oxford (Oxfordshire), GB)
- Achut MANANDHAR (London (Greater London), GB)
Cpc classification
G16H40/20
PHYSICS
F04B2201/12
MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
G05B23/0235
PHYSICS
F04B9/14
MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
International classification
F04B9/14
MECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
Abstract
This disclosure relates to methods and apparatus for monitoring a remote system. In one arrangement, a plurality of measurement data units are obtained. Each measurement data unit represents a time series of measurements made by a sensor system at the remote system. A first trained machine learning model is used to identify a subset of the measurement data units that have a higher average probability of corresponding to an abnormal state of the remote system than the other measurement data units. Data representing the identified measurement data units is sent over a communications network to a central data processing system. An abnormal state of the remote system is detected by using a second trained machine learning model at the central data processing system to process the data representing the identified measurement data units.
Claims
1. A method of monitoring a remote system, comprising: obtaining a plurality of measurement data units, each measurement data unit representing a time series of measurements made by a sensor system at the remote system; using a first trained machine learning model to identify a subset of the measurement data units that have a higher average probability of corresponding to an abnormal state of the remote system than the other measurement data units; sending data representing the identified measurement data units over a communications network to a central data processing system; and detecting an abnormal state of the remote system by using a second trained machine learning model at the central data processing system to process the data representing the identified measurement data units.
2. The method of claim 1, wherein the first trained machine learning model estimates a probability of the remote system being in the abnormal state during a time period corresponding to each measurement data unit and the identification of the subset of measurement data units comprises identifying measurement data units corresponding to time periods in which the estimated probability is above a predetermined threshold.
3. The method of claim 2, wherein the predetermined threshold is dynamically adjusted by the central data processing system via the communications network.
4. The method of claim 3, wherein the predetermined threshold is lowered by the central data processing system in response to the second trained machine learning model detecting an increase in a probability of an abnormal state of the remote system.
5. The method of any of claim 2, wherein the first trained machine learning model uses logistic regression to estimate the probabilities.
6. The method of claim 1, wherein the remote system comprises a mechanical apparatus.
7. The method of claim 6, wherein the sensor system comprises an accelerometer.
8. The method of claim 7, wherein the accelerometer is attached to a moving part of the mechanical apparatus.
9. The method of claim 6, wherein the mechanical apparatus is a hand-operated water pump comprising a movable handle for hand-operating the water pump.
10. The method of claim 9, wherein the sensor system comprises an accelerometer configured to measure a component of acceleration of the handle parallel to a longitudinal axis of the handle.
11. The method of claim 1, further comprising pre-processing the measurement data units before the first trained machine learning model identifies the subset of the measurement data units.
12. The method of claim 11, wherein the pre-processing comprises determining a period of a largest periodic component of the times series of measurements in each measurement data unit, and the pre-processing comprises removing measurement data units in which a period of the determined largest periodic component is below a predetermined threshold period.
13. The method of claim 12, wherein the remote system comprises a hand-operated water pump and the predetermined threshold period equals 0.5 s.
14. The method of claim 11, wherein the pre-processing comprises applying a high pass filter to each measurement data unit.
15. The method of claim 11, wherein the pre-processing comprises transforming the measurement data units to represent the time series of measurements in the frequency domain.
16. The method of claim 1, wherein the second trained machine learning model comprises a support vector machine or a random forest classifier model.
17. The method of claim 1, wherein the remote system comprises an electrical infrastructure system.
18. The method of claim 1, wherein the remote system comprises a biological system.
19. The method of claim 18, wherein the biological system is a human or animal and the sensor system is configured to measure one or more parameters relevant to a state of health of the human or animal.
20. A system for monitoring a remote system, comprising: a local data acquisition unit comprising a sensor system and a local data processing unit; and a central data processing system; wherein: the local data processing unit is configured to: obtain a plurality of measurement data units, each measurement data unit representing a time series of measurements made by the sensor system at the remote system; use a first trained machine learning model to identify a subset of the measurement data units that have a higher average probability of corresponding to an abnormal state of the remote system than the other measurement data units; and send data representing the identified measurement data units over a communications network to the central data processing system; and the central data processing system is configured to: detect an abnormal state of the remote system by using a second trained machine learning model to process the data representing the identified measurement data units received from the local data acquisition unit.
Description
[0022] Embodiments of the invention will now be described, by way of example only, with reference to the accompanying drawings in which corresponding reference symbols indicate corresponding parts, and in which:
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031]
[0032]
[0033] The present disclosure relates to methods and systems for monitoring a remote system, such as an apparatus that is installed in a location where infrastructure such as high-speed internet and reliable power supplies are not readily available. Embodiments described below are particularly applicable to cases where the remote system comprises a mechanical apparatus having at least one moving part, such as a hand-operated water pump, but the principle may also be applied in other scenarios, including for remote monitoring of electrical infrastructure systems that are disconnected from the mains, such as off grid solar batteries, as well as for monitoring biological systems such as humans or animals in remote areas or using minimalist hardware or energy consumption.
[0034]
[0035] A sensor system 6 is provided for measuring physical characteristics associated with the remote system 2. In some embodiments, the sensor system 6 comprises an accelerometer attached to the remote system, for example to a moving part 4 of the remote system 2 or to another part which is affected by operation of the remote system (e.g. due to vibrations from operation propagating to that part).
[0036] In an embodiment, the sensor system 6 is provided as part of a local data acquisition unit 10. The local data acquisition unit 10 comprises the sensor system 6 and a local data processing unit 8. As will be described in detail below, the sensor system 6 generates measurement data units by making measurements at the remote system 2 and the local data processing unit 8 processes the measurement data units and sends data derived from the measurement data units over a communications network.
[0037]
[0038] In the embodiment shown, each sensor system 6 comprises multiple sensor elements 6A-6D. Each sensor element 6A-6D may obtain a different item of measurement information, such as acceleration data relative to a different one of plural axes, or measurement information concerning characteristics of the environment around the remote system 2, such as temperature, humidity, or rainfall data. In an embodiment where the remote system 2 comprises an electrical infrastructure system such as an off grid solar battery, the sensor system 6 may comprise a power meter configured to measure one or more characteristics of an electrical output from the electrical infrastructure system. In an embodiment where the remote system 2 comprises a biological system such as a human or animal, the sensor system 6 may obtain one or more of the following: heart rate, respiratory rate, temperature, blood oxygenation, systolic blood pressure, diastolic blood pressure, electrocardiogram, blood glucose, temperature, blood constituent levels, pupil size, pain score, Glasgow coma score, and/or analyze a sample from the human or animal.
[0039]
[0040] In step S1, a plurality of measurement data units are obtained from a sensor system 6 at a remote system 2. Each measurement data unit comprises a time series of measurements made by the sensor system 6. The time series of measurements may be univariate (e.g. where the sensor system 6 comprises a single sensor element) or multivariate (e.g. where the sensor system 6 comprises multiple sensor elements as in the example of
[0041] In one specific embodiment, each local data acquisition unit 10 comprises a sensor system 6 comprising an IC-based, 96 Hz accelerometer, an 8-bit microprocessor, and a GSM modem. The estimated power consumption of this implementation of the local data acquisition unit 10 in a run and a sleep mode is compared in Table I.
TABLE-US-00001 TABLE I Estimated power consumption of components of an example local data acquisition unit 10 embedded at a remote system operating at 25° C. Component Run Mode Sleep Mode Microprocessor (8-bit at 8 MHz) 20 mA 5 mA Accelerometer at 96 Hz 200 μA 200 μA GSM Module (for SMS) 250 mA 40 μA
As indicated schematically in
[0042] In step S2, the measurement data units are pre-processed before being provided to the step S3 where the measurement data units will be processed by a first trained machine learning model.
[0043]
[0044] In this example, the pre-processing comprises a step S201 based on peak and trough detection. In an embodiment, the peak and trough detection is used to determine a period of a largest (e.g. largest amplitude) periodic component of the times series of measurements in each measurement data unit. The pre-processing then removes measurement data units in which a period of the largest periodic component is below a predetermined threshold period. In the specific case of hand-operated pump monitoring, for example, it has been found beneficial to remove measurement data units corresponding to time periods in which the period of the largest periodic component is less than 0.5 s. This eliminates contributions that are less likely to be informative about the state of the remote system, for example because the contributions correspond to children playing on a hand-operated pump rather than the pump being used in the intended way.
[0045] In an embodiment, the pre-processing further comprises a step S202 comprising applying a high pass filter to each measurement data unit. With wishing to be bound by theory, the inventors believe this filtering is beneficial because changes in the underlying condition of the remote system 2 being monitored (e.g. a hand-operated pump) are not affected to a large extent by the relatively low frequency motion imparted to the moving part 5 (e.g. the handle of the hand-operated pump) directly by manual interaction by the user. In the context of the hand-operated pump, the high-pass filtering thus removes low-frequency components associated with the manual pumping tempo that are not strongly indicative of deterioration of the pump, while retaining information about fast-moving components such as vibrations.
[0046] In an embodiment, the pre-processing further comprises a step S203 comprising applying windowing in the time series domain. This may be used due to resource limitations at the local data acquisition unit 10, such as an 8-bit microprocessor and limited battery. In an embodiment, a phase-corrected 4-point moving average (MA) finite impulse response (FIR) filter to represent the shape of the recording, which is then removed from the original signal. The filter calculates the average of a number of points from the input signal such that each point of the output signal, y, is calculated as follows:
where x is the input signal and M is the number of points used in the moving average.
[0047] In an embodiment, the pre-processing further comprises a step S204 comprising transforming the measurement data units to represent the time series of measurements in the frequency domain. In an embodiment, Fast-Fourier transforms (FFTs) are used to decompose the signal into a sum of sinusoidal basis functions used to describe the frequency content within the time-series waveform. In one particular implementation, the recorded measurement data units were partitioned into 1.3 s windows with 50% overlap. This creates 128 samples per window, equivalent to 64 frequency components with a resolution of 0.75 Hz per component for a sampling frequency of 96 Hz. To account for truncated waveforms with discontinuous endpoints resulting from the finite windows, a 128-point Hamming window function was applied. The final result after such FFT application is a feature vector with 64 frequency components per window, up to the Nyquist rate.
[0048] In an embodiment, the pre-processing further comprises a step S205 in which selected features from the frequency domain representation provided by step S204 are output from the pre-processing. In one particular implementation, a subset of 20 features was selected by uniformly sampling across frequency bins 3 to 60, discarding low frequency components, equivalent to 0 to 2.25 Hz, which represent the pumping motion of the user, where a full hand-operated pump stroke has a median period of 1.1 s. Artefacts of this pumping motion can be seen in
[0049] In step S3 of
[0050] In an embodiment, the first trained machine learning model estimates a probability of the remote system being in the abnormal state during a time period corresponding to each measurement data unit and the identification of the subset of measurement data units comprises identifying measurement data units corresponding to time periods in which the estimated probability is above a predetermined threshold.
[0051] In an embodiment, the first trained machine learning model uses logistic regression to estimate the probabilities. This is explained in detail below with reference to a specific example.
[0052] In step S4 of
[0053] In step S5 of
[0054] In an embodiment, the central data processing system 12 generates a web application to allow users to configure the system 20 (e.g. to adjust data transmission choices or protocols) and/or define user alert protocols. In an embodiment, the central data processing system 12 allows a user to tune either or both of the type and the size of data to be transmitted from the remote system 2 to the central data processing system 12. In an embodiment, the central data processing system 12 is configured to be capable of pro-actively requesting more data and/or more detailed data, such as requesting feature vectors or raw accelerometer data rather than novelty scores.
[0055] In an embodiment, the central data processing system 12 dynamically adjusts the predetermined threshold used for identifying the subset of measurement data units described above with reference to step S3 of
[0056] A detailed example specific to monitoring hand-operated pumps is now described.
[0057] Unlike in the context of patient-monitoring, there are no standardized labeling protocols for rural infrastructure conditions. Two attributes to classify a state of a hand-operated pump are introduced:
[0058] 1) Short-Term Water Quantity: a hand-operated pump is either classed as normal (C1) or abnormal (C0). A hand-operated pump is considered normal when water flows from the spout while pumping and abnormal when no water flows from the spout while pumping.
[0059] 2) Mechanical Performance: ten sub-categories, shown in Table II, are used to identify the mechanical attributes that describe the functionality and physical condition of the hand-operated pump. The data was labeled using notes collected during in-person, contemporaneous observations. This level of labeling is limited in that it allows for only two classes. Certain conditions, like those with average or low flow, are not entirely normal nor entirely abnormal. However, it is believed the proposed labels are adequately descriptive for the purposes of the present demonstration.
TABLE-US-00002 TABLE II Description of the mechanical condition and short-term water quantity classification labels assigned to each recording. Condition Description Water Flow Label Excellent working condition High flow 1 Noisy but working Average to Low flow 1 Dry borehole No flow 0 Rising main leak No flow 0 Broken U-seal No flow 0 Worn U-seal Average to low flow 1 Handpump body leaking Average to low flow 1 Worn bush bearing Average Flow 1 Stiff handle Average Flow 1 Other Average to low flow 1
[0060] In this example, vibrations of an operating Afridev hand-operated pump were measured via a retrofitted sensor system 6 comprising a consumer grade accelerometer with a sampling frequency of 96 Hz as a sensor element. Each sensor system 6 was housed in a waterproof casing and mounted with tamper-proof bolts inside the handle 4 of the pump at a position close to the pump body, as shown schematically in
[0061] Following 5 minutes of inactivity, the local data acquisition units 10 switch to a low power state to preserve battery life, restarting after 10 s of continuous motion. For a regularly used hand-operated pump 2, operating nearly constantly for 8 to 12 hours per day, this translates to about 1 gigabyte of data per hand-operated pump 2 per month. All of the hand-operated pumps 2 in the region managed in this example were located in areas with sufficient network coverage to transmit the data via the telecommunications network. However, to preserve battery and cost of data transmission, the data was stored locally on a micro-SD card and downloaded manually for the purposes of this demonstration.
[0062] Three data sets were collected from pumps at a site in Kwale, Kenya. The data sets contained high-frequency (96 Hz) three axes accelerometery readings from a local data acquisition units 10 mounted inside the handle 4 of the pumps, as described above. Data from the Y-axis was found to be the most informative. Thus, embodiments are preferably provided in which an accelerometer is attached to the handle and each measurement data unit comprises at least a measurement of acceleration parallel to the longitudinal axis of the handle 4 by the accelerometer.
[0063] A significant difference in the spectra of deep and shallow hand-operated pumps was observed, as depicted in
[0064] The first data set, D.sub.m, represents a general inter-hand-operated pump system consisting of twelve different hand-operated pumps 2 of varying operating depths ranging between 6 m to 53 m, and was included to establish the baseline performance of a general classifier. The second data set, D.sub.d,1, represents a deep operating inter-hand-operated pump system consisting of eight different hand-operated pumps 2 operating at depths between 33 m to 54 m. The third data set, D.sub.d,2, represents a deep-operating intra-hand-operated pump system of one hand-operated pump operating at 54 m. Although the implementation of a region-wide intra-hand-operated pump system is unfeasible, this data set was selected to investigate the influence of different failure types, while controlling for the hand-operated pump.
[0065] The data sets contained recordings from eight different common hand-operated pump failure types. All the data sets were balanced and randomly divided into a training-and-validation set (80%) and a test set (20%).
[0066] As a first layer of condition monitoring, the identification of the subset of measurement data units in step S3 of
[0067] In the present example, a logistic regression (LR) model was formulated using the sigmoidal hypothesis function, h(x.sub.n), with a probability that a given example is of class 1:
[0068] where w is a set of weights assigned to each input feature, x.sub.n. The decision threshold, T, is used to assign a given example to class 1 based on whether the hypothesis function is greater than or less than T. This threshold can be varied to change the size of the data subsets that was subsequently transmitted to the offline classifier (the second trained machine learning model). As the value of T is decreased, the size of the subset s increases, as more of the novelty scores are deemed abnormal.
[0069] The LR model was trained using 5-fold cross-validation (CV), where each training set, D.sub.t, was randomly subdivided into 5 equal subsets to construct 5 independent training-and-validation sets. The LR regularization parameter, λ, for each independent LR model was optimized by maximizing the area under receiver operator curve (AUROC) on the held-out validation sets.
[0070] In this example, the second trained machine learning model implementing the functionality of step S5 of
[0071] For comparison purposes in the present example, the second trained machine learning model was implemented using the LR model described above and tested using the novelty filtered data output from step S3 of
[0072] The second machine learning model was also implemented using an SVM classifier model. The SVM classifier model was trained using the radial basis function, exp(−γ∥x−x′∥.sup.2), to project the individual scores from the novelty filter where two classes may be linearly separable. The SVM classifier model was also trained using the 5-fold CV method, using different training and validation sets. The SVM hyperparameters: the kernel bandwidth, γ, and penalty cost factor, C, were optimized using grid search, where γ=2.sup.a for a ∈ [−10, −9, . . . 5] and C=2.sup.b for b ∈ [−5, −4, . . . 10], by maximizing the sum of all AUCs over all CV folds. The grid search was done independently for each CV fold. Once this was completed, we repeated the process to perform a fine grid search, where a.sub.opt ∈ [a.sub.opt−1, a.sub.opt−0.75, . . . a.sub.opt+1] and b.sub.opt ∈ [b.sub.opt−1, b.sub.opt−0.75, . . . b.sub.opt+1]. The refined hyperparameters, γ* and C*, from the fine grid search was used to train the SVM models from the training sets from each of the data sets.
[0073] The second machine learning model was also implemented using a Random Forest (RF). The RF classifier model was trained using a random selection of a subset of features, Θ.sub.k, and a random subset of the training data, D(t), to grow each decision tree, T At each node, t, of the tree, the split s.sub.t=s* to separate the input vector, X, was chosen to minimize the impurity, i(t), in class labels by minimizing the misclassification such that i.sub.E(t)=1−max{p.sub.c}, where p.sub.c is the probability of a class C. The importance of the variable input feature X for predicting the output is based on their weighted impact on decreasing the impurity of that node for all N.sub.T trees in the forest:
where v.sub.s is the variable used in split s.sub.t.
[0074] The RF hyperparameters: the number of threes, N.sub.T, the number of feature vectors in each decision tree, and the proportion of training data to be bootstrapped were again optimized using a grid search.
[0075] Following the analysis above, a condition score was produced in-situ at the remote system 2, Q.sub.n,i. Due to the lightweight processing requirement of the local data acquisition unit 10 used in this example, the temporal dependence of the accelerometer observations were not considered at the remote system 2. This was done during postprocessing by aggregating the classifier scores over consecutive examples to varying degrees by applying a moving average (MA) window and increasing the size of the window from 7 s to 27 s. This produced three lightweight condition scores, Q.sub.n,i, per data set with i=1 . . . 3 equivalent to [raw on-board score, 7 s MA window score, 27 s MA window score].
[0076] The in-situ condition score, Q.sub.n,1, was then used to filter the transmitted data such that data summaries sent to the central data processing system 12 contain only abnormal examples, as labeled (identified) by the on-board first trained machine learning model. Finally, condition scores were produced for each of the three offline classifier methods (LR, SVM, RF) using the novelty filtered data.
[0077] The ability of the above example implementation to verify CM reliability was assessed using the receiver operating characteristic (ROC) to compare the performance. This metric compares the actual and predicted outputs for each class. The true positive rate (TPR), or sensitivity, of a classifier is defined to be the probability of detection, such that
and the false positive rate (FPR), or fall-out, is defined to be the probability of a false alarm, such that
Optimizing the area under the ROC (AUC) will maximize hand-operated pump failure detection while simultaneously minimizing false alarms, which can be costly in real-life. In the ideal case, the classifier would be very sensitive (TPR=1) with no false alarms (FPR=0).
[0078] For the classification performed by the first trained machine learning model on the remote device 2, the performance of Q.sub.n,i, was compared to a baseline control score, Q.sub.n,lab, generated in the lab using the same original data but assuming no processing or power constraints as would be experienced on-board the local data acquisition unit 10.
Performance of First Trained Machine Learning Model (at Remote Device)
[0079] Table III shows that the intra-hand-operated pump classifier, {(Q.sub.n,i, D.sub.d,2} pairs, performs substantially better than the inter-hand-operated pump classifiers, {Q.sub.n,i, D.sub.m/D.sub.d,1} pairs, achieving up to 86.2 percent AUROC compared to 65.7 percent.
TABLE-US-00003 TABLE III Results for field-based, Q.sub.n, i, and lab-simulated, Q.sub.n, lab, on-board condition classification scores, given the mean AUC of 20 iterations (one standard deviation). D.sub.m D.sub.d, 1 D.sub.d, 2 Q.sub.n, 1 61.32 ± 0.7 61.55 ± 0.4 68.87 ± 0.3 Q.sub.n, 2 64.76 ± 0.4 65.08 ± 0.2 77.31 ± 0.2 Q.sub.n, 3 65.26 ± 0.1 65.67 ± 0.0 86.21 ± 0.0 Q.sub.n, lab 68.89 ± 0.3 69.80 ± 0.2 88.01 ± 0.2
[0080] However, the performance of the general inter-hand-operated pump classifier is sufficient to use as a lightweight novelty filter since the large scale implementation of pump-specific classifiers would be too costly and unrealistic to roll-out across entire region-wide rural water supply networks.
[0081] In all three cases, the lab generated scores, Q.sub.n,lab, outperform those generated by the on-pump classifier, Q.sub.n,i, by 7.5 to 12.1 percent. Given the limitations of the embedded system, it was expected that the accuracy of the on-pump classifiers would suffer compared to the lab-simulated results. Due to the lightweight processing requirement of the on-board classifier, the temporal dependence of the accelerometer observations have not been considered. However, post-processing of the ROC scores indicate that the classifier performance improves when temporal correlation is incorporated by aggregating the classifier scores over consecutive examples (to varying degrees as the moving average window size is increased 7 s to 27 s).
[0082] This type of post-processing is fairly lightweight and can be easily implemented on-board the hand-operated pump to improve on-pump novelty scores, which will bring it nearly on par with the lab-simulated results.
[0083]
[0084] The case studies shown in
Performance of Second Trained Machine Learning Model (at Central Data Processing System)
[0085] Table IV shows that in all three cases the LR classifier is sufficiently lightweight in that it reaches the optimum classification accuracy by using only 6 to 15 percent of the flagged data (the identified subset of measurement data units) from the first trained machine learning model, compared to 89 to 98 percent required by the RF classifier and 97 to 100 percent for the SVM. In all three cases, the RF classifier outperforms the LR and SVM classifiers.
[0086] As before, the LR classifier shows little difference in performance between a general, D.sub.m, or depth-specific, D.sub.d,1, inter-hand-operated pump data set. The RF classifier does marginally better for depth-specific, D.sub.d,1, data set. Both the LR and RF classifier performance improve.
TABLE-US-00004 TABLE IV Results for lab-based offline condition classification of on-pump processed data, given the mean AUC of 20 iterations (one standard deviation). D.sub.m D.sub.d, 1 D.sub.d, 2 Offline LR 59.6 ± 0.1 60.0 ± 0.0 79.4 ± 0.0 scores at RF 71.8 ± 2.5 70.9 ± 2.0 86.8 ± 1.4 100% data SVM 60.8 ± 0.2 65.2 ± 0.0 75.0 ± 0.0 Max LR 60.9 ± 1.8 60.7 ± 2.3 80.8 ± 2.7 offline at 8% at 6% at 15% scores RF 73.0 ± 1.7 72.1 ± 2.3 87.5 ± 1.5 at 89% at 98% at 91% SVM 60.8 ± 0.6 65.3 ± 0.5 75.0 ± 0 at 99% at 97% at 100%
[0087] Unlike the RF and LR classifiers, the SVM classifier performance benefits more from the depth-specific data set, D.sub.d,1, than the general data set, D.sub.m, since two-class SVMs are trained to have a low misclassification rate. The SVM classifier performance is likely to increase as we continue to collect more depth-specific data. This is suggested by the significant reduction in variance for the SVM classifier as the proportion of data is increased.
[0088]
[0089] Conversely, however, in all three cases the RF classifier achieves the highest overall accuracy, with relatively little data, and benefits minimally from more data both in improving prediction accuracy or decreasing prediction variance.
[0090] Overall, more heavyweight offline machine learning methods (as performed by the second trained machine learning model as opposed to the first trained machine learning model) offer a 10 percent improvement from the raw on-pump generated condition scores, Q.sub.n,i, in the above example. The three cases show that there is a trade-off between accuracy and specificity. Whilst the RF classifier may offer a higher overall prediction accuracy, both LR and SVM can dramatically reduce the variability in predictions as the proportion of data supplied is increased. This is an important trade-off to note since it may have a direct impact on operational cost.
Improving Real-Time Performance
[0091] To ensure the system is suitable for real-time implementation within the constraints of the limited resources, such as battery life and data transmission, two additional design factors were considered that facilitate leaner implementation with minimal effect on overall system performance.
[0092] For optimizing the run time cost model of operating such a large-scale distributed system, the impact was considered of potentially distributed run time plans and the machine learning characteristics of each classifier as a direct trade-off of its overall prediction accuracy.
[0093] 1) Prediction Run Time: Time implementation of complex, region-wide monitoring systems should aim to optimize machine learning approaches by being sensitive to memory use and parallelism.
[0094] 2) Number of Features:
[0095] In summary, existing systems for monitoring remote systems are not suitable for monitoring rural infrastructure that often operate in harsh environments and with constraints on data-transmission and battery life. In the present disclosure, an appropriate set of labels is described that can be used as the basis for monitoring the condition of hand-operated pumps (Table II). Embodiments are described in which low-cost, lightweight machine learning methods (the first trained machine learning models) are implemented on-board monitored pumps with minimum bandwidth and battery requirements to apply novelty filtering (see
[0096] The inventors found that distributed inference using logistic regression (LR) to implement the first machine learning model (on-board the remote system 2) followed by random forests (RF) to implement the second machine learning model (at the central data processing system 12) provided the best performing monitoring for rural infrastructure while optimizing limited resources. We found that the combination of LR and RF provides the optimal prediction run time and can both be successfully implemented with less than half the number of features to be transmitted from the remote system to the central data processing system.
[0097] The embodiments described above focus on application of the system to monitoring hand-operated pumps, but the same overall architecture can be applied to other rural infrastructure types, such as off-grid home solar systems or agriculture monitoring systems.