METHOD FOR PREDICTING FAULTS IN POWER PACK OF COMPLEX EQUIPMENT BASED ON A HYBRID PREDICTION MODEL

20220341996 · 2022-10-27

    Inventors

    Cpc classification

    International classification

    Abstract

    A method for predicting faults in power pack of complex equipment based on a hybrid prediction model is provided. The method includes steps of analyzing the typical faults of the power pack of complex equipment, extracting the core set of attributes therein, decomposing the time series of the power pack into a linear part and a non-linear part, using an Autoregressive Integrated Moving Average model to forecast the linear part, using an Artificial Neural Network model to forecast the residual obtained, and the predictions of the power pack are obtained by summing the predictions of the non-linear component with the linear component. The method further includes using the hybrid prediction model and the parallel parameters of the core attributes in combination with the upper and lower limits to obtain information on the operation status of the power pack.

    Claims

    1. A method for predicting faults in power pack of complex equipment based on a hybrid prediction model, the prediction method being directed at operational data under steady-state conditions of the complex equipment, wherein: the hybrid prediction model is a fault prediction model consisting of a combination of the Autoregressive Integrated Moving Average (ARIMA) model and Artificial Neural Network (ANN) model; the ARIMA model is used to forecast the time series with a linear variation pattern of power pack; the ANN model is used to forecast the time series with a non-linear pattern of variation of power pack; the hybrid prediction model integrating the predictions of time series with a linear pattern of variation of power pack and the predictions of time series with a non-linear pattern of variation of power pack, and using parallel parameters of the core attributes for condition monitoring; comprising steps of: decomposing the original time series of the power pack into a linear part and a nonlinear part, using the ARIMA model to forecast the linear part and obtain predictions, and the difference between the original time series of the power pack and the linear predictions is made to obtain the residual e(t) which containing the nonlinear change pattern; using the ANN model to forecast the e(t) and obtain predictions; the predictions of the power pack are obtained by summing the predictions of the non-linear component with the linear component; S1: analyzing the power pack failures and extracting the core set of attributes; S1.1: establishing an evaluation indicator system for the set of attributes contained in the power pack in the complex equipment; the complex equipment containing a power pack, a CPU board, a KZB board, an I/O board, an ADA board, an angular velocity sensor, a crosswind sensor and a tilt sensor; S1.2: using a rough set-based difference matrix to analyze the correlations between attributes and attributes approximation; S1.2.1: calculating the difference matrix M(T) based on the definition of the difference matrix; S1.2.2: calculating the difference function ƒM(T) based on the obtained difference matrix M(T); S1.3: obtaining the core set of attributes based on the minimum disjunction paradigm; according to the difference function ƒM(T), using the minimal disjunction paradigm to reduce the attributes and obtain the core set of attributes; S2: using ARIMA model to forecast the time series with linear variation pattern and obtain the residual which containing non-linear information; S2.1: differencing the original time series of the sampled power pack to obtain a smoothed time series; S2.2: ARIMA model identification; plotting the autocorrelation function and partial autocorrelation function plots of the smoothed time series; obtaining a sensory awareness of the autoregressive order n and moving average order m of the ARIMA model based on the autocorrelation function and partial autocorrelation function plots; obtaining the model order (n, m) computationally using the Akaike Information Criterion criterion and the Bayesian Information Criterion; S2.3: hyperparameter estimation:using the least squares method to estimate the hyperparameters of the ARIMA model; S2.4: ARIMA model validation:testing the residual and discerning whether the residual is a white noise time series, i.e. whether it satisfies a random normal distribution and is not autocorrelated; S2.5: using ARIMA model to forecast the time series with linear variation pattern; S2.6: differentiating the original time series of the power pack from the linear predictions to obtain the residual e(t) containing the non-linear variation pattern; S3: using the ANN model to forecast the nonlinear part and obtain predictions; S3.1: the core set of attributes will be used as input and the residual e(t) containing the non-linear pattern of variation obtained through the ARIMA model will be used as output to obtain the training and test sets; S3.2: data normalisation processing to prevent order-of-magnitude impacts; S3.3: establishing an ANN model, training and testing the model; S3.4: evaluating the performance of the ANN model; S3.5: using the ANN model to forecast the nonlinear part and obtain predictions e′(t); S4: obtaining the predictions for the linear and non-linear components using the ARIMA model and the ANN model respectively, and summing the predictions of the two components to obtain the predictions for the power pack; S4.1: using the ARIMA model alone to forecast the single parameter of the extracted core set of attributes and obtain the predictions, evaluating the prediction errors; S4.2: using the ANN model alone to forecast the single parameter of the extracted core set of attributes and obtain the predictions, evaluating the prediction errors; S4.3: using the hybrid prediction model to forecast the single parameter of the extracted core set of attributes and obtain the predictions, evaluating the prediction errors; S4.4: comparing the prediction errors of the three models, selecting the predictions of the hybrid prediction model as the final result; S5: using the parallel parameters of the core attributes combined with upper and lower limits to monitor the operating status of the power pack and obtain the status monitoring results; S5.1: calculating the upper and lower limits of the extracted set of core attributes; S5.2: using the ANN model to forecast the time series of the parallel parameters of the core attributes of the power pack and obtain the predictions of the parallel parameters of the core attributes, comparing the predictions with the upper and lower limits and evaluating prediction errors; S5.3: using the hybrid prediction model to forecast the time series of the parallel parameters of the core attributes of the power pack and obtain the predictions of the parallel parameters of the core attributes, comparing the predictions with the upper and lower limits and evaluating prediction errors; S5.4: obtaining the comparison results and confirming that using the hybrid prediction model and monitoring the parallel parameters of the core attributes in combination with the upper and lower limits can reduce the false alarm rate of the power pack effectively.

    Description

    DESCRIPTION OF DRAWINGS

    [0055] FIG. 1 shows an overall flow diagram of the fault prediction method of the present invention based on the hybrid prediction model;

    [0056] FIG. 2 shows a flow diagram of the ARIMA model fault prediction of the present invention;

    [0057] FIG. 3 shows a flowchart of the ANN model fault prediction of the present invention;

    [0058] FIG. 4 shows an overall flow diagram of the hybrid prediction model based on the present invention combined with the parallel parameters of the core attributes.

    DETAILED DESCRIPTION

    [0059] Detailed description of the present invention is described below in detail in combination with accompanying drawings and technical solutions.

    [0060] The present invention provides a method for predicting faults in power pack of complex equipment based on a hybrid prediction model, the prediction method being directed at operational data under steady-state conditions of the complex equipment. The hybrid prediction model is a fault prediction model consisting of a combination of the Autoregressive Integrated Moving Average (ARIMA) model and Artificial Neural Network (ANN) model.

    [0061] As shown in FIG. 1, the fault prediction method of the present invention comprises the following steps:

    [0062] S1: Analyzing the typical failure modes of complex equipment power pack and extracting the core attributes of the evaluation indicators of complex equipment power pack, and using the rough set based difference matrix to obtain the core attributes set, and dividing the time series X of the obtained core attributes into a linear part Lt and a non-linear part Nt.

    [0063] S2: Using the ARIMA model to forecast the linear part Lt and obtain the predictions L′t and its residual e(t) from the original data series, which implies the information of the non-linear time series.

    [0064] S3: Using the ANN model to forecast the residual e(t), which contains information about the non-linear time series, and obtain the predictions e′(t).

    [0065] S4: The predictions of the obtained linear and non-linear time series are summed to obtain the final predictions of the power pack, i.e., X′=e′(t)+L′t.

    [0066] S5: Using the hybrid prediction model and the parallel parameters of the core attributes combined with upper and lower limits warning to monitor the operating status of the power pack and obtain the status monitoring results;

    [0067] In this embodiment, analyzing typical failure modes of complex equipment power pack to obtain five scenarios, including: power supply pack state normal, ±15V power pack hidden state, power supply 26V01 hidden state, power supply 26V02 hidden state, main power supply 26V hidden state.

    [0068] In this embodiment, the present invention obtains the core set of attributes of the power pack, as follows, the difference matrix M. In this embodiment, the present invention obtains the core set of attributes of the power pack, the difference elements in the difference matrix M(T) are a set consisting of conditional attributes, and as there are lots of difference elements in this difference matrix, they are represented by letters k.sub.i for convenience of representation.

    TABLE-US-00001 U x.sub.1 x.sub.2 x.sub.3 x.sub.4 x.sub.5 x.sub.6 x.sub.7 x.sub.8 x.sub.9 x.sub.10 x.sub.11 x.sub.12 x.sub.13 x.sub.14 x.sub.15 x.sub.16 x.sub.17 x.sub.18 x.sub.19 x.sub.20 D x.sub.1 0 1 x.sub.2 0 0 1 x.sub.3 0 0 0 1 x.sub.4 0 0 0 0 1 x.sub.5 k.sub.1 k.sub.14 k.sub.23 k.sub.39 0 2 x.sub.6 k.sub.2 Ø k.sub.24 Ø 0 0 2 x.sub.7 k.sub.3 Ø k.sub.25 Ø 0 0 0 2 x.sub.8 k.sub.4 k.sub.15 k.sub.26 k.sub.40 0 0 0 0 2 x.sub.9 k.sub.5 k.sub.16 k.sub.27 k.sub.41 k.sub.48 k.sub.60 k.sub.67 k.sub.74 0 3 x.sub.10 k.sub.6 Ø k.sub.28 Ø k.sub.49 Ø Ø k.sub.75 0 0 3 x.sub.11 k.sub.7 Ø k.sub.29 Ø k.sub.50 Ø Ø k.sub.76 0 0 0 3 x.sub.12 Ø k.sub.17 k.sub.30 k.sub.42 k.sub.51 k.sub.61 k.sub.68 k.sub.77 0 0 0 0 3 x.sub.13 k.sub.8 k.sub.18 k.sub.31 k.sub.43 k.sub.52 k.sub.62 k.sub.69 k.sub.78 k.sub.85 k.sub.92 k.sub.97 k.sub.102 0 4 x.sub.14 k.sub.9 k.sub.19 k.sub.32 k.sub.44 k.sub.53 k.sub.63 k.sub.70 Ø k.sub.86 k.sub.93 k.sub.98 k.sub.103 0 0 4 x.sub.15 .sup. k.sub.10 k.sub.20 k.sub.33 k.sub.45 k.sub.54 k.sub.64 k.sub.71 k.sub.79 Ø k.sub.94 k.sub.99 k.sub.104 0 0 0 4 x.sub.16 .sup. k.sub.11 Ø k.sub.34 Ø k.sub.55 Ø Ø k.sub.80 k.sub.87 Ø Ø k.sub.105 0 0 0 0 4 x.sub.17 Ø Ø k.sub.35 Ø k.sub.56 Ø Ø k.sub.81 k.sub.88 Ø Ø k.sub.106 k.sub.108 k.sub.112 k.sub.116 Ø 0 5 x.sub.18 .sup. k.sub.12 k.sub.21 k.sub.36 k.sub.46 k.sub.57 k.sub.65 k.sub.72 k.sub.82 k.sub.89 k.sub.95 .sup. k.sub.100 Ø k.sub.109 k.sub.113 k.sub.117 k.sub.120 0 0 5 x.sub.19 .sup. k.sub.13 Ø k.sub.37 Ø k.sub.58 Ø Ø k.sub.83 k.sub.90 Ø k.sub.93 k.sub.107 k.sub.110 k.sub.114 k.sub.118 Ø 0 0 0 5 x.sub.20 Ø k.sub.22 k.sub.38 k.sub.47 k.sub.59 k.sub.66 k.sub.73 k.sub.84 k.sub.91 k.sub.96 k.sub.94 Ø k.sub.111 k.sub.115 k.sub.119 k.sub.121 0 0 0 0 5

    [0069] When the values of the decision attributes are different:

    [0070] (1) The first case is that, firstly, the conditional attributes that make x.sub.i and x.sub.j (j≠j) obtain different values, which constitute the difference elements m.sub.ij. The meaning is that in this set of conditional attributes, any one of the conditional attributes can distinguish x.sub.i from x.sub.j, so take one of them, and the relation is called the disjunction relation “V”, taking x.sub.1 and x.sub.5 as an example, the conditional attributes that distinguish x.sub.1 from x.sub.5 are c.sub.5, c.sub.6, c.sub.7, c.sub.8, and any one of them can distinguish x.sub.1 from x.sub.5, so take one of them, and it is called the disjunction relation, as c.sub.5Vc.sub.6Vc.sub.7Vc.sub.8; secondly, the only one that can distinguish x.sub.1 from x.sub.6 is c.sub.5.

    [0071] Then the elements that can distinguish (x.sub.1, x.sub.5 and x.sub.6) are c.sub.5 and (c.sub.5Vc.sub.6Vc.sub.7Vc.sub.8) simultaneously. This logical relation is called conjunction and it is written as: c.sub.5 ∧(c.sub.5Vc.sub.6Vc.sub.7Vc.sub.8).

    [0072] 2) The other opposite case is where the unconditional attributes distinguish between x.sub.i and x.sub.j taking values, in which case it is the empty set.

    [0073] Two cases can be disregarded when the decision attributes are the same.

    [0074] 3) The first case is where the elements on the main diagonal of the difference matrix are equal, i.e., U.sub.i=U.sub.j.

    [0075] 4) The other case is where the conditional attributes do not have the ability to make the decision attributes distinguishable regardless of whether they take the same value or not.

    [0076] In this embodiment, the difference matrix for both cases in steps 3) and 4) is the empty set Ø instead of 0.

    [0077] In this embodiment, the conditional attributes C consists of 13 evaluation indicators of the power pack, i.e. C=c.sub.i=1, 2, . . . , 13), and x.sub.i and x.sub.j are sampled historical data of the complex equipment power pack. The decision attribute D is a typical failure modes set of the complex equipment power pack and containing five modes: power supply state normal set to 1, ±15V power supply hidden state set to 2, power supply 26V01 hidden state set to 3, power supply 26V02 hidden state set to 3 and main power supply 26V hidden state set to 5, i.e. D=(1,2,3,4,5).

    [0078] Thus the conditional attributes that separate all individuals x.sub.i and x.sub.j should satisfy the “conjunction” of the differential elements of all columns, and the conjunction of all differential elements also determine ƒM(T).

    [0079] The specific k.sub.i representation elements are as follows.

    [0080] k.sub.1=k.sub.23=k.sub.24˜k.sub.29=k.sub.32˜k.sub.35=k.sub.37=k.sub.51=k.sub.53=k.sub.57=k.sub.58=k.sub.59=c.sub.5, c.sub.6, c.sub.7, c.sub.8;

    [0081] k.sub.2=k.sub.3=k.sub.4=k.sub.5=k.sub.6=k.sub.7=k.sub.9=k.sub.11 k.sub.12 k.sub.11=k.sub.13=k.sub.15 k.sub.17=k.sub.19=k.sub.21=k.sub.22=k.sub.40=k.sub.42=k.sub.44=k.sub.46=k.sub.47=k.sub.61=k.sub.63 k.sub.65 k.sub.66=k.sub.68=k.sub.70=k.sub.72=k.sub.73=k.sub.75=k.sub.76=k.sub.77=k.sub.80=k.sub.81=k.sub.82=k.sub.83=k.sub.84=k.sub.93=k.sub.95=k.sub.96=k.sub.98=k.sub.99=k.sub.100=k.sub.101=k.sub.102=k.sub.103=k.sub.105=k.sub.106=k.sub.107=k.sub.112=k.sub.113=k.sub.114=k.sub.115=k.sub.120=k.sub.121=c.sub.5;

    k.sub.16=k.sub.20=k.sub.41=k.sub.45=k.sub.60=k.sub.64=k.sub.67=k.sub.71=k.sub.87=k.sub.88=k.sub.90=k.sub.94=k.sub.99=k.sub.116=k.sub.118=c.sub.8;

    [0082] k.sub.5=k.sub.10=k.sub.31=k.sub.74=k.sub.79=k.sub.86=k.sub.89=k.sub.91=k.sub.119=c.sub.5, c.sub.8;

    [0083] k.sub.8=k.sub.78=k.sub.109=k.sub.111=c.sub.5, c.sub.6, c.sub.7;

    [0084] k.sub.43=k.sub.54=k.sub.62=k.sub.69=k.sub.92=k.sub.97=k.sub.102=k.sub.111=c.sub.6, c.sub.7;

    [0085] k.sub.14=k.sub.30=k.sub.38,39=k.sub.48,50=k.sub.52=k.sub.55,56=k.sub.58=k.sub.85=c.sub.6, c.sub.7, c.sub.8;

    [0086] As shown in FIG. 2, using ARIMA model to forecast the linearly varying time series, it includes the following steps:

    [0087] S1: smoothing the original time series of the sampled power pack.

    [0088] In the embodiment, in step S1, smoothing the original time series of the sampled power pack, specifically, the first order difference is used to smooth the unsteady time series data acquired due to running in a complex environment in the field and the processed time series need to pass the ADF test and the KPSS test.

    [0089] S2: ARIMA model identification:

    [0090] Plotting the autocorrelation function and partial autocorrelation function plots of the smoothed time series; obtaining a sensory awareness of the autoregressive order n and moving average order m of the ARIMA model based on the autocorrelation function and partial autocorrelation function plots; obtaining the model order (n, m) computationally using the Akaike Information Criterion and the Bayesian Information Criterion.

    [0091] S3: Hyperarameter estimation: using the least squares method to estimate the hyperarameter of the ARIMA model.

    [0092] S4: Model validation: testing the residual and discerning whether the residual is a white noise time series, i.e. whether it satisfies a random normal distribution and is not autocorrelated.

    [0093] In this embodiment, in step S4, the purpose of testing the residual is to ensure that the order of the model is appropriate, the residual is the difference between the original time series and the time series fitted by the model, it includes the following steps:

    [0094] 1) In the graph of the results of the residual test. The purpose of standardising the residual is to see if the residual is close to a normal distribution, and ideally the residual should be close to a normal distribution.

    [0095] 2) The autocorrelation and partial autocorrelation of the residual is tested based on the autocorrelation function (ACF) plot and the partial autocorrelation function (PACF) plot, generally, there are no points outside the boundaries.

    [0096] 3) To test whether the residual is close to a normal distribution, ideally, the input sample quantile should be close to the standard normal quantile.

    [0097] S5: Using ARIMA model to forecast the time series with linear variation pattern.

    [0098] In the embodiment, in step S5, using the ARIMA model that has been determined to forecast the time series with linear variation pattern, and the ratio of the amount of data using training data to test data is 3:1;

    [0099] S6: Differentiating the original time series of the power pack from the linear predictions to obtain the residual e(t) containing the non-linear variation pattern;

    [0100] In the embodiment, in step S6, the obtained residual e(t) containing the non-linear variation pattern e(t)=ƒ(e(t−1), e(t−2), . . . e(t-n))+a(t);

    [0101] The above a(t) is the random error.

    [0102] As shown in FIG. 3, using the ANN model to forecast the time series with non-linear variation pattern and obtain predictions, it includes the following steps.

    [0103] S1: The core set of attributes is used as input and the residual e(t), which contains the non-linear pattern of variation, obtained using the ARIMA model, is used as output to obtain the training and test sets.

    [0104] In the embodiment, in step S1: the ratio of the amount of data using training data to test data is 3:1.

    [0105] S2: Data normalisation processing to prevent order-of-magnitude impacts.

    [0106] S3: Establishing an ANN model, training and testing the model.

    [0107] In the embodiment, in step S3: establishing the ANN model, which includes the following steps.

    [0108] 1) Establishing a neural network with three inputs, three outputs and four hidden layers.

    [0109] 2) Setting the number of iterations of the model to 1000, the training target=le-6, and the learning rate=0.01.

    [0110] 3) Training the network, conducting simulation tests with the trained ANN model and renormalization of the predicted data.

    [0111] S4: Evaluating the performance of the ANN model.

    [0112] In the embodiment, in step S4, evaluating model performance, the evaluation indicators contain: mean absolute error, mean square error and mean absolute percentage error.

    [0113] Mean absolute error is the average of the absolute values of the deviations of all individual observations from the arithmetic mean, which avoids the problem of errors cancelling each other out and therefore accurately reflects the magnitude of the actual prediction error.

    [0114] mean squared error is the mathematical expectation of the square of the difference between one estimate of the overall parameter determined from a subsample, reflecting a measure of the degree of difference between the estimate and the estimated quantity, and can also be obtained as a standard error, again to measure the deviation of the observation from the true value.

    [0115] mean absolute percentage error is the value of the average percentage deviation of the predicted outcome from the true outcome, which is a percentage value and therefore easier to understand than other statistics.

    [0116] S5: Using the ANN model to forecast the nonlinear part and obtain predictions e′(t).

    [0117] As shown in FIG. 4, using the parallel parameters of the core attributes combined with upper and lower limits warning to monitor the operating status of the power pack and obtain the status monitoring results, it includes the following steps:

    [0118] S1: Analyzing the typical failure modes and extracting the core attributes, and extracting the core attributes set based on the attribute reduction method of the rough set based difference matrix set.

    [0119] In the embodiment, in step S1, an attribute reduction method based on the rough set based difference matrix, it includes the following steps:

    [0120] 1) Calculating the difference matrix M(T) based on the definition of the difference matrix.

    [0121] 2) Calculating the difference function ƒM(T) based on the obtained difference matrix M(T).

    [0122] 3) Based on ƒM(T) in (2), using the minimal disjunction paradigm to reduce the attributes and obtain the core set of attributes.

    [0123] S2: Calculating the upper and lower limits of the core set of attributes extracted by the attribute reduction method of the rough set based difference matrix, evaluating the prediction errors.

    [0124] 1) The upper and lower limits of the core set of attributes are as following: upper limit=standard value+one tenth of the standard value and lower limit=standard value−one tenth of the standard value.

    [0125] 2) The evaluation indicators contain: mean absolute error, mean square error and mean absolute percentage error.

    [0126] S3: Using the ANN model to forecast the time series of the parallel parameters of the core attributes of the power pack and obtain the predictions of the parallel parameters of the core attributes, comparing the predictions with the upper and lower limits and evaluating prediction errors.

    [0127] In the embodiment, using the ANN model alone to forecast the time series of the parallel parameters of the core attributes of the power pack and obtain the predictions of the parallel parameters of the core attributes, comparing the predictions with the upper and lower limits and evaluating prediction errors, it includes the following steps.

    [0128] 1) Using the ANN model that has been trained to forecast the parallel parameters of the core attributes that have been extracted.

    [0129] 2) Calculating the upper and lower limits of the core set of attributes, the effective values=standard value±one tenth of the standard value, and exceeding the effective values means exceeding the limits.

    [0130] 3) Using the ANN model that has been trained to forecast the single parameter of the core attributes that has been extracted.

    [0131] 4) Comparing the predictions of the single parameter of the core attributes with the effective values to obtain an early warning indication of whether the predictions are out of bounds.

    [0132] 5) Finding that using the single parameter of the core attributes to monitor condition gave an early warning signal, but not for condition monitoring using the parallel parameters of the core attributes.

    [0133] 6) Evaluating the prediction errors.

    [0134] S4: Using the hybrid prediction model ARIMA-ANN to forecast the time series of the parallel parameters of the core attributes of the power pack and obtain the predictions of the parallel parameters of the core attributes, comparing the predictions with the upper and lower limits and evaluating prediction errors.

    [0135] In the embodiment, using the ARIMA-ANN model to forecast the time series of the parallel parameters of the core attributes of the power pack and obtain the predictions of the parallel parameters of the core attributes, comparing the predictions with the upper and lower limits and evaluating prediction errors, it includes the following steps.

    [0136] 1) Using the ARIMA-ANN model that has been trained to forecast the parallel parameters of the core attributes that have been extracted.

    [0137] 2) Calculating the upper and lower limits of the core set of attributes, the effective values=standard value±one tenth of the standard value, and exceeding the effective values means exceeding the limits.

    [0138] 3) Using the ARIMA-ANN model that has been trained to forecast the single parameter of the core attributes that has been extracted.

    [0139] 4) Comparing the predictions of the single parameter of the core attributes with the effective values to obtain an early warning indication of whether the predictions are out of bounds.

    [0140] 5) Finding that using the single parameter of the core attributes to monitor condition gave an early warning signal, but not for condition monitoring using the parallel parameters of the core attributes.

    [0141] 6) Evaluating the prediction errors.

    [0142] S5: By comparing the error evaluation indicators of the ANN model alone with those of the hybrid prediction model ARIMA-ANN, it can be obtained that the prediction accuracy of the hybrid prediction model is higher than that of the single model; by comparing the condition monitoring results of the single parameter of the core attributes with those of the parallel parameters of the core attributes, it can be obtained that the condition monitoring method using the parallel parameters of the core attributes can significantly reduce the false alarm rate of the power pack.