Method for prediction of key performance parameter of an aero-engine transition state acceleration process based on space reconstruction

Abstract

A method for prediction of key performance parameters of an aero-engine transition state acceleration process based on space reconstruction. Aero-engine transition state acceleration process test data provided by a research institute is used for establishing a training dataset and a testing dataset; dimension increase is conducted on the datasets based on the data space reconstruction of an auto-encoder; model parameters optimization is conducted by population optimization algorithms which is represented by particle swarm algorithm; and random forest regression algorithm performing well on high-dimensional data is used for carrying out regression on transition state performance parameters, which realizes effective real-time prediction from the perspective of engineering application.

Claims

1. A method for prediction of key performance parameters of an aero-engine transition state acceleration process based on space reconstruction, comprising the following steps: step 1: preprocessing aero-engine test data (1) aero-engine transition state acceleration process test data comprises 10 kinds of parameters: compressor inlet relative speed PNNC2.sub.g, engine inlet temperature T.sub.2, engine inlet pressure P.sub.2, compressor outlet total pressure P.sub.3, fuel flow WFB, fan physical speed N.sub.f, compressor physical speed N.sub.c, exhaust gas temperature T.sub.5, simulated altitude H and simulated Mach Ma, which are taken as one sample; (2) data storage and reading: the aero-engine transition state acceleration process test data comprises data collected at multiple aero-engine commissioning process sites; combining the data collected at multiple aero-engine commissioning process sites for the aero-engine acceleration process and storing the collected data uniformly, then establishing an aero-engine performance parameter test database; (3) linear resampling: analyzing aero-engine transition state acceleration process test data; sampling different time intervals, a linear resampling method is adopted to resample the aero-engine transition state acceleration process test data to make sampling frequencies of signal identical; (4) data screening and cleaning: conducting visualization processing on the linearly resampled aero-engine transition state acceleration process test data, and conducting cleaning on acceleration curves which obviously do not meet objective conditions; step 2: conducting Random Forest regression model parameters selection the Random Forest regression model has two key parameters: ntree which is the number of Regression Trees in the Random Forest regression model; and mtry which is the feature number of Regression Trees in the Random Forest regression model, includes the number of branches of each Regression Tree; two-dimensional grid search is selected for ergodic calculation of the parameters ntree and mtry, and Mean Square Error (MSE) is selected for a fitness function; and an optimization range of the two-dimensional grid search is determined by the following principles: (1) the optimization range of ntree is determined by the out-of-bag (OOB) error rate, wherein the out-of-bag error rate is an error rate caused by the Forest regression model of data which is not selected as training sample for a single Decision Tree at a time; variation curves of OOB calculated for compressor physical speed N.sub.c and engine exhaust gas temperature EGT as predicted test data feature parameters as well as the parameter ntree are obtained; and therefore the optimization range of ntree is determined to be 50˜500; (2) the optimization range of mtry is determined to be from the natural number 1 to the total feature number of the test data; the Random Forest regression model parameters selection is finally determined to be ntree=300 and mtry=D/3 by the grid optimization algorithm, wherein D is the number of input variables of the Random Forest regression model; step 3: establishing a training database by using a sparse auto-encoder after determining the parameters of the Random Forest regression model, determining related parameters of Sparse Auto-Encoder (SAE) by using an Sparse Auto-Encoder Random Forest (SAE-RF) hybrid model; establishing an input vector of the SAE-RF hybrid model by using SAE with the structure of 10-dim-10; and optimizing the parameters of SAE by a dispersed-continuous hybrid particle swarm algorithm, wherein parameters of SAE comprise learning rate a and reconstructed dimension dim; using the particle swarm algorithm for optimization as follows: in two-dimensional parameter searching space, there is a population X=(X.sub.1, X.sub.2, . . . , X.sub.n) composed of n parameter combinations, wherein the position of the k.sup.th parameter combination in the parameter searching space is expressed as a two-dimensional vector X.sub.k=(x.sub.k1,x.sub.k2); assuming the k.sup.th parameter combination has the velocity V.sub.k=(V.sub.k1,V.sub.k2).sup.T in the parameter searching space, the local best parameter thereof is P.sub.k=(P.sub.k1,P.sub.k2).sup.T, and the global best parameter of the parameter combination is P.sub.g=(P.sub.g1,P.sub.g2).sup.T; and in each iteration, the iterative formulas of the velocity and the position of the parameter combination are expressed as:
V.sub.k.sup.t+1=wV.sub.k.sup.t+c.sub.1r.sub.1(P.sub.k.sup.t−X.sub.k.sup.t)+c.sub.2r.sub.2(P.sub.g.sup.t−X.sub.k.sup.t)
X.sub.k.sup.t+1=X.sub.k.sup.t+.sub.k.sup.t+1 where, w is inertia weight, t is the current number of iterations, r.sub.1,r.sub.2 are random numbers with uniform distribution in [0,1], and c.sub.1,c.sub.2 are learning factor constants; a K-fold cross-validation method is used for the estimation of generalization ability in parameters selection, and the specific steps of optimizing the SAE-RF hybrid model parameters based on the dispersed-continuous hybrid particle swarm algorithm are as follows: (1) randomly producing a group of {α, dim} as the initial position of particles, and determining inertia weight and learning factor; (2) evenly splitting a training sample into k mutually exclusive subsets S.sub.1, S.sub.2, . . . , S.sub.k; (3) taking the value of the initial position of the population as a parameter to train the SAE-RF hybrid model, and calculating the average value of k accuracies, which is the accuracy of the K-fold cross-validation; (4) taking the accuracy of the K-fold cross-validation as the fitness of the particle swarm algorithm, calculating the local best position and the global best position of the population, and iterating and updating the position and velocity; (5) repeating the step (2) until the fitness requirements are met or the maximum number of iterations is reached; (6) completing the parameter optimization, and taking the result as the parameter of the final SAE-RF hybrid model; step 4: building an SAE-RF regression model, predicting the aero-engine test data, and evaluating the prediction effect normalizing the data sample by a maximum value method to avoid the SAE-RF model error caused by magnitude difference; and normalizing the features of the aero-engine transition state acceleration process test data after sparse representation into the interval [1,2] according to the following formula: $x_{k} = \frac{x_{k} - x_{\min}}{x_{\max} - x_{\min}} + 1$ conducting regression prediction on the compressor physical speed N.sub.c and engine exhaust gas temperature EGT by using the aero-engine transition state acceleration process test data after dimension reconstruction, and calculating the response evaluating indicators; evaluating indicators comprise: (1) relative error RE formula of the relative error is: $RE = \frac{.Math. \hat{y_{i}} - y_{i} .Math.}{y_{i}}, i = 1, 2 .Math. N$ where, ŷ.sub..Math. is the predicted value of the sample at the i.sup.th moment, y.sub.i is the observed value of the sample at the i.sup.th moment, and N is the length of the sample; (2) Mean Square Error MSE mean square error is a measure reflecting the difference between the predicted sequence and the observed sequence, and is calculated according to the following formula: $MSE = \frac{1}{N} {.Math.}_{i = 1}^{N} {(y_{i} - \hat{y_{i}})}^{2}$ where, ŷ.sub..Math. is the predicted value of the sample at the i.sup.th moment, y.sub.i is the observed value of the sample at the i.sup.th moment, and N is the length of the sample.

Description

DESCRIPTION OF DRAWINGS

(1) FIG. 1 is a flow chart of establishing an aero-engine transition state acceleration process key performance parameter prediction model.

(2) FIG. 2 is a curve graph showing a relationship between the out-of-bag error rate and the number of Decision Trees of the model.

(3) FIG. 3 is a diagram of a best parameter result of a Random Forest regression model for grid search.

(4) FIG. 4 is a curve graph showing a parameter optimization process of a particle swarm algorithm, wherein the upper part is a diagram of the optimized result of the particle swarm algorithm predicting compressor physical speed, and the lower part is a diagram of the optimized result of the particle swarm algorithm predicting exhaust gas temperature.

(5) FIG. 5 shows predicted curves and observed curves of 10 groups of samples, wherein the upper part is a diagram of predicted results of compressor physical speed, and the lower part is a diagram of predicted results of exhaust gas temperature.

(6) FIG. 6 is a schematic diagram of degree of deviation of predicted values from observed values of 10 groups of samples, wherein the upper part is a schematic diagram of degree of deviation of predicted values of compressor physical speed, and the lower part is a schematic diagram of degree of deviation of predicted values of exhaust gas temperature.

(7) FIG. 7 is a distribution diagram of relative errors of 10 groups of samples, wherein the upper part is a distribution diagram of relative errors of compressor physical speed prediction, and the lower part is a distribution diagram of relative errors of exhaust gas temperature prediction.

(8) FIG. 8 is a distribution diagram of mean square errors of predicted sequence and observed sequence of 10 groups of samples, wherein the upper part is a diagram of mean square errors of compressor physical speed prediction, and the lower part is a diagram of mean square errors of exhaust gas temperature prediction.

DETAILED DESCRIPTION

(9) Specific embodiment of the present invention is further described below in combination with accompanying drawings and the technical solution.

(10) The data used in the present invention is 100 groups of transition state acceleration process bench test data of a certain type of aero-engine, which are provided by a domestic research institute.

(11) Step 1: preprocessing aero-engine test data;

(12) (1) Aero-engine test data comprises 10 groups of parameters: compressor inlet relative speed PNNC2.sub.g, engine inlet temperature T.sub.2, engine inlet pressure P.sub.2, compressor outlet total pressure P.sub.3, fuel flow WFB, fan physical speed N.sub.f, compressor physical speed N.sub.c, exhaust gas temperature T.sub.5, simulated altitude H and simulated Mach Ma;

(13) (2) Data integration: reading, integrating and storing txt files of 100 groups of data, and establishing an aero-engine test database.

(14) (3) Resampling: resampling the data first due to different sampling intervals. The specific steps are as follows: inserting the proposed new sampling frequency as an interpolation into the time series of the original data by using an interpolation method, and counting the number of original data between nominal sampling points. If only one original data is included, taking the original data as the data corresponding to the sampling point; if two original data are included, calculating the average value of the two original data, and taking the average value as the data corresponding to the time point; and if no original data is included, taking the average value of the data corresponding to the previous time point and the next time point of the time point in the nominal time series as the data of the time point.

(15) (4) Data screening and cleaning: conducting visualization processing on the data in order to conduct simple clustering and cleaning on acceleration curves.

(16) Step 2: conducting Random Forest regression model parameters selection;

(17) In the present invention, the parameter optimization range based on grid research is determined according to FIG. 2, finally the optimization range of 50 to 500 and the optimization step length of 10 for ntree are selected; selecting the optimization range of 1 to D and the optimization step length of 1 for mtry, wherein D is the dimension of the input vector of the model, i.e. the dimension of the test data samples. Calculating the value of the fitness function by the 3-fold cross-validation method, splitting 90 groups of test data samples into three parts, and conducting three prediction tasks on each part.
m.sub.k=saerftrain(x.sup.tr.sup.k,y.sup.tr.sup.k,ntree,mtry)k=1,2,3

(18) Taking the average value of mean square errors of the three tasks as the value of the fitness function corresponding to this group of parameters:

(19) ${mse}_{k} = \frac{1}{N} {.Math.}_{i = 1}^{N} {(saerfregression (x^{{te}_{k}}, m_{k}, ntree, mtry) - y^{{te}_{k}})}^{2}$ $k = 1, 2, 3$ $MSE = ({mse}_{1} + {mse}_{2} + {mse}_{3}) / 3$

(20) The final optimization result is shown in FIG. 3. Considering the influence factors such as time cost and calculated amount, the final parameter selection results are ntree=300, mtry=D/3, wherein D is the dimension of the input vector of the model.

(21) Step 3: establishing a training database by using a sparse auto-encoder;

(22) The important parameters of the sparse auto-encoder in the present invention comprise learning rate α and reconstructed dimension dim, wherein α is a continuous value and dim is a dispersed integer value, so a dispersed-continuous hybrid particle swarm algorithm is used for optimization in two dimensions of parameters, and the 3-fold cross-validation method is also used. Setting the number of groups in the initial population to 10 and the maximum number of iterations to 50, and randomly setting the initial position of particles to {[0,1],{1,2 . . . 50}}. Attention is needed to limit dim to not lower than 1 or larger than 50 during random setting of the initial velocity of particles. Setting the inertia weight to

(23) $\frac{1}{2 \ln 2}$
and the learning factor to c.sub.1=c.sub.2=0.5+ln2.

(24) The parameter optimization result of SAE is shown in FIG. 4. The local best solution at the number of iterations when the minimum value of the fitness function is reached is selected as the parameter result: setting the parameters to dim=46, α=0.7060 during compressor physical speed prediction; and setting the parameters to dim=20, α=1.7428 during exhaust gas temperature prediction.

(25) Step 4: building an SAE-RF regression model, predicting the aero-engine test data, and evaluating the prediction effect.

(26) Considering the large magnitude difference of the reconstructed aero-engine test data, conducting normalization processing by the maximum value method to increase the convergence rate and avoid reduction in the prediction accuracy caused by the magnitude difference. Normalizing the features of the test data after sparse representation into the interval [1,2] according to the following formula:

(27) $x_{k} = \frac{x_{k} - x_{\min}}{x_{\max} - x_{\min}} + 1$

(28) Among 100 groups of test data provided by a domestic research institute and used in the present invention, taking 90 groups as training data and the remaining 10 groups as predicted data to respectively complete prediction tasks for compressor physical speed and exhaust gas temperature in the transition state acceleration process key parameters, calculating the relative error distribution and mean square error of 10 groups of test data, and evaluating the prediction effect of the model.

(29) As shown in FIG. 5 and FIG. 6, curves of the predicted values of the model and the observed values of the test of compressor physical speed nearly coincide, and the predicted values have almost no deviation from the observed values; and for the curves of the predicted values of the model and the observed values of the test of exhaust gas temperature, the effect is slightly poorer than the prediction effect of the compressor physical speed, and the predicted values have a large degree of deviation from the observed values at the start of the sample test. The reason is that the measurable test data features are closely related to the compressor physical speed, but the thermal part of the aero-engine gas path has complex structure, the thermodynamic relationship is difficult to describe, and the operating conditions of the sensor are harsh. FIG. 7 shows the distribution of relative errors at each observation time point of 10 groups of predicted samples. As shown in FIG. 8, even if the prediction effect of exhaust gas temperature is slightly poorer than that of compressor physical speed, the mean square errors of the 10 groups of predicted data samples can be completely controlled below the requirement of aero-engine key performance parameter prediction software.

(30) TABLE-US-00001 TABLE 1 Mean Square Errors of Predicted Samples Mean square error Mean square error Sample of compressor of exhaust gas number physical speed temperature 91 4.4756 × 10.sup.−5 1.5095 × 10.sup.−2 92 6.2384 × 10.sup.−5 1.2713 × 10.sup.−2 93 7.0633 × 10.sup.−5 2.6651 × 10.sup.−2 94 1.3170 × 10.sup.−4 2.4940 × 10.sup.−2 95 9.3826 × 10.sup.−5 1.9236 × 10.sup.−2 96 1.2296 × 10.sup.−4 1.6477 × 10.sup.−2 97 7.8499 × 10.sup.−5 2.8127 × 10.sup.−2 98 7.9962 × 10.sup.−5 2.3424 × 10.sup.−2 99 7.8157 × 10.sup.−5 1.2246 × 10.sup.−2 100 8.3156 × 10.sup.−5 1.8875 × 10.sup.−2

(31) In conclusion, after the sparse auto-encoder based on the particle swarm algorithm optimization conducting dimension reconstruction on the aero-engine transition state acceleration process test data, the accuracy of predicting the key parameters such as compressor physical speed and exhaust gas temperature by the Random Forest regression algorithm can reach the desired effect. Therefore the present invention can be used in the fields of state prediction and fault diagnosis of an aero-engine.

Method for prediction of key performance parameter of an aero-engine transition state acceleration process based on space reconstruction

Assignee

Inventors

Cpc classification

Classification Explorer

G06F18/214

PHYSICS

Classification Explorer

G06F2111/08

PHYSICS

Classification Explorer

G06F18/217

PHYSICS

Classification Explorer

G06F2218/12

PHYSICS

Classification Explorer

G06F30/27

PHYSICS

Classification Explorer

G06N20/10

PHYSICS

Classification Explorer

G06N3/082

PHYSICS

Classification Explorer

G06F18/24323

PHYSICS

Classification Explorer

G06N3/006

PHYSICS

Classification Explorer

G06F2119/08

PHYSICS

Classification Explorer

G01M15/00

PHYSICS

Classification Explorer

G06F30/15

PHYSICS

Classification Explorer

G06N5/01

PHYSICS

Classification Explorer

G06N20/20

PHYSICS

International classification

Classification Explorer

G01M15/00

PHYSICS

Classification Explorer

G06F30/27

PHYSICS

Classification Explorer

G06N20/10

PHYSICS

Classification Explorer

G06K9/62

PHYSICS

Classification Explorer

G06N5/00

PHYSICS

Abstract

Claims

Description