A METHOD FOR PREDICTION OF KEY PERFORMANCE PARAMETER OF AN AERO-ENGINE TRANSITION STATE ACCELERATION PROCESS BASED ON SPACE RECONSTRUCTION
20200184131 ยท 2020-06-11
Inventors
- Shuo ZHANG (Dalian City, CN)
- Xiaoyu SUN (Dalian City, CN)
- Jibang LI (Dalian City, CN)
- Ximing SUN (Dalian City, CN)
Cpc classification
G06F18/214
PHYSICS
G06F18/217
PHYSICS
G06F30/27
PHYSICS
G06N3/082
PHYSICS
G06N3/006
PHYSICS
G06N5/01
PHYSICS
International classification
G06F30/27
PHYSICS
Abstract
A method for prediction of key performance parameters of an aero-engine transition state acceleration process based on space reconstruction. Aero-engine transition state acceleration process test data provided by a research institute is used for establishing a training dataset and a testing dataset; dimension increase is conducted on the datasets based on the data space reconstruction of an auto-encoder; model parameters optimization is conducted by population optimization algorithms which is represented by particle swarm algorithm; and random forest regression algorithm performing well on high-dimensional data is used for carrying out regression on transition state performance parameters, which realizes effective real-time prediction from the perspective of engineering application.
Claims
1. A method for prediction of key performance parameters of an aero-engine transition state acceleration process based on space reconstruction, comprising the following steps: step 1: preprocessing aero-engine test data (1) aero-engine transition state acceleration process test data comprises 10 kinds of parameters: compressor inlet relative speed PNNC2.sub.g, engine inlet temperature T.sub.2, engine inlet pressure P.sub.2, compressor outlet total pressure P.sub.3, fuel flow WFB, fan physical speed N.sub.f, compressor physical speed N.sub.c, exhaust gas temperature T.sub.5, simulated altitude H and simulated Mach Ma, which are taken as one sample; (2) data storage and reading: the aero-engine transition state acceleration process test data comprises data collected at multiple aero-engine commissioning process sites; combining the data collected at multiple aero-engine commissioning process sites for the aero-engine acceleration process and storing the data uniformly, then establishing an aero-engine performance parameter test database; (3) linear resampling: analyzing aero-engine acceleration process test data; because sampling time intervals are different, a linear resampling method is adopted to resample the aero-engine acceleration process test data to make the sampling frequencies of signal identical; (4) data screening and cleaning: conducting visualization processing on the linearly resampled aero-engine transition state acceleration process test data, and conducting cleaning on acceleration curves which obviously do not meet objective conditions; step 2: conducting Random Forest regression model parameters selection the Random Forest regression model has two key parameters: ntree which is the number of Regression Trees in the Random Forest regression model; and mtry which is the feature number of Regression Trees in the Random Forest regression model, i.e. the number of branches of each Regression Tree; two-dimensional grid search is selected for ergodic calculation of the parameters ntree and mtry, and MSE is selected for a fitness function; and the optimization range of the two-dimensional grid search is determined by the following principles: (1) the optimization range of ntree is determined by the out-of-bag error rate, wherein the out-of-bag error rate is the error rate caused by the regression of the data which is not selected as the training sample for a single Decision Tree at a time; variation curves of 00B calculated for compressor physical speed N.sub.c and engine exhaust gas temperature EGT as the predicted test data feature parameters as well as the parameter ntree are obtained; and therefore the optimization range of ntree is determined to be 50500; (2) the optimization range of mtry is determined to be from the natural number 1 to the total feature number of the test data; the Random Forest regression model parameters selection is finally determined to be ntree=300 and mtry=D/3 by the grid optimization algorithm, wherein D is the number of input variables of the model; step 3: establishing a training database by using a sparse auto-encoder after determining the parameters of the Random Forest regression model, determining related parameters of SAE by using an SAE-RF hybrid model; establishing the input vector of the model by using SAE with the structure of 10dim10; and optimizing the parameters of SAE by the dispersed-continuous hybrid particle swarm algorithm, wherein the important parameters of SAE comprise learning rate a and reconstructed dimension dim; using the particle swarm algorithm for optimization as follows: in two-dimensional parameter searching space, there is a population X=(X.sub.1, X.sub.2, . . . , X.sub.n) composed of n parameter combinations, wherein the position of the k.sup.th parameter combination in the parameter searching space is expressed as a two-dimensional vector X.sub.k=(X.sub.k1, X.sub.k2); assuming the k.sup.th parameter combination has the velocity V.sub.k=(V.sub.k1, V.sub.k2).sup.T in the parameter searching space, the local best parameter thereof is P.sub.k=(P.sub.k1,P.sub.k2).sup.T and the global best parameter of the parameter combination is P.sub.g=(P.sub.g1,P.sub.g2).sup.T; and in each iteration, the iterative formulas of the velocity and the position of the parameter combination are expressed as:
V.sub.k.sup.t+1=wV.sub.k.sup.t+c.sub.1r.sub.1(P.sub.k.sup.tX.sub.k.sup.t)+c.sub.2r.sub.2(P.sub.g.sup.tX.sub.k.sup.t) X.sub.k.sup.t+1=X.sub.k.sup.t+V.sub.k.sup.t+1 where, w is inertia weight, t is the current number of iterations, r.sub.1, r.sub.2 are random numbers with uniform distribution in [0,1], and c.sub.1, c.sub.2 are learning factor constants; the K-fold cross-validation method is used for the estimation of generalization ability in parameters selection, and the specific steps of optimizing the SAE-RF hybrid model parameters based on the dispersed-continuous hybrid particle swarm algorithm are as follows: (1) randomly producing a group of {a, dim} as the initial position of particles, and determining inertia weight and learning factor; (2) evenly splitting the training sample into k mutually exclusive subsets S.sub.1, S.sub.2, . . . , S.sub.k; (3) taking the value of the initial position of the population as a parameter to train the SAE-RF hybrid model, and calculating the average value of k accuracies, which is the accuracy of the K-fold cross-validation; (4) taking the accuracy of the K-fold cross-validation as the fitness of the particle swarm algorithm, calculating the local best position and the global best position of the population, and iterating and updating the position and velocity; (5) repeating the step (2) until the fitness requirements are met or the maximum number of iterations is reached; (6) completing the parameter optimization, and taking the result as the parameter of the final SAE-RF hybrid model; step 4: building an SAE-RF regression model, predicting the aero-engine test data, and evaluating the prediction effect normalizing the data sample by a maximum value method to avoid the model error caused by magnitude difference; and normalizing the features of the aero-engine transition state acceleration process test data after sparse representation into the interval [1,2] according to the following formula:
Description
DESCRIPTION OF DRAWINGS
[0037]
[0038]
[0039]
[0040]
[0041]
[0042]
[0043]
[0044]
DETAILED DESCRIPTION
[0045] Specific embodiment of the present invention is further described below in combination with accompanying drawings and the technical solution.
[0046] The data used in the present invention is 100 groups of transition state acceleration process bench test data of a certain type of aero-engine, which are provided by a domestic research institute.
[0047] Step 1: preprocessing aero-engine test data;
[0048] (1) Aero-engine test data comprises 10 groups of parameters: compressor inlet relative speed PNNC2.sub.g, engine inlet temperature T.sub.2 , engine inlet pressure P.sub.2, compressor outlet total pressure P.sub.3 , fuel flow WFB, fan physical speed N.sub.f , compressor physical speed N.sub.c, exhaust gas temperature T.sub.5, simulated altitude H and simulated Mach Ma;
[0049] (2) Data integration: reading, integrating and storing txt files of 100 groups of data, and establishing an aero-engine test database.
[0050] (3) Resampling: resampling the data first due to different sampling intervals. The specific steps are as follows: inserting the proposed new sampling frequency as an interpolation into the time series of the original data by using an interpolation method, and counting the number of original data between nominal sampling points. If only one original data is included, taking the original data as the data corresponding to the sampling point; if two original data are included, calculating the average value of the two original data, and taking the average value as the data corresponding to the time point; and if no original data is included, taking the average value of the data corresponding to the previous time point and the next time point of the time point in the nominal time series as the data of the time point.
[0051] (4) Data screening and cleaning: conducting visualization processing on the data in order to conduct simple clustering and cleaning on acceleration curves.
[0052] Step 2: conducting Random Forest regression model parameters selection;
[0053] In the present invention, the parameter optimization range based on grid research is determined according to
m.sub.k=saerftrain(x.sup.tris k, y.sup.tr.sup.
[0054] Taking the average value of mean square errors of the three tasks as the value of the fitness function corresponding to this group of parameters:
[0055] The final optimization result is shown in
[0056] Step 3: establishing a training database by using a sparse auto-encoder;
[0057] The important parameters of the sparse auto-encoder in the present invention comprise learning rate a and reconstructed dimension dim, wherein a is a continuous value and dim is a dispersed integer value, so a dispersed-continuous hybrid particle swarm algorithm is used for optimization in two dimensions of parameters, and the 3-fold cross-validation method is also used. Setting the number of groups in the initial population to 10 and the maximum number of iterations to 50, and randomly setting the initial position of particles to {[0,1],{1,2 . . . 50}}. Attention is needed to limit dim to not lower than 1 or larger than 50 during random setting of the initial velocity of particles. Setting the inertia weight to
and the learning factor to c.sub.1=c.sub.2=0.5+In2.
[0058] The parameter optimization result of SAE is shown in
[0059] Step 4: building an SAE-RF regression model, predicting the aero-engine test data, and evaluating the prediction effect.
[0060] Considering the large magnitude difference of the reconstructed aero-engine test data, conducting normalization processing by the maximum value method to increase the convergence rate and avoid reduction in the prediction accuracy caused by the magnitude difference. Normalizing the features of the test data after sparse representation into the interval [1,2 ] according to the following formula:
[0061] Among 100 groups of test data provided by a domestic research institute and used in the present invention, taking 90 groups as training data and the remaining 10 groups as predicted data to respectively complete prediction tasks for compressor physical speed and exhaust gas temperature in the transition state acceleration process key parameters, calculating the relative error distribution and mean square error of 10 groups of test data, and evaluating the prediction effect of the model.
[0062] As shown in
TABLE-US-00001 TABLE 1 Mean Square Errors of Predicted Samples Mean square error Mean square error Sample of compressor of exhaust gas number physical speed temperature 91 4.4756 10.sup.5 1.5095 10.sup.2 92 6.2384 10.sup.5 1.2713 10.sup.2 93 7.0633 10.sup.5 2.6651 10.sup.2 94 1.3170 10.sup.4 2.4940 10.sup.2 95 9.3826 10.sup.5 1.9236 10.sup.2 96 1.2296 10.sup.4 1.6477 10.sup.2 97 7.8499 10.sup.5 2.8127 10.sup.2 98 7.9962 10.sup.5 2.3424 10.sup.2 99 7.8157 10.sup.5 1.2246 10.sup.2 100 8.3156 10.sup.5 1.8875 10.sup.2
[0063] In conclusion, after the sparse auto-encoder based on the particle swarm algorithm optimization conducting dimension reconstruction on the aero-engine transition state acceleration process test data, the accuracy of predicting the key parameters such as compressor physical speed and exhaust gas temperature by the Random Forest regression algorithm can reach the desired effect. Therefore the present invention can be used in the fields of state prediction and fault diagnosis of an aero-engine.