Method for prediction of key performance parameter of an aero-engine transition state acceleration process based on space reconstruction
11436395 · 2022-09-06
Assignee
Inventors
Cpc classification
G06F18/214
PHYSICS
G06F18/217
PHYSICS
G06F30/27
PHYSICS
G06N3/082
PHYSICS
G06N3/006
PHYSICS
G06N5/01
PHYSICS
International classification
G06F30/27
PHYSICS
Abstract
A method for prediction of key performance parameters of an aero-engine transition state acceleration process based on space reconstruction. Aero-engine transition state acceleration process test data provided by a research institute is used for establishing a training dataset and a testing dataset; dimension increase is conducted on the datasets based on the data space reconstruction of an auto-encoder; model parameters optimization is conducted by population optimization algorithms which is represented by particle swarm algorithm; and random forest regression algorithm performing well on high-dimensional data is used for carrying out regression on transition state performance parameters, which realizes effective real-time prediction from the perspective of engineering application.
Claims
1. A method for prediction of key performance parameters of an aero-engine transition state acceleration process based on space reconstruction, comprising the following steps: step 1: preprocessing aero-engine test data (1) aero-engine transition state acceleration process test data comprises 10 kinds of parameters: compressor inlet relative speed PNNC2.sub.g, engine inlet temperature T.sub.2, engine inlet pressure P.sub.2, compressor outlet total pressure P.sub.3, fuel flow WFB, fan physical speed N.sub.f, compressor physical speed N.sub.c, exhaust gas temperature T.sub.5, simulated altitude H and simulated Mach Ma, which are taken as one sample; (2) data storage and reading: the aero-engine transition state acceleration process test data comprises data collected at multiple aero-engine commissioning process sites; combining the data collected at multiple aero-engine commissioning process sites for the aero-engine acceleration process and storing the collected data uniformly, then establishing an aero-engine performance parameter test database; (3) linear resampling: analyzing aero-engine transition state acceleration process test data; sampling different time intervals, a linear resampling method is adopted to resample the aero-engine transition state acceleration process test data to make sampling frequencies of signal identical; (4) data screening and cleaning: conducting visualization processing on the linearly resampled aero-engine transition state acceleration process test data, and conducting cleaning on acceleration curves which obviously do not meet objective conditions; step 2: conducting Random Forest regression model parameters selection the Random Forest regression model has two key parameters: ntree which is the number of Regression Trees in the Random Forest regression model; and mtry which is the feature number of Regression Trees in the Random Forest regression model, includes the number of branches of each Regression Tree; two-dimensional grid search is selected for ergodic calculation of the parameters ntree and mtry, and Mean Square Error (MSE) is selected for a fitness function; and an optimization range of the two-dimensional grid search is determined by the following principles: (1) the optimization range of ntree is determined by the out-of-bag (OOB) error rate, wherein the out-of-bag error rate is an error rate caused by the Forest regression model of data which is not selected as training sample for a single Decision Tree at a time; variation curves of OOB calculated for compressor physical speed N.sub.c and engine exhaust gas temperature EGT as predicted test data feature parameters as well as the parameter ntree are obtained; and therefore the optimization range of ntree is determined to be 50˜500; (2) the optimization range of mtry is determined to be from the natural number 1 to the total feature number of the test data; the Random Forest regression model parameters selection is finally determined to be ntree=300 and mtry=D/3 by the grid optimization algorithm, wherein D is the number of input variables of the Random Forest regression model; step 3: establishing a training database by using a sparse auto-encoder after determining the parameters of the Random Forest regression model, determining related parameters of Sparse Auto-Encoder (SAE) by using an Sparse Auto-Encoder Random Forest (SAE-RF) hybrid model; establishing an input vector of the SAE-RF hybrid model by using SAE with the structure of 10-dim-10; and optimizing the parameters of SAE by a dispersed-continuous hybrid particle swarm algorithm, wherein parameters of SAE comprise learning rate a and reconstructed dimension dim; using the particle swarm algorithm for optimization as follows: in two-dimensional parameter searching space, there is a population X=(X.sub.1, X.sub.2, . . . , X.sub.n) composed of n parameter combinations, wherein the position of the k.sup.th parameter combination in the parameter searching space is expressed as a two-dimensional vector X.sub.k=(x.sub.k1,x.sub.k2); assuming the k.sup.th parameter combination has the velocity V.sub.k=(V.sub.k1,V.sub.k2).sup.T in the parameter searching space, the local best parameter thereof is P.sub.k=(P.sub.k1,P.sub.k2).sup.T, and the global best parameter of the parameter combination is P.sub.g=(P.sub.g1,P.sub.g2).sup.T; and in each iteration, the iterative formulas of the velocity and the position of the parameter combination are expressed as:
V.sub.k.sup.t+1=wV.sub.k.sup.t+c.sub.1r.sub.1(P.sub.k.sup.t−X.sub.k.sup.t)+c.sub.2r.sub.2(P.sub.g.sup.t−X.sub.k.sup.t)
X.sub.k.sup.t+1=X.sub.k.sup.t+.sub.k.sup.t+1 where, w is inertia weight, t is the current number of iterations, r.sub.1,r.sub.2 are random numbers with uniform distribution in [0,1], and c.sub.1,c.sub.2 are learning factor constants; a K-fold cross-validation method is used for the estimation of generalization ability in parameters selection, and the specific steps of optimizing the SAE-RF hybrid model parameters based on the dispersed-continuous hybrid particle swarm algorithm are as follows: (1) randomly producing a group of {α, dim} as the initial position of particles, and determining inertia weight and learning factor; (2) evenly splitting a training sample into k mutually exclusive subsets S.sub.1, S.sub.2, . . . , S.sub.k; (3) taking the value of the initial position of the population as a parameter to train the SAE-RF hybrid model, and calculating the average value of k accuracies, which is the accuracy of the K-fold cross-validation; (4) taking the accuracy of the K-fold cross-validation as the fitness of the particle swarm algorithm, calculating the local best position and the global best position of the population, and iterating and updating the position and velocity; (5) repeating the step (2) until the fitness requirements are met or the maximum number of iterations is reached; (6) completing the parameter optimization, and taking the result as the parameter of the final SAE-RF hybrid model; step 4: building an SAE-RF regression model, predicting the aero-engine test data, and evaluating the prediction effect normalizing the data sample by a maximum value method to avoid the SAE-RF model error caused by magnitude difference; and normalizing the features of the aero-engine transition state acceleration process test data after sparse representation into the interval [1,2] according to the following formula:
Description
DESCRIPTION OF DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
DETAILED DESCRIPTION
(9) Specific embodiment of the present invention is further described below in combination with accompanying drawings and the technical solution.
(10) The data used in the present invention is 100 groups of transition state acceleration process bench test data of a certain type of aero-engine, which are provided by a domestic research institute.
(11) Step 1: preprocessing aero-engine test data;
(12) (1) Aero-engine test data comprises 10 groups of parameters: compressor inlet relative speed PNNC2.sub.g, engine inlet temperature T.sub.2, engine inlet pressure P.sub.2, compressor outlet total pressure P.sub.3, fuel flow WFB, fan physical speed N.sub.f, compressor physical speed N.sub.c, exhaust gas temperature T.sub.5, simulated altitude H and simulated Mach Ma;
(13) (2) Data integration: reading, integrating and storing txt files of 100 groups of data, and establishing an aero-engine test database.
(14) (3) Resampling: resampling the data first due to different sampling intervals. The specific steps are as follows: inserting the proposed new sampling frequency as an interpolation into the time series of the original data by using an interpolation method, and counting the number of original data between nominal sampling points. If only one original data is included, taking the original data as the data corresponding to the sampling point; if two original data are included, calculating the average value of the two original data, and taking the average value as the data corresponding to the time point; and if no original data is included, taking the average value of the data corresponding to the previous time point and the next time point of the time point in the nominal time series as the data of the time point.
(15) (4) Data screening and cleaning: conducting visualization processing on the data in order to conduct simple clustering and cleaning on acceleration curves.
(16) Step 2: conducting Random Forest regression model parameters selection;
(17) In the present invention, the parameter optimization range based on grid research is determined according to
m.sub.k=saerftrain(x.sup.tr.sup.
(18) Taking the average value of mean square errors of the three tasks as the value of the fitness function corresponding to this group of parameters:
(19)
(20) The final optimization result is shown in
(21) Step 3: establishing a training database by using a sparse auto-encoder;
(22) The important parameters of the sparse auto-encoder in the present invention comprise learning rate α and reconstructed dimension dim, wherein α is a continuous value and dim is a dispersed integer value, so a dispersed-continuous hybrid particle swarm algorithm is used for optimization in two dimensions of parameters, and the 3-fold cross-validation method is also used. Setting the number of groups in the initial population to 10 and the maximum number of iterations to 50, and randomly setting the initial position of particles to {[0,1],{1,2 . . . 50}}. Attention is needed to limit dim to not lower than 1 or larger than 50 during random setting of the initial velocity of particles. Setting the inertia weight to
(23)
and the learning factor to c.sub.1=c.sub.2=0.5+ln2.
(24) The parameter optimization result of SAE is shown in
(25) Step 4: building an SAE-RF regression model, predicting the aero-engine test data, and evaluating the prediction effect.
(26) Considering the large magnitude difference of the reconstructed aero-engine test data, conducting normalization processing by the maximum value method to increase the convergence rate and avoid reduction in the prediction accuracy caused by the magnitude difference. Normalizing the features of the test data after sparse representation into the interval [1,2] according to the following formula:
(27)
(28) Among 100 groups of test data provided by a domestic research institute and used in the present invention, taking 90 groups as training data and the remaining 10 groups as predicted data to respectively complete prediction tasks for compressor physical speed and exhaust gas temperature in the transition state acceleration process key parameters, calculating the relative error distribution and mean square error of 10 groups of test data, and evaluating the prediction effect of the model.
(29) As shown in
(30) TABLE-US-00001 TABLE 1 Mean Square Errors of Predicted Samples Mean square error Mean square error Sample of compressor of exhaust gas number physical speed temperature 91 4.4756 × 10.sup.−5 1.5095 × 10.sup.−2 92 6.2384 × 10.sup.−5 1.2713 × 10.sup.−2 93 7.0633 × 10.sup.−5 2.6651 × 10.sup.−2 94 1.3170 × 10.sup.−4 2.4940 × 10.sup.−2 95 9.3826 × 10.sup.−5 1.9236 × 10.sup.−2 96 1.2296 × 10.sup.−4 1.6477 × 10.sup.−2 97 7.8499 × 10.sup.−5 2.8127 × 10.sup.−2 98 7.9962 × 10.sup.−5 2.3424 × 10.sup.−2 99 7.8157 × 10.sup.−5 1.2246 × 10.sup.−2 100 8.3156 × 10.sup.−5 1.8875 × 10.sup.−2
(31) In conclusion, after the sparse auto-encoder based on the particle swarm algorithm optimization conducting dimension reconstruction on the aero-engine transition state acceleration process test data, the accuracy of predicting the key parameters such as compressor physical speed and exhaust gas temperature by the Random Forest regression algorithm can reach the desired effect. Therefore the present invention can be used in the fields of state prediction and fault diagnosis of an aero-engine.