ADAPTIVE MANIFOLD PROBABILITY DISTRIBUTION-BASED BEARING FAULT DIAGNOSIS METHOD

Abstract

The invention provides an adaptive manifold probability distribution-based bearing fault diagnosis method, including constructing transferable domains and transfer tasks; converting a data sample in each transfer task into frequency domain data via Fourier transform, inputting the frequency domain data into a GFK algorithm model, and calculating a manifold feature representation matrix related to a bearing fault in each transfer task by using the GFK algorithm model; calculating a cosine distance between centers of a target domain and a source domain in each transfer task according to a manifold feature representation, and defining a target function of in-domain classifier learning; then solving the target function, to obtain a probability distribution matrix of the target domain; and selecting a label corresponding to the largest probability value corresponding to each data sample in the target domain from the probability distribution matrix as a predicted label of the data sample in the target domain.

Claims

1. An adaptive manifold probability distribution-based bearing fault diagnosis method, comprising steps of: 1) constructing a plurality of transferable domains, wherein each transferable domain comprises data samples of a plurality of fault types of a bearing, and different transferable domains conform to different conditional distributions and marginal distributions; and defining any two transferable domains to form one transfer task, wherein in each transfer task, one transferable domain is designated as a source domain, the other transferable domain is a target domain, and each data sample in the source domain is marked with a fault type label; 2) converting each data sample in the source domain and the target domain in each transfer task into frequency domain data via Fourier transform, inputting the frequency domain data into a Geodesic Flow Kernel (GFK) algorithm model, and calculating a manifold feature representation matrix related to a bearing fault in each transfer task by using the GFK algorithm model; 3) calculating a cosine distance between centers of a target domain and a source domain in each transfer task according to a manifold feature representation, and defining a target function of in-domain classifier learning; 4) solving the target function according to the solved cosine distance and a constraint that an element in a probability distribution matrix of the target domain needs to satisfy, to obtain a probability distribution matrix of the target domain; and 5) selecting a fault type label corresponding to the largest probability value corresponding to each data sample in the target domain from the probability distribution matrix as a predicted fault type label of the data sample in the target domain, to complete bearing fault diagnosis.

2. The bearing fault diagnosis method according to claim 1, wherein constructing a transferable domain comprises acquiring vibration signals of the bearing under each fault type in different working conditions as the data samples, and then grouping data samples in one same working condition into one transferable domain.

3. The bearing fault diagnosis method according to claim 1, wherein calculating a manifold feature representation matrix related to a bearing fault in each transfer task by using the GFK algorithm model in step 2) comprises steps of: A1) defining a source domain custom-character .sub.s={X.sub.s, Y.sub.s} and a target domain .sub.t={X.sub.t}, wherein X.sub.s={x.sub.i.sup.s}.sub.i=1.sup.m denotes a sample set of the source domain .sub.s, x.sub.i.sup.s denotes an i.sup.th data sample in the source domain .sub.s, Y.sub.s={y.sub.i.sup.s}.sub.i=1.sup.m denotes a fault type label set in the source domain custom-character .sub.s, y.sub.i.sup.s denotes an i.sup.th fault type label in the source domain .sub.s, X.sub.t={x.sub.j.sup.t}.sub.j=1.sup.n denotes a sample set of the target domain .sub.t, x.sub.j.sup.t denotes a j.sup.th data sample in the target domain .sub.t, m denotes a total quantity of data samples in the source domain custom-character .sub.s, and n denotes a total quantity of data samples in the target domain .sub.t; obtaining a subspace data set P.sub.S of the source domain .sub.s and a subspace data set P.sub.t of the target domain .sub.t by using a principal component analysis (PCA) method, combining P.sub.S and P.sub.t into one combined matrix P.sub.s+t , and then calculating a sine angle α.sub.d between P.sub.S and P.sub.s+t and a sine angle β.sub.d between P.sub.t and P.sub.s+t, so that a consistency measurement function C(d) of P.sub.S and P.sub.t is:
C(d)=0.5 [sin α.sub.d+sin β.sub.d]; A2) calculating an optimal subspace dimensionality d* by using a greedy algorithm:
d*=min{d|C(d)=1}; A3) selecting first d* dimensions of feature vector matrices in P.sub.S and P.sub.t according to the optimal subspace dimensionality d* as preprocessed data after dimensionality reduction; A4) calculating a manifold feature conversion core matrix G according to the preprocessed data after dimensionality reduction, and obtaining a manifold feature representation matrix W according to the manifold feature conversion core matrix G:
W=√{square root over (G)}X, wherein X=[X.sub.s, X.sub.t]; and A5) obtaining a manifold feature representation matrix W.sub.s of the source domain custom-character .sub.s and a manifold feature representation matrix W.sub.t of the target domain .sub.t according to the manifold feature representation matrix W: ${\begin{matrix} \begin{matrix} W_{s} = {w_{i}^{s}}_{i = 1}^{m} = W_{1 : m} & w_{i}^{s} \in W_{s}, i \in {1, 2, .Math., m} \end{matrix} \\ \begin{matrix} W_{t} = {w_{j}^{t}}_{j = 1}^{n} = W_{m + 1 : m + n} & w_{j}^{t} \in W_{t}, j \in {1, 2, .Math., n} \end{matrix} \end{matrix},$ wherein w.sub.i.sup.s denotes an i.sup.th element in W.sub.s, and w.sub.j.sup.t denotes a j.sup.th element in W.sub.t.

4. The bearing fault diagnosis method according to claim 3, wherein the target function of the in-domain classifier learning in step 3) is defined as custom-character , so that:
=Σ.sub.j=1.sup.nΣ.sub.c=1.sup.cP.sub.cjD.sub.cj, wherein C denotes a total quantity of fault type labels, P.sub.cj is an element in the probability distribution matrix P∈.sup.C×n and denotes a probability that w.sub.j.sup.t in W.sub.t is a c.sup.th fault type label, and D.sub.cj is a cosine distance between centers of the target domain custom-character .sub.t and the source domain .sub.s, and also denotes a distance between w.sub.j.sup.t and a data set center e.sub.c corresponding to the c.sup.th fault type label.

5. The bearing fault diagnosis method according to claim 4, wherein the cosine distance in step 3) is D.sub.cj, and D.sub.cj is calculated by using the following formula: $D_{cj} = \frac{2_{j}^{t} e_{c}}{.Math. w_{j}^{t} .Math. .Math. e_{c} .Math.},$ wherein e.sub.c denotes the data set center corresponding to the c.sup.th fault type label in the source domain custom-character .sub.s.

6. The bearing fault diagnosis method according to claim 5, wherein $\begin{matrix} e_{c} = \frac{1}{m^{(c)}} {.Math.}_{i = 1}^{m} w_{i}^{s} & I (y_{i}^{s} = c), \end{matrix}$ wherein m.sup.(c) denotes a quantity of data samples corresponding to the c.sup.th fault type label in the source domain custom-character .sub.s, w.sub.i.sup.s is an i.sup.th element in W.sub.s, and denotes a manifold feature representation of the i.sup.th data sample in the source domain .sub.s, and I(y.sub.i.sup.s=c) is an indicator function, $I (y_{i}^{s} = c) = {\begin{matrix} 1, & True \\ 0, & False \end{matrix} .$

7. The bearing fault diagnosis method according to claim 4, wherein the constraint that the element P.sub.cj in the probability distribution matrix P of the target domain needs to satisfy in step 4) is: $s . t . {\begin{matrix} {.Math.}_{c = 1}^{C} P_{cj} = 1, \forall j \in {1, 2, .Math., n} \\ 0 \leq P_{cj} \leq 1 \\ {.Math.}_{j = 1}^{n} P_{cj} \geq 1, \forall c \in {1, 2, .Math., C} \end{matrix} .$

8. The bearing fault diagnosis method according to claim 7, wherein the target function is solved in the step 4) by using a linear programming solving method.

9. The bearing fault diagnosis method according to claim 7, wherein a calculation formula for selecting a fault type label corresponding to the largest probability value corresponding to each data sample in the target domain from the probability distribution matrix as a predicted fault type label of the data sample in the target domain in step 5) is: ${\hat{y}}_{j}^{t} (x) = \underset{l}{\arg \max} \frac{P_{cj}}{{.Math.}_{c = 1}^{C} P_{cj}}, l \in {1, 2, .Math., C},$ wherein ŷ.sub.j.sup.t(x) denotes a predicted fault type label of the j.sup.th data sample in the target domain custom-character .sub.t.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

[0033] FIG. 1 is a schematic structural diagram of a bearing fault simulation test rig according to an embodiment of the present invention;

[0034] FIG. 2 is a flowchart of an adaptive manifold probability distribution-based bearing fault diagnosis method of the present invention;

[0035] FIG. 3 is a principle diagram of a GFK algorithm;

[0036] FIG. 4 is an exemplary diagram of a probability distribution matrix;

[0037] FIG. 5 is a principle block diagram of a bearing fault diagnosis method according to the present invention;

[0038] FIG. 6 is a diagram of fault diagnosis confusion matrices of four transfer tasks; and

[0039] FIG. 7 is an effect diagram of extensibility of a bearing fault diagnosis method according to the present invention.

[0040] In the drawings: 1, drive motor, 2, plum coupling, 3, normal bearing, 4, test bearing, 5, bearing seat, 6, cushioning device, 7, dynamometer, and 8, load adjustment device.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

[0041] The present invention is further described below with reference to the accompanying drawings and specific embodiments, to enable a person skilled in the art to better understand and implement the present invention. However, the embodiments are not used to limit the present invention.

[0042] This embodiment discloses an adaptive manifold probability distribution-based bearing fault diagnosis method. The method is described below with reference to actual experimental data.

[0043] First, the experimental data (bearing data) in this embodiment is acquired by using a bearing fault simulation test rig shown in FIG. 1. The test rig includes a drive motor 1, a plum coupling 2, a normal bearing 3, a test bearing 4, a bearing seat 5, a cushioning device 6, a dynamometer 7, a load adjustment device 8, and an acceleration sensor. The load adjustment device 8 is used for adjusting a load, to simulate operating states of the bearing in different load conditions. The model of the used drive motor 1 is ABBQABP-90S-4A. The model of the used test bearing 4 is 6205-2RSSKF. The rotational speed of the drive motor 1 is set to 961 rpm. A sampling frequency is 10 kHz. In an experimental process, the acceleration sensor is placed on the bearing seat 5 at which the test bearing 4 is located, is located at the 12 o'clock direction from the test bearing, and vibration signals are acquired by using a data acquisition system NIPX1e-1082.

[0044] In addition, a cut is separately machined in an inner ring, a ball, and an outer ring of the test bearing 4 through wire cutting to simulate a bearing fault. The width of the cut denotes a fault size. The fault size includes 0.3 mm and 0.4 mm. Bearing fault types of the test bearing 4 are first classified as normal (NO), an inner-ring fault (IF), a ball fault (BF), and an outer-ring fault (OF). Each of the inner-ring fault (IF), the ball fault (BF), and the outer ring fault (OF) corresponds to the foregoing two fault sizes. In addition, four compound faulty bearings are machined with a width of 0.2 millimeters, that is, an inner ring/outer-ring fault (IOF), an inner ring/ball fault (IBF), an outer ring/ball fault (OBF), and an inner ring/outer ring/ball fault (IOBF), to implement more realistic simulation of actual fault conditions of the test bearing. The inner ring/outer-ring fault (IOF) denotes that fault occurs in both the inner ring and the outer ring. The inner ring/ball fault (IBF) denotes that fault occurs in both the inner ring and the ball. The outer ring/ball fault (OBF) denotes that fault occurs in both the outer ring and the ball. The inner ring/outer ring/ball fault (IOBF) denotes that fault occurs in all of the inner ring, the outer ring, and the ball. Therefore, there are a total of 1+2*3+4=11 fault types in this embodiment. That is, the test bearing has a total of 11 fault types. Each fault type corresponds to one fault type label.

[0045] Referring to FIG. 2, the adaptive manifold probability distribution-based bearing fault diagnosis method in this embodiment includes the following steps.

[0046] 1) A plurality of transferable domains are constructed. Each transferable domain includes data samples of 11 fault types of a bearing. Different transferable domains conform to different conditional distributions and marginal distributions. Any two transferable domains are defined to form one transfer task. In each transfer task, one transferable domain is designated as a source domain custom-character .sub.s={X.sub.x, Y.sub.s}, the other transferable domain is a target domain .sub.t={X.sub.t}, and each data sample in the source domain .sub.s is marked with a fault type label. No data sample in the target domain .sub.t is marked with a fault type label.

[0047] X.sub.s={X.sub.i.sup.s}.sub.i=1.sup.m denotes a sample set of the source domain custom-character .sub.s, x.sub.i.sup.s denotes an i.sup.th data sample in the source domain .sub.s, Y.sub.s={y.sub.i.sup.s}.sub.i=1.sup.m denotes a fault type label set in the source domain .sub.s, y.sub.i.sup.s denotes an i.sup.th fault type label in the source domain .sub.s, X.sub.t={X.sub.j.sup.t}.sub.j=1.sup.n denotes a sample set of the target domain custom-character .sub.t, x.sub.j.sup.t denotes a j.sup.th data sample in the target domain .sub.t, m denotes a total quantity of data samples in the source domain .sub.s, and n denotes a total quantity of data samples in the target domain .sub.t.Math.X.sub.s∈.sup.m×.sup.r, Y.sub.s∈.sup.m×1, X.sub.t∈ custom-character .sup.n×r, and r denotes a quantity of sample features. Basic assumption: A feature space is χ.sub.s=χ.sub.t, a label space is .sub.s=.sub.t, a conditional distribution is Q(y.sup.s(x)|x.sup.s)≠Q(y.sup.t(x)|x.sup.t), and an marginal distribution is P(x.sup.s)≠P(x.sup.t).

[0048] A method for constructing a transferable domain is: acquiring vibration signals of the bearing under each fault type in different working conditions as the data samples, and then grouping data samples in one same working condition into one transferable domain. Each transferable domain corresponds to one working condition.

[0049] Different working conditions refer to different load working conditions.

[0050] For example, four transferable domains may be constructed, and are separately marked as L0, L1, L2, and L3. L0 denotes a working condition that a radial load applied in the experiment is 0 kN. L1 denotes a working condition that a radial load applied in the experiment is 1 kN. L2 denotes a working condition that a radial load applied in the experiment is 2 kN. L3 denotes a working condition that a radial load applied in the experiment is 3 kN. Vibration time domain signal data of the test bearing of each fault type in the foregoing four load states is first acquired as data samples by using the acceleration sensor. 200 data samples are acquired in each fault type. Data samples in one same working condition are then grouped into one transferable domain. Each of the transferable domains L0, L1, L2, and L3 includes 11 fault types. Each fault type includes 200 data samples. That is, a total quantity of data samples in each transferable domain is 11*200=2200. In this case, in X.sub.s={x.sub.i.sup.s}.sub.i=1.sup.m and X.sub.t={x.sub.j.sup.t}.sub.j=1.sup.n, m=n=2200. Refer to Table 1 for the composition of data samples in each transferable domain.

TABLE-US-00001 TABLE 1 Composition of 11 bearing fault types in each transferable domain Fault Sample size/millimeter Fault type Label quantity Symbol — Normal 1 200 NO 0.2 Inner ring/outer 2 200 IOF0.2 ring fault 0.2 Inner ring/ball fault 3 200 IBF0.2 0.2 Outer ring/ball fault 4 200 OBF0.2 0.2 Inner ring/outer 5 200 IOBF0.2 ring/ball fault 0.3 Inner ring fault 6 200 IF0.3 0.3 Ball fault 7 200 BF0.3 0.3 Outer ring fault 8 200 OF0.3 0.4 Inner ring fault 9 200 IF0.4 0.4 Ball fault 10 200 BF0.4 0.4 Outer ring fault 11 200 OF0.4

[0051] One transfer task is formed between any two of the four transferable domains L0, L1, L2, and L3. A total of 12 transfer tasks may be established, that is: L0.fwdarw.L1, L0.fwdarw.L2, L0.fwdarw.L3; L1.fwdarw.L0, L1.fwdarw.L2, L1.fwdarw.L3; L2.fwdarw.L0, L2.fwdarw.L1, L2.fwdarw.L3; L3.fwdarw.L0, L3.fwdarw.L1, and L3.fwdarw.L2. In each transfer task, on the left side of an arrow is a source domain custom-character .sub.s marked with a fault type label, and on the right side is a target domain .sub.t that is not marked with a fault type label, which is a diagnosis object of a fault type to be recognized. Refer to Table 2 for specific settings of the transfer tasks.

TABLE-US-00002 TABLE 2 Settings of 12 transfer tasks Single- Fault domain Transfer Source Target size sample Health task domain domain (mm) quantity status 1 L1 L0 0.2/0.3/0.4 2200 NO/IF/OF/BF/IOF/ IBF/OBF/IOBF 2 L2 L0 0.2/0.3/0.4 2200 NO/IF/OF/BF/IOF/ IBF/OBF/IOBF 3 L3 L0 0.2/0.3/0.4 2200 NO/IF/OF/BF/IOF/ IBF/OBF/IOBF 4 L0 L1 0.2/0.3/0.4 2200 NO/IF/OF/BF/IOF/ IBF/OBF/IOBF 5 L2 L1 0.2/0.3/0.4 2200 NO/IF/OF/BF/IOF/ IBF/OBF/IOBF 6 L3 L1 0.2/0.3/0.4 2200 NO/IF/OF/BF/IOF/ IBF/OBF/IOBF 7 L0 L2 0.2/0.3/0.4 2200 NO/IF/OF/BF/IOF/ IBF/OBF/IOBF 8 L1 L2 0.2/0.3/0.4 2200 NO/IF/OF/BF/IOF/ IBF/OBF/IOBF 9 L3 L2 0.2/0.3/0.4 2200 NO/IF/OF/BF/IOF/ IBF/OBF/IOBF 10 L0 L3 0.2/0.3/0.4 2200 NO/IF/OF/BF/IOF/ IBF/OBF/IOBF 11 L1 L3 0.2/0.3/0.4 2200 NO/IF/OF/BF/IOF/ IBF/OBF/IOBF 12 L2 L3 0.2/0.3/0.4 2200 NO/IF/OF/BF/IOF/ IBF/OBF/IOBF

[0052] 2) Each data sample in the source domain and the target domain in each transfer task is converted into frequency domain data via Fast Fourier transform (FFT), the frequency domain data is inputted into a GFK algorithm model, and a manifold feature representation matrix related to a bearing fault in each transfer task is calculated by using the GFK algorithm model, to implement cross-domain data distribution alignment in variable working conditions.

[0053] Further, when original data samples of the bearing are converted into frequency domain signals via FFT, one-sided spectrum information of the frequency domain is kept.

[0054] 3) A cosine distance between centers of a target domain and a source domain in each transfer task according to a manifold feature representation is calculated, and a target function of in-domain classifier learning is defined.

[0055] 4) The target function is solved according to the solved cosine distance and a constraint that an element in a probability distribution matrix of the target domain needs to satisfy, to obtain the probability distribution matrix of the target domain.

[0056] 5) A fault type label corresponding to the largest probability value corresponding to each data sample in the target domain from the probability distribution matrix is selected as a predicted fault type label of the data sample in the target domain, to complete bearing fault diagnosis.

[0057] In an embodiment, referring to FIG. 5, a method for calculating a manifold feature representation matrix related to a bearing fault in each transfer task by using the GFK algorithm model in the foregoing step 2) includes the following steps.

[0058] A1) A source domain custom-character .sub.s={X.sub.s, Y.sub.s} and a target domain .sub.t={X.sub.t} are defined. A subspace data set P.sub.S of the source domain .sub.s and a subspace data set P.sub.t of the target domain .sub.t are obtained by using a PCA method. P.sub.S and P.sub.t are combined into one combined matrix P.sub.s+t. A sine angle α.sub.d between P.sub.S and P.sub.s+t and a sine angle β.sub.d between P.sub.t and P.sub.s+t are then calculated, so that a consistency measurement function C(d) of P.sub.s and P.sub.t is:

C(d)=0.5[sin α.sub.d+sin β.sub.d].

[0059] Here, when the source domain is more similar to the target domain, the value of C(d) is larger, while it is ensured that a variance captured in one subspace can be transferred to another subspace, that is: α.sub.d≠π/2, and β.sub.d≠π/2.

[0060] A2) An optimal subspace dimensionality d* is calculated by using a greedy algorithm:

d*=min{d|C(d)=1}.

[0061] A3) First d* dimensions of feature vector matrices in data sets P.sub.S and P.sub.t are selected according to the optimal subspace dimensionality d* as preprocessed data after dimensionality reduction.

[0062] An optimal subspace dimensionality is calculated by using a greedy algorithm. Eigenvectors more related to bearing faults are selected, and redundant unrelated features are eliminated, thereby improving the performance of bearing fault diagnosis and helping to increase the operation speed.

[0063] A4) A manifold feature conversion core matrix G is calculated according to the preprocessed data after dimensionality reduction, and a manifold feature representation matrix W is obtained according to the manifold feature conversion core matrix G:

W=√{square root over (G)}X,

[0064] where X=[X.sub.s, X.sub.t].

[00007] $G = [\begin{matrix} P_{S} U_{1} & R_{S} U_{2}] \end{matrix} [\begin{matrix} Λ_{1} & Λ_{2} \\ Λ_{2} & Λ_{3} \end{matrix}] [\begin{matrix} U_{1}^{T} & P_{S}^{T} \\ U_{2}^{T} & R_{S}^{T} \end{matrix}] .$

[0065] The manifold feature conversion core matrix G is

[0066] In the foregoing formula, Λ.sub.1, Λ.sub.2, and Λ.sub.3 are all diagonal matrices. A diagonal element of Λ.sub.1 is

[00008] $λ_{1 i} = 1 + \frac{\sin (2 θ_{i})}{2 θ_{i}} .$

A diagonal element of Λ.sub.2 is

[00009] $λ_{2 i} = \frac{\cos (2 θ_{i}) - 1}{2 θ_{i}} .$

A diagonal element of Λ.sub.3is

[00010] $λ_{3 i} = 1 - \frac{\sin (2 θ_{i})}{2 θ_{i}} .$

U.sub.1∈ custom-character .sup.d×d and U.sub.2∈.sup.(r−d)×d denote a pair of orthogonal matrices, and may be calculated by using the following singular value decomposition: P.sub.S.sup.TP.sub.t=U.sub.1ΓV.sup.T, R.sub.S.sup.TP.sub.t=−U.sub.2ΣV.sup.T. R.sub.S∈.sup.rx(r−d) is an orthogonal complement of P.sub.S. Γ and Σ are both diagonal matrices. Diagonal elements of Γ and Σ are respectively cos θ.sub.i and sin θ.sub.i (i=1, 2 , . . . d*). θ.sub.i (0≤θ.sub.1≤θ.sub.2≤ . . . θ.sub.d*≤π/2) is an angle between subspaces P.sub.S and P.sub.t.

[0067] It may be understood that a feature in a manifold space usually has adequate geometric properties, so that inter-domain data distribution differences can be reduced. Therefore, features in the original space are first transformed into a Grassmann manifold custom-character (d), and referring to FIG. 3, original d*-dimensional subspaces (eigenvectors) are seen as basic elements thereof, to promote classifier learning. P.sub.S and P.sub.t are seen as two points in (d). One geodesic flow Φ(t)(0≤t≤1) may be built between the two points. Converted features are represented as: w=g (x)=Φ(t).sup.Tx . Any two converted eigenvectors w.sub.i and w.sub.j are chosen. An inner product of the eigenvectors define one kernel function: custom-character w.sub.i.sup.s, w.sub.j.sup.t=∫.sub.0.sup.1(Φ)(t).sup.Tx.sub.i.sup.s).sup.T(Φ)(t).sup.Tx.sub.j.sup.t)dt=x.sub.i.sup.sTGx.sub.j.sup.t. G∈.sup.r×r is a positive semi-definite matrix obtained through singular value decomposition. A converted manifold feature is represented as W=√{square root over (G)}X.

[0068] A5) A manifold feature representation matrix W.sub.s of the source domain custom-character .sub.s and a manifold feature representation matrix W.sub.t of the target domain .sub.t are obtained according to the manifold feature representation matrix W:

[00011] ${\begin{matrix} \begin{matrix} W_{s} = {w_{i}^{s}}_{i = 1}^{m} = W_{1 : m} & w_{i}^{s} \in W_{s}, i \in {1, 2, .Math., m} \end{matrix} \\ \begin{matrix} W_{t} = {w_{j}^{t}}_{j = 1}^{n} = W_{m + 1 : m + n} & w_{j}^{t} \in W_{t}, j \in {1, 2, .Math., n} \end{matrix} \end{matrix},$

[0069] where w.sub.i.sup.s denotes an i.sup.th element in W.sub.s, and w.sub.j.sup.t denotes a j.sup.th element in W.sub.t.

[0070] It may be understood that the manifold feature representation matrix W is a combined matrix of the matrices W.sub.s and W.sub.t. A set of the first to m.sup.th pieces of data in W forms W.sub.s, and a set of the remaining (m+1).sup.th to (m+n).sup.th pieces of data forms W.sub.t.

[0071] In an embodiment, the target function of the in-domain classifier learning in step 3) is defined as custom-character , the target function is :

custom-character =Σ.sub.j=1.sup.nΣ.sub.c=1.sup.cP.sub.cjD.sub.cj,

[0072] where C denotes a total quantity of fault type labels, P.sub.cj is an element in the probability distribution matrix P∈ custom-character .sup.c×n of the target domain, and denotes a probability that W.sub.t in w.sub.j.sup.t is a c.sup.th fault type label, and D.sub.cj is a cosine distance between centers of the target domain .sub.t and the source domain .sub.s, and also denotes a distance between w.sub.j.sup.t and a data set center e.sub.c corresponding to the c.sup.th fault type label. For example, referring to Table 1, in this case, C=11. A total quantity of data samples in the target domain is n=2200. Each fault type label corresponds to one fault type. If c=1, it represents the first fault type label, and a fault type corresponding to the label is normal. If c=6, it represents the sixth fault type label, and a fault type corresponding to the label is an inner-ring fault.

[0073] In an embodiment, the cosine distance in step 3) is D.sub.cj, and D.sub.cj is calculated by using the following formula:

[00012] $D_{cj} = \frac{w_{j}^{t} e_{c}}{.Math. w_{j}^{t} .Math. .Math. e_{c} .Math.},$

[0074] where e.sub.c denotes the data set center corresponding to the c.sup.th fault type label in the source domain custom-character .sub.s, and w.sub.j.sup.t denotes a j.sup.th element in W.sub.t.

[0075] Further, e.sub.c is calculated by using the following formula:

[00013] $e_{c} = \frac{1}{m^{(c)}} {.Math.}_{i = 1}^{m} w_{i}^{s} I (y_{i}^{s} = c) .$

[0076] m.sup.(c) denotes a quantity of data samples corresponding to the c.sup.th fault type label in the source domain custom-character .sub.s. w.sub.i.sup.s is an i.sup.th element in W.sub.s, and denotes a manifold feature representation of the i.sup.th data sample in the source domain .sub.s.

[0077] I(y.sub.i.sup.s=c) is an indicator function,

[00014] $I (y_{i}^{s} = c) = {\begin{matrix} 1, & True \\ 0, & False \end{matrix} .$

That is, if the i.sup.th label y.sub.i.sup.s in the source domain custom-character .sub.s is the c.sup.th fault type label, the value of I is equal to 1, or otherwise the value of I is equal to 0.

[0078] In an embodiment, the constraint that the element P.sub.cj in the probability distribution matrix P of the target domain needs to satisfy in step 4) is:

[00015] $s . t . {\begin{matrix} {.Math.}_{c = 1}^{C} P_{cj} = 1, \forall j \in {1, 2, .Math., n} \\ 0 \leq P_{cj} \leq 1 \\ {.Math.}_{j = 1}^{n} P_{cj} \geq 1, \forall c \in {1, 2, .Math., C} \end{matrix} .$

[0079] For a probability distribution of w.sub.j.sup.t, a sum of all probability values that w.sub.j.sup.t belong to different classes is 1, and it is therefore obtained that the constraint is: Σ.sub.c=1.sup.CP.sub.cj=1,∀j∈{1,2, . . . , n}.

[0080] In addition, unmarked data samples in the target domain do not necessarily completely conform to a 0 to 1 distribution. However, in fact, learned classifiers can readily differentiate between partial fault data that are collected in different working conditions. Assuming that samples having the same fault type label usually exhibit obvious clustering, in an optimal case, all samples to be recognized in the target domain tend to conform to a 0-1 distribution. In this way, the constraint Σ.sub.j=1.sup.nP.sub.cj≥1, ∀c∈{1,2, . . . , C} is obtained. The constraint replaces a conventional constraint P.sub.cj=max(P.sub.lj) , l∈{1,2, . . . , C}, ∀c∈{1,2, . . . , C},∃j. Compared with the conventional constraint, the foregoing constraint is easier to program and implement.

[0081] In an embodiment, the target function is solved in the step 4) by using a linear programming solving method. That is, according to the target function and three constraints, an optimization objective of the target function of in-domain classifier learning may be denoted as:

[00016] $\min = {.Math.}_{j = 1}^{n} {.Math.}_{c = 1}^{C} P_{cj} D_{cj} s . t . {\begin{matrix} {.Math.}_{c = 1}^{C} P_{cj} = 1, \forall j \in {1, 2, .Math., n} \\ 0 \leq P_{cj} \leq 1 \\ {.Math.}_{j = 1}^{n} P_{cj} \geq 1, \forall c \in {1, 2, .Math., C} \end{matrix}$

[0082] An optimization problem of the target function is actually converted into a linear programming problem. The probability distribution matrix P∈ custom-character .sup.C×n of the data sample in the target domain may be obtained by solving the problem. FIG. 4 is an exemplary diagram of a probability distribution matrix.

[0083] In an embodiment, a calculation formula for selecting a fault type label corresponding to the largest probability value corresponding to each data sample in the target domain from the probability distribution matrix P as a predicted fault type label of the data sample in the target domain in step 5) is:

[00017] ${\hat{y}}_{j}^{t} (x) = \underset{l}{\arg \max} \frac{P_{cj}}{{.Math.}_{c = 1}^{C} P_{cj}}, l \in {1, 2, .Math., C} .$

[0084] ŷ.sub.j.sup.t(x) denotes a predicted fault type label of the j.sup.th data sample in the target domain custom-character .sub.t.

[0085] It may be understood that no data sample in the target domain is marked with a fault type label. By means of the probability distribution matrix P, the largest probability that each data sample in the target domain belong to a fault type label may be obtained, and a fault type label corresponding to the largest probability value is used as a predicted fault type label of an unmarked data sample.

[0086] The bearing fault diagnosis method in the foregoing embodiment is compared with five conventional diagnosis algorithms. The five conventional diagnosis algorithms are: a K-Nearest Neighbor (KNN) algorithm; PCA; Subspace Alignment (SA); Transfer Component Analysis (TCA); and GFK. Refer to Table 3 for comparison results of fault diagnosis accuracy of the diagnosis methods:

TABLE-US-00003 TABLE 3 Fault diagnosis accuracy of experimental data This Transfer task KNN PCA SA TCA GFK embodiment L0.fwdarw.L1 99.45 97.82 99.27 98.64 98.91 99.50 L0.fwdarw.L2 94.23 90.41 94.32 97.14 94.05 99.09 L0.fwdarw.L3 86.41 90.41 87.09 88.95 97.41 97.23 L1.fwdarw.L0 95.14 95.73 99.95 96.36 99.41 99.82 L1.fwdarw.L2 95.23 95.73 95.45 99.59 97.95 99.23 L1.fwdarw.L3 87.55 93.36 89.45 94.50 90.36 97.09 L2.fwdarw.L0 88.91 90.41 98.27 89.36 96.86 99.68 L2.fwdarw.L1 99.64 97.55 99.95 99.45 96.68 99.41 L2.fwdarw.L3 92.41 97.00 90.82 98.00 89.55 97.36 L3.fwdarw.L0 74.64 87.73 88.82 89.41 88.91 99.27 L3.fwdarw.L1 90.73 99.05 91.82 89.27 98.50 99.27 L3.fwdarw.L2 92.36 99.68 92.64 98.73 98.95 99.23 Average 91.39 94.57 93.99 94.95 95.62 98.85 accuracy Average 6 4 5 3 2 1 accuracy rank

[0087] As can be seen from the diagnosis results in Table 3, the diagnosis method in this embodiment achieves the highest accuracy in 6 out of the 12 transfer tasks, achieves the highest average diagnosis accuracy (98.85%), obtains a diagnosis accuracy of not less than 97.00% for all the transfer tasks, and exhibits a high accuracy and robustness of diagnosis results in a plurality of working conditions.

[0088] FIG. 6 is a diagram of fault diagnosis confusion matrices of four transfer tasks in this embodiment. FIG. 6 shows that when the diagnosis method in this embodiment is used to perform different transfer diagnosis tasks, the method has an excellent fault recognition capability for a bearing with a single fault. Diagnosis errors generally all occur during fault diagnosis of a bearing with a compound fault. This is caused by the complexity of vibration signals of a bearing with a compound fault.

[0089] FIG. 7 is an effect diagram of extensibility of an adaptive manifold probability distribution-based bearing fault diagnosis method in this embodiment. FIG. 7 indicates that compared with KNN and SVM classifiers, an in-domain classifier provided in the present invention can effectively improve the fault diagnosis performance of four transfer learning methods, that is, has better extensibility.

[0090] The adaptive manifold probability distribution-based bearing fault diagnosis method in this embodiment has the following advantages:

[0091] (1) A manifold feature representation matrix related to a bearing fault is acquired by using a GFK algorithm. That is, a geodesic flow kernel is built in a Grassmann manifold space to acquire a manifold feature representation of data, and an optimal subspace dimensionality is automatically calculated, thereby effectively reducing distribution differences in inter-domain data.

[0092] (2) In-domain classifier learning is implemented by calculating a cosine distance and solving a target function and a probability distribution matrix. A parameter-free in-domain classifier is built based on a probability distribution of samples. The in-domain classifier is combined with a GFK manifold learning algorithm, so that the diagnosis accuracy and the diagnosis efficiency of bearing fault diagnosis in variable working conditions are improved. In addition, the in-domain classifier may also be combined with another existing data alignment and feature extraction algorithm, thereby implementing adequate extensibility.

[0093] (3) A process of constructing data distribution alignment and in-domain classifier learning is very simple. Complex model selection and hyperparameter tuning are not required. The characteristic can better satisfy actual requirements of fault diagnosis in different working conditions.

[0094] (4) Compared with a deep learning-based bearing fault diagnosis method, the diagnosis method in this embodiment has high interpretability, relatively low requirements for computer hardware resources, a faster execution speed, and excellent diagnosis accuracy and model universality, is particularly suitable for multi-scenario, multi-fault bearing fault diagnosis in variable working conditions, and can be widely applied to fault diagnosis tasks in variable working conditions for complex systems such as machinery, electric power, chemical industry, and aviation. A final classifier of an existing model may be replaced with the in-domain classifier provided in this embodiment, thereby further improving the fault diagnosis performance.

[0095] The adaptive manifold probability distribution-based bearing fault diagnosis method in this embodiment effectively improves the diagnosis accuracy and the diagnosis efficiency of bearing fault diagnosis, so that use requirements can be effectively satisfied.

[0096] The foregoing embodiments are merely preferred embodiments used to fully describe the present invention, and the protection scope of the present invention is not limited thereto. Equivalent replacements or variations made by a person skilled in the art to the present invention all fall within the protection scope of the present invention. The protection scope of the present invention is as defined in the claims.

ADAPTIVE MANIFOLD PROBABILITY DISTRIBUTION-BASED BEARING FAULT DIAGNOSIS METHOD

Inventors

Cpc classification

Classification Explorer

G06N7/01

PHYSICS

Classification Explorer

G06N20/00

PHYSICS

Classification Explorer

G01M13/045

PHYSICS

International classification

Classification Explorer

G01M13/045

PHYSICS

Abstract

Claims

Description