AUTOMATIC MODULATION CLASSIFICATION METHOD BASED ON DEEP LEARNING NETWORK FUSION

20240112037 ยท 2024-04-04

    Inventors

    Cpc classification

    International classification

    Abstract

    The present invention discloses an automatic modulation classification method based on deep learning network fusion, comprising: acquiring a WBFM sample signal within a data set RML 2016.10a, and selecting a proper threshold ? to separate a WBFM signal during a silence period; expanding a new WBFM signal to 1000 by adopting a data enhancement method, and expanding an original data set; dividing the data set expanded in the step S2 into a training set, a verification set and a test set; respectively calculating amplitude, phase and a fractional order Fourier transformation result for data in the step S3; building a multi-channel feature fusion network model composed of an LSTM network and an FPN network; performing network model training, after the end of training, inputting verification set data into a trained network model for verification, and calculating prediction accuracy; and performing parameter fine adjustment on the network model through said test set, improving prediction precision, and taking a final model as an automatic modulation classification model. The present invention enables the improvement to the average classification accuracy rate of communication signals.

    Claims

    1. An automatic modulation classification method based on deep learning network fusion, comprising the following steps: S1, acquiring a WBFM sample signal within a RML 2016.10a data set, and selecting a proper threshold ? to separate a WBFM signal during a silence period; S2, expanding a new WBFM signal to 1000 by adopting a data enhancement method, and expanding an original data set; S3, dividing the data set expanded in said step S2 into a training set, a verification set and a test set; S4, respectively calculating amplitude, phase and a fractional order Fourier transformation result for data in said step S3; S5, building a multi-channel feature fusion network model composed of an LSTM network and an FPN network; S6, performing network model training, after the end of training, inputting verification set data into a trained network model for verification, and calculating prediction accuracy; and S7, performing parameter fine adjustment on the network model by means of said test set to improve prediction precision, and taking a final model as an automatic modulation classification model.

    2. The automatic modulation classification method based on deep learning network fusion according to claim 1, wherein said step S1 includes the following sub steps: selecting all data samples with a WBFM label, and normalizing the zero centers of the acquired WBFM sample signals, giving the maximum value of the instantaneous amplitude spectral density ? max = max .Math. "\[LeftBracketingBar]" fft [ N s .Math. A ( i ) .Math. i = 1 N s A ( i ) - 1 ] 2 .Math. "\[RightBracketingBar]" , where A(i) is an instantaneous amplitude value at each sampling time, N.sub.s is the number of sampling points, fft(.Math.) is a Fourier transformation operator, max(.Math.) presents a maximum value; and selecting a proper threshold ?, on ?.sub.max>? judging that the signal is not a WBFM signal in a silence period, then acquiring said sample signal.

    3. The automatic modulation classification method based on deep learning network fusion according to claim 1, wherein said step S2 includes the following sub step: the modulation mode of said RML 2016.10a data set being I/Q modulation, enabling a single sample signal to be represented as x.sub.i?I,Q?; changing said single sample signal as x.sub.i=?I,?Q?, x.sub.i=??I,Q?, x.sub.i=??I,?Q?, so as to expand said WBFM signal to 1000 sample data.

    4. The automatic modulation classification method based on deep learning network fusion according to claim 1, wherein said step S3 includes the following sub step: dividing the data set expanded in said S2 to said training set as 60%, said verification set as 20% and said test set as 20%, and randomly disarranging said raining set data.

    5. The automatic modulation classification method based on deep learning network fusion according to claim 1, wherein said step S4 includes the following sub steps: converting IQ signals into amplitude phase information with the amplitude as follows:
    A.sub.i=?{square root over (I.sub.i.sup.2+Q.sub.i.sup.2)} where, I.sub.i and Q.sub.i represent an imaginary part of i.sup.th data and a real part of i.sup.th data, respectively, A.sub.i represents a amplitude of i.sup.th data; performing L2 norm normalization, where the L2 norm of an amplitude of i.sup.th data is defined as:
    A.sub.norm=?{square root over (A.sub.1.sup.2+A.sub.2.sup.2+ . . . A.sub.N.sup.2)} the amplitude after said L2 norm normalization being as follows: A = A i A norm ; a phase calculation formula being as follows:
    ?.sub.i=arctan(Q.sub.i/I.sub.i) wherein arctan is an arctangent function; acquiring said fractional order Fourier transformation result for data, with its calculation formula as follows: X p ( u ) = .Math. "\[LeftBracketingBar]" F p [ s ( t ) ] .Math. "\[RightBracketingBar]" = .Math. "\[LeftBracketingBar]" ? - ? + ? ( I i + jQ i ) K p ( t , u ) dt .Math. "\[RightBracketingBar]" K p ( t , u ) = { 1 - j cot ? e j ? [ ( t 2 + u 2 ) cot ? - 2 tu csc ? ] , ? ? n ? ? ( t - u ) , ? = 2 n ? ? ( t + u ) , ? = ( 2 n + 1 ) ? wherein F.sup.p is a fractional Fourier transformation operator, s(t) is an original signal, K.sub.p(t,u) is a conversion kernel, t is a time domain, u is a fractional order Fourier domain, ? is a rotation angle, cot is a cotangent function, csc is a cosecant function, ? is a circular constant, ?(t) is an impulse function, n is a positive integer.

    6. The automatic modulation classification method based on deep learning network fusion according to claim 1, wherein said step S5 includes the following sub step: said input to said LSTM network being an amplitude of i.sup.th data and a phase of i.sup.th data, an output form said LSTM network being a dimensional feature graph; said input to said FPN network being an imaginary part of i.sup.th data, a real part of i.sup.th data, and a fractional order Fourier transformation result of i.sup.th data.

    7. The automatic modulation classification method based on deep learning network fusion according to claim 1, wherein said step S5 includes the following sub steps: building said LSTM network with an input layer, two LSTM layers, a Dense layer and an output layer, where an input data matrix is N?128?2, an output matrix is N?M, N is the number of samples, and M is the number of feature points; and building said FPN network with three input layers, three Conv2d layers and two Dense layers, where an input data matrix is N?3?128?1, an output matrix is N?M?1, N is the number of samples, and M is the number of feature points.

    8. The automatic modulation classification method based on deep learning network fusion according to claim 1, wherein said LSTM network model further includes a forget gate, an input gate, an output gate and output memory information; the calculation formula of said forget gate is as follows:
    f.sub.?=?(W.sub.f.Math.[h.sub.?-1,x.sub.?]+b.sub.f) where W.sub.f represents a forget gate weight matrix, x.sub.? represents an input matrix at a time step length ?, h.sub.?-1 represents an output of a hidden layer at a previous time; b.sub.f represents a forget gate deviation; sigmoid function is ? ( x ) = 1 1 + e - x , f.sub.??(0, 1), with e as a natural constant; the calculation formula of said input gate is as follows:
    i.sub.?=?(W.sub.i.Math.[h.sub.?-1,x.sub.?]+b.sub.i where W.sub.i represents an input gate weight matrix, b.sub.i represents an input gate deviation, i.sub.??(0,1); the calculation formula of said output gate is as follows:
    o.sub.?=?(W.sub.o.Math.[h.sub.?-1,x.sub.?]+b.sub.o) wherein W.sub.o represents an input gate weight matrix, b.sub.o represents an output gate deviation o.sub.??(0,1); the calculation formula of said output memory information is as follows:
    C.sub.?=f.sub.?*C.sub.?-1+i.sub.?*tanh(W.sub.Q.Math.[h.sub.?-1,x.sub.?]+b.sub.Q) wherein W.sub.Q represents a memory unit weight matrix, b.sub.Q represents a memory unit deviation, a hidden output at a time ? is h.sub.?=o.sub.? tanh(C.sub.?), with tanh as a hyperbolic tangent function.

    9. The automatic modulation classification method based on deep learning network fusion according to claim 1, wherein said step S6 includes the following sub steps: in a deep learning training process, an optimizer being set to be Adam, a loss function being a cross entropy function, adopting a dynamic learning rate scheme with an initial learning rate set to 0.001; if no reduction of the loss function of said verification set at the tenth round of training, multiplying said learning rate by a coefficient 0.8 to improve the training efficiency; and if no reduction of the loss function of said verification set within 80 rounds of training, stopping training and saving the model.

    10. The automatic modulation classification method based on deep learning network fusion according to claim 9, wherein said cross entropy function is as follows:
    loss=??[p.sub.i log {tilde over (p)}.sub.i+(1?p.sub.i)log(1?{tilde over (p)}.sub.i)] wherein {tilde over (p)}.sub.i represents a true value of a signal state, p.sub.i represents a predicted value of a signal state, log represents a logarithmic operation.

    Description

    BRIEF DESCRIPTION OF THE DRAWINGS

    [0051] FIG. 1 is a multi-channel fusion feature (MFF) network diagram of the present invention.

    [0052] FIG. 2 is a flow chart of the present invention.

    [0053] FIG. 3 is a confusion matrix diagram of the present invention.

    [0054] FIG. 4 is a comparison diagram of classification accuracy rates of different deep learning network models according to the present invention.

    DETAILED DESCRIPTION OF SOME EMBODIMENTS

    [0055] In order to more clearly understand the technical features, purposes and effects of the present invention, we shall describe the specific embodiments of the present invention with reference to the accompanying drawings.

    [0056] The process of the invention is as shown in FIG. 2, comprising the following steps: [0057] S1, acquiring a WBFM sample signal within a RML 2016.10a data set, and selecting a proper threshold ? to separate a WBFM signal during a silence period; [0058] S2, expanding a new WBFM signal to 1000 by adopting a data enhancement method, and expanding an original data set; [0059] S3, dividing the data set expanded in the step S2 into a training set, a verification set and a test set; [0060] S4, respectively calculating amplitude, phase and a fractional order Fourier transformation result for data in the step S3; [0061] S5, building a multi-channel feature fusion (MFF) network model composed of an LSTM network and an FPN network; As shown in FIG. 1, using the training set in the step S4 as an input, which to the LSTM network is an amplitude of i.sup.th data and a phase of i.sup.th data, and which to the FPN network is an imaginary part of i.sup.th data, a real part of i.sup.th data, and a fractional order Fourier transformation result of i.sup.th data; [0062] S6, performing network model training, after the end of training, inputting verification set data into a trained network model for verification, and calculating prediction accuracy; and [0063] S7, performing parameter fine adjustment on the network model by means of the test set to improve prediction precision, and taking a final model as an automatic modulation classification model.

    [0064] Specifically, the step S1 includes the following sub steps: [0065] selecting all data samples with a WBFM label, and normalizing the zero centers of the acquired WBFM sample signals, giving the maximum value of the instantaneous amplitude spectral density

    [00005] ? max = max .Math. "\[LeftBracketingBar]" fft [ N s .Math. A ( i ) .Math. i = 1 N s A ( i ) - 1 ] 2 .Math. "\[RightBracketingBar]" , [0066] where A(i) is an instantaneous amplitude value at each sampling time, N.sub.s is the number of sampling points, fft(.Math.) is a Fourier transformation operator, max (.Math.) presents a maximum value; and [0067] selecting a proper threshold ?, on ?.sub.max>? judging that the signal is not a WBFM signal in a silence period, then acquiring the sample signal.

    [0068] Specifically, the step S2 includes the following sub step: [0069] the modulation mode of the RML 2016.10a data set being I/Q modulation, enabling a single sample signal to be represented as x.sub.i=?I,Q?; changing the single sample signal as x.sub.i=?I,?Q?, x.sub.i=??I,Q?, x.sub.i=??I,?Q?, so as to expand the WBFM signal to 1000 sample data.

    [0070] Specifically, the step S3 includes the following sub step: [0071] dividing the data set expanded in the S2 to the training set as 60%, the verification set as 20% and the test set as 20%, and randomly disarranging the raining set data.

    [0072] Specifically, the step S4 includes the following sub steps: [0073] converting IQ signals into amplitude phase information with the amplitude as follows:


    A.sub.i=?{square root over (I.sub.i.sup.2+Q.sub.i.sup.2)} [0074] where, I.sub.i and Q.sub.i represent an imaginary part of i.sup.th data and a real part of i.sup.th data, respectively, A.sub.i represents a amplitude of i.sup.th data; [0075] And then performing L2 norm normalization, where the L2 norm of an amplitude of i.sup.th data is defined as:


    A.sub.norm=?{square root over (A.sub.1.sup.2+A.sub.2.sup.2+ . . . A.sub.N.sup.2)} [0076] the amplitude after the L2 norm normalization being as follows:

    [00006] A = A i A norm ; [0077] a phase calculation formula being as follows:


    ?.sub.i=arctan(Q.sub.i/I.sub.i) [0078] wherein arctan is an arctangent function; [0079] acquiring the fractional order Fourier transformation result for data, with its calculation formula as follows:

    [00007] X p ( u ) = .Math. "\[LeftBracketingBar]" F p [ s ( t ) ] .Math. "\[RightBracketingBar]" = .Math. "\[LeftBracketingBar]" ? - ? + ? ( I i + jQ i ) K p ( t , u ) dt .Math. "\[RightBracketingBar]" K p ( t , u ) = { 1 - j cot ? e j ? [ ( t 2 + u 2 ) cot ? - 2 tu csc ? ] , ? ? n ? ? ( t - u ) , ? = 2 n ? ? ( t + u ) , ? = ( 2 n + 1 ) ? [0080] wherein F.sup.p is a fractional Fourier transformation operator, s(t) is an original signal, K.sub.p(t,u) is a conversion kernel, t is a time domain, u is a fractional order Fourier domain, ? is a rotation angle, cot is a cotangent function, csc is a cosecant function, ? is a circular constant, ?(t) is an impulse function, n is a positive integer.

    [0081] Thus, completing the extraction of amplitude, phase and fractional Fourier transformation information.

    [0082] Further, the specific method for building LSTM and FPN network structures in step S5 is as follows: [0083] building the LSTM network with an input layer, two LSTM layers, a Dense layer and an output layer, where an input data matrix is N?128?2, an output matrix is N?M, N is the number of samples, and M is the number of feature points; and [0084] building the FPN network with three input layers, three Conv2d layers and two Dense layers, where an input data matrix is N?3?128?1, an output matrix is N?M?1, N is the number of samples, and M is the number of feature points.

    [0085] Specifically, the LSTM network model further includes a forget gate, an input gate, an output gate and output memory information; the calculation formula of the forget gate is as follows:


    f.sub.?=?(W.sub.f.Math.[h.sub.?-1,x.sub.?]+b.sub.f) [0086] where W.sub.f represents a forget gate weight matrix, x.sub.? represents an input matrix at a time step length ?, h.sub.?-1 represents an output of a hidden layer at a previous time; b.sub.f represents a forget gate deviation; sigmoid function is

    [00008] ? ( x ) = 1 1 + e - x ,

    f.sub.??(0,1), with e as a natural constant; [0087] the calculation formula of the input gate is as follows:


    i.sub.?=?(W.sub.i.Math.[h.sub.?-1,x.sub.?]+b.sub.i [0088] where W.sub.i represents an input gate weight matrix, b.sub.i represents an input gate deviation, i.sub.??(0,1); [0089] the calculation formula of the output gate is as follows:


    o.sub.?=?(W.sub.o.Math.[h.sub.?-1,x.sub.?]+b.sub.o) [0090] wherein W.sub.o represents an input gate weight matrix, b.sub.o represents an output gate deviation o.sub.??(0,1); [0091] the calculation formula of the output memory information is as follows:


    C.sub.?=f.sub.?*C.sub.?-1+i.sub.?*tanh(W.sub.Q.Math.[h.sub.?-1,x.sub.?]+b.sub.Q) [0092] wherein W.sub.Q represents a memory unit weight matrix, b.sub.Q represents a memory unit deviation, a hidden output at a time ? is h.sub.?=o.sub.? tanh (C?), with tanh as a hyperbolic tangent function.

    [0093] Further, the specific method for training model in step S6 is as follows: [0094] in a deep learning training process, an optimizer being set to be Adam, a loss function being a cross entropy function, adopting a dynamic learning rate scheme with an initial learning rate set to 0.001; if no reduction of the loss function of the verification set at the tenth round of training, multiplying the learning rate by a coefficient 0.8 to improve the training efficiency; and if no reduction of the loss function of the verification set within 80 rounds of training, stopping training and saving the model.

    [0095] In the specific implementation process, we prepare a processor of NVIDIA (NVIDIA) ORCE RTX 2070 GPU and a software platform of Pycharm so as to build a simulation experiment platform.

    [0096] The optimal confusion matrix of the MFF network is shown in FIG. 3. It can be seen from FIG. 3 that the classification accuracy rates of 16 QAM signals and 64 QAM signals increase. Comparative experiments to the networks such as CNN, ResNet (Residual Network), LSTM and CLDNN (Convolutional Long Short-Term Deep Neural Network) are performed on the MFF network. The signal classification accuracy rates of different network models are shown in FIG. 4. It can be seen from FIG. 4 that the CNN network is poor in performance when processing time signal data with an average classification accuracy rate of only 78%. The ResNet and the CLDNN network have repeatedly made use of the feature information, but insufficient use of the feature information, with average classification accuracy rates of 90% and 88%, respectively. The average classification accuracy rate of the MFF network can reach 94% due to its full extraction of the time, space, deep and shallow features of the sample signal, thereby solving the problem of the signal confusion of 16 QAM signals and 64 QAM, and increasing the average classification accuracy rate.

    [0097] In the present invention, data cleaning is performed by means of a judgment method and a data enhancement method, and an automatic modulation classification method based on multi-channel feature fusion is adopted, so as to enable the obtention of feature information of a sample signal, thereby improving the average classification accuracy rate of communication signals.

    [0098] The basic principle and main features of the invention and the advantages of the invention are shown and described above. A person skilled in the art should understand that the present invention is not limited by the above embodiments, and the above embodiments and the description are merely illustrative of the principle of the present invention. Without departing from the spirit and scope of the present invention, the present invention also has various changes and improvements, and these changes and improvements fall within the scope of the present invention as claimed.