METHOD FOR DESIGNING COMPLEX-VALUED CHANNEL EQUALIZER
20230254187 · 2023-08-10
Inventors
Cpc classification
International classification
Abstract
The present invention discloses a method for the design of complex-valued channel equalizer of digital communication systems, including: constructing a channel equalizer by using a complex-valued neural network; collecting the output signal y(n)=[y(n), y(n−1), . . . , y(n−m+1)].sup.T of the nonlinear channel of the underlying digital communication system as the input of complex-valued neural network and s(n−τ) as the desired output, and taking the mean squared error as the loss function for the training of complex-valued neural network, which is optimized by the proposed adaptive complex-valued L-BFGS algorithm, and finally using it to implement the design of channel equalizer for digital communication systems. The present invention proposes the use of a multi-layer feedforward complex-valued neural network to construct complex-valued channel equalizer. A new adaptive complex-valued L-BFGS algorithm is proposed for efficient training of complex-valued neural network, which is eventually applied to facilitate the design of the channel equalizer for digital communication systems.
Claims
1. A method for designing a complex-valued channel equalizer, comprising: constructing a complex-valued channel equalizer by using a complex-valued neural network, wherein the complex-valued channel equalizer is applied to a digital communication system; collecting an output signal y(n)=[y(n), y(n−1), . . . , y(n−m+1)].sup.T of a nonlinear channel of the digital communication system as an input of complex-valued neural network and s(n−τ) as a corresponding desired output, and taking the mean squared error as the loss function J(z, z*); and optimizing the loss function J(z, z*) to obtain an appropriate complex-valued neural network model, and finally using the complex-valued neural network to implement channel equalization.
2. The method for designing a complex-valued channel equalizer according to claim 1, wherein in the digital communication system, a complex-valued digital sequence of the nonlinear channel is denoted by s(n), a channel order is h, a channel mapping is f (⋅), a noiseless channel output is ŷ(n), an additive noise is v(n), an actual channel output y(n) is a superposition result of the noiseless channel output and the additive noise and is taken as the input data of the equalizer, an equalizer order is m, and a decision delay of the equalizer is assumed to be τ; a state ns of the channel is a data set generated by a limited character set transmitted through the channel, and a size of the data set is determined by the equalizer order and the channel order; a modulation order of a transmitted signal is M; and thus a transmission function of the channel is described by
3. The method for designing a complex-valued channel equalizer according to claim 1, wherein the “optimizing the loss function J(z, z*)” specifically comprises: optimizing the loss function J(z, z*) by an adaptive complex-valued L-BFGS algorithm, in the adaptive complex-valued L-BFGS algorithm, calculating and comparing an approximation degree between the inverse of quasi-Newton matrix and the inverse of Hessian matrix along a particular vector, and selecting a memory size corresponding to the highest degree of approximation as the optimal memory size at the current iteration.
4. The method for designing a complex-valued channel equalizer according to claim 3, wherein a specific implementation process of “in the adaptive complex-valued L-BFGS algorithm, calculating and comparing the approximation degree between the inverse of quasi-Newton matrix and the inverse of Hessian matrix along a particular vector, and selecting a memory size corresponding to the highest degree of approximation as the optimal memory size at the current iteration” is as follows: separately calculating
5. The method for designing a complex-valued channel equalizer according to claim 4, wherein a specific process of “appropriately adjusting the value of M according to a change trend of the optimal memory size selected in several previous iterations” comprises: given a window size W, calculating a predicted value m.sub.f,t* of the memory size at the current iteration according to a formula
6. The method for designing a complex-valued channel equalizer according to claim 4, wherein a new search direction is designed by superimposing a plurality of directions, at the t.sup.th iteration, multiple memory sizes can be selected, so that a plurality of directions are calculated, and then these directions are superimposed to obtain the search direction at the current iteration.
7. The method for designing a complex-valued channel equalizer according to claim 6, wherein a specific process “a new search direction is designed by superimposing a plurality of directions, at the t.sup.th iteration, multiple memory sizes can be selected, so that a plurality of directions are calculated, and then these directions are superimposed to obtain the search direction at the current iteration” is: given a positive integer K, calculating an average memory size according to K memory sizes m.sub.w1* , m.sub.w2* , . . . , m.sub.wK* selected at the w.sup.th iteration,
8. A computer device, comprising: a memory, a processor, and a computer program stored in the memory and configured to be executed by the processor, wherein the processor is configured to execute the program to implement the steps in the method according to claim 1.
9. A computer-readable storage medium, on which a computer program is stored, wherein a processor is configured to execute the program to implement the steps in the method according to claim 1.
10. A processor, configured to execute a program, wherein the program is executed to implement the method according to claim 1.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
[0036]
[0037]
[0038]
[0039]
[0040]
[0041]
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
[0042] The present invention is further described below with reference to the accompanying drawings and specific embodiments, to enable a person skilled in the art to better understand and implement the present invention. However, the embodiments are not used to limit the present invention.
[0043] In practice, artificial neural networks provide a promising method for implementing an equalizer. Especially, complex-valued neural networks are efficient for the design of complex-valued channel equalizer. Complex-valued neural networks have attracted much attention from scholars because of fast learning and the capability of directly processing complex signal. During the past few years, increasing research has been conducted for complex-valued neural networks. In some application fields such as image processing and complex signal processing, it has been proved that complex-valued neural networks outperform real-valued neural networks with the same structure in many aspects.
[0044] For a complex-valued neural network, complex gradient-type algorithms and complex Newton-type algorithms are the popular training algorithms at present. Complex gradient descent algorithms are widely applied due to their low computational complexity. However, the presence of a large number of saddle points in the objective function makes the training of complex-valued neural networks more difficult when the gradient-type algorithms are adopted. To resolve this problem, some improved algorithms are proposed. For example, one can adopt complex step size to change the search direction such that the training algorithm has the capability of escaping from a saddle point. In addition to the use of complex step size, other complex gradient algorithms are also established. For example, complex conjugate gradient algorithm can significantly overcome the impact of saddle points on the training. Compared with gradient-type algorithms, Newton-type algorithms can usually obtain faster convergence and higher accuracy. In the Newton method, the inverse of Hessian matrix needs to be calculated, which causes high computational complexity. To resolve this problem, the quasi-Newton method is provided. In the quasi-Newton method, the inverse of Hessian matrix is calculated in an approximation manner, thereby greatly reducing the calculation resource. The relatively common quasi-Newton method is the so-called BFGS algorithm. Because a large-scale matrix needs to be stored, the BFGS algorithm still has very high memory consumption during the implementation process. In the improved L-BFGS algorithm, it is only necessary to save limited information of a group of vector pairs, which greatly reduces the memory requirement, and thus enhances the practicability of the quasi-Newton algorithm.
[0045] Complex-valued L-BFGS algorithm is an optimization algorithm in the complex domain and can be used for solving unconstrained optimization problems with complex variables. It has been found that the memory size significantly affects the performance of complex-valued L-BFGS algorithm. Experiments have shown that the selection of an excessively large or small memory size cannot ensure better performance for complex-valued L-BFGS algorithm. Therefore, the present invention provides an adaptive memory size adjustment strategy for complex-valued L-BFGS algorithm and is applied for the design of complex-valued channel equalizer, so that the optimal memory size can be found at each iteration. This eventually guarantees the design of high-performance complex-valued channel equalizer and effectively reduces the computational complexity of the designed method.
[0046] A specific solution of the present inventions is as follows:
[0047] A complex-valued equalizer is first constructed by using a complex-valued neural network. The complex-valued equalizer is applied to a digital communication system model. To further understand the channel equalization, a complete digital communication system described in the present invention is given in
[0048] A quantity of the states ns of the channel is determined by:
[0049] ns=M.sup.h+m.
[0050] In this case, an input sequence s(n)=[s(n), . . . , s(n−m+1−h)].sup.T has ns states, after transmission through a noiseless channel, a desired channel output ŷ(n) also has ns states.
[0051] The equalizer is used to reconstruct a sequence transmitted by a transmit end according to the observed channel output vector y(n)=[y(n), y(n−1), . . . , y(n−m+1)].sup.T with additive noise. y(n)=[y(n), y(n−1), . . . , y(n−m+1)].sup.T is a distortion signal with m delays, and m is the equalizer order.
[0052] Then an adaptive memory size adjustment method of complex-valued L-BFGS algorithm is provided, and is used for implementing efficient learning of multi-layer feedforward complex-valued neural networks, thereby greatly improving the performance of the complex-valued equalizer. The output signal y(n)=[y(n), y(n−1), . . . , y(n−m+1)].sup.T of the nonlinear channel is collected as the input of complex-valued neural network and s(n−τ) is acquired as the desired output, and a loss function J(z,z*) is constructed by using a mean squared error. The loss function J(z,z*) is optimized by using the complex-valued L-BFGS algorithm with the adaptive memory size adjustment to obtain an appropriate complex-valued neural network, and channel equalization is finally implemented by using the complex-valued neural network. The overall block diagram of the method is shown in
[0053] In the adaptive complex-valued L-BFGS algorithm, the approximation degree between the inverse of quasi-Newton matrix and the inverse of real Hessian matrix along a particular vector is calculated and compared, and a memory size corresponding to the highest degree of approximation is selected as the optimal memory size at the current iteration. A specific implementation process of the algorithm is as follows:
are separately calculated according to the loss function J(z, z*), where
in a multi-step quasi-Newton formula are respectively calculated by using
[0054] according to the formula:
[0055] a matrix Ĥ.sub.t.sup.m is calculated, where the matrix is used for depicting the degree of approximation between the inverse of quasi-Newton matrix and the inverse of Hessian matrix; herein, the superscript H denotes conjugate transpose,
[0056] given an upper bound M, the approximation degree between Ĥ.sub.t.sup.m and the inverse of Hessian matrix is calculated for m=1, 2, . . . , M according to a formula
[0057] according to the foregoing calculated e.sub.m, a corresponding value of m achieving the minimum of e.sub.m is chosen as the optimal memory size m* of the complex-valued L-BFGS algorithm.
[0058] when the optimal memory size is determined, a search direction at the current iteration is calculated with the help of:
[0059] a search direction
is calculated; and then, using a line search condition:
[0060] a step size is calculated, where k.sub.1 takes value from the interval (0, 0.5), and k.sub.2 takes value from the interval (k.sub.1, 1), to obtain a parameter update formula
The flowchart is shown in
[0061] To effectively reduce the computational complexity of the adaptive complex-valued L-BFGS algorithm, the present invention further provides an adaptive adjustment strategy for the upper bound M of memory size. The value of M may be appropriately adjusted according to a change trend of the optimal memory size found in several previous iterations (for example, previous seven iterations). The flowchart is shown in
[0062] given a window size W, a predicted value m.sub.f,t* of the memory size at the current iteration is calculated according to a formula
where m.sub.t* denotes the optimal memory size at iteration t; and
[0063] given constants c.sub.1 and c.sub.2, if m.sub.f,t*<c.sub.1M, M=M−1, if m.sub.f,t*>(1−c.sub.2)M, M=M+1; or otherwise, M is kept unchanged.
[0064] To further improve the performance of the adaptive complex-valued L-BFGS algorithm, the present invention provides the design of a new search direction by superimposing a plurality of directions. At the t.sup.th iteration, a plurality of memory sizes (for example, m.sub.t1*, m.sub.t2*, . . . , m.sub.tK*) are selected, so that a plurality of directions can be calculated, and then these directions are superimposed to obtain the search direction at the current iteration. The flowchart is shown in
[0065] given a positive integer K, an average memory size is calculated according to K memory sizes m.sub.w1* , m.sub.w2* , . . . , m.sub.wK* selected in the w.sup.th iteration,
[0066] the predicted value
of the memory size at the current iteration is calculated by using an average value
[0067] M is adjusted according to the predicted value by using the foregoing adaptive adjustment method; and
[0068] K memory sizes at the t.sup.th iteration are calculated according to the modified M, to obtain K directions, and then these directions are superimposed to obtain the search direction at the current iteration.
[0069] The present invention constructs a complex-valued channel equalizer using a multi-layer feedforward complex-valued neural network and designs an adaptive complex-valued L-BFGS algorithm for implementing efficient learning of the model, to eventually implement channel equalization. For a given feedforward complex-valued neural network,
are first calculated by using forward propagation and backward propagation, appropriate M and the optimal memory size m* are calculated by using the foregoing method, the adjustment amounts of the weights and the bias of the feedforward complex-valued neural network are calculated through Wirtinger calculus, a step size is obtained by using a line search method, and efficient learning of the multi-layer feedforward complex-valued neural network is eventually implemented, to form a final channel equalizer.
[0070] Referring to
[0071] The key ideas of the present invention are as follows:
[0072] An adaptive adjustment method of the optimal memory size in a complex-valued L-BFGS algorithm is established by considering the approximation degree between quasi-Newton matrix and Hessian matrix. A data prediction measure is used to estimate an upper bound of the memory size, to avoid unnecessary calculation, thereby reducing the computational complexity of the design of channel equalizer. A method is proposed to build a new search direction in a multidirectional superposition manner, thereby improving the performance of the adaptive complex-valued L-BFGS algorithm. By means of the algorithm provided in the present invention, efficient learning of multi-layer feedforward complex-valued neural networks can be implemented, thereby resolving the problem of channel equalization.
[0073] The adaptive complex-valued L-BFGS algorithm provided in the present invention resolves the problem of selecting the optimal memory size, thereby overcoming the instability phenomenon of the algorithm. Compared with a conventional complex-valued L-BFGS algorithm, the adaptive complex-valued L-BFGS algorithm provided in the present invention is significantly improved such that faster convergence and more accurate solution can be achieved. The adaptive complex-valued L-BFGS algorithm provided in the present invention avoids unnecessary calculation, and the computational complexity is greatly reduced. For the channel equalization problem, compared with previous algorithms, the presented algorithm can achieve better performance.
[0074] The foregoing embodiments are merely preferred embodiments used to fully describe the present invention, and the protection scope of the present invention is not limited thereto. Equivalent replacements or variations made by a person skilled in the art to the present invention all fall within the protection scope of the present invention. The protection scope of the present invention is as defined in the claims.