Method for recovering original signal in reduced complexity DS-CDMA system

10404318 ยท 2019-09-03

Assignee

Inventors

Cpc classification

International classification

Abstract

Disclosed is a method for recovering an original signal in a DS-CDMA system based on complexity reduction. In such a method, first, a partial derivative for rMLL is calculated by using a partial derivative generated by applying fast Fourier transform (FFT) to a reduced negative marginal log likelihood (rMLL) obtained by applying a law of log determinant to a Gaussian process regression (GPR) scheme used for the multi-user detection and thereafter, integrating stochastic gradient descent (SGD). Thereafter, the rMLL is calculated by using the partial derivative for the rMLL and a hyper-parameter is updated to a convergence point until an error gap is converged by repeated calculation of the rMLL. Next, a kernel function used for estimating a matched filter is calculated by using the hyper-parameter estimated through the convergence and the original signal for each of multi-users is recovered by using the kernel function.

Claims

1. A method for recovering an original signal through multi-user detection in a synchronous mobile communication system, comprising: calculating a partial derivative of a reduced negative marginal log likelihood (rMLL), thereafter, integrating stochastic gradient descent (SGD) by applying a partial derivative to a fast Fourier transform (FFT) of the rMLL, the rMLL is obtained by applying a law of log determinant to a Gaussian process regression (GPR) scheme used for the multi-user detection; calculating the rMLL by using the partial derivative for the rMLL; updating a hyper-parameter to a convergence point until an error gap is converged by repeated calculation of the rMLL; calculating a kernel function used for estimating a matched filter by using the hyper-parameter estimated through the convergence point; and recovering the original signal for each of multi-users by using the kernel function.

2. The method of claim 1, wherein: the partial derivative follows an relationship equation below, 1 rMLL = - 2 x ~ k 2 ( i ) exp ( 2 2 2 2 2 ) 2 1 3 Where rMLL ( ) = 1 2 n .Math. k x ^ k 2 ( i ) SE ( k ) and represents Fourier transform for the rMLL, .sub.1 represents an output-scale size, .sub.2 represents a time-scale of a received signal, y, from one moment up to a next moment, is a frequency representation of a received signal, y, in a frequency domain, a tilde symbol represents Fourier transform of an original signal x(i) in {tilde over (x)}(i), k, n, i are integers, and custom character.sub.SE()=.sub.2.sub.1.sup.2{square root over (2)}exp (2.sup.2.sup.2.sub.2.sup.2) and represents Fourier transform for the kernel function.

3. The method of claim 2, wherein: the error gap (RMSE) by repeated computation of the rMLL is evaluated through a relationship equation below, RMSE = .Math. j = 1 n j ( k ) - j ( k - 1 ) n Where custom character.sub.j.sup.(k) and custom character.sub.j.sup.(k-1) represent a value of the rMLL at a target location j after k.sup.th and (k1).sup.th repetitions, respectively, and n represents the number of repetitions, and j is an integer.

4. The method of claim 3, wherein: the update is performed by using the relationship equation below, 2 ( k ) 2 ( k - 1 ) + ( k ) 2 ( k - 1 ) rMLL 1 ( k ) 1 ( k - 1 ) + ( k ) 1 ( k - 1 ) rMLL Where (k)=1/(k+1) represents a Robbins-Monroe decay function for k.sup.th repetition.

5. The method of claim 4, wherein: the kernel function follows the relationship equation below, k ( y i , y j ) = 1 2 exp ( - .Math. y i - y j .Math. 2 2 2 2 ) .

6. The method of claim 4, wherein: applying the law of log determinant to the Gaussian process regression (GPR) scheme is convergence to a constant using a law of log determinant log |P| while minimizing the following relationship equation which is a negative log likelihood for estimating a hyper-parameter set () constituted by .sub.1 and .sub.2, 1 and 2 . - log p ( x ( i ) | y , ) = 1 2 x T ( i ) P - 1 x ( i ) + 1 2 log .Math. P .Math. + n 2 log ( 2 ) Where p(x(i)|y, w)=custom character(x(i)|.sup.Tw, .sub.noise.sup.2I) and represents a joint Gaussian distribution, the matched filter w is a random variable and follows the Gaussian distribution like p(w)=custom character(w|0, .sub.w.sup.2I) which is a probability of w, .sub.noise and .sub.w represent standard deviations of noise and the matched filter, respectively, I.sub.n represents a unit matrix having a size of n, =[(y.sub.1), (y.sub.2)], . . . , (y.sub.n) represents a vector of non-linear mapping to a higher dimensional space for a received signal) x(i), and P represents a covariance matrix.

Description

DESCRIPTION OF THE DRAWINGS

(1) FIG. 1 is a diagram illustrating a synchronous DS-CDMA system model to which the present invention is applied.

(2) FIG. 2 is a flowchart of a method for recovering an original signal in a DS-CDMA system according to an exemplary embodiment of the present invention.

MODE FOR INVENTION

(3) In the following detailed description, only certain exemplary embodiments of the present invention have been shown and described, simply by way of illustration. As those skilled in the art would realize, the described embodiments may be modified in various different ways, all without departing from the spirit or scope of the present invention. Accordingly, the drawings and description are to be regarded as illustrative in nature and not restrictive. Like reference numerals designate like elements throughout the specification.

(4) Throughout the specification, unless explicitly described to the contrary, the word comprise and variations such as comprises or comprising, will be understood to imply the inclusion of stated elements but not the exclusion of any other elements. In addition, the terms -er, -or and module described in the specification mean units for processing at least one function and operation and can be implemented by hardware components or software components and combinations thereof.

(5) First, an exemplary embodiment of the present invention may be applied to a synchronous DS-CDMA system illustrated in FIG. 1. In such a system, all users transmit symbols at the same symbol rate. In such a system, symbol k is transmitted at moment j. Each user's symbol s.sub.j(l) is multiplied by a spreading code c.sub.l. In this case, the spreading code c.sub.l is a sequence of n.sub.S pseudorandom binary values considered as a chip. Resulting signals are amplified by different gains a.sub.l through a multiplier 11 of a transmitter 10 and then summed and transmitted in various forms via a summer 12.

(6) The transmitted signal is received via a channel response h(z) 20 in a chip period. In this case, additive white Gaussian noise (AWGN) is added during transmission and is indicated through a summer 12 in a system model in FIG. 1.

(7) Meanwhile, a multi-user detector (MUD) 31 of a receiver 30 restores a transmission bit for a specific user by using the received signal, that is, a chip signal output through the summer 12, and outputs the restored transmission bit as custom character

(8) A. Assumption

(9) An input data set having n training points is referred to as D={x.sub.t,y.sub.t|.sub.t=1, . . . , n}. Herein, {x.sub.tcustom character.sup.d} represents a vector of an original signal and y.sub.t represents a column vector of the received signal at a time step t. A relationship of the vectors is shown in [Equation 1].
y.sub.t=SAx.sub.t+n.sub.t[Equation 1]

(10) Where S represents a UV matrix (each column of such a matrix containing a spreading code for each user). A represents a VV diagonal matrix and contains a size for a user. The size represents a fading degree of the signal transmitted over a channel (the fading degree indicates how far the user is from the receiver). Last, n.sub.t represents the additive white Gaussian noise (AWGN) added to the channel with time progress.

(11) In the receiver, an original signal x.sub.t(i) of an i-th i.sup.th user needs to be restored as shown in [Equation 2].
{circumflex over (x)}.sub.t(i)=sgn(w.sub.i.sup.Ty.sub.t)[Equation 2]

(12) Where w.sub.i represents is a matched filter for the i.sup.th user custom character. Event though w.sub.i is an optimal non-linear vector, such a vector may be estimated as shown in [Equation 3] by using an MMSE method.

(13) w i * = argmin w i �� [ ( x t ( i ) - w i T y t ) 2 ] = C yy - 1 C yx [ Equation 3 ]

(14) Where C.sub.yy=custom character[y.sub.ty.sub.t.sup.T] represents an autocorrelation of the received vector and C.sub.yx=custom character[y.sub.tx.sub.t(i)] represents a cross-correlation between the received vector and the original. [Equation 3] is known as distributed MMSE and may be solved without finding spreading sequences of other users. However, the problem of the solving means is that a huge training data set for an inverse matrix is required and high computational complexity is also required.

(15) B. Deriving MUD of Gaussian Process

(16) =[(y.sub.1), (y.sub.2)], . . . , (y.sub.n) represents the vector of non-linear mapping to a higher dimensional space of the received signal and () represents a corresponding mapping function. When an original signal vector x(i)=[x.sub.1(i), x.sub.2(i), . . . , x.sub.n(i)].sup.T is applied to the received signal vector y=[y.sub.1, y.sub.2, . . . , y.sub.n], the original signal vector to the received signal vector has a joint Gaussian distribution as shown in [Equation 4].
p(x(i)|y,w)=custom charactercustom character(x(i)|.sup.Tw,.sub.noise.sup.2I)[Equation 4]

(17) Where the matched filter w is actually a random variable. A probability of w follows a Gaussian distribution like p(w)=custom character(w|0,.sub.w.sup.2I) and .sub.noise and .sub.w represent standard deviations of noise and the matched filter, respectively and I.sub.n represents a unit matrix having a size of n. By applying a Bayes' law to [Equation 4], a posterior distribution of w is computed as shown in [Equation 5].

(18) p ( w | x ( i ) , y ) = p ( w ) p ( x ( i ) | w , y ) p ( x ( i ) | y ) [ Equation 5 ]

(19) Theoretically, [Equation 3] may be transformed into a nonlinear form using maximum a posterior (MAP) estimation of the random variable w for [Equation 5]. The transformation is expressed as shown in [Equation 6].

(20) w * = argmin w { .Math. x ( i ) - T w .Math. 2 + .Math. w .Math. 2 } [ Equation 6 ]

(21) Where =(.sub.noise/.sub.w).sup.2. A equation term w.sup.2 is integrated into a MAP as a regularizer to skip an over-fitting problem. By discovering w*, the estimation of an original signal {circumflex over (x)}(i) may be obtained as shown in [Equation 7].
{circumflex over (x)}(i)=k.sup.TP.sup.1x(i)[Equation 7]

(22) Where in the case of k(y.sub.i,y.sub.j)=((y.sub.i).sup.T(y.sub.j)), k=[k(y,y.sub.1), k(y,y.sub.2), . . . , k(y,y.sub.n)] represents a kernel function of the non-linear transformation and P=K+.sub.noise.sup.2I (where K represents a covariance matrix which is K.sub.ij=k(y.sub.i,y.sub.j))). Due to a priority of a processing speed, a square exponential kernel function is adopted to compute the estimate of the original signal. Such a kernel function is shown in [Equation 8].

(23) 0 k ( y i , y j ) = 1 2 exp ( - .Math. y i - y j .Math. 2 2 2 2 ) [ Equation 8 ]

(24) Where .sub.1 represents an output-scale size and .sub.2 represents a time-scale of y from one moment to a next moment. A set ={.sub.1,.sub.2} is known as a set of hyper-parameters. Next, the estimation of may be computed as shown in [Equation 9].

(25) * = argmin p ( x ( i ) | y , ) [ Equation 9 ]

(26) In order to solve [Equation 9], O(n.sup.3) for the computational complexity and O(n.sup.2) for storage when the size of the storage is n are assumed. Most of the complexity is caused by computing the inverse matrix and a log matrix. Obviously, such a disadvantage is a burden on the DS-CDMA system. In order to solve such a problem, a complexity reduction method is proposed to significantly accelerate a computation process. By using such a method, the computation and storage complexity corresponds to O(n log n) and O(n), respectively.

(27) C. Reduction of Complexity

(28) The proposed complexity reduction method is a combination of three techniques and three techniques are fast Fourier transform (FFT), a law of log determinant and application of stochastic gradient descent (SGD). By definition, the hyper-parameter set * may be estimated as shown in [Equation 10] by minimizing negative log likelihood log p(x(i)|y,).

(29) - log p ( x ( i ) | y , ) = 1 2 x T ( i ) P - 1 x ( i ) + 1 2 log .Math. P .Math. + n 2 log ( 2 ) [ Equation 10 ]

(30) Due to the high complexity for computing the inverse matrix P.sup.1, it is necessary to develop an approximation method. It may be a better solving means to approximately minimize an upper bound of the equation term in terms of the approximation, instead of trying to minimize the negative log likelihood. Analytically, in [Equation 10], a better computation focuses on two terms, that is, a data-fit term expressed as x.sup.T(i)P.sup.1x(i) and the log determinant log |P|. First, a simplification derivation needs to be applied to the terms to reduce the equation. To this end, the law of the log determinant is used to calculate the log determinant {circumflex over (P)} of the empirical covariance matrix {circumflex over (P)}, which simplifies [Equation 10] to [Equation 11].

(31) - log p ( x ( i ) | y , ) = 1 2 x T ( i ) P - 1 x ( i ) + 1 2 log P ^ + n 2 log ( 2 ) [ Equation 11 ]

(32) Where {circumflex over (P)} is calculated according to [Equation 12] based on the empirical covariance matrix and a constant .

(33) P ^ = log .Math. P ^ .Math. - w . r . t P ^ = 1 n .Math. k = 1 n + 1 [ x k ( i ) - x _ ( i ) ] [ x k ( i ) - x _ ( i ) ] T = ( n 2 ) - log ( n 2 ) [ Equation 12 ]

(34) Where () represents a Digamma function. After multiple recalculation, the term {circumflex over (P)} a converges to the constant according to a central limit theorem. Minimizing the negative log marginal likelihood after the passage of time due to such convergence may involve only minimizing the reduced negative marginal log likelihood (rMLL) as shown in [Equation 13].

(35) - log p ( x ( i ) | y , ) = 1 2 x T ( i ) P - 1 x ( i ) [ Equation 13 ]

(36) A gap of the approximation step with respect to the actual solving means may be very small and may be measured using a mean square error of [Equation 14].

(37) �� ( P ^ - log .Math. P .Math. ) 2 = - 2 log ( 1 - 1 n ) [ Equation 14 ]

(38) Nevertheless, solving the inverse matrix P in [Equation 13] still costs computationally more. Therefore, another method is required in order to achieve the object. Since the covariance matrix P has a positive-definite value, it is possible to perform the transformation using FFT. Such a technique is used for taking the calculation from a spatial-temporal domain to a frequency domain. It is worth mentioning that the cost of the FFT is only O(n log n). Obviously, the cost is much better than the method in the related art. Hereinafter, such transformation will be described in detail.

(39) First, the squared exponent kernel k(y.sub.i,y.sub.j) in Equation (8) needs to be rewritten into a Fourier transform representation as shown in [Equation 15].
custom character.sub.SE()=.sub.2.sub.1.sup.2{square root over (2)}exp(2.sup.2.sup.2.sub.2.sup.2)[Equation 15]

(40) Where is a frequency representation of the received signal y in the frequency domain. It is assumed that generates a function {tilde over (P)}=P.sup.1. Under the frequency domain, the Parseval theorem is applied to derive the Fourier transform for [Equation 13].

(41) rMLL ( ) = ( - log p ( x ( i ) | y , ) ) = 1 2 n x T ( i ) * x o ( i ) [ Equation 16 ]

(42) Where a tilde symbol represents the Fourier transform of x(i) in {tilde over (x)}(i) and x.sub.o(i) represents a data vector in a periodic domain. In a next step, a convolution theorem is continuously applied to a limit custom character.sub.SE1 and last Fourier transform of rMLL may be expressed as shown in [Equation 17].

(43) rMLL ( ) = 1 2 n .Math. n k * x ~ k 2 ( i ) = 1 2 n .Math. k x ^ k 2 ( i ) SE ( k ) [ Equation 17 ]

(44) In the form of [Equation 17], which is the set of hyper-parameters may be estimated using a slope-based technique. In this case, the stochastic gradient descent (SGD) is selected because of a characteristic that the SGD is less sensitive to fast convergence and local minima. In order to integrate the SGD, a partial derivative of [Equation 17] is required for each hyper-parameter. Such an equation is given by [Equation 18].

(45) 2 rMLL = x ~ k 2 ( i ) exp ( 2 2 2 2 2 ) ( 2 2 3 / 2 2 1 2 - 1 2 2 2 2 1 ) 1 rMLL = - 2 x ~ k 2 ( i ) exp ( 2 2 2 2 2 ) 2 1 3 [ Equation 18 ]

(46) Subsequently, an update procedure is applied to update the hyper-parameters to corresponding convergence points.

(47) Such a process is expressed as shown in [Equation 19].

(48) 0 2 ( k ) 2 ( k - 1 ) + ( k ) 2 ( k - 1 ) rMLL 1 ( k ) 1 ( k - 1 ) + ( k ) 1 ( k - 1 ) rMLL [ Equation 19 ]

(49) Where (k)=1/(k+1) represents a Robbins-Monroe decay function for k.sup.th repetition. Such a function is chosen instead of exact line search or backtracking line search or backtracking line search primarily due to a performance problem. Further, in order to control the number of update repetitions, an error function is defined based on a root mean square error (RMSE) method to measure the convergence. It should be noted that the RMSE method is stricter than the means square error (MSE) method which is generally used. By using such an error function, an error gap between a current repetition value and a previous value may be evaluated as shown in [Equation 20].

(50) RMSE = .Math. j = 1 n j ( k ) - j ( k - 1 ) n [ Equation 20 ]

(51) Where custom character.sub.j.sup.(k) and custom character.sub.j.sup.(k-1) represent a value of rMLL at a target location j after k.sup.th and (k1).sup.th repetitions, respectively. In the proposed method, an RMSE threshold is limited to 10.sup.11 which yields a solving means that is close to an actual value. Obviously, the computation may be performed without the inverse matrix. By the end of the method, a required set * of hyper-parameters is obtained. In addition, due to the fact that the method does not need to maintain a historical covariance matrix, which is primarily used to inverse the covariance matrix, the computational complexity is only O(n log n) and the storage complexity is O(n).

(52) Hereinafter, a method for recovering the original signal in the DS-CDMA system using various equations derived through the above process will be described.

(53) FIG. 2 is a flowchart of a method for recovering an original signal in a DS-CDMA system according to an exemplary embodiment of the present invention.

(54) Referring to FIG. 2, first, a receiver 30 collects a reception chip received through a channel 20 from a transmitter 10 (S100).

(55) Thereafter, the hyper-parameter to be estimated is initialized in the exemplary embodiment of the present invention (S110).

(56) Next, a partial derivative of rMLL is calculated using a finally derived [Equation 18] by applying fast Fourier transform, a law of log determinant, and stochastic gradient descent (SGD) to GPR according to the exemplary embodiment of the present invention (S120).

(57) Subsequently, the rMLL is calculated using the calculated partial derivative result (S130).

(58) Then, the RMSE is evaluated using [Equation 20] that calculates the error gap between the current repetition value and the previous value (S140).

(59) After determining whether an RMSE evaluation result converges to a preset threshold value (S150), if the RMSE evaluation result does not converge to the preset threshold value, the update is performed using [Equation 19] to update the RMSE evaluation result to the convergence point corresponding to the hyper-parameter (S160).

(60) However, when it is determined in step S150 that the RMSE evaluation result converges to the preset threshold value, the kernel function is calculated using [Equation 8] (S170) and the original signal for a specific user is finally restored using the calculated kernel function (S180).

(61) As described above, in the exemplary embodiment of the present invention, by applying the fast Fourier transform, the law of the log determinant, and the stochastic gradient descent (SGD) to the existing GPR, the regression may be significantly accelerated so that the computational complexity and storage complexity in the case of using the existing GPR are reduced and better BER is maintained.

(62) While this invention has been described in connection with what is presently considered to be practical example embodiments, it is to be understood that the invention is not limited to the disclosed embodiments, but, on the contrary, is intended to cover various modifications and equivalent arrangements included within the spirit and scope of the appended claims.