METHOD, COMPUTER PROGRAM, SYSTEM, AND COMMUNICATION DEVICE FOR OPTIMIZING THE CAPACITY OF COMMUNICATION CHANNELS

Abstract

The invention relates to a method for optimizing a capacity of a communication channel in a communication system comprising at least a transmitter (10), a receiver (11), and the communication channel (12) between the transmitter and the receiver. The transmitter (10) uses a finite set of symbols Ω={ω.sub.1, . . . , ω.sub.N} having respective positions on a constellation, to transmit a message including at least one symbol on said communication channel (11). The communication channel (11) is characterized by a conditional probability distribution ρ.sub.Y|X(y|x), where y is the symbol received at the receiver (12) while x is the symbol transmitted by the transmitter. More particularly, the conditional probability distribution ρ.sub.Y|X(y|x) is obtained, for each possible transmitted symbol x, by a mixture model using probability distributions represented by exponential functions. An optimized input distribution p.sub.x(x) is computed, based on parameters of the mixture model, to define optimized symbols positions and probabilities to be used at the transmitter for optimizing the capacity of the channel.

Claims

1. A method for optimizing a capacity of a communication channel in a communication system comprising at least a transmitter, a receiver, and said communication channel between the transmitter and the receiver, the transmitter using a finite set of symbols Ω={ω.sub.1, . . . , ω.sub.N} having respective positions on a constellation, to transmit a message including at least one symbol on said communication channel, the communication channel being characterized by a conditional probability distribution p.sub.Y|X(y|x) , where y is the symbol received at the receiver while x is the symbol transmitted by the transmitter, wherein said conditional probability distribution p.sub.Y|X(y|x) is obtained, for each possible transmitted symbol x, by a mixture model using probability distributions represented by exponential functions, and an optimized input distribution p.sub.x(x) is computed, based on parameters of said mixture model, to define optimized symbols positions and probabilities to be used at the transmitter for optimizing the capacity of the channel.

2. The method of claim 1, wherein said optimized symbols positions and probabilities are obtained at the transmitter and at the receiver.

3. The method according to claim 1, wherein the transmitter transmits messages conveyed by a signal belonging to a finite set of signals corresponding respectively to said symbols ω.sub.1, . . . , ω.sub.N, each signal being associated with a transmission probability according to an optimized input signal probability distribution corresponding to said optimized input distribution p.sub.x(x), And the transmitter takes messages to be transmitted and said optimized input signal probability distribution as inputs, and outputs a transmitted signal on the communication channel.

4. The method according to claim 3, wherein the communication channel takes the transmitted signal as an input, and outputs a received signal intended to be processed at the receiver, said conditional probability distribution p.sub.Y|X(y|x) being related thus to a probability of outputting a given signal y when the input x is fixed.

5. The method according to claim 4, wherein an estimation of said conditional probability distribution p.sub.Y|X(y|x) is taken as input, to output the optimized input signal probability distribution p.sub.x(x) to be obtained at least at the transmitter the conditional probability distribution estimation being used for computing the optimized input signal probability distribution, the conditional probability distribution estimation being approximated by said mixture model.

6. The method according to claim 4, wherein the receiver takes the received signal, the optimized input signal probability distribution p.sub.x(x) and an estimation of the channel conditional probability distribution p.sub.Y|X(y|x) as inputs and performs an estimation of a message conveyed in said received signal.

7. The method according to claim 1, wherein said mixed model follows a conditional probability distribution p.sub.Y|X(y|x) which is decomposable into a basis of probability distributions exponential functions g(y|x;θ), where θ is a parameter set, such that:
p.sub.Y|X(y|x)=Σ.sub.j=1.sup.K w.sub.jg(y|x;θ.sub.j) (E) where K is a predetermined parameter, the sets {θ.sub.j}, {w.sub.j} are parameters representing respectively a mean vector coordinates and covariance matrix parameters.

8. The method of claim 7, wherein the derivative of the probability distributions exponential functions g(y|x;θ) are given by g(y|x;θ)=h(y,θ)exp(x.sup.Ty−α(x,θ)), where h(y,θ) is a function of y and θ, and α(x,θ) is the moment generating function, x and y being vectors, such that said derivative is given by: $\frac{\partial}{\partial x} g (y | x; θ) = h (y, θ) (y - \frac{\partial}{\partial x} a (x, θ)) \exp (x^{T} y - a (x, θ))$

9. The method according to claim 7, wherein said distribution p.sub.Y|X(y|x) is approximated by a finite set of continuous functions minimizing a metric defined by Kullback-Leibler divergence, by determining parameters set {θ.sub.j},{w.sub.j} which minimize the Kullback-Leibler divergence between an analytical observation of p.sub.Y|X(y|x) and its expression given by:
p.sub.Y|X(y|x)=Σ.sub.j=1.sup.K w.sub.jg(y|x;θ.sub.j).

10. The method according to claim 1, wherein the input distribution p.sub.x(x) is represented as a list of N constellation positions as {(x.sub.1,π.sub.1), . . . , (x.sub.N,π.sub.N)}, where x.sub.i and π.sub.i denote respectively constellation positions and probability weights, And wherein said input distribution p.sub.x(x) is estimated by solving an optimization problem at the transmitter given by: $(x^{*}, π^{*}) = \underset{\hat{x}, π}{argmax} I (x, π) subject to {.Math.}_{i = 1}^{N} π_{i} = 1 {.Math.}_{i = 1}^{N} {.Math. x_{i} .Math.}^{2} π_{i} \leq P 0 < π_{i} < 1, for i = 1, .Math., N$ Where: I(x,π) is a mutual information as a function of position vector x=[x.sub.1, . . . , x.sub.N].sup.T and weight vector π=[π.sub.1, . . . , π.sub.N].sup.T, optimal values are tagged with the superscript *, and P denotes a total transmit power.

11. The method of claim 10, wherein said mixed model follows a conditional probability distribution p.sub.Y|X(y|x) which is decomposable into a basis of probability distributions exponential functions g(y|x;θ), where θ is a parameter set, such that:
p.sub.Y|X(y|x)=Σ.sub.j=1.sup.K w.sub.jg(y|x;θ.sub.j) (E) where K is a predetermined parameter, the sets {θ.sub.j},{w.sub.j} are parameters representing respectively a mean vector coordinates and covariance matrix parameters; and wherein the mutual information is expressed as: $I (x, π) = \frac{1}{M} {.Math.}_{i = 1}^{N} {.Math.}_{m = 1}^{M} π_{i} \log \frac{p_{Y | X} (y_{i, m} | x_{i})}{{.Math.}_{j = 1}^{N} π_{j} p_{Y | X} (y_{i, m} | x_{j})}, where p_{Y | X} (y | x) = {.Math.}_{j = 1}^{K} w_{j} g (y | x; θ_{j}),$ and argument y.sub.i,m are samples from the distribution p.sub.Y|X(y|x.sub.i).

12. The method of claim 11, wherein an alternating optimization is performed iteratively to calculate both p.sub.x(x) and p.sub.Y|X(y|x)=Σ.sub.j=1.sup.K w.sub.jg(y|x;θ.sub.j), so as to derive from said calculations optimized positions π.sup.(t) described from a preceding iteration t−1 to a current iteration t as follows: at first, positions π.sup.(t) are optimized for a fixed set of symbol positions x.sup.(t−1) and previous position values π.sup.(t−1); then, symbol positions x.sup.(t) are optimized for the thusly determined π.sup.(t) and previous values of x.sup.(t−1), And repeating iteratively these two steps until a stopping condition occurs on the mutual information I(x,π).

13. A computer program comprising instructions causing a processing circuit to implement the method as claimed in claim 1, when such instructions are executed by the processing circuit.

14. A system comprising at least a transmitter, a receiver, and a communication channel between the transmitter and the receiver, wherein the transmitter at least is configured to implement the method according to claim 1.

15. A communication device comprising a processing circuit configured to perform the optimization method according to claim 1.

Description

BRIEF DESCRIPTION OF DRAWINGS

[0046] More details and advantages of possible embodiments of the invention will be presented below with reference to the appended drawings.

[0047] FIG. 1 is an overview of a system according to an example of embodiment of the invention.

[0048] FIG. 2 shows possible steps of an optimization method according to an embodiment of the invention.

[0049] FIG. 3 shows schematically a processing circuit of a communication device to perform the optimization method of the invention.

DESCRIPTION OF EMBODIMENTS

[0050] Referring to FIG. 1, a system according to the present invention comprises in an example of embodiment a transmitter 10, a receiver 12, a transmission channel 11 and an input signal probability distribution optimizer 13.

[0051] The transmitter 10 transmits messages conveyed by a signal belonging to a finite set of signals, each associated with a transmission probability according to an (optimized) input signal probability distribution. The transmitter 10 takes the messages and the (optimized) input signal probability distribution as inputs, and outputs the signal to be transmitted on the channel. The channel 11 takes the transmitted signal as an input, and outputs a received signal which is processed at the receiver 12 in order to decode the transmitted message. It is characterized by a channel conditional probability distribution of the probability of outputting a given signal when the input is fixed. The probability distribution can generally be defined on a discrete or continuous input and/or output alphabet. Here, as an example, the continuous output alphabet is considered, and the probability distribution is called a probability density function in this case.

[0052] The input signal probability distribution optimizer 13 takes the conditional probability distribution estimation as an input, and outputs the optimized input signal probability distribution to the transmitter 10 and receiver 12.

[0053] It is worth noting here that the optimizer 13 can be a same module which is a part of both the transmitter and the receiver. It can be alternatively a module which is a part of a scheduling entity (e.g. a base station or other) in a telecommunication network linking said transmitter and receiver through the communication channel. More generally, a communication device such as the transmitter 10, the receiver 12, or else any device 13 being able to perform the optimization method, can include such a module which can have in practice the structure of a processing circuit as shown on FIG. 3. Such a processing circuit can comprise typically an input interface IN to receive data (at least data enabling the estimation of the conditional probability distribution), linked to a processor PROC cooperating with a memory unit MEM (storing at least instructions of a computer program according to the invention), and an output OUT to send results of optimization computations.

[0054] More particularly, the conditional probability distribution estimation is used for computing the optimized input signal probability distribution at the input signal probability distribution optimizer 13. In particular, it is shown hereafter that the optimization is made more efficient when the conditional probability distribution estimation is approximated by a mixture of exponential distributions.

[0055] The receiver 12 takes the received signal, the optimized input signal probability distribution and the estimated channel conditional probability distribution as inputs and performs an estimation of the message conveyed in the received signal.

[0056] The transmission channel 11 is represented by a model, hereafter, that follows a conditional probability distribution p.sub.Y|X(y|x) that can be decomposed into a basis of probability distributions functions p(y|x;θ), where θ is a parameter set. For example, the distribution function is the exponential family and the parameters are essentially the mean and variance for the scalar case, and more generally the mean vector and covariance matrix for the multi-variate case, such that:

p.sub.Y|X(y|x)=Σ.sub.j=1.sup.K w.sub.jp(y|x;θ.sub.j) (E)

where K, and the sets {θ.sub.j},{w.sub.j} are parameters.

[0057] For example, three examples of channels following the model can be cited hereafter.

[0058] Channels might have random discrete states when the channel fluctuates randomly in time according to discrete events, such as: [0059] interference by bursts that changes the signal to noise ratio from one transmission to another, [0060] shadowing effects that change the received signal power, [0061] an approximation of a random fading channel by a discrete distribution, where the channel coefficient α is random and follows p(α)=Σ.sub.j=1.sup.K w.sub.jδ(α−α.sub.j), where α.sub.j is one out of n possible values of the channel coefficient α occurring with a probability w.sub.j, and δ(.) is the Kronecker function. Thus, in case of Gaussian noise with variance σ.sub.η.sup.2, such fading channel leads to the probability distribution is noted as

[00004] $p_{Y | X} (y | x) = {.Math.}_{j = 1}^{K} w_{j} \frac{e^{- \frac{{.Math. y - α_{j} x .Math.}^{2}}{2 σ_{η}^{2}}}}{\sqrt{2 σ_{η}^{2}}}$

[0062] In case of channel estimation impairments (typically when the transmission channel is imperfectly known), residual self-interference is obtained on the received signal. In general, the channel model is obtained as ={circumflex over (α)}x+η−vx, which leads to:

[00005] $p_{Y | X} (y | x) = {.Math.}_{j = 1}^{K} w_{j} \frac{e^{- \frac{{.Math. y - \hat{α} x .Math.}^{2}}{2 (σ_{η}^{2} + σ_{v}^{2} {.Math. x .Math.}^{2})}}}{\sqrt{2 (σ_{η}^{2} + σ_{v}^{2} {.Math. x .Math.}^{2})}}$

[0063] Therefore, it is shown here that, from any known continuous distribution p.sub.Y|X(y|X), this distribution can be approximated by a finite set of continuous functions.

[0064] The approximation is done by minimizing a metric. One relevant metric is the Kullback-Leibler divergence that allows getting a measure of the difference between two distributions. Thus, when knowing p.sub.Y|X(y|x) analytically, it is possible to find parameters set {θ.sub.j},{w.sub.j} that minimize the Kullback-Leibler divergence between p.sub.Y|X(y|x) and an approximated expression in the form of equation (E) given above.

[0065] From an estimated histogram of p.sub.Y|X(y|x), it can be approximated by a finite set of continuous functions, in the same way as with a known continuous distribution, by using the Kullback-Leibler divergence as a metric.

[0066] The function p.sub.Y|X(y|x) is bi-variate with variables x and y which spans in general in a continuous domain.

[0067] Hereafter a focus is made on symbols x belonging to a finite alphabet Ω={ω.sub.1, . . . , ω.sub.N} of cardinality N.

[0068] It is further assumed that the derivative of the probability distributions functions g(y|x;θ) is known. For example, when g(y|x;θ) is from the exponential family, it can be written:

g(y|x; θ)=h(y,θ)exp(x.sup.Ty−α(x,θ)),

where h(y,θ) is a function of y and θ, and α(x,θ) is the moment generating function, x and y being vectors in this general case. Thus,

[00006] $\frac{\partial}{\partial x} g (y | x; θ) = h (y, θ) (y - \frac{\partial}{\partial x} a (x, θ)) \exp (x^{T} y - a (x, θ))$

[0069] For example, in the scalar Gaussian case, the probability density function is thus decomposed as follows:

[00007] $\frac{\partial}{\partial x} \frac{e^{- \frac{{.Math. y - α_{j} x .Math.}^{2}}{2 σ_{η}^{2}}}}{\sqrt{2 σ_{η}^{2}}} = \frac{2 α_{j} (y - α_{j} x)}{2 σ_{η}^{2}} \frac{e^{- \frac{{.Math. y - α_{j} x .Math.}^{2}}{2 σ_{η}^{2}}}}{\sqrt{2 σ_{η}^{2}}}$

[0070] The input signal distribution optimizer 13 relies on the estimation of the channel probability distribution in the form of equation (E). When the functional basis chosen for the estimation of the channel is the exponential family, closed form expression can be derived and the algorithm converges to the optimal solution.

[0071] The capacity approaching input is deemed to be discrete for some channels. For the case of continuous capacity achieving input (that is the case for more general channels), the input distribution p.sub.X(x) can be represented as a list of N particles as [0072] {(x.sub.1,π.sub.1), . . . , (x.sub.N,π.sub.N)},
where x.sub.i and π.sub.i denote the positions (i.e., represented by a set of coordinates or by a complex number in a 2-dimension case) and weights, respectively. The optimization problem in the transmitter can be written as

[00008] $\begin{matrix} (x^{*}, π^{*}) = \underset{x, π}{argmax} I (x, π) & (1) \end{matrix}$ $\begin{matrix} subject to {.Math.}_{i = 1}^{N} π_{i} = 1 & (2) \end{matrix}$ $\begin{matrix} {.Math.}_{i = 1}^{N} {.Math. x_{i} .Math.}^{2} π_{i} \leq P & (3) \end{matrix}$ $\begin{matrix} 0 < π_{i} < 1, for i = 1, .Math., N & (4) \end{matrix}$

[0073] Where: [0074] I(x, π) is the mutual information as a function of position vector x=[x.sub.1, . . . , x.sub.N].sup.T and weight vector π=[π.sub.1, . . . , π.sub.N].sup.T, [0075] the optimal values are shown with the superscript *, and [0076] P denotes the total transmit power constraint, which is set arbitrarily. In general, this value is defined by a power budget of the transmitter which is related to the physical limit of the power amplifier or is related to a maximum radiated power allowed by regulation.

[0077] The constraint (2) sets the total probability of particles to 1. Constraints (3) and (4) guarantee the total transmit power to be less than or equal to P, and the magnitude of particle probabilities to be positive values less than 1, respectively. The mutual information I({circumflex over (x)},π), involves an integration on continuous random variables, but can be approximated by Monte-Carlo integration (the main principle of which is to replace the expectation function, which usually involves an integration, by a generation of samples which are realizations of said random variable and an averaging of the obtained values) as

[00009] $\begin{matrix} I (x, π) = \frac{1}{M} {.Math.}_{i = 1}^{N} {.Math.}_{m = 1}^{M} π_{i} \log \frac{p_{Y | X} (y_{i, m} | x_{i})}{{.Math.}_{j = 1}^{N} π_{j} p_{Y | X} (y_{i, m} | x_{j})}, & (5) \end{matrix}$

where M denotes the number of samples (i.e., the number of realizations of the random variables generated from their probability distribution), and where

p.sub.Y|X(y|x)=Σ.sub.j=1.sup.K w.sub.jg(y|x;θ.sub.j), (6)

denoting thus a decomposition of the conditional probability p.sub.Y|X(y|x) into a basis of functions g() involving θ.sub.j.

[0078] The argument y.sub.i,m in (5) are the samples from the distribution P.sub.Y|X(y|x.sub.i).

[0079] Hereafter, an alternating optimization method is proposed, described from iteration t−1 to t as follows: [0080] at first, optimize π.sup.(t) for a fixed set of particles x.sup.(t−1) and a previous value π.sup.(t−1); [0081] then, optimize x.sup.(t) for the obtained π.sup.(t) and a previous value x.sup.(t−1).

[0082] These two steps are detailed hereafter respectively as S1 and S2. They can intervene after an initialization step S0 of an algorithm presented below.

[0083] Step S1: Optimization of π.sup.(t) for a Fixed Set of Particles x.sup.(t−1) and a Previous Value π.sup.(t−1)

[0084] The optimization in (1) is concave with respect to it for fixed values of x. So, for a given x.sup.(t−1), (1) is solved for it by writing the Lagrangian and solving for π.sub.i for i=1, . . . , N as

[00010] $\begin{matrix} π_{i}^{(t)} = \frac{\exp (β {.Math. x_{i}^{(t - 1)} .Math.}^{2} + \frac{1}{M} {.Math.}_{m = 1}^{M} \log q (x_{i}^{(t - 1)} | y_{i, m}))}{{.Math.}_{j = 1}^{N} \exp (β {.Math. x_{j}^{(t - 1)} .Math.}^{2} + \frac{1}{M} {.Math.}_{m = 1}^{M} \log q (x_{j}^{(t - 1)} | y_{j, m}))}, where & (7) \end{matrix}$ $q (x_{i} | y_{i, m}) = \frac{π_{i}^{(t - 1)} p_{Y | X} (y_{i, m} | x_{i})}{{.Math.}_{j = 1}^{N} π_{j}^{(t - 1)} p_{Y | X} (y_{i, m} | x_{j})} .$

[0085] Here, the expression

[00011] $\frac{1}{M} {.Math.}_{m = 1}^{M} \log q (x_{i}^{(t - 1)} | y_{i, m})$

is the approximation of the mathematical expectation E[logq(x.sub.i.sup.(t−1)|y.sub.i)] according to the random variable y.sub.i. The approximation is performed by the above mentioned Monte-Carlo integration, i.e., by generating M samples according to the distribution of y.sub.i. The term

[00012] $\frac{1}{M} {.Math.}_{m = 1}^{M} \log q (x_{i}^{(t - 1)} | y_{i, m})$

can be advantageously replaced by a numerical integration or a closed form expression when available.

[0086] In (7), β denotes the Lagrangian multiplier that can be determined by replacing (7) in (3) with equality for the maximum total transmit power P, and resulting to the non-linear equation

[00013] $\begin{matrix} {.Math.}_{i = 1}^{N} \exp (β {.Math. x_{i}^{(t - 1)} .Math.}^{2} + \frac{1}{M} {.Math.}_{m = 1}^{M} \log q (x_{i}^{(t - 1)} | y_{i, m})) [P - {.Math. x_{i}^{(t - 1)} .Math.}^{2}] = 0. & (8) \end{matrix}$

[0087] The non-linear equation (8) can be solved using different tools, e.g., gradient descent based approaches such as Newton-Raphson, or by selecting several values of 16, computing the left part of the equation in (8) and keeping the closest one to 0 in absolute value. And the values of π.sub.i.sup.(t) are obtained from (7).

[0088] Step S2: Optimization of x.sup.(t) for a Fixed π.sup.(t) and Previous x.sup.(t−1)

[0089] The Lagrangian for the optimization in (1) with a given weight vector π.sup.(t) can be given by:

custom-character (x;β,π.sup.(t))=I(x,π.sup.(t))+β(P−Σ.sub.i=1.sup.N |x.sub.i|.sup.2π.sub.i.sup.(t)). (9)

[0090] The position vector x is obtained such that the Kullback-Leibler divergence D(p.sub.Y|X(y|x.sub.i)∥p.sub.Y(y)) penalized by the second term in (9) is maximized. This way the value of Lagrangian custom-character (x; β, π.sup.(t), i.e., penalized mutual information, is greater than or equal to the previous values after each update of the position and weight vectors. This is achieved by gradient ascent based methods, i.e.:

[00014] $x_{i}^{(t)} = x_{i}^{(t - 1)} + λ_{t} \frac{\partial}{\partial x_{i}} D (p_{Y .Math. X} (y | x_{i}) .Math. p_{Y} (y)) |_{x^{(t - 1)}, π^{(t)}}$

where the step size λ.sub.t is a positive real number.

[0091] In the aforementioned gradient ascent based methods, it is required to compute the derivative of the term D(p.sub.Y|X(y|x.sub.i)∥p.sub.Y(y)) by Monte-Carlo integration as

[00015] $\frac{\partial}{\partial x_{i}} D (p_{Y .Math. X} (y | x_{i}) .Math. p_{Y} (y)) |_{x^{(t - 1)}, π^{(t)}} \approx \frac{1}{M} {.Math.}_{m = 1}^{M} h (y_{i, m}, x_{i}^{(t - 1)}) [1 + \log \frac{p_{Y | X} (y_{i, m} | x_{i}^{(t - 1)})}{{.Math.}_{j = 1}^{N} π_{j}^{(t)} p_{Y | X} (y_{i, m} | x_{j}^{(t - 1)})} - π_{i}^{(t)} \frac{p_{Y | X} (y_{i, m} | x_{i}^{(t - 1)})}{{.Math.}_{j = 1}^{N} π_{j}^{(t)} p_{Y .Math. X} (y_{i, m} .Math. x_{j}^{(t - 1)})}] . where h (y_{i, m}, x_{i}) = \frac{\partial}{\partial x_{i}} \log p_{Y .Math. X} (y_{i, m} | x_{i}) .$

[0092] Using (6), it can be obtained:

[00016] $h (y_{i, m}, x_{i}) = \frac{\partial}{\partial x_{i}} \log ({.Math.}_{j = 1}^{K} w_{j} g (y_{i, m} | x_{i}; θ_{j})) = \frac{w_{i} \frac{\partial}{\partial x_{i}} g (y_{i, m} | x_{i}; θ_{i})}{{.Math.}_{j = 1}^{K} w_{j} g (y_{i, m} | x_{i}; θ_{j})}$

[0093] Thus, when g(y|x;θ.sub.j) is known in a closed form and its derivative is known in a closed form, the equation can be computed.

[0094] Finally the x.sup.(t) values are obtained and the iteration can continue until a stopping condition is met. The stopping condition is for example an execution time, or if I(x.sup.(t),π.sup.(t))−I(x.sup.(t−1),π.sup.(t−1)) is lower than a given threshold, typically small.

[0095] An example of algorithm is detailed hereafter, with reference to FIG. 2.

[0096] Step S0: Initialization Step [0097] Step S01: Get the input parameters [0098] P, the power limit of the constellation [0099] The initial constellation of N symbols [0100] The stopping criterion threshold ϵ [0101] Step S02: Get the channel conditional probability distribution in the form P.sub.Y|X(y|x)=Σ.sub.j=1.sup.K w.sub.jg(y|x;θ.sub.j), where K, w.sub.j are sacalar parameters and θ.sub.j is a parameter set, and where the expression of

[00017] $\frac{\partial}{\partial x_{i}} g (y | x; θ_{j})$

is known. [0102] Step S02: Set t=0; Set all π.sub.i.sup.(0)=1/N; Set all x.sub.i.sup.(0) from an initial constellation C0; Set I.sup.(−1)=0; Set t=1

[0103] Step S1: Iterative Step t

[0104] Step S10: Samples Generation [0105] S101: For all i in [1,N], generate M samples y.sub.i,m from the distribution p.sub.Y|X(y|x=x.sub.i.sup.(t−1)) [0106] S102: For all i in [1,N], for all j in [1,N], compute p.sub.Y|X(y.sub.i,m|x.sub.j.sup.(t−1))

[0107] Step S11: Compute the Stopping Condition

[00018] $I^{(t - 1)} = \frac{1}{M} {.Math.}_{i = 1}^{N} {.Math.}_{m = 1}^{M} π_{i}^{(t - 1)} \log \frac{p_{Y | X} (y_{i, m} .Math. x_{i}^{(t - 1)})}{{.Math.}_{j = 1}^{N} π_{j}^{(t - 1)} p_{Y .Math. X} (y_{i, m} .Math. x_{j}^{(t - 1)})}$ [0108] S111: Compute [0109] S112: I.sup.(t−1)−I.sup.(t−2)<ϵ, stop the iterative algorithm (S113). Otherwise, go to S121.

[0110] Step S12: Update the Probabilities π.sub.i.sup.(t) [0111] S121: For all i in [1,N], and m in [1,M], compute

[00019] $q (x_{i}^{(t - 1)} | y_{i, m}) = \frac{π_{i}^{(t - 1)} p_{Y | X} (y_{i, m} .Math. x_{i}^{(t - 1)})}{{.Math.}_{j = 1}^{N} π_{j}^{(t - 1)} p_{Y | X} (y_{i, m} | x_{j}^{(t - 1)})}$ [0112] S122: Compute β by solving:

[00020] ${.Math.}_{i = 1}^{N} \exp (β {.Math. x_{i}^{(t - 1)} .Math.}^{2} + \frac{1}{M} {.Math.}_{m = 1}^{M} \log q (x_{i}^{(t - 1)} .Math. y_{i, m})) [P - {.Math. x_{i}^{(t - 1)} .Math.}^{2}] = 0$ [0113] For example by using a Newton-Raphson descent, and/or [0114] by using a line-search strategy (taking several β values, computing the above expression and selecting the closest to 0); [0115] S123: For all i in [1,N], compute

[00021] $π_{i}^{(t)} = \frac{\exp (β {.Math. x_{i}^{(t - 1)} .Math.}^{2} + \frac{1}{M} {.Math.}_{m = 1}^{M} \log q (x_{i}^{(t - 1)} .Math. y_{i, m}))}{{.Math.}_{j = 1}^{N} \exp (β {.Math. x_{j}^{(t - 1)} .Math.}^{2} + \frac{1}{M} {.Math.}_{m = 1}^{M} \log q (x_{j}^{(t - 1)} .Math. y_{i, m}))},$

[0116] Step S2: Update the Symbols x.sub.i.sup.(t) Position with New π.sub.i.sup.(t) and Previous x.sub.i.sup.(t−1) [0117] S21: For all i in [1,N], for all j in [1,N], compute

[00022] $\frac{\partial}{\partial x_{i}} g (y_{i, m} | x_{i}; θ_{i}),$

which is obtained from the known expression of

[00023] $\frac{\partial}{\partial x_{í}} g (y | x; θ_{i})$

by substituting y by y.sub.i,m and x by x.sub.i [0118] S22: For all i in [1,N], compute

[00024] $x_{i}^{(t)} = x_{i}^{(t - 1)} + λ_{t} \frac{1}{M} {.Math.}_{m = 1}^{M} h (y_{i, m}, x_{i}^{(t - 1)}) [1 + \log \frac{p_{Y .Math. X} (y_{i, m} | x_{i}^{(t - 1)})}{{.Math.}_{j = 1}^{N} π_{j}^{(t)} p_{Y | X} (y_{i, m} .Math. x_{j}^{(t - 1)})} - π_{i}^{(t)} \frac{p_{Y | X} (y_{i, m} .Math. x_{i}^{(t - 1)})}{{.Math.}_{j = 1}^{N} π_{j}^{(t)} p_{Y | X} (y_{i, m} | x_{j}^{(t - 1)})}]$

where h(y.sub.i,m,x.sub.i.sup.(t−1)) is the value of the function

[00025] $h (y_{i, m}, x_{i}) = \frac{w_{i} \frac{\partial}{\partial x_{i}} g (y_{i, m} | x_{i}; θ_{i})}{{.Math.}_{j = 1}^{K} w_{j} g (y_{i, m} | x_{i}; θ_{j})} for x_{i} = x_{i}^{(t - 1)}$

[0119] Next step S3 is an incrementing of t to loop, for a next iteration, to step S101.

[0120] An artificial intelligence can thus be programmed with such an algorithm to optimize the capacity of a given communication channel (one or several communication channels) in a telecommunication network.

METHOD, COMPUTER PROGRAM, SYSTEM, AND COMMUNICATION DEVICE FOR OPTIMIZING THE CAPACITY OF COMMUNICATION CHANNELS

Assignee

Inventors

Cpc classification

Classification Explorer

H04L25/03171

ELECTRICITY

Classification Explorer

H04L27/3405

ELECTRICITY

Classification Explorer

H04L1/0001

ELECTRICITY

International classification

Classification Explorer

H04L27/34

ELECTRICITY

Classification Explorer

H04L1/00

ELECTRICITY

Classification Explorer

H04L25/03

ELECTRICITY

Abstract

Claims

Description