Blind channel estimation method for an MLSE receiver in high speed optical communication channels

Abstract

A method for performing blind channel estimation for an MLSE receiver in a communication channel, according to which Initial Metrics Determination Procedure (IMDP) is performed using joint channel and data estimation in a decision directed mode. This is done by generating a bank of initial metrics that assures convergence, based on initial coarse histograms estimation, representing the channel and selecting a first metrics set M from the predefined bank. Then an iterative decoding procedure is activated during which, a plurality of decision-directed adaptation learning loops are carried out to perform an iterative histograms estimation procedure for finely tuning the channel estimation. Data is decoded during each iteration, based on a previous estimation of the channel during the previous iteration. If convergence is achieved, ISI optimization that maximizes the amount of ISI that is compensated by the MLSE is performed.

Claims

1. A method for performing blind channel estimation for an MLSE receiver in high speed optical communication channel, comprising: a) performing Initial Metrics Determination Procedure (IMDP) using joint channel and data estimation in a decision directed mode, by: a.1) generating a bank of initial metrics that with at least one metric having convergence tendency, based on an initial coarse histograms generated by a set of FIR filters representing parameters of said channel; a.2) selecting a first metric from said bank of initial metrics; a.3) activating an iterative decoding procedure during which, a plurality of decision-directed adaptation learning loops are carried out for a selected metric, to perform an iterative histograms estimation for finely tuning the channel estimation, while during each iteration, decoding samples of the signal received via said channel by an MLSE decoder, based on a previous estimation of said channel during the previous iteration; a.4) checking whether the resulting metrics are converged using sampled standard deviation of the central moments, and if convergence is not achieved, selecting the next metrics set from said bank, otherwise; a.5) performing ISI optimization by said MLSE receiver using metrics for which convergence has been achieved; b) if the initial metrics bank is run out of metrics sets, repeating said IMDP over again; and c) using said decision-directed adaptation loops for tracking variations of said channel during steady state operation.

2. A method according to claim 1, wherein the checking whether the resulting metrics are converged is performed using a Z-test.

3. A method according to claim 1, wherein the convergence tendency of the histogram set is monitored by using the sampled standard deviation of the central moments after a predetermined number of iterations.

4. A method according to claim 1, wherein the convergence tendency of the histogram set is monitored, based on a training sequence.

5. A method according to claim 1, wherein the ISI optimization is performed by: a) collecting several channel estimates, while each time, setting a different Match Point (MP)-shift between the stream of ADC samples and the stream of the corresponding decision bits; and b) selecting the MP-shift that yields the minimal variances-average of the histograms.

Description

BRIEF DESCRIPTION OF THE DRAWINGS

(1) The above and other characteristics and advantages of the invention will be better understood through the following illustrative and non-limitative detailed description of preferred embodiments thereof, with reference to the appended drawings, wherein:

(2) FIG. 1a (prior art) illustrates an example of 4-state trellis diagram

(3) FIG. 1b shows the architecture of blind MLSE equalizer;

(4) FIG. 2 shows an Initial Metrics Determination Procedure (IMDP), according to an embodiment of the invention;

(5) FIGS. 3a-3c show FIR filter examples for generation of metrics bank;

(6) FIG. 4 shows a block diagram an experimental setup used;

(7) FIGS. 5a and 5b illustrate two normalized histograms during Phase # 1 of IMDP, for a channel memory depth of one symbol;

(8) FIGS. 6a-6c show three normalized histogram sets obtained after phase # 2 of IMDP;

(9) FIGS. 7a-7c show IMDP convergence monitoring during phase #2;

(10) FIG. 7d shows IMDP convergence monitoring during phase #3;

(11) FIGS. 8a-8e show the results of ISI optimization during phase #4;

(12) FIG. 9 illustrates Phase # 1 of IMDP Normalized histogram for a 40 km link;

(13) FIGS. 10a-10d present Phase #2 of the IMDP, which consists of 8 iterations;

(14) FIGS. 11a-11d show the convergence process during phase #2 of the IMDP, for a 40 km long optical link;

(15) FIGS. 12a-12e show the histogram sets representing the outcome of the ISI optimization Phase # 4 of IMDP for a 40 km long optical channel; and

(16) FIG. 13 shows Experimental BER curves comparing the training and the IMDP.

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS

(17) The present invention proposes a novel, simple and fast blind channel estimation method for direct-detection optical systems, based on blind channel acquisition algorithm, for MLSE equalization in high speed optical communications. It performs joint channel and data estimation in decision directed mode.

(18) The blind channel acquisition algorithm is referred herein as Initial Metrics Determination Procedure (IMDP). The initialization of the IMDP is based on the approximate Discrete Time Equivalent (DTE) model, exploiting the most relevant physical properties of the fiber and the nonlinear photo-detector.

(19) Blind MLSE Architecture and Decoding Principles

(20) For a non-coherent system, maximum likelihood sequence estimation is proven to be the most effective stochastic technique for mitigating optical channel impairments such as chromatic dispersion and polarization mode dispersion. While CD is a deterministic phenomenon for a given link, PMD is stochastic in nature, and therefore an adaptive equalizer that performs PMD tracking is required for proper estimation. Moreover, the adaptation properties of the MLSE can be also exploited for CD compensation when the amount of CD is not perfectly known. Basically, expensive tunable optical dispersion compensation may be replaced by the adaptive MLSE. To ensure sufficient tracking, the adaptation rate must be fast enough, comparing to temporal variations of the channel. Since PMD changes in the scale of 100 sec-1 m sec, the adaptation rate must be at least ten times faster, meaning that every 10 sec a new channel estimation must be obtained.

(21) The channel estimates are called metrics, and are obtained by taking the (negative) logarithm of the conditional probability density functions (PDFs) of the received samples r.sub.n given the transmitted sequence [a.sub.n, a.sub.n1, . . . , a.sub.nN.sub.isi.sub.+1] of N.sub.isi consecutive symbols:
M.sub.i(r.sub.n|a.sub.n,a.sub.n1, . . . , a.sub.nN.sub.isi.sub.+1)=log(f.sub.channel(r.sub.n|a.sub.n,a.sub.n1, . . . , a.sub.nN.sub.isi.sub.+1)), l=0,1, . . . , V.sup.N.sup.isi.sup.+1[Eq. 1]
where V represents the vocabulary size at the receiver (Rx) side.

(22) The key idea of the MLSE processor is to choose the path .sub.opt with the smallest running metric .sub.l.sup.(k) among V.sup.N candidate sequences of length N:

(23) $\begin{matrix} _{opt} = \min_{0 k V^{N}} {_{l}^{(k)}}_{l}^{(k)} = {.Math.}_{n = 0}^{N - 1} M_{l}^{(k)} (r_{n} .Math. a_{n}, a_{n - 1}, .Math., a_{n - N_{isi} + 1}) & [Eq . 2] \end{matrix}$
and produce the most likely sequence by tracing the trellis back. Practical implementations often resort to the computationally efficient Viterbi algorithm. Here, the Histogram Method is used to approximate the PDFs in [Eq. 1]. Since blind equalization is pursued, the histograms are collected in decision directed manner, as shown on FIG. 1b.

(24) In FIG. 1b, The data path consists of the MLSE decoder which processes the samples coming from the Analog-To-Digital Converter (ADC), based on current channel estimation, and passes the outcome bits (or symbols) to the data aggregator for further processing. At the control path, there are three blocks that carry the channel estimation task in the following way. First, the properly delayed incoming samples are attributed to the output of the MLSE decoder, denoted here as message. Each incoming sample assigned to a group of N.sub.isi+1 consequent bits (or symbols) in the message form an event. Next, the events are counted, and histogram set H, containing V.sup.N.sup.isi.sup.+1 branches is obtained by:
H={H.sub.l(r.sub.n,|a.sub.n,a.sub.n1, . . . , a.sub.nN.sub.isi.sub.+1), l=0,1, . . . , V.sup.N.sup.isi.sup.+1}
H.sub.l(r.sub.n|a.sub.n,a.sub.n1, . . . , a.sub.nN.sub.isi.sub.+1)f.sub.channel(r.sub.n|a.sub.n,a.sub.n1, . . . , a.sub.nN.sub.isi.sub.+1)[Eq.3]

(25) The signal is quantized to N.sub.ADC bits; therefore, each histogram consists of at most 2.sup.N.sup.ADC bins. Finally, after a proper normalization (the sum of all bins in each histogram is unity), and log operation, the branch metrics given by
M={M.sub.l(r.sub.n|a.sub.n,a.sub.n1, . . . , a.sub.nN.sub.isi.sub.+1), l=0, . . . , V.sup.N.sup.isi.sup.+1}
are obtained, thereby forming the current channel estimate. In the steady state (tracking mode), the histograms, and thus the metrics, are updated iteratively, based on the observed data.

(26) Blind Channel AcquisitionInitial Metrics Determination Procedure (IMDP)

(27) The algorithmic flowchart of the blind MLSE acquisition stage, referred herein as Initial Metrics Determination Procedure (IMDP), is illustrated in FIG. 2. The IMDP can be divided into four main phases. At the first phase, the metrics set M is taken from the predefined bank. Then, an iterative decoding procedure is activated, and several (X) decision-directed adaptation loops (later on being used as the tracking loops) are carried out. The third phase's goal is to check whether the resulting metrics are converged. If convergence is not achieved, the next metrics set from the bank is taken. Otherwise, additional optimization procedure that maximizes the amount of ISI that is compensated by the MLSE is used. If the initial metrics bank is run out of metrics j>J.sub.max, then interrupt is generated to the Central Processing Unit (CPU), which may decide to start the IMDP.

(28) Definition of the Metrics Bank custom character

(29) The Approximate Overall Channel DTE Model

(30) Direct detection optical channel systems are nonlinear in nature, mainly due to the square-law operation in the photo-detector and the intensity dependence of the fiber refractive index (the Kerr effecta change in the refractive index of a material in response to an applied electric field.). Thus, the noiseless incoming sample is represented by a nonlinear combination of transmitted symbol a.sub.n and past N.sub.isi.sup.(channel) symbols:

(31) $\begin{matrix} r_{n} = (a_{n}, a_{n - 1}, .Math., a_{n - N_{isi}^{(channel)} + 1}) & [Eq . 4] \end{matrix}$

(32) For the purposes of coarse channel estimation, it is assumed that the predominant nonlinearity comes from the square-law detection, and the fiber non-linearity Kerr effect can be neglected. At the photo-detection input point, the Discrete Time Equivalent model (DTE) accounting for the transmitter shaping, Optical Fiber (OF), CD and first order PMD, is given by:

(33) $\begin{matrix} H_{DTE}^{Tx + fiber} [n] (\begin{matrix} \sqrt{}_{n} & 0 \\ 0 & \sqrt{1 -}_{n +} \end{matrix}) * h_{CD} [n] * h_{Tx} [n] * h_{OF} [n] & [Eq . 5] \end{matrix}$
where

(34) $_{n - k} = {\begin{matrix} 1, n = k \\ 0, else \end{matrix}$
is the discrete Kronecker delta function, and * denotes the convolution operation. The effect of first order PMD in [Eq. 5] is represented by a discrete time 22 diagonal matrix with power splitting coefficient and Differential Group Delay (DGD the difference in propagation time between the two eigenmodes X and Y polarizations.) . In order to be compatible with the DTE model, in [Eq. 5] is rounded up to the nearest value which is multiple of the symbol duration. It should be stressed here, that the latter adjustment does not represent the exact PMD behavior, but is certainly sufficient for the purpose of coarse channel estimation, pursued here to obtain only a starting point for the initial MLSE metrics. Chromatic dispersion can also be represented by a Finite Impulse Response (FIR) filter with N.sub.CD taps (the filter length):

(35) 0 $\begin{matrix} h_{CD} [n] = \sqrt{- j W} e^{j {Wn}^{2}}, - \frac{N_{CD} - 1}{2} n \frac{N_{CD} - 1}{2} W = \frac{c}{f_{s}^{2} .Math. CD .Math._{0}^{2}}, N_{CD} = 2 .Math. .Math. \frac{1}{2 W} .Math. + 1 & [Eq . 6] \end{matrix}$
where c is the speed of light, .sub.0 is the wavelength of the optical carrier, CD is the amount of chromatic dispersion and f.sub.s is the sampling frequency. By denoting the scalar part of H.sub.DTE.sup.Tx+fiber[n] by
[n] custom character h.sub.CD[n]*h.sub.Tx[n]*h.sub.OF[n][Eq.7]
the signal at the photo-detector input can be written as:

(36) $\begin{matrix} x [n] = (H_{DTE}^{Tx + fiber} [n] * a_{n}) + z_{ASE} [n] = s_{n} + z_{ASE} [n] & [Eq . 8] \end{matrix}$
spontaneous emission, that has been optically amplified by a laser source) noise vector coming from optical amplifiers (in both polarizations) and s.sub.n is the DTE signal component given by:

(37) $\begin{matrix} s_{n} = (\begin{matrix} {.Math.}_{k = - \frac{N_{Ch} - 1}{2}}^{\frac{N_{Ch} - 1}{2}} [k] .Math. \sqrt{} .Math. a_{n - k} \\ {.Math.}_{k = - \frac{N_{Ch} - 1}{2}}^{\frac{N_{Ch} - 1}{2}} [k] .Math. \sqrt{1 -} .Math. a_{n - k +} \end{matrix}) & [Eq . 9] \end{matrix}$
where N.sub.Ch represents the length of scalar impulse response [n] in units of symbol duration:
N.sub.Ch=N.sub.CD+N.sub.Tx+N.sub.OF2[Eq.10]
where N.sub.Tx and N.sub.OF are the impulse response lengths of the transmitter (Tx) and optical filter respectively. Similarly, the overall length of the channel impulse response (including the PMD effect is N.sub.overall=N.sub.ch+.

(38) The recorded signal at the Photo-Detector (PD) output, is given by:
u .sub.n=R.Math.(Tr{s.sub.n.Math.s.sub.n.sup.H})*h.sub.Rx[n]+w.sub.n=r.sub.n+w.sub.n[Eq.11]
where Tr denotes the trace operation, H represents the Hermitian conjugate operation, R is the PD responsivity, h.sub.Rx[n] is the photo-detector electronic impulse response, and w.sub.n represents all the noises present in the system: signal-spontaneous, spontaneous-spontaneous, thermal, shot and dark current. The expanded expression of the signal term accounting for the trace operation:
y.sub.n custom character Tr{s.sub.n.Math.s.sub.n.sup.H}[Eq.12]
is given by:

(39) $\begin{matrix} y_{n} = {.Math. {.Math.}_{k = - \frac{N_{Ch} - 1}{2}}^{\frac{N_{Ch} - 1}{2}} [k] .Math. \sqrt{} .Math. a_{n - k} .Math.}^{2} + {.Math. {.Math.}_{k = - \frac{N_{Ch} - 1}{2}}^{\frac{N_{Ch} - 1}{2}} [k] .Math. \sqrt{1 -} .Math. a_{n + - k} .Math.}^{2} & [Eq . 13] \\ y_{n} = {.Math.}_{k = - \frac{N_{Ch} - 1}{2}}^{\frac{N_{Ch} - 1}{2}} .Math. {.Math. [k] .Math.}^{2} .Math. {.Math. a_{n - k} .Math.}^{2} + {.Math.}_{k = - \frac{N_{Ch} - 1}{2}}^{\frac{N_{CD} - 1}{2}} (1 -) .Math. {.Math. [k] .Math.}^{2} .Math. {.Math. a_{n + - k} .Math.}^{2} + {{.Math.}_{k = - \frac{N_{Ch} - 1}{2}}^{\frac{N_{Ch} - 1}{2}} {.Math.}_{l = - \frac{N_{Ch} - 1}{2} k l}^{\frac{N_{Ch} - 1}{2}} .Math. [k] .Math.^{} [l] .Math. a_{n - k} .Math. a_{n - l}^{}} + .Math.e {{.Math.}_{k = - \frac{N_{Ch} - 1}{2}}^{\frac{N_{Ch} - 1}{2}} {.Math.}_{l = - \frac{N_{Ch} - 1}{2} k l}^{\frac{N_{Ch} - 1}{2}} (1 -) .Math. [k] .Math.^{} [l] .Math. a_{n + - k} .Math. a_{n - - l}^{}} & [Eq . 14] \end{matrix}$
where custom character e is the real part of the complex signal, and designates the complex conjugate. Thus, according to [Eqs.11-14] the operator

(40) $(a_{n}, a_{n - 1}, .Math., a_{n - N_{isi}^{(channel)} + 1})$
in [Eq.4] is given by:

(41) $\begin{matrix} r_{n} = (a_{n}, a_{n - 1}, .Math., a_{n - N_{isi}^{(channel)} + 1}) = R .Math. y_{n} * h_{Rx} [n] & [Eq . 15] \end{matrix}$

(42) Equations [Eqs.11-14] will be used in the following sections to derive a coarse FIR approximation of the function

(43) $(a_{n}, a_{n - 1}, .Math., a_{n - N_{isi}^{(channel)} + 1}),$
which is shown to be a good initial guess for the initialization of the MLSE acquisition process.

(44) Definition of the Metrics Bank custom character for Phase #1

(45) The key function that enables the blind MLSE processing is the proper definition of the metrics bank custom character {M.sup.(j), j=0, . . . , J.sub.max1}, which allows operation in decision directed mode. These can be obtained by preparing a predetermined metrics bank, for example by transmitting a known data (training sequence) followed by generating and storing several metric sets for different channel conditions, as described in FIG. 1b. In turn, while deployed in the system, the IMDP, described in FIG. 2, can be activated, and a proper initial set of metrics can be selected from the bank custom character .

(46) The present invention proposes a novel approach for the definition of the metrics bank custom character based on Method of Moments (MoM), combined with knowing the physical behavior of the optical fiber. Since only coarse channel representation is needed, it may be assumed that the branch histograms H.sub.l(r.sub.n|a.sub.n,a.sub.n1, . . . , a.sub.nN.sub.isi.sub.+1) have nearly a Gaussian shape and differ from each other only by the mean and variance. The mean values depend on the channel memory length N.sub.isi.sup.(channel), the data vocabulary size V, and the dominant noise mechanism in the system. To ensure proper operation, the decoder is designed such that the channel memory length is at most as the memory length of the decoder: N.sub.isiN.sub.isi.sup.(channel).

(47) In this case, there are V.sup.N.sup.isi.sup.+1 branches, whereas the variance of each histogram is associated with the noise power that is present in the corresponding combination describing the branch. For example, in a memory-less channel with binary vocabulary (V=2) there are two histograms, representing the corresponding conditional PDFs, and simple hard decision scheme can be used. When V=2 and N.sub.isi.sup.(channel)=1 there are four distinct histograms, with four different mean values. Generally, when N.sub.isiN.sub.isi.sup.(channel), the actual number of histograms in the given MLSE decoder is constant, N.sub.br=V.sup.N.sup.isi.sup.+1, and consists of different groups, while all the members of such a group are identical. Continuing the example (V=2 and N.sub.isi.sup.(channel)=1), for N.sub.isi=4 there are 32 branches. These branches can be divided into four groups, associated with the four different mean values mentioned above.

(48) Based on the argumentation above, the problem of selecting the proper set of metrics bank custom character can be formulated as follows: Finding the set of V.sup.N.sup.isi.sup.+1 mean values and corresponding variances that, together with Gaussianity assumption and correct ordering, lead to conditional PDFs that coarsely but still reliably describe the channel, i.e. result in BER that is low enough (<10.sup.2) to allow operation in decision directed mode. Thus, a bank of metrics custom character is sought, which are derived from histogram sets {H.sup.(j), j=0, . . . , J.sub.max}, having Gaussian shapes with the mean values vectors .sub.j and corresponding variances vectors .sub.j.sup.2. Hence, the metrics in have the following form:
M.sup.(j)={square root over (2)}.sub.j({tilde over (r)}.sub.n.sub.j).sup.2./(2.sub.j.sup.2), j=0, . . . , J.sub.max1[Eq.16]
where the .1 represents the element-wise (Matlab-like) vector division operation.

(49) The values .sub.j can be determined by the FIR approximation of the operator () given by [Eq.15]. Without loss of generality, the following analysis is restricted to the simplest On-Off-Keying (OOK) modulation format, i.e., V=2.

(50) It is assumed that the Non-Return-to-Zero (NRZ) shaping pulse at the transmitter (Tx) is represented by the following impulse response h.sub.Tx[n]=K.sub.1.sub.n in the DTE model, where N.sub.Tx=1 in [Eq.10], and the constant K.sub.1 depends on the transmitted power. It is also assumed that the bandwidth of the optical filter is wide enough, such that at the sampling point, the DTE impulse response of the OF is h.sub.OF[n]=K.sub.2.sub.n, where N.sub.OF=1 and K.sub.2 depends on the OF shape.

(51) In practice, the length of h.sub.OF[n], N.sub.OF, may be longer than a single symbol duration, especially in the environment of concatenated optical filtering (with optical add drop multiplexers). Consequently, according to [Eq.10] the length of the scalar impulse response is dominated by the length of h.sub.CD[n], N.sub.CD and [Eq.7] can be rewritten as:

(52) $\begin{matrix} [n] K_{1} K_{2} h_{CD} [n], - \frac{N_{ch} - 1}{2} n \frac{N_{ch} - 1}{2}, N_{ch} = N_{CD} & [Eq . 17] \end{matrix}$

(53) Using similar argumentation, it can be assumed that h.sub.Rx[n]=K.sub.3.sub.n. For an OOK format a.sub.n=|a.sub.n|.sup.2, and substituting [Eq.6], [Eq.14] and [Eq.17] to [Eq.15] yields:

(54) $\begin{matrix} r_{n} = K {.Math.}_{k = - \frac{N_{CD} - 1}{2}}^{\frac{N_{CD} - 1}{2}} a_{n - k} + (1 -) K {.Math.}_{k = - \frac{N_{CD} - 1}{2}}^{\frac{N_{CD} - 1}{2}} a_{n + - k} + K {.Math.}_{k = - \frac{N_{CD} - 1}{2}}^{\frac{N_{CD} - 1}{2}} {.Math.}_{l = - \frac{N_{CD} - 1}{2} k l}^{\frac{N_{CD} - 1}{2}} \cos (.Math. W .Math. {(k - l)}^{2}) .Math. a_{n - k} .Math. a_{n - l} + (1 -) K {.Math.}_{k = - \frac{N_{CD} - 1}{2}}^{\frac{N_{CD} - 1}{2}} {.Math.}_{l = - \frac{N_{CD} - 1}{2} k l}^{\frac{N_{CD} - 1}{2}} \cos (.Math. W .Math. {(k - l)}^{2}) .Math. a_{n + - k} .Math. a_{n + - l} & [Eq . 18] \end{matrix}$
where K custom character RK.sub.1.sup.2K.sub.2.sup.2K.sub.3W, K>0 is the non-negative proportionality constant that depends on the responsivity and shapes of the transmitter (Tx), optical and receiver (Rx) filters.

(55) The first two terms of [Eq.18] represent the linear part of r.sub.n, and can be regarded as the sum of the responses of two FIR filters with rectangular shapes, relatively delayed by :

(56) $\begin{matrix} {\hat{b}}_{k} = K .Math. (\frac{k}{N_{CD}}) + (1 -) .Math. K .Math. (\frac{k -}{N_{CD}}) & [Eq . 19] \end{matrix}$

(57) Thus, as a first order approximation of (), a metrics bank may be defined by quantizing and building all possible combinations of the coefficients, corresponding to various delays . The last two terms of [Eq.18] account for nonlinear interaction between the transmitted symbols, and may be viewed as a data dependent FIR filters, whose coefficients are proportional to cos(.Math.W.Math.(kl).sup.2):

(58) 0 $\begin{matrix} {\tilde{b}}_{k} (a_{n - \frac{N_{CD} - 1}{2}}, .Math., a_{n + \frac{N_{CD} - 1}{2} +}) = K {.Math.}_{l = - \frac{N_{CD} - 1}{2}, k 1}^{\frac{N_{CD} - 1}{2}} \cos (.Math. W .Math. {(k - l)}^{2}) .Math. a_{n - l} + (1 -) K {.Math.}_{l = \frac{N_{CD} - 1}{2} -, k l}^{\frac{N_{CD} - 1}{2} -} \cos (.Math. W .Math. {(k - l)}^{2}) .Math. a_{n - l} & [Eq . 20] \end{matrix}$

(59) These two terms contribute to the overall sum only when the corresponding data-dependent coefficients are non-zero. On average this filter can be approximated as:

(60) $\begin{matrix} E {{\tilde{b}}_{k} (a_{n - \frac{N_{CD} - 1}{2}}, .Math., a_{n + \frac{N_{CD} - 1}{2} +})} = K {.Math.}_{l = - \frac{N_{CD} - 1}{2}, k 1}^{\frac{N_{CD} - 1}{2}} \cos (.Math. W .Math. {(k - l)}^{2}) .Math. E {a_{n - l}} + (1 -) K {.Math.}_{l = - \frac{N_{CD} - 1}{2} -, k l}^{\frac{N_{CD} - 1}{2} -} \cos (.Math. W .Math. {(k - l)}^{2}) .Math. E {a_{n - l}} & [Eq . 21] \end{matrix}$
where E{} represents the mathematical expectation operator. For OOK modulation format E{a.sub.n}=0.5, thus on average the contribution of the last two terms in [Eq.18], is at most half of the first two terms.

(61) [Eq. 19] and [Eq. 20] summarize the exact mathematical model of the overall channel DTE FIR. For pragmatic acquisition purposes a coarse approximation is proposed. A closer examination of [Eq. 19] and [Eq. 20] reveals that while [Eq.19] represents rectangular shape, [Eq.20] represents the sum of half period cosine terms multiplied by the random data samples. It can be shown empirically (by plotting the sum of [Eq.19] and [Eq.20] for various data, CD and PMD values) that the FIR-equivalent filter can be approximated by either pre-cursor dominating ISI (next bit effect), post-cursor dominating ISI (previous bit effect) or symmetrical ISI filters.

(62) Consequently, the bank of metrics custom character , can be generated by the following set of FIR filters 11, {b.sub.j, j=0, . . . , J.sub.max}, where b.sub.j is given by:

(63) $\begin{matrix} b_{j} [n] = {\begin{matrix} \frac{c^{n}}{{.Math.}_{l = 1}^{m + 1} c^{l}}, j = 0, .Math., N_{isi} - 1, m = (j + 1) \mod N_{isi} \\ \frac{c^{- n}}{{.Math.}_{l = 1}^{m + 1} c^{- l}}, j = N_{isi}, .Math., 2 N_{isi} - 1, m = (j + 1) \mod N_{isi} \\ \begin{matrix} \frac{c^{.Math. n .Math.}}{{.Math. N_{isi} / 2 .Math.}_{- m + 1}}, j = 2 N_{isi}, .Math., 3 N_{isi} - 1, \\ m = (j + 1) \mod N_{isi}, m 0, m N_{isi} - 1 \end{matrix} \\ \underset{l = ({.Math. N_{isi} / 2 .Math.}_{- m + 1})}{.Math.} c^{.Math. l .Math.} \end{matrix} & [Eq . 22] \end{matrix}$

(64) The indexes in [Eq.22] are summarized as follows: j represents the serial number of each element in the proposed set of FIR set custom character , n represents the discrete time axis of the impulse response b.sub.j, and m is related to the impulse response length. The number of coefficients in each element of (the FIR length) is determined by the memory depth of the MLSE engine, N.sub.isi. The design parameter c in [Eq.22] describes the distribution of ISI in each element of custom character (the FIR shape), which, in turn, determines the value of the mean vector .sub.j in [Eq.16]. The actual FIR shape is found to be less critical since only coarse channel model is required for the acquisition stage. Therefore, its value is selected to optimize implementation complexity. In this work, c=2 was used, and satisfactory results are obtained as presented in the below examples.

(65) FIGS. 3a-3c illustrate examples of FIRs corresponding to each line in [Eq.22] for c=2 and N.sub.isi=4, for j=0 (increasing exponent) j=4 (decaying exponent and for j=8 (symmetrically decaying exponent), respectively.

(66) Practically, the MLSE decoder memory length is typically small (N.sub.isi<5), and the number of elements in custom character is finite and not too large. For example, in the ASIC, N.sub.isi=4, resulting in J.sub.max=10 matrices in the bank as dictated by [Eq.22]: 4 matrices with pre-cursor ISI, 4 with post-cursor ISI and 2 with symmetric ISI behavior. The overall acquisition time of the IMDP, in the worst case (when all matrices in the bank should be examined) increases linearly with J.sub.max.

(67) The mean vectors in [Eq.16] can be obtained using [Eq.22] as follows:
.sub.j=A.Math.b.sub.j.Math.(2.sup.N.sup.ADC1)[Eq.23]
where, in the simplest case, A is the V.sup.N.sup.isi.sup.+1(N.sub.isi+1) matrix having all possible combinations of symbols in the vocabulary in increasing order, and N.sub.ADC is the nominal bit count of the ADC.

(68) Similarly, the vector of variances values .sub.j.sup.2 is calculated as follows:
.sub.j.sup.2=S.Math.b.sub.j.sup.2[Eq.24]
where S is the V.sup.N.sup.isi.sup.+1(N.sub.isi+1) matrix, with all possible combinations of symbols in the vocabulary in increasing order like A, but the values of various vocabulary symbols are replaced by the variance values, corresponding to these symbols in ISI free scenario. The variance values are typically derived from the Signal to Noise Ratio conditions in the system. In an optically amplified system the standard deviation for 1, is higher than for 0, depending on the OSNR conditions.

(69) For example, taking V=2, N.sub.isi=1, the matrices A and S have the following form:

(70) $\begin{matrix} A = [\begin{matrix} 0 \\ 0 \\ 1 \\ 1 \end{matrix} | \begin{matrix} 0 \\ 1 \\ 0 \\ 1 \end{matrix}] S = [\begin{matrix} Var (0) \\ Var (0) \\ Var (1) \\ Var (1) \end{matrix} | \begin{matrix} Var (1) \\ Var (1) \\ Var (0) \\ Var (1) \end{matrix}] & [Eq . 25] \end{matrix}$
where Var(0) and Var(1) are determined according to the worst case OSNR the system is designed to tolerate (typically slightly below the pre- Forward Error Correction value), that depend both on signal and noise power in the system.

(71) The Convergence Test (Phase #3) and Convergence Criterion

(72) In order to verify whether the X learning loops during phase #2, a channel estimate M that describes the channel reliably enough is provided, such that successful operation in decision directed mode is possible (BER<10.sup.2) and the histograms in the corresponding histogram set H must possess certain statistical properties.

(73) The only assumption that forms the basis of derivation of these properties is that the transmitted symbols are equiviprobable, i.e.:

(74) $\begin{matrix} P (a_{i}) = \frac{1}{V}, i & [Eq . 26] \end{matrix}$

(75) That this assumption is also needed for using MLSE instead of Maximum A Posteriori Probability (MAPa mode of the posterior distribution.) algorithm, and generally hold in practical systems which employ source coding and scrambling. In turn, [Eq.26] implies that the probability to transmit any combination of N.sub.isi+1 consecutive symbols is:

(76) $\begin{matrix} p = \frac{1}{V^{N_{isi} + 1}} & [Eq . 27] \end{matrix}$

(77) Therefore, if the decoder works correctly and the channel estimate M is reliable, there are N.sub.br=V.sup.N.sup.isi.sup.+1 branches in histogram set H, each having an equal probability p to appear. In other words, the probability to assign the observation at the decoder input u.sub.n to the correct combination of N.sub.isi+1 consecutive decisions at the decoder output .sub.i is p , i.e., the probability of the event u.sub.n.sub.i, i=0, . . . , V.sup.N.sup.isi.sup.+1 is Binomially distributed and is given by:
P(u.sub.n.sub.i)=p, i=0, . . . , N.sub.br1[Eq.28]

(78) Thus the total number of events in each branch is given by:

(79) $\begin{matrix} m_{0}^{(i)} \underset{u_{n}_{i}}{.Math.}_{u_{n}_{i}}, i = 0, .Math., N_{br} - 1 & [Eq . 29] \end{matrix}$
is a Gaussian random variable with an expectation value Np and variance Np(1p):

(80) $\begin{matrix} f_{m_{0}^{(i)}} (m_{0}^{(i)}) = \frac{1}{\sqrt{2 N p (1 - p)}} \exp {- \frac{(m_{0}^{(i)} - N p)}{2 N p (1 - p)}}, i = 0, .Math., N_{br} - 1 & [Eq . 30] \end{matrix}$
where N is the total number of observations, used to build the whole histogram set H. Hence, a widely used Z-test (a statistical test for which the distribution of the test statistic under the null hypothesis can be approximated by a normal distribution) is proposed here as a convergence criterion for each branch H.sub.iH, 0iN.sub.br1. Based on [Eq.30] the null hypothesis is:
m.sub.0.sup.(i)=Np, i=0, . . . , N.sub.br1[Eq.31]
and the Z-statistics is given by:

(81) $\begin{matrix} z = \frac{(m_{0}^{(i)} - N p)}{\sqrt{N p (1 - p)}}, i = 0, .Math., N_{br} - 1 & [Eq . 32] \end{matrix}$

(82) The two-tailed P-value (the probability of obtaining a test statistic result at least as extreme as the one that was actually observed), or the probability that successfully converged metrics would be classified as non-converged is given by:

(83) $\begin{matrix} .Math. - 2 Q (z) = \frac{2}{\sqrt{2}}_{z}^{} e^{- \frac{x^{2}}{2}} d x & [Eq . 33] \end{matrix}$

(84) Thus based on , the practical convergence test translates into:
thr.sub.lowm.sub.0.sup.(i)thr.sub.high, i=0, . . . , N.sub.br1[Eq.34]
i.e., to check whether the obtained event count in each branch lies between the two threshold values, defined by [Eq.34], where:

(85) 0 $\begin{matrix} {thr}_{low} = N p - \sqrt{N p (1 - p)} .Math. Q^{- 1} (\frac{.Math.}{2}) {thr}_{high} = N p + \sqrt{N p (1 - p)} .Math. Q^{- 1} (\frac{.Math.}{2}) & [Eq . 35] \end{matrix}$

(86) Therefore, meeting the conditions in [Eq.35] indicates that the detected symbols obey the equiviprobability assumption of [Eq.26].

(87) Convergence Monitoring During Phase #2

(88) Based on the argumentation in section 4.3, it is possible to use the sampled standard deviation of the central moments after d-th iteration, designated as std(m.sub.0)[d], in order to monitor the convergence tendency of the histogram set H during phase #2:

(89) $\begin{matrix} std (m_{0}) [d] \sqrt{\frac{1}{N_{br} - 1} {.Math.}_{i = 0}^{N_{br} - 1} {(m_{0}^{(i)} [d] - N p)}^{2}}, d = 0, .Math., X - 1 & [Eq . 36] \end{matrix}$

(90) The idea behind [Eq.36] is that if during phase #2 H has the tendency to converge, after X iterations, each branch will have a similar number of events, and std(m.sub.0)[X1] will go to zero.

(91) In addition, an additional figure of merit is proposed, based on training a sequence, for illustrational purposes only. In this case the histogram set, H.sub.training is known, and one can measure the closeness of the obtained set H.sub.blind by means of sample Kullback-Leibler (KLa non-symmetric measure of the difference between two probability distributions P and Q) distance:

(92) $\begin{matrix} D_{KL} (i) D_{KL} (H_{i}^{training} .Math. .Math. H_{i}^{blind}) = {.Math.}_{m = 0}^{2^{N} ADC - 1} \ln (\frac{H_{i}^{training} (m)}{H_{i}^{blind} (m)}) .Math. H_{i}^{training} (m) & [Eq . 37] \end{matrix}$

(93) After treating all the KL-distances in H.sub.blind as a vector in a linear space, a Euclidian norm of KL distances D.sub.KL(i) can be used to monitor the convergence process during phase #2:

(94) $\begin{matrix} D_{ED} (H_{training} .Math. .Math. H_{blind}) \sqrt{{.Math.}_{i = 0}^{N_{br} - 1} {[D_{KL} (i)]}^{2}} & [Eq . 38] \end{matrix}$

(95) Furthermore, the Bit Error Rate (BER), obtained by direct error counting will be used to illustrate that the proposed figure of merit behaves correctly, i.e. convergence in terms of std(m.sub.0) results also in BER convergence.

(96) Match Point (MP) and ISI Optimization (Phase #4)

(97) According to [Eq.4], the incoming sample is a nonlinear combination of a current symbol a.sub.n and N.sub.isi.sup.(channel) previous symbols. The MLSE equalizer operates perfectly, if the memory length of the decoder N.sub.isi is greater than the channel memory, i.e. N.sub.isiN.sub.is.sup.(channel). However, in practical scenarios, the opposite statement holds, i.e. N.sub.isi<N.sub.isi.sup.(channel). In this case, the MLSE equalizer performs sub-optimally, since it takes care only for the first N.sub.isi terms, leaving some portion of residual ISI uncompensated. This residual ISI is treated by the decoder as noise, and is reflected into the variances of the branch histograms:
.sub.l.sup.2=.sub.noise.sup.2(l)+.sub.ADC.sup.2+.sub.residual ISI.sup.2, 0lN.sub.br[Eq.39]
where .sub.noise.sup.2(l) is the receiver random noise (both thermal and optical induced noises), and .sub.ADC.sup.2 is ADC related noise that includes quantization, jitter, etc. Usually, if the decoder is designed correctly, the amount of the residual ISI is small and the effect on the performance is negligible. i.e., .sub.residual ISI.sup.2 custom character .sub.noise.sup.2(l)+.sub.ADC.sup.2, 0lN.sub.br. But, when the amount of the impairments in the channel is high the residual ISI may dominate.

(98) If a simple FIR channel with N.sub.isi.sup.(channel) coefficients is used, the noiseless received sample r.sub.n is given by:

(99) $\begin{matrix} r_{n} = {.Math.}_{k = 0}^{N_{isi}^{(channel)} - 1} b_{k} a_{n - k} & [Eq . 40] \end{matrix}$

(100) The ISI in the system can be divided into two groups: the ISI handled by the MLSE with memory of N.sub.isi symbols, and the residual ISI. The handled ISI should be selected according to a peak-distortion criterion:

(101) $\begin{matrix} \max_{n_{0} (0, L)} {.Math.}_{n = n_{0}}^{n_{0} + N_{isi}} .Math. b_{n} .Math. & [Eq . 41] \end{matrix}$

(102) Thus, there is a subset of L=N.sub.isi.sup.(channel)N.sub.isi taps that is not compensated, and generates the residual ISI noise with variance .sub.residual ISI.sup.2. For the system with V equiprobable symbols (symbols with equal probabilities)

(103) $\begin{matrix} _{residual_ISI}^{2} =_{a}^{2} {.Math.}_{n = n_{0}}^{n_{0} - 1} {.Math. b_{n} .Math.}^{2} +_{a}^{2} {.Math.}_{n = n_{0} + N_{isi} + 1}^{N_{isi}^{(channel)}} {.Math. b_{n} .Math.}^{2} & [Eq . 42] \end{matrix}$
where .sub.a.sup.2 is the variance of the transmitted constellation, is given by:

(104) $\begin{matrix} _{a}^{2} = \frac{1}{V} {.Math.}_{k = 0}^{V - 1} a_{k}^{2} - {(\frac{1}{V} {.Math.}_{k = 0}^{V - 1} a_{k})}^{2}, a_{k} Vocabulary & [Eq . 43] \end{matrix}$

(105) In the case of an OOK system, the received signal is described in [Eq.18], and the peak-distortion criteria can be extended:

(106) $\begin{matrix} \max_{n_{0} (0, L)} {.Math.}_{n = n_{0}}^{n_{0} + N_{isi}} .Math. E {{\hat{b}}_{n} + {\tilde{b}}_{n}} .Math. & (1) \end{matrix}$
where {circumflex over (b)}.sub.n is the given by [Eq.19] and {tilde over (b)}.sub.n is the data-dependent FIR that can be approximated by [Eq.20]. Consequently, .sub.residual ISI.sup.2 can be approximated by [Eq.42] where b.sub.n is replaced by E{{circumflex over (b)}.sub.n+{tilde over (b)}.sub.n}.

(107) The optimal n.sub.0 is called Match Point (MP), and in practice the ISI optimization is done by collecting several channel estimates (histograms), while each time, a different MP-shift n.sub.0 is set between the stream of ADC samples and the stream of the corresponding decision bits. Thus, each histogram represents a selection of a different subset of the channel ISI to be compensated by the MLSE. The contribution of .sub.noise.sup.2(l) and .sub.ADC.sup.2 in [Eq.39] is the same, averagely. Therefore, the variances-average of the histograms' changes between these n.sub.0 shifts, and is determined by the .sub.residual ISI.sup.2. Hence, the selected n.sub.0 (the correct MP-shift) is the one that yields the minimal variances-average of the histograms:

(108) $\begin{matrix} MP = \min_{n_{0}}_{average}^{2} (n_{0})_{average}^{2} (n_{0}) = .Math._{l}^{2} (n_{0}) .Math. = \frac{1}{N_{br}} {.Math.}_{l = 0}^{N_{br} - 1}_{l}^{2} (n_{0}) & [Eq . 45] \end{matrix}$

(109) Experimental Setup and ASIC Parameters

(110) The proposed IMDP method was implemented within the Q ASIC and was verified experimentally using the following optical setup, shown in FIG. 4. The Pseudo-Random Bits Sequence (PRBS) of length 2.sup.311 was generated and amplified by the driver, to modulate the optical carrier. A Mach-Zehnder Modulator (MZM) following a 1550 nm Distributed Feedback (DFB) laser was used. The optical channel was a Standard Single Mode Fiber (SSMF) including an optical amplifier and an ASE noise source. Optical spectrum analyzer was used to measure the Optical Signal-To-Noise Ratio (OSNR). A 50 GHz optical filter was used in order to reduce the amount of received noise at the PIN Photo Detector (PD). The received electrical signal was processed by the ASIC, which has a built-in Pseudo-Random Binary Sequence (PRBS) checker that was used to obtain the BER results.

(111) The ASIC has an ADC with nominal resolution of N.sub.ADC=5 bits and an Effective Number Of Bits (ENOB) of 3.8 bits. An analog Phase-Lock Loop (PLL) was used to recover the symbols clock, while the data was sampled at the symbol rate of 28 Gsymbol/sec. The MLSE equalizer memory depth is N.sub.isi=4 symbols, the principle architecture of which is shown in FIG. 1b.

(112) Experimental Examples

(113) The operation of the proposed blind channel acquisition algorithm (IMDP), the outcome of the intermediate procedure phases (FIG. 2) is described and analyzed for two cases: a back-to-back channel and a 40 km long channel.

(114) The phases of the IMDP for a Back to back channel are illustrated in FIGS. 5-8.

(115) FIGS. 5a and 5b illustrate normalized histograms (a) H.sub.#1.sup.(0) and H.sub.#1.sup.(1) and during Phase # 1 of IMDP, for a channel memory depth of one symbol. Two different histogram sets H.sub.#1.sup.(0) and H.sub.#1.sup.(1), that can represent PDFs describing a channel with memory depth of one symbol, are shown. The histograms on the left hand side correspond to the increasing exponent j=0 in [Eq.22], whereas the histograms on the right hand side represent the decreasing exponent channel j=4 in [Eq.22] for N.sub.isi=4. In fact, the branch (or histogram) labeled 01 in FIG. 5a has different mean and variance values, compared to the same branch (or histogram) in FIG. 5b. The same is true for branch 10.

(116) On the other hand, the edge branches 00 and 11 have the same mean and variance values due to the symmetry presented in [Eq.22]. Both histogram sets H.sub.#1.sup.(0) and H.sub.#1.sup.(1) and contain 32 branches each, which are divided into 4 groups, whereas each group is described by its mean and variance values (which coincide with the 4 histograms shown in FIGS. 5a-5b). H.sub.#1.sup.(0) and H.sub.#1.sup.(1) are the outcome of phase #1, and serve as a starting point for phase #2 of the IMDP.

(117) FIGS. 6a-6c illustrate normalized histogram sets obtained after phase # 2 of IMDP. FIGS. 6a and 6b show the histograms sets after 8 iterations for H.sub.#2.sup.(0)(b) H.sub.#2.sup.(1). FIG. 6c shows the histograms set H.sub.training obtained by using a training sequence. It is clear that H.sub.#1.sup.(0) does not converge, whereas H.sub.#1.sup.(1) provides a good initial guess. The histogram sets H.sub.#2.sup.(0) and H.sub.#2.sup.(1) are obtained, starting from H.sub.#1.sup.(0) and H.sub.#1.sup.(1), respectively.

(118) By comparing FIG. 6a to FIG. 6c, it is clear that the initial guess H.sub.#1.sup.(0) was not successful. On the other hand, comparing FIG. 6b to FIG. 6c, it is clear that that H.sub.#1.sup.(1) is a better guess which converges to histograms map similar to H.sub.training, and therefore, is considered to be a successful initial guess. In addition to the visual effect, the similarity between H.sub.#2.sup.(1) and H.sub.training can be quantitatively measured by means of the parameter D.sub.ED defined in [Eq.38]. In addition, the convergence rate can be quantified by using [Eq.35].

(119) FIGS. 7a-7d show IMDP convergence monitoring during phase #2. FIG. 7a shows D.sub.ED convergence following [Eq.38]. The values of D.sub.ED throughout the 8 iterations of phase #2 are shown. In the experiment, X=8 was the worst case for IMDP number of iterations to converge. It is clear that H.sub.#1.sup.(0) (circles) diverges, whereas H.sub.#1.sup.(1) (squares) converges to zero in terms of D.sub.ED, meaning that the obtained H.sub.#2.sup.(1) converges to H.sub.training.

(120) FIG. 7b shows a standard deviation of central moments [Eq.36], presenting the convergence rate in terms of standard deviation of the central moments, as defined in [Eq.35]. Similarly, convergence of H.sub.#1.sup.(1) (squares) and divergence of H.sub.#1.sup.(0) (circles) are observed. The (final) value of std(m.sub.0) for H.sub.#2.sup.(1) also goes to zero, indicating that all the histograms in H.sub.#2.sup.(1) have similar number of observations.

(121) FIG. 7c shows convergence in terms of BER, presenting the bit error rate convergence during phase #2, compared to the BER obtained by training (solid line). The convergence in terms of BER is slower during the acquisition phase #2, as compared to D.sub.ED and to std(m.sub.0) convergences. However, it is shown in the following figures that the final BER convergence, at the end of the IMDP, is similar to the training case.

(122) FIG. 7d shows Phase #3 of IMDP during which, the convergence criterion of [Eq.34] is checked, presenting the validation of convergence criterion of [Eq.34], which forms phase #3 of the IMDP, as well as the values of the moments m.sub.0.sup.(i) for various histograms in H.sub.#2.sup.(0) (circles) and H.sub.#2.sup.(1) (squares). It is clearly seen that m.sub.0.sup.(i) of H.sub.#2.sup.(1) lie within the upper and lower thresholds defined by of [Eq.35], as opposed to the H.sub.#2.sup.(0) counterpart. In hardware implementation, [Eq.34] is applied only to the m.sub.0.sup.(i) of H.sub.#2.sup.(1), to save complexity and the duration of the acquisition process. The zero-th moments of H.sub.#2.sup.(0) are presented only for clarity and comparison insight.

(123) FIGS. 8a-8e show the results of ISI optimization during phase #4 of IMDP, for the histograms set H.sub.#2.sup.(1).

(124) Histogram sets are obtained for different shifts: (a) MP.sub.shift=2, BER=4.26.Math.10.sup.1, custom character .sub.i.sup.2=89.21, (b) MP.sub.shift=1, BER=3.15.Math.10.sup.3, .sub.hu 2=12.21, (c) MP.sub.shift=0, BER=1.77.Math.10.sup.3, .sub.i.sup.2=10.4, (d) MP.sub.shift=1, BER=1.89.Math.10.sup.3, .sub.i.sup.2=10.41, (e) MP.sub.shift=2, BER=1.82.Math.10.sup.3, .sub.i.sup.2=10.53

(125) In each sub-plot, the titles contain the MP-shift, the BER and the average histograms variance calculated according to [Eq.45]. In this simple back-to-back case, the major portion of ISI comes from the frequency response of the analog front-end of the ASIC. It can be seen in FIGS. 8a-8e, that the effective smearing is between 1 and 2 symbol periods, resulting in three histogram sets, identified in FIGS. 8c-8e. The resulting BER in FIGS. 8c-8e is similar, since the memory depth of the implemented MLSE is higher than the effective smearing, N.sub.isi>N.sub.isi.sup.channel. The optimal shift is chosen such that the residual ISI is minimal according to [Eq.45], which also results in the lowest BER. In this case, the optimal shift is zero, which also complies with the fact that the histogram set from FIG. 8c is very close in terms of D.sub.ED to the histogram set obtained by using the training sequence, shown in FIG. 6c, and having the same shift.

(126) The phases of the IMDP for a 40 km optical link are illustrated in FIGS. 9-12. As predicted by [Eq.6], the initial histogram sets representing the channel with memory of one symbol duration N.sub.isi.sup.channel=1, H.sub.#1.sup.(0) and H.sub.#1.sup.(1) shown in FIGS. 5a-5b would not be sufficient, since they result in high initial BER.

(127) FIG. 9 illustrates Phase # 1 of IMDP Normalized histogram H.sub.#1.sup.(2) corresponding to N.sub.isi.sup.channel=2, j=1 for a 40 km link. H.sub.#1.sup.(0) and H.sub.#1.sup.(1) correspond to N.sub.isi.sup.channel=1 are shown FIGS. 5a-5b. Therefore, if the link length is known a-priory, one can directly start the IMDP from histogram sets with higher channel memory, e.g., N.sub.isi.sup.channel=2 in this case. However, it is assumed that there is no side information about the link length (up to maximal length that the current hardware supports, which is about 50 km). Consequently, the whole IMDP is repeated starting from three histogram sets: H.sub.#1.sup.(0) and H.sub.#1.sup.(1), shown in FIG. 5, and H.sub.#1.sup.(2) shown in FIG. 9.

(128) FIGS. 10a-10c present Phase #2 of the IMDP, which also consists of 8 iterations, resulting in the histogram sets H.sub.#2.sup.(0), H.sub.#2.sup.(1) and H.sub.#2.sup.(1), respectively. The histogram set obtained by using the training sequence H.sub.training is shown in FIG. 10d, for comparison. It is clear that H.sub.#1.sup.(0) and H.sub.#1.sup.(1) diverge, whereas H.sub.#1.sup.(2) provides a similar histogram map as with training. The histogram sets without sufficient channel memory, H.sub.#1.sup.(0) and H.sub.#1.sup.(1) diverge, whereas H.sub.#1.sup.(2) provides a good initial guess. The titles of FIGS. 10a-10c contain the std(m.sub.0) values after 8.sup.th iteration, indicating quantitatively that only H.sub.#1.sup.(2) has successful convergence.

(129) FIGS. 11a-11d show the convergence process during phase #2 of the IMDP, for a 40 km long optical link. The convergence in terms of D.sub.ED (defined by [Eq.45]) for the three histogram sets H.sub.#1.sup.(0) (circles) H.sub.#1.sup.(1) (squares) and H.sub.#1.sup.(2) (triangles) is shown on FIG. 11a. FIG. 11b shows the standard deviation of central moments defined by [Eq.36]. FIG. 11c shows convergence in terms of BER. FIG. 11d shows Phase #3 of IMDP by checking the convergence criterion defined by [Eq.34].

(130) All the three sets appear to stabilize around a constant D.sub.ED value, but as already known from FIGS. 10a-10d, only H.sub.#2.sup.(2) tends to resemble the H.sub.training. The fact that H.sub.#1.sup.(0) and H.sub.#1.sup.(1) stabilize around a constant D.sub.ED value, does not imply that they converged to a correct channel estimation. Rather the opposite is true, and a closer look at FIGS. 10a-10b reveals that the resulting histogram shapes in both sets almost uniformly spread throughout the ensemble range (the x-axis).

(131) Despite the fact that H.sub.#2.sup.(2) converges, the final D.sub.ED value (after 8 iterations) for H.sub.#2.sup.(2) is higher than for H.sub.#2.sup.(0) and H.sub.#2.sup.(1) which eventually diverge. The reason for this is that H.sub.#1.sup.(2) converged to a suboptimal solution. H.sub.#2.sup.(2) is indeed quantitatively far from H.sub.training, since the D.sub.ED between them is not close to zero. This suboptimal solution will be improved during phase #4 of the IMDP. Thus, the KL-distance does not immediately show success, since several suboptimal solutions are possible, and only the optimal reference PDF (or its histogram representative) is relevant for comparison.

(132) On the other hand, by observing the intermediate values of the std(m.sub.0) criterion (shown in FIG. 11b), one can conclude that H.sub.#1.sup.(0) and H.sub.#1.sup.(1) diverge, whereas H.sub.#2.sup.(1) converges (to a valid suboptimal solution). The BER convergence, shown on FIG. 11c, also reveals that the first two histogram sets diverge (BER=0.5) and the latter set converges to a suboptimal solution, indicated by a slightly higher BER than the one obtained with H.sub.training (solid purple curve).

(133) The practical way to conclude whether a given histogram set is converged to a valid (possibly suboptimal) solution, without observing the resulting histogram sets H.sub.#2.sup.(0), H.sub.#2.sup.(1) and H.sub.#2.sup.(2), is to assure that all the zero-th moments of the resulting histograms within the set lie within a predefined range, given by [Eq.34]. FIG. 11d shows that only H.sub.#2.sup.(1) meets [Eq.33], and thus is the only selected metric that is being processed in phase #4.

(134) FIGS. 12a-12e show the histogram sets representing the outcome of the ISI optimization Phase # 4 of IMDP for a 40 km long optical channel. As indicated by both BER and the average standard deviations (ref formula for mean (vars)), the MP-shift of one symbol (shown in) obtains the best performance.

(135) Histogram sets, obtained for different shifts are the following: FIG. 12a: MP.sub.shift=2, BER=3.01.Math.10.sup.1, custom character .sub.i.sup.2=54.36, FIG. 12b: MP.sub.shift=1, BER=2.1.Math.10.sup.1, .sub.i.sup.2=30.40, FIG. 12c: MP.sub.shift=0, BER=2.16.Math.10.sup.3, .sub.i.sup.2=7.07, FIG. 12d: MP.sub.shift=1, BER=1.15.Math.10.sup.3, .sub.i.sup.2=6.32, FIG. 12e: MP.sub.shift=2, BER=2.94.Math.10.sup.3, .sub.i.sup.2 custom character =8.11

(136) In addition, in FIG. 12d, D.sub.ED(H.sub.trainingH.sub.#4.sup.(1))=0.127, is the lowest achieved value, which verifies that optimal solution is obtained.

(137) FIG. 13 shows the experimental measurements, comparing the BER results for various OSNR values, obtained by the data aided approach (training sequence) vs. the proposed blind channel acquisition algorithm (IMDP). The pre-FEC BER level of 10.sup.3 is also shown for convenience. It can be seen that the proposed blind IMDP technique achieves BER values that are in very good agreement with the BER values obtained by the use of training sequence. Thus, it indicates that a reliable channel estimation is obtained by the proposed blind technique, for OSNR values that result in BER<10.sup.2.

(138) The proposed IPMD requires neither additional hardware nor additional complicated calculations. The full blind equalization scheme was implemented in an Application Specific Integrated Circuit (ASIC) and was validated experimentally at the full data rate of 428 Gbit/sec. The overall blind channel acquisition time is measured to be a few milliseconds, which makes it suitable for use in reconfigurable optical network environment that requires 50 msec recovery time.

(139) The above examples and description have of course been provided only for the purpose of illustration, and are not intended to limit the invention in any way. As will be appreciated by the skilled person, the invention can be carried out in a great variety of ways, employing more than one technique from those described above, all without exceeding the scope of the invention.

Blind channel estimation method for an MLSE receiver in high speed optical communication channels

Assignee

Inventors

Cpc classification

Classification Explorer

H04L25/03292

ELECTRICITY

Classification Explorer

H04L25/0238

ELECTRICITY

Classification Explorer

H04L1/0054

ELECTRICITY

Classification Explorer

H04B10/697

ELECTRICITY

Classification Explorer

H04B10/60

ELECTRICITY

International classification

Classification Explorer

H04L1/00

ELECTRICITY

Classification Explorer

H04L25/03

ELECTRICITY

Classification Explorer

H04B10/60

ELECTRICITY

Classification Explorer

H04L25/02

ELECTRICITY

Abstract

Claims

Description