Method and apparatus for correcting optical waveform distortion and optical signal receiving apparatus
12451964 ยท 2025-10-21
Assignee
Inventors
Cpc classification
H04B10/6163
ELECTRICITY
H04B10/2507
ELECTRICITY
International classification
Abstract
This method includes optimizing, by a gradient descent method, a first parameter used in back propagation processing and associated with XPM and a second parameter used in the back propagation processing and associated with SPM and XPM, wherein the back propagation processing is processing to estimate a waveform at a time of transmission by alternately calculating linear terms and nonlinear terms in a nonlinear Schrdinger equation after receiving an optical signal whose waveform shape changed in a transmission line and digitizing a waveform of the received optical signal, and correct, for each channel of plural channels in the transmission line at a time of wavelength-division multiplexing transmission, waveform distortion caused by SPM that occurs in the channel and waveform distortion caused by XPM that occurs in relation with channels other than the channel; and executing the back propagation processing by using the optimized first and second parameters.
Claims
1. A method for correcting optical waveform distortion, comprising: optimizing, by a gradient descent method, a first parameter that is used in back propagation processing and is associated with cross-phase modulation and a second parameter that is used in the back propagation processing and is associated with self-phase modulation and the cross-phase modulation, wherein the back propagation processing is processing to estimate a waveform at a time of transmission by alternately calculating linear terms and nonlinear terms in a nonlinear Schrdinger equation after receiving an optical signal whose waveform shape changed in a transmission line and digitizing a waveform of the received optical signal, and correct, for each channel of plural channels in the transmission line at a time of wavelength-division multiplexing transmission, waveform distortion caused by the self-phase modulation that occurs in the channel and waveform distortion caused by the cross-phase modulation that occurs in relation with channels other than the channel; and executing the aforementioned back propagation processing by using the optimized first and second parameters.
2. The method according to claim 1, wherein the waveform distortion caused by the cross-phase modulation is corrected under approximation that an initial waveform is maintained for an intensity, independent of a propagation distance, and a delay proportional to the propagation distance occurs along a time axis.
3. The method according to claim 1, wherein the second parameter comprises the group velocity dispersion D.sub.2 and nonlinear coefficient g, and the first parameter comprises a walk-off parameter d.sub.n.
4. An apparatus, comprising: a memory; and a processor coupled to the memory and configured to optimize, by a gradient descent method, a first parameter that is used in back propagation processing and is associated with cross-phase modulation and a second parameter that is used in the back propagation processing and is associated with self-phase modulation and the cross-phase modulation, wherein the back propagation processing is processing to estimate a waveform at a time of transmission by alternately calculating linear terms and nonlinear terms in a nonlinear Schrdinger equation after receiving an optical signal whose waveform shape changed in a transmission line and digitizing a waveform of the received optical signal, and correct, for each channel of plural channels in the transmission line at a time of wavelength-division multiplexing transmission, waveform distortion caused by the self-phase modulation that occurs in the channel and waveform distortion caused by the cross-phase modulation that occurs in relation with channels other than the channel.
5. An optical receiver, comprising: a memory; and a processor coupled to the memory and configured to execute back propagation processing by using a first parameter that is used in back propagation processing and is associated with cross-phase modulation and a second parameter that is used in the back propagation processing and is associated with self-phase modulation and the cross-phase modulation, wherein the back propagation processing is processing to estimate a waveform at a time of transmission by alternately calculating linear terms and nonlinear terms in a nonlinear Schrdinger equation after receiving an optical signal whose waveform shape changed in a transmission line and digitizing a waveform of the received optical signal, and correct, for each channel of plural channels in the transmission line at a time of wavelength-division multiplexing transmission, waveform distortion caused by the self-phase modulation that occurs in the channel and waveform distortion caused by the cross-phase modulation that occurs in relation with channels other than the channel, and wherein the first and second parameters are optimized by a gradient descent method.
Description
BRIEF DESCRIPTION OF DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
(22)
(23)
(24)
(25)
(26)
(27)
(28)
(29)
(30)
(31)
(32)
(33)
(34)
(35)
(36)
(37)
(38)
(39)
(40)
(41)
(42)
(43)
(44)
(45)
(46)
(47)
MODES TO IMPLEMENT PRESENT INVENTION
Basic Idea in Embodiment of Present Invention
(48) A technique of correcting nonlinear waveform distortion according to the present embodiment targeting at a WDM signal whose frequency interval is will be described.
[Expression 3]
A.sub.p(t)=.sub.nA.sub.p,n(t)e.sup.i.sup.
(49) In this regard, A.sub.p, n(t) represents an envelope amplitude of a baseband (that means that the center frequency is 0) related to a polarization component p of the channel n.
(50) Moreover, only the linear terms of the nonlinear Schrdinger equation expressed in expression (1) are written out as follows.
(51)
(52) A solution of expression (4) for a waveform obtained by using, as an input, a waveform A.sub.p, n at a coordinate z=0 (z=0, t) in a longitudinal direction of an optical fiber and after propagation over a distance h can be separated for each channel in a frequency domain and is as follows.
(53)
(54) Here, D.sub.2(h)=.sub.2h/2 and D.sub.3=.sub.3h/6 are respective cumulative values of second-order and third-order group velocity dispersions. Furthermore, T.sub.n(h, .sub.n)=(.sub.2.sub.n+.sub.3.sub.n.sup.2/2)h represents group delay (walk-off) produced in the signal of the channel number n. The waveform of expression (5) is represented in a time domain using an operator F that represents Fourier transform as follows.
(55)
(56) On the other hand, only nonlinear terms in the nonlinear Schrdinger equation (1) are written out as follows.
(57)
(58) Here, a term of propagation loss on the right side is originally a linear term, yet is included to take into account a change of nonlinearity that occurs as a signal intensity attenuates due to propagation loss. In a case where an envelope amplitude B.sub.p is defined as A.sub.p=B.sub.p(z)exp(z/2), B.sub.p represents an amplitude from which the attenuation amount due to propagation loss is separated, and expression (7) is expressed for B.sub.p as follows.
(59)
(60) Here, (z)=.sub.0exp(z) holds.
(61) Expression (8) expresses that, when a signal amplitude attenuates due to the propagation loss, a nonlinear effect in the nonlinear Schrdinger equation can be described in such a way that the original nonlinear coefficient .sub.0 attenuates in the longitudinal direction. When expression (8) is decomposed for each channel, the amplitude of the channel number 0 is expressed as follows.
(62)
(63) On the right side of expression (9), a term including |B.sub.p, 0| (p=1, 2) that is a time waveform of the signal intensity of the channel number 0 causes SPM, and a term including |B.sub.p, n| (p=1, 2) that is a time waveform of the signal intensity of the channel number n0 causes XPM.
(64) As proposed by Non-Patent Literature 4, by introducing some assumptions related to a change of the waveform at a time of propagation in expression (9), it is possible to obtain an approximate solution. The first assumption is that, supposing that a time waveform intensity of the signal of the channel number 0 is invariable with respect to the distance z, B.sub.p, 0(z, t)|.sup.2=|B.sub.p, 0(0, t)|.sup.2 is assumed, and this assumption is usually used for calculation of the split-step Fourier method.
(65) The second assumption is that, while a time waveform of the signal intensity of the channel number n0 is invariable with respect to the distance z, |B.sub.p, n(z, t)|.sup.2=|B.sub.p, n(0, td.sub.nz)|.sup.2 is assumed taking the occurrence of the group delay due to walk-off into account. This equation expresses that, while the shape of the intensity does not depend on a propagation distance and is kept as the initial waveform, the delay d.sub.nz proportional to the propagation distance occurs in a time domain. Here, do expressed below represents a parameter that represents walk-off, and corresponds to a reciprocal of the group velocity.
(66)
(67) As a result of introduction of the assumptions, it is possible to integrate expression (9) with z in the frequency domain, and obtain a following solution for the waveform after propagation over the distance h.
(68)
(69) In this regard, when the integration of the length h is performed with respect to the distance z to derive expressions (11) to (13), an integration interval is [h/2, h/2]. Moreover, symbols used for these expressions are defined as follows.
(70)
(71) The linear step of the split-step Fourier method is described by expression (6), and the nonlinear step that takes effects of both of SPM and XPM into account is described by expression (11) for the signal waveform of the channel number 0. Hereinafter, a method for performing calculation of back-propagating an optical signal waveform received through a certain optical fiber transmission line based on these expressions, and estimating a transmitted waveform will be described.
(72) To describe a procedure of back propagation calculation,
(73)
(74) Note that, even in a case where the ratio of the lengths of respective steps is equally partitioned in the span, the present embodiment is applicable as is. Moreover, although a difference in how to split steps in a span influences a waveform distortion correction result according to the methods that are described in Non-Patent Literatures 2 and 4 and do not perform the learning, Non-Patent Literatures 5 to 8 and the present embodiment, which perform the learning, do no more than influence only initial value setting of the learning, and have no difference after the learning converges and is optimized. Moreover, in a case where the number of steps per span is one, the entire one span is calculated as one step.
(75) Next, when calculation of each step is performed, calculation is performed based on the symmetrical type split-step Fourier method (see Non-Patent Literature 3). That is, one step whose distance is h is equally split into a first half and a second half, and calculation that uses expression (6) as the linear step is first performed on a distance h/2 of the first half of the step.
(76) Next, by using a waveform obtained as an output of the linear step as an input, calculation according to expression (11) of the nonlinear step is performed on the distance h of the entire step. Lastly, by using a waveform obtained as an output of the nonlinear step as an input, calculation according to expression (6) of the linear step is performed on the distance h/2 of the second half of the step. Although this operation is repeated per step, calculation is performed for the linear step of a second half of a certain step together with the linear step of a first half of a next step.
(77) In the example in
(78) Although a signal waveform of each channel independently develops in the linear step of expression (6), signal waveforms of other channels are taken into account to perform calculation taking XPM into account when development of a signal waveform of one channel is calculated in the nonlinear step indicated in expressions (11) to (13).
(79) In a case where the input and output waveforms in a j-th linear step L.sup.(j) are put as z.sub.p, n.sup.(j) and y.sub.p, n.sup.(j), input/output waveforms of a nonlinear step N.sup.(j) are y.sub.p, n.sup.(j1) and z.sub.p, n.sup.(j). Moreover, as illustrated in
(80) Next, a method according to the present embodiment for optimizing parameters used for the back propagation calculation by the gradient descent method, and maximizing the performance of the nonlinear waveform distortion correction will be described. A specific learning method will be described below. An error function J() is defined as follows.
[Expression 16]
J()=.sub.t|e.sub.t()|.sup.2(16)
(81) Here, represents a vector including the parameters used for the back propagation calculation, and e.sub.t()=y.sub.t()d.sub.t represents an error at a time t. y.sub.t() represents a value of the signal waveform at the time t after the back propagation calculation and is regarded as a function of the parameter , and d.sub.t represents a value of a desired signal at the time t. Here, a waveform of a transmitted signal is used as the desired signal.
(82) Here, a form of the error function in equation (16) is referred to as a Mean Squared Error (MSE). A dataset is composed by plural sets of the waveform d.sub.t of the transmitted signal and a waveform y.sub.t() obtained after calculation for a back propagation transmission line of the parameter is performed, and the parameter is optimized so as to minimize the error function J() by the stochastic gradient descent method based on repeated calculation. As described in Non-Patent Literature 1, an update formula for the parameter based on the stochastic gradient descent method can be obtained as follows.
(83)
(84) In this regard, .sub.1 represents a parameter vector that is obtained as an i-th update result, represents a minute positive number for determining a learning speed, and J represents a gradient of the error function for the parameter and is calculated as represented in a following expression.
(85)
(86) Here, the desired signal, that is, the transmitted signal does not depend on the parameter , and a following expression is held.
(87)
(88) Therefore, the update formula for the parameter can be eventually obtained as follows.
(89)
(90) When all parameters that are elements of and are being subjected to the back propagation calculation are updated using expression (20), y.sub.t/ that is a gradient for each parameter is used for the output signal y.sub.t. A calculation formula for calculating this is derived according to the differential chain rule.
(91) According to expression (6), the relationship between the input and output waveforms z.sub.p, n.sup.(j) and y.sub.p, n.sup.(j) in the linear step L.sub.p, n.sup.(j) is as follows.
(92)
(93) By directly differentiating expression (21), the following expression can be obtained.
(94)
(95) Consequently, it is possible to calculate gradients of D.sub.2.sup.(j) and D.sub.3.sup.(j) of the output waveform y.sub.p, n.sup.(j) using the input waveform z.sub.p, n.sup.(j). The gradients obtained here are sent to the next step according to the differential chain rule, and is finally modified in a form of gradients for the output waveform y.sub.p, n=y.sub.p, n.sup.(m). A gradient of y.sub.p, n.sup.(j) for an arbitrary parameter included in a step before L.sub.p, n.sup.(j) can be obtained as follows.
(96)
(97) Consequently, it is possible to calculate the gradient of the output waveform y.sub.p, n.sup.(j) for the parameter using a gradient z.sub.p, n.sup.(j)/ output from an immediately prior nonlinear step. In this regard, a walk-off value T.sub.n.sup.(j) is expressed as follows using walk-off parameters d.sub.n.sup.(j) and d.sub.n.sup.(j+1) included in the nonlinear steps N.sub.p, n.sup.(j) and N.sub.p, n.sup.(j+1) before and after the linear step L.sub.p, n.sup.(j).
(98)
(99) Here, h.sub.j represents an interval width of the nonlinear step N.sub.p, n.sup.(j).
(100) Next, a relationship between the input and output waveforms y.sub.p, n.sup.(j) and z.sub.p, n.sup.(j) in the nonlinear step N.sub.p, n.sup.(j) for the signal waveform of the channel 0 is as follows according to expression (11).
(101)
(102) In this regard, P.sub.p, n.sup.(j1)=|y.sub.p, n.sup.(j1)|.sup.2 represents the intensity of an input waveform y.sub.p, 0.sup.(j1), and also has the following relationship.
[Expression 29]
{tilde over (p)}.sub.p,n.sup.(j1)()=[P.sub.p,n.sup.(j1)(t)]29)
(103) Then, the gradient of the output waveform z.sub.p, n.sup.(j) for parameters g.sup.j), .sup.j), .sub.0.sup.j), .sub.n.sup.(j), and d.sub.n.sup.(j) used in the nonlinear step N.sub.p, n.sup.(j) can be obtained as follows by directly differentiating expression (26).
(104)
(105) Consequently, it is possible to calculate the gradient using the output waveform z.sub.p, n.sup.(j), the intensity P.sub.p, n.sup.(j1) of the input waveform, and a frequency waveform P.sub.p, n.sup.(j1) thereof (P is a symbol with above P).
(106) Moreover, for the gradient of z.sub.p, n.sup.(j) for the arbitrary parameter included in a step before N.sub.p, n.sup.(j), the following equations can be obtained by differentiating expression (26) with .
(107)
(108) Note that, although expressions (26) to (36) describe the calculation formulae in the nonlinear steps for the signal of the channel number 0, it is possible to describe a calculation formula likewise for a signal of a general channel number.
(109) To sum up, expressions (22) and (23) are used in the linear step and equations (30) to (34) are used in the nonlinear step to respectively calculate the gradient of the output waveform of that step, which is represented by the parameters used in that step, and the calculation result is passed to a next step. Moreover, expression (24) is used in the linear step and expressions (35) and (36) are used in the nonlinear step to update the gradient passed from the previous step to a gradient of an output of that step, and the updated gradient is passed to a next step. By continuing such calculation from an input side to an output side, it is possible to calculate a gradient of the final output waveform y.sub.p, n=y.sub.p, n.sup.(m), which is represented by all parameters used for the back propagation calculation, and update the parameters according to expression (20).
(110) Note that, in
(111) Furthermore, in the example described below, for a signal other than a channel number n=0, calculation for the nonlinear step is not performed, and only calculation for the linear step is performed. Although XPM nor SPM is corrected in a case where calculation for the nonlinear step is ignored, it has been confirmed that it is possible to ignore the nonlinear waveform distortion produced in waveforms of channels other than the channel number n=0 when an influence caused by XPM on the channel of the channel number n=0 due to channels other than the channel number n=0 is calculated.
(112) According to the above-described method, it is possible to optimize the transmission line parameters for correcting the nonlinear waveform distortion including XPM by the gradient descent method, and maximize the correction effect.
(113) A method for reducing a calculation amount by selecting parameters to be optimized, and further applying approximation thereto will be described below.
(114) Expression (23) expresses a method for calculating a gradient for the third-order group velocity dispersion of a transmission line, however, an influence of the third-order group dispersion on a waveform of a single channel is little to such a degree that this influence can be ignored for a signal whose symbol rate is several tens of Gbaud or less, therefore, a certain initial value can be set and then fixed without performing the learning, or a third-order group velocity dispersion effect itself can be also ignored. Here, it is assumed that the third-order group velocity dispersion is fixed without performing the learning after the initial value is set. In this regard, although the third-order group velocity dispersion causes walk-off between channels to change at a second-order with respect to a frequency difference between the channels, this effect is taken into account to set walk-off T.sub.n.sup.(j) and an initial value of the walk-off parameter d.sub.n.sup.(j).
(115) Although expressions (32) and (33) express methods for calculating a gradient related to a loss coefficient of each channel, the loss coefficient is a parameter that can be easily measured, and therefore it is assumed that the initial value is kept fixed without calculating the gradient.
(116) Next, how the gradient propagates is restricted.
(117) An output waveform y.sub.1, 0.sup.0) of the linear step L.sub.1, 0.sup.(0) is sent to a nonlinear step N.sub.p, n.sup.(1) (p=1, 2; n=0, 1). An output waveform z.sub.p, n.sup.(1) is calculated in each nonlinear step according to expressions (26) to (28), and a gradient z.sub.p, n.sup.(1)/ for =D.sub.2.sup.(0) is calculated according to expressions (35) and (36) and sent to a subsequent step. The gradient for D.sub.2.sup.(0) propagates to a nonlinear step N.sub.1, 0.sup.(2) via a linear step L.sub.p, n.sup.(1) (p=1, 2; n=0, 1).
(118) The solid lines in
(119)
(120) To sum up the above-described method with the approximation, calculation can be performed in the linear step by using expression (21) as the development of the waveform, expression (22) as the calculation formula of the gradient related to second-order group velocity dispersion used in that step, and expression (24) as the update formula for the gradient related to the arbitrary parameter having propagated from a previous step. Moreover, in the nonlinear step, it is possible to use expressions (26) to (28) as the development of the waveform, expressions (30), (31), and (34) as the calculation formulae of the gradients related to the parameters g.sup.(j), .sup.(j), and d.sub.n.sup.(j) used in that step, and expressions (35) and (37) as the update formulae for the gradients related to the arbitrary parameter having sent from a previous step.
(121)
(122) Next, although the waveform is developed from y.sub.p, n.sup.(j) to z.sub.p, n.sup.(j+1) according to expressions (26) to (28) in a nonlinear step N.sup.(j+1), the gradient of the waveform (z.sub.p, n.sup.(j+1))/g.sup.(j+1), z.sub.p, n.sup.(j+1)/.sup.(j+1), z.sub.p, n.sup.(j+1)/.sub.0.sup.(j+1), z.sub.p, n.sup.(j+1)/.sub.n.sup.(j+1), and z.sub.p, n.sup.(j+1/d.sub.n.sup.(j+1)) for each transmission line parameter (g.sup.(j+1), .sup.(j+1), .sub.0.sup.(j+1), .sub.n.sup.(j+1), and d.sub.n.sup.(j+1), used in N.sup.(j+1) is calculated by arithmetic operations according to expressions (30) to (34). Moreover, the gradients of the waveforms for all of the transmission line parameters included in a step before N.sup.(j+1) are calculated by arithmetic operations according to expressions (35) and (37), and y.sub.p, n.sup.(j)/ is updated to z.sub.p, n.sup.(j+1)/. Note that expression (36) may be used instead of expression (37). z.sub.p, n.sup.(j+1)/g.sup.(j+1), z.sub.p, n.sup.(j+1)/.sup.(j+1), z.sub.p, n.sup.(j+1/.sub.0.sup.(j+1), z.sub.p, n.sup.(j+1)/.sub.n.sup.(j+1), and z.sub.p, n.sup.(j+1/d.sub.n.sup.(j+1)are included in z.sub.p, n.sup.(j+1)/, and are collectively sent to L.sup.(j+1) that is a next step.
(123) By repeating such calculation per step, y.sub.t/ that is the gradient of a final output y.sub.t, for all of the parameters in the transmission line is calculated.
(124) Furthermore,
(125) Note that, in a case where it is found that the loss coefficients .sub.0 and .sub.n of the optical fiber and a value of the coefficient that corresponds to cross-polarization cross phase modulation is one, learning these transmission line parameters may be omitted. Moreover, in a case where the third-order dispersion effect in a channel can be ignored, the learning of D.sub.3 may be omitted.
System Configuration According to Embodiment
(126)
(127)
EXAMPLE
(128) An effect of the nonlinear waveform distortion correction for a WDM optical signal by a method described in the present embodiment will be described based on a specific example that is based on optical transmission simulation that uses numerical value calculation. An optical signal that is a target to study is a signal obtained by performing 9-channel wavelength division multiplexing on Dual-Polarization (DP) 64-Quadrature Amplitude Modulation (QAM) signal whose symbol rate is 32 Gbaud at 50 GHz in frequency interval, random noise is given to this signal, and an SN ratio is set to 25 dB. Assume that the spectrum of a transmitted signal is one that a root Nyquist filter whose roll-off factor is 0.05 is applied. Channel numbers 4 to +4 are allocated to WDM signals of nine channels in order from a lower frequency, and signal quality of the center channel number 0 is focused upon to test an operation of the nonlinear waveform distortion correction.
(129) One span includes a Standard Single-Mode Fiber (SSMF) whose length is 80 km and an optical amplifier that amplifies the propagation loss of the optical fiber, transmission lines of four spans to 10 spans is assumed, and calculation for transmitting an optical signal using these transmission lines is performed. In an ideal transmission line, assuming that SSMFs of all spans have the same parameters, second-order and third-order group velocity dispersion values are put as 16.641 ps/nm/km and 0.06 ps/nm.sup.2/km, respectively, a nonlinear coefficient is put as 1.3 W.sup.1km.sup.1, and a propagation loss coefficient is put as 0.192 dB/km. On the other hand, in a realistic transmission line, various parameters are different per span or per spot, power of an optical signal also fluctuates from an ideal state, and an effect of the group velocity dispersion and a magnitude of the nonlinear effect eventually vary. A transmission line for which the second-order group velocity dispersion value and the nonlinear coefficient are fluctuated per step as illustrated in
(130) Numerical value calculation for performing optical transmission simulation handles waveform data that was sampled 32 times the symbol rate on the time axis. By setting one of 4, 6, 8 and 10 as the number of spans and using the split-step Fourier method in which 800 is set as the number of steps per span in the ideal transmission line having the same parameters in each span or in a transmission line having the parameters illustrated in
(131) Transmission signal data whose number of symbols is 16384 is generated based on a random bit pattern for respective conditions of the numbers of transmission spans and launch power to study, transmission simulation is performed, and an output waveform is stored. Sets of input waveforms and output waveforms of transmission lines are used as datasets, 200 datasets in total are used, and parameters of the back propagation calculation are learned by the gradient descent method that uses the above expressions such that the nonlinear waveform distortion correction is optimized. In the back propagation calculation accompanied by learning of the parameters according to the present example, the number of steps per span is set to one. By contrast with this, in the back propagation calculation that is not accompanied by the learning, both of cases where the numbers of steps per span are one and two will be considered, and the effect is compared with those in the cases where the learning is not performed.
(132) While the gradient descent method is used to learn each parameter, a method called AdaBelief proposed in Non-Patent Literature 9 is used as a specific implementation method for updating each parameter. Note that the learning is possible using methods other than AdaBelief. The update formula of AdaBelief is obtained as follows by putting .sub.0=0 and .sub.0=0.
(133)
(134) In this regard, J.sub.i represents a gradient of an error function for the parameter at a time of i-th update, and b.sub.1=0.9, b.sub.2=0.999, and e=10.sup.8 are constants. in expression (38) represents a learning coefficient, and a suitable value is set thereto for each parameter. In the present example, the learning coefficient for the parameter D.sub.2 corresponding to the second-order dispersion value is put as =1.0, =2.010.sup.6 is used for the parameter g corresponding to the nonlinear coefficient, =2.010.sup.5 is used for the parameter corresponding to the cross-polarization cross phase modulation coefficient, =2.010.sup.4 is used for the parameter d.sub.n corresponding to the walk-off, and update according to expression (38) is repeated 30000 times to perform the learning. Note that the number of datasets is 200, and therefore an arrangement order of the datasets is rearranged at random every update is performed 200 times to perform repeated learning. Note that this operation does not cause overtraining. Moreover, when similar learning is performed by changing conditions such as a signal modulation format and the number of channels, the learning may be performed 30000 times or more until the parameters converge if necessary. All of 16384 symbols included in the dataset are used as y.sub.t and d.sub.t in expression (20). Another waveform that has a different bit pattern and whose number of symbols is 262144 is used for testing work after the learning to evaluate signal quality after the nonlinear waveform distortion correction.
(135) First, a result of the nonlinear waveform distortion correction in the ideal transmission line whose parameters have the same values in all spans will be described.
(136)
(137)
(138)
[Expression 39]
Q.sup.2[dB]=20 log.sub.10[{square root over (2)}erfc.sup.1(2BER)](39)
(139) The results in
(140) As for a case of the nonlinear waveform distortion correction by the physical phenomenon-specialized type neural network reported in Non-Patent Literature 8, Non-Patent Literature 8 reports a result that, while a result of 1 step/span is not substantially different from the result of 2 steps/span in a case of a transmission line of 12 spans, a reasonable correction effect could be obtained even though the performance deteriorated even in a case of a configuration of 0.5 step/span, i.e., a case where the number of steps was six. In view of the above, in a case where the method according to the present embodiment is used, it is considered that it is possible to obtain a sufficient correction effect with the number of steps less than that of 2 steps/span, and obtain a reasonable correction effect from a configuration less than 1 step/span.
(141)
(142)
(143) Next,
(144)
(145)
(146) Similar to
(147) The effect of the embodiment was tested by a loop transmission experiment in addition to the above-described simulation results.
(148) As a configuration of the Transmitter (Tx), continuous light of 11 channels with different wavelengths, is output from a wavelength tunable light source, is synthesized by a 161 polarization maintaining coupler, is then input to a Lithium Niobate (LN) dual-polarization IQ modulator, is modulated to an optical signal waveform by electric signals applied to the modulator, and is output. The center frequency is 193.1 THz (the wavelength is 1552.524 nm), the frequency of the continuous light of the 11 channels is set at an interval of 50 GHz from 192.85 THz to 193.35 THz, and a channel number n=5, 4, . . . , 4, and 5 is assigned in order from a lower frequency. The electric signals of four channels to be applied to the modulator are generated by an arbitrary waveform generator whose sampling rate is 64 GSample/s, and each channel corresponds to an X polarization I channel component, an X polarization Q channel component, a Y polarization I channel component, and a Y polarization Q channel component of a dual-polarization IQ modulation signal. These electric signals of the four channels are respectively amplified by a driver amplifier, and applied to the modulator. An optical signal output from the modulator is a dual-polarization QAM signal whose symbol rate is 32 Gbaud, and that has a root Nyquist waveform whose roll-off factor is 0.1. As modulation formats, a uniform distribution 16-QAM signal whose number of bits per single polarization single symbol is four bits, and a Probabilistically Shaping (PS) 64-QAM signal whose number of bits per single polarization single symbol is five bits are used. For each modulation format of 16 QAM and PS-64 QAM, four patterns of the signal waveform that is modulated based on a random bit pattern and that includes 65536 symbols per single polarization are generated. When one of the four patterns is selected, the Tx repeatedly transmits the waveform of this pattern. Note that, although the WDM signals of the 11 channels obtained by modulation are modulated to the same waveforms across all channels, performing long distance transmission through a transmission line in which group velocity dispersion occurs causes the walk-off (group delay between channels), and therefore random XPM between waveforms occurs after the long distance transmission.
(149) Optical power of the WDM signals of the 11 channels generated by the Tx is amplified by the optical amplifier, and is then adjusted by a Variable Optical Attenuator (VOA). After that, optical noise outside a signal band is removed by a Band-Pass Filter (BPF), and the WDM signals are input to an Acoustic Optical Modulator (AOM) that is a switch that switches a loop transmission operation. The signal is input to and output from a loop transmission line through a 3 dB coupler. The loop transmission line includes in order from an input side an optical amplifier, a BPF, a VOA, an SSMF whose length is 84.1 km, an optical amplifier, a BPF, a VOA, an SSMF whose length is 80.5 km, an optical amplifier, an isolator (rightward arrow), a polarization scrambler (Pol. Scrambler), and an AOM. That is, one loop includes the SSMFs of two spans. In the experiment, the transmission distance is set to one of six, eight, 10, 12, 14, and 16 spans, and a signal is transmitted over each distance to test an effect of the nonlinear waveform distortion correction. Note that the SSMF whose length is 84.1 km and the SSMF whose length is 80.5 km have slightly different dispersion characteristics, a measurement result represents that a group velocity dispersion value and a dispersion slope value at 193.1 THz in frequency in a case of the former SSMF are 17.14 ps/nm/km and 0.062 ps/nm.sup.2/km, respectively, and the group velocity dispersion value and the dispersion slope value in a case of the latter SSMF are 16.55 ps/nm/km and 0.058 ps/nm 2/km, respectively, and the group velocity dispersion value and the dispersion slope were estimated as 16.85 ps/nm/km and 0.060 ps/nm.sup.2/km, respectively, as the average characteristics of one loop of the loop transmission line.
(150) As for the signal output from the loop transmission line, only one channel of the 11 channels is extracted by the BPF whose passband is 50 GHz, and is amplified by the optical amplifier, and then is input to a Receiver (Rx). The Rx is a digital coherent receiver that includes a 4-channel real time oscilloscope of 80 GSample/s whose electric band is 33 GHz, a Local Oscillator (LO), and an optical front end, demodulates, by offline digital signal processing, a real time waveform acquired by the oscilloscope, and then performs signal processing for the nonlinear waveform distortion correction offline likewise. In this regard, in the experiment, only signal quality of the center channel whose channel number is n=0 is focused upon, the learning for the nonlinear waveform distortion correction according to the embodiment is performed to maximize this signal quality, and the signal quality after the correction is evaluated.
(151) To perform the back propagation calculation for the nonlinear waveform distortion correction offline after the WDM signal is received, waveforms of all of the 11 channels are received for each channel. Normal demodulation processing that performs up to evaluation on signal quality without performing the back propagation calculation includes dispersion compensation, application of the same root Nyquist filter as that applied at a time of transmission as a matched filter, polarization rotation and demultiplexing of dual-polarization components, resampling to 2 samples/symbols, retiming, carrier frequency estimation and compensation, carrier-phase recovery and 3-tap feed forward-type linear adaptive equalization processing, and symbol decision and acquisition of a bit pattern. Here, the 3-tap equalization processing is butterfly-type 22 MIMO processing that can handle a dual-polarization signal, and is effective to compensate for the polarization crosstalk that occurs due to birefringence in a transmission line, XPM, and the like. On the other hand, the signal waveform demodulated in this way greatly changes from the waveform at the time of reception as a result of application of the root Nyquist filter, and the back propagation calculation cannot be applied as is. Hence, the root Nyquist filter is not applied to the waveform after the dispersion compensation is applied during the normal demodulation processing, the processing performed in the subsequent demodulation process is performed likewise, and the waveform immediately after reception is reproduced as closely as possible.
(152) Incidentally, originally, the optical signals of all of the channels should be simultaneously received by using plural transceivers, the back propagation calculation for the nonlinear waveform distortion correction according to the present embodiment should be performed by using waveforms of all of the channels without demodulating them, and the demodulation should be performed finally. However, in the experiment conducted herein, since the one Rx is used to receive and demodulate each channel in order, the measured waveforms of all of the channels are not synchronized between the channels. Especially, in a WDM signal after transmission through an optical fiber of a long distance, the group delay (walk-off) between the channels occurs in addition to the linear waveform distortion by the effect of the group velocity dispersion, and when the back propagation calculation is performed, it is necessary to start calculation while keeping an accurate group delay amount at the time of reception. However, if the back propagation calculation is performed as is without demodulating the waveform received in an asynchronous manner, the aforementioned condition is not satisfied. Hence, a procedure is adopted that the signals of all of the channels are independently received once, demodulation including dispersion compensation is performed, known pilot symbols are detected from resulting waveforms, timings of all of the channels are synchronized, the compensated group velocity dispersion values are allocated again to give the walk-off to each channel, a WDM signal waveform that would be obtained at a time of collective reception is reproduced, and the back propagation calculation is started.
(153) In the experiment, under each condition of a different number of transmission spans and launch power, a signal of each modulation format is transmitted, received, and demodulated, and a transmitted waveform that is common to all channels and a collectively received waveform of all channels described above are synthesized to generate a dataset. Four datasets are generated for each modulation format in association with four different waveform patterns. Note that, to correctly process the walk-off that occurs during the back propagation calculation, measurement is performed such that symbols much more than 65536 symbols included in one period of the transmitted waveform are included at a time of reception. A frequency difference between the center signal whose channel number is n=0 and an edge channel whose channel number is n=5 is 250 GHz, the maximum number of transmission spans is 16. Therefore, a maximum value of walk-off is estimated as approximately 43500 ps. This walk-off value corresponds to approximately 1400 symbols with respect to a 32-Gbaud signal, and therefore by performing measurement under a condition including the number of symbols greater than the 1400 symbols, it is possible to correctly calculate a walk-off influence when the back propagation is performed using a dataset having a finite time width. Hence, a waveform obtained by adding 5000 symbols to each of both edges in the time domain of the waveform whose one period is 65536 symbols is used to form a dataset.
(154) Prior to the experiment, the following new facts were found by advance study based on simulation. That is, by learning the transmission line parameters only for a waveform obtained for high launch power (e.g., launch power per channel is +3 dBm) equal to or more than a certain value using a signal of a modulation format (e.g., DP-16QAM) having certain complexity, it is possible to apply the resulting parameters to the nonlinear waveform distortion correction for a signal of an arbitrary modulation format whose launch power is equal to or less than the values used for the learning. Based on this fact, in the experiment, the DP-16-QAM signal whose launch power per channel is +3 dBm is transmitted, received, and demodulated to form a dataset, and learn transmission line parameters used for the back propagation calculation. Next, a DP-PS-64-QAM signal whose launch power per channel is 5 dBm to +2 dBm is transmitted, received, and demodulated to form a dataset, perform the nonlinear waveform distortion correction using the previously obtained transmission line parameters, and evaluate an improvement amount of signal quality.
(155) At a time of learning the transmission line parameters, in one learning step, one of four datasets of the DP-16-QAM signal is selected at random to perform the back propagation calculation, then continuous 1024 symbols among 65536 symbols are selected at random, an amplitude waveform of these symbols is put as an output signal waveform y.sub.t in expression (20), a corresponding transmitted signal waveform is used as the desired signal d.sub.t, and a gradient of each parameter is calculated from an error signal to update parameters. By performing such learning, it is possible to randomize the learning process for a limited number of datasets, and advance the learning without causing overfitting. Note that above-described AdaBelief is used as an algorithm of the gradient descent method used for learning.
(156)
(157) The learning was finished, and the transmission line parameters to perform the nonlinear waveform correction was performed were obtained for each number of transmission spans. An effect of the nonlinear waveform distortion correction for the PS-64-QAM signals of the 11 channels whose launch power range per channel was a range of 4 dBm to +2 dBm was tested based on these parameters.
(158) Hereinafter, a calculation amount required in a case where the nonlinear waveform distortion correction according to the embodiment is performed and a calculation amount required in a case where a conventional technique is used are compared to indicate that, under a condition of a certain number of channels or less, the calculation amount required for the method according to the embodiment is substantially the same as the calculation amount required for the conventional method. Note that the calculation amount in a case where the back propagation calculation is performed fixing the optimized parameters obtained by finishing the learning will be focused upon hereinafter, although the nonlinear waveform distortion correction according to the embodiment means that the waveform correction is performed by the back propagation calculation after learning optimized values of the transmission line parameters by using the gradient descent method in order to perform the back propagation calculation of 1 step/span while taking both of distortions caused by SPM and XPM into account. The calculation amount in this case is the same as that in a case of the back propagation calculation that takes XPM into account in 1 step/span, and does not perform the learning. Moreover, the conventional technique supposes that only the correction of the distortion caused by SPM is taken into account, the distortion caused by XPM is not corrected, and the back propagation calculation of 2 steps/span is performed. As a precondition for deriving the calculation amount, it is assumed that results of numerical parameters that can be fixed irrespectively of an input waveform are calculated in advance and stored in a Look-Up Table (LUT), and are read and used every time a different waveform is input, and the number of times of arithmetic operations necessary for that calculation itself is not taken into account.
(159) The number of spans is put as S, a data length is put as N, the number of channels to be taken into account for calculation for XPM correction is put as C, and the number of times of calculation per channel and per polarization is calculated. A calculation load of the DSP is mainly a product arithmetic operation, and therefore the number of times of product arithmetic operations is calculated. Assuming that the number of times of product arithmetic operations of real numbers required for a product of complex numbers,
ab=Re[a]Re[b]Im[a]Im[b]+i(Re[a]Im[b]+Re[b]Im[a]),
is four, a total value of the numbers of times of product arithmetic operations of the real numbers is calculated. Furthermore, the number of times of product operations of complex numbers to perform FFT on a complex number signal whose size is N=2.sup.n, is generally N(log.sub.2N2)/2, and therefore the number of times of production operations of the real numbers is 2N(log.sub.2N2) that is four times N(log.sub.2N2)/2.
(160) First, a calculation amount required for expression (6) that is the linear step is estimated. Assuming that A represents a symbol with above A, the number of times of product operations of real numbers to calculate A.sub.p, n(0, )=FA.sub.p, n(0, t) is 2N(log.sub.2N2). A value of exp(h/2+i(D.sub.2.sup.2+D.sub.3.sup.3T.sub.n)) is irrelevant to an input waveform and therefore is stored in the LUT, and the number of times of product arithmetic operations of real numbers to multiply this value to A.sub.p, n(0, ) is 4N. Lastly taking an arithmetic operation required for inverse FFT into account, a total number of times of production operations of real numbers for the entire linear step is 4N+22N(log.sub.2N2)=4N(log.sub.2N1).
(161) Next, the calculation amount required for calculation according to expressions (11) to (13) that are nonlinear steps is estimated. When the nonlinear phase shift amount , which is a real number, is obtained, the number of times of product operations of the real numbers required for calculation of e.sup.i is 6N in total, because, in following fourth-order Taylor expansion, 2N is required for calculation of .sup.2/2, 2N is required for calculation of (.sup.2).sup.2/24 by reusing a result of .sup.2, and 2N is required for calculation of .sup.2/6.
(162)
(163) Moreover, the number of times of product operations of real numbers to calculate B.sub.p, 0(0, t)e.sup.i that is a product of complex numbers is 4N. Next, when phase shift .sub.SPM(t) caused by SPM is calculated according to expression (12), P.sub.p, 0(t)=Re[B.sub.p, 0(t)].sup.2+Im[B.sub.p, 0(t)].sup.2 holds, therefore the number of times of product operations of real numbers is 2N, and, taking multiplication of a coefficient gH.sub.0 into account, the number of times of product operations of real numbers is 3N in total. Furthermore, because N times of product operations is required when an intensity P.sub.3p, 0(t) of an orthogonal polarization component to be additionally supplied is multiplied with a coefficient gH.sub.9, the number of times of product operations of the real numbers to calculate .sub.SPM(t) is 4N in the end.
(164) Next, a calculation amount required to calculate phase shift .sub.XPM(t) caused by XPM according to expression (13) is estimated. The intensity waveform P.sub.p, 0(t) has already been obtained at a time when .sub.SPM(t) is calculated, and the number of times of product operations of the real numbers to calculate P.sub.p, 0() by applying FFT to this intensity waveform P.sub.p, 0(t) is 2N(log.sub.2N2). P.sub.p, 0() does not appear in expression (13), yet needs to be supplied to perform the nonlinear waveform distortion correction of other channels, and therefore a calculation amount of P.sub.p, 0() is taken into account. On the other hand, assuming that P.sub.p, n() in a case of n0, which appears in expression (13), is separately calculated and supplied, a calculation amount necessary therefor is not taken into account. When P.sub.p, n() and P.sub.3-p, n() are multiplied with 2 g and g that are coefficients of real numbers, product arithmetic operations of real numbers need to be performed 2N times for each multiplication, that is, 4N times of product operations of real numbers is required in total. Assuming that H.sub.n(h, ) in expression (14) is stored in the LUT, the number of times of product operations of real numbers required to multiply a result of 2gP.sub.p, n()+gP.sub.3p, n() with H.sub.n(h, ) is 4N, and therefore the number of times of product operations of the real numbers required to obtain H.sub.n(h, ) 2gP.sub.p, n()+.sub.qP.sub.3p, n()) is 8N in total. This arithmetic operation is required for C-1 channels of n0, and therefore the number of times of product operations of the real numbers is 8N(C1). Finally taking into account a calculation amount for applying inverse FFT, the calculation amount required to calculate .sub.XPM(t) is 2N (log.sub.2N2)+8N (C1)+2N (log.sub.2N2)=4N(log.sub.2N+2C4). In view of the above, the calculation amount in a case where .sub.XPM(t) is not taken into account for one nonlinear step is 6N+4N+4N=14N, and the calculation amount in a case where .sub.XPM(t) is taken into account to compensate XPM is 14N+4N(log2N+2C4).
(165) In the back propagation calculation where the number of steps per span is M in a transmission line whose number of spans is S, the number of linear steps is MS+1 in total, and the number of nonlinear steps is MS in total. The above-described result represents that the number of times of production operations of real numbers in a case where XPM compensation is not performed is as follows.
4N(log.sub.2N1)(MS+1)+14NMS
The number of times of production operations of the real numbers in a case where XPM compensation is performed is as follows.
4N(log.sub.2N1)(MS+1)+{14N+4N(log.sub.2N+2C4)}MS
(166)
(167) Although the embodiment of the present invention has been described above, the present invention is not limited to this. Although, for example, the example where the stochastic gradient descent method is used has been described, various variations of the gradient descent method are applicable. Furthermore, as described above, it may be possible to obtain a sufficient effect even by taking influences of both of SPM and XPM into account only for part of steps instead of taking the influences of SPM and XPM into account for all steps.
(168) Note that the DSP includes an arithmetic operation unit and a memory. Furthermore, not only the DSP, but also another processor may execute the above-described processing. Furthermore, a program for causing the processor to execute the above-described processing is recorded in a non-volatile memory, and executed when commands included in the program is read out and executed by the processor at a time of execution. Furthermore, a dedicated circuit or a combination of the dedicated circuit and the DSP or the like may execute the above-described processing.
(169) The aforementioned embodiments are summarized as follows.
(170) A method for correcting optical waveform distortion, which relates to a first aspect in the present embodiments, is an optical waveform distortion correction method for correcting optical waveform distortion by estimating a waveform at a time of transmission by alternately calculating linear terms and nonlinear terms in a nonlinear Schrdinger equation after receiving an optical signal whose waveform shape changed by a nonlinear optical effect and a group velocity dispersion effect of an optical fiber that is a transmission line and digitizing a waveform of the received optical signal, characterized in that calculation is performed taking into account not only waveform distortion caused by self-phase modulation that occurs in a channel but also waveform distortion caused by cross-phase modulation that occurs between channels in a time of wavelength-division multiplexing transmission, parameters used for the calculation are optimized by a gradient descent method, and the number of steps per one span of the transmission line is less than 2. It is possible to improve the accuracy while suppressing a calculation load.
(171) The aforementioned number of steps per one span of the transmission line may be equal to or less than 1. Even when the number of steps is reduced like this, it is possible to improve the accuracy.
(172) Furthermore, the aforementioned parameters may include second-order group velocity dispersion, a nonlinear coefficient and walk-off.
(173) A method for correcting optical waveform distortion, which relates to a second aspect in the present embodiment, includes (A) a step of optimizing, by a gradient descent method, a first parameter that is used in back propagation processing and is associated with cross-phase modulation and a second parameter that is used in the back propagation processing and is associated with self-phase modulation and the cross-phase modulation, wherein the back propagation processing is processing to estimate a waveform at a time of transmission by alternately calculating linear terms and nonlinear terms in a nonlinear Schrdinger equation after receiving an optical signal whose waveform shape changed in a transmission line and digitizing a waveform of the received optical signal, and correct, for each channel of plural channels in the transmission line at a time of wavelength-division multiplexing transmission, waveform distortion caused by the self-phase modulation (SPM) that occurs in the channel and waveform distortion caused by the cross-phase modulation (XPM) that occurs in relation with channels other than the channel; and (B) a step of executing the aforementioned back propagation processing by using the optimized first and second parameters.
(174) As described above, by optimizing not only the second parameter (e.g., D2, D3, g, , and .sub.0) but also the first parameter (e.g., .sub.n and d.sub.n) and executing the back propagation processing to correct the waveform distortion caused by the SPM and XPM by using the optimized first and second parameters, even when the calculation load is suppressed by decreasing the number of steps per one span of the transmission line, it becomes possible to obtain sufficient calculation accuracy.
(175) Incidentally, the aforementioned waveform distortion caused by the cross-phase modulation may be corrected under approximation that an initial waveform is maintained for an intensity, independent of a propagation distance, and a delay proportional to the propagation distance occurs along a time axis. It is possible to further suppress the calculation load of the correction (also called compensation) by such approximation.
(176) Furthermore, the aforementioned second parameter may include group velocity dispersion D.sub.2 and a nonlinear coefficient g, and the aforementioned first parameter may include a walk-off parameter d.sub.n. When limiting parameters to be optimized, it is possible to further suppress the calculation load.