Abstract
This work proposes a novel and powerful adaptive digital back propagation (A-DBP) method with a fast adaption process. Given that the total transmission distance is known, the proposed A-DBP algorithm blindly compensates for the linear and nonlinear distortions of optical fiber transmission systems and networks, without knowing the launch power and channel parameters. An adjoint-based optimization (ABO) technique is proposed to significantly accelerate the parameters estimation of the A-DBP. The ABO algorithm utilizes a sequential quadratic programming (SQP) method coupled with an adjoint sensitivity analysis (ASA) approach to rapidly solve the A-DBP training problem. The design parameters are optimized using the minimum overhead of only one extra system simulation. Regardless of the number of A-DBP design parameters, the derivatives of the training objective function with respect to all parameters are estimated using only one extra adjoint system simulation per optimization iterate. This is contrasted with the traditional finite-difference (FD)-based optimization methods whose sensitivity analysis calculations cost per iterate scales linearly with the number of parameters. The robustness, performance, and efficiency of the proposed A-DBP algorithm are demonstrated through applying it to mitigate the distortions of 4-span and 20-span optical fiber communication systems. Coarse-mesh A-DBPs with less number of virtual spans are also used to significantly reduce the computational complexity of the equalizer, achieving compensation performance higher than that obtained using the coarse-mesh DBP with the exact channel parameters and full number of virtual spans.
© 2022 Optica Publishing Group under the terms of the Optica Open Access Publishing Agreement
1. Introduction
Optical fibers play a vital role in modern telecommunication systems and networks [1]. An optical fiber link imposes linear and nonlinear distortions on the propagating light-wave signal due to the inherent dispersive nature and nonlinear behavior of the fiber [2,3]. Although linear impairments can be effectively compensated in coherent optical fiber systems using linear equalizers, the mitigation of nonlinear distortions is considered a major challenge [4]. These nonlinear distortions impede the higher data rate transmission over longer distances. Developing an efficient and computationally inexpensive digital signal processing (DSP) technique to perfectly compensate for the nonlinear fiber impairments is therefore essential and of preeminent importance.
Various DSP-based compensation techniques have been proposed in literature for mitigating the distortions imposed by optical fiber communication systems. These techniques include least mean square equalizers [5], Volterra-series-based equalizers [6], neural network-based equalizers [7], and digital back propagation (DBP) [8–14]. The DBP is one of the most attractive candidates that can principally be used to effectively eliminate the deterministic impairments in optical fiber systems with coherent detection. The basic idea of the DBP is to transmit the received distorted signal through a virtual fiber whose loss, dispersion, and nonlinear parameters are the same in magnitude as that of the actual transmission fiber, but with opposite signs such that the deterministic distortion effects of the transmission link can be fully inverted [9,10]. However, the compensation of fiber impairments using the DBP requires prior knowledge of the physical fiber-optic channel parameters. Such information can only be available at the receiver in the case of a point-to-point transmission scenario. Precise information of the physical fiber channel parametrization may not be available in the case of a dynamic optical network. For instance, transmitted carriers in elastic optical networks (EONs) may propagate through several different routes until reaching the same receiver point [15]. These routes are determined according to the dynamic network configurations. Hence, it becomes challenging and very difficult to determine the fiber channel parameters in such scenarios [16]. Moreover, the physical parameters of optical fiber links and in-line erbium-doped fiber amplifiers (EDFAs) are subject to slight variations, from their nominal values, due to environmental changes.
Many approaches have been proposed for estimating the DBP parameters without knowing the launch power and channel parameters. A semi-blind DBP method is introduced in [17], where the dispersion and nonlinear parameters of the DBP are tuned according to information derived from previously detected data after being recovered using the forward error correction (FEC). However, such approach requires significant delay and hence, its use is limited for high-speed transmission systems. To overcome this delay, adaptive methods that can self-determine the fiber nonlinearity coefficient utilizing gradient-based algorithms are proposed in [16,18,19]. Starting from randomly guessed nonlinear coefficients, the DBP is trained using the known transmitted data to minimize the error vector magnitude [16], the phase-noise variance [18], or the Godard’s error function [19]. More advanced adaptive DBP approaches, based on the assisted filter DBP [11] and the perturbation theory [20], are proposed in [21,22]. These advanced approaches are capable of estimating not only the nonlinear compensation parameters, but also the dispersion compensation parameter and other parameters, e.g. the linear filter bandwidth in case of the assisted filter DBP.
However, in all of these adaptive DBP methods [16,18,19,21,22], the classical finite-difference (FD) approach is utilized to determine the sensitivity information required by the optimizer, during the DBP parameters estimation. The computational cost for obtaining the sensitivity information using FD methods is very expensive $- $ it scales linearly with the number of design parameters. Such overhead together with the expensive computational cost of a single run of the DBP system make the adaptive DBP training cost prohibitive and impractical, especially when a large number of design parameters needs to be estimated. This intensive computational cost can though be significantly reduced through evaluating the sensitivities using an adjoint sensitivity analysis (ASA) approach [23].
The ASA method estimates the sensitivities of the desired objective function with respect to all design parameters using at most one extra system simulation. ASA approaches significantly reduces the computational time cost required by FD approaches, at the expense of using extra memory storage [23]. Various ASA algorithms have been developed in literature based on Maxwell’s equations or the wave equation for high frequency structures [24–28]. An ASA technique is also proposed to estimate the sensitivities of low frequency power applications, e.g., switched reluctance motors [29]. Recently, we proposed an ASA method based on the nonlinear Schrödinger equation (NLSE) to evaluate the sensitivities of a repeaterless single fiber span optical communication system [30,31]. However, the use of this method is limited to forward propagation problems with single fiber span, since it cannot account for the effects of in-line amplifiers. In this paper, we extend this approach to the sensitivity analysis of the general multi-span DBP model. The extended ASA includes both the effect of the inverse nonlinear Schrödinger equation (INLSE) and the in-line amplifier inverse effect. Based on this extended sensitivity analysis approach, we propose an adaptive DBP (A-DBP) scheme, with fast adaption process, to blindly compensate for the linear and nonlinear distortions of optical fiber transmission channels. An adjoint-based optimization (ABO) algorithm, based on the sequential quadratic programming (SQP) method, is proposed to train and optimize the parameters of the A-DBP. All sensitivity calculations required by the ABO algorithm are estimated using the extended ASA approach, which significantly accelerates the A-DBP training process. Regardless of the number of A-DBP design parameters, full gradient information of the A-DBP training problem with respect to all parameters is obtained using only one extra adjoint DBP system simulation. The signal and system parameters required for the DBP such as launch power, fiber lengths, dispersion, loss and nonlinear coefficients of all the fiber spans can be estimated using the proposed approach.
The robustness and efficiency of the proposed A-DBP algorithm is investigated through applying it to mitigate the distortions induced in 4-span and 20-span fiber-optic communication systems. Thanks to the proposed ABO algorithm, the A-DBP could rapidly be trained achieving the optimum compensation performance that is obtained using a conventional DBP with the correct parameters of the channel. Moreover, to reduce the computational complexity of the compensation process, we demonstrate the compensation of the 4-span optical communication scenario using a coarse-mesh A-DBP model with less number of virtual spans.
The remainder of the article is organized as follows: Section 2 provides a brief description of the multi-span DBP mathematical model. In Section 3, we derive the general adjoint system simulation corresponding to the original system simulation of the DBP. A full description of the modified split-step Fourier scheme (SSFS) algorithm required for solving the derived adjoint simulation is also given. Section 3 also provides a complete computational complexity analysis of the modified SSFS algorithm. A detailed description of the proposed ABO algorithm is given in Section 4. In Section 5, we investigate the efficiency of the proposed A-DBP scheme through applying the ABO algorithm to train the DBP for blindly mitigating the inherent distortions in typical 4-span and 20-span fiber-optic communication systems. We also demonstrate the performance of the ABO algorithm training as compared to the finite-difference-based optimization algorithms. Moreover, in Section 5, the proposed ABO algorithm is applied to train coarse-mesh A-DBP models with fewer spans, in order to reduce the computational cost of the compensation of the 4-span optical communication scenario. Finally, conclusions of the work are drawn in Section 6.
2. Multi-span DBP mathematical model
The signal evolution within a DBP span (i.e. virtual fiber + loss element) is described by the inverse nonlinear Schrödinger equation (INLSE) coupled with the inverse gain effect, given as follows [9,10]:
Notice that the analysis throughout this paper considers a single channel with a single polarization system. However, the presented formulation can be extended to a system with polarization-multiplexed signal. In the case of dual polarization, the system state vector ${\boldsymbol V}$ would have $4$ independent components at each spatial cell, rather than $2$, corresponding to the real and imaginary parts of the x and y polarization components of the normalized complex-envelope field. In other words, the system state vector becomes: ${\boldsymbol V} = {\boldsymbol V}({Z,t} )= {[{{\boldsymbol v}_{x|re}^T(t )\; {\boldsymbol v}_{y|re}^T(t )\; \; {\boldsymbol v}_{x|im}^T(t )\; \; {\boldsymbol v}_{y|im}^T(t )\; } ]^T}$, where ${{\boldsymbol v}_{q|re}}$ and ${{\boldsymbol v}_{q|im}}$ represent the real and imaginary parts of the q polarization component of the complex field, respectively, and q is x or y. The system matrices of Eq. (6) would have the same forms but with $4$ block matrices instead of $2$ block matrices. For example, the matrices ${{\boldsymbol B}_3}$ and ${\boldsymbol \varGamma }({\boldsymbol V} )$ becomes: $\left[ {\begin{array}{cccc} {{{\tilde{{\boldsymbol \beta }}}_3}}&0&0&0\\ 0&{{{\tilde{{\boldsymbol \beta }}}_3}}&0&0\\ 0&0&{{{\tilde{{\boldsymbol \beta }}}_3}}&0\\ 0&0&0&{{{\tilde{{\boldsymbol \beta }}}_3}} \end{array}} \right]$ and $\left[ {\begin{array}{cccc} 0&0&{ - \widetilde {\tilde{{\boldsymbol \gamma }}}}&0\\ 0&0&0&{ - \widetilde {\tilde{{\boldsymbol \gamma }}}}\\ {\widetilde {\tilde{{\boldsymbol \gamma }}}}&0&0&0\\ 0&{\widetilde {\tilde{{\boldsymbol \gamma }}}}&0&0 \end{array}} \right]$, respectively, where $0$ is $p \times p$ zero matrix and the ${k^{th}}\; $ diagonal component of the diagonal matrix $\widetilde {\tilde{{\boldsymbol \gamma }}}({\boldsymbol V} )$ becomes: $\frac{8}{9}{\gamma _k}{P_0}[{v_{x|re}^2({kh} )+ v_{y|re}^2({kh} )+ v_{x|im}^2({kh} )+ v_{y|im}^2({kh} )} ]$, for $k = 0,\; 1, \ldots ,\; p - 1$.
Next, we derive an adjoint system simulation corresponding to Eq. (6). Using these 2 simulations (the original and adjoint simulations), the full gradient of a general objective function with respect to all DBP design parameters are estimated.
3. ASA for the DBP
Our objective is to estimate the sensitivities of the following objective function [23]:
where the integral kernel $\psi $ is a predefined function, ${\boldsymbol x} \in {\mathrm{\mathbb{R}}^N}$ is the vector of design parameters, ${\boldsymbol V}$ is the system state vector, and ${T_m}$ is half the computational window width. For example, F may represent the mean-square difference between the output of the DBP ${v_{out}}(t )$ and the transmitter output ${u_{tx}}(t )$, i.e. which we would wish to minimize. The analytic derivative of Eq. (10) with respect to the $l$th parameter ${x_l}$, $l = 1,\; 2,\; \ldots ,\; N$, is given by:Figure 1 depicts the adjoint DBP simulation (17) as opposed to the original DBP simulation (6). As shown, each adjoint fiber span (AFS) has the same absolute values of the corresponding dispersion coefficients (${\beta _{3,j}}$ and ${\beta _{2,j}}$), but with reversed signs, as compared to the original DBP simulation. Since the dispersion coefficients used in the original DBP problem (6) have their signs opposite to those of the actual physical fiber spans, the dispersion coefficients used in the adjoint DBP simulation are identical to the corresponding parameters of the physical fiber spans. The gain coefficients ($- {\alpha _j}$) of the adjoint problem are identical to the corresponding gain parameters in the original DBP simulation. All loss elements $({{g_j}} )$ also remain the same in the adjoint simulation. The input of the adjoint problem ${\lambda _{in}}$ is obtained by differentiating the kernel of the objective function integral $\psi $ with respect to the original DBP field v. The nonlinear behavior of the adjoint DBP problem is completely different from the original DBP nonlinear effect. Moreover, the nonlinear coefficient of each AFS depends on the value of the corresponding original field v at the original DBP simulation, rather than the adjoint field $\lambda $, as shown in Fig. 1.
Once the original DBP response ${\boldsymbol V}$ and its corresponding adjoint response ${\boldsymbol \lambda }$ are evaluated for every time step in the entire computational domain, through solving Eqs. (6) and (17), respectively, the adjoint sensitivities of the objective function F are evaluated through the following formula, obtained by using Eqs. (16) and (15) in Eq. (12):
Note that all these derivation steps are valid and hold for the case of dual-polarized system. The derived adjoint problem would also have a similar form to Eq. (17) except for the fact that all system matrices would consist of $4$ block matrices rather than $2$ block matrices. The main difference would only be on the structure of the adjoint nonlinear matrix ${{\boldsymbol \varGamma }^\mathrm{\lambda }}({\boldsymbol V} )$ whose new form would become as follows:
3.1 Adjoint DBP problem solution
In the operator notation, the adjoint DBP problem (17) may be rewritten as:
3.2 Algorithm
First, the output of the fiber-optic link is passed through a DBP block and the adjoint excitation ${\hat{{\boldsymbol \lambda }}_{in}}$ is calculated (See Fig. 1). The algorithm of the introduced M-SSFS utilized for solving the adjoint DBP problem (17) is then summarized as follows:
Step 0: The forward distance parameter is set to: $z = 0$, the current span index is set to: $j = 1$, and the initial adjoint field ${\lambda ^l}$ is calculated, according to the desired objective function as: ${\lambda ^l} = {\lambda _{in}} = \frac{{\partial \psi }}{{\partial {v_{re}}}} + i\frac{{\partial \psi }}{{\partial {v_{im}}}}$, where ${v_{re}}$ and ${v_{im}}$ are the real and imaginary parts of the original DBP problem output $.$ Notice that, in practice, the objective function F is defined in terms of the DBP output signal ${{\boldsymbol V}_{out}}$. We therefore assume that the adjoint excitation ${\boldsymbol Q}_{in}^\lambda (t )= \partial \psi /\partial {\boldsymbol V}$ is nonzero only at the input of the adjoint DBP problem, i.e., at $z = 0$ or $Z = {L_{tot}}$.
Step 1: At the beginning, the loss step is performed as follows:
Step 2: The parameters of the current adjoint DBP and amplifier span are updated, and the step-size h utilized for the current span is determined. A first half-linear step is then performed as follows:Step 3: We then apply a modified nonlinear step that includes the following operations:
Step 4: After applying the nonlinear step, the distance parameter is updated as: $z = z + h$. If z is still less than the length of the current span, a full linear-step is performed as follows:
3.3 Complexity analysis
The computational complexity of each adjoint nonlinear step in the M-SSFS algorithm is larger than that in the conventional SSFS algorithm due to the extra step required to diagonalize the adjoint nonlinear matrix ${{\boldsymbol \varGamma }^\lambda }({\boldsymbol V} )$. The adjoint linear step though has the same complexity of the linear step in the conventional SSFS algorithm.
To analyze the complexity of the M-SSFS algorithm, we consider a block size of ${N_s}$ samples. One step of the adjoint linear step, Eq. (36) or Eq. (38), requires the evaluation of two ${N_s} - $ point complex fast Fourier transforms (FFTs) and ${N_s}$ complex multiplications. Since each complex FFT operation costs $0.5{N_s}{log _2}({{N_s}} )$ complex multiplications, the complexity of one linear step is approximately $[{{N_s} + {N_s}{{log }_2}({{N_s}} )} ]$ complex multiplications [33–35]. Note that one complex multiplication involves 4-real multiplications [36]. Therefore, the total number of real multiplications per one adjoint linear step is $4{N_s}[{1 + {{log }_2}({{N_s}} )} ]$.
As for the first adjoint nonlinear sub-step, evaluating the components of the ${{\boldsymbol P}^{ - 1}}$ matrix costs $9{N_s} - $ real multiplications, and multiplying by the vector ${\hat{{\boldsymbol \lambda }}^l}$ requires $4{N_s} - $ real multiplication. Notice that the squared magnitude function evaluation requires two real multiplications, and the complexity of a real division operation is the same as that of a real multiplication [36]. To calculate the components of the ${{\boldsymbol D}_\mu }$ matrix, extra $2{N_s} - $ real multiplications are needed. Multiplying ${{\boldsymbol D}_\mu }$ by the current step h needs $2{N_s}$ extra real multiplications. The exponential function $\exp ({{{\boldsymbol D}_\mu }h} )$ is then implemented with a lookup table, while the multiplication in $\exp ({{{\boldsymbol D}_\mu }h} )\times \hat{{\boldsymbol W}}$ requires $2{N_s}$ more complex multiplications or $8{N_s}$ more real multiplications. Thus, the total cost of the second adjoint nonlinear sub-step is $12{N_s} - $ real multiplications. Evaluating the ${\boldsymbol P}$ matrix components in the third sub-step requires extra $5{N_s} - $ real multiplications, and the multiplication of ${\boldsymbol P} \times \hat{{\boldsymbol W}}$ needs extra $12{N_s} - $ real multiplications ($4{N_s} - $ real multiplications $+ $ $2{N_s} - $ complex multiplications). Therefore, each adjoint nonlinear step (37) in the M-SSFS algorithm requires a total of $42{N_s} - $ real multiplications.
Suppose that the entire $M $-span adjoint DBP simulation is performed using ${K_{NLS}}$ nonlinear steps. The total number of linear steps needed will thus be ${K_{NLS}} + M$, where the step length used in the first and last linear steps of each span is a half-step, i.e., $h/2$. However, all intermediate linear steps within a span use full length step h. Also, the total number of real multiplications required by the amplifier step (35) is $2{N_s}M$, where each amplifying step costs $2{N_s} - $ real multiplications. In total, the number of real multiplications for the symmetric M-SSFS algorithm is:
where M, ${N_s}$, ${K_{NLS}}$ are the number of spans, the number of samples, and the number of performed nonlinear steps, respectively. The computational complexity of the M-SSFS algorithm as compared to the complexity of the conventional SSFS algorithm are summarized in Table 1. As can be seen, an overhead of $32{N_s} - $ real multiplications per step are required by the M-SSFS algorithm, as opposed to the conventional SSFS algorithm, which is due to the extra diagonalization step needed within each adjoint nonlinear step.To better compare between the computational cost of the conventional SSFS method and the M-SSFS method cost, we define an M-SSFS overhead parameter $\tau $ as the ratio of the modified SSFS cost over the conventional SSFS cost. Taking into account that the number of performed nonlinear steps ${K_{NLS}}$ is usually much larger than the number of spans M in practical simulations, it can be shown that the M-SSFS overhead parameter is given as:
Next, we introduce the adjoint-based optimization algorithm proposed for accelerating the optimization of the adaptive digital back propagation parameters.4. Adjoint-based optimization algorithm
In order to train and optimize the design parameters of an adaptive digital back propagation (A-DBP), a constrained nonlinear optimization problem has to be solved. The general formulation of this training problem is given as:
The first order optimality condition of problem (41) is obtained utilizing the Lagrangian function [37]. Since the Lagrangian function is applied on equality constrains, we introduce an active set $\mathrm{{\cal A}}({\boldsymbol x} )$ that contains all indices of the inequality constraints ${c_j}({\boldsymbol x} )$ that are equal to $0$ at the current design vector ${\boldsymbol x}$, defined as [37]: $\mathrm{{\cal A}}({\boldsymbol x} )= l\; :{c_l}({\boldsymbol x} )= 0$. The Lagrangian function corresponding to problem (41) is thus formulated as follows:
SQP is one of the most effective methods for the numerical solution of nonlinear constrained optimization problems. Its basic idea is to model the objective function by its local quadratic function approximation at the current design point ${{\boldsymbol x}_k}$, and to linearize the most violated constraint(s), i.e., active constraints, at point ${{\boldsymbol x}_k}$, in order to construct a quadratic programing (QP) subproblem. The QP subproblem is then solved, using Newton or quasi-Newton method, to obtain a new solution iterate ${{\boldsymbol x}_{k + 1}}$. The procedure is repeated iteratively until converging to a local optimum solution ${{\boldsymbol x}^\ast }$ that minimizes the objective function and satisfies all problem constraints.
A framework of the SQP method is summarized as follows. Starting from a current iterate solution ${{\boldsymbol x}_k}$, the SQP algorithm determines the search direction of the next iterate by solving the following QP subproblem [38–40]:
4.1 ABO algorithm
A summary of the proposed ABO algorithm is as follows:
- 1. Define the training objective function $F({{\boldsymbol x},{\boldsymbol V}} )$, the initial solution ${{\boldsymbol x}_0}$, the initial Lagrangian multipliers ${{\boldsymbol \pi }_0}$, the stopping gradient criterion $\varepsilon $, and the problem constraints ${c_j}({\boldsymbol x} )$, $j = 1,\; 2,\; \ldots ,\; m$.
- 2. Set: $k = 0$, ${{\boldsymbol x}_k} = {{\boldsymbol x}_0}$, ${{\boldsymbol \pi }_k} = {{\boldsymbol \pi }_0}$, and ${{\boldsymbol B}_k} = {\boldsymbol I}$. Evaluate current constraints ${c_j}({{{\boldsymbol x}_k}} )$, $j = 1,\; 2,\; \ldots ,\; m$, determine current active constraints vector ${{\boldsymbol C}_k}$, update $\mathrm{{\cal A}}({{{\boldsymbol x}_k}} )$, and obtain current Jacobian matrix ${{\boldsymbol J}_k}$.
- 4. Check whether the current solution satisfies stopping criteria (i.e. satisfying all constraints and having a gradient norm value less than $\varepsilon $), or not. If satisfied, the optimal solution is assigned: ${{\boldsymbol x}^\ast } = {{\boldsymbol x}_k}$, and the algorithm terminates. Otherwise, continue to next step.
- 5. Solve the quasi-Newton-KKT system of equations (46) to obtain the next search direction and determine the new solution set (${{\boldsymbol x}_{k + 1}},\; {{\boldsymbol \pi }_{k + 1}}$), where ${{\boldsymbol x}_{k + 1}} = {{\boldsymbol x}_k} + {{\boldsymbol s}_k}$.
- 6. Evaluate next constraints ${c_j}({{{\boldsymbol x}_{k + 1}}} )$, $j = 1,\; 2,\; \ldots ,\; m$, determine next active constraints vector ${{\boldsymbol C}_{k + 1}}$, update $\mathrm{{\cal A}}({{{\boldsymbol x}_{k + 1}}} )$, and obtain next Jacobian matrix ${{\boldsymbol J}_{k + 1}}$. Determine the next objective function ${F_{k + 1}}$ and its gradient ${\nabla _{\boldsymbol x}}{F_{k + 1}}$, using the ASA approach.
- 7. Evaluate the next Hessian matrix approximation ${{\boldsymbol B}_{k + 1}}$, using the BFGS formula (47). Set next parameters as current, i.e., $\mathrm{{\cal A}}({{{\boldsymbol x}_k}} )= \mathrm{{\cal A}}({{{\boldsymbol x}_{k + 1}}} )$, ${{\boldsymbol C}_k} = {{\boldsymbol C}_{k + 1}}$, ${{\boldsymbol J}_k} = {{\boldsymbol J}_{k + 1}}$, ${F_k} = {F_{k + 1}}$, ${\nabla _{\boldsymbol x}}{F_k} = {\nabla _{\boldsymbol x}}{F_{k + 1}}$, ${{\boldsymbol B}_k} = {{\boldsymbol B}_{k + 1}}$, ${{\boldsymbol x}_k} = {{\boldsymbol x}_{k + 1}}$, and ${{\boldsymbol \pi }_{k + 1}} = {{\boldsymbol \pi }_k}$. Update the iteration index: $k = k + 1$.
Next, we investigate the robustness and efficiency of the proposed ABO algorithm for the training of the adaptive DBP, through a number of numerical examples.
5. Results
For all results presented in this section, we consider Monte-Carlo simulations of the single-channel single-polarization fiber-optic communication system, shown in Fig. 2. The system is operating at $28$ Gbaud, and the transmitted data have a root raised cosine pulse shaping with a roll-off factor of $0.1$. The modulation format used in the simulation is a $16$-quadrature amplitude modulation ($16$-QAM) format. The transmission channel is a multi-span optical fiber system. Each span consists of a standard single-mode fiber (SSMF) followed by an inline erbium doped-fiber amplifier (EDFA). The simulation parameters of the fiber are given in Table 2. In order to simulate the scenario of a network communication, we assume that the lengths of fiber spans are unknown at the receiver. Although the fiber parameters may change slightly from span to span due to temperature and environmental fluctuations, we assume that they are the same for each span (except the length). The length of each fiber span is different, with values varying between $50$ km and $125\; $ km. This is the typical range of the fiber span lengths in North America [3]. The EDFA at the end of each fiber span has a gain that fully compensates for the respective fiber loss, with a noise figure ${N_f} = 4.77$ dB. In our simulations, we particularly consider two cases: 4-span and 20-span optical fiber communication systems. Their actual fiber lengths are respectively given in km as: $L = {[{{L_1}\; {L_2}\; {L_3}\; {L_4}} ]^T} = {[{50\; \; \; 80\; \; \; 85\; \; \; 125} ]^T}$, and $L = [50\; 65\; 80\; 125\; 50\; 80\; 85\; 75\; 125\; 50\; 50\; 50\; 50\; 80\; 85\; 80\; 80\; 80$
$80\; 80{]^T}$. The forward propagation in these scenarios are simulated using the split-step Fourier scheme (SSFS) algorithm [32], with an adaptive step-size guaranteeing that the maximum nonlinear phase per step does not exceed $0.01$ radian [42].
A Gaussian band pass filter (BPF) with $50$ GHz bandwidth is used before the coherent receiver. The front-end coherent receiver translates the in-phase and quadrature-phase components of the received optical signal into electrical signal components. An analog to digital converter (ADC) is utilized before the digital signal processing (DSP) unit. The sampling rate for the fiber-optic system simulation is $8$ samples per symbol whereas after the ADC, it is reduced to $2$ samples per symbol. The chromatic dispersion compensation (CDC) block is used to estimate the total length of the transmission fiber link ${L_{tot}}$. It is only applied in the training mode (see Fig. 2), i.e., during the training process of the adaptive digital back propagation (A-DBP). The CDC can be performed either in time domain or frequency domain [3]. In this paper, we used the frequency domain approach using the fast Fourier transforms (FFTs) with $({\beta _2} \times {L_{tot}}$) as an unknown parameter to be optimized. The optimum $({\beta _2} \times {L_{tot}}$) is found by minimizing the BER. Once the value of the total accumulated dispersion (${\beta _2} \times {L_{tot}}$) introduced by the channel is estimated, we divide it by the dispersion coefficient ${\beta _2}$ in order to obtain an estimation of the total transmitted distance ${L_{tot}}$. After the training, the A-DBP switches to the transmission mode where it blindly compensates for the linear and nonlinear distortions induced by the fiber channel. The output of the A-DBP passes through a matched filter (i.e. a root raised cosine filter) to limit the noise. The bit error rate (BER) is calculated by comparing the symbol sequence after the filter with that at the transmitter and symbols that cross the boundaries are counted as error symbols.
Unless otherwise specified, the A-DBP training problem is defined as:
5.1 4-span A-DBP
We first investigate the accuracy and efficiency of the proposed ABO algorithm in training and optimizing the parameters of the A-DBP. We study the ability of the ABO to converge to the ideal DBP solution (ideal DBP refers to the case in which the receiver knows exactly the values of fiber-optic system parameters). We also compare our ABO algorithm to other finite-difference-based optimization algorithms in terms of convergence rate and computational complexity.
The ABO algorithm is first applied to train a 4-span A-DBP system for mitigating the fiber impairments of the 4-span fiber communication scenario. The virtual fibers of the A-DBP have the same transmission fibers’ parameters given in Table 2. However, their lengths are assumed to be unknown to the receiver. The number of known symbols used to train the A-DBP is $4096$. The average launch power at the training process is ${P_{av}} = 7$ dBm. The total length of the channel (estimated using the CDC unit which provides the best performance when the accumulated dispersion of CDC is $-{\beta _2} \times {L_{tot}}$) is ${L_{tot}} = 340$ km. Initially, the 4 spans of the A-DBP are assumed to have an equal length of ${L_{tot}}/4$. Clearly, this initial point is a feasible design point, i.e., it satisfies all the constraints of the training problem (48).
Figure 3 illustrates the evolution of the objective function in Eq. (48) with number of optimization iterates. The ABO algorithm terminates after $6$ iterates with the optimal solution given in Table 3. These values are quite close to the actual fiber span lengths. The Euclidian norm of the objective function gradient at this point is $4.1 \times {10^{ - 16}}$. As can be seen, the ABO algorithm is capable of training the A-DBP, and successfully converges to the physical fiber lengths. Note that the launch power during the training should be chosen such that the nonlinear effects are not negligible. In other words, the training launch power should be reasonably high, i.e., the range of powers for which the nonlinear effects are the dominant source of distortions. For example, if we choose a different launch power within the fiber nonlinearity dominant region, say $6$ or $9\,$ dBm, it would have a minimum impact on the estimated fiber lengths. However, if a low launch training power is chosen, say ${P_{av}} < 5\,$ dBm, the ABO algorithm will not converge to the correct lengths of the physical fiber. Rather, it will terminate at any local minima that satisfies the LC constraint, i.e., any solution that equalizes the total accumulated dispersion.
Once the training of the A-DBP is completed, we have an estimate of fiber lengths (column 3 of Table 3), which is used for the A-DBP. This scheme is referred to as ‘trained A-DBP’. We then transmit ${2^{17}}$ unknown data symbols through the 4-span communication system, and the trained A-DBP is switched into the transmission mode. If the exact parameters of the fiber channel are known at the receiver, the INLSE (Eq. (1)) can be solved using the SSFS approach. This scheme is referred to as ‘ideal DBP’. For both trained A-DBP and ideal DBP, we use the adaptive step-size that guarantees that the maximum nonlinear phase per step does not exceed $0.01$ radian. Figure 4 shows the bit error rate (BER) in the case of the initial A-DBP (i.e. equal virtual span lengths) and the trained A-DBP, as compared to the ideal DBP case, as a function of the average launch power. As compared to the initial A-DBP, the trained A-DBP effectively mitigates the distortions at the higher launch power ranges, achieving the same BER performance as the ideal DBP. Using the trained A-DBP, the BER is found to be zero for average fiber launch powers larger than $- 4$ dBm and less than $14$ dBm. For low launch power (${P_{av}} \le - 8$ dBm), the fiber nonlinear distortion is negligible, and the distortions are mainly due to fiber CD effects and amplifier noise [2,3]. Therefore, the initial A-DBP can provide a BER performance comparable to the trained A-DBP (as well as the ideal DBP) at this region. As the power slightly increases, the signal to noise ratio increases resulting in a BER reduction. However, at higher power levels (${P_{av}} \ge \; \; 2$ dBm), the fiber nonlinearity becomes the dominant source of distortions and the initial A-DBP cannot mitigate the impairments anymore. Contrarily, the trained A-DBP can still provide a BER performance benefit up to average launch power of $14$ dBm. For very high launch power (${P_{av}} > 14$ dBm), both the trained A-DBP and the ideal DBP fails to mitigate the distortions due to the severe signal-noise nonlinear interactions [43,44].
In order to investigate the computational efficiency of the introduced ABO, the same gradient-based optimization algorithm is utilized to train the initial A-DBP. However, at this time the required sensitivity information is obtained using the central-finite difference (CFD), the forward-finite difference (FFD), or the backward-finite difference (BFD) approximations. These optimizers are respectively denoted as SQP-CFD, SQP-FFD, and SQP-BFD. The three algorithms (SQP-CFD, SQP-FFD, and SQP-BFD) terminate after $6 - $ iterations with an objective function gradient norm of $1.19 \times {10^{ - 16}}$, $2.9 \times {10^{ - 14}}$, and $2.6 \times {10^{ - 14}}$, respectively. The optimal design points obtained are given in Table 3. Owing to the proper definition of the training problem (48), all three optimizers converge to the ideal DBP solution as well as the ABO does. However, the solutions of the ABO and SQP-CFD algorithms are slightly better (closer to the ideal solution) than the SQP-FFD and SQP-BFD solutions. This is because the derivatives estimations of both the adjoint and CFD approaches are more accurate than the FFD and BFD estimations [23]. Figure 3 shows the objective function evolution in case of using the three finite-differences-based optimizers as compared to that of the ABO algorithm. Obviously, there are slight differences between routes that each algorithm takes towards the optimal solution. This is due to the slight differences between the gradient estimations of each sensitivity technique.
Table 4 compares the four optimizers (ABO, SQP-CFD, SQP-FFD, and SQP-BFD) in terms of the total number of SSFS simulations performed during the training process of the initial A-DBP. The total number of symbols used during the training process is $4096$, i.e., a total of $4096 \times 2$ training samples is used. Substituting in Eq. (40), the modified SSFS overhead parameter for these settings is $\tau \approx 1.48$. In other words, the complexity of one adjoint DBP simulation is approximately equivalent to $1.48 \times $ complexity of single original DBP simulation. At each ABO algorithm iterate, we need to perform $1 - $ original and $1 - $ adjoint DBP simulations, to evaluate current objective function value and its gradient vector. This is equivalent to performing $2.48$ conventional SSFS simulation. Contrarily, each SQP-CFD iterate requires $({2N + 1} )- $ original DBP simulations to determine current objective function value and the gradient vector, where N is the number of design parameters. Each of the SQP-FFD and SQP-BFD needs to run $({N + 1} )- $ original DBP simulations every optimization iterate, to obtain the current objective function value and a less accurate (as compared to the ABO and SQP-CFD algorithms) gradient vector estimation. As shown in Table 4, the proposed ABO algorithm saves around $39 - $ system simulations, as opposed to the SQP-CFD algorithm for the same accuracy. As compared to SQP-FFD/BFD, the ABO saves around $15 - $ system simulations; besides, the accuracy of SQP-FFD/BFD to estimate the fiber lengths is not so good (See Table 3).
In order to better understand the gain achieved by the ABO algorithm in terms of the computational cost, we define a simulation saving factor $\eta $ as the ratio of the total number of conventional SSFS simulations required by a certain optimizer to train an A-DBP over the number of equivalent conventional SSFS simulations needed by the ABO to train the same A-DBP. For this example, the value of $\eta $ is calculated to be $3.63$ and $2.02$ for the SQP-CFD and SQP-FFD/BFD, respectively, when the number of design parameters is $N = \; 4$. In other words, our ABO algorithm is $3.63$ times faster than the SQP-CFD in training the 4-span A-DBP.
5.2 20-span A-DBP
In order to further demonstrate the efficiency of the ABO algorithm, we train a 20-span A-DBP system to blindly compensate for the impairments of the 20-span optical fiber communication scenario. The average launch power during training is ${P_{av}} = 6$ dBm, and the number of training symbols is $4096$. The total length of the channel (estimated using the CDC) is ${L_{tot}} = 1500$ km. The length of each A-DBP’s span is initially set to $75$ km. The ABO algorithm requires $22$ iterates until successfully converging to the actual fiber span lengths, terminating at gradient norm of $1.05 \times {10^{ - 16}}$.
Figure 5 shows the BER performance of the 20-span transmission system versus a sweep of the average launch power. The BER obtained using the ideal DBP at the receiver is compared to the BER achieved in case of utilizing the initial A-DBP or the trained A-DBP. The number of actual transmitted symbols is ${2^{17}}$. As compared to the 4-span system, the overall BER performance significantly deteriorates because of the longer transmission distance that causes much higher linear and nonlinear signal distortions. The initial A-DBP cannot provide BER less than ${10^{ - 2}}$ for any average launch power value. However, the trained A-DBP achieves the same BER performance obtained using the ideal DBP. The BER is found to be zero using the trained A-DBP when the average launch power is larger than $2$ dBm and less than $10$ dBm.
To investigate the computational cost of the training, we again re-train the initial A-DBP using the SQP-CFD, SQP-FFD, and SQP-BFD optimizers. The three algorithms terminate after $22 - $ iterations with gradient norms of $5.81 \times {10^{ - 16}}$, $1.09 \times {10^{ - 14}}$, and $2.02 \times {10^{ - 14}}$, respectively. Similar to the 4-span results, the solutions of the ABO and SQP-CFD algorithms are slightly more accurate than the SQP-FFD and SQP-BFD solutions. However, as shown in Table 4, the proposed ABO algorithm needs only around 55 conventional SSFS simulations to train the A-DBP, as opposed to $902$ and $462$ simulations required by the SQP-CFD and the SQP-FFD/BFD algorithms. For this example, the value of the simulation saving factor $\eta $ is calculated to be $16.5$ and $8.47$ for the SQP-CFD and SQP-FFD/BFD, respectively. In other words, the ABO algorithm is $16.5$ times faster than the SQP-CFD, and $8.47$ times faster than the SQP-FFD/BFD, in training the 20-span A-DBP, achieving the same high accuracy obtained using the SQP-CFD algorithm. Clearly, the time saving gain achieved by the ABO algorithm increases as the number of design parameters increases. This is because the number of extra simulations required by the finite-differences-based optimizers is linearly proportional to the number of design parameters. The ABO algorithm, on the other hand, always requires one extra simulation of the adjoint system (Eq. (17)) per iterate to evaluate the full gradient information regardless of the number of parameters.
5.3 Optimum low-complexity A-DBP
Typically, the computational cost of the DBP is expensive. It scales directly with the number of propagation steps and hence, DBP approaches with step size larger than the physical fiber span lengths have drawn significant attention [11,12,19,21,22]. In this subsection, we therefore investigate the reduction of the A-DBP computational cost at the expense of a little bit lower equalization performance. For the ideal DBP, we will replace the small step-size used in the SSFS simulation with a relatively larger step. Since the SSFS simulation has an error of order $O({{h^3}} )$, a coarse-mesh ideal DBP simulation, with relatively larger step-size, will not be identical to the inverse response of the fiber channel. As a result, the equalization performance with a coarse-mesh ideal DBP model degrades as opposed to the fine-mesh ideal DBP model. For the 4-span fiber-optic communication scenario, Fig. 6(a) shows the BER performance of the coarse-mesh ideal DBP as compared to the fine-mesh ideal DBP model. The adaptive step-size used in the coarse-mesh model is chosen such that the maximum nonlinear phase change within each step is $0.1$ radian, as opposed to $0.01$ radian in the fine-mesh case. As shown in Fig. 6(a), the BER performance of the coarse-mesh ideal DBP deteriorates significantly for average launch power above $0$ dBm. However, as depicted in Fig. 6(b), the coarse-mesh ideal DBP requires only a total of 4-SSFS steps at ${P_{av}} = 0$ dBm, as compared to $53 - $ SSFS steps required by the fine-mesh ideal DBP model at the same launch power.
In order to enhance the performance of the coarse-mesh ideal DBP model with a computational cost within the same range, we apply our ABO algorithm to train coarse-mesh A-DBP models with $2 - $, $3 - $, and 4-virtual spans. The fiber-optic system has 4 spans with parameters as given in Tables 2 and 3. We thus wish to compensate for the dispersion and nonlinear impairments of the 4-span fiber-optic system with M virtual spans, where $M\; < = \; 4$. We consider here the length and nonlinear coefficient of each virtual fiber span as design parameters. In other words, the design vector ${\boldsymbol x} = {[{{L_1}\; {L_2}\; \cdots {L_M}\; \; {\gamma_1}\; {\gamma_2}\; \cdots {\gamma_M}} ]^T}$ contains the lengths and nonlinear coefficients of all fiber spans. However, the proposed ABO algorithm is general and could be utilized to estimate all other A-DBP parameters, e.g., average launch power ${P_{av}}$, dispersion coefficients (${\beta _{3,j}}\; $ and ${\beta _{2,j}}$), loss coefficients ${\alpha _j}$, and amplifiers’ gains ${g_j}$. The A-DBP training problem is modified as follows:
Figure 7(a) shows the Q-factor performance of the 4-span fiber-optic communication system as a function of fiber launch power for various cases of DBP equalization schemes. The Q-factor in $\textrm{dB}$ is calculated using [3]:
where $erf{c^{ - 1}}({\cdot} )$ stands for the inverse of the complementary error function, and the $BER$ is obtained using the Monte-Carlo simulations. The total number of SSFS steps required by each equalizer versus a sweep of the launch power is also compared in Fig. 7(b). We keep coarse-mesh ideal DBP as our reference for comparison. As can be seen, any $M$-span trained A-DBP with $M = 2$, $3$, and $4$, outperforms the coarse-mesh ideal DBP. From Figs. 7(a) and (b), we see that the Q-factor of the 2-span A-DBP is $1$ dB higher than that of the coarse-mesh ideal DBP and its computational cost is lower. The optimum launch power is $- 2$ dBm and at this power, the 2-span A-DBP requires $2$ SSFS steps while the coarse-mesh ideal DBP requires $4$ SSFS steps. Q-factor increments of $2.2\; \textrm{dB}$ and $2.7\; \textrm{dB}$ are achieved by the 3-span and 4-span A-DBPs, respectively. The 3-span and 4-span A-DBPs though require 2-extra SSFS steps as compared to the coarse-mesh ideal DBP. Hence, the 2-span A-DBP provides a better trade-off between performance and computational cost, i.e. $1$ dB performance advantage and $50\%$ reduction in computational cost as compared to our reference system. It is worth emphasizing that for practical BER of ${10^{ - 2}}$ or ${10^{ - 3}}$ levels, the performance advantage of the 2-span A-DBP is reduced. The computational cost required by the 2-span A-DBP is still, however, half the coarse-mesh ideal DBP cost.It should conceptually be clear that the ABO algorithm can be used to train a $5$ or $10 - $ (virtual) span A-DBP models for the 20-(real) span fiber-optic system. These models could then be utilized to significantly reduce the computational complexity required by a 20-span coarse-mesh ideal DBP model for mitigating the distortions of the 20-span optical fiber communication scenario, achieving a similar or even better compensation performance.
6. Conclusion
We propose a computationally efficient adaptive digital back propagation (A-DBP) method to blindly mitigate the linear and nonlinear impairments-induced in optical fiber communication systems and networks. An efficient adjoint-based optimization (ABO) algorithm is introduced for accelerating the parameters extraction of the A-DBP. The ABO algorithm utilizes the sequential quadratic programming (SQP) optimization technique for solving the A-DBP training problem. The ABO algorithm also exploits an introduced adjoint sensitivity analysis (ASA) approach to rapidly evaluate the required sensitivity information at each optimization iterate. The efficiency and robustness of the proposed A-DBP method are illustrated through 4-span and 20-span optical fiber communication examples. As compared to the A-DBP trained using finite-difference (FD)-based SQP algorithms, the ABO algorithm is shown to be $8.47$ times faster than the backward/forward FD-based optimizers, and $16.5$ times faster than the more accurate central FD-based optimizer in the 20-span example. This computational complexity gain further increases with the number of parameters. Moreover, the reduction of the A-DBP computational cost at the expense of slightly lower equalization performance is demonstrated. In the 4-span example, an optimized $2 - $ virtual span coarse-mesh A-DBP model is shown to provide the 1 dB performance advantage over the reference coarse-mesh ideal DBP system, while reducing the computational complexity of the reference system by $50\%$.
Funding
Natural Sciences and Engineering Research Council of Canada (RGPIN-2016-05451, RGPIN-2016-06579).
Disclosures
The authors declare no conflicts of interest.
Data availability
Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.
References
1. I. Kaminow, T. Li, and A. E. Willner, Optical Fiber Telecommunications Volume VIB: Systems and Networks (Academic Press, 2013).
2. G. P. Agrawal, Nonlinear Fiber Optics (Academic press, 2007).
3. S. Kumar and M. J. Deen, Fiber Optic Communications: Fundamentals and Applications (John Wiley & Sons, 2014).
4. M. Kuschnerov, F. N. Hauske, K. Piyawanno, B. Spinnler, M. S. Alfiad, A. Napoli, and B. Lankl, “Dsp for coherent single-carrier receivers,” J. Lightwave Technol. 27(16), 3614–3622 (2009). [CrossRef]
5. V. Curri, R. Gaudino, A. Napoli, and P. Poggiolini, “Electronic equalization for advanced modulation formats in dispersion-limited systems,” IEEE Photonics Technol. Lett. 16(11), 2556–2558 (2004). [CrossRef]
6. F. P. Guiomar and A. N. Pinto, “Simplified volterra series nonlinear equalizer for polarization-multiplexed coherent optical systems,” J. Lightwave Technol. 31(23), 3879–3891 (2013). [CrossRef]
7. M. M. T. Maghrabi, S. Kumar, and M. H. Bakr, “Dispersion compensation of fiber optic communication system with direct detection using artificial neural networks (anns),” Opt. Commun. 409, 109–116 (2018). [CrossRef]
8. R. J. Essiambre and P. J. Winzer, “Fibre nonlinearities in electronically pre-distorted transmission,” in Proc. 31st Eur. Conf. Opt. Commun., pp. 191–192 (2005).
9. E. Ip and J. M. Kahn, “Compensation of dispersion and nonlinear impairments using digital backpropagation,” J. Lightwave Technol. 26(20), 3416–3425 (2008). [CrossRef]
10. X. Li, X. Chen, G. Goldfarb, E. Mateo, I. Kim, F. Yaman, and G. Li, “Electronic post-compensation of wdm transmission impairments using coherent detection and digital signal processing,” Opt. Express 16(2), 880–888 (2008). [CrossRef]
11. L. B. Du and A and J. Lowery, “Improved single channel backpropagation for intra-channel fiber nonlinearity compensation in long-haul optical communication systems,” Opt. Express 18(16), 17075–17088 (2010). [CrossRef]
12. D. S. Millar, S. Makovejs, C. Behrens, S. Hellerbrand, R. I. Killey, P. Bayvel, and S. J. Savory, “Mitigation of fiber nonlinearity using a digital coherent receiver,” IEEE J. Sel. Top. Quantum Electron. 16(5), 1217–1226 (2010). [CrossRef]
13. D. Rafique and A. D. Ellis, “Impact of signal-ASE four-wave mixing on the effectiveness of digital backpropagation in 112 Gb/s PM-QPSK systems,” Opt. Express 19(4), 3449–3454 (2011). [CrossRef]
14. J. Shao, S. Kumar, and X. Liang, “Digital back propagation with optimal step size for polarization multiplexed transmission,” IEEE Photonics Technol. Lett. 25(23), 2327–2330 (2013). [CrossRef]
15. D. Sharma and S. Kumar, “An overview of elastic optical networks and its enabling technologies,” Int. J. Eng. Technol. 9(3), 1643–1649 (2017). [CrossRef]
16. E. P. da Silva, R. Asif, K. J. Larsen, and D. Zibar, “Nonlinear compensation with modified adaptive digital backpropagation in flexigrid networks,” in CLEO: Science and Innovations, pp. SM2M–5 (2015).
17. T. Tanimura, T. Hoshida, T. Tanaka, L. Li, S. Oda, H. Nakashima, Z. Tao, and J. C. Rasmussen, “Semi-blind nonlinear equalization in coherent multi-span transmission system with inhomogeneous span parameters,” in Optical Fiber Communication Conference (OFC), p. OMR6 (2010).
18. C. Lin, A. Napoli, B. Spinnler, V. Sleiffer, D. Rafique, M. Kuschnerov, M. Bohn, and B. Schmauss, “Adaptive digital back-propagation for optical communication systems,” in Optical Fiber Communication Conference (OFC), p. M3C-4 (2014).
19. L. Jiang, L. Yan, Z. Chen, A. Yi, Y. Pan, W. Pan, and B. Luo, “Low-complexity and adaptive nonlinearity estimation module based on godard’s error,” IEEE Photonics J. 8(1), 1–8 (2016). [CrossRef]
20. X. Liang and S. Kumar, “Multi-stage perturbation theory for compensating intra-channel nonlinear impairments in fiber-optic links,” Opt. Express 22(24), 29733–29745 (2014). [CrossRef]
21. F. Zhang, Q. Zhuge, M. Qiu, X. Zhou, M. Y. S. Sowailem, T. M. Hoang, M. Xiang, and D. V. Plant, “Blind adaptive digital backpropagation for fiber nonlinearity compensation,” J. Lightwave Technol. 36(9), 1746–1756 (2018). [CrossRef]
22. J. Zhou, Y. Wang, and Y. Zhang, “Blind backpropagation method for fiber nonlinearity compensation with low computational complexity and high performance,” Opt. Express 28(8), 11424–11438 (2020). [CrossRef]
23. M. Bakr, A. Elsherbeni, and V. Demir, Adjoint Sensitivity Analysis of High Frequency Structures with MATLAB (The Institution of Engineering and Technology (IET), 2017).
24. M. H. Bakr, O. S. Ahmed, M. H. El Sherif, and T. Nomura, “Time domain adjoint sensitivity analysis of electromagnetic problems with nonlinear media,” Opt. Express 22(9), 10831–10843 (2014). [CrossRef]
25. N. K. Georgieva, S. Glavic, M. H. Bakr, and J. W. Bandler, “Feasible adjoint sensitivity technique for em design optimization,” IEEE Trans. Microwave Theory Techn. 50(12), 2751–2758 (2002). [CrossRef]
26. M. M. T. Maghrabi, M. H. Bakr, S. Kumar, A. Elsherbeni, and V. Demir, “Fdtd-based adjoint sensitivity analysis of high frequency nonlinear structures,” IEEE Trans. Antennas Propagat. 68(6), 4727–4737 (2020). [CrossRef]
27. G. Veronis, R. W. Dutton, and S. Fan, “Method for sensitivity analysis of photonic crystal devices,” Opt. Lett. 29(19), 2288–2290 (2004). [CrossRef]
28. M. R. Abdelhafez and M. A. Swillam, “Efficient sensitivity analysis approach based on finite element solutions of photonic structures,” Opt. Commun. 313, 430–435 (2014). [CrossRef]
29. E. Sayed, M. H. Bakr, B. Bilgin, and A. Emadi, “Adjoint sensitivity analysis of switched reluctance motors,” Electric Power Components and Systems 46(18), 1959–1968 (2018). [CrossRef]
30. M. M. T. Maghrabi, M. H. Bakr, and S. Kumar, “Adjoint sensitivity analysis approach for the nonlinear schrödinger equation,” Opt. Lett. 44(16), 3940–3943 (2019). [CrossRef]
31. M. M. T. Maghrabi, M. H. Bakr, and S. Kumar, “Nonlinear Schrödinger Equation-Based Adjoint Sensitivity Analysis,” in 2020 International Applied Computational Electromagnetics Society Symposium (ACES), pp. 1–2 (2020).
32. R. Glowinski, S. J. Osher, and W. Yin, Splitting Methods in Communication, Imaging, Science, and Engineering (Springer, 2017).
33. B. Spinnler, “Equalizer design and complexity for digital coherent receivers,” IEEE J. Sel. Top. Quantum Electron. 16(5), 1180–1192 (2010). [CrossRef]
34. Y. Gao, J. H. Ke, K. P. Zhong, J. C. Cartledge, and S. S. Yam, “Assessment of intrachannel nonlinear compensation for 112 gb/s dual- polarization 16qam systems,” J. Lightwave Technol. 30(24), 3902–3910 (2012). [CrossRef]
35. F. Zhang, Q. Zhuge, M. Qiu, W. Wang, M. Chagnon, and D. V. Plant, “Xpm model-based digital backpropagation for subcarrier-multiplexing systems,” J. Lightwave Technol. 33(24), 5140–5150 (2015). [CrossRef]
36. H. Alt and J. V. Leeuwen, “The complexity of basic complex operations,” Computing 27(3), 205–215 (1981). [CrossRef]
37. S. S. Rao, Engineering Optimization: Theory and Practice (John Wiley & Sons, 2019).
38. U. M. G. Palomares and O. L. Mangasarian, “Superlinearly convergent quasi-newton algorithms for nonlinearly constrained optimization problems,” Mathematical Programming 11(1), 1–13 (1976). [CrossRef]
39. S. Han, “Superlinearly convergent variable metric algorithms for general nonlinear programming problems,” Mathematical Programming 11(1), 263–282 (1976). [CrossRef]
40. J. Nocedal and S. Wright, Numerical Optimization (Springer Science & Business Media, 2006).
41. M. Bakr, Nonlinear optimization in electrical engineering with applications in MATLAB (The Institution of Engineering and Technology (IET), 2013).
42. J. Shao, X. Liang, and S. Kumar, “Comparison of split-step Fourier schemes for simulating fiber optic communication systems,” IEEE Photonics J. 6(4), 1–15 (2014). [CrossRef]
43. J. P. Gordon and L. F. Mollenauer, “Phase noise in photonic communications systems using linear amplifiers,” Opt. Lett. 15(23), 1351–1353 (1990). [CrossRef]
44. S. Kumar, “Analysis of nonlinear phase noise in coherent fiber-optic transmission systems,” J. Lightwave Technol. 27(21), 4722–4733 (2009). [CrossRef]