Adaptive multi-layer filters incorporated with Volterra filters for impairment compensation including transmitter and receiver nonlinearity

Manabu Arikawa; Manabu Arikawa; Kazunori Hayashi; Kazunori Hayashi

doi:10.1364/OE.435161

1. Introduction

Coherent detection and digital signal processing (DSP) in optical fiber communications have paved the way for the adaptation of advanced modulation formats such as higher-order quadrature amplitude modulation (QAM) and probabilistic constellation shaping [1–3]. In addition, DSP provides the possibility of compensating of various effects that occur in fiber transmission systems in the digital domain flexibly, including carrier recovery [4], accumulated chromatic dispersion (CD) [5], polarization demultiplexing while compensating for polarization mode dispersion (PMD) [6], and fiber Kerr nonlinearity [7]. Both linear and nonlinear impairments due to imperfections in optical and electrical components can also occur in a transmitter (Tx) and receiver (Rx). These impairments are becoming non-negligible, especially for signals with a high symbol rate where high frequency devices are used [8–11].

Compensation of the impairments that occur in a Tx and Rx has been investigated regarding linear [12–15] and nonlinear impairments [16–24]. Characteristics of these impairments depend on the components used in a Tx and Rx, which are usually unknown beforehand. Thus, an adaptive approach or learning is required to deal with Tx and Rx impairments. Linear impairments that occur in a Tx and Rx are mainly a timing skew between in-phase (I) and quadrature (Q) components, a gain imbalance between IQ components, and a phase deviation of IQ from $\pi /2$. Receiver-side adaptive filters can compensate for these linear impairments that occur in a Tx [12,15] and those occur in an Rx [13–15]. Nonlinear impairments are mainly caused by digital-to-analog converters (DACs), electronic driver amplifiers, and a Mach-Zehnder modulator in a Tx, as well as electronic trans-impedance amplifiers (TIAs) and analog-to-digital converters (ADCs) in an Rx.

To compensate for nonlinear impairments that occur mainly in a Tx, digital pre-distortion in the Tx side has been investigated on the basis of Volterra filters [16–19] and neural networks [21–23]. These pre-distortion approaches enable adaptive equalization by using a signal with a high signal-to-noise ratio (SNR) without the effect from other impairments that occur in a fiber transmission; however, they can only resolve Tx nonlinearity. Different approaches are required to compensate for nonlinear impairments that occur in an Rx. Moreover, other effects such as CD accumulate in a signal though fiber propagation. Nonlinear impairments are not mutually commutative with other effects, so compensating for fiber nonlinearity uses a split-step back propagation based on the nonlinear Schrödinger equation [7]. Therefore, to compensate for nonlinear impairments together with other impairments, the order in which all the relevant impairments occur should be considered unless one lumped adaptive nonlinear filter is used. However, a conventional DSP uses a block-wise compensation to effectively deal with various impairments that have different causes and models [5]. From this point of view, mutual non-commutativity of nonlinear impairments that occur in a Tx and Rx with other impairments has not been resolved in these previous approaches, preventing compensation of both Tx and Rx nonlinearity at the same time. Recently, combination of pre-distortion at the Tx side and adaptive nonlinear equalization at the Rx side has been reported [24], where a nonlinear equalizer for Rx nonlinearity compensation is positioned at the first of impairment compensation blocks and a nonlinear equalizer for Tx nonlinearity compensation is positioned at the last. Four real-valued nonlinear filters were used for both the Rx and Tx nonlinear equalizers and no impairment compensation blocks that have IQ cross terms were included in this DSP. It is reasonable to use four real-valued nonlinear filters since Tx and Rx nonlinearity usually affects IQ components independently in coherent optical transmission systems. Whereas, the absence of IQ cross terms prevents compensation of IQ phase deviation. If nonlinear equalizers that have IQ cross terms are used, this problem will be resolved, though this straightforward approach increases the number of parameters of nonlinear equalizers greatly, resulting in high computational complexity.

Linear processes are also not mutually commutative in general in the case of multi-input multi-output (MIMO). For example, CD can be represented by a convolution of a complex-valued input signal with a complex-valued response function. This complex-valued linear model is denoted as strictly-linear (SL) [14]. The Jones matrix of CD is diagonal with same elements, and thus CD is commutative with other SL processes such as PMD. Complex-valued linear models can be described as real-valued MIMO models with a restriction on the real-valued IQ basis representation. The response of CD is non-diagonal on the real-valued IQ basis representation and thus not commutative with linear processes that cannot be described as MIMO models with the restriction. These real-valued linear MIMO processes that can be described only without the restriction are equivalent to the models with a convolution of complex-valued signals and their complex-conjugate with complex-valued response functions. These linear processes are denoted as widely-linear (WL). IQ MIMO processes such as IQ skew are WL. Thus, CD and IQ skew are not mutually commutative. To compensate for all the relevant linear impairments in optical fiber communication systems including IQ skew, IQ imbalance, and IQ phase deviation that occur in both Tx and Rx at the receiver side, we have proposed an adaptive multi-layer (ML) filter architecture that considers the order in which the impairments occur [15]. The ML filter architecture unfolds an adaptive filter to ones being different in type and size to compensate for corresponding impairments. The coefficients of the ML filters are adaptively controlled by gradient calculation with back propagation, which is similar to the learning of neural networks and can be applied to any differentiable parameterized function [25,26], to minimize a loss that is composed of the last layer outputs.

In this study, we extended the adaptive ML filter architecture by incorporating nonlinear filters to compensate for both Tx and Rx nonlinearity when other impairments such as CD coexist. Volterra filters and neural networks are both nonlinear functions and back propagation can be applied to both of them. A deep neural network (DNN) slightly outperforms in compensating nonlinearity with memory effects when nonlinear compensation is performed after conventional linear impairment compensation [20]. From the view point of commutativity of impairments, this previous work is regarded as a nonlinearity compensation in a Tx. Although DNNs have the ability to approximate a complicated nonlinear function, random initialization of parameters is usually required before learning [27], resulting completely random outputs at an initial phase. Regarding impairment compensation in optical fiber communications, dominant sources to prevent demodulation are linear effects that occur in fiber propagation such as CD, though Tx and Rx nonlinearity cannot be ignored. Therefore, initializing nonlinear filters that compensate for Tx and Rx nonlinearity as a certain linear or even an identity function instead of a random function can help convergence at the beginning of adaptive control. In the case of the Volterra filter, which is easily initialized as a linear filter, an optimum nonlinear function can be smoothly and steadily obtained by adaptive control from the initial state. Here, we introduced Volterra filters into the adaptive ML filters. Considering the order in which all the relevant impairments occur, the ML filters consist of SL and WL filter layers to compensate for relevant linear impairments, and the two Volterra filter layers, each of which works as to compensate for nonlinearity that occurs in an Rx and Tx, respectively, are appropriately positioned in the ML filters. The coefficients including the Volterra filter layers were adaptively controlled by a gradient calculation with back propagation and stochastic gradient descent (SGD). In this ML filter architecture including Volterra filter layers, a Volterra filter itself compensates only for Rx or Tx nonlinearity and is not required to compensate for any other effects and their interaction with nonlinearity, which expands a temporal spread. Thus, the Volterra filter layers in the ML filters can be implemented with short memory taps, though the number of coefficients and computational complexity of a Volterra filter increase drastically with the increase in length of the memory taps [28]. We evaluated the performance of the adaptive ML filters including Volterra filter layers through simulations with a simple model and experiments where more realistic Tx and Rx nonlinearity was induced by tuning the output amplitude of electronic amplifiers. The adaptive ML filters were used in receiver-side signal processing for the transmission of a 23 Gbaud polarization-division-multiplexed (PDM) 64QAM signal over one span of a 100-km single-mode fiber (SMF). The results demonstrated that the proposed architecture could compensate for the nonlinearity that occurs in both Tx and Rx simultaneously and effectively under the accumulation of CD.

2. Theory

We first review the nonlinear impairments that occur in optical fiber communication systems with coherent detection. We then show the ML filter architecture including Volterra filter layers in an appropriate order and its adaptive control. The coefficients are updated by a gradient calculation with back propagation and SGD to minimize a loss function that is composed of the last layer outputs. Although an adaptive Volterra filter is well-known [29], adaptive control of the filter coefficients with its direct input and output is insufficient when incorporating it into the adaptive ML filters. We derive the back propagation of the Volterra filter layers in the ML filters, in other words, calculating the gradients of a loss in terms of filter coefficients and inputs, when gradients in terms of filter outputs are given, to update the coefficients in all the layers including the Volterra filter ones.

Figure 1 shows a schematic diagram of a conventional wavelength-division multiplexed (WDM) transmission system with coherent detection. The transmitted data are encoded and mapped to a certain modulation format. A DAC and electric driver amplifiers generate electric signals of four streams corresponding to the IQ components of two polarizations. A continuous-wave (CW) light source from a laser diode (LD) is modulated by a modulator driven with the electric signals. The modulated signal is multiplexed with other WDM signals and transmitted to a fiber. On this Tx side, some nonlinearity occurs in the DAC and driver amplifiers [30]. A Mach-Zehnder modulator also has nonlinear sinusoidal characteristics [31]. In a fiber propagation, CD and PMD accumulate in the signal. Fiber Kerr nonlinearity also occurs, though we ignore it here for simplicity. After fiber transmission, the signal is demultiplexed and received by a coherent receiver and sampled by an ADC. Demodulation and decoding are performed in the digital domain to recover the transmitted data. On this Rx side, TIAs in a coherent receiver and ADC induce nonlinearity.

Fig. 1. Schematic diagram of a WDM transmission system with coherent detection. Nonlinear impairments occur in Tx and Rx. ENC: encoder, DAC: digital-to-analog converter, MOD: modulator, LD: laser diode, SMF: single-mode fiber, EDFA: erbium-doped fiber amplifier, CRx: coherent receiver, ADC: analog-to-digital converter, DEM: demodulation, DEC: decoder.

Abstract

1. Introduction

2. Theory

3. Simulation

4. Experiment

5. Conclusion

Appendix A: Simulation with laser phase noise

Appendix B: Converged coefficients of Volterra filters in experiment

Appendix C: Performances of adaptive control of Volterra filters in different OSNR

Funding

Acknowledgments

Disclosures

Data availability

References

Data availability

Cited By

Figures (16)

Equations (24)

Optics Express