Upper and lower bounds to the information rate transferred through the Pol-Mux channel

Arnaldo Spalvieri; Luca Reggiani; Laura Dossi

doi:10.1364/OE.26.027118

Optics Express
Vol. 26,
Issue 21,
pp. 27118-27126
(2018)
•https://doi.org/10.1364/OE.26.027118

Upper and lower bounds to the information rate transferred through the Pol-Mux channel

Arnaldo Spalvieri, Luca Reggiani, and Laura Dossi

Open Access

Get PDF
Email
Share
Get Citation
Copy Citation Text
Arnaldo Spalvieri, Luca Reggiani, and Laura Dossi, "Upper and lower bounds to the information rate transferred through the Pol-Mux channel," Opt. Express 26, 27118-27126 (2018)

Export Citation
- BibTex
- Endnote (RIS)
- HTML
- Plain Text
Citation alert
Save article

Check for updates

More Like This

A new lower bound below the information rate of Wiener phase noise channel based on Kalman carrier...
Luca Barletta, et al.
Opt. Express 20(23) 25471-25477 (2012)

Upper and lower bounds for the ergodic capacity of MIMO Jacobi fading channels
Amor Nafkha, et al.
Opt. Express 25(11) 12144-12151 (2017)

Improved two-stage equalization for coherent Pol-Mux QPSK and 16-QAM systems
Chen Zhu, et al.
Opt. Express 20(26) B141-B150 (2012)

Related Topics
Table of Contents Category
- Optical Communications and Interconnects
Optics & Photonics Topics
?

The topics in this list come from the Optics and Photonics Topics applied to this article.

About this Article
History
- Original Manuscript: May 11, 2018
- Revised Manuscript: July 12, 2018
- Manuscript Accepted: July 16, 2018
- Published: October 3, 2018

Abstract

Pol-Mux transmission is a well established technique that enhances spectral efficiency by simultaneously transmitting over horizontal and vertical polarizations of the electrical field. However, cross-coupling of the two polarizations impairs transmission. Under the assumption that the cross-coupling matrix is a Markov process with free-running state, we propose upper and lower bounds to the information rate that can be transferred through the channel. Simulation results show that the two bounds are tight for values of the cross-coupling power of practical interest and modulation formats up to 16-QAM (quadrature amplitude modulation).

1. Introduction

Simultaneous transmission of modulated signals over the horizontal and vertical polarizations of the electrical field is a well established technique [1–3] that allows to improve spectral efficiency by using the same frequency twice. In its essence, this technique relies upon the principle of MIMO (Multiple Input Multiple Output) systems, that have become popular after the seminal paper [4]. To cancel interference arising from non-ideal orthogonality between the horizontal and the vertical polarizations, linear processing can be adopted [5], even if it is well known that non-linear techniques achieve better performance in presence of interference and additive noise, see e.g. [6, 7].

Either implicitly or explicitly, most of the receivers studied in the literature assume that the MIMO channel matrix is static or quasi-static. However, the experimental results of [6] show that the coherence time of the channel is quite small, say, in the order of 10 to 30 symbol intervals for 112 Gb/s dual-polarization QPSK (Quadrature Phase Shift Keying). Hence tracking the channel becomes an issue. Tracking techniques can be based on pilot symbols, as proposed, for instance, in [8], but, independently of the channel tracking method, a low coherence time of the channel matrix, hence a fast time-varying channel, will make noisy the channel estimate (in practice only a short time window spanning a few signal samples can be used for channel estimation at a given time instant) thus impacting the information rate that can be transmitted through the channel. This observation motivates the study of the information rate transferred through the Pol-Mux channel. Channel capacity of the fading MIMO channel is a classical topic in the general framework of information theory, see e.g. [9] and, in that context, also the information rate of channels with free-running state has been studied [10]. In the context of optical transmission the information rate is well studied for the phase noise channel, at least for the channel model with free-running state, see e.g. [11–13], but less has been done for the Pol-Mux channel, which can be seen indeed as a variant of the phase noise channel where

the modulus is not constant
the channel is MIMO.

Therefore, starting from the lower bound for the phase noise channel of [13], we adapt it here to the Pol-Mux channel and introduce a new upper bound based on the Kalman filter.

2. Channel model

Let the lowercase characters indicate possibly complex scalars and column vectors and let the uppercase characters indicate matrices. The notation $a_{k}^{k + i}$ is used to indicate a column vector (or matrix, when the elements are vectors) made by the chunk of sequence (a_k, a_k₊₁, ⋯, a_k₊_i)^T, while {a_k} is used to indicate the semi-infinite sequence (a₀, a₁, ⋯). The notation $ℐ_{m}$ is used to indicate the m × m identity matrix and the superscript ^H denotes Hermitian transposition. The output of the Pol-Mux channel at time k is

y_{k} = M_{k} x_{k} + w_{k}, k = 1, 2, \dots,

where x_k is the k-th sample of the i.i.d. input modulation complex vector data sequence, with zero mean vector and covariance matrix

E {x_{k} x_{k}^{H}} = ℐ_{2},

M_k is the channel matrix and w_k is the k-th element of the i.i.d. complex Gaussian vector noise sequence with zero mean vector and covariance matrix

E {w_{k} w_{k}^{H}} = σ^{2} ℐ_{2} .

For small to moderate polarization crosstalk, the matrix M_k can be modelled as [6]

M_{k} = (\begin{matrix} 1 & λ_{1, k} \\ λ_{2, k} & 1 \end{matrix}),

where

λ_{k} = {(λ_{1, k}, λ_{2, k})}^{T}

is the k-th element of a complex Gaussian random vector sequence which is hereafter modelled here as a free-running 1-causal ARMA (Autoregressive Moving Average) process, hence

λ_{k} = \sum_{i = 1}^{p} b_{i} v_{k - i} + \sum_{i = 1}^{q} a_{i} λ_{k - i},

where v_k is the k-th sample of a white Gaussian random vector sequence with zero mean and covariance matrix

E {v_{k} v_{k}^{H}} = (\begin{matrix} 1 & ρ \\ ρ & 1 \end{matrix}) .

In other words, {λ_k} is the filtered version of {v_k}, where the filter is made of two shift registers, one for {v₁,_k} and the other one for {v₂,_k}, each one with m memories, and with 1-causal feedback taps

a_{1}^{q}

and 1-causal forward taps

b_{1}^{p}

, with

m = \max {p, q} .

Using the z-transform you write

λ (z) = v (z) \frac{b (z)}{1 - a (z)},

where

b (z) = \sum_{i = 1}^{m} b_{i} z^{- i}, a (z) = \sum_{i = 1}^{m} a_{i} z^{- i} .

To cast the model in the framework of linear dynamic systems we need to define the state of the system. To this aim, let us define the vector sequence

ω_{k} = {(ω_{1, k}, ω_{2, k})}^{T} = v_{k} + \sum_{i = 1}^{m} a_{i} ω_{k - i}, k = 0, 1, \dots,

hence

ω_{k - m}^{k - 1}

is the content of the two shift registers at the k-th channel use. Note that λ_k depends only on

ω_{k - m}^{k - 1}

λ_{k} = \sum_{i = 1}^{m} b_{i} ω_{k - i}

and, given

ω_{k - m}^{k - 1}

the sequence

λ_{k}^{\infty}

is independent of

λ_{1}^{k - 1}

. Therefore you can take

s_{k} = {(1, {(ω_{1, k - m}^{k - 1})}^{T}, 1, {(ω_{2, k - m}^{k - 1})}^{T})}^{T}

as the state of the linear dynamic system at time k, thus writing the measurement equation and the state transition equation as

y_{k} = H_{k} s_{k} + w_{k},

s_{k + 1} = F s_{k} + {(0, v_{1, k}, {(0_{1}^{m - 1})}^{T}, 0, v_{2, k}, {(0_{1}^{m - 1})}^{T})}^{T},

with

H_{k} = [\begin{array}{l} x_{1, k} & x_{2 k} {(b_{1}^{m})}^{T} & 0 & {(0_{1}^{m})}^{T} \\ 0 & {(0_{1}^{m})}^{T} & x_{2, k} & x_{1, k} {(b_{1}^{m})}^{T} \end{array}],

where

0_{1}^{m}

is a column vector of m zeros, and the 2(m + 1) × 2(m + 1) state transition matrix is

F [\begin{matrix} F_{m + 1} & O_{m + 1} \\ O_{m + 1} & F_{m + 1} \end{matrix}],

where

F_{m + 1} [\begin{matrix} 1 & {(0_{1}^{m - 1})}^{T} & 0 \\ 0 & {(a_{1}^{m - 1})}^{T} & a_{m} \\ 0_{1}^{m - 1} & ℐ_{m - 1} & 0_{1}^{m - 1} \end{matrix}],

and

O_{m}

is the all-zero square matrix of size m × m. The state transition probability is

p (s_{k + 1} | s_{k}) = g_{c} (F s_{k}, Q; s_{k + 1}),

where g_c(µ, Σ_m; x) indicates a m-dimensional complex Gaussian probability density function over the complex vector space spanned by x with mean vector µ and covariance matrix Σ_m and Q is the covariance matrix of the process noise

{(0, v_{1, k}, {(0_{1}^{m - 1})}^{T}, 0, v_{2, k}, {(0_{1}^{m - 1})}^{T})}^{T}

, that is

Q [\begin{matrix} Q_{1} & Q_{ρ} \\ Q_{ρ} & Q_{1} \end{matrix}],

with

Q_{1} = [\begin{matrix} 0 & 0 & {(0_{1}^{m - 1})}^{T} \\ 0 & 1 & {(0_{1}^{m - 1})}^{T} \\ 0_{1}^{m - 1} & 0_{1}^{m - 1} & O_{m - 1} \end{matrix}],

Q_{ρ} = [\begin{matrix} 0 & 0 & {(0_{1}^{m - 1})}^{T} \\ 0 & ρ & {(0_{1}^{m - 1})}^{T} \\ 0_{1}^{m - 1} & 0_{1}^{m - 1} & O_{m - 1} \end{matrix}] .

The joint source and channel output probability, given the hidden state, is

\begin{array}{l} p (y_{k}, x_{k} | s_{k}) = p (x_{k} | s_{k}) p (y_{k} | x_{k}, s_{k}) \\ = p (x_{k}) p (y_{k} | x_{k}, s_{k}) \end{array}

where

p (y_{k} | x_{k}, s_{k}) = g_{c} (H_{k} s_{k}, σ^{2} ℐ_{2}; y_{k}) .

The conditional probability of channel output given the hidden state is

p (y_{k} | s_{k}) = \sum_{x_{k} \in X_{k}} p (x_{k}) p (y_{k} | x_{k}, s_{k}) .

3. Upper and lower bounds to the information rate by the Kalman filter

Let

I (x; y) = H (x) - H (x | y),

where, for conventional M-QAM (Multi-Level Quadrature Amplitude Modulation) and M-PSK (Multi-Level Phase Shift Keying)

H (x) = \log_{2} M .

For the conditional entropy, by chain rule one writes

H (x | y) = \lim_{N \to \infty} \frac{1}{N} \sum_{k = 1}^{N} H (x_{k} | x_{1}^{k - 1}, y_{1}^{N}),

which, by the Shannon-McMillan-Breiman theorem, can be evaluated as

H (x | y) = - \lim_{N \to \infty} \frac{1}{N} \sum_{k = 1}^{N} \log_{2} p (x_{k} | x_{1}^{k - 1}, y_{1}^{N}) .

Since conditioning does not increase entropy, we have the following upper and lower bounds to the conditional entropy

\begin{array}{l} H (x | y) = \lim_{N \to \infty} \frac{1}{N} \sum_{k = 1}^{N} H (x_{k} | x_{1}^{k - 1}, y_{1}^{N}) \\ \leq \lim_{N \to \infty} \frac{1}{N} \sum_{k = 1}^{N} H (x_{k} | x_{1}^{k - 1}, y_{1}^{k}) \\ = - \lim_{N \to \infty} \frac{1}{N} \sum_{k = 1}^{N} \log_{2} p (x_{k} | x_{1}^{k - 1}, y_{1}^{k}), \end{array}

\begin{array}{l} H (x | y) = \lim_{N \to \infty} \frac{1}{N} \sum_{k = 1}^{N} H (x_{k} | x_{1}^{k - 1}, y_{1}^{N}) \\ \geq \lim_{N \to \infty} \frac{1}{N} \sum_{k = 1}^{N} H (x_{k} | x_{1}^{k - 1}, x_{k + 1}^{N}, y_{1}^{N}) \\ = - \lim_{N \to \infty} \frac{1}{N} \sum_{k = 1}^{N} \log_{2} p (x_{k} | x_{1}^{k - 1}, x_{k + 1}^{N}, y_{1}^{N}), \end{array}

that one can use in a straightforward way in the right side of (22) together with (23) to get lower and upper bounds to the information rate.

Let us consider the upper bound (26). The probabilities inside the logarithm can be evaluated by the Kalman filter as follows. The knowledge of past transmitted symbols that appear in the conditioning is imported in the Kalman filter by including all the conditions in the measurement, hence by updating the Kalman filter in data-aided mode. Let us write the channel output as

y_{k} = H_{k} s_{k} + w_{k} = h_{k} (s_{k}) + w_{k} .

The predicted measurement at time k is

{\hat{y}}_{k} = H_{k} {\hat{s}}_{k},

where

{\hat{s}}_{k}

denotes the state predicted by the Kalman filter at time k, that is the expectation of the hidden state given past measurements

{\hat{s}}_{k} = E {s_{k} | y_{1}^{k - 1}, x_{1}^{k - 1}} .

As innovations process we take

u_{k} = y_{k} - {\hat{y}}_{k} = H_{k} (s_{k} - {\hat{s}}_{k}) + w_{k} .

Starting from an initial pair

({\hat{Σ}}_{1}, {\hat{s}}_{1})

, where

{\hat{Σ}}_{k} = E {(s_{k} - {\hat{s}}_{k}) {(s_{k} - {\hat{s}}_{k})}^{H}},

for k = 1, 2, ⋯, the state prediction vector and the prediction error covariance matrix evolve as

{\hat{s}}_{k + 1} = F ({\hat{s}}_{k} + K_{k} u_{k}),

{\hat{Σ}}_{k + 1} = F Σ_{k} F^{T} + Q,

where

Σ_{k} = {({({\hat{Σ}}_{k})}^{- 1} + σ^{- 2} H_{k}^{H} H_{k})}^{- 1},

K_{k} = σ^{- 1} Σ_{k} H_{k}^{H} .

The desired probability is evaluated as

\begin{array}{l} p (x_{k} | x_{1}^{k - 1}, y_{1}^{k}) = p (x_{k} | y_{k}, x_{1}^{k - 1}, y_{1}^{k - 1}) \\ = \frac{p (x_{k} | x_{1}^{k - 1}, y_{1}^{k - 1}) p (y_{k} | x_{k}, x_{1}^{k - 1}, y_{1}^{k - 1})}{\sum_{x_{k} \in X_{k}} p (x_{k} | x_{1}^{k - 1}, y_{1}^{k - 1}) p (y_{k} | x_{k}, x_{1}^{k - 1}, y_{1}^{k - 1})} \\ = \frac{p (x_{k}) p (y_{k} | x_{1}^{k}, y_{1}^{k - 1})}{\sum_{x_{k} \in X_{k}} p (x_{k}) p (y_{k} | x_{1}^{k}, y_{1}^{k - 1})}, \end{array}

where, using the predicted state and the prediction error covariance matrix computed by the Kalman filter, one has

\begin{matrix} p (y_{k} | x_{1}^{k}, y_{1}^{k - 1}) = \int_{S} p (s_{k}, y_{k} | x_{1}^{k}, y_{1}^{k - 1}) d s_{k} \\ = \int_{S} p (s_{k} | x_{1}^{k}, y_{1}^{k - 1}) p (y_{k} | s_{k}, x_{1}^{k}, y_{1}^{k - 1}) d s_{k} \\ = \int_{S} p (s_{k} | x_{1}^{k - 1}, y_{1}^{k - 1}) p (y_{k} | s_{k}, x_{k}) d s_{k} \\ = \int_{S} g_{c} ({\hat{s}}_{k}, {\hat{Σ}}_{k}; s_{k}) g_{c} (H_{k}, s_{k}, σ^{2} ℐ_{2}; y_{k}) d s_{k} \\ = g_{c} (H_{k} {\hat{s}}_{k}, H_{k}^{H} {\hat{Σ}}_{k} H_{k} + σ^{2} ℐ_{2}; y_{k}) . \end{matrix}

Similarly, for the lower bound to the conditional entropy, one has

p (x_{k} | x_{1}^{k - 1}, x_{k + 1}^{N}, y_{1}^{N}) = \frac{p (x_{k}) p (y_{k} | x_{1}^{N}, y_{1}^{k - 1}, y_{k + 1}^{N})}{\sum_{x_{k} \in X_{k}} p (x_{k}) p (y_{k} | x_{1}^{N}, y_{1}^{k - 1}, y_{k + 1}^{N})},

with

\begin{matrix} p (y_{k} | x_{1}^{N}, y_{1}^{k - 1}, y_{k + 1}^{N}) = \int_{S} p (s_{k}, y_{k} | x_{1}^{N}, y_{1}^{k - 1}, y_{k + 1}^{N}) d s_{k} \\ = \int_{S} p (s_{k} | x_{1}^{N}, y_{1}^{k - 1}, y_{k + 1}^{N}) p (y_{k} | s_{k}, x_{1}^{N}, y_{1}^{k - 1}, y_{k + 1}^{N}) d s_{k} \\ = \int_{S} p (s_{k} | x_{1}^{k - 1}, y_{1}^{k - 1}, x_{k + 1}^{N}, y_{k + 1}^{N}) p (y_{k} | s_{k}, x_{k}) d s_{k} \\ = \int_{S} g_{c} ({\hat{s}}_{f b, k}, {\hat{Σ}}_{f b, k}; s_{k}) g_{c} (H_{k} s_{k}, σ^{2} ℐ_{2}; y_{k}) d s_{k} \\ = g_{c} (H_{k} {\hat{s}}_{f b, k}, H_{k}^{H} {\hat{Σ}}_{f b, k} H_{k} + σ^{2} ℐ_{2}; y_{k}), \end{matrix}

where

{\hat{s}}_{f b, k}

and

{\hat{Σ}}_{f b, k}

are the estimates produced by combining a forward and a backward Kalman filter as

{\hat{s}}_{f b} = {\hat{Σ}}_{b} {({\hat{Σ}}_{f} + {\hat{Σ}}_{b})}^{- 1} {\hat{s}}_{f} + Σ_{f} {({\hat{Σ}}_{f} + {\hat{Σ}}_{b})}^{- 1} {\hat{s}}_{b},

{\hat{Σ}}_{f b} = {({\hat{Σ}}_{f}^{- 1} + {\hat{Σ}}_{b}^{- 1})}^{- 1} .

4. Simulation results

The consideration of realistic spectra of the cross-pol coefficients is out of the scope of the present paper and we left it to future studies. For practical methods, to estimate the strength of cross-pol interference the reader is referred to [6], where the strength of interference is given by the autocorrelation of interference at time zero. In the following we express the strength of interference by using the SIR (Signal-to-Interference Ratio), which is the inverse of the interference autocorrelation at time zero. To derive simulation results, we set ρ = 0 and for each one of the two random coefficients appearing in the Pol-Mux matrix we take the first-order ARMA model

λ (z) = v (z) \frac{(1 - z_{p}) z^{- 1}}{1 - z_{p} z^{- 1}},

where −1 < z_p < 1 is the pole of the first-order ARMA model. The filtered sequence has zero mean, unit power spectral density at frequency zero and power

E {λ_{k}^{2}} = \frac{1 - z_{p}}{1 + z_{p}},

hence the SIR is

SIR = \frac{1 + z_{p}}{1 - z_{p}} .

In the common case where z_p is close to 1, the filtered sequence is a first-order low-pass random sequence with −3 dB normalized bandwidth

B_{- 3} \approx \frac{1 - z_{p}}{2 π} .

Figure 1 gives the upper and lower bounds to the information rate of 4-QAM, 16-QAM and 64-QAM obtained with z_p = 0.977, corresponding to SIR=19.3 dB. With such moderate interference the two bounds are close to each other, also for 64-QAM. Moreover, at high values of SNR (Signal-to-Noise Ratio) information rates reach the maximum value allowed by the constellation sizes, achievable with the pure AWGN (Additive White Gaussian Noise) channel: 4 bits for 2 × 4-QAM, 8 bits for 2 × 16-QAM and 12 bits for 2 × 64-QAM.

Fig. 1 Upper and lower bounds to the information rate for various modulation formats and z_p = 0.977. The Signal-to-Noise Ratio (SNR) is $SNR = \frac{1}{σ^{2}}$ .

Download Full Size | PDF

Figure 2 gives the same upper and lower bounds obtained with z_p = 0.887, that is SIR=12.2 dB. In the practice it seems to be a strong interference condition, since the minimum SIR reported in the experimental results of [6] is around 14 dB. In this case, the information rate with 64-QAM and at high SNR remains well below the information rate achieved with the AWGN channel, thus confirming that the Pol-Mux interference becomes the limiting factor of the information rate transferred through the channel. We also note that the spread between upper and lower bounds becomes large with 64-QAM and at high SNR, where the capability of tracking the MIMO channel becomes crucial. Actually, the lower bound renounces to the blind part of tracking thus renouncing to some tracking capability, while the upper bound upgrades the blind tracking to a data-aided tracking, thus enhancing tracking capabilities over what can actually be done.

Fig. 2 Upper and lower bounds to the information rate for various modulation formats and z_p = 0.887. The Signal-to-Noise Ratio (SNR) is $SNR = \frac{1}{σ^{2}}$ .

Download Full Size | PDF

5. Conclusions

We have proposed upper and lower bounds to the information rate of the Pol-Mux channel and shown simulation results for a specific channel model. The results show that with moderate interference our bounds are so close that virtually compute the exact information rate. For strong interference and modulation formats with high spectral efficiency there is still some spread between the two, leaving space to future investigations.

References and links

1. M.S.A.S. Al Fiad, M. Kuschnerov, S.L. Jansen, T. Wuth, D. van den Borne, and H. de Waardt, “11 × 224-Gb/s POLMUX-RZ-16QAM transmission over 670 km of SSMF with 50-GHz channel spacing,” IEEE Photon. Technol. Lett. 22(15), 1150–1152 (2010). [CrossRef]

2. V.A.J.M. Sleiffer, M.S.A.S. Al Fiad, D. van den Borne, M. Kuschnerov, V. Veljanovski, M. Hirano, Y. Yamamoto, T. Sasaki, S.L. Jansen, T. Wuth, and H. de Waardt, “10 × 224-Gb/s POLMUX-16QAM transmission over 656 km of Large-A_eff PSCF with a spectral efficiency of 5.6 b/s/Hz,” IEEE Photon. Technol. Lett. 23(20), 1427–1429 (2011). [CrossRef]

3. P. Boffi, M. Ferrario, L. Marazzi, P. Martelli, P. Parolari, A. Righetti, R. Siano, and M. Martinelli, “Stable 100-Gb/s POLMUX-DQPSK transmission with automatic polarization stabilization,” IEEE Photon. Technol. Lett. 21(11), 745–747 (2009). [CrossRef]

4. Gerard J. Foschini and Michael J. Gans, “On limits of wireless communications in a fading environment when using multiple antennas,” Wireless Pers. Commun. 6(3), 311–335 (1998). [CrossRef]

5. Seb. J. Savory, “Digital coherent optical receivers: Algorithms and subsystems,” IEEE J. Sel. Top. Quantum Electron. 16(5), 1164–1179 (2010). [CrossRef]

6. L. Li, Z. Tao, L. Liu, W. Yan, S. Oda, T. Hoshida, and Jens C. Rasmussen, “Nonlinear polarization crosstalk canceller for dual-polarization digital coherent receivers,” presented at Optical Fiber Communication, collocated National Fiber Optic Engineers Conference (OFC/NFOEC), IEEE, Piscataway, NJ, USA, 21 March 2010.

7. P. Layec, A. Ghazisaeidi, and G. Charlet, Jean-Christophe Antona, and S. Bigo, “Generalized maximum likelihood for cross-polarization modulation effects compensation,” J. Lightwave Technol. 33(7), 1300–1307 (2015). [CrossRef]

8. J. Li, R. Schmogrow, D. Hillerkuss, Philipp C. Schindler, M. Nazarathy, C. Schmidt-Langhorst, Shalva-Ben Ezra, I. Tselniker, C. Koos, W. Freude, and J. Leuthold, “A self-coherent receiver for detection of PolMUX coherent signals,” Opt. Express 20(19), 21413–21433 (2012). [CrossRef] [PubMed]

9. R.H. Etkin and D. N. C. Tse, “Degrees of freedom in some underspread MIMO fading channels,” IEEE T. Inform. Theory 52(4), 1576–1608 (2006). [CrossRef]

10. L. Barletta, M. Magarini, S. Pecorino, and A. Spalvieri, “Upper and lower bounds to the information rate transferred through first-order Markov channels with free-running continuous state,” IEEE T. Inform. Theory 60(7), 3834–3844 (2014). [CrossRef]

11. L. Barletta, M. Magarini, and A. Spalvieri, “Estimate of information rates of discrete-time first-order Markov phase noise channels,” IEEE Photonic Tech. L. 23(21), 1582–1584 (2011). [CrossRef]

12. L. Barletta, M. Magarini, and A. Spalvieri, “The information rate transferred through the discrete-time Wiener’s phase noise channel,” J. Lightwave Technol. 30(10), 1480–1486 (2012). [CrossRef]

13. L. Barletta, M. Magarini, and A. Spalvieri, “A new lower bound below the information rate of Wiener phase noise channel based on Kalman carrier recovery,” Opt. Express 20(23), 2547–25477 (2012). [CrossRef]

Previous Article Next Article

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.

Fig. 1 Upper and lower bounds to the information rate for various modulation formats and z_p = 0.977. The Signal-to-Noise Ratio (SNR) is

SNR = \frac{1}{σ^{2}}

View in Article | Download Full Size | PDF

Fig. 2 Upper and lower bounds to the information rate for various modulation formats and z_p = 0.887. The Signal-to-Noise Ratio (SNR) is

SNR = \frac{1}{σ^{2}}

View in Article | Download Full Size | PDF

Equations (50)

Equations on this page are rendered with MathJax. Learn more.

y_{k} = M_{k} x_{k} + w_{k}, k = 1, 2, \dots,

E {x_{k} x_{k}^{H}} = ℐ_{2},

E {w_{k} w_{k}^{H}} = σ^{2} ℐ_{2} .

M_{k} = (\begin{matrix} 1 & λ_{1, k} \\ λ_{2, k} & 1 \end{matrix}),

λ_{k} = {(λ_{1, k}, λ_{2, k})}^{T}

λ_{k} = \sum_{i = 1}^{p} b_{i} v_{k - i} + \sum_{i = 1}^{q} a_{i} λ_{k - i},

E {v_{k} v_{k}^{H}} = (\begin{matrix} 1 & ρ \\ ρ & 1 \end{matrix}) .

m = \max {p, q} .

λ (z) = v (z) \frac{b (z)}{1 - a (z)},

b (z) = \sum_{i = 1}^{m} b_{i} z^{- i}, a (z) = \sum_{i = 1}^{m} a_{i} z^{- i} .

ω_{k} = {(ω_{1, k}, ω_{2, k})}^{T} = v_{k} + \sum_{i = 1}^{m} a_{i} ω_{k - i}, k = 0, 1, \dots,

λ_{k} = \sum_{i = 1}^{m} b_{i} ω_{k - i}

s_{k} = {(1, {(ω_{1, k - m}^{k - 1})}^{T}, 1, {(ω_{2, k - m}^{k - 1})}^{T})}^{T}

y_{k} = H_{k} s_{k} + w_{k},

s_{k + 1} = F s_{k} + {(0, v_{1, k}, {(0_{1}^{m - 1})}^{T}, 0, v_{2, k}, {(0_{1}^{m - 1})}^{T})}^{T},

H_{k} = [\begin{array}{l} x_{1, k} & x_{2 k} {(b_{1}^{m})}^{T} & 0 & {(0_{1}^{m})}^{T} \\ 0 & {(0_{1}^{m})}^{T} & x_{2, k} & x_{1, k} {(b_{1}^{m})}^{T} \end{array}],

F [\begin{matrix} F_{m + 1} & O_{m + 1} \\ O_{m + 1} & F_{m + 1} \end{matrix}],

F_{m + 1} [\begin{matrix} 1 & {(0_{1}^{m - 1})}^{T} & 0 \\ 0 & {(a_{1}^{m - 1})}^{T} & a_{m} \\ 0_{1}^{m - 1} & ℐ_{m - 1} & 0_{1}^{m - 1} \end{matrix}],

p (s_{k + 1} | s_{k}) = g_{c} (F s_{k}, Q; s_{k + 1}),

Q [\begin{matrix} Q_{1} & Q_{ρ} \\ Q_{ρ} & Q_{1} \end{matrix}],

Q_{1} = [\begin{matrix} 0 & 0 & {(0_{1}^{m - 1})}^{T} \\ 0 & 1 & {(0_{1}^{m - 1})}^{T} \\ 0_{1}^{m - 1} & 0_{1}^{m - 1} & O_{m - 1} \end{matrix}],

Q_{ρ} = [\begin{matrix} 0 & 0 & {(0_{1}^{m - 1})}^{T} \\ 0 & ρ & {(0_{1}^{m - 1})}^{T} \\ 0_{1}^{m - 1} & 0_{1}^{m - 1} & O_{m - 1} \end{matrix}] .

\begin{array}{l} p (y_{k}, x_{k} | s_{k}) = p (x_{k} | s_{k}) p (y_{k} | x_{k}, s_{k}) \\ = p (x_{k}) p (y_{k} | x_{k}, s_{k}) \end{array}

p (y_{k} | x_{k}, s_{k}) = g_{c} (H_{k} s_{k}, σ^{2} ℐ_{2}; y_{k}) .

p (y_{k} | s_{k}) = \sum_{x_{k} \in X_{k}} p (x_{k}) p (y_{k} | x_{k}, s_{k}) .

I (x; y) = H (x) - H (x | y),

H (x) = \log_{2} M .

H (x | y) = \lim_{N \to \infty} \frac{1}{N} \sum_{k = 1}^{N} H (x_{k} | x_{1}^{k - 1}, y_{1}^{N}),

H (x | y) = - \lim_{N \to \infty} \frac{1}{N} \sum_{k = 1}^{N} \log_{2} p (x_{k} | x_{1}^{k - 1}, y_{1}^{N}) .

\begin{array}{l} H (x | y) = \lim_{N \to \infty} \frac{1}{N} \sum_{k = 1}^{N} H (x_{k} | x_{1}^{k - 1}, y_{1}^{N}) \\ \leq \lim_{N \to \infty} \frac{1}{N} \sum_{k = 1}^{N} H (x_{k} | x_{1}^{k - 1}, y_{1}^{k}) \\ = - \lim_{N \to \infty} \frac{1}{N} \sum_{k = 1}^{N} \log_{2} p (x_{k} | x_{1}^{k - 1}, y_{1}^{k}), \end{array}

\begin{array}{l} H (x | y) = \lim_{N \to \infty} \frac{1}{N} \sum_{k = 1}^{N} H (x_{k} | x_{1}^{k - 1}, y_{1}^{N}) \\ \geq \lim_{N \to \infty} \frac{1}{N} \sum_{k = 1}^{N} H (x_{k} | x_{1}^{k - 1}, x_{k + 1}^{N}, y_{1}^{N}) \\ = - \lim_{N \to \infty} \frac{1}{N} \sum_{k = 1}^{N} \log_{2} p (x_{k} | x_{1}^{k - 1}, x_{k + 1}^{N}, y_{1}^{N}), \end{array}

y_{k} = H_{k} s_{k} + w_{k} = h_{k} (s_{k}) + w_{k} .

{\hat{y}}_{k} = H_{k} {\hat{s}}_{k},

{\hat{s}}_{k} = E {s_{k} | y_{1}^{k - 1}, x_{1}^{k - 1}} .

u_{k} = y_{k} - {\hat{y}}_{k} = H_{k} (s_{k} - {\hat{s}}_{k}) + w_{k} .

{\hat{Σ}}_{k} = E {(s_{k} - {\hat{s}}_{k}) {(s_{k} - {\hat{s}}_{k})}^{H}},

{\hat{s}}_{k + 1} = F ({\hat{s}}_{k} + K_{k} u_{k}),

{\hat{Σ}}_{k + 1} = F Σ_{k} F^{T} + Q,

Σ_{k} = {({({\hat{Σ}}_{k})}^{- 1} + σ^{- 2} H_{k}^{H} H_{k})}^{- 1},

K_{k} = σ^{- 1} Σ_{k} H_{k}^{H} .

\begin{array}{l} p (x_{k} | x_{1}^{k - 1}, y_{1}^{k}) = p (x_{k} | y_{k}, x_{1}^{k - 1}, y_{1}^{k - 1}) \\ = \frac{p (x_{k} | x_{1}^{k - 1}, y_{1}^{k - 1}) p (y_{k} | x_{k}, x_{1}^{k - 1}, y_{1}^{k - 1})}{\sum_{x_{k} \in X_{k}} p (x_{k} | x_{1}^{k - 1}, y_{1}^{k - 1}) p (y_{k} | x_{k}, x_{1}^{k - 1}, y_{1}^{k - 1})} \\ = \frac{p (x_{k}) p (y_{k} | x_{1}^{k}, y_{1}^{k - 1})}{\sum_{x_{k} \in X_{k}} p (x_{k}) p (y_{k} | x_{1}^{k}, y_{1}^{k - 1})}, \end{array}

\begin{matrix} p (y_{k} | x_{1}^{k}, y_{1}^{k - 1}) = \int_{S} p (s_{k}, y_{k} | x_{1}^{k}, y_{1}^{k - 1}) d s_{k} \\ = \int_{S} p (s_{k} | x_{1}^{k}, y_{1}^{k - 1}) p (y_{k} | s_{k}, x_{1}^{k}, y_{1}^{k - 1}) d s_{k} \\ = \int_{S} p (s_{k} | x_{1}^{k - 1}, y_{1}^{k - 1}) p (y_{k} | s_{k}, x_{k}) d s_{k} \\ = \int_{S} g_{c} ({\hat{s}}_{k}, {\hat{Σ}}_{k}; s_{k}) g_{c} (H_{k}, s_{k}, σ^{2} ℐ_{2}; y_{k}) d s_{k} \\ = g_{c} (H_{k} {\hat{s}}_{k}, H_{k}^{H} {\hat{Σ}}_{k} H_{k} + σ^{2} ℐ_{2}; y_{k}) . \end{matrix}

p (x_{k} | x_{1}^{k - 1}, x_{k + 1}^{N}, y_{1}^{N}) = \frac{p (x_{k}) p (y_{k} | x_{1}^{N}, y_{1}^{k - 1}, y_{k + 1}^{N})}{\sum_{x_{k} \in X_{k}} p (x_{k}) p (y_{k} | x_{1}^{N}, y_{1}^{k - 1}, y_{k + 1}^{N})},

\begin{matrix} p (y_{k} | x_{1}^{N}, y_{1}^{k - 1}, y_{k + 1}^{N}) = \int_{S} p (s_{k}, y_{k} | x_{1}^{N}, y_{1}^{k - 1}, y_{k + 1}^{N}) d s_{k} \\ = \int_{S} p (s_{k} | x_{1}^{N}, y_{1}^{k - 1}, y_{k + 1}^{N}) p (y_{k} | s_{k}, x_{1}^{N}, y_{1}^{k - 1}, y_{k + 1}^{N}) d s_{k} \\ = \int_{S} p (s_{k} | x_{1}^{k - 1}, y_{1}^{k - 1}, x_{k + 1}^{N}, y_{k + 1}^{N}) p (y_{k} | s_{k}, x_{k}) d s_{k} \\ = \int_{S} g_{c} ({\hat{s}}_{f b, k}, {\hat{Σ}}_{f b, k}; s_{k}) g_{c} (H_{k} s_{k}, σ^{2} ℐ_{2}; y_{k}) d s_{k} \\ = g_{c} (H_{k} {\hat{s}}_{f b, k}, H_{k}^{H} {\hat{Σ}}_{f b, k} H_{k} + σ^{2} ℐ_{2}; y_{k}), \end{matrix}

{\hat{s}}_{f b} = {\hat{Σ}}_{b} {({\hat{Σ}}_{f} + {\hat{Σ}}_{b})}^{- 1} {\hat{s}}_{f} + Σ_{f} {({\hat{Σ}}_{f} + {\hat{Σ}}_{b})}^{- 1} {\hat{s}}_{b},

{\hat{Σ}}_{f b} = {({\hat{Σ}}_{f}^{- 1} + {\hat{Σ}}_{b}^{- 1})}^{- 1} .

λ (z) = v (z) \frac{(1 - z_{p}) z^{- 1}}{1 - z_{p} z^{- 1}},

E {λ_{k}^{2}} = \frac{1 - z_{p}}{1 + z_{p}},

SIR = \frac{1 + z_{p}}{1 - z_{p}} .

B_{- 3} \approx \frac{1 - z_{p}}{2 π} .

Abstract

1. Introduction

2. Channel model

3. Upper and lower bounds to the information rate by the Kalman filter

4. Simulation results

5. Conclusions

References and links

Cited By

Figures (2)

Equations (50)

Optics Express