Partially pruned DNN coupled with parallel Monte-Carlo algorithm for path loss prediction in underwater wireless optical channels

Zihao Du; Zihao Du; Zihao Du; Wenmin Ge; Wenmin Ge; Wenmin Ge; Guangbin Song; Guangbin Song; Guangbin Song; Guangbin Song; Yizhan Dai; Yizhan Dai; Yizhan Dai; Yufan Zhang; Yufan Zhang; Yufan Zhang; Jianmin Xiong; Jianmin Xiong; Jianmin Xiong; Bowen Jia; Bowen Jia; Bowen Jia; Bowen Jia; Yan Hua; Yan Hua; Yan Hua; Dongfang Ma; Dongfang Ma; Zejun Zhang; Zejun Zhang; Zejun Zhang; Jing Xu; Jing Xu; Jing Xu; Jing Xu

doi:10.1364/OE.455992

1. Introduction

With the global economy flourishing nowadays, land resources have gradually been unable to satisfy the increasing demands of human life. Considering the abundant natural resources that the ocean contains (e.g., fishery, mineral, oil, and gas), it has become a second home for humans and is playing an indispensable role in human activities. Meanwhile, aiming to carry out the investigation, research, and exploitation of oceans effectively and flexibly, underwater wireless communication (UWC) has already become an essential and widely used technical means over the past decades [1].

As an emerging UWC technology, underwater wireless optical communication (UWOC) has developed rapidly and attracted lots of attention in the last few years [2]. Compared with the conventional UWC methods, including underwater acoustic communication and underwater radio-frequency communication, UWOC has been increasingly regarded as an appropriate alternative for the underwater wireless sensor networks, uncrewed marine vehicles, ocean resource exploration, and seabed environmental monitoring, which is owing to its advantages of tremendous bandwidth over hundreds of GHz, achievable transmission distance up to hundreds of meters, low latency, low cost as well as high security [3–6]. Given this situation, a number of studies have been reported to analyze and validate the feasibility of UWOC systems via diverse experiments and numerical simulations [7].

Since the two major physical mechanisms, namely absorption and scattering, strongly corrupt the light propagation in marine medium, the transmission performance of UWOC systems is significantly governed by the underwater channel characteristics. The two mentioned inherent optical properties (IOPs), arising from water molecules, particulates, and dissolved matters, will affect the amplitude, phase, arrival angle, spot size, and energy distribution of light beam at the receiving end, which can eventually result in temporal dispersion and optical power reduction [8–9]. However, such channel characteristics and corresponding effects are more difficult to be accurately described than those of free space optical (FSO) communication and visible light communication (VLC), due to the difficulties of conducting field experiments in highly dynamic and complex waters [10]. As a substitute, analytical and numerical approaches have been developed and gradually become the alternative methods to mathematically study and analyze the underwater channel characteristics of UWOC links [11]. In this case, one of the main targets is to evaluate the overall optical path loss, which can be quite helpful for further link budget calculations to some extent.

Within the past few decades, many approaches have been reported in the literature to evaluate the underwater optical path loss. Among these, Beer-Lambert’s law is the most widely used model in the line-of-sight (LOS) UWOC systems because of its simplicity [12]. But considering its rough assumption that the scattered photons are annihilated and cannot be captured by the detector after multiple scattering events, Beer-Lambert’s law will severely underestimate the total received optical power, especially in the turbid water. Besides, the radiative transfer equation (RTE), which takes both absorption and scattering effects into account, has been employed as a deterministic solution to fully describe underwater light propagation. However, it should be pointed out that since the RTE is a complicated integrodifferential equation containing plenty of independent variables, it cannot be analytically solved for underwater conditions. Therefore, several numerical approaches have been proposed to obtain an approximate solution of the RTE [13]. Table 1 outlines four typical numerical RTE methods that have been studied in recent years [13–17], and the partially pruned deep neural network method proposed by us, namely PPDNN, is also presented here for comparison, which will be comprehensively discussed in the following sections. The assessment items used here are from Ref. [13].

Table 1. Comparison of numerical RTE methods

View Table | View all tables in this article

The previously reported methods mentioned in Table 1 can be generally divided into two categories: probabilistic and deterministic methods [15]. Among these, the Monte-Carlo (MC) simulation is the most popular probabilistic approach widely applied to numerically solve the RTE owing to its simplicity, intuitiveness, and understandability. Nevertheless, its main drawback lies in that it is very time-consuming, especially when accurately simulating a long-distance or strong-scattering channel since millions of photons are required to assure sufficient received photons. On the other hand, the deterministic methods, such as the conventional discrete ordinate method [14], the direct RTE solver [15], and Markov chains [16] mentioned in Table 1, will partly suffer from the disadvantages of mathematical complexity and poor universality. Besides, they are more difficult to be programmed and only specific to some given parameters such as a certain volume scattering function (VSF), whereas the MC simulation can directly use the look-up table method without any probability recalculations for a different VSF [13]. Meanwhile, even though the deterministic ones can obtain the path loss more quickly, yet the final accuracy highly depends on the granularity of the parameter discretization process, and the performance will gradually become worse as the transmission distance increases [17–18].

Inspired by the deep learning methods applied to solve the Navier-Stoke equation [19–20], the deep neural network (DNN) method is first introduced here to effectively solve the RTE and determine the path loss for UWOC systems. Specifically, a partially pruned DNN model is proposed to predict the received optical power in underwater wireless optical channels, and the required model datasets are generated by the parallel MC simulation. It is worth noting that the deterministic methods are not suitable for datasets generation due to the aforementioned drawbacks. Once trained, the proposed PPDNN can instantly provide a predicted value of the optical path loss without additional bulky computation or iterative operation. The major contributions of this paper are summarized as follows:

(1) A parallel algorithm is newly employed to speed up the MC simulation via vectorization operation and graphics processing unit (GPU) acceleration. Additionally, a comprehensive MC simulation with a detailed scattering angle generation process is introduced.
(2) A DNN model is trained and employed to learn how to solve the RTE and predict the path loss under different transmission distances, fields of view (FOV), and detector radiuses at the receiving end, with an interesting finding that the mean square errors (MSEs) between the predictive and the reference ones are significantly lower than those obtained by other methods for three typical water types.
(3) A pruning strategy is utilized to compress the size of the original DNN model [21], and the simulation results indicate that the MSEs are all lower than 0.2 while the sparsity of weights can be appropriately increased to 0.9, 0.7, and 0.5 for clear water, coastal water, and harbor water, respectively. Finally, the occupied storage space of the proposed PPDNN can be reduced by at least 40% of the original size.

The rest of this paper is well-organized as follows: Section 2 introduces the complete algorithm flow of the proposed parallel MC simulation; Section 3 describes the overall structure of the DNN model as well as the pruning algorithm; Section 4 obtains the simulation results under different parameter settings, and some discussions are presented; Section 5 draws the conclusions of this paper.

2. Parallel MC algorithm

As noted in Ref. [22], there are four typical MC methods proposed for photon migration simulations, namely albedo-weight (AW), albedo-rejection (AR), absorption-scattering path length rejection, and microscopic Beer-Lambert law, respectively. In our simulations, the most favored AW method was chosen since there is no wasted computation [22]. To provide a better pre-understanding before introducing the parallel MC algorithm, a simplified flowchart generalizing the process of a typical single-step MC simulation and some corresponding sketches are presented in Fig. 1. Firstly, a laser diode (LD) light source with a Gaussian beam distribution is commonly employed to propagate along the z-axis in the x-y plane with radius W and divergence angle θ. The initial 3D emission coordinates (x₀, y₀, z₀) and direction cosines (u_x₀, u_y₀, u_z₀) of each photon shown in Fig. 1 can be expressed as:

(1)$$\left\{ \begin{array}{rll} ({x_0},{y_0},{z_0}) &= &({r_0}\cos {\alpha_0},{r_0}\sin {\alpha_0},0)\\ ({u_{{x_0}}},{u_{{y_0}}},{u_{{z_0}}}) &= &(\cos {\varphi_0}\sin {\theta_0},\sin {\varphi_0}\sin {\theta_0},\cos {\theta_0}) \end{array} \right.$$

where (r₀, α₀) is the polar coordinate of the emitted photon. φ₀ and θ₀ are the initial azimuth angle and scattering angle, respectively.

Fig. 1. (a) Flowchart of the single-step MC algorithm; (b) Initial 3D emission coordinate and direction cosines of a single photon; (c) Trajectory of a single photon.

Download Full Size | PDF

Since it has been assumed that the irradiance distribution of the light beam strictly conforms to the ideal 2D Gaussian distribution, x₀ and y₀ will independently decrease smoothly from the center outwards in the form of the Gaussian function ${e^{ - 2{r^2}/{W^2}}}$. Therefore, r₀ is subject to Rayleigh distribution, and its probability distribution function (PDF) and cumulative distribution function (CDF) can be given as:

(2)$$\left\{ \begin{aligned} {PDF_{{r_0}}} &= \frac{{{r_0}}}{{{\sigma_0}^2}}{e^{ - \frac{{{r_0}^2}}{{2{\sigma_0}^2}}}}\\ {CDF_{{r_0}}} &= 1 - {e^{ - \frac{{{r_0}^2}}{{2{\sigma_0}^2}}}} \end{aligned} \right.{,^{}}{\sigma _0} = \frac{W}{2}$$

Besides, the expressions of all parameters needed in Eq. (1) can be defined below [23]:

(3)$$\left\{\begin{aligned} {{r_0}} &= \sqrt { - \ln {\xi_{{r_0}}}/2 \cdot W}\\ {{\alpha_0}} &= 2\pi {\xi_{{\alpha_0}}}\\ {\theta_0} &= {{\tan }^{ - 1}}({\raise0.7ex\hbox{${{r_0}}$} \!\mathord{\left/ {\vphantom {{{r_0}} W}} \right.}\!\lower0.7ex\hbox{$W$}} \cdot \tan \theta )\\ {\varphi_0} &= 2\pi {\xi_{{\varphi_0}}} \end{aligned} \right.$$

in which ξ_r₀, ξ_α₀, and ξ_φ₀ obey uniform distribution in the range (0,1), independently. In our simulations, W and θ are set as 0.01 m and 2.5°, respectively.

After the photon initialization, the random path length or step size of the photon can be given as [24]:

(4)$$s ={-} \frac{1}{c}\ln {\xi _s}$$

where ξ_s ∼ U (0,1), and c is the attenuation coefficient of light underwater, which is equal to the sum of the absorption coefficient a and the scattering coefficient b. After the photon travels a certain distance s, the new weight w₁ and coordinates (x₁, y₁, z₁) can be updated as:

(5)$$\left\{ \begin{array}{rll} {w_1} &= &{w_0} \cdot {W_{albedo}}\\ ({x_1},{y_1},{z_1}) &= &({x_0},{y_0},{z_0}) + ({u_{{x_0}}},{u_{{y_0}}},{u_{{z_0}}}) \cdot s \end{array} \right.$$

where the scattering albedo W_albedo= b/c [25]. Considering the calculation efficiency and accuracy, the initial weight w₀ is typically set as 1, and the weight threshold W_t is set as 10⁻¹⁰ in our simulations. That is, for any photon with a weight lower than 10⁻¹⁰, it will be omitted to speed up the whole simulation. Additionally, the new direction cosines (u_x₁, u_y₁, u_z₁) after each scattering event can be given as [23] [26]:

(6)$$\begin{aligned}&\left\{ \begin{aligned} {u_{{x_1}}} &= \frac{{\sin {\theta_1}}}{{\sqrt {1 - {u_{{z_0}}}^2} }}({u_{{x_0}}}{u_{{z_0}}}\cos {\varphi_1} - {u_{{y_0}}}\sin {\varphi_1}) + {u_{{x_0}}}\cos {\theta_1}\\ {u_{{y_1}}} &= \frac{{\sin {\theta_1}}}{{\sqrt {1 - {u_{{z_0}}}^2} }}({u_{{y_0}}}{u_{{z_0}}}\cos {\varphi_1} - {u_{{x_0}}}\sin {\varphi_1}) + {u_{{y_0}}}\cos {\theta_1}\\ {u_{{z_1}}} &= {-} \sin {\theta_1}\cos {\varphi_1}\sqrt {1 - {u_{{z_0}}}^2} + {u_{{z_0}}}\cos {\theta_1} \end{aligned} \right.{,^{}}|{{u_{{z_0}}}} |\le 1 - {10^{ - 5}}\\& \left\{ \begin{aligned} {u_{{x_1}}} &= \cos {\varphi_1}\sin {\theta_1}\\ {u_{{y_1}}} &= \sin {\varphi_1}\sin {\theta_1}\\ {u_{{z_1}}} &= \textrm{sign} ({u_{{z_0}}})\cos {\theta_1} \end{aligned} \right.{,^{}}|{{u_{{z_0}}}} |> 1 - {10^{ - 5}}\end{aligned}$$

It is important to stress that the expression of (u_x₁, u_y₁, u_z₁) depends on the value of |u_z₀| as shown in Eq. (6) [23]. Parameters φ₁ and θ₁ are the new azimuth and scattering angles, respectively. Owing to symmetry, φ₁ is simply subject to the uniform distribution in the range (0,1). As for the distribution of θ₁, lots of analytical models have been proposed in previous studies, such as single term Henyey-Greenstein (STHG) [27], two-term Henyey-Greenstein (TTHG) [28], and Fournier–Forand (FF) phase function [29]. In our work, the FF phase function is chosen and employed to randomly generate θ₁ since it fits Petzold’s measurements [30] well and can be expressed as [31]:

(7)$$\left\{ \begin{aligned} &{\widetilde \beta_{FF}}(\theta ,\mu ,{n_p})= \{ [\delta (1 - {\delta^\upsilon }) - \upsilon (1 - \delta )]{\sin^{ - 2}}\frac{\theta }{2} + \upsilon (1 - \delta ) - (1 - {\delta^\upsilon })\} \\ &\cdot \frac{1}{{4\pi {{(1 - \delta )}^2}{\delta^\upsilon }}} + \frac{{1 - \delta_\pi^\upsilon }}{{16\pi ({\delta_\pi } - 1)\delta_\pi^\upsilon }} \cdot (3{\cos^2}\theta - 1)\\ &\upsilon = \frac{{3 - \mu }}{2}\\ &\delta = \frac{4}{{3{{({n_p} - 1)}^2}}}{\sin^2}\frac{\theta }{2} \end{aligned} \right.$$

in which µ and n_p are the slope parameter of hyperbolic distribution and the real index of refraction of the particles. δ_π is δ evaluated at θ = 180° [31]. In our simulations, µ and n_p are set as 3.5835 and 1.10, respectively [29]. As it is difficult to analytically obtain the inverse CDF expression of the FF phase function, the tabulated method is appropriately applied to generate the scattering angle θ₁ in our simulations [32]. In such case, the procedure can be described as follows: (1) According to the normalized form of the FF phase function, we can first obtain the sampled FF-CDF values by numerical integration (e.g., the trapezoidal rule) and hence produce a lookup table of FF-CDF values and corresponding scattering angle values. (2) Because of the distribution range (0, 1) of the FF-CDF, a uniformly distributed random number between 0 and 1 is generated and can be uniquely matched to a nearest scattering angle value by using the bisection search algorithm. Through repeating the above processes, plenty of random scattering angles can be quickly obtained. As indicated in Fig. 2(a), the probability distribution of scattering angles generated by the tabulated method turns out to be an excellent fit to the FF phase function (solid red line), especially in the top 99% of the FF-CDF, which means that the small scattering angles of 0°∼30° can match the FF phase function values very well.

oe-30-8-12835-i001

Fig. 2. (a) FF phase function (solid red line) compared with the normalized histogram of 10⁹ scattering angles generated by the tabulated method; (b) Time-consuming comparison of parallel and single-step MC simulation with 10⁵ received photons needed.

Download Full Size | PDF

As mentioned before, the common single-step MC simulation is extremely time-consuming due to tremendous repetitions. To overcome this shortcoming, we propose a parallel MC algorithm that employs vectorization operation and GPU acceleration to reduce the running time, as shown in Algorithm 1. In general, the parallel operation packs a mass of photons and emits them simultaneously, and then the states of all photons, such as coordinates, direction cosines, step sizes, and weights, will be updated together at each iteration no matter these photons are eliminated, received, or ongoing. Meanwhile, the GPU is used to accelerate the whole process. Compared with the conventional single-step MC, parallel MC considers the independence of photons, thus making better use of computing resources via vectorization operations. As shown in Fig. 2(b), the running time between single-step and parallel MC simulation under three typical water types reported by Petzold is compared [30]. As the single-step MC requires such a long running time for turbid water and long-distance transmission, a comparison is made for a maximum distance of 20 m here. It can be observed that the parallel MC simulation can effectively reduce the running time by at least 95% or acquire a time gain over 20, and the trend becomes more significant as the water becomes more turbid or the transmission distance increases. In view of this, the parallel MC method will be used to generate adequate datasets fast in our work.

3. Partially pruned DNN model

After generating training and testing datasets by the above parallel MC simulation, the DNN model is developed for predicting the received optical power in underwater wireless optical channels. The DNN model before pruned can be theoretically expressed as:

(8)$${P_p} = {\textrm{W}_4} \times {f_3}(\ldots {f_1}({\textrm{W}_\textrm{1}}\textrm{X} + {\textrm{b}_\textrm{1}})\ldots ) + {\textrm{b}_4}$$

in which X denotes the vector (L, R, FOV) of the input layer. P_p denotes the predicted received optical power, and Θ = {W_i, b_i | i = 1∼4} are the model parameters to be optimized. In our work, three hidden layers containing 10, 20, and 20 neurons, respectively, are employed for deep learning. The corresponding activation function is rectified linear unit (ReLU), while the output layer only employs the linear function to get a regression value. Besides this, the Adam gradient descent method and the MSE loss function are typically used.

To reduce the size of the original DNN model, the pruning algorithm has been applied [21]. To be more specific, the DNN model is first trained to obtain a relatively high accuracy through the mini-batch method. Afterwards, the weights of each layer are sorted in descending order. According to the preset sparse value, some low weights are set to 0 by comparing absolute values, namely pruning. Then the DNN model will be retrained to further fine-tune the remained weights. In this way, the training MSE performance can converge to a relatively low value again. Instead of merely pruning partial weights in one step to achieve the desired sparsity, we use the automated gradual pruning algorithm proposed in [33]:

(9)$${s_t} = {s_f} + ({s_i} - {s_f}){(1 - \frac{{t - {t_0}}}{{n \times dt}})^3}{\kern 7pt}\textrm{for} {\kern 7pt}t \in \{{{t_0},{t_0} + dt,\ldots ,{t_0} + n \times dt} \}$$

where s_i is the initial sparsity value set as 0, and s_f is the final sparsity value, while the pruning process starts at epoch t₀ with pruning frequency dt. That is, the original DNN model will be pruned every dt epochs so that it can be trained to gradually increase its sparsity without a sudden performance fluctuation. As shown in Fig. 3, the final PPDNN will only remain weights with large absolute values indicated by solid red lines. In addition, the weights connecting the input and output layer will not be pruned since they are relatively critical [34]. After trained and partially pruned, the final PPDNN will be employed to predict the received optical power.

Fig. 3. Schematic diagram of the proposed PPDNN model.

Download Full Size | PDF

4. Simulation results and discussions

The parameters of the parallel MC simulation used for training data generation are given in Table 2, and the values in parentheses represent the step intervals. During the testing process, two datasets are properly chosen with the parameters (R, FOV) set as (0.15 m, 25°) and (0.75 m, 180°), respectively, which represents two different conditions for a certain water type. Meanwhile, the maximum testing value of parameter L is set as 132 m, 96 m, and 60 m for three water types, respectively. Note that the parameter L of testing datasets is further increased to be distinguished from the training datasets.

Table 2. Parallel MC simulation parameters for training data generation

View Table | View all tables in this article

We first compared the prediction performance between the original unpruned DNN model and three typical models, including linear regression (LR), support vector regression (SVR) [35], and XGBoost [36], and the two aforementioned testing datasets are used here. The MSE is properly employed as an effective indicator of prediction performance. For instance, Fig. 4 indicates the differences between the predicted and reference values with different methods in harbor water. Besides, the coefficient of determination R² is also given for a better understanding of the goodness of fit. It can be found that the DNN model has the best agreement as its testing MSE value is the lowest and R² is closest to 1, while LR, SVR, and XGBoost cannot yield satisfactory results with the optimal parameters after traversal. More specifically, the LR model based on the least square method is not suitable for such a nonlinear relationship. Meanwhile, the SVR and XGBoost models with optimized parameters are no longer feasible as the transmission distance exceeds the range of training datasets. In such a case, no matter how the other two parameters (R, FOV) are selected, the fitting performance of SVR and XGBoost will be poor. However, since the DNN model can establish a potential correlation between the parameters (L, R, FOV) and the optical power P_p more accurately, it can lead to better fit performance. In Table 3, the MSE values of three water types under different methods are indicated. Compared to other common models, the original DNN model obtains minimum MSE values and has a good predictive ability for the received optical power under different water types.

Fig. 4. Comparison of predicted values and reference values with different methods in harbor water.

Download Full Size | PDF

Table 3. Testing MSE values of three water types with different methods

View Table | View all tables in this article

Since the original DNN model is redundant, the pruning algorithm mentioned in section 3 is employed to compress its size. In Fig. 5, the MSE results in the training process with different NN models and pruning strategies are compared, where the harbor water is chosen as an example, and the final sparsity is properly set as 0.5. The blue curve represents the training process of the original unpruned DNN model, and the dotted red curve shows the sparsity curve in which the pruning process starts at the 600th epoch and stops at the 1500th epoch with a pruning interval of 100 epochs. As the black curve shows, the artificial neural network (ANN) model containing only one hidden layer of 100 neurons is more unstable than other DNN models due to its lack of deep-network fitting ability. The red curve represents the partial pruning strategy, in which the weights of hidden layers connecting the input layers and output layers are not pruned. Besides, it can be observed that the red curve has the smoothest training convergence process by employing the pre-trained weights obtained from the originally unpruned DNN model. However, as the green curve shows, the training process without the pre-trained weights would be slower and rougher. That is, the pre-trained weights instead of random initial weights can further accelerate the convergence process since the pruning process can be regarded as a fine-tuning process for the weights in some way. Additionally, compared with other strategies, the purple curve indicates that the full pruning strategy, in which the weights of all layers are pruned, will lead to a significant degradation of MSE performance, which proves the selectivity of pruning is necessary.

Fig. 5. MSE performance of training process for different NN and pruning strategies in harbor water.

Download Full Size | PDF

The final training MSE values of different strategies are presented in Table 4. It can be found that the PPDNN with pre-trained weights can reduce the number of weights from 701 to 401. Even though the original DNN model is compressed to 57.2%, the final training MSE value increases only by 1.36%. At the same time, although the ANN model with only one hidden layer is simpler, its final MSE value is a little higher than the PPDNN. Using the same pruning algorithm, the final MSE results for three typical water types with different sparsity are obtained. In Fig. 6, it indicates that the lower sparsity is needed as the water quality deteriorates. However, even if the sparsity increases to 0.9, the final testing MSE performance will hardly increase for clear water. As well, the MSE values are also almost unchanged when the sparsity increases to 0.7 and 0.5 for coastal water and harbor water, respectively. But as the sparsity is further improved, the MSE performance of harbor water will be further degraded. That is because the scattering and absorption effects are more obvious in turbid water. In this case, any changes in the three variables (L, FOV, R) of the input layer will significantly affect the final optical power loss so that the PPDNN needs more weights to fit the entire process.

Fig. 6. Impact of different sparsity on the MSE performance of testing datasets for three typical water types.

Download Full Size | PDF

Table 4. Number of parameters and final training MSE values of different methods

View Table | View all tables in this article

Furthermore, more detailed results are shown in Fig. 7. The corresponding MC simulation results are used as a reference, and “0.15 m/20°” in figures represents the values of (R, FOV). It can be found that the predicted values are in good agreement with the reference values, and all testing MSE values are lower than 0.2. In general, the obtained results have proved the feasibility of the pruning method for different water types.

Fig. 7. Comparison of PPDNN predicted values and MC reference values.

Download Full Size | PDF

To further understand the state of weight pruning, for instance, the heatmaps of pruned DNN model for clear water with sparsity 0.9 are shown in Fig. 8. The x-axis and y-axis indicate the neurons for the corresponding layers, and the color block represents the weights between the corresponding two neurons. It can be observed that lots of blocks have zero values owing to the large sparsity, leading to a significant size compression and relatively low space complexity. Meanwhile, it is worth noting that some biases are also trained to zero since lots of weights are deleted, further decreasing the number of parameters needed for the DNN model. Table 5 shows the final occupied storage space of the PPDNN for three typical water types. According to Table 5, it can be observed that more turbid water whose sparsity cannot be too large requires more parameters, resulting in a larger storage space correspondingly. Nevertheless, the compression ratios of the three water types are all less than 40%, which can be used to decrease the occupied storage space effectively.

Fig. 8. Heatmap of the pruned weights and corresponding biases for clear water (sparsity = 0.9).

Download Full Size | PDF

Table 5. Occupied storage space of DNN models for different water types

View Table | View all tables in this article

5. Conclusions

In this paper, a partial pruned DNN model coupled with parallel MC simulation has been proposed to predict the path loss in underwater wireless optical channels. The parallel algorithm is employed to speed up the MC simulation and generate essential datasets for the DNN model. Compared with LR, SVR, and XGBoost methods, the DNN model has the best MSE performance. As well, the partial pruned DNN model is trained and employed to predict the path loss in various water types, resulting in a relatively low MSE value below 0.2 and a compression ratio of storage space less than 40%. The proposed PPDNN can be employed to instantly provide a reference value for the design of UWOC systems without intensive time consumption. The proposed path loss prediction scheme can be deployed on devices owing to the relatively small required storage space. In the future, more channel parameters and corresponding datasets will be included in the proposed PPDNN, and it is well encouraged to consider more conditions like bubbles and turbulence to build a universal prediction model in our follow-up work.

Funding

National Natural Science Foundation of China (61971378); Strategic Priority Research Program of the Chinese Academy of Sciences (XDA22030208); Zhoushan-Zhejiang University Joint Research Project (2019C81081).

Disclosures

The authors declare no conflicts of interest.

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

References

1. I. U. Khan, B. Iqbal, L. Songzou, H. Li, G. Qiao, and S. Khan, “Full-duplex Underwater Optical Communication Systems: A Review,” in Proceedings of IEEE Conference on International Bhurban Conference on Applied Sciences and Technologies (IEEE, 2021), pp. 886–893.

2. S. Zhu, X. Chen, X. Liu, G. Zhang, and P. Tian, “Recent progress in and perspectives of underwater wireless optical communication,” Prog. Quantum Electron. 73, 100274 (2020). [CrossRef]

3. C. Lee, C. Zhang, M. Cantore, R. M. Farrell, S. H. Oh, T. Margalith, J. S. Speck, S. Nakamura, J. E. Bowers, and S. P. DenBaars, “4 Gbps direct modulation of 450 nm GaN laser for high-speed visible light communication,” Opt. Express 23(12), 16232–16237 (2015). [CrossRef]

4. J. Wang, C. Lu, S. Li, and Z. Xu, “100 m/500 Mbps underwater optical wireless communication using an NRZ-OOK modulated 520 nm laser diode,” Opt. Express 27(9), 12171–12181 (2019). [CrossRef]

5. M. Doniec, A. Xu, and D. Rus, “Robust real-time underwater digital video streaming using optical communication,” in Proceedings of IEEE International Conference on Robotics and Automation (IEEE, 2013), pp. 5117–5124.

6. Z. Zeng, S. Fu, H. Zhang, Y. Dong, and J. Cheng, “A Survey of Underwater Optical Wireless Communications,” IEEE Commun. Surv. Tutorials 19(1), 204–238 (2017). [CrossRef]

7. M. A. Khalighi, C. Gabriel, T. Hamza, S. Bourennane, P. Leon, and V. Rigaud, “Underwater wireless optical communication; Recent advances and remaining challenges,” in Proceedings of IEEE International Conference on Transparent Optical Networks (IEEE, 2014), pp. 1–4.

8. M. A. A. Ali, “Analyzing of Short Range Underwater Optical Wireless Communications Link,” Int. J. Electron. Commun. Technol. 4(3), 125–132 (2013).

9. S. Arnon, J. Barry, G. Karagiannidis, R. Schober, and M Uysal, Advanced Optical Wireless Communication Systems (Cambridge University, 2012).

10. L. Liu, S. Zhou, and J. Cui, “Prospects and problems of wireless communication for underwater sensor networks,” Wirel. Commun. Mob. Comput. 8(8), 977–994 (2008). [CrossRef]

11. S. Jaruwatanadilok, “Channel Modeling and Performance Evaluation using Vector Radiative Transfer Theory,” IEEE J. Sel. Areas Commun. 26(9), 1620–1627 (2008). [CrossRef]

12. J. H. Smart, “Underwater optical communications systems Part 1: Variability of water optical parameters,” in Proceedings of IEEE Conference on Military Communications (IEEE, 2005), pp. 1140–1146.

13. C. T. Geldard, J. Thompson, and W. O. Popoola, “An Overview of Underwater Optical Wireless Channel Modelling Techniques,” in Proceedings of IEEE International Symposium on Electronics and Smart Devices (IEEE, 2019), pp. 1–4.

14. C. D. Mobley, B. Gentili, H. R. Gordon, Z. Jin, G. W. Kattawar, A. Morel, P. Reinersman, K. Stamnes, and R. H. Stavn, “Comparison of numerical models for computing underwater light fields,” Appl. Opt. 32(36), 7484–7504 (1993). [CrossRef]

15. C. Li, K. H. Park, and M. S. Alouini, “On the Use of a Direct Radiative Transfer Equation Solver for Path Loss Calculation in Underwater Optical Wireless Channels,” IEEE Wirel. Commun. Lett. 4(5), 561–564 (2015). [CrossRef]

16. T. Zhou, J. Ma, T. Lu, G. Hu, T. Fan, X. Zhu, X. Zhu, and W. Chen, “Simulation and verification of pulsed laser beam propagation underwater using Markov chains [Invited],” Chinese Opt. Lett. 17(10), 100003 (2019). [CrossRef]

17. E. Illi, F. El Bouanani, and F. Ayoub, “A high accuracy solver for RTE in underwater optical communication path loss prediction,” in Proceedings of IEEE International Conference on Advanced Communication Technologies and Networking (IEEE, 2018), pp. 1–8.

18. E. Illi, F. El Bouanani, K. H. Park, F. Ayoub, and M. S. Alouini, “An Improved Accurate Solver for the Time-Dependent RTE in Underwater Optical Wireless Communications,” IEEE Access 7, 96478–96494 (2019). [CrossRef]

19. S. H. Rudy, S. L. Brunton, J. L. Proctor, and J. N. Kutz, “Data-driven discovery of partial differential equations,” Sci. Adv. 3(4), 1–7 (2017). [CrossRef]

20. N. Thuerey, K. Weißenow, L. Prantl, and X. Hu, “Deep Learning Methods for Reynolds-Averaged Navier–Stokes Simulations of Airfoil Flows,” AIAA J. 58(1), 25–36 (2020). [CrossRef]

21. M. G. Augasta and T. Kathirvalavakumar, “Pruning algorithms of neural networks - A comparative study,” Open Comput. Sci. 3(3), 105–115 (2013). [CrossRef]

22. A. Sassaroli and F. Martelli, “Equivalence of four Monte Carlo methods for photon migration in turbid media,” J. Opt. Soc. Am. A. 29(10), 2110–2117 (2012). [CrossRef]

23. W. Cox, Simulation, modeling, and design of underwater optical communication systems (North Carolina State University, 2012).

24. Y. Zhang, Y. Wang, and A. Huang, “Analysis of underwater laser transmission characteristics under Monte Carlo simulation,” in Proceedings of IEEE Conference on OCEANS-MTS/IEEE Kobe Techno-Oceans (IEEE, 2018), pp. 1–5.

25. C. F. Bohren and D. R. Huffman, Absorption and scattering of light by small particles (John Wiley & Sons, 2008).

26. Y. Ding, B. Song, and N. Li, “Monte Carlo method-based dynamic simulation of underwater optical transmission characteristics,” Proc. SPIE 10255, 102555C (2017). [CrossRef]

27. D. Toublanc, “Henyey–Greenstein and Mie phase functions in Monte Carlo radiative transfer computations,” Appl. Opt. 35(18), 3270–3274 (1996). [CrossRef]

28. V. I. Haltrin, “One-parameter two-term Henyey-Greenstein phase function for light scattering in seawater,” Appl. Opt. 41(6), 1022–1028 (2002). [CrossRef]

29. G. R. Fournier and J. L. Forand, “Analytic phase function for ocean water,” Proc. SPIE 2258, 94–201 (1994). [CrossRef]

30. T. J. Petzold, Volume scattering functions for selected ocean waters (Scripps Institution of Oceanography La Jolla Ca Visibility Lab, 1972).

31. C.D. Mobley, L.K. Sundman, and E. Boss, “Phase function effects on oceanic light fields,” Appl. Opt. 41(6), 1035–1050 (2002). [CrossRef]

32. J. Zhang, “On sampling of scattering phase functions,” Astron. Comput. 29, 100329 (2019). [CrossRef]

33. M. Zhu and S. Gupta, “To prune, or not to prune: exploring the efficacy of pruning for model compression,” arXiv preprint arXiv:1710.01878 (2017).

34. Y. Zhao and N. Chi, “Partial pruning strategy for a dual-branch multilayer perceptron-based post-equalizer in underwater visible light communication systems,” Opt. Express 28(10), 15562–15572 (2020). [CrossRef]

35. A. J. Smola and B. Schölkopf, “A tutorial on support vector regression,” Stat. Comput. 14(3), 199–222 (2004). [CrossRef]

36. T. Chen and C. Guestrin, “Xgboost: A scalable tree boosting system,” in Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining, 2016, pp. 785–794.

Reference	Method	Mathematical difficulty	Programming difficulty	Universality	Running time
[13]	Monte-Carlo	Simple	Simple	General	Slow
[14]	Discrete ordinate	Highly mathematical	Difficult to program	Analytic solution for certain conditions	Fast
[15]	Direct solver
[16]	Markov chains
This work	PPDNN	Simple	Moderate	General	Instant

		Water types [30]
Symbol	Physical meaning	Clear water	Coastal water	Harbor water
c (m^-1)	Attenuation coefficient	0.151	0.398	2.19
W_albedo	Albedo	0.245	0.55	0.84
L (m)	Transmission distance	4∼80 (4)	3∼60 (3)	2∼30 (2)
R (m)	Detector radius	0.1∼0.5 (0.1)
FOV (°)	Field of view	10∼180 (10)

	Method
Water type	LR	SVR	XGBoost	DNN
Clear water	1.955	3.031	7.785	0.05696
Coastal water	2.237	1.271	9.142	0.08725
Harbor water	5.135	9.408	6.115	0.18030

Strategies	Number of parameters	Final training MSE
ANN (with one hidden layer)	501	0.008048
Unpruned DNN	701	0.004420
Partial pruned DNN w/ pre-trained weights	401	0.004480
Partial pruned DNN w/o pre-trained weights	401	0.007776
Full pruned DNN w/ pre-trained weights	376	0.102900

		Occupied storage space
Water types	Number of parameters after pruning	Before pruning	After pruning	Compression ratio
Clear water	154 (sparsity = 0.9)	9.04KB	2.64KB	29.2%
Coastal water	277 (sparsity = 0.7)	8.81KB	3.13KB	35.5%
Harbor water	395 (sparsity = 0.5)	8.97KB	3.57KB	39.8%

Partially pruned DNN coupled with parallel Monte-Carlo algorithm for path loss prediction in underwater wireless optical channels

Abstract

1. Introduction

2. Parallel MC algorithm

3. Partially pruned DNN model

4. Simulation results and discussions

5. Conclusions

Funding

Disclosures

Data availability

References

Data availability

Cited By

Figures (8)

Tables (5)

Equations (9)

Optics Express