Taxonomy of hybridly polarized Stokes vortex beams

Gauri Arora; Ankit Butola; Ankit Butola; Ruchi Rajput; Ruchi Rajput; Rohit Agarwal; Krishna Agarwal; Alexander Horsch; Dilip K Prasad; Paramasivam Senthilkumaran; Paramasivam Senthilkumaran

doi:10.1364/OE.512409

1. Introduction

Phase singularities are associated with the orbital angular momentum (OAM) [1–3], and circular polarization is related to spin angular momentum (SAM) of light [4]. The superposition of two beams in orthogonal polarization states with at least one of the beams carrying phase singularity leads to the formation of Stokes singularities [5–7]. Recently, a similar superposition of two half-skyrmions (merons) in opposite polarities has been theoretically realized to form topological bimeronic beams [8]. Beams with Stokes singularities are called Stokes singular beams (SSBs), where light’s spin and orbital angular momentum are coupled together. Several methods such as interferometric techniques [9], implementation of spatial light modulators (SLM) [10,11], stress engineered optics [12], and spatially inhomogeneous wave plates [13] have been reported to generate these beams. The polarization distribution associated with these beams adds an extra degree of freedom in optical communication which increases the channel capacity [14–16]. The exotic properties of these beams have been exploited in laser beam shaping [17], super-resolution microscopy [18], chirality measurement [19], image processing [20], robust beam engineering [21], Mobius strips generation [22] and among others.

The role of SSBs in various applications depends on the net OAM (mode indices) present in these beams. The Stokes index used to define SSBs does not provide complete information of the vector beams in terms of their net OAM content. Therefore, it is necessary to identify these singularities based on their superposition modes to deploy them for a particular application. However, the detection techniques of these beams are limited to a few, including Stokes polarimetry [23], interferometric [24] and diffraction techniques [25–28]. These detection techniques can not deal with all the degenerate states associated with Stokes singular beams. In addition, unavoidable experimental fluctuations in phase, amplitude, and polarization together with other beam fluctuations, makes detection a challenging task. Recently, artificial intelligence (AI) has emerged as a powerful tool to boost scientific research in the field of optics and photonics. The AI-based techniques are also being adopted for the identification of multi-singularity structured field, orbital angular momentum of vortex beams, and among others [29,30]. In vector regime, machine learning-based detection of vector vortex beams is presented in [31]. However, the classification method adopted here is based on Stokes polarimetry which does not consider the degeneracy present in the SSBs. In addition, the report focuses on the classification of vector vortex beams i.e., a subset of Stokes singularities. Nevertheless, there is no report to date that deals with identifying all the Stokes singularities and their degenerate states.

Here, we present the detection of all Stokes singularities by exploiting the diffraction and polarization transformation patterns of the singular beams assisted with the deep neural network. A deep neural network provides an excellent framework for detecting hybridly polarized beams due to its ability to recognize indiscernible features in the intensity images, that would not be detectable merely by intensity-sensitive measurements. The method employed here is based on the diffraction of Stokes singular beams through an equilateral triangular aperture in combination with polarization transformation. The resultant total and component intensities of a particular Stokes singular beam after diffraction are utilized for classification purposes. A total of 15 classes of beams are first simulated and further experimentally generated based on the type of Stokes singularity and the associated mode indices.

Next, the classification of all Stokes singular beams is performed and compared by using five different deep neural networks. These networks are trained with mixed simulation and experimental datasets which consist of 15300 images (15000 simulated and 300 experimentally acquired). A total of 90% of the simulated images and 50% of the experimentally acquired images are used for training and validation purposes. Finally, the optimized trained network is tested on 10% of the simulated image and 50% of the experimental datasets. Separate testing accuracy for both simulated and experimental datasets is shown to demonstrate the advantage of the current framework. We found ResNet-18 offers the best testing accuracy i.e., 97.67% and 98.67% for simulated and experimental data, respectively. Furthermore, the current experimental and computational approach addresses both detection and classification tasks thus, enabling the recognition of total and component intensities of the Stokes singular beams which could be relevant in applications such as super-resolution optical microscopy and optical communication. Our findings manifest the soundness of proposed method for the detection of all Stokes singularities and novel identification method which paves the way to scalable microscopy applications through Stokes beams illumination.

2. Methods

2.1 Stokes degenerate states

Any state of polarization of light can be defined using four Stokes parameters $S_0$, $S_1$, $S_2$, and $S_3$ [23]

(1)$$\begin{aligned} S_0 &= |E_x|^2+|E_y|^2; \quad S_1 = |E_x|^2-|E_y|^2;\\ S_2 &= 2Re(E_x^*E_y); \quad S_3 = 2Im(E_x^*E_y). \end{aligned}$$

For inhomogeneously polarized beams, Stokes parameters are functions of $x$ and $y$. Here, the Stokes parameter $S_0$ gives total intensity and other Stokes parameters give differences in intensities of orthogonal polarization components. Using these Stokes parameters, Stokes fields $S_{jk}=S_j+iS_k$ can be constructed which can be $S_{12}$, $S_{23}$, and $S_{31}$. The corresponding Stokes phases are $\phi _{12}$, $\phi _{23}$, and $\phi _{31}$ where $\phi _{jk}$ is argument of Stokes field defined as: $\phi _{jk}=\tan ^{-1} (S_k/S_j)$. The Stokes singularities are points of phase singularities in these Stokes phases. Stokes singularities $\phi _{12}$, $\phi _{23}$, and $\phi _{31}$ provide information about the phase singularities present in orthogonal polarization basis states, namely, circular (R, L), linear (H, V), and linear (D, A) bases respectively. Here, R, L, H, V, D, and A represent right circular, left circular, horizontal, vertical, diagonal, and anti-diagonal polarization states. Various types of Stokes singularities are reported in the literature which includes point and line singularities in 2D transverse plane of the beam, and singularities present in 3D fields which include Mobius strips, links and knots [22]. In this article, we deal with point Stokes singularities present in 2D transverse plane of the beam.

At $\phi _{12}$ Stokes singular point, polarization azimuth is indeterminate; hence, these singularities are also known as polarization singularities. The other two types of Stokes singularities are called Poincaré vortices. Under paraxial approximations, Stokes singular beams can be mathematically expressed as the superposition of two beams in orthogonal polarization states such that at least one of them carries phase singularity and is given by

(2)$$\begin{array}{r} {\left|{\psi_{l,m}(r,\theta)}\right\rangle}= \cos(\chi+\frac{\pi}{4})\left|\psi_{P}^l(r, \theta)\right\rangle + \sin(\chi+\frac{\pi}{4}) \\ \quad \exp(i2\gamma) \left|\psi_{Q}^m(r, \theta)\right\rangle \end{array}$$

where

(3)$$\left|\psi_{P}^l(r, \theta)\right\rangle = \psi^l(r)\exp(il\theta)\hat{P}$$

(4)$$\left|\psi_{Q}^m(r, \theta)\right\rangle = \psi^m(r)\exp(im\theta)\hat{Q}.$$

Here, $2\chi$ and $2\gamma$ decide the weighting factor and phase difference between the two superposing beams respectively. The variables $l$ and $m$ represent topological charges of the vortex beams used to construct SSBs, and $\hat {P}$, $\hat {Q}$ represent orthogonal polarization basis states. The ${\left |{\psi _{l,m}(r,\theta )}\right \rangle }$ gives the resultant amplitude distribution due to the superposition of two beams. Bright Stokes singularities are formed when one of the superposing beams is a Gaussian beam. The superposition of phase singular beams in orthogonal polarization states result in dark Stokes singularities. These singularities can be identified using a Stokes index $\sigma _{jk}$ which is defined as $\sigma _{jk}= \frac {1}{2\pi } \oint \nabla \phi _{jk} \cdot dl$ where $\phi _{jk}$ represents the Stokes phase, and $dl$ is the closed path of integration around the singular point. Stokes index can also be defined as $\sigma _{jk}=m-l$, where $m$ and $l$ are topological charges of component phase singular beams respectively.

Stokes singular beams in which the singularities are present at the center of the beam are represented as points on the hybrid-order Poincaré spheres(HyOPS) or higher-order Poincaré spheres(HOPS). These spheres are geometrical constructions where one or both the poles of the spheres represent phase singular beams in orthogonal polarization states. Stokes singular beams with the same Stokes index can have different intensity, polarization, and Stokes phases distributions, which result in degenerate Stokes index states [26]. Stokes singular beams generated using the superposition of phase singularities with topological charges $l$ and $m$ such that $m-l=constant$, and $l\neq 0, m\neq 0$ are polarization degenerate. All the beams represented on a surface of a particular HyOPS/HOPS are Stokes index degenerate [26]. Also, the SSBs on a particular longitude of a hybrid order Poincaré sphere have similar polarization distribution but different polarization gradients [26,32].

The degenerate polarization singular beams are composed of either different mode indices ($l,m$) or same mode indices but with different relative weightage of $l$ and $m$ and therefore contain different optical properties. For example, a partially coherent beam is shown to produce different depolarization effects despite of having the same Stokes index and polarization distribution [33]. Further, polarization degenerate beams cannot be differentiated using the Stokes polarimetry method because of same polarization distribution. Hence, the identification of SSBs based on their mode indices is important. Figure 1 shows one example where the Stokes singularities of a given index are composed of different mode indices but are polarization degenerate. The component phases and the corresponding polarization states (marked on the phase maps) from which the SSBs are composed of are shown in the left side of the figure. The corresponding resultant intensity distributions, degenerate polarization distribution and the respective Stokes parameter distributions ($S_1, S_2, S_3$) are shown in the middle of the figure. Due to the same polarization distributions, both the beams have same Stokes parameter distributions. It can be noticed that there is an increase in the spatial size of the beam with an increase in the topological index. However, the spatial size of the beam cannot be used to determine the topological index of the singular beam. On the right side of the figure, the respective diffraction intensity distributions produced by two distinct SSBs are shown to distinguish the degenerate polarization states.

Fig. 1. An example of polarization degeneracy in Stokes singular beams with Stokes index $\sigma _{23}=1$. Left: Phase distribution of the two superposing vortex beams characterized by topological charges $l$ and $m$ respectively. Orthogonal polarization basis states are marked in the inset. Centre: Corresponding total intensity, degenerate polarization distribution and Stokes parameter distributions. Right: Respective diffraction patterns produced by the two beams after diffraction through a triangular aperture.

Download Full Size | PDF

2.2 Detection of Stokes singularities

The diffraction of Stokes singular beams through an equilateral triangular aperture lifts the degeneracy associated with mode indices (refer to Fig. 1). However, the SSBs generated using same combination of mode indices irrespective of their orthogonal polarization states results in diffraction degenerate states. Thus, to completely lift the degeneracy associated with SSBs, polarization transformation after diffraction is obligatory.

The transmittance function of an equilateral triangular aperture with side length $a$ can be mathematically written as [26]:

(5)$$\begin{aligned}T(x,y) &= 1 \quad \text{for} \quad \frac{-a}{2}\leq x\leq \frac{a}{2}\quad\text{and}\\ & \quad \quad \quad \quad 0 \leq y \leq{-}\sqrt{3}\left(|x|-\frac{a}{2}\right)\\ &= 0 \quad \text{otherwise} \end{aligned}$$

The field $\left |\psi _{l,m} (r, \theta )\right \rangle$ in cartesian coordinate system is written as $\left |\psi _{l,m} (x, y)\right \rangle$ where $x=r\cos \theta$ and $y=r\sin \theta$. The field just after the aperture with aperture function $T(x,y)$ is given by:

(6)$${\left|{E_{T}}\right\rangle}=T(x,y){\left|{\psi_{l,m} (x, y)}\right\rangle}$$

The far-field diffraction pattern is produced at the back focal plane of a lens with focal length $f$ and is given by:

(7)$$\begin{aligned}{\left|{\mathcal{F}}(u,v)\right\rangle}=\frac{B}{i\lambda f} \int_{-\infty}^{\infty} \int_{-\infty}^{\infty} T(x,y){\left|{\psi_{l,m}}\right\rangle}\\ \quad \quad \exp{\{{-}i\frac{2\pi}{\lambda f}(ux+vy)\}}dxdy \end{aligned}$$

where $(u,v)$ denotes the coordinates of the Fourier plane, $B$ is a constant and $\lambda$ defines the wavelength of light. The diffraction of the individual superposing beams is calculated independently and added together to obtain the total diffraction pattern of a Stokes singular beam. Notably, the diffraction of these beams can be carried out using other apertures too, that will result in different diffraction intensity patterns depending on their shape and symmetry. However, the aperture should be chosen such that it can differentiate between positive and negative phase singularities with same magnitude of topological charge.

Diffraction is a wave phenomenon that can be used to identify the OAM of light. Since SAM and OAM are coupled in SSBs, diffraction and Stokes polarimetry techniques alone cannot distinguish between distinct SSBs. Therefore, here we report the detection of these beams by combining diffraction based Stokes polarimetry method with the deep learning techniques. The ability of deep learning-based models to recognize intricate features in intensity images makes it an excellent tool to classify Stokes singular beams. Respective total diffraction intensity distributions and all the six polarization projections (R, L, H, V, D and A) are utilized as a set to serve as input channels for training the deep neural network (DNN). Of the infinitely many possible classes of SSBs, we selected a total of 15 classes of SSBs. These classes include: $\phi _{jk}^{01}$, $\phi _{jk}^{02}$, $\phi _{jk}^{10}$, $\phi _{jk}^{12}$, $\phi _{jk}^{23}$ where ${j,k}$ run cyclically from $1-3$ and $j\neq k$. Here, in $\phi _{jk}^{lm}$, indices $j$ and $k$ decide the orthogonal polarization states, and, $l$ and $m$ represent the topological charges of the component vortex states of the Stokes singular beam where the Stokes singularity is present at the center. In each of the superpositions, other types of Stokes vortices are also present at different locations [9]. The corresponding mode indices for these classes are given by (0,1), (0,2), (1,0), (1,2), and (2,3). Among each mode indices, three pairs of polarization basis (R, L; H, V; D, A) are considered, contributing to 15 classes.

3. Experimental setup

The experimental setup used to generate and detect SSBs is depicted in Fig. 2. The light from the He-Ne laser (632nm) is spatially filtered and collimated using lens L1. A $45^\circ$ polarized beam is generated using a combination of polarizer (P) and half wave plate (HWP). The beam is launched into a modified Mach Zehnder type interferometer where two orthogonally linearly polarized beams in two arms of the interferometer acquire different topological charges using spiral phase plate (SPP). For generating bright Stokes singularity, one SPP was removed from one of the arms of the interferometer. The two beams combine after the beam splitter (BS). The quarter wave plate (QWP) after BS is used to change linear basis superposition to circular basis superposition and the HWP after the BS is used to change (H,V) linear basis to (D,A) linear basis superposition. Further, the beam is allowed to diffract through an equilateral triangular aperture, and the far-field diffraction intensity pattern is recorded using the Stokes camera. The combination of polarizer and QWP after the triangular aperture is used to extract the component intensities of the beam which includes horizontal (H), vertical (V), diagonal (D), anti-diagonal (A), right circular (R) and left circular (L) polarization intensities. The total and polarization components intensities are recorded at three different z-planes for a particular Stokes singular beam. Based on the generation of a particular Stokes index beam, the charge of the spiral phase plate in the two arms of the interferometer is varied.

Fig. 2. Experimental set up for generation and detection of Stokes singular beams. SF: Spatial filter assembly, L1, L2: Lenses, Iris: amplitude aperture to control the size of the beam, P: Polarizer, H(Q)WP: half(quarter) wave plate, (P)BS: (polarizing) beam splitter, M1, M2: Mirrors, SPP: spiral phase plate, Ap: Triangular aperture, CP: Circular polarizer, SC: Stokes camera, PC: Personal computer.

Download Full Size | PDF

4. Results and discussion

4.1 Total diffraction pattern and corresponding polarization projections of Stokes singular beams

Simulation and experimental results which include total (T) and component diffracted intensities (R, L, H, V, D, A) for different SSBs are depicted in Fig. 3. These component intensities are projections of the resultant beam onto various polarization states. The topological charges of the superposing phase singular beams that form the Stokes singularity are mentioned in the left side of the figure in each case. Each set contains three rows for superposition in three different polarization basis namely circular (R, L), linear (H, V), and diagonal bases (D, A) from top to bottom, respectively. All the polarization component intensities (R, L, H, V, D, and A) are extracted for each basis of superposition (R,L; H,V; and D,A). Stokes singular beams composed of different mode indices produce different diffraction patterns. However, SSBs composed of the same mode indices results in the same resultant diffraction pattern irrespective of the polarization associated with component beams, and hence are diffraction degenerate (refer to Fig. 3). To lift the diffraction degeneracy, polarization transformation is utilized.

Fig. 3. Simulation and experimental results of total diffraction intensity patterns (T) and corresponding polarization projections (R, L, H, V, D, A) for various Stokes singular beams. For each mode indices $(l=l_1,m=l_2)$, superposition in three different polarization basis (R,L; H,V; D,A) is considered.

Download Full Size | PDF

In the next section, we describe the deep learning procedure that is used for the recognition of SSBs. It includes dataset preparation, selection and optimization of deep neural networks followed by the classification results.

4.2 Deep learning models and strategy

4.2.1 Simulation and experimental datasets

The datasets used in the present study consist of both simulated and experimentally acquired images. Possible experimental fluctuations in amplitude, phase, polarization, beam shift, and aperture shape are taken into account while creating simulation data for the identification of SSBs. These fluctuations may affect the accuracy of the detection techniques. Therefore, it is imperative to include these parameters while simulating the datasets for different classes. Moreover, this also helps in the robust training of deep learning networks. In addition, since deep learning models require a large amount of data for training [34], robust simulation aids in the quick convergence of the network. Heuristically, we simulate 1000 images for each class, resulting in 15000 total images. The experimental data contained a total of 300 images i.e., 20 images for each class. The experimental datasets were acquired under the specifications mentioned in section 3.

4.2.2 Comparison models

We considered five different convolutional neural networks (CNN) based deep learning models, namely, SqueezeNet [35], VGG [36], AlexNet [37], DenseNet [38] and ResNet-18 [39] to apply to our datasets. All of these models are pre-trained on ImageNet data. More information about these models is provided in Supplement 1 section 1. The cross-entropy loss is employed and a stochastic gradient descent optimizer is used for back-propagation. Each of the $707 \times 101$ image as shown in Fig. 3 is transformed into a square image of dimension 303$\times$303 by stacking two voids of dimension 101$\times$101 and served as an input image to the neural network. Examples of resultant input image are depicted in Fig. 4. This peculiar dimension is chosen in order to preserve the information by symmetry, since the deep learning models resize the input images to 224$\times$224.

Fig. 4. (a) Comparison of different training strategies: We compare the data distribution of the 3 different training strategies. The scatter plot shown here is for illustration of data distribution. (b) Data partition for training, validation and testing of the neural network under mix training strategy

Download Full Size | PDF

4.2.3 Training strategy

We employed 3 different strategies to train and test our model as shown in Fig. 4. Figure 4 depicts the different strategies used for training and testing to achieve the best classification accuracy. A scatter plot is shown in Fig. 4 to show the data distribution in the feature space. Next, we explain all three training and testing strategies:

(1) Training with only simulation data: We trained and validated all five deep-learning models using simulated data and tested on the experimental data. However, this strategy provides very poor classification performance i.e., 43.6%. Note that this is the best achievable performance among five deep learning models and is obtained through Alexnet. This is attributed to the fact that experimental data have more variability between the images in each class which is illustrated in the scatter plot in Fig. 4. In addition, slight mismatch between the simulated and experimentally acquired datasets also affects the classification accuracy. This strategy is further discussed in Supplement 1 section 2.
(2) 10-fold strategy on experimental data: We employed a 10-fold strategy [40] to train a ResNet-18 model on experimental data as shown in Fig. 4. ResNet-18 model was first pre-trained on the simulated data followed by fine-tuning using experimental data by a 10-fold strategy (strategy 2.1 in Fig. 4). This strategy works well and provides an accuracy of 93%. On the other hand, fine-tuning of the ResNet-18 model (pre-trained on ImageNet) on experimental data with a 10-fold strategy is also performed (strategy 2.2 in Fig. 4). Fine-tuning with only experimental datasets provides an accuracy of 92%. Strategy 2 does not exploit the benefit of simulated data as strategy 2.1 (shown in Fig. 3) improves the final performance by only 1%. We discuss strategy 2 in more detail in Supplement 1 section 3.
(3) Mix Training: Finally, we train the deep-learning model on mix simulated and experimental data [31] for generalized and robust training. Mix training provides the best accuracy of 98.67% using the ResNet-18 model. The mix strategy is discussed in detail in the next paragraph.

4.2.4 Mix training

For robust training of the neural networks, we performed mixed training with simulated and experimental datasets. The dataset is divided into three parts: training, validation and the testing set. The testing set is further categorized into two subparts: simulated testing and experimental testing. A total of five different deep learning models are trained separately and optimized for a fixed batch size of 8. The models are trained on the training set, the best hyperparameters are chosen on the validation set and the performance of each network is shown on the testing set in Table 1. The division of the dataset is shown in Fig. 4(b). A total of 35% (105 images), 15% (45 images) and 50% (150 images) of the experimental images are used for training, validation and testing respectively. From simulated data, 89.7% (13455 images), 0.3% (45 images) and 10% (1500 images) images are used for training, validation and testing purpose respectively. We chose only 0.3% of simulated images for validation to ensure that both experimental and simulated datasets contained an equal number of images in the validation set. Moreover, the ratio of simulated and experimental images in the training datasets is optimized since strategy 1 confirms that training model with only simulated images offer poor accuracy on experimentally acquired images. To balance the training datasets, all experimental images of the training set (105 images) are copied 18 times resulting 1890 experimental images. This results in 7:1 ratio of simulated and experimental images in the training set. This strategy ensures the availability of one experimental image in each batch of 8 images during training of the network. All the models are trained for 50 epochs. The learning rate is optimized for all models and found to be $0.001$ on the validation set. The accuracy is calculated as the percentage of instances rightly classified. The accuracy of each model in all the parts of the dataset is given in Table 1.

Table 1. Model comparison based on the accuracy is given here. The accuracy is rounded to 2 digits. The best model and performance is highlighted in bold and the second best in italics. Sim Testing and Exp Testing represent simulation testing and experimental testing datasets respectively.

View Table

4.3 Labelling hybridly polarized beams and assessing the winning models

Out of all models used in the present study, ResNet-18 and DenseNet offer the best training accuracy of 97.81${\% }$ and 98.36${\% }$ respectively (see Table 1). DenseNet performs better in the validation set followed by ResNet-18. DenseNet offers 100${\% }$ of validation accuracy while ResNet-18 provides a validation accuracy of 97.78${\% }$. In the simulation testing dataset, ResNet-18 and DenseNet give classification accuracy of 98.33${\% }$ and 97.67${\% }$ respectively. Other architecture i.e., SqueezeNet, VGG and AlexNet offer accuracy of 85.80 ${\% }$, 95.73${\% }$ and 95.00${\% }$ respectively. Interestingly, ResNet-18 outperforms all the models in the experimental testing dataset. The classification accuracy on the experimental images using ResNet-18 is found to be 98.67${\% }$.

Further to quantify the best-performed model i.e., ResNet-18 and DenseNet, we show the loss curve in Fig. 5. Both training and validation loss are calculated and shown after each epoch here. It is evident from the training loss curve that both models are converging after 50 epochs. Further, confusion matrices of ResNet-18 and DenseNet shown in Fig. 5 demonstrate the visual clues of the network prediction. Confusion matrices are shown for both the simulation and experimental testing datasets. Diagonal elements of the confusion matrix show correct prediction while off-diagonal elements are the ones wrongly classified by the network. For example, only two images are wrongly classified by ResNet-18 in the experimental testing datasets i.e., one instance of the class $\phi _{12}^{01}$ which is represented as class 0 and class $\phi _{12}^{12}$ which is shown as class 4 in Fig. 5. Wrongly classified image of $\phi _{12}^{01}$ and $\phi _{12}^{12}$ are classified as $\phi _{23}^{12}$ (represented as class 8) by ResNet-18. On the contrary, DenseNet wrongly classifies 6 images, and hence performs slightly poorer than ResNet-18. Nonetheless, the experimental classification accuracy of ResNet-18 and DenseNet are 98.33${\% }$ and 97.67${\% }$ respectively manifesting the benefits of using a deep learning approach to quantify SSBs in their relevant classes. The efficacy of ResNet-18 can be attributed to their residual connections which alleviate the degrading performance due to the vanishing gradient problem in deep neural networks.

Fig. 5. Loss curve and confusion matrices of ResNet-18 and DenseNet. The loss curve is shown after each epoch on the training and validation dataset. The confusion matrix shows the classification accuracy of the network on experimental and simulated testing datasets. Class representation: 0: $\phi _{12}^{01}$, 1: $\phi _{12}^{02}$, 2: $\phi _{12}^{10}$, 3: $\phi _{12}^{12}$, 4: $\phi _{12}^{23}$, 5: $\phi _{23}^{01}$, 6: $\phi _{23}^{02}$, 7: $\phi _{23}^{10}$, 8: $\phi _{23}^{12}$, 9: $\phi _{23}^{23}$, 10: $\phi _{31}^{01}$, 11: $\phi _{31}^{02}$, 12: $\phi _{31}^{10}$, 13: $\phi _{31}^{12}$, 14: $\phi _{31}^{23}$.

Download Full Size | PDF

A deep learning model requires a substantial amount of data for training purposes. While acquiring a large amount of experimental data is challenging as it requires considerable time and effort, it is imperative to simulate images that exhibit properties similar to experimental images. However, it is almost impossible to include all experimental variations in the simulated images and therefore, our proposed mix training strategy of a combination of simulation and a few experimental images offers a unique solution to overcome the aforementioned issue and successfully leverage the performance of CNNs for the classification of Stokes singular beams. Moreover, the superior performance by ResNet-18 asserts that a simpler network with a skip connection performs better in the case of Stokes singular beams. The proposed idea can be extend to detect Stokes singular beams of higher topological index. For higher-order Stokes singular beams, hyper-parameters tuning in the deep learning models might be required to achieve the best classification accuracy. In addition, due to an increase in the spatial size of the beam with an increase in the topological index, the window size of the beam considered for training the model needs to be adjusted accordingly.

Taken together, our work presents important steps towards understanding Stokes singular beams by a unique experimental approach and should help to democratize their applications in super-resolution imaging, optical communication, beam shaping and potentially new label-free imaging technique beyond what can be achieved in the present label-free optical microscopy community. Our ability to identify the specific class of hybridly polarized beam can be further upscaled to real-time identification of vortex beams which can act as a bridge between beam shaping and microscopy community to understand and choose the appropriate beams for their precise applications.

5. Conclusion

We demonstrated a generalized experimental and computational framework to study the taxonomy of hybridly polarized beams. The diffraction pattern of Stokes singular beams through an equilateral triangular aperture and their polarization projections are utilized for training a deep neural network. After appropriate training, deep neural networks offer excellent accuracy to classify and detect the structured beams carrying Stokes singularities. The scheme discussed in the article can lift all the degeneracy associated with SSBs. Further, the detection scheme is intensity-based, and mixed simulation and experimental images are exploited to train the neural network, which reduces the requirement of acquiring thousands of experimental datasets. We have achieved 97.67${\% }$ and 98.33${\% }$ accuracy in unknown simulated and experimental testing datasets, respectively. In addition to the characterization of Stokes singular beams, the present approach can also be applied to other applications such as optical microscopy and optical communication. It can be used to design different label-free techniques with improved resolution.

Funding

European Research Council (804233).

Disclosures

The authors declare no conflicts of interest. Authors’ contribution: AB, PS, KA conceived the idea and supervised the work. GA, RR, PS designed the simulation, experimental system and planned the experiments. AB, RA, KA provide inputs with simulation study. RA, AB, AH, and DKP optimized the computational part and classified the datasets. GA, AB, RR, and RA analysed the result and prepared the figures. GA drafted the initial manuscript and all authors revised the manuscript.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Supplemental document

See Supplement 1 for supporting content.

References

1. P. Coullet, L. Gil, and F. Rocca, “Optical vortices,” Opt. Commun. 73(5), 403–408 (1989). [CrossRef]

2. S. Fu, Y. Zhai, J. Zhang, et al., “Universal orbital angular momentum spectrum analyzer for beams,” PhotoniX 1(1), 19 (2020). [CrossRef]

3. A. Pryamikov, L. Hadzievski, M. Fedoruk, et al., “Optical vortices in waveguides with discrete and continuous rotational symmetry,” J. Eur. Opt. Soc.-Rapid Publ. 17(1), 23 (2021). [CrossRef]

4. O. V. Angelsky, A. Y. Bekshaev, P. P. Maksimyak, et al., “Orbital rotation without orbital angular momentum: mechanical action of the spin part of the internal energy flow in light beams,” Opt. Express 20(4), 3563–3571 (2012). [CrossRef]

5. I. Freund, “Polarization singularity indices in Gaussian laser beams,” Opt. Commun. 201(4-6), 251–270 (2002). [CrossRef]

6. I. Freund, A. Mokhun, M. Soskin, et al., “Stokes singularity relations,” Opt. Lett. 27(7), 545–547 (2002). [CrossRef]

7. Ruchi, P. Senthilkumaran, and S. K. Pal, “Phase singularities to polarization singularities,” Int. J. Opt. 2020, 1–33 (2020). [CrossRef]

8. Y. Shen, “Topological bimeronic beams,” Opt. Lett. 46(15), 3737–3740 (2021). [CrossRef]

9. G. Arora, Ruchi, and P. Senthilkumaran, “Full Poincaré beam with all the Stokes vortices,” Opt. Lett. 44(22), 5638–5641 (2019). [CrossRef]

10. X.-L. Wang, J. Ding, W.-J. Ni, et al., “Generation of arbitrary vector beams with a spatial light modulator and a common path interferometric arrangement,” Opt. Lett. 32(24), 3549–3551 (2007). [CrossRef]

11. Y. Gao, Z. Chen, J. Ding, et al., “Single ultra-high-definition spatial light modulator enabling highly efficient generation of fully structured vector beams,” Appl. Opt. 58(24), 6591–6596 (2019). [CrossRef]

12. A. Ariyawansa, K. Liang, and T. G. Brown, “Polarization singularities in a stress-engineered optic,” J. Opt. Soc. Am. A 36(3), 312–319 (2019). [CrossRef]

13. B. Radhakrishna, G. Kadiri, and G. Raghavan, “Realization of doubly inhomogeneous waveplates for structuring of light beams,” J. Opt. Soc. Am. B 38(6), 1909–1917 (2021). [CrossRef]

14. Y. Zhao and J. Wang, “High-base vector beam encoding/decoding for visible-light communications,” Opt. Lett. 40(21), 4843–4846 (2015). [CrossRef]

15. J. Wang, “Advances in communications using optical vortices,” Photonics Res. 4(5), B14–B28 (2016). [CrossRef]

16. K. Singh, I. Nape, W. T. Buono, et al., “A robust basis for multi-bit optical communication with vectorial light,” Laser Photonics Rev. 17(6), 2200844 (2023). [CrossRef]

17. W. Han, W. Cheng, and Q. Zhan, “Flattop focusing with full Poincaré beams under low numerical aperture illumination,” Opt. Lett. 36(9), 1605–1607 (2011). [CrossRef]

18. Y. Kozawa, D. Matsunaga, and S. Sato, “Superresolution imaging via superoscillation focusing of a radially polarized beam,” Optica 5(2), 86–92 (2018). [CrossRef]

19. C. Samlan, R. R. Suna, D. N. Naik, et al., “Spin-orbit beams for optical chirality measurement,” Appl. Phys. Lett. 112(3), 031101 (2018). [CrossRef]

20. B. S. B. Ram, P. Senthilkumaran, and A. Sharma, “Polarization-based spatial filtering for directional and nondirectional edge enhancement using an S-waveplate,” Appl. Opt. 56(11), 3171–3178 (2017). [CrossRef]

21. P. Lochab, P. Senthilkumaran, and K. Khare, “Robust laser beam engineering using polarization and angular momentum diversity,” Opt. Express 25(15), 17524–17529 (2017). [CrossRef]

22. I. Freund, “Cones, spirals, and Möbius strips, in elliptically polarized light,” Opt. Commun. 249(1-3), 7–22 (2005). [CrossRef]

23. D. H. Goldstein, Polarized light (CRC Press, 2011).

24. O. V. Angelsky, I. I. Mokhun, A. I. Mokhun, et al., “Interferometric methods in diagnostics of polarization singularities,” Phys. Rev. E 65(3), 036602 (2002). [CrossRef]

25. B. S. B. Ram, A. Sharma, and P. Senthilkumaran, “Diffraction of V-point singularities through triangular apertures,” Opt. Express 25(9), 10270–10275 (2017). [CrossRef]

26. G. Arora, S. Deepa, S. N. Khan, et al., “Detection of degenerate stokes index states,” Sci. Rep. 10(1), 20759 (2020). [CrossRef]

27. Y. Shen, X. Fu, and M. Gong, “Truncated triangular diffraction lattices and orbital-angular-momentum detection of vortex su(2) geometric modes,” Opt. Express 26(20), 25545–25557 (2018). [CrossRef]

28. L. E. E. de Araujo and M. E. Anderson, “Measuring vortex charge with a triangular aperture,” Opt. Lett. 36(6), 787–789 (2011). [CrossRef]

29. H. Wang, X. Yang, Z. Liu, et al., “Deep-learning-based recognition of multi-singularity structured light,” Nanophotonics 11(4), 779–786 (2022). [CrossRef]

30. P. Wang, J. Liu, L. Sheng, et al., “Convolutional neural network-assisted optical orbital angular momentum recognition and communication,” IEEE Access 7, 162025–162035 (2019). [CrossRef]

31. T. Giordani, A. Suprano, E. Polino, et al., “Machine learning-based classification of vector vortex beams,” Phys. Rev. Lett. 124(16), 160401 (2020). [CrossRef]

32. G. Arora, Ruchi, S. K. Pal, et al., “Full Poincaré beam delineation based on the Stokes vortex ring,” J. Opt. 23(10), 105201 (2021). [CrossRef]

33. L. Guo, Y. Chen, X. Liu, et al., “Vortex phase-induced changes of the statistical properties of a partially coherent radially polarized beam,” Opt. Express 24(13), 13714–13728 (2016). [CrossRef]

34. M. Z. Alom, T. M. Taha, C. Yakopcic, et al., “A state-of-the-art survey on deep learning theory and architectures,” Electronics 8(3), 292 (2019). [CrossRef]

35. F. N. Iandola, S. Han, M. W. Moskewicz, et al., “Squeezenet: Alexnet-level accuracy with 50x fewer parameters and< 0.5 mb model size,” arXivarXiv:1602.07360 (2016). [CrossRef]

36. K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” arXiv preprint arXiv:1409.1556 (2014). [CrossRef]

37. A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification with deep convolutional neural networks,” Commun. ACM 60(6), 84–90 (2017). [CrossRef]

38. G. Huang, Z. Liu, L. Van Der Maaten, et al., “Densely connected convolutional networks,” in Proceedings of the IEEE conference on computer vision and pattern recognition, (2017), pp. 4700–4708.

39. K. He, X. Zhang, S. Ren, et al., “Deep residual learning for image recognition,” in Proceedings of the IEEE conference on computer vision and pattern recognition, (2016), pp. 770–778.

40. T.-T. Wong and P.-Y. Yeh, “Reliable accuracy estimates from k-fold cross validation,” IEEE Trans. Knowl. Data Eng. 32(8), 1586–1594 (2020). [CrossRef]

Models	Accuracy in (%)
Models	Training	Validation	Sim Testing	Exp Testing
SqueezeNet [35]	90.72	91.11	85.80	88.00
VGG [36]	95.85	97.78	95.73	93.33
AlexNet [37]	94.71	92.22	95.00	94.00
DenseNet [38]	98.36	100.00	98.33	96.00
ResNet-18 [39]	97.81	97.78	97.67	98.67

Taxonomy of hybridly polarized Stokes vortex beams

Abstract

1. Introduction

2. Methods

2.1 Stokes degenerate states

2.2 Detection of Stokes singularities

3. Experimental setup

4. Results and discussion

4.1 Total diffraction pattern and corresponding polarization projections of Stokes singular beams

4.2 Deep learning models and strategy

4.2.1 Simulation and experimental datasets

4.2.2 Comparison models

4.2.3 Training strategy

4.2.4 Mix training

4.3 Labelling hybridly polarized beams and assessing the winning models

5. Conclusion

Funding

Disclosures

Data availability

Supplemental document

References

Supplementary Material (1)

Data availability

Cited By

Figures (5)

Tables (1)

Equations (7)

Optics Express