Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Feature extraction on Mueller matrix data for detecting nonporous electrospun fibers based on mutual information

Open Access Open Access

Abstract

The surface morphology of electrospun fibers largely determines their application scenarios. Conventional scanning electron microscopy is usually used to observe the microstructure of polymer electrospun fibers, which is time consuming and will cause damage to the samples. In this paper, we use backscattering Mueller polarimetry to classify the microstructural features of materials by statistical learning methods. Before feeding the Mueller matrix (MM) data into the classifier, we use a two-stage feature extraction method to find out representative polarization parameters. First, we filter out the irrelevant MM elements according to their characteristic powers measured by mutual information. Then we use Correlation Explanation (CorEx) method to group interdependent elements and extract parameters that represent their relationships in each group. The extracted parameters are evaluated by the random forest classifier in a wrapper forward feature selection way and the results show the effectiveness in classification performance, which also shows the possibility to detect nonporous electrospun fibers automatically in real time.

© 2020 Optical Society of America under the terms of the OSA Open Access Publishing Agreement

1. Introduction

Electrospun fibers have been used in diverse applications ranging from materials science and technology to life sciences and biomedicine [1,2]. The properties of the fibers mainly depend on the morphologies. for example, nonporous fibers are used as scaffolds or vascular repair materials to imitate extracellular matrix for their regular surfaces and fine diameters [3]. Various factors, e.g. parameters on electrospinning, solutions and surrounding environments, can affect the surface features of the fibers in the production process [4], which increases the difficulty of getting the morphological information just from electrospinning parameters in advance. Researchers usually rely on scanning electron microscopy (SEM) to monitor the morphology of samples. However, SEM is time consuming and may damage the fibers. Therefore, it is necessary to develop non-destructive, fast and effective techniques to characterize different morphologies of fibers.

Polarimetric imaging approach can provide much richer information than intensity images and has demonstrated its potential in material characterization [5,6], atmospheric remote sensing [7] and biomedicine [8,9]. It is non-invasive and sensitive to the morphological changes of materials. Previously, we used a transmission MM microscope to measure the electrospun fibers and distinguish different morphologies with depolarizaion parameter MMD-$\Delta$ [10]. In this paper, we use a backscattering Mueller polarimetric imaging system which has lower resolution and is more adaptive to thicker samples.

There have been consistent efforts to derive new polarization parameters from Mueller matrix (MM) for better description of the samples. MM polar decomposition (MMPD) [11] and MM transformation (MMT) [12,13] have been used to derive new sets of parameters which are explicitly related to the optical and structural properties of the samples. He et al. [14] analyzed the frequency distribution histograms of MM elements and used central moments of them to distinguish different microstructures of tissues. Since the complexity of many materials, it is difficult to distinguish variant structures with certain optical or polarimetric properties. Therefore, many efforts, e.g. [1519], have tried to utilize statistical learning methods to analyze MM data. Others [15,16] built a decision-theoretic framework consisting of several classifiers to identity precancerous uterine cervix tissues. Vaughn et al. [17] applied non-linear support vector machines to classify 47 different materials. Cameron et al. [18] used principal component analysis (PCA) and feature selection method to analyze the scattering coefficient contributions in turbid media. Hoover et al. [19] used PCA and nonlinear components analysis to distinguish among electromagnetic-wave scattering characteristics of materials.

As we know, irrelevant or redundant parameters will cause a negative impact on the classification performance. In this work, we used feature selection and extraction to explore representative polarization parameters before applying the classifier on MM data. In the feature extraction procedure, we pay attention to the correlative information among MM elements. We analyzed the discriminative power of each element considering the dependencies with the target microstructures and the redundancies and complementarities [20,21] among them. Then, we used Correlation Explanation (CorEx) method [22,23] to explore the interdependence among elements. There are three major contributions in this paper.

  • - A low-resolution polarimetric imaging method that distinguishes among backscattering characteristics of electrospun fibers was built, which potentially provides a real-time and dynamic way of monitoring nonporous surface.
  • - We utilized information-theoretic methods to quantitatively evaluate the importance of MM elements in distinguishing different microstructures of electrospun fibers.
  • - We analyzed the interdependent information among MM elements and extracted new parameters representing these relationships. The parameters were evaluated by a random forest classifier in a wrapper forward feature selection way.
The rest of this paper is organized as follows: Section 2 introduces the fiber materials and the backscattering polarization detecting system. Section 3 describes the feature selection and extraction methods based on analysis of MM elements. The experimental results are given in Section 4 and the conclusions are given in Section 5.

2. Materials and polarimetric imaging acquisition system

In the electrospinning procedure, different proportions of Poly-(L-lactic acid) (PLLA) and Poly($\varepsilon$-caprolactone) (PCL) were mixed and dissolved in methylene chloride. The experimental preparation of fibers is detailed in [10]. Figure 1 shows the SEM photographs of fibers with variant morphologies.

 figure: Fig. 1.

Fig. 1. SEM photographs of electrospun fibers with different surface morphologies. (a) Nonporous surface, PLLA/PCL (25:75), Spinning solution (8wt%), N,N-Dimethylformamide (DMF, 6g); (b) Nonporous surface, PLLA/PCL (100:0), Spinning solution (8wt%), DMF (6g); (c) Meshed surface PLLA/PCL (100:0), Spinning solution (8wt%), DMF (4.6g); (d) Porous microspheres (25:75), Spinning solution (4wt%), DMF (4.8g).

Download Full Size | PDF

We adopted a typical backscattering MM measurements with dual rotating retarder configuration [2426] as illustrated in Fig. 2. The polarimeter consists of one light-emitting diode (Source, 633nm, 3w), two lens (L1 and L2, Thorlabs, USA), two rotating quater-wave plates (R1 and R2, Daheng, Optic, China), two polarizers (P1 and P2, extinction ratio > 1000:1) and a CCD camera (Qimaging 32-0122A, 12bit, Canada). The exposure time used for the CCD camera is 200ms. To reduce the influence of surface reflection, we set an oblique angle ($\theta =30^\circ$) between the illumination path and the axis of detection. In each measurement, 30 images with specific incident and output polarization states are obtained by the rotation of two retarders with the angular speed of $\theta _{2}=5\theta _{1}$. The acquisition time of the data for each MM measurement is 90 seconds and the subsequent prediction time is within 1 second on one desk computer. The MM is calculated by using the Fourier coefficients $a_{n}$ and $b_{n}$ as shown in Eq. (1) [27].

$$I=a_{0}+\sum_{n=1}^{12}(a_{n}\cos{2n \theta_{1}}+b_{n}\sin{2n \theta_{1}})$$

 figure: Fig. 2.

Fig. 2. Experimental setup. (a) Intensity distribution from the CCD where the center region (red box) is cropped and analyzed. (b)Mueller matrix backscattering imaging system with two active polarimetric elements: a Polarization State Generator (PSG) and a Polarization State Analyzer (PSA). (c) Photograph of an electrospun fiber sample.

Download Full Size | PDF

The MM images of electrospun fibers with different morphologies are shown in Fig. 3. The non-diagonal matrices indicate that the materials are anisotropic. The variations of three diagonal elements ($M_{22},M_{33}$ and $M_{44}$) show the differences of depolarization abilities. The values of $M_{24},M_{34}, M_{42}$ and $M_{43}$ mean that the samples are birefringent.

 figure: Fig. 3.

Fig. 3. 2D images of MMs of electrospun fibers. (a) Nonporous surface; (b) Meshed surface; (c) Porous microspheres. The color bar is from −0.5 to 0.5 for diagonal elements and from −0.1 to 0.1 for other elements.

Download Full Size | PDF

The individual matrix elements were cropped to $300\times 300$ pixels (approximate $0.6cm\times 0.6cm$), with the center located at the sample region. In addition, all elements were normalized to the first element $M_{11}$ that was excluded in the next data analyzing procedure. To reduce the impact of noise, we filtered all remained 15 images by a $5\times 5$ slide window (sampling with a 5-pixel step) and computed the average of these 25 pixels. We handle data in the pixel level. For a filtered pixel in MM, 15 elements would be stacked as a $15\times 1$ vector.

3. Feature selection and extraction

Mueller matrix (MM) has qualitative information of the microstructures and analyzing the discriminative parameters is useful for revealing the polarization characteristics about samples. In this section, we resort to information measures to select and extract the most representative parameters.

For a given instance set comprising $N$ observations $\{m^{n}\}$, where $m^{n}\in \mathcal {M}$ and $n=1,\ldots ,N$, together with corresponding morphology labels (or target classes) $\{c^{n}\}$ where $c^n\in \mathcal {C}$. We denote $M_{ij}$ as the element (we use "element" $M_{ij}$ rather than standard notation "feature" $X$ according to the context of this work) in MM with $i^{th}$ row and $j^{th}$ colunm index, where $i,j\in \{1,2,3,4\}$ and denote $C$ as the label variable. $c_i$ is considered as the $i^{th}$ label where $i=1,\ldots ,|\mathcal {C}|$ (in this work, $|\mathcal {C}|=2$). The Shannon’s entropy [28] function $H(\cdot )$ and $H(\cdot |\cdot )$ are used to measure the uncertainty of variables with and without condition respectively. For the prior class variable $C$, the information gain, also known as mutual information (MI), from one element $M_{ij}$ is the reduction in the prior uncertainty due to the knowledge of this element as shown in Eq. (2).

$$I(M_{ij},C)=H(C)-H(C|M_{ij})]$$
This gives a way to quantify how much information an element or an element set can provide with respect to the class variable $C$. $I(M_{ij},C)$ is non-negative and is zero if $M_{ij}$ and $C$ are statistically independent. The maximum value happens when $C$ is completely determined by $M_{ij}$, i.e. $H(C|M_{ij})=0$.

MI has been widely used in the feature selection literature. We consider $S=\{M_{ij}\}$ as the selected subset with $k$ items ($|S|=k$). Since the hardness of the estimation for joint probability density $p(S)$ and $p(S,C)$, [29] selects the top $k$ features with high MI score individually, i.e., $\max {\sum _{M_{ij}\in {S}}{I(M_{ij},C)}}$, where $k$ is predefined. However, for highly interdependent elements, the discriminative power would not change much if one of them were removed. To get a nested subset, besides the MI score with respect to the class variable, [3032] consider the small similarity or the low dependency between elements, i.e., $\min {\sum _{M_{ij},M_{i'j'}\in {S}}I(M_{ij},M_{i'j'})}$. However, [20,33] pointed out that dependency does not simply imply redundancy and the interaction between elements may provide new information about $C$, measured as conditional dependency: $\sum _{M_{ij},M_{i'j'}\in {S}}I(M_{ij},M_{i'j'}|C)$. Brown et al. [20] summarized the selection criteria from the literature as a linear combination of these three components in Eq. (3).

$$J(M_{ij})=I(M_{ij},C)-\beta\sum_{M_{i'j'}\in{S}}{I(M_{ij},M_{i'j'})}+\gamma\sum_{M_{i'j'}\in{S}}{I(M_{ij},M_{i'j'}|C)}$$
$J(M_{ij})$ outputs the importance score of $M_{ij}$ and $\beta /\gamma$ is predefined based on certain assumptions. Vergara et al. [21] described the three terms in Eq. (3) as relevance, redundancy and complementarity (synergy). The difference of last two terms is multi-information when $\beta / \gamma = 1$ [34,35]. Some non-linear criterion [36,37] also follow a similar form. Let us consider a case that depolarization ability is one of the main differences among various morphologies of materials. The top 3 elements with MI score (relevance) might be $M_{22}$, $M_{33}$ and $M_{44}$. However, for isotropic material, $M_{22}$ and $M_{33}$ are quite similar and will provide unnecessary duplicated information (redundancy). Meanwhile, the depolarization property that comes from the interaction among these diagonal elements, which means that they are complementary and will provide characteristic information together.

Some feature extraction methods [38,39] based on information theory have been proposed. In this work, we mainly focus on the interdependence among elements. In the selection procedure, we filter out irrelevant elements only considering the first term in Eq. (3). To represent the correlative information among elements, we utilize an unsupervised information-theoretic method, Correlation Explanation (CorEx) [22,23], that groups the highly interdependent elements together and extracts the representative parameters from them, which will also preserve the interaction information in each group. Terminologically, we are trying to find representative variables $Y_{1},Y_{2},\ldots ,Y_{l} \in G$ that can explain the relationships in $S$.

Total correlation (TC) or multivariate mutual information [34,40] is a general measure of non-independence among elements from a variable set $S$ shown as Eq. (4). For $|S|=2$, $TC(S)$ is the expression of mutual information.

$$TC(S)=\sum_{M_{ij}\in{S}}{H(M_{ij})-H(S)}$$
The extent to which $G$ explains the relationships in $S$ can be measured by looking as how much the total correlation is reduced after conditioning on $G$ [22,23] shown as Eq. (5) where $TC(S|G)=\sum _{M_{ij}\in {S}}{H(M_{ij}|G)-H(S|G)}$. The maximum is achieved when $TC(S|G)=0$, in which case that variables in S are independent conditioned on $G$. This indicates that variables $Y_{j}$s in $G$ explain all the relationships in $S$.
$$TC(S;G)=TC(S)-TC(S|G)=\sum_{M_{ij}\in{S}}{I(M_{ij},G)-I(S,G)}$$
CorEx divides the set $S$ into several subgroups and simultaneously searches the representative $Y_{j}$ in each group by optimizing the information-theoretic objective function shown as Eq. (6) where $S_{j}$ is denoted as a subgroup and $l$ is the total group number. $Y_{j}$, the probabilistic function of $S_{j}$, is a discrete random variable and $r$ is the number of the possible values of it. The search strategy and implementations are discussed in [22]. Before feeding the data into CorEx, we need to determine the subgroup number $l$. Here we both consider the interdependence among elements and their polarization properties of MM elements.
$$\max_{S_{j}, p(y_{j}|S_{j})}{\sum_{j=1}^{l}{TC(S_{j};Y_{j})}}\quad s.t. \quad|Y_{j}|=r, S_{j}\cap S_{j'\neq j}=\emptyset$$
Consequently, the feature selection and extraction procedure consists of three steps:
  • Step1: Rank the mutual information between each element and the target class variable $C$. For a given threshold $T_{0}$ denoted as the amount of information, select the top $k$ ones where $k$ is the minimum value that $\frac {\sum _{k}{I(M_{ij},C)}}{\sum _{15}{I(M_{ij},C)}} \geq T_{0}$.
  • Step2: Calculate the interdependence among the selected elements. Given a threshold $T_{1}$, for $M_{ij}$ and $M_{i'j'}$, group them together when $\frac {I(M_{ij},M_{i'j'}|C)}{H(M_{ij},M_{i'j'}|C)} \geq T_{1}$. Then count the group number $l$.
  • Step3: Given the predefined group number $l$, use CorEx to search the representative parameters about correlative level. The parameter $r$ is determined through trail-and-error.

4. Experiment results

Backscattered Mueller imaging were performed on 19 electrospun fiber samples. After being filtered and stacked as described in Section 2, the dataset encompasses 68,400 averaged pixels as instances. We used 10-fold cross validation strategy and the test samples did not overlap with the samples used in the training set.

In this work, we used a feature ranking and selecting strategy as the first step and set $T_{0}=0.85$. Figure 4(a) shows the MIs between elements and the class label $C$, sorted by decreasing values. The curve indicates the percentage of cumulative MI. As we can see, the top seven elements include most of the characteristic information (nearly 86%) which represent the variants of different morphologies.

 figure: Fig. 4.

Fig. 4. Mutual information (MI) scores of individual elements. (a) MI between each element and the class variable $C$, i.e. $I(M_{ij},C)$. The curve represents the cumulative percentage of MI. (b) An example of multi-information between $M_{44}$ and other elements, i.e. $I(M_{44},M_{ij}|C)-I(M_{44},M_{ij})$. Here we set the interaction values with $M_{44}$ itself as zero since the value is the opposite of $I(M_{44},C)$ (non-positive).

Download Full Size | PDF

For given MMs, besides the individual characteristic power of each element, the redundant or synergistic information among them should also be considered. Figure 4(b) shows the multi-information ($I(M_{44},M_{ij}|C)-I(M_{44},M_{ij})$) of $M_{44}$ interacting with other elements. The positive values mean that the kind of interaction information between them are complementary rather than redundancy, especially with other two diagonal elements $M_{22}$ and $M_{33}$. For the top seven elements, the three diagonal elements are related to the depolarization ability while the others are related to birefringence properties ($q_{L}=\sqrt {M_{42}^{2}+M_{43}^2}$ and $r_{L}=\sqrt {M_{24}^2+M_{34}^2}$ are rotation invariant parameters that represent the capability of transforming between linear and circular polarization [41]). We divided the selected elements into two groups and then extracted the parameters which could represent the correlative information among them in each group.

We set $l=2$ in Eq. (6) and fed the seven elements into CorEx model. Figure 5(a) shows the MI results between the extracted parameters, $Y_{1}$ and $Y_{2}$, from each group and the class variable $C$. The results converge as the cardinality $|Y_{j}|=r$ increases. Two dashed or dotted lines paralleling the horizontal axis are the MIs of $M_{22}$ and $M_{34}$ which are the highest individual scores in each group respectively. The greater MI values of extracted parameters may indicate that the correlative information among elements are characteristic. After $r=3$, the separation results were stable where the selected elements were divided into two groups: diagonal set $S_{1}=\{M_{22}, M_{33}, M_{44}\}$ and non-diagonal set $S_{2}=\{M_{24}, M_{34}, M_{42}, M_{43}\}$ as illustrated in Fig. 5(b), which satisfied the multi-information results in Fig. 4(b) to some extend. It should be noted that we only predefined the group number, and CorEx divided the element set according to the conditional total correlation maximization criterion without any other prior knowledge.

 figure: Fig. 5.

Fig. 5. Grouping results from CorEx given the group number $l=2$. (a) The MI between extracted parameters from each group and class variable. The horizontal axis is the number of possible values $r$ in Eq. (6). (b) The schematic diagram of grouping results generating by CorEx. Elements in the bottom row represents the input. $Y_{1}$ and $Y_{2}$ in the higher layers are learned parameters which explain the relationships in each group.

Download Full Size | PDF

In order to demonstrate the effectiveness of our method, we conducted a serious of experiments on two parameter sets: One is the original parameter set; Another is the enhanced set including $\{Y_{1}, Y_{2}\}$. We used the random forest (RF) classifier as an evaluator to do the feature selection in a wrapper forward way. Figure 6 shows the accuracy results with the increasing feature number. The overall performance on the element set with extracted parameters are better than that without them. The classifier picked $Y_{1}$ as the first feature whose individual prediction was at a higher significance level than others in the data. Although the MI score of $Y_{2}$ is much higher, it was selected further back. The multi-information between $Y_{1}$ and $Y_{2}$ was negative, which means that they were redundant and adding $Y_{2}$ would diminish the dependence between $Y_{1}$ and $C$. If we set the first feature as $Y_{2}$ (the second best individual feature according to MI score), $Y_{1}$ would be added in the fourth step and would improve the performance slightly, which might be caused since the dependency between $Y_{1}$ and $C$ is much more important that the feature redundancy.

 figure: Fig. 6.

Fig. 6. The RF classification accuracy with the wrapper forward selection. Three lines denote the results on: (1) All 15 MM elements and two extracted parameters; (2) All 15 MM elements and two extracted parameters with setting the first picked feature as $Y_{2}$; (3) All 15 MM elements.

Download Full Size | PDF

After wrapper selection by RF, 8 parameters were selected from the enhanced set while 6 parameters remained from the original set. The maximum accuracies were 0.859 and 0.842 respectively. Table. 1 shows the overall performance on these two sets before and after wrapper section. The results show that the enhanced parameter set with $\{Y_{1}, Y_{2}\}$ lead to better performance than the original set. Furthermore, we visualized the receiver operating characteristic (ROC) curves for these experiments as illustrated in Fig. 7. The AUCs from the enhanced set are relatively higher.

Tables Icon

Table 1. Performance comparison with and without $\{Y_{1}, Y_{2}\}$

 figure: Fig. 7.

Fig. 7. ROC curves for classification results with RF. (a) Parameter sets before wrapper selection. The AUCs on original and enhanced sets are 0.894 and 0.865 respectively. (b) Parameter sets after wrapper selection. The AUCs on original and enhanced sets are 0.940 and 0.908 respectively.

Download Full Size | PDF

When the trees in RF were being trained, the splitting strategy on every node was based on how much each element or parameter contributes to decreasing the weighted impurity. By averaging the decrease in impurity over trees, we could get the feature importance as illustrated in Fig. 8. The extracted parameters $Y_{1}$ and $Y_{2}$ were in top 3.

 figure: Fig. 8.

Fig. 8. Feature importance of $Y_{1}$, $Y_{2}$ and MM elements.

Download Full Size | PDF

5. Conclusion

In the past decades, many efforts have tried to analyze the Mueller matrix (MM) and have extracted many parameters to describe different polarization properties based on certain assumptions. In this paper, we analyze the correlative information among elements in a statistical view. We used backscattering Mueller polarimetry to detect nonporous morphologies of electro-spun fibers. We selected the most representative elements based on the results of mutual information with target morphology variable individually. Then, we used CorEx method to divide the selected elements into two groups and extracted new parameters which represented the relationships in each group.

The selected elements in two groups are related to the depolarization power and birefringence properties respectively. In the future work, we plan to use Monte Carlo simulations based on the sphere-cylinder scatterers model to analyze the underlying meaning of these parameters. Meanwhile, the correctness of the estimation of entropy from the data is the key to guide the feature selection and extraction procedure. More estimation methods, e.g. network-based methods [42], will be compared to determine the most suitable one for MM data.

Funding

National Natural Science Foundation of China (11974206, 61527826); Science, Technology and Innovation Commission of Shenzhen Municipality (JCYJ20160818143050110, JCYJ20170412170814624).

Disclosures

The authors declare no conflicts of interest.

References

1. A. Haider, S. Haider, and I. K. Kang, “A comprehensive review summarizing the effect of electrospinning parameters and potential applications of nanofibers in biomedical and biotechnology,” Arab. J. Chem. 11(8), 1165–1188 (2018). [CrossRef]  

2. D. H. Reneker and A. L. Yarin, “Electrospinning jets and polymer nanofibers,” Polymer 49(10), 2387–2425 (2008). [CrossRef]  

3. C. Kriegel, A. Arrechi, K. Kit, D. McClements, and J. Weiss, “Fabrication, functionalization, and application of electrospun biopolymer nanofibers,” Crit. Rev. Food Sci. Nutr. 48(8), 775–797 (2008). [CrossRef]  

4. C. L. Casper, J. S. Stephens, N. G. Tassi, D. B. Chase, and J. F. Rabolt, “Controlling surface morphology of electrospun polystyrene fibers: effect of humidity and molecular weight in the electrospinning process,” Macromolecules 37(2), 573–578 (2004). [CrossRef]  

5. F. Snik, J. Craven-Jones, M. Escuti, S. Fineschi, D. Harrington, A. De Martino, D. Mawet, J. Riedi, and J. S. Tyo, “An overview of polarimetric sensing techniques and technology with applications to different research fields,” in Polarization: Measurement, Analysis, and Remote Sensing XI, vol. 9099 (International Society for Optics and Photonics, 2014), p. 90990B.

6. S. Liu, W. Du, X. Chen, H. Jiang, and C. Zhang, “Mueller matrix imaging ellipsometry for nanostructure metrology,” Opt. Express 23(13), 17316–17329 (2015). [CrossRef]  

7. P. Y. Deschamps, F. M. Bréon, M. Leroy, A. Podaire, A. Bricaud, J. C. Buriez, and G. Seze, “The polder mission: Instrument characteristics and scientific objectives,” IEEE Trans. Geosci. Remote Sensing 32(3), 598–615 (1994). [CrossRef]  

8. V. V. Tuchin, “Polarized light interaction with tissues,” J. Biomed. Opt. 21(7), 071114 (2016). [CrossRef]  

9. Y. Dong, J. Qi, H. He, C. He, S. Liu, J. Wu, D. S. Elson, and H. Ma, “Quantitatively characterizing the microstructural features of breast ductal carcinoma tissues in different progression stages by Mueller matrix microscope,” Biomed. Opt. Express 8(8), 3643–3655 (2017). [CrossRef]  

10. X. Li, Y. Zhu, H. Ma, and Y. Sheng, “A polarization method for quickly distinguishing the morphology of electro-spun ultrafine fibers,” Chin. Chem. Lett. 29(8), 1317–1320 (2018). [CrossRef]  

11. M. Swami, S. Manhas, P. Buddhiwant, N. Ghosh, A. Uppal, and P. Gupta, “Polar decomposition of 3× 3 Mueller matrix: a tool for quantitative tissue polarimetry,” Opt. Express 14(20), 9324–9337 (2006). [CrossRef]  

12. H. He, N. Zeng, E. Du, Y. Guo, D. Li, R. Liao, and H. Ma, “A possible quantitative Mueller matrix transformation technique for anisotropic scattering media,” Photonics Lasers Med. 2(2), 129–137 (2013). [CrossRef]  

13. C. He, H. He, J. Chang, Y. Dong, S. Liu, N. Zeng, Y. He, and H. Ma, “Characterizing microstructures of cancerous tissues using multispectral transformed Mueller matrix polarization parameters,” Biomed. Opt. Express 6(8), 2934–2945 (2015). [CrossRef]  

14. C. He, H. He, X. Li, J. Chang, Y. Wang, S. Liu, N. Zeng, Y. He, and H. Ma, “Quantitatively differentiating microstructures of tissues by frequency distributions of Mueller matrix images,” J. Biomed. Opt. 20(10), 105009 (2015). [CrossRef]  

15. C. Heinrich, J. Rehbinder, A. Nazac, B. Teig, A. Pierangelo, and J. Zallat, “Mueller polarimetric imaging of biological tissues: classification in a decision-theoretic framework,” J. Opt. Soc. Am. A 35(12), 2046–2057 (2018). [CrossRef]  

16. J. Rehbinder, H. Haddad, S. Deby, B. Teig, A. Nazac, T. Novikova, A. Pierangelo, and F. Moreau, “Ex vivo Mueller polarimetric imaging of the uterine cervix: a first statistical evaluation,” J. Biomed. Opt. 21(7), 071113 (2016). [CrossRef]  

17. I. J. Vaughn, B. G. Hoover, and J. S. Tyo, “Classification using active polarimetry,” in Polarization: Measurement, Analysis, and Remote Sensing X, vol. 8364 (International Society for Optics and Photonics, 2012), p. 83640S.

18. B. D. Cameron, Y. Li, and A. A. Nezhuvingal, “Determination of optical scattering properties in turbid media using Mueller matrix imaging,” J. Biomed. Opt. 11(5), 054031 (2006). [CrossRef]  

19. B. G. Hoover and J. S. Tyo, “Polarization components analysis for invariant discrimination,” Appl. Opt. 46(34), 8364–8373 (2007). [CrossRef]  

20. G. Brown, A. Pocock, M.-J. Zhao, and M. Luján, “Conditional likelihood maximisation: a unifying framework for information theoretic feature selection,” J. Mach. Learn. Res. 13, 27–66 (2012).

21. J. R. Vergara and P. A. Estévez, “A review of feature selection methods based on mutual information,” Neural Comput. Appl. 24(1), 175–186 (2014). [CrossRef]  

22. G. V. Steeg and A. Galstyan, “Discovering structure in high-dimensional data through correlation explanation,” in Advances in Neural Information Processing Systems, (2014), pp. 577–585.

23. G. V. Steeg and A. Galstyan, “Maximally informative hierarchical representations of high-dimensional data,” in Artificial Intelligence and Statistics, (2015), pp. 1004–1012.

24. D. H. Goldstein, “Mueller matrix dual-rotating retarder polarimeter,” Appl. Opt. 31(31), 6676–6683 (1992). [CrossRef]  

25. O. Arteaga, M. Baldrís, J. Antó, A. Canillas, E. Pascual, and E. Bertran, “Mueller matrix microscope with a dual continuous rotating compensator setup and digital demodulation,” Appl. Opt. 53(10), 2236–2245 (2014). [CrossRef]  

26. X. Chen, W. Du, K. Yuan, J. Chen, H. Jiang, C. Zhang, and S. Liu, “Development of a spectroscopic mueller matrix imaging ellipsometer for nanostructure metrology,” Rev. Sci. Instrum. 87(5), 053707 (2016). [CrossRef]  

27. R. Azzam, “Photopolarimetric measurement of the Mueller matrix by fourier analysis of a single detected signal,” Opt. Lett. 2(6), 148–150 (1978). [CrossRef]  

28. T. M. Cover and J. A. Thomas, Elements of information theory (John Wiley & Sons, 2012), pp. 13–16.

29. D. D. Lewis, “Feature selection and feature extraction for text categorization,” in Proceedings of the workshop on Speech and Natural Language, (Association for Computational Linguistics, 1992), pp. 212–217.

30. H. Peng, F. Long, and C. Ding, “Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy,” IEEE Trans. Pattern Anal. Machine Intell. 27(8), 1226–1238 (2005). [CrossRef]  

31. C. Ding and H. Peng, “Minimum redundancy feature selection from microarray gene expression data,” J. Bioinf. Comput. Biol. 03(02), 185–205 (2005). [CrossRef]  

32. Q. Zou, J. Zeng, L. Cao, and R. Ji, “A novel features ranking metric with application to scalable visual and bioinformatics data classification,” Neurocomputing 173, 346–354 (2016). [CrossRef]  

33. P. E. Meyer, C. Schretter, and G. Bontempi, “Information-theoretic feature selection in microarray data using variable complementarity,” IEEE J. Sel. Top. Signal Process. 2(3), 261–274 (2008). [CrossRef]  

34. E. Schneidman, S. Still, M. J. Berry, and W. Bialek, “Network information and connected correlations,” Phys. Rev. Lett. 91(23), 238701 (2003). [CrossRef]  

35. W. McGill, “Multivariate information transmission,” Trans. IRE Prof. Group Inf. Theory 4(4), 93–111 (1954). [CrossRef]  

36. F. Fleuret, “Fast binary feature selection with conditional mutual information,” J. Mach. learning research 5, 1531–1555 (2004).

37. M. Bennasar, Y. Hicks, and R. Setchi, “Feature selection using joint mutual information maximisation,” Expert. Syst. with Appl. 42(22), 8520–8532 (2015). [CrossRef]  

38. J. M. Leiva-Murillo and A. Artes-Rodriguez, “Maximization of mutual information for supervised linear feature extraction,” IEEE Trans. Neural Netw. 18(5), 1433–1441 (2007). [CrossRef]  

39. A. Jukić and M. Filipović, “Supervised feature extraction for tensor objects based on maximization of mutual information,” Pattern Recognit. Lett. 34(13), 1476–1484 (2013). [CrossRef]  

40. S. Watanabe, “Information theoretical analysis of multivariate correlation,” IBM J. Res. Dev. 4(1), 66–82 (1960). [CrossRef]  

41. P. Li, D. Lv, H. He, and H. Ma, “Separating azimuthal orientation dependence in polarization measurements of anisotropic media,” Opt. Express 26(4), 3791–3800 (2018). [CrossRef]  

42. C. Chung, A. Al-Bashabsheh, H. P. Huang, M. Lim, D. S. H. Tam, and C. Zhao, “Neural entropic estimation: A faster path to mutual information estimation,” arXiv preprint arXiv:1905.12957 (2019).

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (8)

Fig. 1.
Fig. 1. SEM photographs of electrospun fibers with different surface morphologies. (a) Nonporous surface, PLLA/PCL (25:75), Spinning solution (8wt%), N,N-Dimethylformamide (DMF, 6g); (b) Nonporous surface, PLLA/PCL (100:0), Spinning solution (8wt%), DMF (6g); (c) Meshed surface PLLA/PCL (100:0), Spinning solution (8wt%), DMF (4.6g); (d) Porous microspheres (25:75), Spinning solution (4wt%), DMF (4.8g).
Fig. 2.
Fig. 2. Experimental setup. (a) Intensity distribution from the CCD where the center region (red box) is cropped and analyzed. (b)Mueller matrix backscattering imaging system with two active polarimetric elements: a Polarization State Generator (PSG) and a Polarization State Analyzer (PSA). (c) Photograph of an electrospun fiber sample.
Fig. 3.
Fig. 3. 2D images of MMs of electrospun fibers. (a) Nonporous surface; (b) Meshed surface; (c) Porous microspheres. The color bar is from −0.5 to 0.5 for diagonal elements and from −0.1 to 0.1 for other elements.
Fig. 4.
Fig. 4. Mutual information (MI) scores of individual elements. (a) MI between each element and the class variable $C$, i.e. $I(M_{ij},C)$. The curve represents the cumulative percentage of MI. (b) An example of multi-information between $M_{44}$ and other elements, i.e. $I(M_{44},M_{ij}|C)-I(M_{44},M_{ij})$. Here we set the interaction values with $M_{44}$ itself as zero since the value is the opposite of $I(M_{44},C)$ (non-positive).
Fig. 5.
Fig. 5. Grouping results from CorEx given the group number $l=2$. (a) The MI between extracted parameters from each group and class variable. The horizontal axis is the number of possible values $r$ in Eq. (6). (b) The schematic diagram of grouping results generating by CorEx. Elements in the bottom row represents the input. $Y_{1}$ and $Y_{2}$ in the higher layers are learned parameters which explain the relationships in each group.
Fig. 6.
Fig. 6. The RF classification accuracy with the wrapper forward selection. Three lines denote the results on: (1) All 15 MM elements and two extracted parameters; (2) All 15 MM elements and two extracted parameters with setting the first picked feature as $Y_{2}$; (3) All 15 MM elements.
Fig. 7.
Fig. 7. ROC curves for classification results with RF. (a) Parameter sets before wrapper selection. The AUCs on original and enhanced sets are 0.894 and 0.865 respectively. (b) Parameter sets after wrapper selection. The AUCs on original and enhanced sets are 0.940 and 0.908 respectively.
Fig. 8.
Fig. 8. Feature importance of $Y_{1}$, $Y_{2}$ and MM elements.

Tables (1)

Tables Icon

Table 1. Performance comparison with and without { Y 1 , Y 2 }

Equations (6)

Equations on this page are rendered with MathJax. Learn more.

I = a 0 + n = 1 12 ( a n cos 2 n θ 1 + b n sin 2 n θ 1 )
I ( M i j , C ) = H ( C ) H ( C | M i j ) ]
J ( M i j ) = I ( M i j , C ) β M i j S I ( M i j , M i j ) + γ M i j S I ( M i j , M i j | C )
T C ( S ) = M i j S H ( M i j ) H ( S )
T C ( S ; G ) = T C ( S ) T C ( S | G ) = M i j S I ( M i j , G ) I ( S , G )
max S j , p ( y j | S j ) j = 1 l T C ( S j ; Y j ) s . t . | Y j | = r , S j S j j =
Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.