Blurring kernel extraction and super-resolution image reconstruction based on style generative adersarial networks

YangJie Wei; WeiHan Hou

doi:10.1364/OE.441507

1. Introduction

Optical microscopy is a powerful tool for observing and analysing the structure and properties of samples at the micro/nano scale. The application of optical microscopy has extended from biology and medicine to materials science, robotics, and other fields [1–2].

With the rapid development of related applications in recent decades, the demand for super-resolution, real-time, non-destructive observation of microscopic sample structures has increased; however, conventional optical microscopes have gradually failed to meet this requirement because of blur imaging as a result of the following factors: 1) Owing to optical diffraction of the source wave, a point on the surface of an object forms an Airy disk on the image plane, rather than a focused point, such that the resolution of optical systems has a theoretical limit of approximately half the wavelength used for illumination [3–5] . 2) Owing to the limit of the depth-of-field (DOF) of a microscope, when the thickness of a sample is larger than the DOF, the parts outside the focal plane are defocused, and these defocused images are superimposed on the image plane, resulting in a blurry superimposed image [6–8].

To reduce the influence of these blur imaging factors and achieve a high-resolution image at the micro/nano scale, the mechanism of light intensity diffusion during microscope imaging must be analysed. In theory, the intensity distribution function generated by a point source is called a point spread function (PSF) or blurring kernel, using which a high-resolution image could be reconstructed through deconvolution with the blurry image. Therefore, it is very important to estimate the blurring kernel parameters as accurate as possible for the purpose of constructing a high-resolution image. Although many researchers have proposed some powerful methods for relieving the effects of out-of-focus light and diffusion light for many microscope systems [9–12], until now, precisely measuring the blurring kernels of a practical high magnification microscopy is still a problem, because of complicated imaging process and various dynamic image features at the micro/nano scale.

Therefore, in this paper, we proposed a method to extract blurring kernels and reconstruct super-resolution images based on StyleGANs. Our approach is novel in several ways. First, the improved StyleGAN model was trained using the optical image set of the Gaussian light source, and an ideal blurry image generation model based on StyleGANs was obtained. Second, a microscopic image feature analysis system, including an ideal blurry image generation module, a blurring kernel extraction module, and an image deconvolution module, was designed. Through the automatic learning of the image features with changed parameters, a series of ideal blurry images that obey the law of blur imaging were generated and the blurring kernel of the optical imaging system was extracted. Finally, a super-resolution image-reconstruction method with respect to the images of the single wavelength Gaussian light source and large-scale visible light images is proposed based on the learnable convolutional half-quadratic splitting and convolutional preconditioned Richardson (LCHQS-CPCR) neural network model. Experiments with dynamic samples were conducted, and the results exhibited that our proposed method obtains the blurring property of an optical system and can reconstruct higher resolution images at the micro/nano scale.

2. Related work

Currently, to describe the blurring features of an optical imaging system, there are three main methods to obtain its PSF: mathematical, analytical, and experimental methods [13–17]. The mathematical method depends on a fixed formulation of the PSF developed using a physical model of the light propagation route [18,19]. However, for advanced microscopic optical systems, the complexity of the system dictates that the aberrations cannot be fully considered in the mathematical method, resulting in a bias in the calculation of PSFs. The analytical method uses a blind deconvolution process to estimate the PSF parameters of the optical system, based on many iterations until some optimisation function is satisfied [20,21]. However, the precision of the estimated PSF is highly related to the form of the optimisation function, and it is difficult to define a general optimisation function for all optical systems. Moreover, the iteration process is time consuming. For experimental methods, PSF is generally measured using embedded microbeads with fluorescent properties in the optical cement at different heights, or fixed fluorescent microbeads on an inclined surface [22,23]. In this case, a single fluorescent microbead can be considered as a single-point light source. Therefore, the measured PSF is obtained by measuring the intensity distribution in the microbead images. This involves a challenge as it is difficult to precisely control the position of a microbead in optical cement, which leads to inaccurate measurement results. However, when the problem of microbeads fixation is well resolved, human intervention in the optical cement can disrupt the local refractive index in the cement, resulting in an inconsistent refractive index in the optical medium, and thus lead to measurement errors.

Further, deep learning is a representation learning method that can work directly and learn autonomously with raw data. It is exceptional at processing large-scale and high-dimensional image data and discovering its hidden data structure, thus making it capable of performing various tasks in the field of image analysis. In the last ten years, a series of networks have been applied to image deblurring, including convolutional neural networks (CNNs) and deep CNN networks (DNCNNs) [24–27]. Generative adversarial networks (GANs) were first proposed to generate more real data by learning [28], followed by a large number of variants of different GANs, such as wasserstein GANs (WGANs) [29], deep convolutional GANs (DCGANs) [30], WGAN gradient penalty (WGAN-GP) [31] and least squares GANs (LSGANs) [32]. Recently, the deblur GANs have been used to achieve blind motion deblurring [33]. Compared to the deblur GANs, the super-resolution GANs (SRGANs), enhanced super-resolution GANs (ESRGANs), StyleGANs, and improved StyleGANs have achieved better results for noise removal and image filtering, because these methods adopted the perceptual loss to optimize their frameworks [34–37]. However, they are difficult to use in the deblurring process of optical microscope imaging, where the blurring property of the images is strongly related to the system parameters, because these deblurring techniques are to generate more real data by learning the main properties of noisy images, rather than extracting of the blurring property in a microscope system.

In a word, measuring the blurring kernel of an optical system is complicated and time consuming with current methods. Furthermore, their measurement precision is easily influenced by various internal and external factors during imaging process. StyleGANs-based methods have potential to achieve high-revolution imaging, however, without extraction of the blurring kernel which describing the blurring property of a microscope system, they are difficult to reconstruct a high-revolution image at the miro/nano scale.

3. Blur imaging and deep learning model

3.1 Basic principles of blur imaging

The blur imaging process of an optical system can be represented by a convolution process denoted as,

(1)$$g({x,y} )= f({x,y} )\otimes h({x,y} )$$

where g is the blurry image on the 2D imaging plane vertical to the optics axis; f is the ideal clear image; h is the PSF which describes the energy distribution property during the defocused imaging process; ⊗ is the convolution operation; (x, y) are the point coordinates on the imaging plane.

In the practical imaging process, the variation of PSF is not limited to the 2D plane perpendicular to the optical axis, but has a three-dimensional spatial variation characteristic containing the direction along the optical axis under the influence of complex factors, such as optical diffraction, defocus and changes in camera parameters. Therefore, the study of PSF is extended to the 3D space, and its spatial convolution process can be written as:

(2)$$g({x,y,z} )= f({x,y,z} )\otimes h({x,y,z} )$$

where h can be well approximated with a Gaussian function, denoted as,

(3)$$h({x,y,z} )= A\exp \left( { - \frac{{{x^2}}}{{2\sigma_x^2}} - \frac{{{y^2}}}{{2\sigma_y^2}} - \frac{{{z^2}}}{{2\sigma_z^2}}} \right)$$

where the standard deviations σ_x, σ_y and σ_z jointly determine the distribution of h in the 3D space.

Normally, the distribution of h is considered to have the same scale variation in the x and y directions, then σ_x= σ_y= σ_xy. That means the blurry image of a source point is a round spot, and the relationship between σ_xy and the radius of the spot is given by,

(4)$$\sigma _{xy}^2 = {\gamma ^2}r_h^2$$

where r_h denotes the radius of the spot; γ denotes the scaling factor between σ_xy and r_h.

More in general, one can approximate the PSF with other functions, as long as they satisfy the following property:

(5)$$\int\!\!\!\int {h({x,y,z} )dxdy} = 1$$

The properties mentioned above correspond to having a lossless optical system, i.e. an optical system such that all the energy emitted by a source point is transferred to the image plane. According to Eq. (2), reversely, the clear image can be obtained using the method of deconvolution, the process can be described as:

(6)$$f({x,y,z} )= {[{g({x,y,z} )\otimes h({x,y,z} )} ]^{ - 1}}$$

From Eq. (6), we can find that when a blurry image of a sample is known, the deconvolution process depends on the accurate acquisition of the PSF. However, as discussed in Section 2, current mathematical, analytical, and experimental methods all have problems to obtain an accurate and general PSF during imaging of an optical microscope. Therefore, deep learning with independent analysis and learning ability is introduced in this paper.

3.2 StyleGAN and its optimization

StyleGANs are a more effective high-resolution image generation method, as it is born out of GAN, and inherits the idea of generativity and advisability. The main contribution of StyleGANs is to rebuild the traditional generator that directly feeds the effective information hidden code representation to the input layer. The comparison between the design of a traditional generator and the StyleGAN generator is shown in Fig. 1, where AdaIN is the adaptive instance normalization, PixelNorm is the pixel normalization layer, FC is the fully-connected layer, Conv is the convolutional kernel, W denotes an intermediate latent space, w denotes the learned weights, A denotes a learned affine transform from the space W, and B denotes a noise broadcast operation. First, the style based generator nonlinearly maps the hidden code through an 8-layer full connection layer n: Z → W. Then, the AdaIN module performs the function of affine transformation, and the encoded information created in the nonlinear mapping layer is transmitted to the synthesis network. Finally, the image is generated by StyleGAN controlled by w and AdaIN.

Fig. 1. Comparison between traditional generator structure (on the left of the dash line) and StyleGAN generator structure (on the right of the dash line)

	Environment	Configuration Version
Hardware	GPU	NVIDIA 2080Ti 11GB
	CPU	Intel core i5 9400F
	RAM	14GB
Software	Operating System	Ubuntu 18.04
	Pytorch	1.8.1
	CUDA	11.2
	Python	3.7

FID		IS		KID
Before training	After training	Before training	After training	Before training	After training
523.3526	9.0748	0.0160	1.2982	0.8337	0.0072

Δd (μm)
Image entropy		0	2	4	6	8	10
	Original images	5.870	5.838	5.816	5.606	5.473	5.370
	Generated images	5.792	5.715	5.601	5.090	4.872	4.518
	Reconstructed images	6.312	6.275	6.255	6.007	5.845	5.711

Methods	Regions	Improvement in the reconstructed images at different Δd (μm)
Methods	Regions	0	2	4	6	8	10
Laplacian	Region 1st	50.124	47.194	52.249	43.713	42.648	39.679
	Region 2nd	38.379	34.642	34.103	24.098	22.830	19.157
	Region 3rd	67.025	60.687	56.003	48.742	52.381	47.830
	Region 4th	35.918	23.829	15.657	15.767	14.467	18.884
	Average	47.862	41.588	39.503	33.08	33.082	31.388
Average gradient	Region 1st	0.048	0.047	0.049	0.049	0.049	0.047
	Region 2nd	0.046	0.047	0.048	0.047	0.047	0.044
	Region 3rd	0.046	0.044	0.046	0.046	0.046	0.044
	Region 4th	0.048	0.046	0.048	0.049	0.047	0.045
	Average	0.047	0.046	0.048	0.048	0.047	0.045
Image entropy	Region 1st	0.884	0.893	0.855	0.925	0.953	1.002
	Region 2nd	0.837	0.836	0.829	0.795	0.920	1.002
	Region 3rd	0.850	0.877	0.889	0.965	1.163	0.211
	Region 4th	0.908	0.958	0.965	0.934	0.990	1.047
	Average	0.870	0.891	0.885	0.905	1.007	0.816

	Laplacian	Average gradient	Image entropy
Original image	103.63	0.0156	6.4774
Inverse filtering	1239.23	0.2000	7.2664
Wiener filtering	157.61	0.0210	6.5029
Lucy-Richardson	214.19	0.0332	6.5944
Our method	161.47	0.0569	7.3313

Abstract

1. Introduction

2. Related work

3. Blur imaging and deep learning model

3.1 Basic principles of blur imaging

3.2 StyleGAN and its optimization

4. Super-resolution image reconstruction based on blurring kernel

4.1 Training the image generation model

4.1.1 Pre-processing of data

4.1.2 Model training

4.2 Blurring kernel extraction module

4.3 Image deconvolution module

5. Experiment

5.1 Ideal image generation

5.2 Extraction of the blurring kernel

5.3 Image deblurring with the extracted blurring kernel

5.3.1 Deblurring the Gaussian beam images

5.3.2 Deblurring the microbead images

5.5. Comparison and evaluation

6. Conclusion

Funding

Disclosures

Data Availability

References

Data Availability

Cited By

Figures (17)

Tables (5)

Equations (18)

Optics Express