Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Orthonormal vector polynomials in a unit circle, Part II : completing the basis set

Open Access Open Access

Abstract

Zernike polynomials provide a well known, orthogonal set of scalar functions over a circular domain, and are commonly used to represent wavefront phase or surface irregularity. A related set of orthogonal functions is given here which represent vector quantities, such as mapping distortion or wavefront gradient. Previously, we have developed a basis of functions generated from gradients of Zernike polynomials. Here, we complete the basis by adding a complementary set of functions with zero divergence – those which are defined locally as a rotation or curl.

©2008 Optical Society of America

1. Introduction

In a previous paper,1 we developed an orthonormal set of vector polynomials over a unit circle, which we call S⃗ polynomials. These polynomials are great for fitting the slope data taken by a Shack-Hartmann sensor. But since they are gradients of linear combinations of Zernike polynomials, they have zero curl, which means they make an incomplete set of vector polynomials such that an arbitrary continuously differentiable vector function defined over a unit circle cannot be represented by linear combinations of these polynomials. Additional vector polynomials must be added to make a complete set. An optical application that needs a complete set of vector polynomials is to fit the mapping distortions of an interferometric null test. See Reference 2 for an example of mapping distortion in a null test and how to correct it. The lowest modes of mapping distortions include translation, scaling and rotation. Polynomial S⃗ 2 and S⃗ 3 represent x and y translation, respectively, and S⃗ 4 represents scaling. But no S⃗ polynomial represents rotation. The reason is that the rotation vector has non-zero curl, while all S⃗ polynomials have zero curl. In this paper, we derived a complementary set of vector polynomials which have zero divergence and non-zero curl. This new set combined with the zero-curl set S⃗ makes a complete set such that it can represent any continuously differentiable vector polynomials defined over a unit circle.

In Section 2, we review the S⃗ polynomials we derived and reported in a previous paper. We then proceed to derive the complementary T⃗ polynomials in Section 3.

2. The S⃗ polynomials

In a previous paper1, we derived an orthonormal set of vector polynomials over a unit circle. We call this set the S⃗ polynomials. Each S⃗ polynomial is the gradient of a scalar function:

Sj=ϕj=îϕjx+ĵϕjy.

The scalar functions ϕj are linear combination of Zernike polynomials. Following Noll’s notation and numbering scheme,3

ϕj=12n(n+1)Zj,foralljwithn=m,

and

ϕj=14n(n+1)(Zjn+1n1Zj(n=n2,m=m)),foralljwith nm.

Each S⃗ polynomial is then the linear combinations of gradient of Zernike polynomials following (1) and (2). Since gradient of Zernike polynomials can also be represented by Zernike polynomials,3 S⃗ polynomials can be written as linear combinations of Zernike polynomials as well. The first 14 non-trivial S⃗ polynomials are listed in Table 1.

Tables Icon

Table 1. List of the first 14 non-trivial S⃗ polynomials as linear combinations of Zernike polynomials.

If A⃗ and B⃗ are two vector polynomials defined over a unit circle, we define their inner product as

(A,B)=1π(AB)dxdy,

where integration is over unit circle.

S⃗ polynomials are orthonormal, which means

(Si,Sj)=1π((ϕi)(ϕj))dxdy=δij.

3. Derivation of a complementary set of vector polynomials

Any vector field can be written as4

v=ϕ+×P,

where ϕ is a scalar and P⃗ is a vector. The divergence of ν⃗ is then

∇•ν⃗=∇2 ϕ+∇•(∇×P⃗)=∇2 ϕ,

and the curl of ν⃗ is

∇×ν⃗=∇×(∇×P⃗)=∇(∇•P⃗)-∇2 P⃗.

The S⃗ polynomials presented in the previous paper1 were defined as gradients of scalar functions, so have no curl component and P⃗=0. We complete the basis by adding a second set that has zero divergence, therefore zero ϕ, but non zero P⃗, such that

T=×P=[îĵk̂xyzPxPyPz].

This set has to be mutually orthogonal as well.

Like the S⃗ polynomials, T⃗ polynomials are vectors defined in x-y plane only. A convenient choice of P⃗ is vectors along z axis only, i.e. Px=Py=0. We can use a scalar ψ instead to represent P⃗:

P=ψk,̂

where ψ is a function of x and y: ψ=ψ (x, y). It follows that

Ti=×(ψik̂)=îψiyĵψix.

The inner product of two T⃗ polynomials is then

(Ti,Tj)=((ψiy)(ψjy)+(ψix)(ψjx))dxdy
=((ψi)(ψj))dxdy.

We choose a basis of functions {ψi} that we use to generate the T⃗ polynomials to be the same basis as we used to generate the S⃗ polynomials, {ϕi}. By Eq. (4), we know that their choice will create T⃗ polynomials that are mutually orthogonal.

From Eqs. (1) and (8), and with ψi=ϕi, we know that S⃗i(x,y) and T⃗i(x,y) have same magnitude and are orthogonal to each other at any point in a unit circle, therefore (S⃗i,T⃗i)=0. But the sets S⃗ and T⃗ are not fully independent. For all the j with m=n, we can show that

∇×T⃗j=-2 ϕj,

and

2ϕj2Zj(1rr(rr)+1r22θ2)[rn(cosnθsinnθ)]=0,

which means T⃗j has 0 curl and is therefore not linearly independent of S⃗ polynomials. For example, when j=9 or 10, m=n=3: T9=12(îZ6ĵZ5)=S10 and T10=12(îZ5ĵZ6)=S9 .

For any other pair of i and j, (S⃗i,T⃗j)=0.

 figure: Fig. 1.

Fig. 1. Relations between the S⃗ and T⃗ polynomials. The Laplacian vector fields are the overlap between S⃗ and T⃗. The dashed circles and associated solid arrows illustrate the local behaviors of the vectors in different sets after subtracting the local constant vector.

Download Full Size | PDF

The S⃗ and T⃗ polynomials can be thought of as vector fields in a unit circle. In vector calculus, S⃗ is known as irrotational vector fields which have zero curl everywhere, and T⃗ is known as solenoidal vector fields which have zero divergence everywhere. The two types vector fields have some overlap where both divergence and curls are everywhere zero, which is known as Laplacian vector field. The overall relationship between S⃗ and T⃗ vector fields is illustrated in Figure 1. The overlapped area contains terms derived from corresponding scalar ϕ polynomials whose Laplacian is 0. If ϕ represents wavefront, these terms correspond to a wavefront that has zero net curvature at any point in the pupil.

It is useful to compare the different types of functions defined here. The S⃗ functions are generated from gradients, thus have no curl. Since S⃗ functions are 2-d vectors defined in a plane, mathematically, we can express the curl as line integral along a closed path in the plane:

Sdl=0.

The T⃗ functions have no divergence. Again they are 2-d vectors defined in a plane. Mathematically, we express divergence of a 2-d vector as a line integral over a closed path:

Tn̂dl=0,

where is the unit normal vector pointing out of the closed path.

The intersection, which includes both S⃗ and T⃗, is of the form that fits both Eqs. (11) and (12), having both zero divergence and zero curl. Graphical depictions of the local behavior of the functions are included in Fig. 1: dashed circles represent infinitesimal region and solid arrows represent local vectors (after a constant vector is subtracted.)

Table 2 lists expressions for the first 15 T⃗ polynomials. The plots of first 12 non-trivial T⃗ polynomials are shown in Table 3. The complete set of orthogonal vector polynomials that fully spans the circular domain can be written as the combined set of S⃗ polynomials and independent T⃗ polynomials since the Laplacian type functions are included in both sets. Care must be taken to ensure that the common functions are not counted twice.

Tables Icon

Table 2. Analytical expressions of the first 15 T⃗ polynomials.

Tables Icon

Table 3. Plots of the first 12 non-trivial T⃗ polynomials.

4. Summary

We derived a set of vector polynomials defined over a unit circle which complements the set we presented in a previous paper. Each set of the vector polynomials is orthonormal over a unit circle. But there are some overlap between the two sets. We can combine the two sets in such a way that the overlapped subset is counted only once, then we obtain a complete set of vector polynomials defined over a unit circle. The combined set is useful for fitting any continuously differentiable vector functions in a circular domain. In particular, it is useful in fitting the mapping distortions often seen in an interferometric null test. We will explore and report its applications in subsequent papers.

If interested, you can request the MATLAB codes for calculating the S⃗ and T⃗ polynomials from Dr. Chunyu Zhao, czhao@optics.arizona.edu.

References and links

1. C. Zhao and J. H. Burge, “Orthonormal vector polynomials in a unit circle, Part I: basis set derived from gradients of Zernike polynomials,” Opt. Express 15, 18014–18024 (2007). [CrossRef]   [PubMed]  

2. C. Zhao, et al, “Figure measurement of a large optical flat with a Fizeau interferometer and stitching technique,” Proc. SPIE 6293, 62930k, (2006). [CrossRef]  

3. R. J. Noll, “Zernike polynomials and atmospheric turbulence”, J. Opt. Soc. Am. 66, 1976, 207–211. [CrossRef]  

4. H. F. Davis and A. D. Snider, Introduction to Vector Analysis, (Wm. C. Brown Publisher, 1986).

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (1)

Fig. 1.
Fig. 1. Relations between the S⃗ and T⃗ polynomials. The Laplacian vector fields are the overlap between S⃗ and T⃗. The dashed circles and associated solid arrows illustrate the local behaviors of the vectors in different sets after subtracting the local constant vector.

Tables (3)

Tables Icon

Table 1. List of the first 14 non-trivial S⃗ polynomials as linear combinations of Zernike polynomials.

Tables Icon

Table 2. Analytical expressions of the first 15 T⃗ polynomials.

Tables Icon

Table 3. Plots of the first 12 non-trivial T⃗ polynomials.

Equations (14)

Equations on this page are rendered with MathJax. Learn more.

S j = ϕ j = i ̂ ϕ j x + j ̂ ϕ j y .
ϕ j = 1 2 n ( n + 1 ) Z j , for all j with n = m ,
ϕ j = 1 4 n ( n + 1 ) ( Z j n + 1 n 1 Z j ( n = n 2 , m = m ) ) , for all j with n m .
( A , B ) = 1 π ( A B ) dxdy ,
( S i , S j ) = 1 π ( ( ϕ i ) ( ϕ j ) ) dxdy = δ ij .
v = ϕ + × P ,
T = × P = [ i ̂ j ̂ k ̂ x y z P x P y P z ] .
P = ψ k , ̂
T i = × ( ψ i k ̂ ) = i ̂ ψ i y j ̂ ψ i x .
( T i , T j ) = ( ( ψ i y ) ( ψ j y ) + ( ψ i x ) ( ψ j x ) ) dxdy
= ( ( ψ i ) ( ψ j ) ) dxdy .
2 ϕ j 2 Z j ( 1 r r ( r r ) + 1 r 2 2 θ 2 ) [ r n ( cos n θ sin n θ ) ] = 0 ,
S d l = 0 .
T n ̂ d l = 0 ,
Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.