# Optical diffraction in close proximity to plane apertures. III. Modified, self-consistent theory.

The classical theory of diffraction at plane apertures illuminated by normally incident light is modified so that diffraction on the source side of the screen is taken into consideration and the energy transport across the aperture plane is described by continuous functions. The modified field expressions involve the sums and differences of the Rayleigh-Sommerfeld diffraction integrals as descriptors of a bidirectional flow of energy in the near zones on either side of the aperture. The theory is valid for unpolarized fields, and a pragmatic argument is presented that it is applicable to metallic as well as black screens. The modified field expressions are used for numerical near-field computations of the diffraction profiles and transmission coefficients of circular apertures and slits. In the mid zone the modified theory is reduced to the Fresnel approximation, and here the latter may be used with confidence.Key words: bidirectional scalar fields; boundary-value theory; circular apertures; diffraction; Kirchhoff; irradiance; near zone; optics; polarization; Rayleigh; scalar wave functions; slits; Sommerfeld; transmission coefficients.

**********

1. Introduction

This is a continuation of previous papers [1,2] in which the physical significance of the classical Rayleigh-Sommerfeld and Kirchhoff diffraction integrals was assessed and their suitability for computations in the near zone was analyzed. The need for such computations arises, for example, in the evaluation of radiometric diffraction errors, where it is necessary to know the transmission coefficients of the apertures used for the measurements. The computation of these coefficients is a near-zone task even for large aperture-detector distances.

The specific situation considered is a plane aperture A contained in an infinitesimally thin screen S that occupies the xy-plane of a Cartesian coordinate system and is illuminated from the half space z < 0 by a normally incident monochromatic plane wave with irradiance [E.sub.0] and wavelength [lambda]. The resulting optical field is denoted by a scalar wave function,

U(P) = [square root of ([E.sub.0])]u(P), |u(P)| [less than or equal to] 1, (1)

and is expressed in the Rayleigh-Sommerfeld theory in terms of the surface integrals,

[u.sub.RS.sup.(p)] (P) = -[[ik]/[2[eth]]] [[integral].[A]] dQ [[e.sup.ikQP]/[QP]], z > 0, (2a)

[u.sub.RS.sup.(s)] (P) = [1/[2[eth]]] [[integral].[A]] dQ [[partial derivative]/[[partial derivative]z]] ([e.sup.ikQP]/[QP]) = [1/ik] [[[partial derivative][u.sub.RS.sup.(p)]]/[[partial derivative]z]], z > 0 (2b)

where a metallic screen illuminated by p- or s-polarized light is assumed. (1) The corresponding expression in Kirchhoff's theory, which is usually associated with black screens, is

[u.sub.K] (P) = -[1/[4[eth]]] [[integral].[A]] dQ [ik - [[partial derivative]/[[partial derivative]z]]] [[e.sup.ikQP]/[QP]] [equivalent to] [1/2] [[u.sub.RS.sup.(p)] (P) + [u.sub.RS.sup.(s)](P)], z > 0. (2c)

In these equations, A denotes the aperture area, P = (x,y,z) is the point of observation, Q = ([xi],[eta],0) is a point inside the aperture, QP is the distance between these points, dQ is the surface element at Q, k = 2[pi]/[lambda] is the circular wave number, and the time dependence of the field is assumed as [e.sup.-i[omega]t].

Equations (2a,b) were reduced in Ref. [1] to previously unknown single integrals for the respective cases of circular apertures and apertures bounded by straight lines, and these were used for numerical computations of [u.sub.RS.sup.(p)] and [u.sub.RS.sup.(s)] that involved no simplifying assumptions and could be performed for arbitrarily small distances z from these apertures. The numerical results obtained were everywhere finite, free of singularities, and confirmed the well-known prediction that [u.sub.RS.sup.(p)] and [u.sub.RS.sup.(s)] reproduce the boundary values assumed in their derivation ([partial derivative][u.sub.RS.sup.(p)]/[partial derivative]z [right arrow] ik and [u.sub.RS.sup.(s)] [right arrow] 1 as z [right arrow] 0) but not the compatible values ([u.sub.RS.sup.(p)] [right arrow] 1 and [u.sub.RS.sup.(s)]/[partial derivative]z [right arrow] ik) which are implied in the classical postulate that the aperture field is the same as the unperturbed geometrical field incident on the screen. These inconsistencies obscured the differences between the Rayleigh-Sommerfeld and Kirchhoff integrals in the immediate proximity of the screen and made it impossible to assess their physical significance without additional considerations.

This impasse was overcome in Ref. [2] by evaluating Eqs. (2a,b) for the special case of a diffracting half plane and comparing them to the corresponding values, [u.sub.S.sup.(p)] and [u.sub.S.sup.(s)], given by Sommerfeld's rigorous theory of half-plane diffraction [3,4]. The agreement was remarkably good on the positive side of the screen, where the differences ([u.sub.RS.sup.(p,s)] - [u.sub.S.sup.(p,s)]) and their derivatives were negligibly small even at sub-wavelength distances z. Thus, it was decided that the aperture values given by the Rayleigh-Sommerfeld integrals are consistent with Sommerfeld's rigorous theory, so that attempting to improve them would be pointless.

Accordingly, it became apparent that the real problem with the Rayleigh-Sommerfeld and Kirchhoff theories was not their failure to reproduce the assumed boundary values but these boundary conditions themselves. The classical theories involve "inclination factors" which explicitly preclude a backward motion of diffracted light and, thus, any perturbation of the geometrical field on the source side. On the other hand, Sommerfeld's rigorous theory showed that the incident light is modified by diffraction before it reaches the screen, and therefore the notion of an unperturbed incident field is abandoned in this paper by adding a diffraction term to the geometrical field on the source side.

The comparison with Sommerfeld's theory also suggested the need for a further modification of the classical theory. The optical field specified by the rigorous theory is expressed in the form [u.sub.S.sup.(p,s)] = [u.sub.S][+ or -][^.u.sub.S], where [u.sub.S.sup.(p,s)] obey the same boundary conditions as the Rayleigh-Sommerfeld integrals and their components [u.sub.S] and [^.u.sub.S] propagate in the opposite directions of the incident field and its reflection from the screen and are mutually incoherent. In this paper, the Rayleigh-Sommerfeld integrals will likewise be resolved into forward and reverse components defined by

[u.sub.K] = [1/2] ([u.sub.RS.sup.(p)] + [u.sub.RS.sup.(s)]), [^.u.sub.K] = [1/2] ([u.sub.RS.sup.(p)] - [u.sub.RS.sup.(s)]), z > 0, (3a)

where the subscript K is used because the forward wave function on the left-hand side of this equation happens to be the same as the Kirchhoff diffraction integral, Eq. (2c). The effective, time-averaged flow of field energy is then given by the squared moduli of these functions, so that the mutually incoherent, forward and reverse irradiances incident on any given area element dxdy are given by

E = [E.sub.0] | [u.sub.K] |[.sup.2], [^.E] = [E.sub.0] | [^.u.sub.K] |[.sup.2], z > 0. (3b)

Finally, these quantities will be extended into the source space by matching functions so that the overall field is continuously differentiable (2) in the aperture plane and the bidirectional transport of energy through the aperture is also expressed by continuous functions. The modified theory presented in this paper is valid for normally incident light but can easily be adapted for oblique angles of incidence. As will be shown, it becomes indistinguishable from the usual Fresnel approximation in the mid zone z [much greater than] [lambda], and here the latter can be used with confidence.

2. Modified Field Expressions

2.1 Derivations

In addressing the problem of diffraction on the source side of a plane metallic screen illuminated by normally incident parallel light, it is frequently assumed that

[v.sup.(p,s)] = [u.sub.+.sup.(p,s)], z > 0, = [e.sup.ikz] [+ or -] [e.sup.-ikz] [+ or -] [u.sub.-.sup.(p,s)], z < 0, (4a)

where [v.sup.(p,s)] is the total field, [u.sub.[+ or -].sup.(p,s)] denotes the field components due to diffraction, and [e.sup.ikz] [+ or -] [e.sup.-ikz] is the unperturbed geometrical field on the source side. These assumptions appeared first in Rayleigh's papers [5,6] on diffraction by infinitesimally small apertures and show that a continuously differentiable solution for [v.sup.(p,s)] must obey the boundary conditions

[u.sub.-.sup.(p)] = 2 + [u.sub.+.sup.(p)], [[[partial derivative][u.sub.-.sup.(p)]]/[[partial derivative]z]] = [[[partial derivative][u.sub.+.sup.(p)]]/[[partial derivative]z]], z = 0, (4b)

[u.sub.-.sup.(s)] = [u.sub.+.sup.(s)], [[[partial derivative][u.sub.-.sup.(s)]]/[[partial derivative]z]] = 2ik + [[[partial derivative][u.sub.+.sup.(s)]]/[[partial derivative]z]], z = 0. (4c)

These conditions were used by Rayleigh to derive the initial terms of Taylor expansions for [u.sub.[+ or -].sup.(p,s)] for slits and circular apertures with dimensions smaller than the wavelength of light. Additional higher-order terms were calculated by Sommerfeld [4], Bouwkamp [7], and others.

As mentioned above, the Rayleigh-Sommerfeld integrals, Eq. (2a,b), will be retained in this paper by assuming

[u.sub.+.sup.(p,s)] = [u.sub.RS.sup.(p,s)] (x,y,z), z > 0, (5a)

and then the second condition in Eq. (4b) and the first condition in Eq. (4c) will be satisfied by also assuming

[u.sub.-.sup.(p)] = -[u.sub.RS.sup.(p)] (x,y,-z), [u.sub.-.sup.(s)] = [u.sub.RS.sup.(s)](x,y,-z), z < 0. (5b)

However the two remaining conditions in Eqs. (4a,b) are still not satisfied, so that [partial derivative][v.sup.(p)]/[partial derivative]z and [v.sup.(s)] will still be discontinuous in the aperture plane.

This failure of Eqs. (4a) can be attributed to the fact that the Rayleigh-Sommerfeld integrals are composite quantities which can be resolved into the forward and reverse field components [u.sub.K] and [^.u.sub.K] in Eqs. (3a); that is, [u.sub.RS.sup.(p)] = [u.sub.K] + [^.u.sub.K] and [u.sub.RS.sup.(s)] = [u.sub.K] - [^.u.sub.K]. There are no physical reasons why these sums and differences should be continuously differentiable for z = 0, but on the other hand this must be required of [u.sub.K] and [^.u.sub.K] in order to correctly account for a continuous transport of energy through the aperture. To satisfy this requirement, we retain Eq. (5a) but reverse the signs of [e.sup.-ikz] and [u.sub.RS.sup.(s)] in Eqs. (5b), so that

[u.sub.-.sup.(p)] = [u.sub.RS.sup.(p)] (x,y,-z), [u.sub.-.sup.(s)] = -[u.sub.RS.sup.(s)] (x,y,-z), z < 0. (6a)

Hence, by applying Eqs. (3a) and letting v = [1/2] ([v.sup.(p)] + [v.sup.(s)]), [^.v] = [1/2]([v.sup.(p)] - [v.sup.(s)]),

v = [u.sub.K] (x,y,z), z > 0, = [e.sup.ikz] + [^.u.sub.K] (x,y,-z), z < 0, (6b)

[^.v] = [u.sub.K] (x,y,z), z > 0, = -[e.sup.-ikz] + [u.sub.K](x,y,-z), z < 0. (6c)

Now, it may be recalled that the Rayleigh-Sommerfeld integrals obey the boundary conditions assumed in their derivation; that is,

[u.sub.RS.sup.(s)] [equivalent to] ik [[[partial derivative][u.sub.RS.sup.(p)]]/[[partial derivative]z]] = 1 in A, [equivalent to] 0 on S, z [right arrow] 0, (6d)

and hence it follows at once that the scalar field specified by Eqs. (6b,c) is continuously differentiable in the aperture plane. Likewise, the corresponding forward and reverse irradiances,

E = |[u.sub.K] (x,y,z)|[.sup.2], z > 0, = |[e.sup.ikz] + [^.u.sub.K] (x,y,-z)|[.sup.2], z < 0, (7a)

[^.E] = |[^.u.sub.K] (x,y,z)|[.sup.2], z > 0, = |-[e.sup.-ikz] + [u.sub.K] (x,y,-z)|[.sup.2], z < 0, (7b)

are continuously differentiable, thus implying a smooth bidirectional transport of energy through the aperture.

Equations (6b,c) and (7a,b) represent the key findings of this paper. It should be noted that in these expressions the roles of [u.sub.K] and [^.u.sub.K] are reversed on opposite sides of the screen. That is, [^.u.sub.K] appears in the expressions for the forward field quantities [u.sub.K] and E, and vice versa. The general properties of these modified field quantities can readily be predicted from the results reported in Ref. [1]; namely, that the differences between the Rayleigh-Sommerfeld integrals [u.sub.RS.sup.(p)] and [u.sub.RS.sup.(s)] are pronounced only in the immediate proximity of the screen and vanish in the Fresnel approximation. Thus, a bidirectional exchange of energy between the positive and negative sides of the aperture occurs only in the near zone, long as [^.u.sub.K](x,y,|z|) is appreciably different from zero for values of |z| on the order of a few wavelengths. In the Fresnel limit on either side of the aperture plane (|z| [much greater than] [lambda]) the forward and reverse fields are unidirectional, and the forward field is reduced to the standard expressions in terms of Fresnel integrals for slits and Lommel functions for circular apertures [8] for z > 0, and to the unperturbed geometrical field for z < 0. Similarly, the reverse Fresnel field is zero for z > 0 and equal to a Fresnel diffraction pattern superimposed on the reflected geometrical field for z < 0.

2.2 Numerical Examples

2.21 Slits

As an illustration of the behavior of the modified field expression on both sides of the aperture plane, Figs. 1a-c show the forward irradiance profiles [E(x,z) vs x/w] (3) for a slit of width 2 w = 10 [lambda] and for varying distances [+ or -] z from the aperture plane. The numerical values shown in these figures were computed using Eqs. (3a) and (7a) in conjunction with the expressions for [u.sub.RS.sup.(p,s)] derived in Sec. 3.3 of Ref. [1].

Figure 1a shows that, for z = [+ or -] 0.01 [lambda], the modified field irradiances are manifestly continuous inside the aperture (x/w < 1) and that a modulation of the incident field by diffraction also occurs on the opaque portion of the screen (x/w < 1). For z = [+ or -] [lambda], shown in Fig. 1b, the diffraction profile on the positive side of the screen is already significantly altered in that more light is spreading into the shadow, whereas on the negative side the modulation of the field is diminished. Finally, for z = [+ or -] 10 [lambda] as shown in Fig. 1c, the profile on the positive side is similar but not yet equal to the Fresnel approximation (F, shown as a dashed line) and the modulation of the incident field on the negative side is very small. This confirms the expectation that the modified theory affects only the positive and negative near zones in which the Fresnel approximation does not apply. For the slit width assumed here, it is estimated that the Fresnel limit is reached, within 1% or better, for |z| = 100 [lambda].

2.22 Circular Apertures

The numerical data presented in Figs. 2 and 3 illustrate the bidirectionality of the field in the positive and negative near zones of a circular aperture of width 2 w = 10 [lambda]. The data were computed using the mathematical expressions derived in Sec. 3.2 of Ref. [1].

Figures 2a,b show the forward and reverse axial irradiances E(0,z) and [^.E](0,z) for the range -10 [lambda] < z < 10 [lambda]. as given by the closed expressions

E(0, z) = 1 - (1 + [z/W]) cos[k(W - z)] + [1/4](1 + [z/W])[.sup.2], (8a)

[^.E](0,z) = [1/4](1 - [z/W])[.sup.2], W = [square root of ([w.sup.2] - [z.sup.2])] (8b)

which are valid for positive and negative values of z and follow readily from Eqs. (9a,b) of Ref. [1] and Eqs. (7a,b), above. In Fig. 2a, the modification of the incident geometrical field is evidenced by the onset of pronounced oscillations of the forward irradiance E for z < 0. In Fig. 2b, the small but finite values of [^.E] for z < 0 demonstrate again that the energy flow is bidirectional and the reverse field reaches into the positive near zone.

It will be noticed that the reverse axial irradiance [^.E](0,z) in Fig. 2b exhibits no oscillations with respect to z. This is due to the fact, illustrated in Figs. 3a and b, that [^.E](x,z) always has a maximum for x = 0. On the source side, this maximum lies in the reflection shadow and is much smaller than the main diffraction pattern formed in the region x/w > 1.

3. Transmission Coefficients

The transmission coefficient [tau] of a diffracting aperture is defined as the radiant flux transmitted into the positive half space, divided by the radiant flux incident upon it in the limit of geometrical optics. Thus, for a plane aperture of area A and normally incident parallel light of unit irradiance,

[tau] = [1/A] [[integral].[A]] dQ E(Q), (9a)

and for two-dimensional apertures of width 2 w which are centered on the coordinate origin, as discussed in this paper, this is further reduced to

[tau] = [1/2w][w.[integral].[0]]d[xi] |[u.sub.K]([xi],0)|[.sup.2], (9b)

[FIGURE 1 OMITTED]

[FIGURE 2 OMITTED]

where Eq. (3b) was used and the integral over [eta] was evaluated as 2 w. According to Eq. (3a) one finds

[tau] = [1/4w] [w.[integral].[0]]d[xi] |[lim.[z[right arrow]0]][[u.sub.RS.sup.(p)]([xi],z)] + 1|[.sup.2], z > 0. (9c)

Here, the aperture value of [u.sub.RS.sup.(s)] was substituted from Eq. (6d) in order to avoid computational problems that would otherwise arise from singularities for very small values of z. The computation of [u.sub.RS.sup.(p)] involves lesser singularities and could be performed reliably down to z = 0.0003 [lambda]. Trial computations indicated that the limiting value of [tau] defined by Eq. (9c) was reached at the 0.1 % level for z < 0.003 [lambda], and consequently the results presented below were computed for z = 0.001 [lambda].

The numerical results thus obtained for the transmission coefficients of circular apertures and slits are shown in Fig. 4 for the range 0 < kw [less than or equal to] 5 [pi]. In both cases, these transmission coefficients approach the limit [tau] = 0 for kw = 0, and for larger values of kw they exhibit a damped oscillatory behavior. In the case of circular apertures the extremes of [tau] occur near kw = [pi], 2 [pi],..., whereas for slits they are less pronounced and occur near kw = 0.55 [pi], 1.1 [pi],... For large values of kw outside the range shown in Fig. 4 both approach the limit [tau] = 1, and further computations showed that near kw = 100 [pi] the oscillations of [tau] are still on the order of 1 % for circular apertures and less than 0.1 % for slits.

[FIGURE 3 OMITTED]

[FIGURE 4 OMITTED]

These results can be compared to a large number of data that have appeared in the earlier literature for the limiting case of very small apertures, kw [right arrow] 0. In this limit the transmission coefficients shown in Fig. 4 for circular apertures are superficially similar to, but not the same as those computed by Levine and Schwinger [9,10]. In the case of narrow slits the results obtained here do not at all agree with those published by Bouwkamp [7]. These discrepancies will be addressed in a subsequent publication.

4. Concluding Remarks

There are two aspects of the modified theory presented in Sec. 2 that deserve further comments: its failure to account for polarization effects and the appearance of Kirchhoff's integral in the context of a theory in which metallic screens are assumed.

This work was begun in the anticipation that, because of their pseudo-vectorial nature, the Rayleigh-Sommerfeld integrals could be used to analyze the polarization of diffracted light. This anticipation did not materialize. For example, the expressions derived in Ref. [1] for the axial values of [u.sub.RS.sup.(p)] and [u.sub.RS.sup.(s)] pertaining to a circular aperture illuminated by normally incident light differed from each other, although in this case symmetry would dictate the absence of polarization effects. It also seemed odd that the computed values of [u.sub.RS.sup.(p)] and [u.sub.RS.sup.(s)] were consistently different in the near zone but the same in the mid zone, without any indication how the degree of polarization could change during the free-space propagation of light. Thus it appeared that, in spite of the assumption of different boundary conditions for [u.sub.RS.sup.(p)] and [u.sub.RS.sup.(s)], the "polarization" effects predicted by the Rayleigh-Sommerfeld integrals were implausible.

Whereas the analysis of polarization effects was clearly an objective of Rayleigh's work [5,6], there is no indication in Sommerfeld's writings that he had the same goal. In his derivation of Eqs. (2a,b) in Ref. [4], he did not mention polarization at all but stated that separate wave functions and boundary conditions were required to overcome a well-known mathematical inconsistency of Kirchhoff's theory. In his half-plane work [3,4], he took the additional step of expressing the forward and reverse wave functions in terms of the sums and differences of these separate wave functions, thus negating any semblance with a vectorial theory as these expressions would otherwise imply the interference of mutually orthogonal states. Likewise, any association of the Rayleigh-Sommerfeld integrals with polarized light is negated in the present paper by the introduction of Eqs. (3a) in Sec. 1. Accordingly, the appearance of Kirchhoff's integral in the modified theory has no significance apart from the fact that it happens to be the arithmetic mean of [u.sub.RS.sup.(p)] and [u.sub.RS.sup.(s)].

Given the fact that the modified theory no longer pertains to polarized light, a question arises whether it is still limited to metallic screens. It may be observed that the Eqs. (6b) and (7a) for the forward field are in no way altered if the corresponding expressions for the reverse field are simply ignored, as if the screen were "black." Thus, these expressions might also be useful to describe the forward field v produced by a black screen, and similarly it might be possible to describe the diffraction by partially reflecting screens by simply multiplying the reverse field [^.v] by a suitable amplitude reflectance. These ideas are akin to earlier suggestions to define blackness as the absence of reflection [11], but cannot be justified theoretically as a metallic screen was assumed in the first place. However, from a pragmatic point of view it appears that the results obtained in this manner would not be far off, and in this context we recall Sommerfeld's comment [4] that "a slit scratched in a piece of tin foil produces the same diffraction pattern, no matter if it is shiny or has been blackened."

Accepted: September 5, 2004

Available online: http://www.nist.gov/jres

(1) See comments in Sec. 4, below.

(2) That is, continuous with continuous first derivatives. The purpose of the derivations in this paper is to make the overall field continuously differentiable with respect to z. Continuity with respect to x and y is assured as the diffracted components of the field will be expressed in terms of Eqs. (2a-c) and obey the wave equation.

(3) Because of the two-dimensional nature of the diffraction patterns discussed in the remainder of this paper, the y-coordinate will be omitted from here on.

5. References

[1] K. D. Mielenz, J. Res. Natl. Inst. Stand. Technol. 107, 355-362 (2002).

[2] K. D. Mielenz, J. Res. Natl. Inst. Stand. Technol. 108, 57-68 (2003).

[3] A. Sommerfeld, Math. Ann. 47, 317 (1896).

[4] A. Sommerfeld, Optik, Dieterich'sche Verlagsb., Wiesbaden (1950). Transl.: Theory of Optics, Longmans, Green & Co., London etc. (1964).

[5] Lord Rayleigh, Phil. Mag. 43, 259 (1897).

[6] Lord Rayleigh, Proc. Roy. Soc. (A) 89, 194 (1913).

[7] C. J. Bouwkamp, Rep. Progr. Phys. (London) 17, 35 (1953).

[8] K. D. Mielenz, J. Res. Natl. Inst. Stand. Technol. 103, 497-509 (1998).

[9] H. Levine and J. Schwinger, Phys. Rev. 74, 958-974 (1948).

[10] H. Levine and J. Schwinger, Phys. Rev. 75, 1423-1432 (1949).

[11] B. B. Baker and E. T. Copson, The Mathematical Theory of Huygens' Principle, Chelsea Publ. Co., New York (1987), p. 150.

Klaus D. Mielenz

National Institute of Standards and Technology, Gaithersburg, MD 20899-8440

klausm@hereintown.net

About the author: Klaus D. Mielenz is a physicist and retired Chief of the Radiometric Physics Division of the NIST Physics Laboratory. The National Institute of Standards and Technology is an agency of the Technology Administration, U.S. Department of Commerce.

Printer friendly Cite/link Email Feedback | |

Author: | Mielenz, Klaus D. |
---|---|

Publication: | Journal of Research of the National Institute of Standards and Technology |

Date: | Sep 1, 2004 |

Words: | 4149 |

Previous Article: | Evaluation of handheld radionuclide identifiers. |

Next Article: | Development of a high throughput method incorporating traditional analytical devices. |

Topics: |