6: Optics
( \newcommand{\kernel}{\mathrm{null}\,}\)
The bending of light
For the refraction at a surface : nisin(θi)=ntsin(θt) holds where n is the refractive index of the material. Snell’s law is:
n2n1=λ1λ2=v1v2
If Δn≤1, the change in phase of the light is Δφ=0, if Δn>1 then: Δφ=π. The refraction of light in a material is caused by scattering from atoms. This is described by:
n2=1+nee2ε0m∑jfjω20,j−ω2−iδω
where ne is the electron density and fj the oscillator strength, for which ∑jfj=1. From this follows that vg=c/(1+(nee2/2ε0mω2)). From this the equation of Cauchy one can derive: n=a0+a1/λ2. More generally, it is possible to expand n as: n=n∑k=0akλ2k.
For an electromagnetic wave in general: n=√εrμr.
The path, followed by a light ray in material can be found from Fermat’s principle:
δ2∫1dt=δ2∫1n(s)cds=0⇒δ2∫1n(s)ds=0
Paraxial geometrical optics
Lenses
The Gaussian lens formula can be deduced from Fermat’s principle with the approximations cosφ=1 and sinφ=φ. For the refraction at a spherical surface with radius R:
n1v−n2b=n1−n2R
where |v| is the distance of the object and |b| the distance of the image. Applying this twice results in:
1f=(nl−1)(1R2−1R1)
where nl is the refractive index of the lens, f is the focal length and R1 and R2 are the curvature radii of both surfaces. For a double concave lens R1<0, R2>0, for a double convex lens R1>0 and R2<0. Further:
1f=1v−1b
D:=1/f is called the dioptric power of a lens. For a lens with thickness d and diameter D to a good approximation: 1/f=8(n−1)d/D2. For two lenses placed on a line with distance d between them:
1f=1f1+1f2−df1f2
In these equations the following signs are being used for refraction at a spherical surface, as is seen by an incoming light ray:
Quantity | + | - |
---|---|---|
R | Concave surface | Convex surface |
f | Converging lens | Diverging lens |
v | Real object | Virtual object |
b | Virtual image | Real image |
Mirrors
For images formed by mirrors
1f=1v+1b=2R+h22(1R−1v)2
where h is the perpendicular distance from the point the light ray hits the mirror to the optical axis. Spherical aberration can be reduced by not using spherical mirrors. A parabolical mirror has no spherical aberration for light rays parallel with the optical axis and is therefore often used for telescopes. The signs used are:
Quantity | + | - |
---|---|---|
R | Concave mirror | Convex mirror |
f | Concave mirror | Convex mirror |
v | Real object | Virtual object |
b | Real image | Virtual image |
Principal planes
The nodal points N of a lens are defined inFigure 6.1. If the lens is surrounded by the same medium on both sides, the nodal points are the same as the principal points H. The plane ⊥ to the optical axis through the principal points is called the principal plane. If the lens is described by a matrix mij then for the distances h1 and h2 to the boundary of the lens it holds that:
h1=nm11−1m12 , h2=nm22−1m12

Magnification
The linear magnification is defined by: N=−bv
The angular magnification is defined by: Nα=−αsystαnone
where αsys is the size of the retinal image in the optical system and αnone the size of the retinal image outside the system. Further: N⋅Nα=1. For a telescope: N=fobjective/focular holds. The f-number is defined by f/Dobjective.
Matrix methods
A light ray can be described by a vector (nα,y) with α the angle with the optical axis and y the distance to the optical axis. The new position of a light ray interacting with an optical system can be obtained using matrix multiplication:
(n2α2y2)=M(n1α1y1)
where Tr(M)=1. M is a product of elementary matrices. These are:
- Transfer along length l: MR=(10l/n1)
- Refraction at a surface with dioptric power D: MT=(1−D01)
Aberrations
Lenses usually do not give a perfect image. Some causes are:
- Chromatic aberration is caused by the fact that n=n(λ). This can be partially corrected with a compound lens which is composed of several lenses with different indicies of refraction ni(λ). Using N lenses makes it possible to obtain the same f for N wavelengths.
- Spherical aberration is caused by second-order effects which are usually ignored; a spherical surface does not make a perfect lens. Incoming rays far from the optical axis will bend more. Best form lenses can ameliorate spherical aberration.
- Coma is caused by the fact that the principal planes of a lens are only flat near the principal axis. Further from the optical axis they are curved. This curvature can be either positive or negative.
- Astigmatism:for each point of an object not on the optical axis the image is an ellipse because the thickness of the lens is not the same everywhere.
- Field curvature can be corrected by the human eye.
- Distorsion leads to aberration near the edges of the image. This can be corrected with a combination of positive and negative lenses.
Reflection and transmission
If an electromagnetic wave hits a transparent medium part of the wave will reflect at the same angle as the incident angle, and a part will be refracted at an angle according to Snell’s law. It makes a difference whether the →E field of the wave is ⊥ or ∥ w.r.t. the surface. When the coefficients of reflection r and transmission t are defined as:
r∥≡(E0rE0i)∥ , r⊥≡(E0rE0i)⊥ , t∥≡(E0tE0i)∥ , t⊥≡(E0tE0i)⊥
where E0r is the reflected amplitude and E0t the transmitted amplitude. Then the Fresnel equations are:
r∥=tan(θi−θt)tan(θi+θt) , r⊥=sin(θt−θi)sin(θt+θi)
t∥=2sin(θt)cos(θi)sin(θt+θi)cos(θt−θi) , t⊥=2sin(θt)cos(θi)sin(θt+θi)
and the following holds: t⊥−r⊥=1 and t∥+r∥=1. If the coefficient of reflection R and transmission T are defined as (with θi=θr):
R≡IrIi and T≡Itcos(θt)Iicos(θi)
with I=⟨|→S|⟩ it follows that: R+T=1. A special case is r∥=0. This happens if the angle between the reflected and transmitted rays is 90∘. From Snell’s law it then follows: tan(θi)=n. This angle is called Brewster’s angle. The situation with r⊥=0 is not possible.
Polarization
The polarization is defined as:
P=IpIp+Iu=Imax−IminImax+Imin
where the intensity of the polarized light is given by Ip and the intensity of the unpolarized light is given by Iu. Imax and Imin are the maximum and minimum intensities when the light passes a polarizer. If polarized light passes through a polarizer Malus law applies: I(θ)=I(0)cos2(θ) where θ is the angle of the polarizer.
The state of a light ray can be described by the Stokes-parameters: start with 4 filters where each transmits half the intensity. The first is independent of the polarization, the second and third are linear polarizers with the transmission axes horizontal and at +45∘, while the fourth is a circular polarizer which is opaque for L-states. Then S1=2I1, S2=2I2−2I1, S3=2I3−2I1 and S4=2I4−2I1.
The state of a polarized light ray can also be described by the Jones vector:
→E=(E0xeiφxE0yeiφy)
For the horizontal P-state: →E=(1,0), for the vertical P-state →E=(0,1), the R-state is given by →E=12√2(1,−i) and the L-state by →E=12√2(1,i). The change in polarization of a light beam after passage through the optical train can be described as →E2=M⋅→E1. For some types of optical equipment the Jones matrix M is given by:
- Horizontal linear polarizer: M=(1000)
- Vertical linear polarizer: M=(0001)
- Linear polarizer at +45∘ M=12(1111)
- Lineair polarizer at −45∘ M=12(1−1−11)
- 1/4−λ plate, fast axis vertical M=eiπ/4(100−i)
- 1/4−λ plate, fast axis horizontal M=eiπ/4(100i)
- Homogenous circular polarizor right M=12(1i−i1)
- Homogenous circular polarizer left M=12(1−ii1)
Prisms and dispersion
A light ray passing through a prism is refracted twice and aquires a deviation from its original direction δ=θi+θi′+α w.r.t. the incident direction, where α is the apex angle, θi is the angle between the incident angle and a line perpendicular to the surface and θi′ is the angle between the ray leaving the prism and a line perpendicular to the surface. When θi varies there is an angle for which δ becomes minimal. For the refractive index of the prism now:
n=sin(12(δmin+α))sin(12α)
The dispersion of a prism is defined by:
D=dδdλ=dδdndndλ where the first factor depends on the shape and the second on the composition of the prism. For the first factor it follows that:
dδdn=2sin(12α)cos(12(δmin+α))
For visible light usually dn/dλ<0 holds and shorter wavelengths are more strongly bent than longer ones. The refractive index in this region can usually be approximated by Cauchy’s formula.
Diffraction
Fraunhofer diffraction occurs far away from the source(s). The Fraunhofer diffraction of light passing through multiple slits is described by:
I(θ)I0=(sin(u)u)2⋅(sin(Nv)sin(v))2
where u=πbsin(θ)/λ, v=πdsin(θ)/λ. N is the number of slits, b the width of a slit and d the distance between the slits. The maxima in intensity are given by dsin(θ)=kλ.
The diffraction through a spherical aperture with radius a is described by:
I(θ)I0=(J1(kasin(θ))kasin(θ))2
The diffraction pattern of a rectangular aperture at distance R with length a in the x-direction and b in the y-direction is described by:
I(x,y)I0=(sin(α′)α′)2(sin(β′)β′)2
where α′=kax/2R and β′=kby/2R.
When X rays are diffracted at a crystal Bragg's Relation holds for the position of maximum intensity : 2dsin(θ)=nλ where d is the distance between the crystal layers.
Close to the source the Fraunhofer model is invalid because it ignores the angle-dependence of the reflected waves. This is described by the obliquity or inclination factor, which describes the directionality of the secondary emissions: E(θ)=12E0(1+cos(θ)) where θ is the angle w.r.t. the optical axis.
Diffraction limits the resolution of a system. This is the minimum angle Δθmin between two incident rays coming from points far away for which their refraction patterns can be detected separately. For a circular slit: Δθmin=1.22λ/D where D is the diameter of the slit.
For a grating: Δθmin=2λ/(Nacos(θm)) where a is the distance between two peaks and N the number of peaks. The minimum difference between two wavelengths that gives a separated diffraction pattern in a multiple slit geometry is given by Δλ/λ=nN where N is the number of lines and n the order of the pattern.
Special optical effects
- Birefringence and dichroism. →D is not parallel with →E if the polarizability →P of a material is not equal in all directions. There are at least three directions, the principal axes, in which they are parallel. This results in three refractive indices ni which can be used to construct Fresnel’s ellipsoid. In the case n2=n3≠n1, which happens e.g. in trigonal, hexagonal and tetragonal crystals there is one optical axis in the direction of n1. Incident light rays can now be split up in two parts: the ordinary wave is linearly polarized ⊥ the plane through the transmission direction and the optical axis. The extraordinary wave is linearly polarized in the plane through the transmission direction and the optical axis. Dichroism is caused by a differential absorption of the ordinary and extraordinary wave in some materials. Double images occur when the incident ray makes an angle with the optical axis: the extraordinary wave will refract, the ordinary will not.
- Retarders: waveplates and compensators. Incident light will have a phase shift of Δφ=2πd(|n0−ne|)/λ0 if an uniaxial crystal is cut in such a way that the optical axis is parallel with the front and back plane. Here, λ0 is the wavelength in vacuum and n0 and ne the refractive indices for the ordinary and extraordinary wave. For a quarter-wave plate: Δφ=π/2.
- The Kerr-effect: isotropic, transparent materials can become birefringent when placed in an electric field. In that case, the optical axis is parallel to →E. The difference in refractive indecies in the two directions is given by: Δn=λ0KE2, where K is the Kerr constant of the material. If the electrodes have an effective length ℓ and are separated by a distance d, the retardation is given by: Δφ=2πKℓV2/d2, where V is the applied voltage.
- The Pockels or linear electro-optical effect can occur in 20 (from a total of 32) crystal symmetry classes, namely those without a centre of symmetry. These crystals are also piezoelectric: their polarization changes when a pressure is applied and vice versa: →P=pd+ε0χ→E. The retardation in a Pockels cell is Δφ=2πn30r63V/λ0 where r63 is the 6-3 element of the electro-optic tensor.
- The Faraday effect: the polarization of light passing through material with length d and to which a magnetic field is applied in the propagation direction is rotated by an angle β=VBd where V is the Verdet constant.
- Cerenkov radiation arises when a charged particle with vq>vf arrives. The radiation is emitted within a cone with an apex angle α with sin(α)=c/cmedium=c/nvq.
The Fabry-Perot interferometer
For a Fabry-Perot interferometer and in general: T+R+A=1 where T is the transmission factor, R the reflection factor and A the absorption factor. If F is given by F=4R/(1−R)2 it follows for the intensity distribution:
ItIi=[1−A1−R]211+Fsin2(θ)
The term [1+Fsin2(θ)]−1:=A(θ) is called the Airy function.

The width of the peaks at half height is given by γ=4/√F. The finesse F is defined as F=12π√F. The maximum resolution is then given by Δfmin=c/2ndF.