Loading [MathJax]/jax/output/HTML-CSS/jax.js
Skip to main content
Library homepage
 

Text Color

Text Size

 

Margin Size

 

Font Type

Enable Dyslexic Font
Physics LibreTexts

6: Optics

( \newcommand{\kernel}{\mathrm{null}\,}\)

The bending of light

For the refraction at a surface : nisin(θi)=ntsin(θt) holds where n is the refractive index of the material. Snell’s law is:

n2n1=λ1λ2=v1v2

If Δn1, the change in phase of the light is Δφ=0, if Δn>1 then: Δφ=π. The refraction of light in a material is caused by scattering from atoms. This is described by:

n2=1+nee2ε0mjfjω20,jω2iδω

where ne is the electron density and fj the oscillator strength, for which jfj=1. From this follows that vg=c/(1+(nee2/2ε0mω2)). From this the equation of Cauchy one can derive: n=a0+a1/λ2. More generally, it is possible to expand n as: n=nk=0akλ2k.

For an electromagnetic wave in general: n=εrμr.

The path, followed by a light ray in material can be found from Fermat’s principle:

δ21dt=δ21n(s)cds=0δ21n(s)ds=0

Paraxial geometrical optics

Lenses

The Gaussian lens formula can be deduced from Fermat’s principle with the approximations cosφ=1 and sinφ=φ. For the refraction at a spherical surface with radius R:

n1vn2b=n1n2R

where |v| is the distance of the object and |b| the distance of the image. Applying this twice results in:

1f=(nl1)(1R21R1)

where nl is the refractive index of the lens, f is the focal length and R1 and R2 are the curvature radii of both surfaces. For a double concave lens R1<0, R2>0, for a double convex lens R1>0 and R2<0. Further:

1f=1v1b

D:=1/f is called the dioptric power of a lens. For a lens with thickness d and diameter D to a good approximation: 1/f=8(n1)d/D2. For two lenses placed on a line with distance d between them:

1f=1f1+1f2df1f2

In these equations the following signs are being used for refraction at a spherical surface, as is seen by an incoming light ray:

 

Quantity + -
R Concave surface Convex surface
f Converging lens Diverging lens
v Real object Virtual object
b Virtual image Real image

Mirrors

For images formed by mirrors

1f=1v+1b=2R+h22(1R1v)2

where h is the perpendicular distance from the point the light ray hits the mirror to the optical axis. Spherical aberration can be reduced by not using spherical mirrors. A parabolical mirror has no spherical aberration for light rays parallel with the optical axis and is therefore often used for telescopes. The signs used are:

Quantity + -
R Concave mirror Convex mirror
f Concave mirror Convex mirror
v Real object Virtual object
b Real image Virtual image

Principal planes

The nodal points N of a lens are defined inFigure 6.1. If the lens is surrounded by the same medium on both sides, the nodal points are the same as the principal points H. The plane to the optical axis through the principal points is called the principal plane. If the lens is described by a matrix mij then for the distances h1 and h2 to the boundary of the lens it holds that:

h1=nm111m12  ,   h2=nm221m12

fig-ch01_patchfile_01.jpg
Figure 6.1

Magnification

The linear magnification is defined by: N=bv

The angular magnification is defined by: Nα=αsystαnone

where αsys is the size of the retinal image in the optical system and αnone the size of the retinal image outside the system. Further: NNα=1. For a telescope: N=fobjective/focular holds. The f-number is defined by f/Dobjective.

Matrix methods

A light ray can be described by a vector (nα,y) with α the angle with the optical axis and y the distance to the optical axis. The new position of a light ray interacting with an optical system can be obtained using matrix multiplication:

(n2α2y2)=M(n1α1y1)

where Tr(M)=1. M is a product of elementary matrices. These are:

  1. Transfer along length l: MR=(10l/n1)
  2. Refraction at a surface with dioptric power D: MT=(1D01)

Aberrations

Lenses usually do not give a perfect image. Some causes are:

  1. Chromatic aberration is caused by the fact that n=n(λ). This can be partially corrected with a compound lens which is composed of several lenses with different indicies of refraction ni(λ). Using N lenses makes it possible to obtain the same f for N wavelengths.
  2. Spherical aberration is caused by second-order effects which are usually ignored; a spherical surface does not make a perfect lens. Incoming rays far from the optical axis will bend more. Best form lenses can ameliorate spherical aberration.
  3. Coma is caused by the fact that the principal planes of a lens are only flat near the principal axis. Further from the optical axis they are curved. This curvature can be either positive or negative.
  4. Astigmatism:for each point of an object not on the optical axis the image is an ellipse because the thickness of the lens is not the same everywhere.
  5. Field curvature can be corrected by the human eye.
  6. Distorsion leads to aberration near the edges of the image. This can be corrected with a combination of positive and negative lenses.

Reflection and transmission

If an electromagnetic wave hits a transparent medium part of the wave will reflect at the same angle as the incident angle, and a part will be refracted at an angle according to Snell’s law. It makes a difference whether the E field of the wave is or w.r.t. the surface. When the coefficients of reflection r and transmission t are defined as:

r(E0rE0i) ,  r(E0rE0i) ,  t(E0tE0i) ,  t(E0tE0i)

where E0r is the reflected amplitude and E0t the transmitted amplitude. Then the Fresnel equations are:

r=tan(θiθt)tan(θi+θt)   ,   r=sin(θtθi)sin(θt+θi)

t=2sin(θt)cos(θi)sin(θt+θi)cos(θtθi)   ,   t=2sin(θt)cos(θi)sin(θt+θi)

and the following holds: tr=1 and t+r=1. If the coefficient of reflection R and transmission T are defined as (with θi=θr):

RIrIi   and   TItcos(θt)Iicos(θi)

with I=|S| it follows that: R+T=1. A special case is r=0. This happens if the angle between the reflected and transmitted rays is 90. From Snell’s law it then follows: tan(θi)=n. This angle is called Brewster’s angle. The situation with r=0 is not possible.

Polarization

The polarization is defined as:

P=IpIp+Iu=ImaxIminImax+Imin

where the intensity of the polarized light is given by Ip and the intensity of the unpolarized light is given by Iu. Imax and Imin are the maximum and minimum intensities when the light passes a polarizer. If polarized light passes through a polarizer Malus law applies: I(θ)=I(0)cos2(θ) where θ is the angle of the polarizer.

The state of a light ray can be described by the Stokes-parameters: start with 4 filters where each transmits half the intensity. The first is independent of the polarization, the second and third are linear polarizers with the transmission axes horizontal and at +45, while the fourth is a circular polarizer which is opaque for L-states. Then S1=2I1, S2=2I22I1, S3=2I32I1 and S4=2I42I1.

The state of a polarized light ray can also be described by the Jones vector:

E=(E0xeiφxE0yeiφy)

For the horizontal P-state: E=(1,0), for the vertical P-state E=(0,1), the R-state is given by E=122(1,i) and the L-state by E=122(1,i). The change in polarization of a light beam after passage through the optical train can be described as E2=ME1. For some types of optical equipment the Jones matrix M is given by:

  • Horizontal linear polarizer: M=(1000)
  • Vertical linear polarizer: M=(0001)
  • Linear polarizer at +45 M=12(1111)
  • Lineair polarizer at 45 M=12(1111)
  • 1/4λ plate, fast axis vertical M=eiπ/4(100i)
  • 1/4λ plate, fast axis horizontal M=eiπ/4(100i)
  • Homogenous circular polarizor right M=12(1ii1)
  • Homogenous circular polarizer left M=12(1ii1)

Prisms and dispersion

A light ray passing through a prism is refracted twice and aquires a deviation from its original direction δ=θi+θi+α w.r.t. the incident direction, where α is the apex angle, θi is the angle between the incident angle and a line perpendicular to the surface and θi is the angle between the ray leaving the prism and a line perpendicular to the surface. When θi varies there is an angle for which δ becomes minimal. For the refractive index of the prism now:

n=sin(12(δmin+α))sin(12α)

The dispersion of a prism is defined by:

D=dδdλ=dδdndndλ where the first factor depends on the shape and the second on the composition of the prism. For the first factor it follows that:

dδdn=2sin(12α)cos(12(δmin+α))

For visible light usually dn/dλ<0 holds and shorter wavelengths are more strongly bent than longer ones. The refractive index in this region can usually be approximated by Cauchy’s formula.

Diffraction

Fraunhofer diffraction occurs far away from the source(s). The Fraunhofer diffraction of light passing through multiple slits is described by:

I(θ)I0=(sin(u)u)2(sin(Nv)sin(v))2

where u=πbsin(θ)/λ, v=πdsin(θ)/λ. N is the number of slits, b the width of a slit and d the distance between the slits. The maxima in intensity are given by dsin(θ)=kλ.

The diffraction through a spherical aperture with radius a is described by:

I(θ)I0=(J1(kasin(θ))kasin(θ))2

The diffraction pattern of a rectangular aperture at distance R with length a in the x-direction and b in the y-direction is described by:

I(x,y)I0=(sin(α)α)2(sin(β)β)2

where α=kax/2R and β=kby/2R.

When X rays are diffracted at a crystal  Bragg's Relation holds for the position of  maximum intensity : 2dsin(θ)=nλ where d is the distance between the crystal layers.

Close to the source the Fraunhofer model is invalid because it ignores the angle-dependence of the reflected waves. This is described by the obliquity or inclination factor, which describes the directionality of the secondary emissions: E(θ)=12E0(1+cos(θ)) where θ is the angle w.r.t. the optical axis.

Diffraction limits the resolution of a system. This is the minimum angle Δθmin between two incident rays coming from points far away for which their refraction patterns can be detected separately. For a circular slit: Δθmin=1.22λ/D where D is the diameter of the slit.

For a grating: Δθmin=2λ/(Nacos(θm)) where a is the distance between two peaks and N the number of peaks. The minimum difference between two wavelengths that gives a separated diffraction pattern in a multiple slit geometry is given by Δλ/λ=nN where N is the number of lines and n the order of the pattern.

Special optical effects

  • Birefringence and dichroism. D is not parallel with E if the polarizability P of a material is not equal in all directions. There are at least three directions, the principal axes, in which they are parallel. This results in three refractive indices ni which can be used to construct Fresnel’s ellipsoid. In the case n2=n3n1, which happens e.g. in trigonal, hexagonal and tetragonal crystals there is one optical axis in the direction of n1. Incident light rays can now be split up in two parts: the ordinary wave is linearly polarized the plane through the transmission direction and the optical axis. The extraordinary wave is linearly polarized in the plane through the transmission direction and the optical axis. Dichroism is caused by a differential absorption of the ordinary and extraordinary wave in some materials. Double images occur when the incident ray makes an angle with the optical axis: the extraordinary wave will refract, the ordinary will not.
  • Retarders: waveplates and compensators. Incident light will have a phase shift of Δφ=2πd(|n0ne|)/λ0 if an uniaxial crystal is cut in such a way that the optical axis is parallel with the front and back plane. Here, λ0 is the wavelength in vacuum and n0 and ne the refractive indices for the ordinary and extraordinary wave. For a quarter-wave plate: Δφ=π/2.
  • The Kerr-effect: isotropic, transparent materials can become birefringent when placed in an electric field. In that case, the optical axis is parallel to E. The difference in refractive indecies in the two directions is given by: Δn=λ0KE2, where K is the Kerr constant of the material. If the electrodes have an effective length and are separated by a distance d, the retardation is given by: Δφ=2πKV2/d2, where V is the applied voltage.
  • The Pockels or linear electro-optical effect can occur in 20 (from a total of 32) crystal symmetry classes, namely those without a centre of symmetry. These crystals are also piezoelectric: their polarization changes when a pressure is applied and vice versa: P=pd+ε0χE. The retardation in a Pockels cell is Δφ=2πn30r63V/λ0 where r63 is the 6-3 element of the electro-optic tensor.
  • The Faraday effect: the polarization of light passing through material with length d and to which a magnetic field is applied in the propagation direction is rotated by an angle β=VBd where V is the Verdet constant.
  • Cerenkov radiation arises when a charged particle with vq>vf arrives. The radiation is emitted within a cone with an apex angle α with sin(α)=c/cmedium=c/nvq.

The Fabry-Perot interferometer

For a Fabry-Perot interferometer and in general: T+R+A=1 where T is the transmission factor, R the reflection factor and A the absorption factor. If F is given by F=4R/(1R)2 it follows for the intensity distribution:

ItIi=[1A1R]211+Fsin2(θ)

The term [1+Fsin2(θ)]1:=A(θ) is called the Airy function.

clipboard_e4e6a0e142d90df32a1c8e035cc689e07.png
Figure 6.2: Fabry Perot Interferometer

The width of the peaks at half height is given by γ=4/F. The finesse F is defined as F=12πF. The maximum resolution is then given by Δfmin=c/2ndF.


This page titled 6: Optics is shared under a CC BY license and was authored, remixed, and/or curated by Johan Wevers.

Support Center

How can we help?