4.1: Polarized Light and the Stokes Parameters
- Page ID
- 7369
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)Suppose that we wish to characterize a beam of parallel monochromatic light. A description of it should include the following.
* Its wavelength or frequency. Its wavelength depends upon the refractive index of the material in which it is travelling, whereas its frequency does not. Therefore, if the wavelength is given, the medium must be specified. It may not always be realized, but most tables of wavelengths of spectrum lines in the visible region of the spectrum are given for air and not for a vacuum. [Actually for something called “Standard Air” - details of which may be found in http://orca.phys.uvic.ca/~tatum/stellatm/atm7.pdf ] Specifying the frequency rather than the wavelength removes possible ambiguity. Spectroscopists often quote the wavenumber in vacuo, which is the reciprocal of the vacuum wavelength.
* Its flux density in W m−2. This is related to the electric field strength of the electromagnetic wave, in a manner that will be discussed later in the chapter.
* Its state of polarization. In this chapter, polarized light will in general be taken to mean elliptically polarized light, which includes circularly and linearly (plane) polarized light as special cases. The state of polarization can be described by specifying
* the eccentricity of the polarization ellipse
* the orientation of the polarization ellipse
* the chirality (handedness) of the polarization ellipse
* whether the polarization is total or partial, and, if partial, the degree of polarization.
Up to and including Equation (\(\ref{A15}\)) (page 8) we shall assume that the polarization is total. We shall look at partial polarization after that.
Polarized light is generally described by supposing that, at some point in space, the tip of the vector that represents the strength of the electric field describes a Lissajous ellipse (Figure IV.1).
In the drawing the semi major axis a represents the greatest value of the electric field strength, in volts per metre, during a cycle, and the semi minor axis \( b\) represent the least value of the electric field strength during the cycle. If you prefer, you could use symbols such as \( E_{max}\) and \( E_{min}\) instead of \( a\) and \( b\).
In order to describe the ellipse, we need to describe its size, its shape, its orientation and its chirality or handedness (i.e., whether the vector is rotating clockwise or counterclockwise).
The natural way of doing this is to give the length \( a\) of the semi major axis (in volts per metre), the eccentricity of the (\( e\ =\ \sqrt{1-\frac{b^{2}}{a^{2}}}\)) , the angle \( \theta\) that the major axis makes with the horizontal, and perhaps one of the words "clockwise" or "counterclockwise". It will be necessary, however, to make clear whether you, the observer, are looking towards the source of light, or are looking in the direction of travel of the light. Not everyone uses the same convention in this matter, and the onus is on the writer to make clear which convention he or she is using. In this chapter I shall assume that we are looking towards the source of the light. In Figure IV.1, I have drawn the ellipse with \( \frac{b}{a}\ =\ \frac{1}{2}\) (\( e\ =\ \frac{\sqrt{3}}{2}\ =\ 0.8660\)) and \( \theta\ =\ 30^{\circ}\).
[Since I wrote the above paragraph, I received in December 2015 a memorandum from the International Astronomical Union stating that there has long been an IAU convention that position angle is to be reckoned positive in the counterclockwise direction for an observer looking towards the source of light. This is in fact the convention that I use in these notes. The IAU memorandum, however, pointed out that some scientists who investigate the polarization of the Cosmic Background Radiation have been using the opposite convention, and consequently the IAU reiterates its recommendation that all astronomers, including those working on the CBR, use the above convention. This is a good example of what I meant in the previous paragraph. I would emphasize that, even although there is an IAU convention - one which I strongly support - it is incumbent upon YOU, to make certain, if you wish your readers to understand you, to make it unambiguously clear, whenever you write about polarization, as to what convention you are using. And don’t just say “the IAU convention”. Say that angles are reckoned positive if increasing counterclockwise when you are facing towards the source of light. I hope that referees and editors will enforce this!]
We noted above that the flux density of the beam is related to the electric field strength of the electromagnetic wave. In this paragraph and the next we explore this relation. Suppose, for ) example, that the light is plane polarized, and that the maximum value of the electric field is \( \hat{E}\) volts. Its mean square value during a cycle is \( \overline{E\ ^{2}}\ =\ \frac{1}{2}\hat{E}\ ^{2}\). The energy per unit volume is \( \frac{1}{2}\ \epsilon\ \overline{E\ ^{2}}\ =\ \frac{1}{4}\ \epsilon\ \hat{E}\ ^{2}\ \text{J m}^{-3}\), where \( \epsilon\) is the permittivity of the medium in which the radiation is travelling. If it is moving at speed \( v\), the flux density of the beam is \( \frac{1}{4}\ v\epsilon\ \hat{E}\ ^{2}\ \ \text{W m}^{-2}\). The speed of an electromagnetic wave in a medium of permittivity \( \epsilon\) and permeability \( \mu\) is given by \( v\ =\ \frac{1}{\sqrt{\epsilon\mu}}\), so this expression becomes \( \frac{1}{4}\sqrt{\frac{\epsilon}{\mu}}\hat{E}\ ^{2}\ =\ \frac{\hat{E}\ ^{2}}{4Z}\), where \( Z\ =\ \sqrt{\frac{\mu}{\epsilon}}\) is the impedance (in the sense used in electromagnetic theory) of the medium. For most transparent media, \( \mu\) is very close to \( \mu_{0}\). the permeability of free space. This is not the case for the permittivity, which usually ranges from 1 up to a few tens of times \( \epsilon_{0}\). For a vacuum, the impedance has a value of about 377\( \Omega\).
If the light is elliptically polarized, the expression for the flux density will be \( \frac{a^{2}\ +\ b^{2}}{4Z}\), where \( a\) and \( b\) are the electric fields described in earlier paragraphs. That the \( \hat{E}\ ^{2}\) for plane polarized light can be replaced by \( a^{2}\ +\ b^{2}\) for elliptically polarized light should become apparent later while discussing the director circle property of an ellipse.
While these parameters may be the obvious ones to use in describing the state of polarization, the fact is that none of them is directly measurable. What we can measure relatively easily is the intensity of the light when viewed through a polarizing filter oriented at various angles. What we can measure are four parameters known as the Stokes parameters, which we shall describe shortly. We can measure the Stokes parameters, and it will then be our task to determine from these the eccentricity, orientation and chirality of the polarization ellipse, and the degree of polarization.
Before describing them, a word about notation.
The traditional symbols used to describe the Stokes parameters are IQUV. These may seem somewhat haphazard, so some modern authors prefer a more systematic \( S_{1},\ S_{2},\ S_{3},\ S_{4}\) while some prefer \( S_{0},\ S_{1},\ S_{2},\ S_{3}\). If you use the modern \( S\) notation, I would (strongly) recommend \( S_{0},\ S_{1},\ S_{2},\ S_{3}\) over \( S_{1},\ S_{2},\ S_{3},\ S_{4}\). In these notes, however, I shall be old-fashioned and I shall use IQUV , which at least has the advantage of avoiding the ambiguity over the two possible \( S\) notations, and you will not have to worry which version I am using.
In the figure the lines represent the component of the electric field passed by the filter. The lengths of the long organic molecules embedded within the filter are perpendicular to this transmission direction. Light (i.e. an oscillating electromagnetic field) that is oscillating parallel to the lengths of these molecules is strongly absorbed, because of the highly anisotropic polarizability of these molecules.
Perhaps we can measure the intensity of the light after passage through the filter at each of these angles, and also without the filter, and somehow determine from these measurements the shape and orientation of the polarization ellipse.
The Stokes parameters are named after a nineteenth century British physicist, Sir George Stokes, and may be referred to as Stokes's parameters, Stokes' parameters or the Stokes parameters, but not, of course, as Stoke's parameters.
Let us imagine that we have in our hand a flux meter, and that it can measure the flux density, in W m−2 of our parallel beam of monochromatic light. While we would prefer to use the symbol \( F\) for flux density, in fact the flux density of the unobstructed light is the first of the Stokes parameters, for which the traditional symbol is I (and whose modern symbol is \( S_{0}\) or \( S_{1}\), depending on which book you are reading.)
Now let us suppose that we measure the flux density of the light after passage through a polarizing filter oriented at various angles as suggested in Figure IV.2. The second and third Stokes parameters, then, are defined by
\[ \textbf{Q}\ =\ F_{0}\ -\ F_{90} \tag{1}\label{4.1}\]
and
\[ \textbf{U}\ =\ F_{45}\ -\ F_{135} \tag{2}\label{4.2}\]
Unless you are fortunate or rich, it is unlikely that your little flux meter will accurately measure the flux densities in absolute SI units in W m−2. Therefore those of us of more modest means will just have to be content with dimensionless Stokes parameters - measured in units so that the unobstructed flux density is 1. We define the dimensionless Stokes parameters (for which I use a different font) by
\[ Q\ =\ \frac{F_{0}\ -\ F_{90}}{F}\ =\ \bf{\frac{\text{Q}}{\text{I}}} \tag{3}\label{4.3}\]
\[ U\ =\ \frac{F_{45}\ -\ F_{135}}{F}\ =\ \bf{\frac{\text{U}}{\text{I}}} \tag{4}\label{4.4}\]
Thus, for the dimensioned Stokes parameters in W m−2 (which we may not easily be able to measure), I use IQUV. For the dimensionless Stokes parameters, I use \( QUV\). (There is no need for a dimensionless \( I\), because it is 1.)
It is possible to determine the eccentricity \( e\) and the inclination \( \theta\) of the polarization ellipse from \( Q\) and \( U\). Here I give the relations without derivation. I shall give a derivation in an Appendix to this chapter. For the time being, then, here are the relations:
\[ Q\ =\ \frac{e^{2}\cos 2\theta}{2\ -\ e^{2}} \tag{5}\label{4.5}\]
\[ U\ =\ \frac{e^{2}\sin 2\theta}{2\ -\ e^{2}} \tag{6}\label{4.6}\]
Perhaps of more interest are the converses of these:
\[ e^{2}\ =\ \frac{2\sqrt{Q^{2}\ +\ U^{2}}}{1\ +\ \sqrt{Q^{2}\ + \ U^{2}}} \tag{7}\label{4.7}\]
\[ \tan 2\theta\ =\ \frac{U}{Q} \tag{8}\label{4.8}\]
In solving Equation (\( \ref{4.8}\)) for \( \theta\), it is necessary to know the signs of \( U\) and \( Q\) separately, in order to avoid an ambiguity of quadrant. Provision of the \( \arctan 2\) function in a calculator or computer greatly facilitates this.
The table below shows a sample of polarization ellipses for various combinations of \( Q\) and \( U\). For reasons that will become apparent during the derivation of the formulas in the Appendix, all of the ellipses are drawn such that \( a^{2}\ +\ b^{2}\) is the same for each. This ensures that the flux density is the same for each.
Thus far we have dealt with the Stokes parameters I (related to the flux density of the light), and \( Q\) and \( U\) (related to the shape and orientation of the polarization ellipse). Now we have to describe the Stokes parameter V, and how it is related to the chirality (handedness) of the ellipse. In this account, when I use the words “clockwise” and “counterclockwise” I shall assume that we are looking towards the source of light.
If we really want to know the polarity, we need to have a good research grant and to be in possession of a filter that passes only circularly polarized light. A linear polarizer in conjunction with a quarter-wave plate will do it. I shall take it that the filter passes only light that is circularly polarized in the clockwise sense. Suppose the flux density after passage through such a filter is \( F_{C}\). The Stokes V parameter is defined as
\[ \textbf{V}\ =\ 2F_{C}\ -\ I, \tag{9}\label{4.9}\]
or, in dimensionless form,
\[ V\ =\ \frac{2F_{C}}{F}\ -1. \tag{10}\label{4.10}\]
It will be observed that this parameter (like the others) ranges from −1 (if \( F_{C}=0\)) to +1 (if \( F_{C}=1\)), and hence that negative \( V\) implies counterclockwise polarization, and positive \( V\) implies clockwise polarization. We shall also show in the Appendix, that (subject to an important condition - see below), \( V\) is related to the eccentricity by
\[ V^{2}\ =\ \frac{4(1-e^{2})}{(2-e^{2})^{2}}. \tag{11}\label{4.11}\]
This means that \( V=0\) implies \( e=1\), and hence linear polarization (for which there is no chirality). Also, \( V^{2}=1\) implies \( e=1\), and hence circular polarization. Conversely
\[ e^{2}=\frac{2(-1+V^{2}+\sqrt{1-V^{2}})}{V^{2}} \tag{12}\label{4.12}\]
Thus one can determine both the chirality and the eccentricity (but not \( \theta\)) from \( V\) alone. Figure IV.3 shows the relation between \( |V|\) and \( e\).
This redundancy must mean that \( Q\),\( U\) and \( V\) are not independent, and indeed it will be observed from equations (\( \ref{4.5}\)), (\( \ref{4.6}\)) and (\( \ref{4.11}\)) that
\[ Q^{2}\ +\ U^{2}\ +\ V^{2}\ =\ 1. \tag{13}\label{4.13}\]
In terms of the dimensioned Stokes parameters:
\[ \bf{Q^{2}\ +\ U^{2}\ +\ V^{2}\ =\ I^{2}}. \tag{14}\label{4.14}\]
In one of the \( S\) notations, this would conveniently be
\[ S^2_1 + S^2_2 + S^2_3 = S^2_0. \tag{15}\label{4.15}\]
Just before Equation (\( \ref{4.11}\)) we referred to an important condition. Equations (\( \ref{4.11}\)) - (\( \ref{4.15}\)), and Figure IV.3, are valid only for the case of total elliptical polarization. The case of partial polarization is discussed in what follows. The section on partial polarization should not be thought of as a relatively unimportant afterthought, because most sources of polarized light that one comes across are more likely to be partially polarized rather than totally polarized.
Partial Polarization
Until this point we have assumed that we have been concerned with a single coherent wave with one well-defined polarization state. In practice, we rarely see this, and we more often have to deal with partially polarized light. Most of us have a fairly good idea of what is meant by light that is partially plane polarized horizontally. We mean that the light is mostly like this:
but there’s also a little bit of this:
But if that were so with two coherent waves, this would result, if they were in phase, in this:
or if they were not in phase, in this:
In truth, unless we are looking at a coherent light source, such as a laser, partially polarized light might be more like this:
This is partially plane polarized at about an angle of 30º, but it is clearly not totally plane polarized. Partially polarized light can be described as the sum of a totally polarized component plus an unpolarized component. Thus we might describe the situation illustrated above by something like this:
Partially elliptical polarized light might be described by a totally elliptically polarized component, plus an unpolarized component:
If we could somehow separately measure the flux densities of the polarized (p) and unpolarized (u) components, we could define the degree of polarization by
\[ p\ =\ \frac{F_{p}}{F_{p}\ +\ F_{u}} \tag{16}\label{4.16}\]
If we know that the light is partially plane (linearly) polarized, as in Figure IV.5 (rather than elliptically polarized as in Figure IV.6), we can measure this rather easily. Place the polarizing filter in front of the source, and rotate it until the transmitted flux density goes through a maximum, \( F_{\text{max}}\). and then through a further 90º until it goes through a minimum, \( F_{\text{min}}\). This will give you the degree of polarization from.
\[ p\ =\ \frac{F_{\text{max}}\ -\ F_{\text{min}}}{F_{\text{max}}\ +\ F_{\text{min}}}. \tag{17}\label{4.17}\]
and of course it also gives you the polarization angle. This applies, of course, only to light that you know to be partially linearly polarized. It will not do for partially elliptically polarized light.
Recall that
\[ Q\ =\ \frac{F_{0}\ -\ F_{90}}{F}\ =\ \bf{\frac{\text{Q}}{\text{I}}} \tag{3}\label{4.3b}\]
and
\[ U\ =\ \frac{F_{45}\ -\ F_{135}}{F}\ =\ \bf{\frac{\text{U}}{\text{I}}} \tag{4}\label{4.4b}\]
If the source is partially plane polarized, each of the measurements \( F_{0},F_{90},F_{45},F_{135}\) includes a total linear or elliptical component, and an unpolarized component. However, the unpolarized component is the same for each of these four measurements. Consequently Q and U describe the “total” component only. Thus all equations up to and including Equation (\( \ref{4.8}\)), as well as the table illustrating the shape of the ellipse as a function of Q and U, are still valid for the “total” component.
The parameter \( V\), however, was defined in Equations 9 and 10 by
\[ \textbf{V}\ =\ 2F_{C}\ -\ I, \tag{9}\label{4.9b}\]
or, in dimensionless form,
\[ V\ =\ \frac{2F_{C}}{F}\ -1. \tag{10}\label{4.10b}\]
\( F_{C}\) and \( F\) each contain a “total” and an unpolarized component, so that, unlike \( Q\) and \( U\), the “total” component is not separated out.
Recall from Equations (\( \ref{4.13}\)) and (\( \ref{4.14}\)) that \( \bf{I\ =\ \sqrt{Q^{2}\ +\ U^{2}\ +\ V^{2}}}\) and \( Q^{2}\ +\ U^{2}\ +\ V^{2}\ =\ 1\).
These were derived for totally elliptically (which includes linearly) polarized light. For light that is partially polarized, it applies only to the “total” part, so that, for partially polarized light,
\[ \text{p}\ =\ \sqrt{Q^{2}\ +\ U^{2}\ +\ V^{2}}. \tag{18}\label{4.18}\]
From Equations (\( \ref{4.5}\)), (\( \ref{4.6}\)) and (\( \ref{4.18}\)) we determine that
\[ \text{p}\ =\ \sqrt{V^{2}\ +\ \frac{e^{4}}{(2-e^{2})^{2}}} \tag{19}\label{4.19}\]
Thus from the measurements of \( F_{0},F_{90},F_{45},F_{135}\) and their combinations \( IQUV\) we have determined, for partially polarized light, the degree of polarization, and the eccentricity, orientation and chirality of the polarization ellipse.
Equation (\( \ref{4.18}\)) suggests that that the state of polarization of light can be described by a point in \( QUV\) space . This concept is described by the Poincaré sphere:
In this context I have often seen the notation \( 2\psi\) for \( \phi\) and \( 2\chi\) for 90º − \( \theta\). (The \( \theta\) here, of course, is not the same as the \( \theta\) of Figure IV.1.
Let us suppose, to begin with, that we have total polarization, so that \( p=1\). The reader is invited to imagine the shape of the polarization ellipse at any point on the surface of the sphere. Recall in particular that \( V=0\) implies linear polarization, and \( V=\pm1\) implies circular polarization. Thus anywhere around the equator of the Poincaré represents linear polarization, and at the poles we have circular polarization.
Let us look along the meridian of longitude with \( \phi\) = 0 (\( U\)= 0). As we go from the “north pole” to the “south pole”, \( V\) goes from +1 (circular) through 0 (linear) to −1 (circular), and \( Q\) goes from 0 (circular) through 1 (linear) to 0 (circular). It will be useful (essential) to refer to the table on page 5.
The reader is now invited to think about (while referring to the table on page 5) the situation along the meridian with φ = 90º. And then to try other meridians, eventually covering the sphere with ellipses. This is a little beyond my artistic ability, but I found a very good one by Googling for Poincaré sphere. Choose “Images for poincare sphere”. There are some excellent images there. I particularly like the orange-coloured one from University of Arizona. If you click on it, the sphere rotates, and you can see all round the sphere.
APPENDIX
In the article above I described the Stokes parameters, and I related them to the shape, orientation and chirality of the polarization ellipse, as follows (for total polariazation):
\( Q\ =\ \frac{e^{2}\cos2\theta}{2-e^{2}} \qquad U\ =\ \frac{e^{2}\sin2\theta}{2-e^{2}} \qquad V^{2}\ =\ \frac{4(1-e^{2})}{(2-e^{2})^{2}}\)
In this Appendix, I derive these relations.
Before starting, let us remind ourselves of an established property of an ellipse of semi major and semi minor axes \( a\) and \( b\), namely that the locus of the corners of all circumscribing rectangles to an ellipse is a circle, known as the director circle, which is of radius \( \sqrt{a^{2}+b^{2}}\). This is illustrated in Figure A1, in which I have drawn three circumscribing rectangles. The semidiagonals of all the circumscribing rectangles are of the same length, namely \( \sqrt{a^{2}+b^{2}}\). A proof of this theorem is to be found in http://orca.phys.uvic.ca/~tatum/celmechs/celm2.pdf , Section 2.3, or in many books on the properties of the conic sections.
Recall now the meanings of \( a\) and \( b\). They are the semi major and semi minor axes of the ellipse, but they are also the greatest and least values of the electric field during a cycle. Recall also that the energy per unit volume of an electric field is proportional to the square of the electric field strength. When the light is observed direct without the intervention of a polarizing filter, the flux density of the light is proportional, then, to \( a^{2}+b^{2}\). That is to say, the Stokes parameter \( I\) is proportional to the square of the radius of the director circle.
In what follows, we shall have occasion to refer the polarization ellipse to three rectangular coordinate systems.
i. A coordinate system (\( x, y\)), in which the axes of coordinates coincide with the axes of the polarization ellipse.
ii. A coordinate system (\( x_{1}, y_{1}\)), in which the axes of coordinates are horizontal and vertical - or, to more precise, parallel to the transmission axes of the first two filters illustrated in Figure IV.1.
iii. A coordinate system (\( x_{2}, y_{2}\)), in which the axes of coordinates are parallel to the transmission axes of the last two filters illustrated in Figure IV.1.
The ellipse referred to these three coordinate systems is shown in Figures A2, A3, A4. In each of these drawings, I have drawn a circumscribing rectangle and the director circle. The flux density of the radiation is proportional to the square of the rectangle diagonal, which is the same in all three drawings, and is equal to the diameter of the director circle, namely \( 2\sqrt{a^{2}\ +\ b^{2}}\).
I have also indicated the lengths \( a,\ b,\ a_{1},\ b_{1},\ a_{2},\ b_{2}\) in these drawings. These represent the maximum values of the component of the electric field during a cycle in the directions of the six axes. Indeed, the reader might even prefer an alternative notation:
\( a\ =\ \hat{E}_{x}\)
\( b\ =\ \hat{E}_{y}\)
\( a_{1}\ =\ \hat{E}_{x_{1}}\)
\( b_{1}\ =\ \hat{E}_{y_{1}}\)
\( a_{2}\ =\ \hat{E}_{x_{2}}\)
\( b_{2}\ =\ \hat{E}_{y_{2}}\)
The first notation is easier for the analysis of the geometry of the ellipse. The second notation reminds us of the physical meaning of the symbols. Indeed the readings of our flux meter are proportional, successively, to \( \hat{E}_{x}\ ^{2}\ +\ \hat{E}_{y}\ ^{2},\ \hat{E}_{x_{1}}\ ^{2}\ ,\ \hat{E}_{y_{1}}\ ^{2},\ \hat{E}_{x_{2}}\ ^{2}\ ,\ \hat{E}_{y_{2}}\ ^{2},\) or, in the \( a,b\) notation \( a^{2}\ +\ b^{2},\ a_{1}^{2},\ \ b_{1}^{2},\ a_{2}^{2}\ ,\ b_{2}^{2},\). The Stokes parameters \( I,Q,U\) are proportional successively to \( \hat{E}_{x}\ ^{2}\ +\ \hat{E}_{y}\ ^{2},\ \hat{E}_{x_{1}}\ ^{2}\ -\ \hat{E}_{y_{1}}\ ^{2},\ \hat{E}_{x_{2}}\ ^{2}\ -\ \hat{E}_{y_{2}}\ ^{2},\), or in the \( a,b\) notation, \( a^{2}\ +\ b^{2},\ a_{1}^{2}\ -\ b_{1}^{2},\ a_{2}^{2}\ -\ b_{2}^{2}\).
Refer to Figure A2. The equation to the ellipse, referred to this coordinate system, is the familiar
\[ \frac{x^{2}}{a^{2}}\ +\ \frac{y^{2}}{b^{2}}\ =\ 1, \tag{A1}\label{A1}\]
However, I want to express lengths (electric field strengths) in units such that \( a^{2}\ +\ b^{2}\ =1\), and, further, I want to write the equation in terms of the eccentricity \( e\ =\ \sqrt{1-\frac{b^{2}}{a^{2}}}\). In that case, Equation (\( \ref{A1}\)) becomes
\[ fx^{2}\ +\ \text{g}y^{2}\ =\ 1 \tag{A2}\label{A2}\]
where
\[ f\ =\ 2\ -\ e^{2}\ \text{and}\ \text{g}\ =\ \frac{2-e^{2}}{1-e^{2}}. \tag{A3}\label{A3}\]
Now refer to Figure A3. If the major axis of the ellipse makes an angle \( \theta\) with the horizontal, the coordinate systems are related by
\[ \begin{pmatrix}x \\ y \end{pmatrix}\ =\ \begin{pmatrix}c & s \\ -s & c\end{pmatrix}\begin{pmatrix}x_{1} \\ y_{1} \end{pmatrix}, \tag{A4}\label{A4}\]
\( c = \cos \theta\) and \(s = \sin \theta\)
On making use of equations (\(\ref{A2}\)) and (\(\ref{A4}\)), we find that the equation to the ellipse referred to the \((x_1 , y_1)\) coordinate system is
\[(fc^2 + gs^2)x^2_1 - 2(g-f)scx_1y_1 + (fs^2+gc^2)y^2_1 = 1 \tag{A5}\label{A5}\]
We now wish to find \(a_1 = \hat{E}_{x_1}\) and \(b_1 = \hat{E}_{y_1}\), the maximum horizontal and vertical components of the electric field. The length \(a_1\) can be found as follows. The vertical line \( x_1 = a_1\) intersects this ellipse at values of \(y_1\) given by
\[(fs^2 + gc^2)y^2_1 - 2(g-f)sca_1y_1 + (fc^2+gs^2)a^2_1 - 1 = 0 \tag{A6}\label{A6}\]
But the line \( x_1 = a_1\) is to be a vertical tangent to the ellipse, and therefore the quadratic equation (\(\ref{A6}\)) must have two equal real roots, which tells us, after a little algebra, that
\[a^2_1 = \dfrac{fs^2+gc^2}{fg}. \tag{A7}\label{A7}\]
A similar analysis starting with the horizontal line \(y_1 = b_1\) reveals that
\[b^2_1 = \dfrac{fc^2+gs^2}{fg}. \tag{A8}\label{A8}\]
For a check on the correctness of the algebra, it can now be verified that \(a^2_1 + b^2_1= 1\).
The Stokes Q parameter is \(a^2_1 - b^2_1\), and, after some algebra and trigonometric identities, it is found that
\[Q = a^2_1 - b^2_1 = \dfrac{e^2\cos 2 \theta}{2-e^2}, \tag{A9}\label{A9}\]
which is one of the relations that we sought.
Now refer to Figure A3. The \(x_2,y_2\) and \(x,y\) coordinate systems are related by
\[\left(\begin{array}{c}x\\ y\end{array}\right) =\left(\begin{array}{c}C -S\\ S \quad C\end{array}\right)\left(\begin{array}{c}x_2\\ y_2\end{array}\right), \tag{A10}\label{A10}\]
where
\[S = \sin (45 ^{\circ}- \theta) \quad \text{and} \quad C = \cos(45 ^{\circ}- \theta). \tag{A11}\label{A11}\]
On making use of Equations (\(\ref{A2}\)) and (\(\ref{A10}\)), we find that the equation to the ellipse referred to the \((x_2 , y_2)\) coordinate system is
\[(fC^2 + gS^2)x^2_2 + 2(g-f)SCx_2y_2 + (fS^2+gC^2)y^2_2 = 1 \tag{A12}\label{A12}\]
To obtain \(U\), we now proceed in a similar fashion to the analysis of \(Q\). We combine this equation with \( x_2 = a_2 \) and put in the condition that the resulting quadratic equation in \(y_2\) has two equal real roots, to obtain
\[a^2_2 = \dfrac{fS^2 + gC^2}{fg} \tag{A13}\label{A13}\]
Likewise, by combination with \(y_2 = b_2\), we obtain
\[b^2_2 = \dfrac{fC^2 + gS^2}{fg} \tag{A14}\label{A14}\]
The correctness of the algebra can be checked by verifying that \(a^2_2+b^2_2 = 1\). Then \(U\), which is \(a^2_2 - b^2_2\), can be calculated with some algebra and trigonometry, to be
\[U = \dfrac{e^2 \sin 2 \theta}{2-e^2}.\tag{A15}\label{A15}\]
And this is a good time to remind ourselves of equation (\(\ref{A9}\))
In our drawings in this chapter, we have taken \(b = \dfrac{1}{2}a, e= \dfrac{\sqrt{3}}{2}, \theta = 30^{\circ}\) so that \(Q = 0.3, U = 0.5196\).
Now for the chirality or handedness of the radiation. From measurements of \(Q\) and \(U\) we have deduced the eccentricity and orientation of the Lissajous ellipse, but we don’t yet know whether the tip of the E-vector is moving clockwise or counterclockwise (as seen when looking towards the source of light). This is what the Stokes \(V\) parameter is going to tell us.
It is well known that a Lissajous ellipse can be generated as the resultant of two simple harmonic linear oscillations at right angles to each other. In order to understand the V parameter it is necessary to understand that a Lissajous ellipse can also be generated by two circular motions, of different amplitude, and moving in opposite directions. If the semi major and semi minor axes of the Lissajous ellipse are, respectively, \(a\) and \(b\), the radii of the circular components are \(\dfrac{1}{2}(a+b\) and \(\dfrac{1}{2}(a-b\) (see Figure 11).
To measure \(V\) we place in front of the light source a filter that transmits only circularly polarized light. We’ll suppose that it transmits light that is left-handed (counterclockwise) as seen when looking towards the light source. I.e. it will obstruct the smaller circle of Figure A5 and transmit the large circle.
If the fraction of the flux density passed by the filter is \(f\), the Stokes \(V\) parameter is \(2f-1\).
Examples:
If the light is lefthand circularly polarized, the filter will transmit all of the light. That is, \(f=1, V=1\).
If the light is righthand circularly polarized, the filter will transmit none of the light. That is, \(f = 0, V=-1\).
If the light is linearly polarized, the filter will transmit half of the light. (Linearly polarized light can be generated by two equal circles moving in opposite directions.) That is, \(f = \dfrac{1}{2}, V= 0\).
In Figure A5, \(b = \dfrac{1}{2}\). The radius of the small circle (which is obstructed) is \(\dfrac{1}{4}a\) and the radius of the large circle (which is transmitted) is \(\dfrac{3}{4}a\). The flux density of the unfiltered light is proportional to \(a^2 + b^2 = \dfrac{5}{4} a^2\). The flux density of the light that is passed is proportional to \(\dfrac{9}{8}a^2\). (The flux density, we recall, is proportional to the square of the director circle. The radius of the director circle of the large circle is \(\sqrt{(\dfrac{3}{4}a)^2 + (\dfrac{3}{4}a^2)} = \sqrt{(\dfrac{8}{9}a)^2}\). So we have \(f =0.9, V = 0.8\).
If we were to reverse all of the arrows in Figure A5, it would be the larger circle that would be blocked and the small circle passed. The flux density of the light that is passed is then proportional to \(\dfrac{1}{8}a^2\). So we have \(f = 0.1, V = -0.8\).
Thus positive \(V\) means that the tip of the E-vector is moving counterclockwise, and negative \(V \) means that it is rotating clockwise.
In general, the radius of the large circle is \(\dfrac{1}{2} (a+b)\) and the radius of its director circle is \(\dfrac{1}{\sqrt{2}}(a+b)\). If this is the circle that is transmitted, the flux density passed is proportional to \(\dfrac{1}{2} (a+b)^2\).
We have, then, \(f = \dfrac{\dfrac{1}{2}(a+b)^2}{a^2+b^2}, V =\dfrac{2ab}{a^2+b^2}\). This means, incidentally, that \(V\) is proportional to the area of the ellipse. If we take \(a^2 + b^2 = 1\), then \(V = 2ab\). If it is the small circle that is passed, \(f = \dfrac{\dfrac{1}{2}(a-b)^2}{a^2+b^2}, V =\dfrac{-2ab}{a^2+b^2}\).
Since the eccentricity of the ellipse is given by \(e^2 = 1- \dfrac{b^2}{a^2}\), we can express \(V^2\) in terms of the eccentricity, thus
\[V^2 = \dfrac{4(1-e^2)}{(2-e^2)^2}.\tag{A16}\label{A16}\]
This equation is valid for totally polarized light. For partially polarized light, return to the main text.