Skip to main content
Physics LibreTexts

16.3: Transfer of Polarized Radiation

  • Page ID
    141713
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\dsum}{\displaystyle\sum\limits} \)

    \( \newcommand{\dint}{\displaystyle\int\limits} \)

    \( \newcommand{\dlim}{\displaystyle\lim\limits} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \(\newcommand{\longvect}{\overrightarrow}\)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    Even before the advent of classical electromagnetism in 1864 by James Clerk Maxwell, it was generally accepted that light propagated by means of a transverse wave. Any such wave would have a preferred plane of vibration as seen by an observer, and any collection of such beams that exhibited such a preferred plane was said to be polarized. In 1852, George Stokes7 showed that such a collection of light beams could be characterized by four parameters that now bear his name. Normally a heterogeneous collection of light beams will have planes of oscillation that are randomly oriented as seen by an observer and will be called unpolarized. However, should a light beam exhibit some degree of polarization, a measure of that polarization potentially contains a significant amount of information about the nature of the source.

    This property of light has been largely ignored in stellar astrophysics until relatively recently. The probable reason can be found in the assumption that all stars are spherical. Any such object would be completely symmetric about the line of sight from the observer to the center of the apparent disk so if even local regions on the surface emitted 100 percent polarized light, the symmetric orientation of their planes of polarization would yield no net polarization to an observer viewing the integrated light of the disk. However, many astrophysical processes produce polarized light, and numerous stars are not spherical. In particular, eclipsing binary systems present a source of integrated light which is not symmetric about the line of sight. Early work by Chandrasekhar8,9 and Code10 implied that there might be measurable polarization that would place significant constraints on the nature of the source. However, only after the development of very sensitive detectors and the discovery of intrinsic circular polarization in white dwarfs did intrinsic stellar polarization attract any great interest. In the hope that this interest will continue to grow, we will discuss some of the theoretical approaches to the problems posed by the transfer of polarized light.

    a. Representation of a Beam of Polarized Light and the Stokes Parameters

    Imagine a collection of largely uncorrelated electromagnetic waves propagating in the same direction through space. The combined effect of these beams can be obtained by adding the squares of the amplitudes of their oscillating electric and magnetic vectors. If the waves were completely correlated as are those from a laser, the amplitudes themselves would vectorially add directly. Indeed, the representation of the addition of the squares of the amplitudes can be taken as a definition of a completely uncorrelated set of waves. Now consider a situation where the collection of waves interacting with some object that produces a systematic effect on the waves such as a phase shift that depends on the orientation of the wave to the object. A reflection from a mirror produces such an effect. If we introduce a coordinate system to represent this interaction (see Figure 16.2), the net effect of summing components of the amplitudes of the combined waves could produce an ellipse whose orientation changes as it propagates through space. Such a beam is said to be elliptically polarized. If the tip of the combined electric vectors appears to rotate counterclockwise as viewed by the observer, the beam is said to have positive helicity or to be left circularly polarized. However, some authors define the state of positive polarization to be the reverse of this convention, and students should be careful, when reading polarization literature, which is being employed by the author.

    Figure 16.2 shows two beams of light incident on a reflecting surface where a phase shift is introduced between those with electric vectors vibrating parallel to the surface and those vibrating normal to the surface. The combined effect on the two beams is to have initially unpolarized light converted into an elliptically polarized beam.
    Figure 16.2 shows two beams of light incident on a reflecting surface where a phase shift is introduced between those with electric vectors vibrating parallel to the surface and those vibrating normal to the surface. The combined effect on the two beams is to have initially unpolarized light converted into an elliptically polarized beam.

    In 1852, George Stokes7 showed that such a beam of light could be represented by four quantities. Although we now use a somewhat different notation than that of Stokes, the meaning of those parameters is the same. Basically, the four parameters required to describe a polarized beam are the total intensity; the difference between two orthogonal components of the intensity, which is related to the degree of polarization; the orientation of the ellipse, which specifies the plane of polarization; and the degree of ellipticity, which is related to the magnitude of the systematic phase shifts introduced by the interaction that gives rise to the polarized beam. Since these quantities presuppose the existence of the coordinate system for their definition, they can be defined most easily in terms of those coordinates (see Figure 16.2) as \[\begin{aligned}
    I & \equiv I_l+I_r=E_l^2+E_r^2 \\
    Q & \equiv I_l-I_r=E_l^2-E_r^2 \\
    U & \equiv\left(I_l-I_r\right) \operatorname{Tan} 2 \chi=2 E_l E_r \cos \epsilon \\
    V & \equiv\left(I_l-I_r\right) \operatorname{Sec} 2 \chi \operatorname{Tan} 2 \beta=2 E_l E_r \sin \epsilon
    \end{aligned}\label{16.2.1}\]

    where Tanβ is the ratio of the semi-minor to the semi-major axis of the ellipse. The quantities \(\mathrm{E}_l\) and \(\mathrm{E}_r\) are the amplitudes of two orthogonal waves that are phase shifted by ε. The vector superposition of two such waves will produce a single wave with an amplitude that rotates as the wave moves through space. We will take \(0 \leq|\beta| \leq \pi / 2\), and the whether sign of β is positive or negative will determine if the beam is to be called left or right elliptically polarized respectively. If Q = U = V = 0, the light is said to be completely unpolarized, while if V = 0, the beam is linearly polarized with a plane of polarization making an angle \(\chi\) with the \(l\)-axis.

    Since the Stokes parameters contain some information which is purely geometric, we should not be surprised if, for a completely polarized beam, the parameters are not linearly independent and that relationships exist between them involving their geometric representation. Specifically, from equations (16.2.1) we get \[I^2=Q^2+U^2+V^2 \quad \tan 2 \chi=\frac{U}{Q} \quad \sin 2 \beta=\frac{V}{\left(Q^2+U^2+V^2\right)^{1 / 2}}\label{16.2.2}\]

    Since the Stokes parameters already involve combining the squares of the amplitudes of an arbitrary beam, we should be able to add a natural or unpolarized beam directly to the completely polarized beam. For such a beam \(\mathrm{I}_l=\mathrm{I}_r\) in all coordinate systems and Q = U = V = 0, so that \[I^2 \geq Q^2+U^2+V^2\label{16.2.3}\]

    Thus for a collection of light beams, some of which may be polarized, all four Stokes parameters are indeed linearly independent and must be specified to completely characterize the beam.

    It is clear from the definitions of the Stokes parameters that I and V are invariant to a coordinate rotation about the axis of propagation while Q and U are not. If we rotate the \(l-r\) coordinate frame through an angle \(\phi\) in a counterclockwise direction (see Figure 16.2), then \[\begin{aligned}
    & Q^{\prime}=Q \cos 2 \phi-U \sin 2 \phi=I \cos 2 \beta \cos 2(\chi+\phi) \\
    & U^{\prime}=Q \sin 2 \phi+U \cos 2 \phi=I \cos 2 \beta \sin 2(\chi+\phi)
    \end{aligned}\label{16.2.4}\]

    Since I and V are invariant to rotational transformations, we can write the general rotation for the Stokes parameters in matrix-vector notation. If we define an intensity vector to have components \(\overrightarrow{\mathrm{I}}[\mathrm{I}, \mathrm{Q}, \mathrm{U}, \mathrm{V}]\) so that \[\left(\begin{array}{l}
    I \\
    Q \\
    U \\
    V
    \end{array}\right)=\left(\begin{array}{cccc}
    1 & 0 & 0 & 0 \\
    0 & \cos 2 \phi & -\sin 2 \phi & 0 \\
    0 & \sin 2 \phi & \cos 2 \phi & 0 \\
    0 & 0 & 0 & 1
    \end{array}\right)\left(\begin{array}{l}
    I^{\prime} \\
    Q^{\prime} \\
    U^{\prime} \\
    V^{\prime}
    \end{array}\right)\label{16.2.5}\]

    then we can call the matrix the rotation matrix for the Stokes parameters. If we choose a coordinate system so that \(\mathrm{I}_l\) is a maximum, then the degree of linear polarization P is a useful parameter defined by \[P \equiv\left(\frac{Q}{I}\right)_{\max }\label{16.2.6}\]

    where, since the \(l-r\) coordinate frame has been chosen so that the \(l\)-axis lies along this direction of maximum intensity, U = 0.

    It is also useful to linearly decompose the Stokes parameters I and Q into \(\mathrm{I}_l\) and \(\mathrm{I}_r\) so that \[I_l=\frac{1}{2}(I+Q) \quad I_r=\frac{1}{2}(I-Q)\label{16.2.7}\]

    However, neither of these parameters is invariant to a coordinate rotation so that if we denote a vector \(\overrightarrow{\mathrm{I}}\left(\mathrm{I}_{l}, \mathrm{I}_{r}, \mathrm{U}, \mathrm{V}\right)\) whose components are this different set of Stokes parameters, then \[\vec{I}=\mathbf{L}(\phi) \overrightarrow{I^{\prime}}\label{16.2.8}\]

    where the rotation matrix for these Stokes parameters is \[\mathbf{L}(\phi)=\left(\begin{array}{cccc}
    \cos ^2 \phi & \sin ^2 \phi & -\frac{1}{2} \sin 2 \phi & 0 \\
    \sin ^2 \phi & \cos ^2 \phi & \frac{1}{2} \sin 2 \phi & 0 \\
    \sin 2 \phi & -\sin 2 \phi & \cos 2 \phi & 0 \\
    0 & 0 & 0 & 1
    \end{array}\right)\label{16.2.9}\]

    The rotation matrix \(\mathrm{L}(\phi)\) can be obtained from equation (16.2.5) by employing the definitions of I and Q in terms of \(\mathrm{I}_l\) and \(\mathrm{I}_r\) in equation (16.2.7). Here the angle \(\phi\) is taken positively increasing for counterclockwise rotations. While this definition is the opposite of that used by Chandrasekhar1 (p. 26), it is the more common notation for a rotational angle. The form of this rotation matrix is a little different from that used to transform vectors because the quantities being transformed are the squares of vector components rather than the components and their composition law is different from that for vectors alone. However, the transformation matrix does share the common group identities of the normal rotation matrices so that \[\mathbf{L}^{-1}(\phi)=\mathbf{L}(-\phi) \quad \mathbf{L}\left(\phi_1\right) \mathbf{L}\left(\phi_2\right)=\mathbf{L}\left(\phi_1+\phi_2\right)\label{16.2.10}\]

    Finally, there is an additional way to conceptualize the Stokes parameters that many have found useful. In general, a polarized beam of light can be imagined as being composed of two orthogonally oscillating electric vectors bearing a relative phase shift e with respect to one another. The vector sum of these two amplitudes will be a vector whose tip traces out an elliptically helical spiral as the vector propagates through space. We have defined the Stokes parameters in terms of the projection of that spiral onto a plane normal to the direction of propagation. However, we can also view them in terms of the motion of the vector through space. The fact that the vector rotates results from the phase difference between the two waves. If the phase difference were an odd multiple of \(\pi/2\), then the wave would spiral through space and would be said to contain a significant amount of circular polarization. However, if the phase difference e were zero or a multiple of \(\pi\), the tip of the vector would merely oscillate in a plane whose orientation is determined by the ratio of the amplitudes of the electric vectors in the original orthogonal coordinate frame defining the beam, and the beam would be said to be linearly polarized.

    Since the Stokes parameters are intensity-like (i.e. they have they units of the square of the electric field), one way to identify them is to take the outer (tensor) product of the electric vector with itself. Such an electric vector will have components El and Er so that the outer product will be \[\vec{E} \vec{E}=\left(\begin{array}{ll}
    E_l E_l & E_r E_l \\
    E_l E_r & E_r E_r
    \end{array}\right)\label{16.2.11}\]

    Only three of these components are linearly independent, but the component \(\mathrm{E}_r\) itself is made up of two linearly independent components \(\mathrm{E}_r \cos \varepsilon\) and \(\mathrm{E}_r \sin \varepsilon\). Thus the expansion of the term \(\mathrm{E}_l\mathrm{E}_r\) leaves us with four intensity-like components which we can identify with the Stokes parameters so that formally \[\left(\begin{array}{c}
    I_l \\
    I_r \\
    U \\
    V
    \end{array}\right)=\left(\begin{array}{c}
    E_l^2 \\
    E_r^2 \\
    2 E_l E_r \cos \epsilon \\
    2 E_l E_r \sin \epsilon
    \end{array}\right)\label{16.2.12}\]

    Thus taking the outer product of a general electric vector provides an excellent formalism for generating the Stokes parameters for that beam.

    b. Equations of Transfer for the Stokes Parameters

    The intensity-like nature of the Stokes parameters that describe an arbitrarily polarized beam makes them additive. This and the linear nature of the equation of radiative transfer ensures that we can write equations of transfer for each of them. Thus, if the components of \(\vec{\mathrm{I}}\) are the Stokes parameters, we can express this result as a vector equation of radiative transfer that has the familiar form \[\mu \frac{d \vec{I}}{d \tau}=\vec{I}-\vec{S}\label{16.2.13}\]

    As usual, all the problems in the equations of transfer are contained in the source function. If we choose the intensity vector to have components \(\overrightarrow{\mathrm{I}}\left[\mathrm{I}_{l}, \mathrm{I}_{r}, \mathrm{U}, \mathrm{~V}\right]\), then the source function can be written as a vector whose components are \(\mathrm{S}_l\), \(\mathrm{S}_r\), \(\mathrm{S_U}\), and \(\mathrm{S_V}\). Again, as we did with incident radiation, it is convenient to break up the radiation field into the "diffuse" field consisting of photons that have interacted at least once and the direct transmitted field that could originate from any incident radiation. We concentrate initially on the diffuse field since, in the absence of incident radiation, it exhibits axial symmetry about the normal to the atmosphere.

    Nature of the Source Function for Polarized Radiation The source function that appears in equation (16.2.13) is a vector quantity that must account for all contributions to the four Stokes parameters from thermal and scattering processes. Since thermal emission is by nature a random equilibrium process, we expect it to make no contribution to the Stokes parameters U and V for these measure the departure of the radiation field from isotropy but now carry energy. In short, thermal radiation is completely unpolarized. For the same reasons, the thermal radiation will make an equal contribution to both Il and Ir. Therefore the thermal contribution to the vector source function in equation (16.2.13) will have components \[\epsilon_\nu \vec{B}_\nu(T)=\epsilon_\nu\left[\frac{1}{2} B_\nu(T), \frac{1}{2} B_\nu(T), 0,0\right]\label{16.2.14}\]

    where \(\varepsilon_\nu\) is the familiar ratio of pure absorption to total extinction defined in equation (10.1.8). In some instances it is possible for \(\varepsilon_\nu\) to depend on orientation. This is particularly true if an external magnetic field is present. The role played by scattering is considerably more complex.

    Redistribution Phase Function for Polarized Light For the sake of simplicity, let us assume that the scattering is completely coherent so that there is no shift in the frequency of the photon introduced by the scattering process. However, in some instances scattering can result in photons being scattered from the \(l\)-plane into the \(r\)-plane, so that in principle all four Stokes parameters may be affected. In addition, the scattering by the atom or electron is most simply described in the reference frame of the scatterer and most simply involves the angle between the incoming and outgoing photons. But the coordinate frame of the equation of transfer is affixed to the observer. Thus we must express not only the details by which the photons are scattered from one Stokes parameter to another, but also that result in the coordinate frame of the equation of radiative transfer. It is this latter transformation that took the relatively simple form of the Rayleigh phase function and turned it into the more complicated form given by equation (15.3.39).

    Since the scattering of a photon may carry it from one Stokes parameter to another, we can expect the redistribution function to have the form of a matrix where each element describes the probability of the scattering carrying the photon from some given Stokes parameter to another. We have already seen that the tensor outer product provides a vehicle for identifying the Stokes parameters in a beam. Let us use this formalism to investigate the Stokes parameters for a polarized beam of light scattered by an electron. While we generally depict a light beam as being composed of a single electric vector oscillating in a plane, this is not the most general form that a light wave can have.

    Consider a beam of light composed of orthogonal electric vectors \(\vec{E’_l}\) and \(\vec{E’_r}\) which are systematically phase-shifted with respect to each other. Let this beam be incident on an electron so that the wave is scattered in the \(l\)-plane (see Figure 16.3). We are free to pick the \(l\)-plane to be any plane we wish, and the scattering plane is a convenient one. We can also view the scattering event to result from the acceleration of the electron by the incident electric fields \(\vec{E’_l}\) and \(\vec{E’_r}\) producing oscillating dipoles which reradiate the incident energy. Since the photon is reradiated in the \(l\)-plane, we may consider it to be made up of electric vectors \(\vec{E_l}\) and \(\vec{E_r}\) which bear the same relative phase shift to each other as \(\vec{E’_l}\) and \(\vec{E’_r}\). Now consider how each dipole excited by \(\vec{E’_l}\) and \(\vec{E’_r}\) will re-radiate in the scattering plane. In general, electric dipoles will radiate in a plane containing the dipole axis, but the amplitude of that wave will decrease as one approaches the pole, specifically as the sine of the polar angle. Thus the intensity of \(\vec{E_l}\) will bary as \(\cos \gamma\) while \(\vec{E_r}\) will be independent of the scattering (see Figure 16.3). Thus the scattered vector \(\vec{E}\) will have components \(\left[\vec{E}_l^{\prime} \cos \gamma, \vec{E}_r^{\prime}\right]\). Taking the outer product of this vector with itself, we can identify the linearly independent components as \[\vec{E}\vec{E}=\left[E_l^{\prime 2} \cos ^2 \gamma, E_r^{\prime 2},\left(2 E_l^{\prime} E_r^{\prime} \cos \epsilon\right) \cos \gamma,\left(2 E_l^{\prime} E_r^{\prime} \sin \epsilon\right) \cos \gamma\right]\label{16.2.15}\]

    Figure 16.3 shows the electric vectors of two orthogonal waves differing in phase by \(\varepsilon\) incident on an electron. The scattering of the waves takes place in the same plane as the oscillating \(\vec{E’_l}\) wave. The angle through which the waves are scattered is \(\gamma\). Each wave can be viewed as exciting a dipole which radiates in the scattering plane in accordance with dipole radiation for the orientation of the respective dipoles.
    Figure 16.3 shows the electric vectors of two orthogonal waves differing in phase by \(\varepsilon\) incident on an electron. The scattering of the waves takes place in the same plane as the oscillating \(\vec{E’_l}\) wave. The angle through which the waves are scattered is \(\gamma\). Each wave can be viewed as exciting a dipole which radiates in the scattering plane in accordance with dipole radiation for the orientation of the respective dipoles.

    Since the dipoles excited by the incident radiation do not reradiate in a plane orthogonal to the exciting radiation, there can be no mixing of energy from the \(r\)-plane to the \(l\)-plane or vice versa. Thus, in any scattering matrix for Stokes parameters, the four independent components of the beam given by equation (16.2.15) must appear on the diagonal. In addition, one would expect a constant of proportionality determined by the conservation of energy. Specifically, the total energy of the incident beam must equal the total energy of radiation scattered through all allowed scattering angles. Thus, factoring out the definitions of the Stokes parameters from the components given in equation (16.2.15), we can write \[\left(\begin{array}{c}
    I_l \\
    I_r \\
    U \\
    V
    \end{array}\right)=A_0\left(\begin{array}{cccc}
    \cos ^2 \gamma & 0 & 0 & 0 \\
    0 & 1 & 0 & 0 \\
    0 & 0 & \cos \gamma & 0 \\
    0 & 0 & 0 & \cos \gamma
    \end{array}\right)\left(\begin{array}{c}
    \left(E_l^{\prime}\right)^2 \\
    \left(E_r^{\prime}\right)^2 \\
    2 E_l^{\prime} E_r^{\prime} \cos \epsilon \\
    2 E_l^{\prime} E_r^{\prime} \sin \epsilon
    \end{array}\right)\label{16.2.16}\]

    where \(A_0\) is the constant of proportionality described above. By applying the conservation of energy, \(A_0\) can be found to be 3/2. Thus the matrix in equation (16.2.16) represents the transformation of \(\overrightarrow{\mathrm{I}}\left[\mathrm{I}^{\prime}_l, \mathrm{I}^{\prime}, \mathrm{U}^{\prime}, \mathrm{V}^{\prime}\right]\) into \(\overrightarrow{\mathrm{I}}\left[\mathrm{I}_l, \mathrm{I}_r, \mathrm{U}, \mathrm{V}\right]\). This matrix is generally called the Rayleigh phase matrix and is \[\mathbf{R}=\frac{3}{2}\left(\begin{array}{cccc}
    \cos ^2 \gamma & 0 & 0 & 0 \\
    0 & 1 & 0 & 0 \\
    0 & 0 & \cos \gamma & 0 \\
    0 & 0 & 0 & \cos \gamma
    \end{array}\right)\label{16.2.17}\]

    Here \(\gamma\) is the scattering angle between the incoming and outgoing photons. The Stokes parameters in the scattering coordinate system will be given by the vector components of the intensity \(\overrightarrow{\mathrm{I}}\left[\mathrm{I}_l, \mathrm{I}_r, \mathrm{U}, \mathrm{V}\right]\) where \(\mathrm{I}_l\) and \(\mathrm{I}_r\) lie in the scattering plane and perpendicular to it in a plane containing the outgoing photon (see Figure 16.3 and Figure 16.4). The equivalent matrix for scattering of the more common Stokes vector \(\mathrm{\vec{I}^{\prime}\left[I^{\prime}, Q^{\prime}, U^{\prime}, V^{\prime}\right]}\) to \(\mathrm{\vec{I}[I, Q, U, V]}\) can be obtained by transforming equation (16.2.17) in accordance with the definitions given in equation (16.2.7) so that \[R[I, Q, U, V]=\frac{3}{4}\left(\begin{array}{cccc}
    1+\cos ^2 \gamma & -\sin ^2 \gamma & 0 & 0 \\
    -\sin ^2 \gamma & 1+\cos ^2 \gamma & 0 & 0 \\
    0 & 0 & 2 \cos \gamma & 0 \\
    0 & 0 & 0 & 2 \cos \gamma
    \end{array}\right)\label{16.2.18}\]

    To be useful for radiative transfer, these matrices must now expressed in the observer's coordinate frame by multiplying them by the appropriate rotation matrix as given in equation (16.2.9) or (16.2.5).

    Figure 16.4 shows the coordinate frame for a single scattering event and the orientation of the polarized components.
    Figure 16.4 shows the coordinate frame for a single scattering event and the orientation of the polarized components.

    The final transformation to the coordinate system of the equation of transfer is moderately complicated (see Chandrasekhar1, pp.37- 41) and I will simply quote the result for equation (16.2.17), using his notation. \[\mathbf{R}=\frac{3}{2}\left(\begin{array}{cccc}
    (l l)^2 & (r l)^2 & (l l)(r l) & 0 \\
    (l r)^2 & (r r)^2 & (l r)(r r) & 0 \\
    2(l l)(l r) & 2(r r)(r l) & (l l)(r r)+(r l)(l r) & 0 \\
    0 & 0 & 0 & (l l)(r r)-(r l)(l r)
    \end{array}\right)\label{16.2.19}\]

    where \[\begin{aligned}
    & l l=\sin \theta \sin \theta^{\prime}+\cos \theta \cos \theta^{\prime} \cos \left(\phi^{\prime}-\phi\right) \\
    & r l=\cos \theta \sin \left(\phi^{\prime}-\phi\right) \\
    & l r=-\cos \theta \sin \left(\phi^{\prime}-\phi\right) \\
    & r r=\cos \left(\phi^{\prime}-\phi\right)
    \end{aligned}\label{16.2.20}\]

    The equivalent matrix for the describing the scattering of \(\overrightarrow{\mathrm{I}}^{\prime}\left[\mathrm{I}^{\prime}, \mathrm{Q}^{\prime}, \mathrm{U}^{\prime}, \mathrm{V}^{\prime}\right]\) can be obtained by transforming the matrix given in equation (16.2.19) in accordance with the definitions given in equations (16.2.7)(see equation 16.2.37). Thus the scattering matrix can be obtained in the observer's frame for either representation of the Stokes parameters. In cases where the scattering matrix may have a more complicated representation in the frame of the scatterer, it may be easier to accomplish the transformation to the observer's coordinate frame for the orthogonal electric vectors themselves and then reconstruct the appropriate Stokes parameters. Substitution of these admittedly messy functions into equation (16.2.13) means that we can write the vector source function for Rayleigh scattering as \[\vec{S}_\nu=\epsilon_\nu \vec{B}(T)-\left(1-\epsilon_\nu\right) \frac{1}{4 \pi} \int_0^{2 \pi} \int_{-\pi / 2}^{+\pi / 2} \mathbf{R}\left(\theta, \theta^{\prime}, \phi, \phi^{\prime}\right) \vec{I}\left(\theta^{\prime}, \phi^{\prime}\right) \sin \theta^{\prime} d \theta^{\prime} d \phi^{\prime}\label{16.2.21}\]

    where the only angles that appear in the source function are those that define the coordinate frame of the equation of transfer (see Figure 16.5).

    Although the angle \(\phi\) appears in the equation of transfer, the solution for the diffuse radiation field will not depend on it as long as there is no incident radiation. All the solution methods described in Chapter 10 for the equation of transfer may be used to solve the problem. However, equation (16.2.13) represents four simultaneous equations coupled through the scattering matrix.

    More Complicated Phase Functions The scattering of photons by electrons is one of the simplest yet widespread scattering phenomena in astronomy. However, in a number of cases the scattering source results in more complicated phase functions and produces interesting observational tests of stellar structure. Some examples are the scattering of radiation by resonance lines, the scattering from dust grains, and the effects of global magnetic fields on scattering. Even for these more difficult cases, one may proceed in the manner described for classical Thomson scattering to formulate the Stokes phase matrix. In most cases one can characterize the scattering process as involving the excitation of an electron which then reradiates the photon in a radiation pattern of an electric dipole. In some instances, the electron may be bound so that its oscillatory amplitude and orientation are restricted by external forces.

    Figure 16.5 shows the scattering of a photon in the 'observer's' coordinate frame. The Z-axis points in the direction of the normal to the atmosphere and \(\cos\theta=\mu\). The angle \(\phi\) is the azimuthal angle and now is a parameter of the problem.
    Figure 16.5 shows the scattering of a photon in the 'observer's' coordinate frame. The Z-axis points in the direction of the normal to the atmosphere and \(\cos\theta=\mu\). The angle \(\phi\) is the azimuthal angle and now is a parameter of the problem.

    As long as the physics of this restriction is understood, the re-radiation can be characterized and the nature of the scattered wave described. The tensor outer product of that wave will then describe the scattering behavior of the Stokes parameters of the incident wave. As long as one is careful to keep track of the plane of origin of the various scattered electric vector amplitudes as well as the planes in which they are radiated, the contribution of the elements of the tensor outer product of the scattered beam with itself to the Stokes parameters of the scattered beam is unique.

    For example, consider the case of Thomson scattering by an electron in a macroscopic magnetic field. The motion of the electron produced by the incident electric fields will subject the electron to Lorentz forces orthogonal to both the magnetic field and the induced velocity. This will cause the electron to oscillate and generate a second "electric dipole" whose radiation is added to that of the first. This allows energy to be transferred from one plane of polarization to another by an amount that depends on the magnetic field strength and orientation. This will have a profound effect on all the Stokes parameters. Contributions will be made to the scattered Stokes parameters from the interaction of the radiation by the initial electric dipole with itself, the interaction of that radiation with that of the magnetically induced dipole, and finally the self interaction of the magnetically induced dipole. If we represent the scattered electric fields as being composed of a component from the initially driven dipole \(\vec{\mathrm{E}}\) and the magnetically induced dipole \(\vec{\mathrm{M}}\) then we can write the outer product of the scattered electric field with itself as \[\vec{E}\vec{E}=\vec{E}\vec{E}+2 \vec{E} \vec{M}+\vec{M} \vec{M}\label{16.2.22}\]

    Each of these tensor outer products will produce a scattering matrix, and the sum will represent the scattering of the Stokes parameters by electrons in the magnetic field. The first will be identical to the Rayleigh phase matrix, where each element has been reduced by an amount equal to the energy that has been fed into the magnetically induced dipole. The second matrix represents the mixing between the electric field of the initial dipole with that of the magnetically induced dipole. Since these dipoles radiate out of phase with one another by \(\pi/2\), the information contained here will largely involve the U and V Stokes parameters. The last matrix represents the contribution from the magnetically induced dipole with itself and will depend strongly on the relative orientation of the magnetic field. Keeping track of the relative phases of the various components, we calculate the three scattering matrices to be \[\begin{aligned}
    R(\vec{E} \vec{E}) & =\frac{3}{2} \zeta^2\left(\begin{array}{cccc}
    \cos ^2 \gamma & 0 & 0 & 0 \\
    0 & 1 & 0 & 0 \\
    0 & 0 & \cos \gamma & 0 \\
    0 & 0 & 0 & \cos \gamma
    \end{array}\right) \\
    R(\vec{E} \vec{M}) & =\frac{3 \zeta^2 x}{2}\left(\begin{array}{cccc}
    0 & 0 & 0 & +b_2 \cos \gamma \\
    0 & 0 & 0 & a_2 \\
    0 & 0 & 0 & a_1 \\
    2 a_2 \cos \gamma & +2 b_1 & a_1 & 0
    \end{array}\right) \\
    R(\vec{M} \vec{M}) & =\frac{3}{2} x^2 \zeta^2\left(\begin{array}{cccc}
    a_1^2 & b_1^2 & a_1 b_1 & 0 \\
    a_2^2 & 0 & 0 & 0 \\
    2 a_1 a_2 & 0 & +a_2 b_1 & 0 \\
    0 & 0 & 0 & -a_2 b_1
    \end{array}\right)
    \end{aligned}\label{16.2.23}\]

    where \[a_1=\sin \gamma \sin \xi \cos \eta \quad a_2=-\cos \xi \quad b_1=-\sin \gamma \sin \xi \sin \eta\label{16.2.24}\]

    The coordinates of the magnetic field with respect to the scattering plane of the incident photon are given by \(\eta\) and the polar angle \(\xi\), respectively. The strength of the magnetic field is contained in the parameter x so that \[x \equiv \frac{\omega_c}{\omega}=\frac{e B / m_e}{\omega}\label{16.2.25}\]

    If we require that the scattering be conservative, then since energy is required to generate those oscillations, the amplitude of the incident electric vector will be reduced in order to supply the energy to excite them. However, the energy that excites the dipoles must be equal to the energy of the incident beam so that \[\vec{E}^{\prime} \cdot \vec{E}^{\prime}=[\overrightarrow{\mathrm{E}}+x(\overrightarrow{\mathrm{E}} \times \hat{b})] \cdot[\overrightarrow{\mathrm{E}}+x(\overrightarrow{\mathrm{E}} \times \hat{b})]=\overrightarrow{\mathrm{E}}^2\left[1+x^2 \sin ^2(\xi)\right]\label{16.2.26}\]

    where \(\xi\) is the angle between the electric vector and the magnetic field. If we assume that the magnitude of the electric vector is reduced by \(\zeta\), then \[\zeta^2=\left[1+x^2 \sin ^2 \xi\right]^{-1}=\zeta_0^2\left[1-x^2 \zeta_0^2 \cos ^2 \xi\right]^{-1}\label{16.2.27}\]

    where \[\zeta_0^2=\left[1+x^2\right]^{-1}\label{16.2.28}\]

    If we assume that \(x\ll1\), then \(\vec{\mathrm{E}}\) will remain nearly parallel to \(\vec{\mathrm{E}}’\) and we may expand equations (16.2.27), and (16.2.28) to give \[\left.\begin{array}{l}
    \zeta^2 \approx \zeta_0^2\left[1+x^2 \zeta_0^2 \cos ^2 \xi\right] \approx 1 \\
    \zeta_0^2 \approx 1-x^2 \approx 1
    \end{array}\right\}\label{16.2.29}\]

    and we may write \[\vec{E}=\mathbf{R}\left[\vec{E}^{\prime}+x\left(\vec{E}^{\prime} \times \hat{b}\right)\right]\label{16.2.30}\]

    For zero magnetic field, one recovers the Rayleigh phase matrix. As the strength of the field increases, the contribution of the second two matrices increases and normal electron scattering decreases. As the gyro-frequency \(\omega_\mathrm{c}\) is reached, the motion of the electron becomes circular and significant circular polarization is introduced to the beam along the direction of the magnetic field. Since we have ignored the resonance that occurs at the gyro-frequency (that is, \(x=1\), these results are only approximate. For the scattering of natural light, \(\overrightarrow{\mathrm{I}}[½ \mathrm{I}, ½ \mathrm{I}, 0,0]\), the strength of the components from the Rayleigh matrix are reduced while the \(R(\vec{\mathrm{E}},\vec{\mathrm{M}})\) contributes solely to the V stokes parameter, indicating that the beam will be circularly polarized when scattered along the magnetic field.

    This contribution diminishes as the scattering angle moves toward \(\pi/2\). The contribution from the \(R(\vec{M}\vec{M})\) matrix only has nonzero components that transfer energy equally from \(\mathrm{I}_l\) to \(\mathrm{I}_r\) and vice versa (for \(\gamma=\pi\)), but the contribution to \(\mathrm{I}_l\) vanishes as \(\gamma\rightarrow\pi/2\). The fact that U remains zero implies that the scattered radiation for \(\gamma=\pi/2\) is linearly polarized with the plane of polarization perpendicular to the magnetic field.

    As one approaches the gyro-frequency, the transport of radiation becomes increasingly complex. Since the component of the electric field that is parallel to the magnetic field produces no Lorentz force on the electron, the interaction of the two orthogonal components of the generalized electromagnetic wave with the magnetic field are unequal. This anisotropic interaction has its optical counterparts in the phenomena known as Faraday rotation and a complex index of refraction. The phenomena enter the scattering problem as \(x^2\) so that our results are valid only for \(x\ll1\).

    At frequencies less that the gyrofrequency, the scattering situation becomes more complex as the entire energy of the photon may be absorbed by the electron in the form of circular motion. This energy may be lost through collisions before it can be reradiated.

    Again to utilize these matrices for problems involving radiative transfer requires that these matrices be transferred to the observer's frame. However, as they were developed primarily to indicate the method for generating the Stokes phase matrices, we will not develop them further. It seems clear that such techniques are capable of developing phase matrices for problems of considerable complexity.

    c. Solution of the Equations of Radiative Transfer for Polarized Light.

    While direct solution of the integrodifferential equations of radiative transfer may be accomplished by direct means, all the various rays characterizing the four Stokes parameters must be solved together so that the integrals required by the scattering part of the source function may be evaluated simultaneously. To obtain the necessary accuracy for low levels of polarization would require a computing power that would tax even the fastest computers. Even if the axial symmetry of the diffuse field can be used to advantage for some problems, cases involving incident radiation will require involving discrete streams in \(\phi\) as well as \(\theta\).

    There is an approach involving the integral equations for the source function that greatly simplifies the problem. We have seen throughout the book that the development of Schwarzschild-Milne equations for the source function always produced equations that involved moments of the radiation field, usually the mean intensity J. As long as the redistribution function can be expressed as a finite series involving tesseral harmonics of \(\theta\) and \(\phi\), the source function can be expressed in terms of simple functions of \(\theta\) and \(\phi\) which multiply various moments of the radiation field that depend on depth only. As we did in Chapter 9, it will then be possible to generate Schwarzschild-Milne-like equations for these moments that depend on depth only. Indeed, since a moment of a radiation field cannot depend on angle, by definition, this effectively separates the angular dependence of the radiation field from the depth dependence. This is true even for the case of incident radiation. The solution of these equations will then yield the source function at all depths which, together with the classical solution for the equation of transfer, will provide a complete description of the radiation field in all four Stokes parameters. This conceptually simple approach becomes rather algebraically complicated in practice, even for the relatively simple case of Rayleigh scattering.

    The general case of this problem has been worked out by Collins11 and more succinctly by Collins and Buerger12 and we will only quote the results. The source functions for nongray Rayleigh scattering can be written as \[\begin{aligned}
    S_l\left(\tau_\nu, \theta, \phi\right)= & \frac{1}{2} \epsilon_\nu\left(\tau_\nu\right) B_\nu\left(\tau_\nu\right)+\frac{3}{4}\left\{2 X\left(\tau_\nu\right)-\cos ^2 \theta\left[Y\left(\tau_\nu\right)+Z\left(\tau_\nu\right)\right.\right. \\
    & \times(\cos 2 \phi-2 \xi \sin 2 \phi)]+M\left(\tau_\nu\right) \cos \theta \sin \theta(\cos \phi-\zeta \sin 2 \phi\} \\
    S_r\left(\tau_\nu, \theta, \phi\right)= & \frac{1}{2} \epsilon_\nu\left(\tau_\nu\right) B_\nu\left(\tau_\nu\right)+\frac{3}{4}\left[2 X\left(\tau_\nu\right)-Y\left(\tau_\nu\right)+Z\left(\tau_\nu\right)(\cos 2 \phi-2 \xi \sin 2 \phi)\right] \\
    S_U\left(\tau_\nu, \theta, \phi\right)= & -\frac{3}{4}\left[2 Z\left(\tau_\nu\right) \cos \theta(2 \xi \cos 2 \phi+\sin 2 \phi)\right. \\
    & \left.-M\left(\tau_\nu\right) \sin \theta(\sin \phi-\zeta \cos \phi)\right] \\
    S_V\left(\tau_\nu, \theta, \phi\right)= & \frac{3}{8}\left[P\left(\tau_\nu\right) \cos \theta+W\left(\tau_\nu\right) \sin \theta \cos \phi\right]
    \end{aligned}\label{16.2.31}\]

    where the moments of the radiation field \(\mathrm{X}(\tau)\), \(\mathrm{Y}(\tau)\), \(\mathrm{Z}(\tau)\), \(\mathrm{M}(\tau)\), \(\mathrm{P}(\tau)\), and \(\mathrm{W}(\tau)\) depend on depth alone and satisfy these integral equations: \[\begin{aligned}
    X\left(\tau_\nu\right)= & C_X\left(\tau_\nu\right)+\frac{1}{4}\left[\Lambda_1\left(\tau_\nu\right)-\Lambda_3\left(\tau_\nu\right)\right]\left|\epsilon_\nu(t) B_\nu(t)+3 X(t)\right| \\
    & -\frac{3}{8}\left[\Lambda_3\left(\tau_\nu\right)-\Lambda_5\left(\tau_\nu\right)\right]|Y(t)| \\
    Y\left(\tau_\nu\right)= & C_Y\left(\tau_\nu\right)+\frac{1}{4}\left[\Lambda_1\left(\tau_\nu\right)-3 \Lambda_3\left(\tau_\nu\right)\right]\left|\epsilon_\nu(t) B(t)-3 X(t)\right| \\
    & -\frac{3}{8}\left[\Lambda_1\left(\tau_\nu\right)-2 \Lambda_3\left(\tau_\nu\right)+3 \Lambda_5\left(\tau_\nu\right)\right]|Y(t)| \\
    Z\left(\tau_\nu\right)= & C_Z\left(\tau_\nu\right)+\frac{3}{16}\left[\Lambda_1\left(\tau_\nu\right)+2 \Lambda_3\left(\tau_\nu\right)+\Lambda_5\left(\tau_\nu\right)\right]|Z(t)| \\
    M\left(\tau_\nu\right)= & C_M\left(\tau_\nu\right)+\frac{3}{8}\left[\Lambda_1\left(\tau_\nu\right)+\Lambda_3\left(\tau_\nu\right)-2 \Lambda_5\left(\tau_\nu\right)\right]|M(t)| \\
    W\left(\tau_\nu\right)= & C_W\left(\tau_\nu\right)+\frac{3}{8}\left[\Lambda_1\left(\tau_\nu\right)-\Lambda_3\left(\tau_\nu\right)\right]|W(t)| \\
    P\left(\tau_\nu\right)= & C_P\left(\tau_\nu\right)+\frac{3}{4} \Lambda_3\left(\tau_\nu\right)|P(t)|
    \end{aligned}\label{16.2.32}\]

    The \(\Lambda_n\) operator is the same as that defined in Chapter 10 [see equation (10.1.16)] and produces exponential integrals of order \(n\). The constants \(\mathrm{C}_N(\tau)\) depend on the incident radiation field only and are attenuated by the appropriate optical path. It is assumed that the incident radiation encounters the atmosphere along a path defined by \(\phi=0\). The values of these constants for plane-parallel polarized radiation are \[\left(\begin{array}{l}
    C_X\left(\tau_\nu\right) \\
    C_Y\left(\tau_\nu\right) \\
    C_Z\left(\tau_\nu\right) \\
    C_M\left(\tau_\nu\right) \\
    C_W\left(\tau_\nu\right) \\
    C_P\left(\tau_\nu\right)
    \end{array}\right)=\frac{\left[1-\epsilon_\nu\left(\tau_\nu\right)\right] e^{-\tau_\nu / \mu_0}}{4 \pi}\left(\begin{array}{c}
    \left(1-\mu_0^2\right) I_l^0 \\
    \left(2-3 \mu_0^2\right) I_l^0-I_r^0 \\
    I_r^0-\mu_0^2 I_l^0 \\
    -\mu_0\left(1-\mu_0^2\right)^{1 / 2} I_l^0 \\
    \left(1-\mu_0^2\right)^{1 / 2} V^0 \\
    -\mu_0 V^0
    \end{array}\right)\label{16.2.33}\]

    The parameters \(\xi\) and \(\zeta\) also depend on the incident radiation alone and are given by \[\xi \equiv \frac{\mu_0 U^0}{2\left(I_r^0-\mu_0^2 I_l^0\right)} \quad \zeta \equiv \frac{U^0}{2 \mu_0 I_l^0}\label{16.2.34}\]

    These parameters basically represent the direct contribution to the source function from the incident radiation field. If the incident radiation is unpolarized, they are both zero.

    Note that only the first two integral equations for the moments \(\mathrm{X}(\tau)\) and \(\mathrm{Y}(\tau)\) are coupled and therefore must be solved simultaneously. In addition, the net radiative flux flowing through the atmosphere involves only these moments, so that only these two equations must be solved to satisfy radiative equilibrium. This can be shown by substituting the equations for the source functions into the definition for the radiative flux and obtaining \[F_\nu\left(\tau_\nu\right)=4 \Lambda_2\left(\tau_\nu\right)\left|\epsilon_\nu(t) B(t)+3 X(t)-\frac{3}{4} Y(t)\right|-3 \Lambda_4\left(\tau_\nu\right)|Y(t)|\label{16.2.35}\]

    or in differential form \[\frac{d F_\nu\left(\tau_\nu\right)}{d \tau_\nu}=\epsilon_\nu\left(\tau_\nu\right) B_\nu\left[T\left(\tau_\nu\right)\right]-3 X\left(\tau_\nu\right)+Y\left(\tau_\nu\right)\label{16.2.36}\]

    These two expressions are all that is needed to apply the Avrett-Krook perturbation scheme for constructing a model atmosphere. The remaining equations need to be solved only when complete convergence of the atmosphere has been achieved, and then only if the state of polarization for the emergent radiation field is required.

    d. Approximate Formulas for the Degree of Emergent Polarization

    The complexity of equations (16.2.31) and (16.2.32) show that the complete solution of the problem of the transfer of polarized radiation is quite formidable. Clearly approximations that yielded the degree of polarization and its wavelength dependence would be useful. To get such approximation formulas, we plan to formulate them from the Stokes parameters \(\mathrm{I}[\mathrm{I}, \mathrm{Q}, \mathrm{U}, \mathrm{V}]\) rather than \(\mathrm{I}\left[\mathrm{I}_{\mathrm{l}}, \mathrm{I}_{\mathrm{r}}, \mathrm{U}, \mathrm{V}\right]\) since the degree of polarization is just Q/I. We may generate the appropriate scattering matrix from equation (16.2.19) from the relation for the Stokes parameters \(\mathrm{I}[\mathrm{I}, \mathrm{Q}, \mathrm{U}, \mathrm{V}]\) and \(\mathrm{I}\left[\mathrm{I}_{l}, \mathrm{I}_{r}, \mathrm{U}, \mathrm{V}\right]\) given in equation 16.2.7. The resulting matrix for the Stokes Scattering matrix is then

    \[\begin{aligned}
    \mathbf{R}_{I Q U V}=\left[\begin{array}{cc}
    \frac{1}{2}\left[g_{11}^2+g_{12}^2+g_{21}^2+g_{22}^2\right] & \frac{1}{2}\left[g_{11}^2-g_{12}^2+g_{21}^2-g_{22}^2\right] \\
    \frac{1}{2}\left[g_{11}^2+g_{12}^2-g_{21}^2-g_{22}^2\right] & \frac{1}{2}\left[g_{11}^2-g_{12}^2-g_{21}^2+g_{22}^2\right] \\
    {\left[g_{11} g_{21}+g_{12} g_{22}\right]} & {\left[g_{11} g_{21}-g_{12} g_{22}\right]} \\
    0 & 0
    \end{array}\right.\\
    \left.\begin{array}{cc}
    {\left[g_{11} g_{12}+g_{21} g_{22}\right]} & 0 \\
    {\left[g_{11} g_{12}-g_{21} g_{22}\right]} & 0 \\
    {\left[g_{11} g_{22}+g_{21} g_{12}\right]} & 0 \\
    0 & {\left[g_{11} g_{22}-g_{21} g_{12}\right]}
    \end{array}\right]
    \end{aligned}\label{16.2.37}\]

    Here we have replaced the notation of Chandrasekhar with \[g_{11}=l l \quad g_{12}=l r \quad g_{21}=r l \quad g_{22}=r r\label{16.2.38}\]

    To obtain the source function for Thomson-Rayleigh scattering we must carry out the operations implied by equation (16.2.21) for the source function. However, now we are to do it for all four of the Stokes parameters. This means that we shall integrate all elements of the scattering matrix over \(\phi’\) and \(\theta’\). However, we know that the geometry of the plane parallel atmosphere means that the solution can have no \(\phi\)-dependence so that all odd functions of \(\phi’\) must vanish as well as \(\sin2\phi’\). Similarly the resulting source function can have no explicit \(\phi\)-dependence. Also the elements of \(\mathbf{R}[\mathrm{I}, \mathrm{Q}, \mathrm{U}, \mathrm{V}]\) contain a limited number of squares and products of the \(g_{\mathrm{ij}}\mathrm{s}\). If we simply average these quantities over \(\phi’\) we get \[\begin{aligned}
    & \overline{g_{11}^2}=\sin ^2 \theta-\sin ^2 \theta \cos ^2 \theta^{\prime}+\frac{1}{2} \cos ^2 \theta \cos ^2 \theta^{\prime} \\
    & \overline{g_{12}^2}=\frac{1}{2} \cos \theta \quad \overline{g_{21}^2}=\frac{1}{2} \cos \theta^{\prime} \quad \overline{g_{22}^2}=\frac{1}{2} \\
    & \overline{g_{12} g_{21}}=-\frac{1}{2} \cos \theta \cos \theta^{\prime} \\
    & \overline{g_{11} g_{22}}=+\frac{1}{2} \cos \theta \cos \theta^{\prime} \\
    & \overline{g_{12} g_{22}}=\overline{g_{21} g_{22}}=\overline{g_{11} g_{12}}=\overline{g_{11} g_{21}}=0
    \end{aligned}\label{16.2.39}\]

    Integration of these terms over \(\theta’\) will then yield familiar moments of the Stokes parameters such as J and K (Section 9.3). If we let the subscript P stand for the different Stokes parameters, then expressing the product of the \(g_{\mathrm{ij}}\mathrm{s}\) in equation (16.2.39) and the Stokes vector in terms of the moments of the Stokes parameters yields \[\begin{aligned}
    \left\langle I_P g_{11}^2\right\rangle & =J_P-K_P-\left(\cos ^2 \theta\right)\left(J_P-\frac{3 K_P}{2}\right) \\
    \left\langle I_P g_{12}^2\right\rangle & =\frac{1}{2} J_P \cos ^2 \theta \\
    \left\langle I_P g_{21}^2\right\rangle & =\frac{1}{2} K_P \\
    \left\langle I_P g_{22}^2\right\rangle & =\frac{1}{2} J_P \\
    \left\langle I_P g_{12} g_{21}\right\rangle & =-\frac{(\cos \theta) F_P}{4} \\
    \left\langle I_P g_{11} g_{22}\right\rangle & =+\frac{(\cos \theta) F_P}{4}
    \end{aligned}\label{16.2.40}\]

    Substituting these average values into the respective elements of equation (16.2.19) we can calculate the scattering fraction of the source functions for the Stokes parameters. Adding the contribution to the I-source function from thermal emission we can then write the source functions for all the Stokes parameters as \[\begin{aligned}
    S_I\left(\tau_\nu\right)= & \epsilon_\nu B_\nu(t)+\frac{3\left(1-\epsilon_\nu\right)}{8} \\
    & \times\left\{\left[\left(3 J_I-K_I\right)+\left(J_Q-K_Q\right)\right]-\left(\cos ^2 \theta\right)\left[\left(J_I-3 K_I\right)+3\left(J_Q-K_Q\right)\right]\right\} \\
    S_Q\left(\tau_\nu\right)= & \frac{3\left(1-\epsilon_\nu\right)}{8}\left(\sin ^2 \theta\right)\left[\left(J_I-3 K_I\right)+3\left(J_Q-K_Q\right)\right] \\
    S_U\left(\tau_\nu\right)= & 0 \\
    S_V\left(\tau_\nu\right)= & \frac{3\left(1-\epsilon_\nu\right)}{4} F_V \cos \theta
    \end{aligned}\label{16.2.41}\]

    where as before \[\epsilon_\nu\left(\tau_\nu\right) \equiv \frac{\kappa_\nu\left(\tau_\nu\right)}{\kappa_\nu\left(\tau_\nu\right)+\sigma_\nu\left(\tau_\nu\right)}\label{16.2.42}\]

    For an isotropic radiation field such as one could expect deep in the star that \(\mathrm{I_I=3K_I}\) and \(\mathrm{J_Q=K_Q=0}\) since there is no preferred plane in which to measure Q. Under these conditions all fluxes vanish and the source functions become \[\begin{aligned}
    & S_I\left(\tau_\nu\right)=\epsilon_\nu\left(\tau_\nu\right) B_\nu(T)+\left(1-\epsilon_\nu\right) J_\nu\left(\tau_\nu\right) \\
    & S_Q\left(\tau_\nu\right)=S_U\left(\tau_\nu\right)=S_V\left(\tau_\nu\right)=0
    \end{aligned}\label{16.2.43}\]

    which is the expected result for isotropic scattering and is consistent with an isotropic radiation field.

    For anisotropic scattering the source function is no longer independent of θ. However, the θ-dependent terms depend on a collection of moments that measure the departure of the radiation field from isotropy. For a one dimensional beam JQ = KQ, while for an isotropic radiation field JI = 3KI. This is true for both SI and SQ. Indeed, the coefficient of the angular term is the same for both source functions. The fact that SU is zero means that U is zero throughout the atmosphere. This result is not surprising as for U to be non-zero would imply that polarization would have to have a maximum in some plane other than that containing the normal to the atmosphere and the observer (or one perpendicular to it). From symmetry, there can be no such plane and hence U must be zero everywhere. We are now in a position to generate integral equations for quantities that will explicitly determine the variation of the source functions with depth.

    Approximation of SQ in the Upper Atmosphere To begin our analysis of the polarization to be expected from a stellar atmosphere, let us consider the complete source functions for such an atmosphere. These can be written as \[\begin{aligned}
    & S_I\left(\tau_\nu\right)=\epsilon_\nu\left(\tau_\nu\right) B_\nu(T)+X\left(\tau_\nu\right)-Y\left(\tau_\nu\right) \cos ^2 \theta \\
    & S_Q\left(\tau_\nu\right)=Y\left(\tau_\nu\right) \sin ^2 \theta
    \end{aligned}\label{16.2.44}\]

    where \[\begin{aligned}
    & X\left(\tau_\nu\right) \equiv \frac{3}{8}\left[1-\epsilon_\nu\left(\tau_\nu\right)\right]\left\{\left[3 J_I\left(\tau_\nu\right)-K_I\left(\tau_\nu\right)\right]+\left[J_Q\left(\tau_\nu\right)-K_Q\left(\tau_\nu\right)\right]\right\} \\
    & Y\left(\tau_\nu\right) \equiv \frac{3}{8}\left[1-\epsilon_\nu\left(\tau_\nu\right)\right]\left\{\left[J_I\left(\tau_\nu\right)-3 K_I\left(\tau_\nu\right)\right]+3\left[J_Q\left(\tau_\nu\right)-K_Q\left(\tau_\nu\right)\right]\right\}
    \end{aligned}\label{16.2.45}\]

    Let us first consider the pure scattering gray case so that \(\varepsilon_\nu\left(\tau_\nu\right)=0\). Our problem then basically reduces to finding an approximation for \(\mathrm{Y}(\tau_\nu)\). For this part of the discussion, we shall drop the subscript n as radiative equilibrium guarantees constant flux at all frequencies. The quantity (JI-3KI ) occurs in \(\mathrm{Y}(\tau_\nu)\) and normally is the dominant term. However, in the Eddington approximation, this term would be zero. This clearly demonstrates that polarization is a 'second-order' effect and will rely on the departure of the radiation field from isotropy for its existence.

    The entire problem of estimating the polarization then boils down to estimating (J-3K) or the Eddington factor. The Eddington factor was defined in equation (10.4.9) and is a measure of the extent to which the radiation field departs from isotropy. If the radiation field is isotropic then \(f(\tau)=1 / 3\). If the radiation field is strongly forward directed then \(f(\tau)=1\). If the radiation field is somewhat flattened with respect to the forward direction then \(f(\tau)\) can fall below 1/3. Thus we may write the source function for Q as \[S_Q(\tau)=3 \sin ^2 \theta \frac{J_I(\tau)\left[1-3 f_I(\tau)\right]+3 J_Q(\tau)\left[1-f_Q(\tau)\right]}{8}\label{16.2.46}\]

    It is fairly easy to show that if \(\mathrm{J_I}(\tau)\), is given by the gray atmosphere solution \[J_I(\tau)=\frac{3}{4} F_I[\tau+q(\tau)]\label{16.2.47}\]

    where \(\mathrm{q}(\tau)\) is the Hopf function, that \[J_Q(\tau)=C_1 e^{-\alpha \tau}+C_2 e^{-k_1 \tau}\label{16.2.48}\]

    To obtain this result we have simply taken moments of the Q-equation of transfer, assumed \(J_Q(\tau)=1/3\), and used the Chandrasekhar \(n=1\) approximation for \(q(\tau)\). Thus \(k_1\) is the \(n=1\) eigenvalue for the gray equation of transfer (see Table 10.1) and \(\alpha^2=3 / 2\). \(C_1\) and \(C_2\) have opposite signs. Thus \(\mathrm{J_Q}(\tau)\) will indeed vanish rapidly with optical depth. This rapid decline will also extend to \(\mathrm{S_Q}(\tau)\). Thus we will assume that \[Q(0, \mu) \approx S_Q(0, \mu)\label{16.2.49}\]

    This means that \[J_Q(0)=\frac{1}{2} \int_0^1 Q(0, \mu) d \mu=\frac{1}{5} J_I(0) \frac{1-3 f_I(0)}{1+3 f_Q(0) / 5}\label{16.2.50}\]

    This may be substituted back into equation (16.2.46) to yield \[S_Q(0)=\frac{3\left[1-3 f_I(0)\right]}{5\left[1+3 f_Q(0) / 5\right]} J_I(0) \sin ^2 \theta\label{16.2.51}\]

    Approximate Formulas for the Degree of Polarization in a Gray Atmosphere Since we can expect the maximum value of the polarization to occur at the limb (that is, \(\theta=\pi/2\)), both Q(0,0) and I(0,0) will be given by the value of their source functions at the surface. If for I(0,0) we take that to be JI(0), we get for the degree of polarization to be \[P_1(\mu)=\frac{3\left[1-3 f_I(0)\right]}{5\left[1+3 f_Q(0) / 5\right]}\left(1-\mu^2\right)\label{16.2.52}\]

    Just as we calculated \(\mathrm{J_Q}(0)\) in equation (16.2.50) so we can calculate \(\mathrm{K_Q}(0)\) and determine \(f_{\mathrm{Q}}(0)=1/5\). Using the best value for \(f_{\mathrm{I}}(0)\) of 0.4102 [i.e., the 'exact-approximation' of Chandrasekhar], we get that the limb polarization for a gray atmosphere should be 12.35 percent. This is to be compared with Chandrasekhar's (1960) value of 11.713 percent. The accuracy of this result basically stems from the fact that the limb values only require knowledge of the surface values of \(f_{\mathrm{I}}\) and \(f_{\mathrm{Q}}\) so our approximations will give the best values at the limb.

    While the µ-dependence given by equation (16.2.52) will be approximately correct, we can hardly expect it to be accurate. The solution for \(\mu>0\) requires integrating the source function over a range of \(\mu\tau\) which means that we will have to know the source function as a function of \(\tau\). Physically this amounts to including the limb-darkening in the expressions for \(\mathrm{I(0,\mu)}\) and \(\mathrm{Q(0,\mu)}\). Using expressions for the source functions obtained from the same kind of analysis that was used to get \(\mathrm{J_Q(\tau)}\), we get a significant difference in the limb-darkening for I and Q. The increase of the source function for I with depth produces a decrease of the intensity at the limb. The limb is simply a less bright region than the center of the disk. However, the source function for Q declines rapidly as one enters the star so that the region of the star that is the 'brightest' in Q is the limb. Hence Q will suffer 'limb-brightening'. The ratio of these two effects will cause the polarization to drop much faster than \((1-\mu^2)\) as one approaches the center of the disk from the limb. Determining the source functions \(\tau\)-dependence in the same level of approximation that was used to get \(\mathrm{J_Q}(\tau)\) yields \[P_2(\mu)=\frac{3\left\{\left(1-\mu^2\right)\left[1-3 f_I(0)\right] q(0)\right\}\left(1+k_1 \mu\right)}{\left\{5\left[q(0)+\mu\left(1+k_1 q(\infty)\right)+k_1 \mu^2\right]\left[1+3 f_q(0) / 5\right]\right\}}\label{16.2.53}\]

    or substituting in values of the constants for the same order of approximation we get \[P_2(\mu)=-0.107\left(1-\mu^2\right)\left(1+k_1 \mu\right) /\left(1+4.026 \mu+3.416 \mu^2\right)\label{16.2.54}\]

    The value of the polarization for \(\mu=½\) from equation (16.2.52) is \(\mathrm{P_1}(\mu=.5)=9.26\) percent. In contrast, the value given by equation (16.2.54) is \(\mathrm{P_2}(\mu=0.5)=4.1\) percent. This is to be compared with Chandrasekhar's value of 2.252 percent. Thus the limb-darkening is important in determining the center-limb variation of the polarization and it drops much faster than would be anticipated from the simple surface approximation.

    The Wavelength Dependence of Polarization The gray atmosphere, being frequency independent, tells us little about the way in which the polarization can be expected to vary with wavelength. However, it is clear from the approximation formulae for the center-limb variation of the gray polarization, that the polarization is largely determined by the quantity \([1-3f_\mathrm{I}(0)]\). This quantity basically measures the departure from isotropy of the radiation field. In any stellar atmosphere, this will be determined by the gradient of the source function. Since the polarization is largely determined by the atmosphere structure above optical depth unity, we may approximate the source function by a surface term and a gradient so that \[S_I(\tau) \approx S_0(1+b \tau)\label{16.2.55}\]

    Thus, \[\begin{aligned}
    & J_I(0)=\frac{1}{2} \int_0^1 \int_0^{\infty} \frac{S_I(t) e^{-t / \mu} d t}{\mu}=\frac{1}{2} S_I(0)\left(1+\frac{b}{2}\right) \\
    & K_I(0)=\frac{1}{2} \int_0^1 \int_0^{\infty} \frac{\mu^2 S_I(t) e^{-t / \mu} d t}{\mu}=\frac{1}{2} S_I(0)\left(\frac{1}{3}+\frac{b}{4}\right)
    \end{aligned}\label{16.2.56}\]

    This enables us to write the term that measures the isotropy of the radiation field as \[J_I(0)\left[1-3 f_I(0)\right]=\frac{-\frac{1}{2} S_0 b}{4}\label{16.2.57}\]

    Thus replacing the term that measures the anisotropy of the radiation field with \(b\) yields \[P_1(\mu)=-\left(1-\epsilon_\nu\right)\left(\frac{21 b}{250}\right) \frac{\sin ^2 \theta}{1+\mu b}\label{16.2.58}\]

    If for purposes of determining the wavelength dependence of the polarization, we take the source function to be the Planck function, then the normalized gradient \(b_\nu\) becomes \[b_\nu=\left.\frac{1}{B_\nu(0)}\left(\frac{d B_\nu}{d \tau_\nu}\right)\right|_{\tau_\nu=0}=\left.\frac{1}{B_\nu(0)}\left(\frac{d B_\nu}{d T} \frac{d T}{d \tau_\nu}\right)\right|_{\tau_\nu=0}=\left.\frac{1}{B_\nu(0)}\left(\frac{d B_\nu}{d T} \frac{d T}{d \tau_0} \frac{d \tau_0}{d \tau_\nu}\right)\right|_{\tau_\nu=0}\label{16.2.59}\]

    The frequency \(\nu_0\) is the reference frequency at which the temperature gradient is to be evaluated. In this manner, the temperature gradient is the same for all frequencies and the frequency dependence is then determined by the derivative of the Planck function and the term \(\mathrm{d}\tau_0/\mathrm{d}\tau_{\nu}\). This term is simply the ratio of the extinction coefficients at the reference frequency and the frequency of interest. It is this term that is largely responsible for the frequency dependence of the polarization in hot stars.

    If for purposes of approximation we continue to use the Planck function as the source function, we can evaluate much of the expression for \(b_\nu\) explicitly so that \[b_\nu=\beta_\nu \frac{\sigma_0\left(1-\epsilon_\nu\right)}{\sigma_\nu\left(1-\epsilon_0\right)}\label{16.2.60}\]

    where \[\beta_\nu=\frac{h \nu}{k T}\left(1-e^{-h \nu /(k T)}\right)^{-1} \frac{1}{T} \frac{d T}{d \tau_0}\label{16.2.61}\]

    We can write the polarization as \[P_\nu(\mu) \approx \frac{-21\left(1-\mu^2\right)\left(1-\epsilon_\nu\right)^2 \beta_\nu}{250\left(1-\epsilon_0\right)\left(1+b_\nu \mu\right)}\label{16.2.62}\]

    It is clear that for the gray atmosphere \(\epsilon_\nu=\epsilon_0=0\), and the maximum polarization will be achieved where \(\beta_\nu\) is a maximum. By letting \(\alpha=(\mathrm{h_\nu/KT})\), setting \(\mathrm{P_\nu}(0)\) equal to the gray atmosphere result, and using the gray atmosphere temperature distribution, we find that the gray result will occur near \(\alpha=4\). This lies between the maximum of \(\mathrm{B_\nu}\quad(\alpha=2.82)\) and \(\mathrm{B_\lambda}\quad(\alpha=4.97)\) and is virtually identical to the frequency at which \(\nu\mathrm{B_\nu}\quad(\alpha=3.92)\) is a maximum which is practically the same as the frequency for which \([\mathrm{dB\nu/dT}]\quad(\alpha=3.83)\) is a maximum. Thus, to the accuracy of the approximation, the gray polarization will occur at the frequency for which the source function gradient is a maximum. This results from the fact that Q will attain its largest value when \(\mathrm{J_I-3K_I}\) is a maximum. This will occur when the source function gradient is a maximum which, since the radiative flux is driven by that gradient, is also the energy maximum. For larger values of the frequency \(f_\mathrm{I}(0)\) will become larger meaning that the degree of polarization can exceed the gray value, but the magnitude of \(\mathrm{Q_\nu}\) will decreae.

    In the non-gray case, the situation is somewhat more complicated. Here, in addition to the frequency dependence of the source function gradient, the variation of the opacity with frequency strongly influences the value of the polarization. If we assume that the gray result is actually realized for hot stars in the vicinity of the energy maximum, then the polarization in the visible will be roughly given by \[P_\nu(0)=P_{\nu_0}\left[\frac{\left(1-\epsilon_\nu\right)^2}{1-\epsilon_0}\right]\label{16.2.63}\]

    That is, it will drop quadratically with \(\left(1-\varepsilon_\nu\right)\) as one moves into the visible. The change in \(\epsilon_\nu\) with frequency will be largely by the change in \(\kappa_\nu\). To the extent that this is largely due to Hydrogen, we can expect the opacity to vary as \(\nu^{-3}\). Thus, in stars where the dominant extinction is from absorption by hydrogen and the energy maximum is in the far ultraviolet, we can expect the polarization to drop by several powers of ten from its maximum value. In stars where electron scattering is the dominant opacity source, a significant decrease in the degree of polarization will still in the visible as a result of the increase in the pure absorption component of the extinction.

    In the limit of \(\alpha=(\mathrm{h} \nu / \mathrm{kT}) \ll 1\), the source function asymptotically approaches a constant given by the temperature gradient, and the wavelength dependence of the polarization depends only on the opacity effects. In the case where \(\alpha \ll 1\), \(\beta\) is proportional to \(\alpha\) times the normalized temperature gradient and the polarization will continue to rise into the ultraviolet until there is a change in the opacity. Thus we should expect the early type stars to show significant polarization approaching or exceeding the gray value in the vicinity of the Lyman Jump. Little polarization will be in evidence in the Lyman continuum as a result of the marked increase in the absorptive opacity. The polarization will also decline as one moves into the optical part of the spectrum to values typically of the order of 0.01 percent at the limb and will slowly decline throughout the Paschen continuum.

    Everything done so far has presupposed that the dependence of the source function near the surface can be expressed in terms of a first-order Taylor series. In many stars, this is not adequate as there is a very steep plunge in the temperature near the surface. This will require a substantial second derivative of opposite sign from the first derivative to adequately describe the surface behavior of the source function. It is clear from equations (16.2.55) and (16.2.56) that the presence of such a term will increase the value of \(\left(\mathrm{J}_{\mathrm{I}}-3 \mathrm{~K}_{\mathrm{I}}\right)\) and could in principle reverse its sign resulting in a positive degree of polarization (i.e., polarization with the maximum electric vector aligned with the atmospheric normal). Detailed models of gray atmospheres indeed show this and it can be expected any time the second order limb-darkening coefficient to positive and greater than half the magnitude of the first order limb-darkening coefficient.

    e. Implications of the Transfer of Polarization for Stellar Atmospheres.

    It is clear from equations (16.2.33) that in the absence of illumination on the atmosphere, all the \({\mathrm{C}_{N}}’\mathrm{s}\) in the integral equations for the moments [equations (16.2.32)] vanish and all but the first two integral equations become homogeneous. The last two of equations (16.2.18) deal with the propagation of the ellipticity of the polarization, and they make it clear that in the absence of incident elliptically polarized light and sources of such light in the atmosphere, Rayleigh scattering will make no contribution to the Stokes parameter V. The trivial solution V = 0 will prevail throughout the atmosphere, and no amount of circularly polarized light is to be expected. In the absence of incident radiation, the resulting homogeneity of the integral equations for \(\mathrm{Z}(\tau)\) and \(\mathrm{M}(\tau)\) simply means that it is always possible to choose a coordinate system aligned with the plane of the resulting linear polarization so that U = 0 is also true throughout the atmosphere. The only parameter that prevents the integral equations for \(\mathrm{X}(\tau)\) and \(\mathrm{Y}(\tau)\) from becoming homogeneous is the presence of the Planck function tying the radiation field of the gas to the thermal field of the particles.

    Interpretation of the Polarization Moments of Radiation Field From the behavior of the integral equations (16.2.32), it is possible to understand the properties of the Stokes parameters that they represent. For numerical reasons, the scattering fraction \(1-\epsilon(\tau)\) has been absorbed into the definition of the moments. Clearly \(\mathrm{W}(\tau)\) and \(\mathrm{P}(\tau)\) describe the propagation of the Stokes parameter V throughout the atmosphere. Since both U and V are basically geometric, two equations are required to determine the magnitude and quadrant of the angular parameters which they contain. Since the scattering matrix [equation (16.2.19)] is reducible with respect to V (i.e., only the diagonal element describing scatterings from V' into V is nonzero), it is not surprising that these two equations stand alone and can generally be ignored. The two equations for \(\mathrm{Z}(\tau)\) and \(\mathrm{M}(\tau)\) describe the orientation of the plane of linear polarization throughout the atmosphere. Since the orientation of the plane of polarization is also a geometric quantity, no energy is involved in the propagation of \(\mathrm{Z}(\tau)\) and \(\mathrm{M}(\tau)\) so their absence in the equations for radiative equilibrium is explained. The moments \(\mathrm{X}(\tau)\) and \(\mathrm{Y}(\tau)\) describe the actual flow of the energy field associated with the photons. An inspection of the \(\mathrm{X}(\tau)\) equation of equations (16.2.32) shows that \(\mathrm{X}(\tau)\) is very like the mean intensity J. The moment \(\mathrm{Y}(\tau)\) measures the difference between \(\mathrm{I}_l\) and \(\mathrm{I}_r\) and therefore describes the propagation of the degree of linear polarization in the atmosphere. Since both these moments appear in the flux equations, the amount of linear polarization does affect the condition of radiative equilibrium and hence the atmospheric structure. However, in the stellar atmosphere, the effect is not large.

    The reason that there should be any effect at all can be found in the explanation for the existence of scattering lines in stars. Anisotropic scattering redirects the flow of photons from that which would be expected for isotropic scattering. This redirection increases the distance that a photon must travel to escape the atmosphere, thereby increasing the probability that the photon will be absorbed. This effectively increases the extinction coefficient and the opacity of the gas. While the effect is small for Rayleigh scattering, it has been found to play a significant role in the stellar interior when electron scattering is a major source of the opacity. In the stellar atmosphere, the effect on the stellar structure is small because the photon mean free path is comparable to the dimensions of the atmosphere (by definition), so that there is simply not enough space for anisotropic scattering to significantly affect the photon flow.

    However, Rayleigh scattering will always produce linear polarization of the emergent radiation field. Since the Rayleigh phase function correctly describes electron scattering as resonance scattering, we can expect that polarization will be ubiquitous in stellar spectra, even in the absence of incident radiation.

    Effects on the Continuum The early work by Chandrasekhar8,9 on the gray atmosphere showed that one might expect up to 11 percent linear polarization at the limb of a star with a pure scattering atmosphere. As was pointed out earlier, even this rather large polarization would average to zero for observers viewing spherical stars. Hence, Chandrasekhar suggested that the effect be searched for in eclipsing binary systems. Such search led Hiltner13 and Hall14 to independently discover the interstellar polarization. It was not until 1984 that Kemp and colleagues15 verified the existence of the effect in binary systems. However, the measured result was significantly smaller than that anticipated by the gray atmosphere study.

    The reason can be found by considering the effects that tend to destroy intrinsic stellar polarization. The most significant is the near-axial symmetry about the line of sight of most stars. This tends to average any locally generated polarization to much lower values. The second reason results from the sources of the polarization itself. It is obvious that to generate polarization locally on the surface of a star, it is necessary to have opacity sources that produce polarization. In the hot early-type stars (earlier than B3), electron scattering is the dominant source of opacity, while in the late-type stars Rayleigh scattering from molecules is a major source of continuous opacity. Unfortunately, the spectra of these stars are so blanketed by atomic and molecular lines that it is difficult to find a region of the spectrum dominated by the continuous opacity alone. For stars of spectral type between late B and early K, there is a general lack of anisotropic scatterers making a major contribution to the total opacity, so that polarization is generally absent even locally.

    In the early-type stars where anisotropic scattering electrons are abundant, the third reason comes into play and is basically a radiative transfer effect. To understand this rather more subtle effect, consider the following extreme and admittedly contrived examples. Remember that an electron acting as an oscillating dipole will scatter photons which are polarized in a plane perpendicular to the scattering plane. Consider a region of the atmosphere so that the observer sees the last scattering and is therefore in the scattering plane. The distribution of polarized photons that the observer sees will then reflect the distribution of the photons incident on the electrons. Further consider a point near the limb so that photons emerging from deep in the atmosphere will be scattered through 90 degrees and be 100 percent linearly polarized. If the source function gradient is steep, most of the photons scattered toward the observer will indeed come from deep in the atmosphere and exhibit such polarization and the observer will see a beam of light, strongly polarized in a plane tangent to the limb. However, if the source function gradient is shallow or nearly nonexistent, the majority of last scattered photons will originate not from deep within the star, but from positions near the limb. There will then be a surplus of scattered photons, with their scattering plane parallel to the limb and therefore polarized at right angles to the limb. Somewhere between these two extremes there exists a source function gradient which will produce zero net polarization locally at the limb. This argument can be generalized for any point on the surface of the star. Unfortunately, in the early-type stars that abound with anisotropically scattering electrons, the energy maximum lies far in the ultraviolet part of the spectrum. Thus the source function in the optical part of the spectrum is weakly dependent on the temperature and hence has a rather flat gradient. So these stars may exhibit little more than one percent or so of linear polarization in the optical part of the spectrum, even at the limb. The integrated effect, even from a highly distorted star, is minuscule. Such would not be the case for the late-type stars where the energy maximum lies to longer wavelengths than the visible light. These stars have very strong source function gradients in the visible part of the spectrum and should produce large amounts of polarization of the local continuum flux. There remains a class of stars for which significant amounts of polarization may be found in the integrated light. These are the close binaries. Here the source of the polarization is not the star itself, but the scattered light of the companion. Although little has been done in a rigorous fashion to investigate these cases, Kuzma16 has found that measurable polarization should be detected in at least some of these stars. Study of these stars offers the significant advantage that the polarization will be phase dependent and so the effects of interstellar polarization may be removed in an unambiguous manner. Much work remains to be done in this area, but it must be done very carefully. All the effects that determine the degree of polarization of the emergent light depend critically on the temperature distribution of the upper atmosphere. Thus, such studies must include nongray and non-LTE effects. Kuzma16 also included the effect that the incident radiation comes not from a point source, but from a finite solid angle and found that this significantly affected the result. While the task of modeling these stars is formidable, the return for determining the structure of the upper layers of the atmosphere will be great.

    Effects of Polarization on Spectral Lines While that a line formed in pure absorption will be completely unpolarized except for contributions from electron scattering, this is not the case for scattering lines. For resonance lines where an upward transition must be followed quickly by a downward transition, the reemission of the photon is not isotropic, but follows a phase function not unlike the Rayleigh phase function. The scattering process can change the state of polarization in much the same fashion as electron scattering does. Thus, these lines can behave in much the same way as electrons in introducing or modifying the polarization of a beam of light. Since any absorption has a finite probability of being followed immediately by the reemission of a similar photon, any spectral line has the possibility of introducing some polarization into the beam. For example, under normal atmospheric conditions, the formation of \(\mathrm{H\beta}\) results in a scattered photon about 25 percent of the time. For \(\mathrm{H\alpha}\) the fraction can approach 50 percent. In other strong lines such as the Fraunhofer H and K lines of calcium, the fraction is much larger. Thus one should not be surprised if large amounts of local polarization are present in such lines. Unfortunately, the effect will be wavelength dependent.

    The majority of the wavelength dependence arises from the redistribution function by linking one frequency within the line to another. The introduction of a frequency-dependent redistribution function greatly complicates the problem. To handle this problem, it is necessary to express the redistribution function in some analytic form such as those of Hummer (see Section 15.3). Furthermore, one cannot use the angle-averaged forms because the angle dependence is crucial to the transfer of polarized radiation. McKenna17 has developed the formalism from the standpoint of integral equations for the moments of the radiation field for a fairly general class of spectral lines. In the case of coherent scattering, these equations reduce to those presented here. For a general redistribution function, the equations are much more complicated.

    Using the corrected Hummer RIV for the case of the sun, McKenna18 found that weak resonance lines showed a slowly decreasing degree of polarization from the core to the wings. The structure of the line as well as the state of polarization agreed well with observation and reproduced the center-limb variation observed for such lines. For strong resonance lines, a sharp spike in the polarization at the line core was found that rapidly diminished as one moved away from the line core before rising again and then diminishing into the wings as, in the case of the weak lines. Again, this effect is seen in the sun and provides a useful diagnostic tool for the upper atmosphere structure.

    In most stars, except those distorted by rotation, binary systems, and nonradially pulsating stars, symmetry will tend to destroy these effects. But for these other classes of stars, line polarization may provide a useful constraint on their shape and structure. Unfortunately, the correct redistribution function for hydrogen has yet to be worked out. Since the hydrogen lines are readily observed and are often found in situations where the geometry would indicate little symmetry, an analysis of this type extended to these lines would provide an interesting constraint on the physical nature of the system.


    This page titled 16.3: Transfer of Polarized Radiation is shared under a Public Domain license and was authored, remixed, and/or curated by George W. Collins II (Pachart Foundation) via source content that was edited to the style and standards of the LibreTexts platform.