Processing math: 100%
Skip to main content
Library homepage
 

Text Color

Text Size

 

Margin Size

 

Font Type

Enable Dyslexic Font
Physics LibreTexts

5.2: Dirac's Theory of the Electron

( \newcommand{\kernel}{\mathrm{null}\,}\)

The Dirac Hamiltonian

So far, we have been using p2/2m-type Hamiltonians, which are limited to describing non-relativistic particles. In 1928, Paul Dirac formulated a Hamiltonian that can describe electrons moving close to the speed of light, thus successfully combining quantum theory with special relativity. Another triumph of Dirac’s theory is that it accurately predicts the magnetic moment of the electron.

Dirac’s theory begins from the time-dependent Schrödinger wave equation,

itψ(r,t)=ˆHψ(r,t).

Note that the left side has a first-order time derivative. On the right, the Hamiltonian ˆH contains spatial derivatives in the form of momentum operators. We know that time and space derivatives of wavefunctions are related to energy and momentum by

itE,ijpj.

We also know that the energy and momentum of a relativistic particle are related by

E2=m2c4+3j=1p2jc2,

where m is the rest mass and c is the speed of light. Note that E and p appear to the same order in this equation. (Following the usual practice in relativity theory, we use Roman indices j{1,2,3} for the spatial coordinates {x,y,z}.)

Since the left side of the Schrödinger equation (???) has a first-order time derivative, a relativistic Hamiltonian should involve first-order spatial derivatives. So we make the guess

ˆH=α0mc2+3j=1αjˆpjc,

where ˆpji/xj. The mc2 and c factors are placed for later convenience. We now need to determine the dimensionless “coefficients” α0, α1, α2, and α3.

For a wavefunction with definite momentum p and energy E,

ˆHψ=Eψ(α0mc2+3j=1αjpjc)ψ=Eψ.

This is obtained by replacing the ˆpj operators with definite numbers. If ψ is a scalar, this would imply that α0mc2+jαjpjc=E for certain scalar coefficients {α0,,α3}, which does not match the relativistic energy-mass-momentum relation (???).

But we can get things to work if ψ(r,t) is a multi-component wavefunction, rather than a scalar wavefunction, and the α’s are matrices acting on those components via the matrix-vector product operation. In that case,

ˆH=ˆα0mc2+3j=1ˆαjˆpjc,whereˆpjij,

where the hats on {ˆα0,,ˆα3} indicate that they are matrix-valued. Applying the Hamiltonian twice gives

(ˆα0mc2+3j=1ˆαjpjc)2ψ=E2ψ.

This can be satisfied if

(ˆα0mc2+3j=1ˆαjpjc)2=E2ˆI,

where ˆI is the identity matrix. Expanding the square (and taking care of the fact that the ˆαμ matrices need not commute) yields

ˆα20m2c4+j(ˆα0ˆαj+ˆαjˆα0)mc3pj+jjˆαjˆαjpjpj=E2ˆI.

This reduces to Equation (???) if the ˆαμ matrices satisfy

ˆα2μ=ˆIforμ=0,1,2,3,andˆαμˆαν+ˆανˆαμ=0forμν.

(We use Greek symbols for indices ranging over the four spacetime coordinates {0,1,2,3}.) The above can be written more concisely using the anticommutator:

{ˆαμ,ˆαν}=2δμν,forμ,ν=0,1,2,3.

Also, we need the ˆαμ matrices to be Hermitian, so that ˆH is Hermitian.

It turns out that the smallest possible Hermitian matrices that can satisfy Equation (???) are 4×4 matrices. The choice of matrices (or “representation”) is not uniquely determined. One particularly useful choice is called the Dirac representation:

ˆα0=[ˆIˆ0ˆ0ˆI],ˆα1=[ˆ0ˆσ1ˆσ1ˆ0]ˆα2=[ˆ0ˆσ2ˆσ2ˆ0],ˆα3=[ˆ0ˆσ3ˆσ3ˆ0],

where {ˆσ1,ˆσ2,ˆσ3} denote the usual Pauli matrices. Since the ˆαμ’s are 4×4 matrices, it follows that ψ(r) is a four-component field.

Eigenstates of the Dirac Hamiltonian

According to Equation (???), the energy eigenvalues of the Dirac Hamiltonian are

E=±m2c4+jp2jc2.

This is plotted below:

clipboard_ee0b451796da75c30e10c7848ffee7750.png
Figure 5.2.2

The energy spectrum forms two hyperbolic bands. For each p, there are two degenerate positive energy eigenvalues, and two degenerate negative energy eigenvalues, for a total of four eigenvalues (matching the number of wavefunction components). The upper band matches the dispersion relation for a massive relativistic particle, as desired. But what about the negative-energy band? Who ordered that?

It might be possible for us to ignore the existence of the negative-energy states, if we only ever consider an isolated electron; we could just declare the positive-energy states to be the ones we are interested in, and ignore the others. However, the problem becomes hard to dismiss once we let the electron interact with another system, such as the electromagnetic field. Under such circumstances, the availability of negative-energy states extending down to E would destabilize the positive-energy electron states, since the electron can repeatedly hop to states with ever more negative energies by shedding energy (e.g., by emitting photons). This is obviously problematic. However, let us wait for a while (till Section 5.2) to discuss how the stability problem might be resolved.

For now, let us take a closer look at the meaning of the Dirac wavefunction. Its four components represent a four-fold “internal” degree of freedom, distinct from the electron’s ordinary kinematic degrees of freedom. Since there are two energy bands, the assignment of an electron to the upper or lower band (or some superposition thereof) consitutes two degrees of freedom. Each band must then posssess a two-fold degree of freedom (so that 2×2=4), which turns out to be associated with the electron’s spin.

To see explicitly how this works, let us pick a representation for the ˆαμ matrices. The choice of representation determines how the four degrees of freedom are encoded in the individual wavefunction components. We will use the Dirac representation (5.2.13). In this case, it is convenient to divide the components into upper and lower parts,

ψ(r,t)=[ψA(r,t)ψB(r,t)],

where ψA and ψB have two components each. Then, for an eigenstate with energy E and momentum p, applying (5.2.13) to the Dirac equation (???) gives

ψA=1Emc2jˆσjpjψB,ψB=1E+mc2jˆσjpjψA.

Consider the non-relativistic limit, |p|0, for which E approaches either mc2 or mc2. For the upper band (Emc2), the vanishing of the denominator in Equation (5.2.17) tells us that the wavefunction is dominated by ψA. Conversely, for the lower band (Emc2), Equation (5.2.18) tells us that the wavefunction is dominated by ψB. We can thus associate the upper (A) and lower (B) components with the band degree of freedom. Note, however, that this is only an approximate association that holds in the non-relativistic limit! In the relativistic regime, upper-band states can have non-vanishing values in the B components, and vice versa. (There does exist a way to make the upper/lower spinor components correspond rigorously to positive/negative energies, but this requires a more complicated representation than the Dirac representation, for details, see Foldy and Wouthuysen (1950).)

Dirac electrons in an electromagnetic field

To continue pursuing our objective of interpreting the Dirac wavefunction, we must determine how the electron interacts with an electromagnetic field. We introduce electromagnetism by following the same procedure as in the non-relativistic theory (Section 5.1): add eΦ(r,t) as a scalar potential function, and add the vector potential via the substitution

ˆpˆp+eA(ˆr,t).

Applying this recipe to the Dirac Hamiltonian (???) yields

itψ={ˆα0mc2eΦ(r,t)+jˆαj[ij+eAj(r,t)]c}ψ(r,t).

You can check that this has the same gauge symmetry properties as the non-relativistic theory discussed in Section 5.1.

In the Dirac representation (5.2.13), Equation (???) reduces to

itψA=(+mc2eΦ)ψA+jˆσj(ij+eAj)cψBitψB=(mc2eΦ)ψB+jˆσj(ij+eAj)cψA,

where ψA and ψB are the previously-introduced two-component objects corresponding to the upper and lower halves of the Dirac wavefunction.

In the non-relativistic limit, solutions to the above equations can be cast in the form

ψA(r,t)=ΨA(r,t)exp[i(mc2)t]ψB(r,t)=ΨB(r,t)exp[i(mc2)t].

The exponentials on the right side are the exp(iωt) factor corresponding to the rest energy mc2, which dominates the electron’s energy in the non-relativistic limit. (Note that by using +mc2 rather than mc2, we are explicitly referencing the positive-energy band.) If the electron is in an eigenstate with p=0 and there are no electromagnetic fields, ΨA and ΨB would just be constants. Now suppose the electron is non-relativistic but not in a p=0 eigenstate, and the electromagnetic fields are weak but not necessarily vanishing. In that case, ΨA and ΨB are functions that vary with t, but slowly.

Plugging this ansatz into Equations (5.2.22)(5.2.23) gives

itΨA=eΦΨA+jˆσj(ij+eAj)cΨB(it+2mc2+eΦ)ΨB=jˆσj(ij+eAj)cΨA.

On the left side of Equation (5.2.28), the 2mc2 term dominates over the other two, so

ΨB12mcjˆσj(ij+eAj)ΨA.

Plugging this into Equation (5.2.27) yields

itΨA={eΦ+12mjkˆσjˆσk(ij+eAj)(ik+eAk)}ΨA.

Using the identity ˆσjˆσk=δjkˆI+iiεijkσi:

itΨA={eΦ+12m|i+eA|2+i2mijkεijkˆσi(ij+eAj)(ik+eAk)}ΨA.

Look carefully at the last term in the curly brackets. Expanding the square yields

i2mijkεijkˆσi(jkiejAkie[Akj+Ajk]+e2AjAk).

Due to the antisymmetry of εijk, all terms inside the parentheses that are symmetric under j and k cancel out when summed over. The only survivor is the second term, which gives

e2mijkεijkˆσijAk=e2mˆσB(r,t),

where B=×A is the magnetic field. Hence,

itΨA={eΦ+12m|i+eA|2(e2mˆσ)B}ΨA.

This is an exact match for Equation (5.1.20), except that the Hamiltonian has an additional term of the form ˆμˆB. This additional term corresponds to the potential energy of a magnetic dipole of moment μ in a magnetic field B. The Dirac theory therefore predicts the electron’s magnetic dipole moment to be

|μ|=e2m.

Remarkably, this matches the experimentally-observed magnetic dipole moment to about one part in 103. The residual mismatch between Equation (???) and the actual magnetic dipole moment of the electron is understood to arise from quantum fluctuations of the electronic and electromagnetic quantum fields. Using the full theory of quantum electrodynamics, that “anomalous magnetic moment” can also be calculated and matches experiment to around one part in 109, making it one of the most precise theoretical predictions in physics! For details, see Zee (2010).

It is noteworthy that we did not set out to include spin in the theory, yet it arose, seemingly unavoidably, as a by-product of formulating a relativistic theory of the electron. This is a manifestation of the general principle that relativistic quantum theory is more constrained than non-relativistic quantum theory Dyson (1951). Due to the demands imposed by relativistic symmetries, spin is not allowed to be an optional part of the theory of the relativistic electron—it has to be built into the theory at a fundamental level.

Positrons and Dirac Field Theory

As noted in Section 5.2, the stability of the quantum states described by the Dirac equation is threatened by the presence of negative-energy solutions. To get around this problem, Dirac suggested that what we regard as the “vacuum” may actually be a state, called the Dirac sea, in which all negative-energy states are occupied. Since electrons are fermions, the Pauli exclusion principle would then forbid decay into the negative-energy states, stabilizing the positive-energy states.

At first blush, the idea seems ridiculous; how can the vacuum contain an infinite number of particles? However, we shall see that the idea becomes more plausible if the Dirac equation is reinterpreted as a single-particle construction which arises from a more fundamental quantum field theory. The Dirac sea idea is an inherently multi-particle concept, and we know from Chapter 4 that quantum field theory is a natural framework for describing multi-particle quantum states. Let us therefore develop this theory.

Consider again the eigenstates of the single-particle Dirac Hamiltonian with definite momenta and energies. Denote the positive-energy wavefunctions by

ukσeikr(2π)3/2=r|k,+,σ,whereˆH|k,+,σ=ϵkσ|k,+,σ.

The negative-energy wavefunctions are

vkσeikr(2π)3/2=r|k,,σ,whereˆH|k,,σ=ϵkσ|k,,σ.

Note that |k,,σ denotes a negative-energy eigenstate with momentum k, not k. The reason for this notation, which uses different symbols to label the positive-energy and negative-energy states, will become clear later. Each of the ukσ and vkσ terms are four-component objects (spinors), and for any given k, the set

{ukσ,vk,σ|σ=1,2}

forms an orthonormal basis for the four-dimensional spinor space. Thus,

n(unkσ)unkσ=δσσ,n(unkσ)vnkσ=0,etc.

Here we use the notation where unkσ is the n-th component of the ukσ spinor, and likewise for the v’s.

Following the second quantization procedure from Chapter 4, let us introduce a fermionic Fock space HF, as well as a set of creation/annihilation operators:

ˆbkσandˆbkσcreate/annihilate|k,+,σˆdkσandˆdkσcreate/annihilate|k,,σ.

These obey the fermionic anticommutation relations

{ˆbkσ,ˆbkσ}=δ3(kk)δσσ,{ˆdkσ,ˆdkσ}=δ3(kk)δσσ{ˆbkσ,ˆbkσ}={ˆbkσ,ˆdkσ}={ˆdkσ,ˆdkσ}=0,etc.

The Hamiltonian is

ˆH=d3kσϵkσ(ˆbkσˆbkσˆdkσˆdkσ),

and applying the annihilation operators to the vacuum state | gives zero:

ˆbkσ|=ˆdkσ|=0.

When formulating bosonic field theory, we defined a local field annihilation operator that annihilates a particle at a given point r. In the infinite-system limit, this took the form

ˆψ(r)=d3kφk(r)ˆak,

and the orthonormality of the φk wavefunctions implied that [ˆψ(r),ˆψ(r)]=δ3(rr). Similarly, we can use the Dirac Hamiltonian’s eigenfunctions (???)(???) to define

ˆψn(r)=d3k(2π)3/2σ(unkσeikrˆbkσ+vnkσeikrˆdkσ).

Note that there are two terms in the parentheses because the positive-energy and negative-energy states are denoted by differently-labeled annihilation operators. Moreover, since the wavefunctions are four-component spinors, the field operators have a spinor index n. Using the spinor orthonormality conditions (???) and the anticommutation relations (5.2.44), we can show that

{ˆψn(r),ˆψn(r)}=δnnδ3(rr),

with all other anticommutators vanishing. Hence, ˆψn(r) can be regarded as an operator that annihilates a four-component fermion at point r.

Now let us define the operators

ˆckσ=ˆdkσ.

Using these, the fermionic anticommutation relations can be re-written as

{ˆbkσ,ˆbkσ}=δ3(kk)δσσ,{ˆckσ,ˆckσ}=δ3(kk)δσσ{ˆbkσ,ˆbkσ}={ˆbkσ,ˆckσ}={ˆckσ,ˆckσ}=0,etc.

Hence ˆckσ and ˆckσ formally satisfy the criteria to be regarded as creation and annihilation operators. The particle created by ˆckσ is called a positron, and is equivalent to the absence of a d-type particle (i.e., a negative-energy electron).

The Hamiltonian (???) can now be written as

ˆH=d3kσϵkσ(ˆbkσˆbkσ+ˆckσˆckσ)+constant,

which explicitly shows that the positrons have positive energies (i.e., the absence of a negative-energy particle is equivalent to the presence of a positive-energy particle). With further analysis, which we will skip, it can be shown that the positron created by ˆckσ has positive charge e and momentum k. The latter is thanks to the definition adopted in Equation (???); the absence of a momentum k particle is equivalent to the presence of a momentum k particle. As for the field annihilation operator (???), it can be written as

ˆψn(r)=d3k(2π)3/2σ(unkσeikrˆbkσ+vnkσeikrˆckσ).

The c-type annihilation operators do not annihilate |. However, let us define

|=kσˆdkσ|,

which is evidently a formal description of the Dirac sea state. Then

ˆckσ|=ˆdkσkσˆdkσ|=0.

At the end of the day, we can regard the quantum field theory as being defined in terms of b-type and c-type operators, using the anticommutators (5.2.52), the Hamiltonian (???), and the field operator (???), along with the vacuum state |. The elementary particles in this theory are electrons and positrons with strictly positive energies. The single-particle Dirac theory, with its quirky negative-energy states, can then be interpreted as a special construct that maps the quantum field theory into single-particle language. Even though we actually started from the single-particle description, it is the quantum field theory, and its vacuum state |, that is more fundamental.

There are many more details about the Dirac theory that we will not discuss here. One particularly important issue is how the particles transform under Lorentz boosts and other changes in coordinate system. For such details, the reader is referred to Dyson (1951).


This page titled 5.2: Dirac's Theory of the Electron is shared under a CC BY-SA 4.0 license and was authored, remixed, and/or curated by Y. D. Chong via source content that was edited to the style and standards of the LibreTexts platform.

Support Center

How can we help?