5.2: Dirac's Theory of the Electron
( \newcommand{\kernel}{\mathrm{null}\,}\)
The Dirac Hamiltonian
So far, we have been using p2/2m-type Hamiltonians, which are limited to describing non-relativistic particles. In 1928, Paul Dirac formulated a Hamiltonian that can describe electrons moving close to the speed of light, thus successfully combining quantum theory with special relativity. Another triumph of Dirac’s theory is that it accurately predicts the magnetic moment of the electron.
Dirac’s theory begins from the time-dependent Schrödinger wave equation,
iℏ∂tψ(r,t)=ˆHψ(r,t).
Note that the left side has a first-order time derivative. On the right, the Hamiltonian ˆH contains spatial derivatives in the form of momentum operators. We know that time and space derivatives of wavefunctions are related to energy and momentum by
iℏ∂t↔E,−iℏ∂j↔pj.
We also know that the energy and momentum of a relativistic particle are related by
E2=m2c4+3∑j=1p2jc2,
where m is the rest mass and c is the speed of light. Note that E and p appear to the same order in this equation. (Following the usual practice in relativity theory, we use Roman indices j∈{1,2,3} for the spatial coordinates {x,y,z}.)
Since the left side of the Schrödinger equation (???) has a first-order time derivative, a relativistic Hamiltonian should involve first-order spatial derivatives. So we make the guess
ˆH=α0mc2+3∑j=1αjˆpjc,
where ˆpj≡−iℏ∂/∂xj. The mc2 and c factors are placed for later convenience. We now need to determine the dimensionless “coefficients” α0, α1, α2, and α3.
For a wavefunction with definite momentum p and energy E,
ˆHψ=Eψ⇒(α0mc2+3∑j=1αjpjc)ψ=Eψ.
This is obtained by replacing the ˆpj operators with definite numbers. If ψ is a scalar, this would imply that α0mc2+∑jαjpjc=E for certain scalar coefficients {α0,…,α3}, which does not match the relativistic energy-mass-momentum relation (???).
But we can get things to work if ψ(r,t) is a multi-component wavefunction, rather than a scalar wavefunction, and the α’s are matrices acting on those components via the matrix-vector product operation. In that case,
ˆH=ˆα0mc2+3∑j=1ˆαjˆpjc,whereˆpj≡−iℏ∂j,
where the hats on {ˆα0,…,ˆα3} indicate that they are matrix-valued. Applying the Hamiltonian twice gives
(ˆα0mc2+3∑j=1ˆαjpjc)2ψ=E2ψ.
This can be satisfied if
(ˆα0mc2+3∑j=1ˆαjpjc)2=E2ˆI,
where ˆI is the identity matrix. Expanding the square (and taking care of the fact that the ˆαμ matrices need not commute) yields
ˆα20m2c4+∑j(ˆα0ˆαj+ˆαjˆα0)mc3pj+∑jj′ˆαjˆαj′pjpj′=E2ˆI.
This reduces to Equation (???) if the ˆαμ matrices satisfy
ˆα2μ=ˆIforμ=0,1,2,3,andˆαμˆαν+ˆανˆαμ=0forμ≠ν.
(We use Greek symbols for indices ranging over the four spacetime coordinates {0,1,2,3}.) The above can be written more concisely using the anticommutator:
{ˆαμ,ˆαν}=2δμν,forμ,ν=0,1,2,3.
Also, we need the ˆαμ matrices to be Hermitian, so that ˆH is Hermitian.
It turns out that the smallest possible Hermitian matrices that can satisfy Equation (???) are 4×4 matrices. The choice of matrices (or “representation”) is not uniquely determined. One particularly useful choice is called the Dirac representation:
ˆα0=[ˆIˆ0ˆ0−ˆI],ˆα1=[ˆ0ˆσ1ˆσ1ˆ0]ˆα2=[ˆ0ˆσ2ˆσ2ˆ0],ˆα3=[ˆ0ˆσ3ˆσ3ˆ0],
where {ˆσ1,ˆσ2,ˆσ3} denote the usual Pauli matrices. Since the ˆαμ’s are 4×4 matrices, it follows that ψ(r) is a four-component field.
Eigenstates of the Dirac Hamiltonian
According to Equation (???), the energy eigenvalues of the Dirac Hamiltonian are
E=±√m2c4+∑jp2jc2.
This is plotted below:

The energy spectrum forms two hyperbolic bands. For each p, there are two degenerate positive energy eigenvalues, and two degenerate negative energy eigenvalues, for a total of four eigenvalues (matching the number of wavefunction components). The upper band matches the dispersion relation for a massive relativistic particle, as desired. But what about the negative-energy band? Who ordered that?
It might be possible for us to ignore the existence of the negative-energy states, if we only ever consider an isolated electron; we could just declare the positive-energy states to be the ones we are interested in, and ignore the others. However, the problem becomes hard to dismiss once we let the electron interact with another system, such as the electromagnetic field. Under such circumstances, the availability of negative-energy states extending down to E→−∞ would destabilize the positive-energy electron states, since the electron can repeatedly hop to states with ever more negative energies by shedding energy (e.g., by emitting photons). This is obviously problematic. However, let us wait for a while (till Section 5.2) to discuss how the stability problem might be resolved.
For now, let us take a closer look at the meaning of the Dirac wavefunction. Its four components represent a four-fold “internal” degree of freedom, distinct from the electron’s ordinary kinematic degrees of freedom. Since there are two energy bands, the assignment of an electron to the upper or lower band (or some superposition thereof) consitutes two degrees of freedom. Each band must then posssess a two-fold degree of freedom (so that 2×2=4), which turns out to be associated with the electron’s spin.
To see explicitly how this works, let us pick a representation for the ˆαμ matrices. The choice of representation determines how the four degrees of freedom are encoded in the individual wavefunction components. We will use the Dirac representation (5.2.13). In this case, it is convenient to divide the components into upper and lower parts,
ψ(r,t)=[ψA(r,t)ψB(r,t)],
where ψA and ψB have two components each. Then, for an eigenstate with energy E and momentum p, applying (5.2.13) to the Dirac equation (???) gives
ψA=1E−mc2∑jˆσjpjψB,ψB=1E+mc2∑jˆσjpjψA.
Consider the non-relativistic limit, |p|→0, for which E approaches either mc2 or −mc2. For the upper band (E≳mc2), the vanishing of the denominator in Equation (5.2.17) tells us that the wavefunction is dominated by ψA. Conversely, for the lower band (E≲−mc2), Equation (5.2.18) tells us that the wavefunction is dominated by ψB. We can thus associate the upper (A) and lower (B) components with the band degree of freedom. Note, however, that this is only an approximate association that holds in the non-relativistic limit! In the relativistic regime, upper-band states can have non-vanishing values in the B components, and vice versa. (There does exist a way to make the upper/lower spinor components correspond rigorously to positive/negative energies, but this requires a more complicated representation than the Dirac representation, for details, see Foldy and Wouthuysen (1950).)
Dirac electrons in an electromagnetic field
To continue pursuing our objective of interpreting the Dirac wavefunction, we must determine how the electron interacts with an electromagnetic field. We introduce electromagnetism by following the same procedure as in the non-relativistic theory (Section 5.1): add −eΦ(r,t) as a scalar potential function, and add the vector potential via the substitution
ˆp→ˆp+eA(ˆr,t).
Applying this recipe to the Dirac Hamiltonian (???) yields
iℏ∂tψ={ˆα0mc2−eΦ(r,t)+∑jˆαj[−iℏ∂j+eAj(r,t)]c}ψ(r,t).
You can check that this has the same gauge symmetry properties as the non-relativistic theory discussed in Section 5.1.
In the Dirac representation (5.2.13), Equation (???) reduces to
iℏ∂tψA=(+mc2−eΦ)ψA+∑jˆσj(−iℏ∂j+eAj)cψBiℏ∂tψB=(−mc2−eΦ)ψB+∑jˆσj(−iℏ∂j+eAj)cψA,
where ψA and ψB are the previously-introduced two-component objects corresponding to the upper and lower halves of the Dirac wavefunction.
In the non-relativistic limit, solutions to the above equations can be cast in the form
ψA(r,t)=ΨA(r,t)exp[−i(mc2ℏ)t]ψB(r,t)=ΨB(r,t)exp[−i(mc2ℏ)t].
The exponentials on the right side are the exp(−iωt) factor corresponding to the rest energy mc2, which dominates the electron’s energy in the non-relativistic limit. (Note that by using +mc2 rather than −mc2, we are explicitly referencing the positive-energy band.) If the electron is in an eigenstate with p=0 and there are no electromagnetic fields, ΨA and ΨB would just be constants. Now suppose the electron is non-relativistic but not in a p=0 eigenstate, and the electromagnetic fields are weak but not necessarily vanishing. In that case, ΨA and ΨB are functions that vary with t, but slowly.
Plugging this ansatz into Equations (5.2.22)–(5.2.23) gives
iℏ∂tΨA=−eΦΨA+∑jˆσj(−iℏ∂j+eAj)cΨB(iℏ∂t+2mc2+eΦ)ΨB=∑jˆσj(−iℏ∂j+eAj)cΨA.
On the left side of Equation (5.2.28), the 2mc2 term dominates over the other two, so
ΨB≈12mc∑jˆσj(−iℏ∂j+eAj)ΨA.
Plugging this into Equation (5.2.27) yields
iℏ∂tΨA={−eΦ+12m∑jkˆσjˆσk(−iℏ∂j+eAj)(−iℏ∂k+eAk)}ΨA.
Using the identity ˆσjˆσk=δjkˆI+i∑iεijkσi:
iℏ∂tΨA={−eΦ+12m|−iℏ∇+eA|2+i2m∑ijkεijkˆσi(−iℏ∂j+eAj)(−iℏ∂k+eAk)}ΨA.
Look carefully at the last term in the curly brackets. Expanding the square yields
i2m∑ijkεijkˆσi(−∂j∂k−iℏe∂jAk−iℏe[Ak∂j+Aj∂k]+e2AjAk).
Due to the antisymmetry of εijk, all terms inside the parentheses that are symmetric under j and k cancel out when summed over. The only survivor is the second term, which gives
ℏe2m∑ijkεijkˆσi∂jAk=ℏe2mˆσ⋅B(r,t),
where B=∇×A is the magnetic field. Hence,
iℏ∂tΨA={−eΦ+12m|−iℏ∇+eA|2−(−ℏe2mˆσ)⋅B}ΨA.
This is an exact match for Equation (5.1.20), except that the Hamiltonian has an additional term of the form −ˆμ⋅ˆB. This additional term corresponds to the potential energy of a magnetic dipole of moment μ in a magnetic field B. The Dirac theory therefore predicts the electron’s magnetic dipole moment to be
|μ|=ℏe2m.
Remarkably, this matches the experimentally-observed magnetic dipole moment to about one part in 103. The residual mismatch between Equation (???) and the actual magnetic dipole moment of the electron is understood to arise from quantum fluctuations of the electronic and electromagnetic quantum fields. Using the full theory of quantum electrodynamics, that “anomalous magnetic moment” can also be calculated and matches experiment to around one part in 109, making it one of the most precise theoretical predictions in physics! For details, see Zee (2010).
It is noteworthy that we did not set out to include spin in the theory, yet it arose, seemingly unavoidably, as a by-product of formulating a relativistic theory of the electron. This is a manifestation of the general principle that relativistic quantum theory is more constrained than non-relativistic quantum theory Dyson (1951). Due to the demands imposed by relativistic symmetries, spin is not allowed to be an optional part of the theory of the relativistic electron—it has to be built into the theory at a fundamental level.
Positrons and Dirac Field Theory
As noted in Section 5.2, the stability of the quantum states described by the Dirac equation is threatened by the presence of negative-energy solutions. To get around this problem, Dirac suggested that what we regard as the “vacuum” may actually be a state, called the Dirac sea, in which all negative-energy states are occupied. Since electrons are fermions, the Pauli exclusion principle would then forbid decay into the negative-energy states, stabilizing the positive-energy states.
At first blush, the idea seems ridiculous; how can the vacuum contain an infinite number of particles? However, we shall see that the idea becomes more plausible if the Dirac equation is reinterpreted as a single-particle construction which arises from a more fundamental quantum field theory. The Dirac sea idea is an inherently multi-particle concept, and we know from Chapter 4 that quantum field theory is a natural framework for describing multi-particle quantum states. Let us therefore develop this theory.
Consider again the eigenstates of the single-particle Dirac Hamiltonian with definite momenta and energies. Denote the positive-energy wavefunctions by
ukσeik⋅r(2π)3/2=⟨r|k,+,σ⟩,whereˆH|k,+,σ⟩=ϵkσ|k,+,σ⟩.
The negative-energy wavefunctions are
vkσe−ik⋅r(2π)3/2=⟨r|k,−,σ⟩,whereˆH|k,−,σ⟩=−ϵkσ|k,−,σ⟩.
Note that |k,−,σ⟩ denotes a negative-energy eigenstate with momentum −ℏk, not ℏk. The reason for this notation, which uses different symbols to label the positive-energy and negative-energy states, will become clear later. Each of the ukσ and vkσ terms are four-component objects (spinors), and for any given k, the set
{ukσ,vk,σ|σ=1,2}
forms an orthonormal basis for the four-dimensional spinor space. Thus,
∑n(unkσ)∗unkσ′=δσσ′,∑n(unkσ)∗vnkσ′=0,etc.
Here we use the notation where unkσ is the n-th component of the ukσ spinor, and likewise for the v’s.
Following the second quantization procedure from Chapter 4, let us introduce a fermionic Fock space HF, as well as a set of creation/annihilation operators:
ˆb†kσandˆbkσcreate/annihilate|k,+,σ⟩ˆd†kσandˆdkσcreate/annihilate|k,−,σ⟩.
These obey the fermionic anticommutation relations
{ˆbkσ,ˆb†k′σ′}=δ3(k−k′)δσσ′,{ˆdkσ,ˆd†k′σ′}=δ3(k−k′)δσσ′{ˆbkσ,ˆbk′σ′}={ˆbkσ,ˆdk′σ′}={ˆdkσ,ˆdk′σ′}=0,etc.
The Hamiltonian is
ˆH=∫d3k∑σϵkσ(ˆb†kσˆbkσ−ˆd†kσˆdkσ),
and applying the annihilation operators to the vacuum state |∅⟩ gives zero:
ˆbkσ|∅⟩=ˆdkσ|∅⟩=0.
When formulating bosonic field theory, we defined a local field annihilation operator that annihilates a particle at a given point r. In the infinite-system limit, this took the form
ˆψ(r)=∫d3kφk(r)ˆak,
and the orthonormality of the φk wavefunctions implied that [ˆψ(r),ˆψ†(r′)]=δ3(r−r′). Similarly, we can use the Dirac Hamiltonian’s eigenfunctions (???)–(???) to define
ˆψn(r)=∫d3k(2π)3/2∑σ(unkσeik⋅rˆbkσ+vnkσe−ik⋅rˆdkσ).
Note that there are two terms in the parentheses because the positive-energy and negative-energy states are denoted by differently-labeled annihilation operators. Moreover, since the wavefunctions are four-component spinors, the field operators have a spinor index n. Using the spinor orthonormality conditions (???) and the anticommutation relations (5.2.44), we can show that
{ˆψn(r),ˆψ†n′(r′)}=δnn′δ3(r−r′),
with all other anticommutators vanishing. Hence, ˆψn(r) can be regarded as an operator that annihilates a four-component fermion at point r.
Now let us define the operators
ˆckσ=ˆd†kσ.
Using these, the fermionic anticommutation relations can be re-written as
{ˆbkσ,ˆb†k′σ′}=δ3(k−k′)δσσ′,{ˆckσ,ˆc†k′σ′}=δ3(k−k′)δσσ′{ˆbkσ,ˆbk′σ′}={ˆbkσ,ˆck′σ′}={ˆckσ,ˆck′σ′}=0,etc.
Hence ˆc†kσ and ˆckσ formally satisfy the criteria to be regarded as creation and annihilation operators. The particle created by ˆc†kσ is called a positron, and is equivalent to the absence of a d-type particle (i.e., a negative-energy electron).
The Hamiltonian (???) can now be written as
ˆH=∫d3k∑σϵkσ(ˆb†kσˆbkσ+ˆc†kσˆckσ)+constant,
which explicitly shows that the positrons have positive energies (i.e., the absence of a negative-energy particle is equivalent to the presence of a positive-energy particle). With further analysis, which we will skip, it can be shown that the positron created by ˆc†kσ has positive charge e and momentum ℏk. The latter is thanks to the definition adopted in Equation (???); the absence of a momentum −ℏk particle is equivalent to the presence of a momentum ℏk particle. As for the field annihilation operator (???), it can be written as
ˆψn(r)=∫d3k(2π)3/2∑σ(unkσeik⋅rˆbkσ+vnkσe−ik⋅rˆc†kσ).
The c-type annihilation operators do not annihilate |∅⟩. However, let us define
|∅′⟩=∏kσˆd†kσ|∅⟩,
which is evidently a formal description of the Dirac sea state. Then
ˆckσ|∅′⟩=ˆd†kσ∏k′σ′ˆd†k′σ′|∅⟩=0.
At the end of the day, we can regard the quantum field theory as being defined in terms of b-type and c-type operators, using the anticommutators (5.2.52), the Hamiltonian (???), and the field operator (???), along with the vacuum state |∅′⟩. The elementary particles in this theory are electrons and positrons with strictly positive energies. The single-particle Dirac theory, with its quirky negative-energy states, can then be interpreted as a special construct that maps the quantum field theory into single-particle language. Even though we actually started from the single-particle description, it is the quantum field theory, and its vacuum state |∅′⟩, that is more fundamental.
There are many more details about the Dirac theory that we will not discuss here. One particularly important issue is how the particles transform under Lorentz boosts and other changes in coordinate system. For such details, the reader is referred to Dyson (1951).