9.1: Time-Independent Perturbation Theory
( \newcommand{\kernel}{\mathrm{null}\,}\)
If an atom (not necessarily in its ground state) is placed in an external electric field, the energy levels shift, and the wavefunctions are distorted. This is called the Stark effect. The new energy levels and wavefunctions could in principle be found by writing down a complete Hamiltonian, including the external field, and finding the eigenkets. This actually can be done in one case: the hydrogen atom, but even there, if the external field is small compared with the electric field inside the atom (which is billions of volts per meter) it is easier to compute the changes in the energy levels and wavefunctions with a scheme of successive corrections to the zero-field values. This method, termed perturbation theory, is the single most important method of solving problems in quantum mechanics, and is widely used in atomic physics, condensed matter and particle physics.
It should be noted that there are problems which cannot be solved using perturbation theory, even when the perturbation is very weak, although such problems are the exception rather than the rule. One such case is the one-dimensional problem of free particles perturbed by a localized potential of strength λ. As we found earlier in the course, switching on an arbitrarily weak attractive potential causes the k=0 free particle wavefunction to drop below the continuum of plane wave energies and become a localized bound state with binding energy of order λ2. However, changing the sign of λ to give a repulsive potential there is no bound state, the lowest energy plane wave state stays at energy zero. Therefore the energy shift on switching on the perturbation cannot be represented as a power series in λ, the strength of the perturbation. This particular difficulty does not in general occur in three dimensions, where arbitrarily weak potentials do not give bound states—except for certain many-body problems (like the Cooper pair problem) where the exclusion principle reduces the effective dimensionality of the available states.
The Perturbation Series
We begin with a Hamiltonian H0 having known eigenkets and eigenenergies: H0|n0⟩=E0n|n0⟩.
The task is to find how these eigenkets and eigenenergies change if a small term H1 (an external field, for example) is added to the Hamiltonian, so: (H0+H1)|n⟩=En|n⟩.
That is to say, on switching on H1, |n0⟩→|n⟩, E0n→En.
The basic assumption in perturbation theory is that H1 is sufficiently small that the leading corrections are the same order of magnitude as H1 itself, and the true energies can be better and better approximated by a successive series of corrections, each of order H1/H0 compared with the previous one.
The strategy, then, is to expand the true wavefunction and corresponding eigenenergy as series in H1/H0. These series are then fed into (H0+H1)|n⟩=En|n⟩, and terms of the same order of magnitude in H1/H0 on the two sides are set equal. The equations thus generated are solved one by one to give progressively more accurate results.
To make it easier to identify terms of the same order in H1/H0 on the two sides of the equation, it is convenient to introduce a dimensionless parameter λ which always goes with H1, and then expand |n⟩, En as power series in λ, |n⟩=|n0⟩+λ|n1⟩+λ2|n2⟩+…, etc. The ket |nm⟩ multiplied by λm is therefore of order (H1/H0)m.
This λ is purely a bookkeeping device: we will set it equal to 1 when we are through! It’s just there to keep track of the orders of magnitudes of the various terms.
Putting the series expansions for |n⟩, En in (H0+λH1)|n⟩=En|n⟩
we have
(H0+λH1)(|n0⟩+λ|n1⟩+λ2|n2⟩+…)=(E0n+λE1n+λ2E2n+…)(|n0⟩+λ|n1⟩+λ2|n2⟩+…).
We’re now ready to match the two sides term by term in powers of λ.
Zeroth Order of Energy
The zeroth-order term (λ=0), of course, just gives back
H0|n0⟩=E0n|n0⟩
Zeroth Order of Eigenstates
|n0⟩=|n0⟩
First-Order of Energy
Matching the terms linear in λ on both sides: H0|n1⟩+H1|n0⟩=E0n|n1⟩+E1n|n0⟩.
This equation is the key to finding the first-order change in energy E1n. Taking the inner product of both sides with ⟨n0| : ⟨n0|H0|n1⟩+⟨n0|H1|n0⟩=⟨n0|E0n|n1⟩+⟨n0|E1n|n0⟩,
then using ⟨n0|H0=⟨n0|E0n, and ⟨n0|n0⟩=1, we find E1n=⟨n0|H1|n0⟩.
Taking now λ=1, we have established that the first-order change in the energy of a state resulting from adding a perturbing term H1 to the Hamiltonian is just the expectation value of H1 in that state.
For example, we can estimate the ground state energy of the helium atom by treating the electrostatic repulsion between the electrons as a perturbation. The zeroth-order ground state has the two (opposite spin) electrons in the ground state hydrogen-atom wavefunction (scaled for the doubling of nuclear charge). The first-order energy correction E10 is then given by computing the expectation value ⟨e2/r12⟩ for this ground state wavefunction.
First-Order of Eigenstates
The general expression for the first-order change in the wavefunction is found by taking the inner product of the first-order equation with the bra ⟨m0|, m≠n, ⟨m0|H0|n1⟩+⟨m0|H1|n0⟩=⟨m0|E0n|n1⟩+⟨m0|E1n|n0⟩.
The last term is zero, since ⟨m0|n0⟩=0, and in the first term ⟨m0|H0=⟨m0|E0m, so ⟨m0|n1⟩=⟨m0|H1|n0⟩E0n−E0m
and therefore the wavefunction correct to first order is:
|n⟩=|n0⟩+|n1⟩=|n0⟩+∑m≠n|m0⟩⟨m0|H1|n0⟩E0n−E0m.
Second-Order of Energy
To find the second-order correction to the energy, it is necessary to match the second-order terms in
(H0+λH1)(|n0⟩+λ|n1⟩+λ2|n2⟩+…)=(E0n+λE1n+λ2E2n+…)(|n0⟩+λ|n1⟩+λ2|n2⟩+…)
giving:
H0|n2⟩+H1|n1⟩=E0n|n2⟩+E1n|n1⟩+E2n|n0⟩.
Taking the inner product with ⟨n0| yields:
⟨n0|H0|n2⟩+⟨n0|H1|n1⟩=E0n⟨n0|n2⟩+E1n⟨n0|n1⟩+E2n⟨n0|n0⟩.
The leading terms on the two sides cancel as before. What about the term E1n⟨n0|n1⟩? Since |n⟩=|n0⟩+|n1⟩, and both |n⟩, |n0⟩ are normalized, ⟨n0|n1⟩+⟨n1|n0⟩=0 in leading order—that is to say, ⟨n0|n1⟩ is pure imaginary. That just means that if to this order |n⟩ has a component parallel to |n0⟩, that component has a small pure imaginary amplitude, and |n⟩ can be written (to this order) as |n⟩=eiα|n0⟩+kets⊥|n0⟩, with α small. But the phase factor can be eliminated by redefining the phase of |n⟩, and with that redefinition |n1⟩ has no component in the |n0⟩ direction, we can therefore drop the term E1n⟨n0|n1⟩.
So the second-order correction to the energy is:
E2n=⟨n0|H1|n1⟩=⟨n0|H1∑m≠n|m0⟩⟨m0|H1|n0⟩E0n−E0m=∑m≠n|⟨m0|H1|n0⟩|2E0n−E0m.
Selection Rules
Perturbation theory involves evaluating matrix elements of operators. Very often, many of the matrix elements in a sum are zero—obvious tests are parity and the Wigner-Eckart theorem. These are examples of selection rules: tests to find if a matrix element may be nonzero.
The Quadratic Stark Effect
When a hydrogen atom in its ground state is placed in an electric field, the electron cloud and the proton are pulled different ways, an electric dipole forms, and the overall energy is lowered. The perturbing Hamiltonian from the electric field is
H1=e Ez=eErcosθ
where E is the electric field strength, the field is in the z- direction, the electron charge e is negative.
We shall denote the unperturbed eigenenergies of the hydrogen atom by En=Enlm=−1/n2, so in particular we denote the ground state energy by E1.
The first-order correction to the ground state energy E11=⟨100|eEz|100⟩, where
|100⟩≡ψ100(r)=(1πa30)1/2e−r/a0.
This first-order term is zero since there are equal contributions from positive and negative z.
The second-order term is E21=∑n≠1;l,m|⟨nlm|eEz|100⟩|2E1−En
where we are now using |nlm⟩ to denote the unperturbed hydrogen atom wavefunctions, and here the En=−1/n2 (in Rydbergs) are the unperturbed energies.
Most of the terms in this infinite series are zero—the selection rules help get rid of them as follows: since eEz is the m=0 component of a spherical vector and |100⟩ is a zero angular momentum state, it follows from the Wigner-Eckart theorem that ⟨nlm| can only be ⟨n10|. This reduces the second-order sum over states to: E21=∑n≠1|⟨n10|eEz|100⟩|2E1−En.
This is still not easy to evaluate, but an upper bound can be found by observing that |E1−En|≥|E1−E2|, so |E20|<1E2−E1∑n≠1|⟨n10|eEz|100⟩|2=1E2−E1∑n≠1;l,m⟨100|eEz|nlm⟩⟨nlm|eEz|100⟩
where we have temporarily restored the full sum over n,l,m that is, we’ve put back all the zero terms. The reason for this seeming backward step is that, having taken the energy-difference denominator outside the sum, we can even include |100⟩ in the |nlm⟩ sum (it’s another zero term) and in fact we can even include the plane-wave (ionized) states as well as the bound states, since the plane waves all have energy greater than zero. At this point, the ∑n,l,m sum becomes a sum over all states, and therefore just becomes the unit operator, ∑n,l,m|nlm⟩⟨nlm|=I,
so |E21|<1E2−E1⟨100|(eEz)2|100⟩.
For the ground state hydrogen wavefunction, ⟨100|z2|100⟩=a20,E1=−e2/2a0, E2=E1/4, so |E21|<1(34e2/2a0)(eE)2a20=83E2a30.
Furthermore, since all the terms in the series for E21 are negative, the first term sets a lower bound on |E21|:
|E21|>|⟨210|eEz|100⟩|2E1−E2.
This can be evaluated in straightforward fashion to find |E21|>0.55×83E2a30.
So, even though we have not actually evaluated the second-order correction to the energy explicitly, we have it bracketed between two values, the lower one being more than half the upper one. Other ingenious methods have been developed (see Shankar or Sakurai) to find that the true answer is |E21|=94E2a30, but in fact the whole problem can be solved exactly using parabolic coordinates.
Degenerate Perturbation Theory: Distorted 2-D Harmonic Oscillator
The above analysis works fine as long as the successive terms in the perturbation theory form a convergent series. A necessary condition is that the matrix elements of the perturbing Hamiltonian must be smaller than the corresponding energy level differences of the original Hamiltonian. If H0 has different states with the same energy, in other words degenerate energy levels, and the perturbation has nonzero matrix elements between these degenerate levels, then obviously the theory breaks down. To see just how it breaks down, and how to fix it, we consider the two-dimensional simple harmonic oscillator:
H0=p2x+p2y2m+12mω2(x2+y2).
Recall that for the one-dimensional simple harmonic oscillator the ground state wavefunction is
|0⟩=(mωπℏ)1/4e−mωx2/2ℏ=(mωπℏ)1/4e−ξ2/2withξ=√mωℏx,and|1⟩=(mωπℏ)1/4√2ξe−ξ2/2.
The two-dimensional oscillator is simply a product of two one-dimensional oscillators, so, writing η=√mωℏy, the ground state is |0⟩=(mωπℏ)1/2e−(ξ2+η2)/2, and the two (degenerate) next states, energy ℏω above the ground state, are
|1,0⟩=(mωπℏ)1/2√2ξe−(ξ2+η2)/2,|0,1⟩=(mωπℏ)1/2√2ηe−(ξ2+η2)/2.
Suppose now we add a small perturbation
H1=αmω2xy,
with α a small parameter.
Notice that
⟨0|H1|0⟩=⟨1,0|H1|1,0⟩=⟨0,1|H1|0,1⟩=0
so according to naïve perturbation theory, there is no first-order correction to the energies of these states.
However, on going to second-order in the energy correction, the theory breaks down. The matrix element ⟨1,0|H1|0,1⟩ is nonzero, but the two states |0,1⟩, |1,0⟩ have the same energy! This gives an infinite term in the series for E2n.
Yet we know that a small term of this type will not wreck a two-dimensional simple harmonic oscillator, so what is wrong with our approach? It is helpful to plot the original harmonic oscillator potential 12mω2(x2+y2) together with the perturbing potential αmω2xy. The first of course has circular symmetry, the second has axes in the directions x=±y, climbing most steeply from the origin along x=y, falling most rapidly in the directions x=−y. If we combine the two potentials into a single quadratic form, the original circles of constant potential become ellipses, with their axes aligned along x=±y.
The problem arises even in the classical two-dimensional oscillator: picture a ball rolling backwards and forwards in a smooth saucer, a circular bowl. Now imagine the saucer is made slightly elliptical. The ball will still roll backwards and forwards through the center if it is released along one of the axes of the ellipse, although with different periods, as the axes differ in steepness. However, if it is released at a point off the axes, it will describe a complex path resolvable into components in the two axis directions having different periods.
For the quantum oscillator as for the classical one, as soon as the perturbation is introduced, the eigenkets are in the direction of the new elliptic axes. This is a large change from the original x and y axes, and definitely not proportional to the small parameter α. But the original unperturbed problem had circular symmetry, and there was no particular reason to choose the x and y axes as we did. If we had instead chosen as our original axes the lines x=±y, the kets would not have undergone large changes on switching on the perturbation.
The resolution of the problem is now clear: before switching on the perturbation, choose a set of basis kets in a degenerate subspace such that the perturbation is diagonal in that subspace.
In fact, for the simple harmonic oscillator example above, the problem can be solved exactly: 12mω2(x2+y2)+αmω2xy=12mω2[(1+α)(x+y√2)2+(1−α)(x−y√2)2]
and it is clear that, despite the results of naïve first-order theory, there is indeed a first order shift in the energy levels, ℏω→ℏω√1±α≈ℏω(1±α/2).
The Linear Stark Effect
The hydrogen atom, like the two-dimensional harmonic oscillator discussed above, has a nondegenerate ground state but degeneracy in its lowest excited states. Specifically, there are four n=2 states, all having energy -1/4 Ryd : ψ200(r)=(132πa30)1/2(2−ra0)e−r/2a0,ψ210(r,θ,ϕ)=(132πa30)1/2(ra0)e−r/2a0cosθ,ψ21±1(r,θ,ϕ)=(132πa30)1/2(ra0)e−r/2a0sinθe±iϕ.
Perturbing this system with an electric field in the z- direction, H1=eEz=eErcosθ, note first that naïve perturbation theory predicts no first-order shift in any of these energy levels. However, to second order, there is a nonzero matrix element between two degenerate levels ⟨200|H1|210⟩. All the other matrix elements between these basis kets in the four-dimensional degenerate subspace are zero, so the only diagonalization necessary is within thetwo-dimensional degenerate subspace spanned by |200⟩, |210⟩, where H1=(0ΔΔ0)
with
Δ=⟨200|H1|210⟩=eE(132πa30)∫∞0(2−ra0)(rcosθa0)2e−r/a0r2drsinθdθdϕ=−3eEa0.
Diagonalizing H1 within this subspace, then, the new basis states are (|200⟩±|210⟩)/√2 with energy shifts ±Δ, linear in the perturbing electric field.
The states |21±1⟩ are not changed by the presence of the field to this approximation, so the complete energy map of the n=2 states in the electric field has two states at the original energy of -1/4Ryd, one state moved up from that energy by Δ, and one down by Δ.
Notice that the new eigenstates (|200⟩±|210⟩)/√2 are not eigenstates of the parity operator—a sketch of their wavefunctions reveals that in fact they have nonvanishing electric dipole moment →μ, indeed this is the reason for the energy shift, ±Δ=∓3eEa0=∓→μ⋅→E.