7.2: Spin
( \newcommand{\kernel}{\mathrm{null}\,}\)
For orbital angular momentum we found that 2l+1 must be an integer, and moreover the spatial properties of the wave function force l to be an integer as well. However, we can also construct states with half-integer l, but this must then be an internal degree of freedom. This is called spin angular momentum, or spin for short. We will show later in the course that the spin observable is interpreted as an intrinsic magnetic moment of a system.
To describe spin, we switch from L to S, which is no longer related to r and p. The commutation relations between the components Si are the same as for Li,
[Si,Sj]=iℏϵijkSk
so S and L obey the same algebra. The commutation relations between S and L, r, and p vanish:
[Si,Lj]=[Si,rj]=[Si,pj]=0.
Therefore, spin generates a whole new vector space, since it commutes with observables that themselves do not commute (like rj and pj), and it is independent of the spatial degrees of freedom.
Since the commutation relations for S (its algebra) are the same as for L, we can immediately copy the algebraic structure of the eigenstates and eigenvalues:
Sz|s,ms⟩=msℏ|s,ms⟩, with s=0,12,1,32,2,…S2|s,ms⟩=s(s+1)ℏ2|s,ms⟩.
When s=12, the system has two levels (a qubit) with spin eigenstates |12,12⟩ and |12,−12⟩. We often write ms=+12=↑ (“up”) and ms=−12=↓ (“down”), which finds its origin in the measurement outcomes of electron spin in a Stern-Gerlach apparatus.
Now that we have introduced a whole new vector space related to spin, how do we write the wave function of a particle with spin? Without spin, the wave function is a normal single-valued function ψ(r,t)=⟨r∣ψ(t)⟩ of space and time coordinates. Now we have to add the spin degree of freedom. For each spin (↑ or ↓ when s=12), we have a wave function ψ↑(r,t) for the particle with spin up, and ψ↓(r,t) for the particle with spin down. We can write this as a vector:
ψ(r,t)=(ψ↑(r,t)ψ↓(r,t)).
The spin degree of freedom generates a vector space, after all. The vector ψ is called a spinor.
Expectation values are evaluated in the usual way, but now we have to sum over the spin degree of freedom, as well as integrate over space. For example, the probability of finding a particle with spin up in a region Ω of space is given by
p(↑,Ω)=∑ms=↑,↓∫Ωdrδms,↑|ψms(r,t)|2=∫Ωdr|ψ↑(r,t)|2,
and the expectation value of finding a particle with any spin in a region Ω of space is given by
p(Ω)=∑ms=↑,↓∫Ωdr|ψms(r,t)|2.
The normalization of the spinor is such that
∑ms=↑,↓∫Vdr|ψms(r,t)|2=1,
where V is the entire space available to the particle (this may be the entire universe, or the volume of a box with impenetrable walls, etc.).
If spin is represented by (2s+1)-dimensional spinors (vectors), then spin transformations (operators) are represented by (2s+1)×(2s+1) matrices. In the two-dimensional case, we have by construction:
Sz|↑⟩=ℏ2(10) and Sz|↓⟩=−ℏ2(01),
which means that the matrix representation of Sz is given by
Sz=ℏ2(100−1).
Next, the ladder operators act according to
S+|↑⟩=0,S+|↓⟩=ℏ|↑⟩,S−|↑⟩=ℏ|↓⟩,S−|↓⟩=0,
which leads to the matrix representation
S+=ℏ(0100) and S−=ℏ(0010).
From S±=Sx±iSy we can then deduce that
Sx=ℏ2(0110) and Sy=ℏ2(0−ii0).
We often define Si≡12ℏσi, where σi are the so-called Pauli matrices. Previously, we have called these matrices X, Y, and Z. The commutation relations of the Pauli matrices are
[σi,σj]=2iϵijkσk or [σi2,σj2]=iϵijkσk2.
Other important properties of the Pauli matrices are
{σi,σj}≡σiσj+σjσi=2δijI (anti-commutator).
They are both Hermitian and unitary, and the square of the Pauli matrices is the identity: σ2i=I. Moreover, they obey an “orthogonality” relation
12Tr(σiσj)=δij.
The proof of this statement is as follows:
σiσj=σiσj+σjσi−σjσi={σi,σj}−σjσi=2δIij−σjσi.
Taking the trace then yields
Tr(2δijI−σjσi)=Tr(σiσj)=Tr(σjσi),
or (using Tr(I)=2)
2Tr(σjσi)=4δij,
which proves Eq. (7.38). If we define σ0≡I, we can extend this proof to the four-dimensional case
12Tr(σμσv)=δμv
with μ,v=0,1,2,3.

We can then write any 2×2 matrix as a sum over the two-dimensional Pauli operators:
A=∑μaμσμ,
since
\(\frac{1}{2} \operatorname{Tr}\left(A \sigma_{v}\right)=\frac{1}{2} \operatorname{Tr}\left(\sum_{\mu} a_{\mu} \sigma_{\mu} \sigma_{v}\right)=\frac{1}{2} \sum_{\mu} a_{\mu} \operatorname{Tr}\left(\sigma_{\mu} \sigma_{v}\right)=\sum_{\mu} a_{\mu} \delta_{\mu v}=a_{v}\tag{7.44}\]
The Pauli matrices and the identity matrix form a basis for the 2×2 matrices, and we can write
A=a0I+a⋅σ=(a0+azax−iayax+iaya0−az)
where we used the notation σ=(σx,σy,σz).