After this particular (and hopefully inspiring) example, let us discuss the general relation between the Dirac formalism and experiment in more detail. The expectation value of an observable over any statistical ensemble (not necessarily coherent) may be always calculated using the general statistical rule (1.37). For the particular case of a coherent superposition (118), we can combine that rule with Eq. (120) and the second of Eqs. (118): ⟨A⟩=∑jAjWj=∑jα∗jAjαj=∑j⟨α∣aj⟩Aj⟨aj∣α⟩≡⟨α|(∑j|aj⟩Aj⟨aj|)|α⟩.
Now using Eq. (59) for the particular case of the eigenstate basis {a}, for which Eq. (98) is valid, we arrive at a very simple and important formula 26 ⟨A⟩=⟨α|ˆA|α⟩.
This is a clear analog of the wave-mechanics formula (1.23) - and as we will see soon, may be used to derive it. A big advantage of Eq. (125) is that it does not explicitly involve the eigenvector set of the corresponding operator, and allows the calculation to be performed in any convenient basis. 27
For example, let us consider an arbitrary coherent state α of spin- 1/2,28 and calculate the expectation values of its components. The calculations are easiest in the z-basis because we know the matrix elements of the spin operator components in that basis. Representing the ket- and bra-vectors of the given state as linear superpositions of the corresponding vectors of the basis states ↑ and ↓, |α⟩=α↑|↑⟩+α↓|↓⟩,⟨α|=⟨↑|α∗↑+⟨↓|α∗↓.
and plugging these expressions to Eq. (125) written for the observable Sz, we get
⟨Sz⟩=(⟨↑|α∗↑+⟨↓|α∗↓)ˆSz(α↑|↑⟩+α↓|↓⟩)=α↑α∗↑⟨↑|ˆSz|↑⟩+α↓α∗↓⟨↓|ˆSz|↓⟩+α↑α∗↓⟨↓|ˆSz|↑⟩+α↓α∗↑⟨↑|ˆSz|↓⟩
Now there are two equivalent ways (both very simple) to calculate the long brackets in this expression. The first one is to represent each of them in the matrix form in the z-basis, in which the braand ket-vectors of states ↑ and ↓ are the matrix-rows (1,0) and (0,1), or similar matrix-columns - the exercise highly recommended to the reader. Another (perhaps more elegant) way is to use the general Eq. (59), in the z-basis, together with the spin-1/2-specific Eqs. (116a) and (105) to write ˆSx=ℏ2(|↑⟩⟨↓|+|↓⟩⟨↑|),ˆSy=−iℏ2(|↑⟩⟨↓|−|↓⟩⟨↑|),ˆSz=ℏ2(|↑⟩⟨↑|−|↓⟩⟨↓|)
For our particular calculation, we may plug the last of these expressions into Eq. (127), and use the orthonormality conditions (38): ⟨↑∣↑⟩=⟨↓∣↓⟩=1,⟨↑∣↓⟩=⟨↓∣↑⟩=0.
Both approaches give (of course) the same result: ⟨Sz⟩=ℏ2(α↑α∗↑−α↓α∗↓).
This particular result might be also obtained using Eq. (120) for the probabilities W↑=α↑α↑∗ and W↓=α↓α↓∗, namely: ⟨Sz⟩=W↑(+ℏ2)+W↓(−ℏ2)=α↑α∗↑(+ℏ2)+α↓α∗↓(−ℏ2).
The formal way (127), based on using Eq. (125), has, however, an advantage of being applicable, without any change, to finding the observables whose operators are not diagonal in the z-basis, as well. In particular, absolutely similar calculations give ⟨Sx⟩=α↑α∗↑⟨↑|ˆSx|↑⟩+α↓α∗↓⟨↓|ˆSx|↓⟩+α↑α∗↓⟨↓|ˆSx|↑⟩+α↓α∗↑⟨↑|ˆSx|↓⟩=ℏ2(α↑α∗↓+α↓α∗↑),⟨Sy⟩=α↑α∗↑⟨↑|ˆSy|↑⟩+α↓α∗↓⟨↓|ˆSy|↓⟩+α↑α∗↓⟨↓|ˆSy|↑⟩+α↓α∗↑⟨↑|ˆSy|↓⟩=iℏ2(α↑α∗↓−α↓α∗↑),
Let us have a good look at a particular spin state, for example the spin-up state ↑. According to Eq. (126), in this state α↑=1 and α↓=0, so that Eqs. (130)-(133) yield: ⟨Sz⟩=ℏ2,⟨Sx⟩=⟨Sy⟩=0.
Now let us use the same Eq. (125) to calculate the spin component uncertainties. According to Eqs. (105) and (116)-(117), the operator of each spin component squared is equal to (ℏ/2)2ˆI, so that the general Eq. (1.33) yields (δSz)2=⟨S2z⟩−⟨Sz⟩2=⟨↑|ˆS2z|↑⟩−(ℏ2)2=(ℏ2)2⟨↑|ˆI|↑⟩−(ℏ2)2=0(δSx)2=⟨S2x⟩−⟨Sx⟩2=⟨↑|ˆS2x|↑⟩−0=(ℏ2)2⟨↑|ˆI|↑⟩=(ℏ2)2(δSy)2=⟨S2y⟩−⟨Sy⟩2=⟨↑|ˆS2y|↑⟩−0=(ℏ2)2⟨↑|ˆI|↑⟩=(ℏ2)2
While Eqs. (134) and (135a) are compatible with the classical notion of the angular momentum of magnitude ℏ/2 being directed exactly along the z-axis, this correspondence should not be overstretched, because such classical picture cannot explain Eqs. (135b) and (135c). The best (but still imprecise!) classical image I can offer is the spin vector S oriented, on average, in the z-direction, but still having its x - and y-components strongly "wobbling" (fluctuating) about their zero average values.
It is straightforward to verify that in the x-polarized and y-polarized states the situation is similar, with the corresponding change of axis indices. Thus, in neither of these states all three spin components have definite values. Let me show that this is not just an occasional fact, but reflects the perhaps most profound property of quantum mechanics, the uncertainty relations. For that, let us consider two measurable observables, A and B, of the same quantum system. There are two possibilities here. If the operators corresponding to these observables commute, [ˆA,ˆB]=0
then all matrix elements of the commutator in any orthogonal basis (in particular, in the basis of eigenstates aj of the operator ˆA ) have to equal zero: ⟨aj|[ˆA,ˆB]|aj′⟩≡⟨aj|ˆAˆB|aj′⟩−⟨aj|ˆBˆA|aj′⟩=0.
In the first bracket of the middle expression, let us act by the (Hermitian!) operator ˆA on the bra-vector, while in the second one, on the ket-vector. According to Eq. (68), such action turns the operators into the corresponding eigenvalues, which may be taken out of the long brackets, so that we get Aj⟨aj|ˆB|aj′⟩−Aj′⟨aj|ˆB|aj′⟩≡(Aj−Aj′)⟨aj|ˆB|aj′⟩=0.
This means that if all eigenstates of operator ˆA are non-degenerate (i.e. Aj≠Aj, if j≠j ’), the matrix of operator ˆB has to be diagonal in the basis {a}, i.e., the eigenstate sets of the operators ˆA and ˆB coincide. Such pairs of observables (and their operators) that share their eigenstates, are called compatible. For example, in the wave mechanics of a particle, its momentum (1.26) and kinetic energy (1.27) are compatible, sharing their eigenfunctions (1.29). Now we see that this is not occasional, because each Cartesian component of the kinetic energy is proportional to the square of the corresponding component of the momentum, and any operator commutes with an arbitrary integer power of itself: [ˆA,ˆAn]≡[ˆA,ˆAˆA…ˆA⏟n]=ˆAˆAˆA…ˆA⏟n−ˆAˆA…ˆA⏟nˆA=0.
Now, what if operators ˆA and ˆB do not commute? Then the following general uncertainty relation is valid: δAδB≥12|⟨[ˆA,ˆB]⟩|,
where all expectation values are for the same but arbitrary state of the system. The proof of Eq. (140) may be divided into two steps, the first one proving the so-called Schwartz inequality for any two possible states, say α and β:29 ⟨α∣α⟩⟨β∣β⟩≥|⟨α∣β⟩|2
Its proof may be readily achieved by applying the postulate (16) - that the norm of any legitimate state of the system cannot be negative - to the state with the following ket-vector: |δ⟩≡|α⟩−⟨β∣α⟩⟨β∣β⟩|β⟩
where α and β are possible, non-null states of the system, so that the denominator in Eq. (142) is not equal to zero. For this case, Eq. (16) gives (⟨α|−⟨α∣β⟩⟨β∣β⟩⟨β|)(|α⟩−⟨β∣α⟩⟨β∣β⟩|β⟩)≥0
Opening the parentheses, we get ⟨α∣α⟩−⟨α∣β⟩⟨β∣β⟩⟨β∣α⟩−⟨β∣α⟩⟨β∣β⟩⟨α∣β⟩+⟨α∣β⟩⟨β∣α⟩⟨β∣β⟩2⟨β∣β⟩≥0
After the cancellation of one inner product ⟨β∣β⟩ in the numerator and the denominator of the last term, it cancels with the 2nd (or the 3rd ) term. What remains is the Schwartz inequality (141).
Now let us apply this inequality to states |α⟩≡ˆ˜A|γ⟩ and |β⟩≡ˆ˜B|γ⟩,
where, in both relations, γ is the same (but otherwise arbitrary) possible state of the system, and the deviation operators are defined similarly to the deviations of the observables (see Sec. 1.2): ˆ˜A≡ˆA−⟨A⟩,˜ˆB≡ˆB−⟨B⟩.
With this substitution, and taking into account again that the observable operators ˆA and ˆB are Hermitian, Eq. (141) yields ⟨γ|ˆ˜A2|γ⟩⟨γ|ˆ˜B2|γ⟩≥|⟨γ|ˆ˜Aˆ˜B|γ⟩|2.
Since the state γ is arbitrary, we may use Eq. (125) to rewrite this relation as an operator inequality:
δAδB≥|⟨ˆ˜Aˆ˜B⟩|.
Actually, this is already an uncertainty relation, even "better" (stronger) than its standard form (140); moreover, it is more convenient in some cases. To prove Eq. (140), we need a couple of more steps. First, let us notice that the operator product participating in Eq. (148) may be recast as ˆ˜Aˆ˜B=12{ˆ˜A,ˆ˜B}−i2ˆC, where ˆC≡i[ˆ˜A,ˆ˜B].
Any anticommutator of Hermitian operators, including that in Eq. (149), is a Hermitian operator, and its eigenvalues are purely real, so that its expectation value (in any state) is also purely real. On the other hand, the commutator part of Eq. (149) is just ˆC≡i[ˆ˜A,ˆ˜B]≡i(ˆA−⟨A⟩)(ˆB−⟨B⟩)−i(ˆB−⟨B⟩)(ˆA−⟨A⟩)≡i(ˆAˆB−ˆBˆA)≡i[ˆA,ˆB].
Second, according to Eqs. (52) and (65), the Hermitian conjugate of any product of the Hermitian operators ˆA and ˆB is just the product of these operators swapped. Using the fact, we may write ˆC†=(i[ˆA,ˆB])†=−i(ˆAˆB)†+i(ˆBˆA)†=−iˆBˆA+iˆAˆB=i[ˆA,ˆB]=ˆC,
so that the operator ˆC is also Hermitian, i.e. its eigenvalues are also real, and thus its expectation value is purely real as well. As a result, the square of the expectation value of the operator product (149) may be represented as ⟨ˆ˜Aˆ˜B⟩2=⟨12{ˆ˜A,ˆ˜B}⟩2+⟨12ˆC⟩2.
Since the first term on the right-hand side of this equality cannot be negative, we may write ⟨ˆ˜Aˆ˜B⟩2≥⟨12ˆC⟩2=⟨i2[ˆA,ˆB]⟩2,
and hence continue Eq. (148) as δAδB≥|⟨ˆ˜Aˆ˜B⟩|≥12|⟨[ˆA,ˆB]⟩|,
thus proving Eq. (140).
For the particular case of operators ˆx and ˆpx (or a similar pair of operators for another Cartesian coordinate), we may readily combine Eq. (140) with Eq. (2.14b) and to prove the original Heisenberg’s uncertainty relation (2.13). For the spin-1/2 operators defined by Eq. (116)-(117), it is very simple (and highly recommended to the reader) to show that [ˆσj,ˆσj′]=2iεjj′j′′ˆσj′′, i.e. [ˆSj,ˆSj′]=iεjj′j′ℏˆSj′′,
where εijj " is the Levi-Civita permutation symbol - see, e.g., MA Eq. (13.2). As a result, the uncertainty relations (140) for all Cartesian components of spin- −12 systems are similar, for example δSxδSy≥ℏ2|⟨Sz⟩|, etc.
In particular, as we already know, in the ↑ state the right-hand side of this relation equals (ℏ/2)2 >0, so that neither of the uncertainties δSx,δSy can equal zero. As a reminder, our direct calculation earlier in this section has shown that each of these uncertainties is equal to ℏ/2, i.e. their product is equal to the lowest value allowed by the uncertainty relation (156) - just as the Gaussian wave packets (2.16) provide the lowest possible value of the product δxδpx, allowed by the Heisenberg relation (2.13).
26 This equality reveals the full beauty of Dirac’s notation. Indeed, initially in this chapter the quantummechanical brackets just reminded the angular brackets used for the statistical averaging. Now we see that in this particular (but most important) case, the angular brackets of these two types may be indeed equal to each other!
27 Note also that Eq. (120) may be rewritten in a form similar to Eq. (125): Wj=⟨α|ˆΛj|α⟩, where ˆΛj is the operator (42) of the state’s projection upon the jth eigenstate aj.
28 For clarity, the noun "spin- 1/2 " is used, here and below, to denote the spin degree of freedom of a spin- 1/2 particle, independent of its orbital motion.
29 This inequality is the quantum-mechanical analog of the usual vector algebra’s result α2β2≥|α⋅β|2.