5.3: Operators and Observables
- Page ID
- 94124
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)Quantum State Information
Something we discussed only obliquely in an earlier section is the idea of a quantum state and the information contained within it. There are some very strange features associated with the concept of a quantum state. High among those it the fact that it is non-local. We are used to the classical notion that the mass, charge, and other features of a particle are located at the particle – we can literally point to the point to the position in space where these quantities can be found. But now we have to accept the fact that even the location of the particle itself is not something that is well-defined. The wave function of a particle exists everywhere in space at the same time, and it isn't until it interacts with a measuring device that its location is defined. To emphasize this point: It isn't that the particle is somewhere but we just don't know where (like the result of a coin flip still concealed by someone's hand), it actually not located anywhere until it is observed.
All of this mysterious mumbo-jumbo might tempt us to throw up our hands in despair that we can't do any of the predictive science that we've become accustomed to in classical physics, but the quantum state of a particle does contain useful information about it. Indeed, the theory claims that the quantum state contains all of the accessible information about the particle. Much of it is probabilistic, but this is still useful. We have already discussed a bit about how to extract this information from the quantum state – we take averages using the probability density. We have slightly oversimplified this process, but we will correct that now.
Expectation Value Computation
The key to pulling information from the quantum state is calculating expectation values. Even when we want to compute uncertainties, to do this we need to be able to compute averages. So far, we have seen that the method for doing this is to multiply the quantities measured by their associated probability densities, and integrate over all the possible values. For example, if we wish to calculate the average position:
\[\left<x\right>=\int\limits_{-\infty}^{+\infty}\mathcal P\left(x\right)~x~dx=\int\limits_{-\infty}^{+\infty}\left|\Psi\right|^2~x~dx=\int\limits_{-\infty}^{+\infty}\Psi^*\Psi~x~dx\]
If we instead which to calculate the average momentum, we can Fourier-transform the position wave function to get the momentum version, and use it in the integral along with a \(k\) and \(dk\) in place of \(x\) and \(dx\) (this will give an average wave number, which can then be multiplied by \(\hbar\) to get an average momentum). Notice that in the momentum case we can't use the usual \(\mathcal P\left(x\right)\), because the momentum values are not a function of \(x\). Lucky that we have the Fourier transform! But what if there are other observable quantities for which we wish to compute an average (energy, angular momentum, etc.)?
Quantum mechanics provides an alternate means that is totally equivalent to the one above for position and momentum, without the need for a Fourier transform, and which works for other quantities. What is more, this process for computing averages embodies the idea that measurements affect the very quantum state they seek to measure. We'll start with a basic description for how this works...
We begin with two things: The quantum state we are working with, and the observable whose the expectation value we wish to compute. We throw these both into a machine, which in turn spits out the expectation value:
Figure 5.3.1 – Expectation Value Machine
Well, this is just fine, but of course we need to peek behind the curtain to see precisely how this expectation machine functions. It works in a few steps:
- It invents an operator that belongs to the given observable. It is one of the postulates of quantum theory that every quantity that can be measured and is stored in a quantum state has an associated operator.
- The operator "acts upon" the state, changing it into a new state. This is the part of the process where the observation of a physical property of a particle alters the state of the particle being observed.
- The "overlap integral" of new state and the original state is computed. As we have said before, the integral of the product of two functions is like a dot product (e.g. odd functions are "orthogonal" to even functions, as their overlap integral is zero). So this overlap integral gives us a sense of how far the wave function has been altered from its original one.
Figure 5.3.2 – Machine Inner Workings
It's probably not immediately clear how this process gives us the average value we wish to compute, so let's look a bit closer.
The Position and Momentum Operators
Let's look first at the simple case of \(\left<x\right>\). In this case, the "new state" has a wave function for position that is simply the product of \(x\) and the previous wave function:
\[\Psi_{new}=x~\Psi~~~\Rightarrow~~~\left<x\right>=\int\limits_{-\infty}^{+\infty}\Psi^*\Psi_{new}dx=\int\limits_{-\infty}^{+\infty}\Psi^*\left(x~\Psi\right)dx=\int\limits_{-\infty}^{+\infty}x~\left|\Psi\right|^2dx=\int\limits_{-\infty}^{+\infty}x~\mathcal P\left(x\right)dx\]
If we now wish to do the same with momentum, it is not clear how the momentum operator creates a new quantum state from the old one, when we describe that quantum state in terms of position. We do know how it changes the quantum state when it is described in terms of momentum (or wave number) – it works the same way as \(x\) did:
\[\Phi_{new}=p~\Phi~~~\Rightarrow~~~\left<p\right>=\int\limits_{-\infty}^{+\infty}\Phi^*\Phi_{new}dk=\int\limits_{-\infty}^{+\infty}\Phi^*\left(p~\Phi\right)dk=\int\limits_{-\infty}^{+\infty}p~\left|\Phi\right|^2dk=\int\limits_{-\infty}^{+\infty}p~\mathcal P\left(k\right)dk\]
But now we are interested in how the momentum affects the quantum state when the wave function is viewed in terms of position. To do this, we turn to our "translation" device - the Fourier transform. Noting that \(\Phi_{new} = p\Phi\), we can do an inverse Fourier transform to get both the original wave function \(\Psi\) and the newly-altered function \(\Psi_{new}\):
\[\Psi=\int\limits_{-\infty}^{+\infty}\Phi~ e^{ikx}dk~,~~~~~ \Psi_{new}=\int\limits_{-\infty}^{+\infty}\Phi_{new}~e^{ikx}dk=\int\limits_{-\infty}^{+\infty}p~\Phi ~e^{ikx}dk=\int\limits_{-\infty}^{+\infty}\hbar k~\Phi ~e^{ikx}dk\]
Now we seek some operation we can perform on \(\Psi\) that can give us \(\Psi_{new}\). Without further ado, we declare that if we act on \(\Psi\) with the operation \(-i\hbar\frac{d}{dx}\), that will do the trick. The wave function \(\Phi\) is only a function of \(k\) (not \(x\)), so:
\[\Psi_{new}=-i\hbar\frac{d}{dx}\Psi = -i\hbar\frac{d}{dx}\int\limits_{-\infty}^{+\infty}\Phi ~e^{ikx}dk= -i\hbar\int\limits_{-\infty}^{+\infty}\Phi \frac{d}{dx}e^{ikx}dk=\int\limits_{-\infty}^{+\infty}\hbar k~\Phi ~e^{ikx}dk\]
To summarize, the \(x\)-direction momentum operator for use on wave functions expressed in terms of position is (when we eventually go beyond 1-dimension, this will become a partial derivative):
\[\widehat p_x = -i\hbar\frac{d}{dx}\]
The little "hat" above the \(p\) is a reminder to us that we are talking about an operator that changes a quantum state, and not just the value of momentum. What this operator actually is depends upon the type of wave function it is acting on. That is:
\[\widehat p_x~\psi\left(x\right) = -i\hbar\frac{d}{dx}\psi\left(x\right)~,~~~~~\widehat p_x~\phi\left(k\right) = \hbar k~\phi\left(k\right)\]
Similarly, the operator \(\widehat x\) is just the function \(f\left(x\right)=x\) when acting on a wave function expressed in terms of position, and will involve a derivative when acting on a wave function expressed in terms of wave number (it is left as an exercise o the reader to determine the operator \(\widehat x\) that acts on \(\phi\left(k\right)\)).
Going back to the original discussion of computing expectation values, we see that we have:
\[\left<p\right>=\int\limits_{-\infty}^{+\infty}\Psi^*\left(\widehat p~\Psi\right)dx=\int\limits_{-\infty}^{+\infty}\Psi^*\left(-i\hbar\frac{d}{dx}\Psi\right)dx\]
Building More Operators
We can build new operators for other physical observables from \(\widehat x\) and \(\widehat p\). Most notable among these is the kinetic energy operator:
\[\widehat{KE}=\frac{\widehat p^2}{2m}=\frac{1}{2m}\left(-i\hbar\frac{d}{dx}\right)\left(-i\hbar\frac{d}{dx}\right)=-\frac{\hbar^2}{2m}\frac{d^2}{dx^2}\]
This looks familiar! It is precisely what acts on the wave function in the Schrödinger equation, which we already said accounts for the particle's kinetic energy. Now we see the Schrödinger equation in a whole new light – as an equation that relates the effects of operators. The potential \(V\left(x\right)\) is just a function of \(x\), so it is an operator formed from \(\widehat x\). Together, the operators \(\widehat{KE}\) and \(\widehat {V\left(x\right)}\) account for the total energy, and as a shorthand we sometimes use:
\[\widehat H \equiv \widehat{KE}+\widehat {V\left(x\right)}\]
This "total energy operator" is commonly referred to as the Hamiltonian. Note that Schrödinger's equation states that the Hamiltonian's actions in on the wave function expressed in terms of position are equivalent to another operator's actions. The other operator (sometimes called the "total energy operator") is what we see on the right hand side of the Schrödinger equation:
\[\widehat E \equiv i\hbar\frac{\partial}{\partial t}\]
Uncertainty Principle
We have already seen that measurements of position and momentum are "incompatible" in that the measurement of one affects the measurement of the other – the more we take care to precisely one of them, the less able we are to measure the other. This comes through very clearly with this idea that operators change quantum states into new states. We would expect that the alteration of the state by one of these two operators will have an effect on the measurement of the expectation for the other, and it does. Suppose, for example, that for whatever reason, we wish to know the expectation value of the product of the position and momentum, \(xp\). We follow our "expectation machine" method, but since we now have two operators, we have to do them in sequence – first change the quantum state by one of the operators, and then by the other. If we use the momentum operator first, we get:
\[\Psi_{new}=\widehat x~\left(\widehat p~\Psi\right)=x~\left(-i\hbar\frac{d}{dx}\Psi\right)=-i\hbar x~\frac{d\Psi}{dx}\]
But if we perform the operation in the other order, we get a different result:
\[\Psi_{new}=\widehat p~\left(\widehat x~\Psi\right)=-i\hbar\frac{d}{dx}\left(x\Psi\right)=-i\hbar \left(\Psi+x~\frac{d\Psi}{dx}\right)\]
This effect of two operators "tripping over each other" is directly related to an uncertainty principle between those two operators – changing the state by one of them affects the measurement of the other. When two operators do not acheive the same result when performed in either order, we say that that do not commute with each other. Note that any function of \(\widehat x\) (like \(\widehat V\left(x\right)\)) will commute with any other function of \(\widehat x\), and any function of \(\widehat p\) (like \(\widehat{KE}\)) will commute with any other function of \(\widehat p\). So measuring the momentum will not have an effect on measuring the kinetic energy.