Skip to main content
Physics LibreTexts

5.1: The Difference Between Relativistic and Non-Relativistic Quantum Mechanics

  • Page ID
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    One of the key points in particles physics is that special relativity plays a key role. As you all know, in ordinary quantum mechanics we ignore relativity. Of course people attempted to generate equations for relativistic theories soon after Schrödinger wrote down his equation. There are two such equations, one called the Klein-Gordon and the other one called the Dirac equation.

    The structure of the ordinary Schrödinger equation of a free particle (no potential) suggests what to do. We can write this equation as

    \[\hat H\psi = \frac{1}{2m}{\vec{p}}^2 \psi = i \hbar \frac{\partial}{\partial t} \psi.\]

    This is clearly a statement of the non-relativistic energy-momentum relation, \(E=\dfrac{1}{2} m v^2\), since a time derivative on a plane wave brings down a factor energy. Remember, however, that \({\vec{p}}\) as an operator also contains derivatives, \[{\vec{p}} = \frac{\hbar}{i} {\vec{\nabla}}.\] A natural extension would to use the relativistic energy expression,

    \[\hat H\psi = \sqrt{m^2c^4+{\vec{p}}^2c^2}\; \psi = i \hbar \frac{\partial}{\partial t} \psi. \label{EQ3}\]

    This is a nonsensical equation, unless we specify how to take the square root of the operator. The first attempt to circumvent this problem, by Klein and Gordon, was to take the square of Equation \ref{EQ3} to generate the Klein-Gordon Equation (Equation \ref{KG}):

    Klein-Gordon Equation

    \[\left({m^2c^4+{\vec{p}}^2c^2}\right) \psi = - \hbar^2 \frac{\partial^2}{\partial t^2}\psi . \label{KG}\]

    This is an excellent equation for spin-less particles or spin one particles (bosons), but not to describe fermions (half-integer spin), since there is no information about spin is in this equation. This needs careful consideration, since spin must be an intrinsic part of a relativistic equation!

    Dirac realized that there was a way to define the square root of the operator. The trick he used was to define four matrices \(\alpha\), \(\beta\) that each have the property that their square is one, and that they anticommute,

    \[\begin{align} {2} \alpha_i \alpha_i &= I, &\quad \\[4pt] \beta \beta &= I \\[4pt] \alpha_i \beta +\beta \alpha_i &= 0 \\[4pt] \alpha_i \alpha_j +\alpha_j \alpha_i &= 0\quad\text{$i \neq j$}.\end{align}\]

    This then leads to an equation that is linear in the momenta – and very well behaved:

    Dirac Equation

    \[(\beta m c^2 + c {\vec{\alpha}}\cdot {\vec{p}})\Psi = i \hbar \frac{\partial}{\partial t} \Psi\]

    Note that the minimum dimension for the matrices in which we can satisfy all conditions is \(4\), and thus \(\Psi\) is a four-vector! This is closely related to the fact that these particles have spin.

    Let us investigate this equation a bit further. One of the possible forms of \(\alpha_i\) and \(\beta\) is

    \[\alpha_i = \begin{pmatrix} 0 & \sigma_i\\\sigma_i & 0 \end{pmatrix},\quad \beta = \begin{pmatrix} I & 0\\ 0 & -I \end{pmatrix},\]

    where \(\sigma_i\) are the two-by-two Pauli spin matrices

    \[\sigma_1 = \begin{pmatrix} 0 & 1 \\ 1 & 0 \end{pmatrix},\quad \sigma_2 = \begin{pmatrix} 0 & -i \\ i & 0 \end{pmatrix},\quad \sigma_3 = \begin{pmatrix} 1 & 0 \\ 0 & -1 \end{pmatrix}.\]

    (These matrices satisfy some very interesting relations. For instance

    \[\sigma_1 \sigma_2 = i\sigma_3,\quad \sigma_2 \sigma_1 = -i\sigma_3,\quad \sigma_2 \sigma_3 = i \sigma_1,\] etc. Furthermore \(\sigma_i^2=1\).)

    Once we know the matrices, we can try to study a plane-wave solution (i.e., free particle):

    \[\Psi({\vec{x}},t) = u({\vec{p}}) e^{i({\vec{p}} \cdot {\vec{x}} - Et)/\hbar}.\]

    (Note that the exponent is a “Lorentz scalar”, it is independent of the Lorentz frame!).

    If substitute this solution we find that \(u({\vec{p}})\) satisfies the eigenvalue equation

    \[\begin{pmatrix} m c^2 & 0 & p_3c & p_1c-i p_2 c\\ 0 & m c^2 & p_1c+ip_2c & -p_3c \\ p_3c & p_1c -ip_2c & -m c^2 & 0 \\ p_1c+ip_2c & -p_3c & 0 & - mc^2 \end{pmatrix} \begin{pmatrix} u_1 \\ u_2 \\ u_3 \\ u_4 \end{pmatrix} = E \begin{pmatrix} u_1 \\ u_2 \\ u_3 \\ u_4 \end{pmatrix}.\]

    The eigenvalue problem can be solved easily, and we find the eigenvalue equation

    \[(m^2c^4 +p^2 c^2 -E^2)^2 = 0\]

    which has the solutions \(E=\pm \sqrt{m^2c^4+p^2 c^2}\). The eigenvectors for the positive eigenvalues are

    \[\begin{pmatrix} 1 \\ 0 \\ p_3c/(E+mc^2) \\ ( p_1c-ip_2c)/(E+mc^2) \end{pmatrix} \text{, and } \begin{pmatrix} 0 \\ 1 \\ ( p_1c+ip_2c)/(E+mc^2)\\-p_3c/(E+mc^2) \end{pmatrix},\]

    with similar expressions for the two eigenvectors for the negative energy solutions. In the limit of small momentum the positive-energy eigenvectors become

    \[\begin{pmatrix} 1 \\ 0 \\ 0 \\ 0 \end{pmatrix}\text{, and } \begin{pmatrix} 0 \\ 1 \\ 0 \\ 0 \end{pmatrix},\]

    and seem to denote a particle with spin up and down. We shall show that the other two solutions are related to the occurrence of anti-particles (positrons).

    Just as photons are the best way to analyze (decompose) the electro-magnetic field, electrons and positrons are the natural way way to decompose the Dirac field that is the general solution of the Dirac equation. This analysis of a solution in terms of the particles it contains is called (incorrectly, for historical reasons) “second quantisation”, and just means that there is a natural basis in which we can say there is a state at energy \(E\), which is either full or empty. This could more correctly be referred to as the “occupation number representation” which should be familiar from condensed matter physics. This helps us to see how a particle can be described by these wave equations. There is a remaining problem, however!

    This page titled 5.1: The Difference Between Relativistic and Non-Relativistic Quantum Mechanics is shared under a CC BY-NC-SA 2.0 license and was authored, remixed, and/or curated by Niels Walet via source content that was edited to the style and standards of the LibreTexts platform.