Skip to main content
Physics LibreTexts

1: A geometrical theory of spacetime

Chapter 1. A geometrical theory of spacetime

“I always get a slight brain-shiver, now [that] space and time appear conglomerated together in a gray, miserable chaos.” -- Sommerfeld

This is a book about general relativity, at a level that is meant to be accessible to advanced undergraduates.

This is mainly a book about general relativity, not special relativity. I've heard the sentiment expressed that books on special relativity generally do a lousy job on special relativity, compared to books on general relativity. This is undoubtedly true, for someone who already has already learned special relativity --- but wants to unlearn the parts that are completely wrong in the broader context of general relativity. For someone who has not already learned special relativity, I strongly recommend mastering it first, from a book such as Taylor and Wheeler's Spacetime Physics.

In the back of this book I've included excerpts from three papers by Einstein --- two on special relativity and one on general relativity. They can be read before, after, or along with this book. There are footnotes in the papers and in the main text linking their content with each other.

I should reveal at the outset that I am not a professional relativist. My field of research was nonrelativistic nuclear physics until I became a community college physics instructor. I can only hope that my pedagogical experience will compensate to some extent for my shallow background, and that readers who find mistakes will be kind enough to let me know about them using the contact information provided at


\[E=MC^2 \tag{Love}\]

1.1 Time and causality

Updating Plato's allegory of the cave, imagine two super-intelligent twins, Alice and Betty. They're raised entirely by a robotic tutor on a sealed space station, with no access to the outside world. The robot, in accord with the latest fad in education, is programmed to encourage them to build up a picture of all the laws of physics based on their own experiments, without a textbook to tell them the right answers. Putting yourself in the twins' shoes, imagine giving up all your preconceived ideas about space and time, which may turn out according to relativity to be completely wrong, or perhaps only approximations that are valid under certain circumstances.

Causality is one thing the twins will notice. Certain events result in other events, forming a network of cause and effect. One general rule they infer from their observations is that there is an unambiguously defined notion of betweenness: if Alice observes that event 1 causes event 2, and then 2 causes 3, Betty always agrees that 2 lies between 1 and 3 in the chain of causality. They find that this agreement holds regardless of whether one twin is standing on her head (i.e., it's invariant under rotation), and regardless of whether one twin is sitting on the couch while the other is zooming around the living room in circles on her nuclear fusion scooter (i.e., it's also invariant with respect to different states of motion).

You may have heard that relativity is a theory that can be interpreted using non-Euclidean geometry. The invariance of betweenness is a basic geometrical property that is shared by both Euclidean and non-Euclidean geometry. We say that they are both ordered geometries. With this geometrical interpretation in mind, it will be useful to think of events not as actual notable occurrences but merely as an ambient sprinkling of points at which things could happen. For example, if Alice and Betty are eating dinner, Alice could choose to throw her mashed potatoes at Betty. Even if she refrains, there was the potential for a causal linkage between her dinner and Betty's forehead.

Betweenness is very weak. Alice and Betty may also make a number of conjectures that would say much more about causality. For example: (i) that the universe's entire network of causality is connected, rather than being broken up into separate parts; (ii) that the events are globally ordered, so that for any two events 1 and 2, either 1 could cause 2 or 2 could cause 1, but not both; (iii) not only are the events ordered, but the ordering can be modeled by sorting the events out along a line, the time axis, and assigning a number t, time, to each event. To see what these conjectures would entail, let's discuss a few examples that may draw on knowledge from outside Alice and Betty's experiences.

Example: According to the Big Bang theory, it seems likely that the network is connected, since all events would presumably connect back to the Big Bang. On the other hand, if (i) were false we might have no way of finding out, because the lack of causal connections would make it impossible for us to detect the existence of the other universes represented by the other parts disconnected from our own universe.

Example: If we had a time machine,1 we could violate (ii), but this brings up paradoxes, like the possibility of killing one's own grandmother when she was a baby, and in any case nobody knows how to build a time machine.

Example: There are nevertheless strong reasons for believing that (ii) is false. For example, if we drop Alice into one black hole, and Betty into another, they will never be able to communicate again, and therefore there is no way to have any cause and effect relationship between Alice's events and Betty's.2

Since (iii) implies (ii), we suspect that (iii) is false as well. But Alice and Betty build clocks, and these clocks are remarkably successful at describing cause-and-effect relationships within the confines of the quarters in which they've lived their lives: events with higher clock readings never cause events with lower clock readings. They announce to their robot tutor that they've discovered a universal thing called time, which explains all causal relationships, and which their experiments show flows at the same rate everywhere within their quarters.

“Ah,” the tutor sighs, his metallic voice trailing off.

“I know that `ah', Tutorbot,” Betty says. “Come on, can't you just tell us what we did wrong?”

“You know that my pedagogical programming doesn't allow that.”

“Oh, sometimes I just want to strangle whoever came up with those stupid educational theories,” Alice says.

The twins go on strike, protesting that the time theory works perfectly in every experiment they've been able to imagine. Tutorbot gets on the commlink with his masters and has a long, inaudible argument, which, judging from the hand gestures, the twins imagine to be quite heated. He announces that he's gotten approval for a field trip for one of the twins, on the condition that she remain in a sealed environment the whole time so as to maintain the conditions of the educational experiment.

“Who gets to go?” Alice asks.

“Betty,” Tutorbot replies, “because of the mashed potatoes.”

“But I refrained!” Alice says, stamping her foot.

“Only one time out of the last six that I served them.”

The next day, Betty, smiling smugly, climbs aboard the sealed spaceship carrying a duffel bag filled with a large collection of clocks for the trip. Each clock has a duplicate left behind with Alice. The clock design that they're proudest of consists of a tube with two mirrors at the ends. A flash of light bounces back and forth between the ends, with each round trip counting as one “tick,” one unit of time. The twins are convinced that this one will run at a constant rate no matter what, since it has no moving parts that could be affected by the vibrations and accelerations of the journey.

Betty's field trip is dull. She doesn't get to see any of the outside world. In fact, the only way she can tell she's not still at home is that she sometimes feels strong sensations of acceleration. (She's grown up in zero gravity, so the pressing sensation is novel to her.) She's out of communication with Alice, and all she has to do during the long voyage is to tend to her clocks. As a crude check, she verifies that the light clock seems to be running at its normal rate, judged against her own pulse. The pendulum clock gets out of synch with the light clock during the accelerations, but that doesn't surprise her, because it's a mechanical clock with moving parts. All of the nonmechanical clocks seem to agree quite well. She gets hungry for breakfast, lunch, and dinner at the usual times.

When Betty gets home, Alice asks, “Well?”

“Great trip, too bad you couldn't come. I met some cute boys, went out dancing, ...”

“You did not. What about the clocks?”

“They all checked out fine. See, Tutorbot? The time theory still holds up.”

“That was an anticlimax,” Alice says. “I'm going back to bed now.”

“Bed?” Betty exclaims. “It's three in the afternoon.”

The twins now discover that although all of Alice's clocks agree among themselves, and similarly for all of Betty's (except for the ones that were obviously disrupted by mechanical stresses), Alice's and Betty's clocks disagree with one another. A week has passed for Alice, but only a couple of days for Betty.

1.2 Experimental tests of the nature of time


a / The clock took up two seats, and two tickets were bought for it under the name of “Mr. Clock.”

1.2.1 The Hafele-Keating experiment

In 1971, J.C. Hafele and R.E. Keating3 of the U.S. Naval Observatory brought atomic clocks aboard commercial airliners and went around the world, once from east to west and once from west to east. (The clocks had their own tickets, and occupied their own seats.) As in the parable of Alice and Betty, Hafele and Keating observed that there was a discrepancy between the times measured by the traveling clocks and the times measured by similar clocks that stayed at the lab in Washington. The result was that the east-going clock lost an amount of time Δ tE=-59±10 ns, while the west-going one gained Δ tW=+273±7 ns. This establishes that time is not universal and absolute.

Nevertheless, causality was preserved. The nanosecond-scale effects observed were small compared to the three-day lengths of the plane trips. There was no opportunity for paradoxical situations such as, for example, a scenario in which the east-going experimenter arrived back in Washington before he left and then proceeded to convince himself not to take the trip.

Hafele and Keating were testing specific quantitative predictions of relativity, and they verified them to within their experiment's error bars. At this point in the book, we aren't in possession of enough relativity to be able to make such calculations, but, like Alice and Betty, we can inspect the empirical results for clues as to how time works.

The opposite signs of the two results suggests that the rate at which time flows depends on the motion of the observer. The east-going clock was moving in the same direction as the earth's rotation, so its velocity relative to the earth's center was greater than that of the ones that remained in Washington, while the west-going clock's velocity was correspondingly reduced.4 The signs of the Δ t's show that moving clocks were slower.

On the other hand, the asymmetry of the results, with |Δ tE| ≠ |Δ tW|, implies that there was a second effect involved, simply due to the planes' being up in the air. Relativity predicts that time's rate of flow also changes with height in a gravitational field. The deeper reasons for such an effect are given in section 1.5.6 on page 34.

Although Hafele and Keating's measurements were on the ragged edge of the state of the art in 1971, technology has now progressed to the point where such effects have everyday consequences. The satellites of the Global Positioning System (GPS) orbit at a speed of 1.9× 103 m/s, an order of magnitude faster than a commercial jet. Their altitude of 20,000 km is also much greater than that of an aircraft. For both these reasons, the relativistic effect on time is stronger than in the Hafele-Keating experiment. The atomic clocks aboard the satellites are tuned to a frequency of 10.22999999543 MHz, which is perceived on the ground as 10.23 MHz. (This frequency shift will be calculated in example 12 on page 59.)

1.2.2 Muons

Although the Hafele-Keating experiment is impressively direct, it was not the first verification of relativistic effects on time, it did not completely separate the kinematic and gravitational effects, and the effect was small. An early experiment demonstrating a large and purely kinematic effect was performed in 1941 by Rossi and Hall, who detected cosmic-ray muons at the summit and base of Mount Washington in New Hampshire. The muon has a mean lifetime of 2.2 μs, and the time of flight between the top and bottom of the mountain (about 2 km for muons arriving along a vertical path) at nearly the speed of light was about 7 μs, so in the absence of relativistic effects, the flux at the bottom of the mountain should have been smaller than the flux at the top by about an order of magnitude. The observed ratio was much smaller, indicating that the “clock” constituted by nuclear decay processes was dramatically slowed down by the motion of the muons.

1.2.3 Gravitational red-shifts

The first experiment that isolated the gravitational effect on time was a 1925 measurement by W.S. Adams of the spectrum of light emitted from the surface of the white dwarf star Sirius B. The gravitational field at the surface of Sirius B is 4× 105 g, and the gravitational potential is about 3,000 times greater than at the Earth's surface. The emission lines of hydrogen were red-shifted, i.e., reduced in frequency, and this effect was interpreted as a slowing of time at the surface of Sirius relative to the surface of the Earth. Historically, the mass and radius of Sirius were not known with better than order of magnitude precision in 1925, so this observation did not constitute a good quantitative test.

The first such experiment to be carried out under controlled conditions, by Pound and Rebka in 1959, is analyzed quantitatively in example 7 on page 129.


b / Gravity Probe A.

The first high-precision experiment of this kind was Gravity Probe A, a 1976 experiment5 in which a space probe was launched vertically from Wallops Island, Virginia, at less than escape velocity, to an altitude of 10,000 km, after which it fell back to earth and crashed down in the Atlantic Ocean. The probe carried a hydrogen maser clock which was used to control the frequency of a radio signal. The radio signal was received on the ground, the nonrelativistic Doppler shift was subtracted out, and the residual blueshift was interpreted as the gravitational effect effect on time, matching the relativistic prediction to an accuracy of 0.01%.

1.3 Non-simultaneity and the maximum speed of cause and effect

We've seen that time flows at different rates for different observers. Suppose that Alice and Betty repeat their Hafele-Keating-style experiment, but this time they are allowed to communicate during the trip. Once Betty's ship completes its initial acceleration away from Betty, she cruises at constant speed, and each girl has her own equally valid inertial frame of reference. Each twin considers herself to be at rest, and says that the other is the one who is moving. Each one says that the other's clock is the one that is slow. If they could pull out their phones and communicate instantaneously, with no time lag for the propagation of the signals, they could resolve the controversy. Alice could ask Betty, “What time does your clock read right now?” and get an immediate answer back.

By the symmetry of their frames of reference, however, it seems that Alice and Betty should not be able to resolve the controversy during Betty's trip. If they could, then they could release two radar beacons that would permanently establish two inertial frames of reference, A and B, such that time flowed, say, more slowly in B than in A. This would violate the principle that motion is relative, and that all inertial frames of reference are equally valid. The best that they can do is to compare clocks once Betty returns, and verify that the net result of the trip was to make Betty's clock run more slowly on the average.

Alice and Betty can never satisfy their curiosity about exactly when during Betty's voyage the discrepancies accumulated or at what rate. This is information that they can never obtain, but they could obtain it if they had a system for communicating instantaneously. We conclude that instantaneous communication is impossible. There must be some maximum speed at which signals can propagate --- or, more generally, a maximum speed at which cause and effect can propagate --- and this speed must for example be greater than or equal to the speed at which radio waves propagate. It is also evident from these considerations that simultaneity itself cannot be a meaningful concept in relativity.

1.4 Ordered geometry


a / Axioms O2 (left) and O3 (right).


b / Stephen Hawking (1942-).

Let's try to put what we've learned into a general geometrical context.

Euclid's familiar geometry of two-dimensional space has the following axioms,6 which are expressed in terms of operations that can be carried out with a compass and unmarked straightedge:

  • E1 Two points determine a line.
  • E2 Line segments can be extended.
  • E3 A unique circle can be constructed given any point as its center and any line segment as its radius.
  • E4 All right angles are equal to one another.
  • E5 Parallel postulate: Given a line and a point not on the line, no more than one line can be drawn through the point and parallel to the given line.7

The modern style in mathematics is to consider this type of axiomatic system as a self-contained sandbox, with the axioms, and any theorems proved from them, being true or false only in relation to one another. Euclid and his contemporaries, however, believed them to be empirical facts about physical reality. For example, they considered the fifth postulate to be less obvious than the first four, because in order to verify physically that two lines were parallel, one would theoretically have to extend them to an infinite distance and make sure that they never crossed. In the first 28 theorems of the Elements, Euclid restricts himself entirely to propositions that can be proved based on the more secure first four postulates. The more general geometry defined by omitting the parallel postulate is known as absolute geometry.

What kind of geometry is likely to be applicable to general relativity? We can see immediately that Euclidean geometry, or even absolute geometry, would be far too specialized. We have in mind the description of events that are points in both space and time. Confining ourselves for ease of visualization to one dimension worth of space, we can certainly construct a plane described by coordinates (t,x), but imposing Euclid's postulates on this plane results in physical nonsense. Space and time are physically distinguishable from one another. But postulates 3 and 4 describe a geometry in which distances measured along non-parallel axes are comparable, and figures may be freely rotated without affecting the truth or falsehood of statements about them; this is only appropriate for a physical description of different spacelike directions, as in an (x,y) plane whose two axes are indistinguishable.

We need to throw most of the specialized apparatus of Euclidean geometry overboard. Once we've stripped our geometry to a bare minimum, then we can go back and build up a different set of equipment that will be better suited to relativity.

The stripped-down geometry we want is called ordered geometry, and was developed by Moritz Pasch around 1882. As suggested by the parable of Alice and Betty, ordered geometry does not have any global, all-encompassing system of measurement. When Betty goes on her trip, she traces out a particular path through the space of events, and Alice, staying at home, traces another. Although events play out in cause-and-effect order along each of these paths, we do not expect to be able to measure times along paths A and B and have them come out the same. This is how ordered geometry works: points can be put in a definite order along any particular line, but not along different lines. Of the four primitive concepts used in Euclid's E1-E5 --- point, line, circle, and angle --- only the non-metrical notions of point (i.e., event) and line are relevant in ordered geometry. In a geometry without measurement, there is no concept of measuring distance (hence no compasses or circles), or of measuring angles. The notation [ABC] indicates that event B lies on a line segment joining A and C, and is strictly between them.

The axioms of ordered geometry are as follows:8

  • O1 Two events determine a line.
  • O2 Line segments can be extended: given A and B, there is at least one event such that [ABC] is true.
  • O3 Lines don't wrap around: if [ABC] is true, then [BCA] is false.
  • O4 Betweenness: For any three distinct events A, B, and C lying on the same line, we can determine whether or not B is between A and C (and by statement 3, this ordering is unique except for a possible over-all reversal to form [CBA]).

O1-O2 express the same ideas as Euclid's E1-E2. Not all lines in the system will correspond physically to chains of causality; we could have a line segment that describes a snapshot of a steel chain, and O3-O4 then say that the order of the links is well defined. But O3 and O4 also have clear physical significance for lines describing causality. O3 forbids time travel paradoxes, like going back in time and killing our own grandmother as a child; figure a illustrates why a violation of O3 is referred to as a closed timelike curve. O4 says that events are guaranteed to have a well-defined cause-and-effect order only if they lie on the same line. This is completely different from the attitude expressed in Newton's famous statement: “Absolute, true and mathematical time, of itself, and from its own nature flows equably without regard to anything external ...”

If you're dismayed by the austerity of a system of geometry without any notion of measurement, you may be more appalled to learn that even a system as weak as ordered geometry makes some statements that are too strong to be completely correct as a foundation for relativity. For example, if an observer falls into a black hole, at some point he will reach a central point of infinite density, called a singularity. At this point, his chain of cause and effect terminates, violating O2. It is also an open question whether O3's prohibition on time-loops actually holds in general relativity; this is Stephen Hawking's playfully named chronology protection conjecture. We'll also see that in general relativity O1 is almost always true, but there are exceptions.

1.5 The equivalence principle


a / The cannonball and the musketball have identical parabolic world-lines. On this type of space-time plot, space is conventionally shown on the horizontal axis, so the tower has to be depicted on its side.


b / A piece of string held taut on a globe forms a geodesic from Mexico City to London. Although it appears curved, it is the analog of a straight line in the non-Euclidean geometry confined to the surface of the Earth. Similarly, the world-lines of figure a appear curved, but they are the analogs of straight lines in the non-Euclidean geometry used to describe gravitational fields in general relativity.

1.5.1 Proportionality of inertial and gravitational mass

What physical interpretation should we give to the “lines” described in ordered geometry? Galileo described an experiment (which he may or may not have actually performed) in which he simultaneously dropped a cannonball and a musket ball from a tall tower. The two objects hit the ground simultaneously, disproving Aristotle's assertion that objects fell at a speed proportional to their weights. On a graph of spacetime with x and t axes, the curves traced by the two objects, called their world-lines, are identical parabolas. (The paths of the balls through x-y-z space are straight, not curved.) One way of explaining this observation is that what we call “mass” is really two separate things, which happen to be equal. Inertial mass, which appears in Newton's a=F/m, describes how difficult it is to accelerate an object. Gravitational mass describes the strength with which gravity acts. The cannonball has a hundred times more gravitational mass than the musket ball, so the force of gravity acting on it is a hundred times greater. But its inertial mass is also precisely a hundred times greater, so the two effects cancel out, and it falls with the same acceleration. This is a special property of the gravitational force. Electrical forces, for example, do not behave this way. The force that an object experiences in an electric field is proportional to its charge, which is unrelated to its inertial mass, so different charges placed in the same electric field will in general have different motions.

1.5.2 Geometrical treatment of gravity

Einstein realized that this special property of the gravitational force made it possible to describe gravity in purely geometrical terms. We define the world-lines of small9 objects acted on by gravity to be the lines described by the axioms of the geometry. Since we normally think of the “lines” described by Euclidean geometry and its kin as straight lines, this amounts to a redefinition of what it means for a line to be straight. By analogy, imagine stretching a piece of string taut across a globe, as we might do in order to plan an airplane flight or aim a directional radio antenna. The string may not appear straight as viewed from the three-dimensional Euclidean space in which the globe is embedded, but it is as straight as possible in the sense that it is the path followed by a radio wave,10 or by an airplane pilot who keeps her wings level and her rudder straight. The world-“line” of an object acted on by nongravitational forces is not considered to be a straight “line” in the sense of O1-O4. When necessary, one eliminates this ambiguity in the overloaded term “line” by referring to the lines of O1-O4 as geodesics. The world-line of a low-mass object acted on only by gravity is one type of geodesic.11

We can now see the deep physical importance of statement O1, that two events determine a line. To predict the trajectory of a golf ball, we need to have some initial data. For example, we could measure event A when the ball breaks contact with the club, and event B an infinitesimal time after A.12 This pair of observations can be thought of as fixing the ball's initial position and velocity, which should be enough to predict a unique world-line for the ball, since relativity is a deterministic theory. With this interpretation, we can also see why it is not necessarily a disaster for the theory if O1 fails sometimes. For example, event A could mark the launching of two satellites into circular orbits from the same place on the Earth, heading in opposite directions, and B could be their subsequent collision on the opposite side of the planet. Although this violates O1, it doesn't violate determinism. Determinism only requires the validity of O1 for events infinitesimally close together. Even for randomly chosen events far apart, the probability that they will violate O1 is zero.


c / Loránd Eötvös (1848-1919).


d / If the geodesics defined by an airplane and a radio wave differ from one another, then it is not possible to treat both problems exactly using the same geometrical theory. In general relativity, this would be analogous to a violation of the equivalence principle. General relativity's validity as a purely geometrical theory of gravity requires that the equivalence principle be exactly satisfied in all cases.


e / An Eötvös experiment. Top: simplified version. Bottom: realistic version by Braginskii and Panov. (Drawing after Braginskii and Panov.)

1.5.3 Eötvös experiments

Einstein's entire system breaks down if there is any violation, no matter how small, of the proportionality between inertial and gravitational mass, and it therefore becomes very interesting to search experimentally for such a violation. For example, we might wonder whether neutrons and protons had slightly different ratios of gravitational and inertial mass, which in a Galileo-style experiment would cause a small difference between the acceleration of a lead weight, with a large neutron-to-proton ratio, and a wooden one, which consists of light elements with nearly equal numbers of neutrons and protons. The first high-precision experiments of this type were performed by Eötvös around the turn of the twentieth century, and they verified the equivalence of inertial and gravitational mass to within about one part in 108. These are generically referred to as Eötvös experiments.

Figure e shows a strategy for doing Eötvös experiments that allowed a test to about one part in 1012. The top panel is a simplified version. The platform is balanced, so the gravitational masses of the two objects are observed to be equal. The objects are made of different substances. If the equivalence of inertial and gravitational mass fails to hold for these two substances, then the force of gravity on each mass will not be exact proportion to its inertia, and the platform will experience a slight torque as the earth spins. The bottom panel shows a more realistic drawing of an experiment by Braginskii and Panov.13 The whole thing was encased in a tall vacuum tube, which was placed in a sealed basement whose temperature was controlled to within 0.02°\textup{C}. The total mass of the platinum and aluminum test masses, plus the tungsten wire and the balance arms, was only 4.4 g. To detect tiny motions, a laser beam was bounced off of a mirror attached to the wire. There was so little friction that the balance would have taken on the order of several years to calm down completely after being put in place; to stop these vibrations, static electrical forces were applied through the two circular plates to provide very gentle twists on the ellipsoidal mass between them.

Equivalence of gravitational fields and accelerations

One consequence of the Eötvös experiments' null results is that it is not possible to tell the difference between an acceleration and a gravitational field. At certain times during Betty's field trip, she feels herself pressed against her seat, and she interprets this as evidence that she's in a space vessel that is undergoing violent accelerations and decelerations. But it's equally possible that Tutorbot has simply arranged for her capsule to be hung from a rope and dangled into the gravitational field of a planet. Suppose that the first explanation is correct. The capsule is initially at rest in outer space, where there is no gravity. Betty can release a pencil and a lead ball in the air inside the cabin, and they will stay in place. The capsule then accelerates, and to Betty, who has adopted a frame of reference tied to its deck, ceiling and walls, it appears that the pencil and the ball fall to the deck. They are guaranteed to stay side by side until they hit the deckplates, because in fact they aren't accelerating; they simply appear to accelerate, when in reality it's the deckplates that are coming up and hitting them. But now consider the second explanation, that the capsule has been dipped into a gravitational field. The ball and the pencil will still fall side by side to the floor, because they have the same ratio of gravitational to inertial mass.


g / An artificial horizon.


h / Bars of upsidasium are kept in special warehouses, bolted to the ground. Copyright Jay Ward Productions, used under U.S. fair use excpetion to copyright law.


j / Two local Lorentz frames.


k / One planet rotates about its axis and the other does not. As discussed in more detail on p. 115, Einstein believed that general relativity was even more radically egalitarian about frames of reference than it really is. He thought that if the planets were alone in an otherwise empty universe, there would be no way to tell which planet was really rotating and which was not, so that B's equatorial bulge would have to disappear. There would be no way to tell which planet's surface was a Lorentz frame.


l / Chiao's paradox: a charged particle and a neutral particle are in orbit around the earth. Will the charged particle radiate, violating the equivalence principle?

1.5.4 The equivalence principle

This leads to one way of stating a central principle of relativity known as the equivalence principle: Accelerations and gravitational fields are equivalent. There is no experiment that can distinguish one from the other.14

To see what a radical departure this is, we need to compare with the completely different picture presented by Newtonian physics and special relativity. Newton's law of inertia states that “Every object perseveres in its state of rest, or of uniform motion in a straight line, unless it is compelled to change that state by forces impressed thereon.”15 Newton's intention here was to clearly state a contradiction of Aristotelian physics, in which objects were supposed to naturally stop moving and come to rest in the absence of a force. For Aristotle, “at rest” meant at rest relative to the Earth, which represented a special frame of reference. But if motion doesn't naturally stop of its own accord, then there is no longer any way to single out one frame of reference, such as the one tied to the Earth, as being special. An equally good frame of reference is a car driving in a straight line down the interstate at constant speed. The earth and the car both represent valid inertial frames of reference, in which Newton's law of inertia is valid. On the other hand, there are other, noninertial frames of reference, in which the law of inertia is violated. For example, if the car decelerates suddenly, then it appears to the people in the car as if their bodies are being jerked forward, even though there is no physical object that could be exerting any type of forward force on them. This distinction between inertial and noninertial frames of reference was carried over by Einstein into his theory of special relativity, published in 1905.

But by the time he published the general theory in 1915, Einstein had realized that this distinction between inertial and noninertial frames of reference was fundamentally suspect. How do we know that a particular frame of reference is inertial? One way is to verify that its motion relative to some other inertial frame, such as the Earth's, is in a straight line and at constant speed. But how does the whole thing get started? We need to bootstrap the process with at least one frame of reference to act as our standard. We can look for a frame in which the law of inertia is valid, but now we run into another difficulty. To verify that the law of inertia holds, we have to check that an observer tied to that frame doesn't see objects accelerating for no reason. The trouble here is that by the equivalence principle, there is no way to determine whether the object is accelerating “for no reason” or because of a gravitational force. Betty, for example, cannot tell by any local measurement (i.e., any measurement carried out within the capsule) whether she is in an inertial or a noninertial frame.


f / Wouldn't it be nice if we could define the meaning of a Newtonian inertial frame of reference? Newton makes it sound easy: to define an inertial frame, just find some object that is not accelerating because it is not being acted on by any external forces. But what object would we use? The earth? The “fixed stars?” Our galaxy? Our supercluster of galaxies? All of these are accelerating --- relative to something.

We could hope to resolve the ambiguity by making non-local measurements instead. For example, if Betty had been allowed to look out a porthole, she could have tried to tell whether her capsule was accelerating relative to the stars. Even this possibility ends up not being satisfactory. The stars in our galaxy are moving in circular orbits around the galaxy. On an even larger scale, the universe is expanding in the aftermath of the Big Bang. It spent about the first half of its history decelerating due to gravitational attraction, but the expansion is now observed to be accelerating, apparently due to a poorly understood phenomenon referred to by the catch-all term “dark energy.” In general, there is no distant background of physical objects in the universe that is not accelerating.

Lorentz frames

The conclusion is that we need to abandon the entire distinction between Newton-style inertial and noninertial frames of reference. The best that we can do is to single out certain frames of reference defined by the motion of objects that are not subject to any nongravitational forces. A falling rock defines such a frame of reference. In this frame, the rock is at rest, and the ground is accelerating. The rock's world-line is a straight line of constant x=0 and varying t. Such a free-falling frame of reference is called a Lorentz frame. The frame of reference defined by a rock sitting on a table is an inertial frame of reference according to the Newtonian view, but it is not a Lorentz frame.

In Newtonian physics, inertial frames are preferable because they make motion simple: objects with no forces acting on them move along straight world-lines. Similarly, Lorentz frames occupy a privileged position in general relativity because they make motion simple: objects move along “straight” world-lines if they have no nongravitational forces acting on them.

Example 1: The artificial horizon

The pilot of an airplane cannot always easily tell which way is up. The horizon may not be level simply because the ground has an actual slope, and in any case the horizon may not be visible if the weather is foggy. One might imagine that the problem could be solved simply by hanging a pendulum and observing which way it pointed, but by the equivalence principle the pendulum cannot tell the difference between a gravitational field and an acceleration of the aircraft relative to the ground --- nor can any other accelerometer, such as the pilot's inner ear. For example, when the plane is turning to the right, accelerometers will be tricked into believing that “down” is down and to the left. To get around this problem, airplanes use a device called an artificial horizon, which is essentially a gyroscope. The gyroscope has to be initialized when the plane is known to be oriented in a horizontal plane. No gyroscope is perfect, so over time it will drift. For this reason the instrument also contains an accelerometer, and the gyroscope is automatically restored to agreement with the accelerometer, with a time-constant of several minutes. If the plane is flown in circles for several minutes, the artificial horizon will be fooled into indicating that the wrong direction is vertical.

Example 2: No antigravity

This whole chain of reasoning was predicated on the null results of Eötvös experiments. In the Rocky and Bullwinkle cartoons, there is a non-Eötvösian substance called upsidasium, which falls up instead of down. Its ratio of gravitational to inertial mass is apparently negative. If such a substance could be found, it would falsify the equivalence principle. Cf. example 10, p. 283.

Operational definition of a Lorentz frame

We can define a Lorentz frame in operational terms using an idealized variation (figure i) on a device actually built by Harold Waage at Princeton as a lecture demonstration to be used by his partner in crime John Wheeler. Build a sealed chamber whose contents are isolated from all nongravitational forces. Of the four known forces of nature, the three we need to exclude are the strong nuclear force, the weak nuclear force, and the electromagnetic force. The strong nuclear force has a range of only about 1 fm (10-15 m), so to exclude it we merely need to make the chamber thicker than that, and also surround it with enough paraffin wax to keep out any neutrons that happen to be flying by. The weak nuclear force also has a short range, and although shielding against neutrinos is a practical impossibility, their influence on the apparatus inside will be negligible. To shield against electromagnetic forces, we surround the chamber with a Faraday cage and a solid sheet of mu-metal. Finally, we make sure that the chamber is not being touched by any surrounding matter, so that short-range residual electrical forces (sticky forces, chemical bonds, etc.) are excluded. That is, the chamber cannot be supported; it is free-falling.


i / The spherical chamber, shown in a cutaway view, has layers of shielding to exclude all known nongravitational forces. Once the chamber has been calibrated by marking the three dashed-line trajectories under free-fall conditions, an observer inside the chamber can always tell whether she is in a Lorentz frame.

Crucially, the shielding does not exclude gravitational forces. There is in fact no known way of shielding against gravitational effects such as the attraction of other masses (example 10, p. 283) or the propagation of gravitational waves (ch. 9). Because the shielding is spherical, it exerts no gravitational force of its own on the apparatus inside.

Inside, an observer carries out an initial calibration by firing bullets along three Cartesian axes and tracing their paths, which she defines to be linear.

We've gone to elaborate lengths to show that we can really determine, without reference to any external reference frame, that the chamber is not being acted on by any nongravitational forces, so that we know it is free-falling. In addition, we also want the observer to be able to tell whether the chamber is rotating. She could look out through a porthole at the stars, but that would be missing the whole point, which is to show that without reference to any other object, we can determine whether a particular frame is a Lorentz frame. One way to do this would be to watch for precession of a gyroscope. Or, without having to resort to additional apparatus, the observer can check whether the paths traced by the bullets change when she changes the muzzle velocity. If they do, then she infers that there are velocity-dependent Coriolis forces, so she must be rotating. She can then use flywheels to get rid of the rotation, and redo the calibration.

After the initial calibration, she can always tell whether or not she is in a Lorentz frame. She simply has to fire the bullets, and see whether or not they follow the precalibrated paths. For example, she can detect that the frame has become non-Lorentzian if the chamber is rotated, allowed to rest on the ground, or accelerated by a rocket engine.

It may seem that the detailed construction of this elaborate thought-experiment does nothing more than confirm something obvious. It is worth pointing out, then, that we don't really know whether it works or not. It works in general relativity, but there are other theories of gravity, such as Brans-Dicke gravity (p. 322), that are also consistent with all known observations, but in which the apparatus in figure i doesn't work. Two of the assumptions made above fail in this theory: gravitational shielding effects exist, and Coriolis effects become undetectable if there is not enough other matter nearby.

Locality of Lorentz frames

It would be convenient if we could define a single Lorentz frame that would cover the entire universe, but we can't. In figure j, two girls simultaneously drop down from tree branches --- one in Los Angeles and one in Mumbai. The girl free-falling in Los Angeles defines a Lorentz frame, and in that frame, other objects falling nearby will also have straight world-lines. But in the LA girl's frame of reference, the girl falling in Mumbai does not have a straight world-line: she is accelerating up toward the LA girl with an acceleration of about 2g.

A second way of stating the equivalence principle is that it is always possible to define a local Lorentz frame in a particular neighborhood of spacetime.16 It is not possible to do so on a universal basis.

The locality of Lorentz frames can be understood in the analogy of the string stretched across the globe. We don't notice the curvature of the Earth's surface in everyday life because the radius of curvature is thousands of kilometers. On a map of LA, we don't notice any curvature, nor do we detect it on a map of Mumbai, but it is not possible to make a flat map that includes both LA and Mumbai without seeing severe distortions.


The meanings of words evolve over time, and since relativity is now a century old, there has been some confusing semantic drift in its nomenclature. This applies both to “inertial frame” and to “special relativity.”

Early formulations of general relativity never refer to “inertial frames,” “Lorentz frames,” or anything else of that flavor. The very first topic in Einstein's original systematic presentation of the theory17 is an example (figure k) involving two planets, the purpose of which is to convince the reader that all frames of reference are created equal, and that any attempt to make some of them into second-class citizens is invidious. Other treatments of general relativity from the same era follow Einstein's lead.18 The trouble is that this example is more a statement of Einstein's aspirations for his theory than an accurate depiction of the physics that it actually implies. General relativity really does allow an unambiguous distinction to be made between Lorentz frames and non-Lorentz frames, as described on p. 26. Einstein's statement should have been weaker: the laws of physics (such as the Einstein field equation, p. 263) are the same in all frames (Lorentz or non-Lorentz). This is different from the situation in Newtonian mechanics and special relativity, where the laws of physics take on their simplest form only in Newton-inertial frames.

Because Einstein didn't want to make distinctions between frames, we ended up being saddled with inconvenient terminology for them. The least verbally awkward choice is to hijack the term “inertial,” redefining it from its Newtonian meaning. We then say that the Earth's surface is not an inertial frame, in the context of general relativity, whereas in the Newtonian context it is an inertial frame to a very good approximation. This usage is fairly standard,19 but would have made Newton confused and Einstein unhappy. If we follow this usage, then we may sometimes have to say “Newtonian-inertial” or “Einstein-inertial.” A more awkward, but also more precise, term is “Lorentz frame,” as used in this book; this seems to be widely understood.20

The distinction between special and general relativity has undergone a similar shift over the decades. Einstein originally defined the distinction in terms of the admissibility of accelerated frames of reference. This, however, puts us in the absurd position of saying that special relativity, which is supposed to be a generalization of Newtonian mechanics, cannot handle accelerated frames of reference in the same way that Newtonian mechanics can. In fact both Newtonian mechanics and special relativity treat Newtonian-noninertial frames of reference in the same way: by modifying the laws of physics so that they do not take on their most simple form (e.g., violating Newton's third law), while retaining the ability to change coordinates back to a preferred frame in which the simpler laws apply. It was realized fairly early on21 that the important distinction was between special relativity as a theory of flat spacetime, and general relativity as a theory that described gravity in terms of curved spacetime. All relativists writing since about 1950 seem to be in agreement on this more modern redefinition of the terms.22

In an accelerating frame, the equivalence principle tells us that measurements will come out the same as if there were a gravitational field. But if the spacetime is flat, describing it in an accelerating frame doesn't make it curved. (Curvature is a physical property of spacetime, and cannot be changed from zero to nonzero simply by a choice of coordinates.) Thus relativity allows us to have gravitational fields in flat space --- but only for certain special configurations like this one. Special relativity is capable of operating just fine in this context. For example, Chung et al.23 did a high-precision test of special relativity in 2009 using a matter interferometer in a vertical plane, specifically in order to test whether there was any violation of special relativity in a uniform gravitational field. Their experiment is interpreted purely as a test of special relativity, not general relativity.

Chiao's paradox

The remainder of this subsection deals with the subtle question of whether and how the equivalence principle can be applied to charged particles. You may wish to skip it on a first reading. The short answer is that using the equivalence principle to make conclusions about charged particles is like the attempts by slaveholders and abolitionists in the 19th century U.S. to support their positions based on the Bible: you can probably prove whichever conclusion was the one you set out to prove.

The equivalence principle is not a single, simple, mathematically well defined statement.24 As an example of an ambiguity that is still somewhat controversial, 90 years after Einstein first proposed the principle, consider the question of whether or not it applies to charged particles. Raymond Chiao25 proposes the following thought experiment, which I'll refer to as Chiao's paradox. Let a neutral particle and a charged particle be set, side by side, in orbit around the earth. Assume (unrealistically) that the space around the earth has no electric or magnetic field. If the equivalence principle applies regardless of charge, then these two particles must go on orbiting amicably, side by side. But then we have a violation of conservation of energy, since the charged particle, which is accelerating, will radiate electromagnetic waves (with very low frequency and amplitude). It seems as though the particle's orbit must decay.

The resolution of the paradox, as demonstrated by hairy calculations26 is interesting because it exemplifies the local nature of the equivalence principle. When a charged particle moves through a gravitational field, in general it is possible for the particle to experience a reaction from its own electromagnetic fields. This might seem impossible, since an observer in a frame momentarily at rest with respect to the particle sees the radiation fly off in all directions at the speed of light. But there are in fact several different mechanisms by which a charged particle can be reunited with its long-lost electromagnetic offspring. An example (not directly related to Chiao's scenario) is the following.

Bring a laser very close to a black hole, but not so close that it has strayed inside the event horizon, which is the spherical point of no return from within which nothing can escape. Example 15 on page 65 gives a plausibility argument based on Newtonian physics that the radius27 of the event horizon should be something like rH=GM/c2, and section 6.3.2 on page 224 derives the relativistically correct factor of 2 in front, so that rH=2GM/c2. It turns out that at r=(3/2)RH, a ray of light can have a circular orbit around the black hole. Since this is greater than RH, we can, at least in theory, hold the laser stationary at this value of r using a powerful rocket engine. If we point the laser in the azimuthal direction, its own beam will come back and hit it.

Since matter can experience a back-reaction from its own electromagnetic radiation, it becomes plausible how the paradox can be resolved. The equivalence principle holds locally, i.e., within a small patch of space and time. If Chiao's charged and neutral particle are released side by side, then they will obey the equivalence principle for at least a certain amount of time --- and “for at least a certain amount of time” is all we should expect, since the principle is local. But after a while, the charged particle will start to experience a back-reaction from its own electromagnetic fields, and this causes its orbit to decay, satisfying conservation of energy. Since Chiao's particles are orbiting the earth, and the earth is not a black hole, the mechanism clearly can't be as simple as the one described above, but Gr\o{}n and Næss show that there are similar mechanisms that can apply here, e.g., scattering of light waves by the nonuniform gravitational field.

It is worth keeping in mind the DeWitts' caution that “The questions answered by this investigation are of conceptual interest only, since the forces involved are far too small to be detected experimentally” (see problem 8, p. 39).


m / 1. A photon is emitted upward from the floor of the elevator. The elevator accelerates upward. 2. By the time the photon is detected at the ceiling, the elevator has changed its velocity, so the photon is detected with a Doppler shift.


n / An electromagnetic wave strikes an ohmic surface. The wave's electric field excites an oscillating current density J. The wave's magnetic field then acts on these currents, producing a force in the direction of the wave's propagation. This is a pre-relativistic argument that light must possess inertia. The first experimental confirmation of this prediction is shown in figure o. See Nichols and Hull, "The pressure due to radiation," Phys. Rev. (Series I) 17 (1903) 26.


o / A simplified drawing of the 1903 experiment by Nichols and Hull that verified the predicted momentum of light waves. Two circular mirrors were hung from a fine quartz fiber, inside an evacuated bell jar. A 150 mW beam of light was shone on one of the mirrors for 6 s, producing a tiny rotation, which was measurable by an optical lever (not shown). The force was within 0.6% of the theoretically predicted value of 0.001 μN. For comparison, a short clipping of a single human hair weighs ∼ 1 μN.

1.5.5 Gravitational red-shifts

Starting on page 15, we saw experimental evidence that the rate of flow of time changes with height in a gravitational field. We can now see that this is required by the equivalence principle.

By the equivalence principle, there is no way to tell the difference between experimental results obtained in an accelerating laboratory and those found in a laboratory immersed in a gravitational field.28 In a laboratory accelerating upward, a photon emitted from the floor and would be Doppler-shifted toward lower frequencies when observed at the ceiling, because of the change in the receiver's velocity during the photon's time of flight. The effect is given by Δ E/Ef/f=ay/c2, where a is the lab's acceleration, y is the height from floor to ceiling, and c is the speed of light.

Self-check: Verify this statement.

By the equivalence principle, we find that when such an experiment is done in a gravitational field g, there should be a gravitational effect on the energy of a photon equal to Δ E/E=gy/c2. Since the quantity gy is the gravitational potential (gravitational energy per unit mass), the photon's fractional loss of energy is the same as the (Newtonian) loss of energy experienced by a material object of mass m and initial kinetic energy mc2.

The interpretation is as follows. Classical electromagnetism requires that electromagnetic waves have inertia. For example, if a plane wave strikes an ohmic surface, as in figure n, the wave's electric field excites oscillating currents in the surface. These currents then experience a magnetic force from the wave's magnetic field, and application of the right-hand rule shows that the resulting force is in the direction of propagation of the wave. Thus the light wave acts as if it has momentum. The equivalence principle says that whatever has inertia must also participate in gravitational interactions. Therefore light waves must have weight, and must lose energy when they rise through a gravitational field.

Self-check: Verify the application of the right-hand rule described above.

Further interpretation:

  • The quantity mc2 is famous, even among people who don't know what m and c stand for. This is the first hint of where it comes from. The full story is given in section 4.2.2.
  • The relation p=E/c between the energy and momentum of a light wave follows directly from Maxwell's equations, by the argument above; however, we will see in section 4.2.2 that according to relativity this relation must hold for any massless particle
  • What we have found agrees with Niels Bohr's correspondence principle, which states that when a new physical theory, such as relativity, replaces an older one, such as Newtonian physics, the new theory must agree with the old one under the experimental conditions in which the old theory had been verified by experiments. The gravitational mass of a beam of light with energy E is E/c2, and since c is a big number, it is not surprising that the weight of light rays had never been detected before Einstein trying to detect it.
  • This book describes one particular theory of gravity, Einstein's theory of general relativity. There are other theories of gravity, and some of these, such as the Brans-Dicke theory, do just as well as general relativity in agreeing with the presently available experimental data. Our prediction of gravitational Doppler shifts of light only depended on the equivalence principle, which is one ingredient of general relativity. Experimental tests of this prediction only test the equivalence principle; they do not allow us to distinguish between one theory of gravity and another if both theories incorporate the equivalence principle.
  • If an object such as a radio transmitter or an atom in an excited state emits an electromagnetic wave with a frequency f, then the object can be considered to be a type of clock. We can therefore interpret the gravitational red-shift as a gravitational time dilation: a difference in the rate at which time itself flows, depending on the gravitational potential. This is consistent with the empirical results presented in section 1.2.1, p. 15.
Example 3: Chiao's paradox revisited

The equivalence principle says that electromagnetic waves have gravitational mass as well as inertial mass, so it seems clear that the same must hold for static fields. In Chiao's paradox (p. 39), the orbiting charged particle has an electric field that extends out to infinity. When we measure the mass of a charged particle such as an electron, there is no way to separate the mass of this field from a more localized contribution. The electric field “falls” through the gravitational field, and the equivalence principle, which is local, cannot guarantee that all parts of the field rotate uniformly about the earth, even in distant parts of the universe. The electric field pattern becomes distorted, and this distortion causes a radiation reaction which back-reacts on the particle, causing its orbit to decay.


p / The Pound-Rebka experiment.


q / Emission of 14 keV gamma-rays by 57Fe. The parent nucleus 57Co absorbs an electron and undergoes a weak-force decay process that converts it into 57Fe, in an excited state. With 85% probability, this state decays to a state just above the ground state, with an excitation energy of 14 keV and a half-life of 10-7 s. This state finally decays, either by gamma emission or emission of an internal conversion electron, to the ground state.


r / Top: A graph of velocity versus time for the source. The velocity has both a constant component and an oscillating one with a frequency of 10-50 Hz. The constant component vo was used as a way of determining the calibration of frequency shift as a function of count rates. Data were acquired during the quarter-cycle periods of maximum oscillatory velocity, 1 and 2. Bottom: Count rates as a function of velocity, for vo=0 and v1≠ 0. The dashed curve and black circles represent the count rates that would have been observed if there were no gravitational effect. The gravitational effect shifts the resonance curve to one side (solid curve), resulting in an asymmetry of the count rates (open circles). The shift, and the resulting asymmetry, are greatly exaggerated for readability; in reality, the gravitational effect was 500 times smaller than the width of the resonance curve.


s / Pound and Rebka at the top and bottom of the tower.

1.5.6 The Pound-Rebka experiment

The 1959 Pound-Rebka experiment at Harvard29 was one of the first high-precision, relativistic tests of the equivalence principle to be carried out under controlled conditions, and in this section we will discuss it in detail.

When y is on the order of magnitude of the height of a building, the value of Δ E/E=gy/c2 is ∼ 10-14, so an extremely high-precision experiment is necessary in order to detect a gravitational red-shift. A number of other effects are big enough to obscure it entirely, and must somehow be eliminated or compensated for. These are listed below, along with their orders of magnitude in the experimental design finally settled on by Pound and Rebka.

(1) Classical Doppler broadening due to temperature. Thermal motion causes Doppler shifts of emitted photons, corresponding to the random component of the emitting atom’s velocity vector along the direction of emission.

<math xmlns=""> <mrow><mo lspace="0" rspace="0.278em">∼</mo><msup><mn>10</mn><mrow><mo lspace="0" rspace="0">-</mo><mn>6</mn></mrow></msup></mrow> </math>

(2) The recoil Doppler shift. When an atom emits a photon with energy<math xmlns=""> <mi>E</mi> </math> and momentum<math xmlns=""> <mrow><mi>p</mi><mo lspace="0.278em" rspace="0.278em">=</mo><mi>E</mi><mo lspace="0" rspace="0" stretchy="false">/</mo><mi>c</mi></mrow> </math>, conservation of momentum requires that the atom recoil with momentum<math xmlns=""> <mrow><mi>p</mi><mo lspace="0.278em" rspace="0.278em">=</mo><mo lspace="0" rspace="0">-</mo><mi>E</mi><mo lspace="0" rspace="0" stretchy="false">/</mo><mi>c</mi></mrow> </math> and energy<math xmlns=""> <mrow><msup><mi>p</mi><mn>2</mn></msup><mo lspace="0" rspace="0" stretchy="false">/</mo><mn>2</mn><mi>m</mi></mrow> </math>. This causes a downward Doppler shift of the energy of the emitted photon. A similar effect occurs on absorption, doubling the problem.

<math xmlns=""> <mrow><mo lspace="0" rspace="0.278em">∼</mo><msup><mn>10</mn><mrow><mo lspace="0" rspace="0">-</mo><mn>12</mn></mrow></msup></mrow> </math>

(3) Natural line width. The Heisenberg uncertainty principle says that a state with a half-life<math xmlns=""> <mi>τ</mi> </math> must have an uncertainty in its energy of at least<math xmlns=""> <mrow><mo lspace="0" rspace="0.278em">∼</mo><mi>h</mi><mo lspace="0" rspace="0" stretchy="false">/</mo><mi>τ</mi></mrow> </math>, where<math xmlns=""> <mi>h</mi> </math> is Planck’s constant.

<math xmlns=""> <mrow><mo lspace="0" rspace="0.278em">∼</mo><msup><mn>10</mn><mrow><mo lspace="0" rspace="0">-</mo><mn>12</mn></mrow></msup></mrow> </math>

(4) Special-relativistic Doppler shift due to temperature. Section ?? presented experimental evidence that time flows at a different rate depending on the motion of the observer. Therefore the thermal motion of an atom emitting a photon has an effect on the frequency of the photon, even if the atom’s motion is not along the line of emission. The equations needed in order to calculate this effect will not be derived until section ??; a quantitative estimate is given in example ?? on page ??. For now, we only need to know that this leads to a temperature-dependence in the average frequency of emission, in addition to the broadening of the bell curve described by effect (1) above.

<math xmlns=""> <mrow><mo lspace="0" rspace="0.278em">∼</mo><msup><mn>10</mn><mrow><mo lspace="0" rspace="0">-</mo><mn>14</mn></mrow></msup></mrow> </math> per degree C

The most straightforward way to mitigate effect (1) is to use photons emitted from a solid. At first glance this would seem like a bad idea, since electrons in a solid emit a continuous spectrum of light, not a discrete spectrum like the ones emitted by gases; this is because we have N electrons, where N is on the order of Avogadro's number, all interacting strongly with one another, so by the correspondence principle the discrete quantum-mechanical behavior must be averaged out. But the protons and neutrons within one nucleus do not interact much at all with those in other nuclei, so the photons emitted by a nucleus do have a discrete spectrum. The energy scale of nuclear excitations is in the keV or MeV range, so these photons are x-rays or gamma-rays. Furthermore, the time-scale of the random vibrations of a nucleus in a solid are extremely short. For a velocity on the order of 100 m/s, and vibrations with an amplitude of ∼ 10-10 m, the time is about 10-12 s. In many cases, this is much shorter than the half-life of the excited nuclear state emitting the gamma-ray, and therefore the Doppler shift averages out to nearly zero.

Effect (2) is still much bigger than the 10-14 size of the effect to be measured. It can be avoided by exploiting the Mössbauer effect, in which a nucleus in a solid substance at low temperature emits or absorbs a gamma-ray photon, but with significant probability the recoil is taken up not by the individual nucleus but by a vibration of the atomic lattice as a whole. Since the recoil energy varies as p2/2m, the large mass of the lattice leads to a very small dissipation of energy into the recoiling lattice. Thus if a photon is emitted and absorbed by identical nuclei in a solid, and for both emission and absorption the recoil momentum is taken up by the lattice as a whole, then there is a negligible energy shift. One must pick an isotope that emits photons with energies of about 10-100 keV. X-rays with energies lower than about 10 keV tend to be absorbed strongly by matter and are difficult to detect, whereas for gamma-ray energies >rsim 100 keV the Mössbauer effect is not sufficient to eliminate the recoil effect completely enough.

If the Mössbauer effect is carried out in a horizontal plane, resonant absorption occurs. When the source and absorber are aligned vertically, p, gravitational frequency shifts should cause a mismatch, destroying the resonance. One can move the source at a small velocity (typically a few mm/s) in order to add a Doppler shift onto the frequency; by determining the velocity that compensates for the gravitational effect, one can determine how big the gravitational effect is.

The typical half-life for deexcitation of a nucleus by emission of a gamma-ray with energy E is in the nanosecond range. To measure an gravitational effect at the 10-14 level, one would like to have a natural line width, (3), with Δ E/E ≤sssim 10-14, which would require a half-life of >rsim 10 μs. In practice, Pound and Rebka found that other effects, such as (4) and electron-nucleus interactions that depended on the preparation of the sample, tended to put nuclei in one sample “out of tune” with those in another sample at the 10-13-10-12 level, so that resonance could not be achieved unless the natural line width gave Δ E/E >rsim 10-12. As a result, they settled on an experiment in which 14 keV gammas were emitted by 57Fe nuclei (figure q) at the top of a 22-meter tower, and absorbed by 57Fe nuclei at the bottom. The 100-ns half-life of the excited state leads to Δ E/E ∼ 10-12. This is 500 times greater than the gravitational effect to be measured, so, as described in more detail below, the experiment depended on high-precision measurements of small up-and-down shifts of the bell-shaped resonance curve.

The absorbers were seven iron films isotopically enhanced in 57Fe, applied directly to the faces of seven sodium-iodide scintillation detectors (bottom of figure p). When a gamma-ray impinges on the absorbers, a number of different things can happen, of which we can get away with considering only the following: (a) the gamma-ray is resonantly absorbed in one of the 57Fe absorbers, after which the excited nucleus decays by re-emission of another such photon (or a conversion electron), in a random direction; (b) the gamma-ray passes through the absorber and then produces ionization directly in the sodium iodide crystal. In case b, the gamma-ray is detected. In case a, there is a 50% probability that the re-emitted photon will come out in the upward direction, so that it cannot be detected. Thus when the conditions are right for resonance, a reduction in count rate is expected. The Mössbauer effect never occurs with 100% probability; in this experiment, about a third of the gammas incident on the absorbers were resonantly absorbed.

The choice of y=22 m was dictated mainly by systematic errors. The experiment was limited by the strength of the gamma-ray source. For a source of a fixed strength, the count rate in the detector at a distance y would be proportional to y-2, leading to statistical errors proportional to <math xmlns=""> <mrow><mn>1</mn><mo lspace="0" rspace="0" stretchy="false">/</mo><msqrt><mtext>count rate</mtext></msqrt><mo lspace="0.278em" rspace="0.278em">∝</mo><mi>y</mi></mrow> </math>. Since the effect to be measured is also proportional to y, the signal-to-noise ratio was independent of y. However, systematic effects such as (4) were easier to monitor and account for when y was fairly large. A lab building at Harvard happened to have a 22-meter tower, which was used for the experiment. To reduce the absorption of the gammas in the 22 meters of air, a long, cylindrical mylar bag full of helium gas was placed in the shaft, p.

The resonance was a bell-shaped curve with a minimum at the natural frequency of emission. Since the curve was at a minimum, where its derivative was zero, the sensitivity of the count rate to the gravitational shift would have been nearly zero if the source had been stationary. Therefore it was necessary to vibrate the source up and down, so that the emitted photons would be Doppler shifted onto the shoulders of the resonance curve, where the slope of the curve was large. The resulting asymmetry in count rates is shown in figure r. A further effort to cancel out possible systematic effects was made by frequently swapping the source and absorber between the top and bottom of the tower.

For y=22.6 m, the equivalence principle predicts a fractional frequency shift due to gravity of 2.46×10-15. Pound and Rebka measured the shift to be (2.56± 0.25)×10-15. The results were in statistical agreement with theory, and verified the predicted size of the effect to a precision of 10%.

Homework Problems

\begin{homeworkforcelabel}{acceleration-of-gravity}{1}{}{1} In classical mechanics, one hears the term “the acceleration of gravity,” which doesn't literally make sense, since it is objects that accelerate. Explain why this term's usefulness is dependent on the equivalence principle. \end{homeworkforcelabel}

\begin{homeworkforcelabel}{space-probe-redshift}{1}{}{2} The New Horizons space probe communicates with the earth using microwaves with a frequency of about 10 GHz. Estimate the sizes of the following frequency shifts in this signal, when the probe flies by Pluto in 2015, at a velocity of ∼10 A.U./year: (a) the Doppler shift due to the probe's velocity; (b) the Doppler shift due to the Earth's orbital velocity; (c) the gravitational Doppler shift. \end{homeworkforcelabel}

\begin{homeworkforcelabel}{ordered-geom-finite-models}{1}{}{3} Euclid's axioms E1-E5 (p. 18) do not suffice to prove that there are an infinite number of points in the plane, and therefore they need to be supplemented by an extra axiom that states this (unless one finds the nonstandard realizations with finitely many points to be interesting enough to study for their own sake). Prove that the axioms of ordered geometry O1-O4 on p. 19 do not have this problem. (solution in the pdf version of the book) \end{homeworkforcelabel}

\begin{homeworkforcelabel}{have-spacesuit}{1}{}{4} In the science fiction novel Have Spacesuit --- Will Travel, by Robert Heinlein, Kip, a high school student, answers a radio distress call, encounters a flying saucer, and is knocked out and kidnapped by aliens. When he wakes up, he finds himself in a locked cell with a young girl named Peewee. Peewee claims they're aboard an accelerating spaceship. “If this was a spaceship,” Kip thinks. “The floor felt as solid as concrete and motionless.”

The equivalence principle can be stated in a variety of ways. On p. 21, I stated it as (1) gravitational and inertial mass are always proportional to one another. An alternative formulation (p. 32) is (2) that Kip has no way, by experiments or observarions inside his sealed prison cell, to determine whether he's in an accelerating spaceship or on the surface of a planet, experiencing its gravitational field.

(a) Show that any violation of statement 1 also leads to a violation of statement 2. (b) If we'd intended to construct a geometrical theory of gravity roughly along the lines of axioms O1-O4 on p. 19, which axiom is violated in this scenario? (solution in the pdf version of the book) \end{homeworkforcelabel}

\begin{homeworkforcelabel}{tossed-clock}{1}{}{5} Clock A sits on a desk. Clock B is tossed up in the air from the same height as the desk and then comes back down. Compare the elapsed times. \hwhint{hint:\currenthwlabel} (solution in the pdf version of the book) \end{homeworkforcelabel}

\begin{homeworkforcelabel}{clock-at-center-of-earth}{1}{}{6} (a) Find the difference in rate between a clock at the center of the earth and a clock at the south pole. (b) When an antenna on earth receives a radio signal from a space probe that is in a hyperbolic orbit in the outer solar system, the signal will show both a kinematic red-shift and a gravitational blueshift. Compare the orders of magnitude of these two effects. (solution in the pdf version of the book) \end{homeworkforcelabel}

\begin{homeworkforcelabel}{ep-charge}{1}{}{7} Consider the following physical situations: (1) a charged object lies on a desk on the planet earth; (2) a charged object orbits the earth; (3) a charged object is released above the earth's surface and dropped straight down; (4) a charged object is subjected to a constant acceleration by a rocket engine in outer space. In each case, we want to know whether the charge radiates. Analyze the physics in each case (a) based on conservation of energy; (b) by determining whether the object's motion is inertial in the sense intended by Isaac Newton; (c) using the most straightforward interpretation of the equivalence principle (i.e., not worrying about the issues discussed on p. that surround the ambiguous definition of locality). (solution in the pdf version of the book) \end{homeworkforcelabel}

\begin{homeworkforcelabel}{dewitt-estimate}{1}{}{8} Consider the physical situation depicted in figure l, p. 30. Let ag be the gravitational acceleration and ar the acceleration of the charged particle due to radiation. Then ar/ag measures the violation of the equivalence principle. The goal of this problem is to make an order-of-magnitude estimate of this ratio in the case of a neutron and a proton in low earth orbit.
(a) Let m the mass of each particle, and q the charge of the charged particle. Without doing a full calculation like the ones by the DeWitts and Gr\o{}n and Næss, use general ideas about the frequency-scaling of radiation (see section 9.2.5, p. 340) to find the proportionality that gives the dependence of ar/ag on q, m, and any convenient parameters of the orbit.
(b) Based on considerations of units, insert the necessary universal constants into your answer from part a.
(c) The result from part b will still be off by some unitless factor, but we expect this to be of order unity. Under this assumption, make an order-of-magnitude estimate of the violation of the equivalence principle in the case of a neutron and a proton in low earth orbit.
(solution in the pdf version of the book) \end{homeworkforcelabel}

(c) 1998-2013 Benjamin Crowell, licensed under the Creative Commons Attribution-ShareAlike license. Photo credits are given at the end of the Adobe Acrobat version.

[1] The possibility of having time come back again to the same point is often referred to by physicists as a closed timelike curve (CTC). Kip Thorne, in his popularization Black Holes and Time Warps, recalls experiencing some anxiety after publishing a paper with “Time Machines” in the title, and later being embarrassed when a later paper on the topic was picked up by the National Enquirer with the headline PHYSICISTS PROVE TIME MACHINES EXIST. “CTC” is safer because nobody but physicists know what it means.
[2] This point is revisited in section 6.1.
[3] Hafele and Keating, Science, 177 (1972), 168
[4] These differences in velocity are not simply something that can be eliminated by choosing a different frame of reference, because the clocks' motion isn't in a straight line. The clocks back in Washington, for example, have a certain acceleration toward the earth's axis, which is different from the accelerations experienced by the traveling clocks.
[5] Vessot at al., Physical Review Letters 45 (1980) 2081
[6] These axioms are summarized for quick reference in the back of the book on page .
[7] This is a form known as Playfair's axiom, rather than the version of the postulate originally given by Euclid.
[8] The axioms are summarized for convenient reference in the back of the book on page . This is meant to be an informal, readable summary of the system, pitched to the same level of looseness as Euclid's E1-E5. Modern mathematicians have found that systems like these actually need quite a bit more technical machinery to be perfectly rigorous, so if you look up an axiomatization of ordered geometry, or a modern axiomatization of Euclidean geometry, you'll typically find a much more lengthy list of axioms than the ones presented here. The axioms I'm omitting take care of details like making sure that there are more than two points in the universe, and that curves can't cut through one another without intersecting. The classic, beautifully written book on these topics is H.S.M. Coxeter's Introduction to Geometry, which is “introductory” in the sense that it's the kind of book a college math major might use in a first upper-division course in geometry.
[9] The reason for the restriction to small objects is essentially gravitational radiation. The object should also be electrically neutral, and neither the object nor the surrounding spacetime should contain any exotic forms of negative energy. This is discussed in more detail on p. 280. See also problem 1 on p. 344.
[10] Radio waves in the HF band tend to be trapped between the ground and the ionosphere, causing them to curve over the horizon, allowing long-distance communication.
[11] For more justification of this statement, see ch. 9, problem 1, on page 344.
[12] Regarding infinitesimals, see p. 93.
[13] V.B. Braginskii and V.I. Panov, Soviet Physics JETP 34, 463 (1972).
[14] This statement of the equivalence principle is summarized, along with some other forms of it to be encountered later, in the back of the book on page .
[15] paraphrased from a translation by Motte, 1729
[16] This statement of the equivalence principle is summarized, along with some other forms of it, in the back of the book on page .
[17] Einstein, “The Foundation of the General Theory of Relativity,” 1916. An excerpt is given on p. 360.
[18] Two that I believe were relatively influential are Born's 1920 Einstein's Theory of Relativity and Eddington's 1924 The Mathematical Theory of Relativity. Born follows Einstein's “Foundation” paper slavishly. Eddington seems only to mention inertial frames in a few places where the context is Newtonian.
[19] Misner, Thorne, and Wheeler, Gravitation, 1973, p. 18
[20] ibid, p. 19
[21] Eddington, op. cit.
[22] Misner, Thorne, and Wheeler, op. cit., pp.163-164. Penrose, The Road to Reality, 2004, p. 422. Taylor and Wheeler, Spacetime Physics, 1992, p. 132. Schutz, A First Course in General Relativity, 2009, pp. 3, 141. Hobson, General Relativity: An Introduction for Physicists, 2005, sec. 1.14.
[24] A good recent discussion of this is “Theory of gravitation theories: a no-progress report,” Sotiriou, Faraoni, and Liberati,
[26] The first detailed calculation appears to have been by Cécile and Bryce DeWitt, “Falling Charges,” Physics 1 (1964) 3. This paper is unfortunately very difficult to obtain now. A more recent treatment by Gr\o{}n and Næss is accessible at A full exposition of the techniques is given by Poisson, “The Motion of Point Particles in Curved Spacetime,”
[27] Because relativity describes gravitational fields in terms of curvature of spacetime, the Euclidean relationship between the radius and circumference of a circle fails here. The r coordinate should be understood here not as the radius measured from the center but as the circumference divided by 2π.
[28] Problem 4 on p. 38 verifies, in one specific example, that this way of stating the equivalence principle is implied by the one on p. 21.
[29] Phys. Rev. Lett. 4 (1960) 337