This is a modern-English version of The A B C of Relativity, originally written by Russell, Bertrand. It has been thoroughly updated, including changes to sentence structure, words, spelling, and grammar—to ensure clarity for contemporary readers, while preserving the original spirit and nuance. If you click on a paragraph, you will see the original text that we modified, and you can toggle between the two versions.

Scroll to the bottom of this page and you will find a free ePUB download link for this book.


HARPER’S MODERN SCIENCE SERIES

HARPER'S MODERN SCIENCE SERIES


THE A B C OF RELATIVITY

BY

BY

BERTRAND RUSSELL

BERTRAND RUSSELL

AUTHOR OF
“THE PRINCIPLES OF MATHEMATICS”
“PROPOSED ROADS TO FREEDOM”
AND “WHY MEN FIGHT”

AUTHOR OF
“THE PRINCIPLES OF MATHEMATICS”
“PROPOSED ROADS TO FREEDOM”
AND “WHY MEN FIGHT”

PUBLISHERS

PUBLISHERS

HARPER & BROTHERS
NEW YORK AND LONDON

HARPER & BROTHERS
NEW YORK AND LONDON

THE A B C OF RELATIVITY

THE A B C OF RELATIVITY

Copyright, 1925, by Harper & Brothers
Printed in the United States of America

Copyright, 1925, by Harper & Brothers
Printed in the USA


Contents

**Contents**

CHAPTER PAGE
I. TOUCH AND SIGHT: THE EARTH AND THE HEAVENS    1
II. WHAT HAPPENS AND WHAT IS OBSERVED 14
III. THE VELOCITY OF LIGHT 28
IV. CLOCKS AND FOOT RULES 43
V. SPACE-TIME 58
VI. THE SPECIAL THEORY OF RELATIVITY 71
VII. INTERVALS IN SPACE-TIME 91
VIII. EINSTEIN’S LAW OF GRAVITATION 111
IX. PROOFS OF EINSTEIN’S LAW OF GRAVITATION 131
X. MASS, MOMENTUM, ENERGY AND ACTION 144
XI. IS THE UNIVERSE FINITE? 163
XII. CONVENTIONS AND NATURAL LAWS 177
XIII. THE ABOLITION OF “FORCE” 192
XIV. WHAT IS MATTER? 206
XV. PHILOSOPHICAL CONSEQUENCES 219

THE A B C OF RELATIVITY

The Basics of Relativity


[Pg 1]

[Pg 1]

CHAPTER ONE:
TOUCH AND SIGHT:
THE EARTH AND THE HEAVENS

Everybody knows that Einstein has done something astonishing, but very few people know exactly what it is that he has done. It is generally recognized that he has revolutionized our conception of the physical world, but his new conceptions are wrapped up in mathematical technicalities. It is true that there are innumerable popular accounts of the theory of relativity, but they generally cease to be intelligible just at the point where they begin to say something important. The authors are hardly to blame for this. Many of the new ideas can be expressed in non-mathematical language, but they are none the less difficult on that account. What is demanded is a change in our imaginative picture of the [Pg 2] world—a picture which has been handed down from remote, perhaps pre-human, ancestors, and has been learned by each one of us in early childhood. A change in our imagination is always difficult, especially when we are no longer young. The same sort of change was demanded by Copernicus, when he taught that the earth is not stationary and the heavens do not revolve about it once a day. To us now there is no difficulty in this idea, because we learned it before our mental habits had become fixed. Einstein’s ideas, similarly, will seem easy to a generation which has grown up with them; but for our generation a certain effort of imaginative reconstruction is unavoidable.

Everybody knows that Einstein has achieved something remarkable, but very few people understand exactly what it is. It's widely acknowledged that he has transformed our understanding of the physical world, but his new ideas are tied up in complicated math. While there are countless popular explanations of the theory of relativity, they often become confusing right when they start to discuss something significant. The authors aren't entirely at fault for this. Many of the new concepts can be explained without math, but they still remain challenging for that reason. What’s needed is a shift in our mental image of the [Pg 2] world—a perspective that has been passed down from ancient, possibly pre-human, ancestors and learned by each of us in early childhood. Changing our imagination is always tough, especially as we get older. A similar shift was required by Copernicus, who showed that the earth isn't fixed in place and the heavens don’t revolve around it daily. To us now, this concept is easy because we learned it before our thinking patterns were set. Einstein's ideas will probably seem straightforward to a generation that has grown up with them; however, for our generation, a certain effort in reimagining is unavoidable.

In exploring the surface of the earth, we make use of all our senses, more particularly of the senses of touch and sight. In measuring lengths, parts of the human body are employed in pre-scientific ages: a “foot,” a “cubit,” a “span” are defined in this way. For longer distances, we think of the time it takes to walk from one place to another. We gradually learn to judge distances roughly by the eye, but we rely upon touch for accuracy. Moreover it is touch that gives us our sense of “reality.” Some things cannot be touched: rainbows, reflections in looking-glasses, and so on. These things [Pg 3] puzzle children, whose metaphysical speculations are arrested by the information that what is in the looking glass is not “real.” Macbeth’s dagger was unreal because it was not “sensible to feeling as to sight.” Not only our geometry and physics, but our whole conception of what exists outside us, is based upon the sense of touch. We carry this even into our metaphors: a good speech is “solid,“ a bad speech is “gas,” because we feel that a gas is not quite “real.”

In exploring the surface of the earth, we use all our senses, especially touch and sight. In the pre-scientific ages, we measured lengths using parts of the human body: a “foot,” a “cubit,” a “span” were defined this way. For longer distances, we think about how long it takes to walk from one place to another. We gradually learn to estimate distances roughly with our eyes, but we rely on touch for precision. Moreover, touch gives us our sense of “reality.” Some things can’t be touched: rainbows, reflections in mirrors, and so on. These things confuse children, whose philosophical questions are halted by the idea that what’s in the mirror isn’t “real.” Macbeth’s dagger was unreal because it couldn’t be “felt in the same way as it is seen.” Not only our geometry and physics, but our entire understanding of what exists outside us is rooted in the sense of touch. We even carry this into our metaphors: a good speech is “solid,” and a bad speech is “gas,” because we feel that gas isn’t quite “real.” [Pg 3]

In studying the heavens, we are debarred from all senses except sight. We cannot touch the sun, or travel to it; we cannot walk round the moon, or apply a foot rule to the Pleiades. Nevertheless, astronomers have unhesitatingly applied the geometry and physics which they found serviceable on the surface of the earth, and which they had based upon touch and travel. In doing so, they brought down trouble on their heads, which it has been left for Einstein to clear up. It has turned out that much of what we learned from the sense of touch was unscientific prejudice, which must be rejected if we are to have a true picture of the world.

In studying the universe, we can only rely on our sight. We can’t touch the sun or travel to it; we can’t walk around the moon or measure the Pleiades with a ruler. Still, astronomers confidently used the geometry and physics that worked on Earth, which were based on touch and movement. In doing so, they created confusion that Einstein had to address. It turned out that much of what we learned through touch was just unscientific bias that needs to be discarded if we want an accurate view of the world.

An illustration may help us to understand how much is impossible to the astronomer as compared to the man who is interested in things on [Pg 4] the surface of the earth. Let us suppose that a drug is administered to you which makes you temporarily unconscious, and that when you wake you have lost your memory but not your reasoning powers. Let us suppose further that while you were unconscious you were carried into a balloon, which, when you come to, is sailing with the wind in a dark night—the night of the fifth of November if you are in England, or of the fourth of July if you are in America. You can see fireworks which are being sent off from the ground, from trains, and from aeroplanes traveling in all directions, but you cannot see the ground or the trains or the aeroplanes be cause of the darkness. What sort of picture of the world will you form? You will think that nothing is permanent: there are only brief flashes of light, which, during their short existence, travel through the void in the most various and bizarre curves. You cannot touch these flashes of light, you can only see them. Obviously your geometry and your physics and your metaphysics will be quite different from those of ordinary mortals. If an ordinary mortal [Pg 5] is with you in the balloon, you will find his speech unintelligible. But if Einstein is with you, you will understand him more easily than the ordinary mortal would, because you will be free from a host of preconceptions which prevent most people from understanding him.

An illustration might help us grasp how much is beyond the reach of the astronomer compared to someone focused on what happens on the surface of the Earth. Imagine you’re given a drug that makes you temporarily unconscious, and when you wake up, you’ve lost your memory but not your ability to reason. Now, let’s say while you were out, you were taken up in a balloon, which is now floating with the wind on a dark night—the night of November 5th if you’re in England, or July 4th if you’re in America. You can see fireworks being set off from the ground, by trains, and from airplanes traveling in all directions, but you can’t see the ground, the trains, or the airplanes because of the darkness. What kind of picture of the world will you create? You’ll believe that nothing is permanent: there are just brief flashes of light that, during their short existence, move through the void in various and strange curves. You can’t touch these flashes of light; you can only see them. Clearly, your understanding of geometry, physics, and metaphysics will be vastly different from that of ordinary people. If an ordinary person is with you in the balloon, you’ll find their speech confusing. But if Einstein is with you, you’ll grasp him more easily than the ordinary person would, because you’ll be free from the many preconceived notions that make it hard for most people to understand him.

The theory of relativity depends, to a considerable extent, upon getting rid of notions which are useful in ordinary life but not to our drugged balloonist. Circumstances on the surface of the earth, for various more or less accidental reasons, suggest conceptions which turn out to be inaccurate, although they have come to seem like necessities of thought. The most important of these circumstances is that most objects on the earth’s surface are fairly persistent and nearly stationary from a terrestrial point of view. If this were not the case, the idea of going a journey would not seem so definite as it does. If you want to travel from King’s Cross to Edinburgh, you know that you will find King’s Cross where it always has been, that the railway line will take the course that it did when you last made the journey, and that Waverley Station in Edinburgh will not have walked up to the Castle. You therefore say and think that you have traveled to [Pg 6] Edinburgh, not that Edinburgh has traveled to you, though the latter statement would be just as accurate. The success of this common sense point of view depends upon a number of things which are really of the nature of luck. Suppose all the houses in London were perpetually moving about, like a swarm of bees; suppose railways moved and changed their shapes like avalanches; and finally suppose that material objects were perpetually being formed and dissolved like clouds. There is nothing impossible in these suppositions: something like them must have been verified when the earth was hotter than it is now. But obviously what we call a journey to Edinburgh would have no meaning in such a world. You would begin, no doubt, by asking the taxi-driver: “Where is King’s Cross this morning?“ At the station you would have to ask a similar question about Edinburgh, but the booking-office clerk would reply: “What part of Edinburgh do you mean, Sir? Prince’s Street has gone to Glasgow, the Castle has moved up into the Highlands, and Waverley Station is under water in the middle of the Firth of Forth.” And on the journey the stations would not be staying quiet, but some [Pg 7] would be travelling north, some south, some east or west, perhaps much faster than the train. Under these conditions you could not say where you were at any moment. Indeed the whole notion that one is always in some definite “place” is due to the fortunate immovability of most of the large objects on the earth’s surface. The idea of “place” is only a rough practical approximation: there is nothing logically necessary about it, and it cannot be made precise.

The theory of relativity largely relies on letting go of ideas that are handy in everyday life but not applicable to our drugged balloonist. The conditions on the Earth's surface, for various accidental reasons, lead us to concepts that turn out to be wrong, even though they seem essential to our thinking. The most crucial of these conditions is that most objects on the Earth's surface are pretty stable and nearly stationary from a local viewpoint. If this weren't true, the concept of taking a trip wouldn't feel as concrete as it does. If you're traveling from King’s Cross to Edinburgh, you know that King’s Cross will be where it always has been, the railway will follow the same path as it did on your last trip, and Waverley Station in Edinburgh hasn't moved up to the Castle. So you say you've traveled to Edinburgh, not that Edinburgh has come to you, although the latter statement is just as true. This common-sense perspective relies on several factors that are essentially a matter of luck. Imagine if all the buildings in London were constantly moving around like a swarm of bees; imagine if railways shifted and transformed like avalanches; and finally, think about if material objects were endlessly appearing and disappearing like clouds. None of these scenarios are impossible: something similar must have happened when the Earth was hotter than it is now. But clearly, the idea of a journey to Edinburgh would be meaningless in such a world. You would likely start by asking the taxi driver, “Where is King’s Cross this morning?” At the station, you'd need to ask a similar question about Edinburgh, but the ticket clerk would respond, “Which part of Edinburgh are you referring to, Sir? Prince’s Street has moved to Glasgow, the Castle has shifted up into the Highlands, and Waverley Station is underwater in the middle of the Firth of Forth.” During the trip, the stations wouldn't stay still, with some heading north, some south, some east or west, perhaps much faster than the train. In such conditions, you couldn't pinpoint where you were at any moment. In fact, the entire idea that one is always in some specific “place” stems from the fortunate stability of most large objects on the Earth's surface. The concept of “place” is merely a rough practical estimate: there's nothing logically necessary about it, and it can't be precisely defined.

If we were not much larger than an electron, we should not have this impression of stability, which is only due to the grossness of our senses. King’s Cross, which to us looks solid, would be too vast to be conceived except by a few eccentric mathematicians. The bits of it that we could see would consist of little tiny points of matter, never coming into contact with each other, but perpetually whizzing round each other in an inconceivably rapid ballet-dance. The world of our experience would be quite as mad as the one in which the different parts of Edinburgh go for walks in different directions. If—to take the opposite extreme—you were as large as the sun and lived as long, with a corresponding slowness of perception, you would again find a [Pg 8] higgledy-piggledy universe without permanence—stars and planets would come and go like morning mists, and nothing would remain in a fixed position relatively to anything else. The notion of comparative stability which forms part of our ordinary outlook is thus due to the fact that we are about the size we are, and live on a planet of which the surface is no longer very hot. If this were not the case, we should not find pre-relativity physics intellectually satisfying. Indeed, we should never have invented such theories. We should have had to arrive at relativity physics at one bound, or remain ignorant of scientific laws. It is fortunate for us that we were not faced with this alternative, since it is almost inconceivable that one man could have done the work of Euclid, Galileo, Newton, and Einstein. Yet without such an incredible genius physics could hardly have been discovered in a world where the universal flux was obvious to non-scientific observation.

If we weren’t much larger than an electron, we wouldn’t have this sense of stability, which is really just a result of our senses being coarse. King’s Cross, which seems solid to us, would be way too big to be understood by anyone but a few strange mathematicians. The parts we could see would consist of tiny bits of matter, never touching each other but constantly zooming around in an unimaginably fast dance. Our experience of the world would be as chaotic as if different parts of Edinburgh were walking in different directions. On the other hand, if you were as large as the sun and lived for that long, with perceptions moving at a much slower pace, you would again find a disorganized universe without permanence—stars and planets would appear and disappear like morning fog, with nothing staying in a fixed position relative to anything else. The idea of relative stability that shapes our normal view of the world comes from the fact that we are about our current size and live on a planet that’s no longer extremely hot. If that weren’t true, we wouldn’t find pre-relativity physics intellectually satisfying. In fact, we likely wouldn’t have even come up with those theories. We would have had to jump straight to relativity physics or remain unaware of scientific laws. It’s lucky for us that we weren’t stuck with that choice, as it’s hard to imagine that one person could have done the work of Euclid, Galileo, Newton, and Einstein. Yet without such incredible genius, physics might hardly have been discovered in a world where constant change was clear to everyone, even those who weren’t scientists.

In astronomy, although the sun, moon, and stars continue to exist year after year, yet in other respects the world we have to deal with is very different from that of everyday life. As already observed, we depend exclusively on sight: the heavenly bodies cannot be touched, [Pg 9] heard, smelt or tasted. Everything in the heavens is moving relatively to everything else. The earth is going round the sun, the sun is moving, very much faster than an express train, towards a point in the constellation “Hercules,” the “fixed” stars are scurrying hither and thither like a lot of frightened hens. There are no well-marked places in the sky, like King’s Cross and Edinburgh. When you travel from place to place on the earth, you say the train moves and not the stations, because the stations preserve their topographical relations to each other and the surrounding country. But in astronomy it is arbitrary which you call the train and which the station: the question is to be decided purely by convenience and as a matter of convention.

In astronomy, even though the sun, moon, and stars keep showing up year after year, the world we’re dealing with is actually quite different from our everyday experience. As mentioned earlier, we rely solely on sight: we can’t touch, hear, smell, or taste the celestial bodies. Everything in the sky is moving in relation to everything else. The earth is orbiting the sun, the sun is racing, much faster than a high-speed train, toward a point in the constellation “Hercules,” and the so-called "fixed" stars are darting around like a bunch of startled chickens. There aren’t any clearly defined locations in the sky, like King’s Cross or Edinburgh. When you travel from one place to another on earth, you say the train is moving, not the stations, because the stations maintain their geographical positions relative to each other and the surrounding land. But in astronomy, it’s arbitrary which one you call the train and which one the station: it’s all based on convenience and convention. [Pg 9]

In this respect, it is interesting to contrast Einstein and Copernicus. Before Copernicus, people thought that the earth stood still and the heavens revolved about it once a day. Copernicus taught that “really” the earth rotates once a day, and the daily revolution of sun and stars is only “apparent.” Galileo and Newton endorsed this view, and many things were thought to prove it—for example, the flattening of the [Pg 10] earth at the poles, and the fact that bodies are heavier there than at the equator. But in the modern theory the question between Copernicus and his predecessors is merely one of convenience; all motion is relative, and there is no difference between the two statements: “the earth rotates once a day” and “the heavens revolve about the earth once a day.” The two mean exactly the same thing, just as it means the same thing if I say that a certain length is six feet or two yards. Astronomy is easier if we take the sun as fixed than if we take the earth, just as accounts are easier in a decimal coinage. But to say more for Copernicus is to assume absolute motion, which is a fiction. All motion is relative, and it is a mere convention to take one body as at rest. All such conventions are equally legitimate, though not all are equally convenient.

In this regard, it’s interesting to compare Einstein and Copernicus. Before Copernicus, people believed that the Earth was stationary and that the heavens revolved around it once a day. Copernicus explained that, in reality, the Earth rotates once a day, and the daily movement of the sun and stars is only “apparent.” Galileo and Newton supported this idea, and various observations were thought to validate it—for instance, the Earth's flattening at the poles and the fact that objects weigh more there than at the equator. However, in modern theory, the debate between Copernicus and his predecessors is simply a matter of convenience; all motion is relative, and there’s no difference between the statements: “the Earth rotates once a day” and “the heavens revolve around the Earth once a day.” They convey exactly the same meaning, just like saying a certain length is six feet or two yards. Astronomy is simpler when we consider the sun as fixed rather than the Earth, much like how accounting is easier in a decimal system. But to advocate more for Copernicus implies assuming absolute motion, which is a fabrication. All motion is relative, and it’s just a convention to designate one body as at rest. All such conventions are equally valid, though not all are equally practical.

There is another matter of great importance, in which astronomy differs from terrestrial physics because of its exclusive dependence upon sight. Both popular thought and old-fashioned physics used the notion of “force,” which seemed intelligible because it was associated with familiar sensations. When we are walking, we have sensations connected with our muscles which we do not have when we are sitting still. In the days before mechanical traction, although people could [Pg 11] travel by sitting in their carriages, they could see the horses exerting themselves and evidently putting out “force” in the same way as human beings do. Everybody knew from experience what it is to push or pull, or to be pushed or pulled. These very familiar facts made “force” seem a natural basis for dynamics. But Newton’s law of gravitation introduced a difficulty. The force between two billiard balls appeared intelligible, because we know what it feels like to bump into another person; but the force between the earth and the sun, which are ninety-three million miles apart, was mysterious. Newton himself regarded this “action at a distance” as impossible, and believed that there was some hitherto undiscovered mechanism by which the sun’s influence was transmitted to the planets. However, no such mechanism was discovered, and gravitation remained a puzzle. The fact is that the whole conception of “force” is a mistake. The sun does not exert any force on the planets; in Einstein’s law of gravitation, the planet only pays attention to what it finds in its own neighborhood. The way in which this works will be explained in a later chapter; for the present [Pg 12] we are only concerned with the necessity of abandoning the notion of “force,” which was due to misleading conceptions derived from the sense of touch.

There’s another important point where astronomy differs from Earth-based physics because it totally relies on sight. Both common understanding and traditional physics used the idea of “force,” which seemed reasonable since it was tied to familiar sensations. When we walk, we feel sensations in our muscles that we don’t feel when we’re sitting still. Before mechanical transportation, even though people could travel by sitting in their carriages, they could see the horses working hard and clearly exerting “force” just like humans do. Everyone knew from experience what it felt like to push or pull, or to be pushed or pulled. These well-known facts made “force” feel like a natural foundation for dynamics. But Newton’s law of gravitation created a problem. The force between two billiard balls is easy to understand because we know what it feels like to bump into someone; however, the force between the Earth and the Sun, which are ninety-three million miles apart, was a mystery. Newton himself thought this “action at a distance” was impossible and believed there must be some undiscovered mechanism that transmitted the Sun’s influence to the planets. Yet, no such mechanism was found, and gravitation remained puzzling. The truth is that the entire idea of “force” is flawed. The Sun doesn’t exert any force on the planets; in Einstein’s law of gravitation, a planet only reacts to what it finds nearby. How this works will be explained in a later chapter; for now, we only need to focus on the need to let go of the idea of “force,” which stemmed from misleading perceptions based on our sense of touch.

As physics has advanced, it has appeared more and more that sight is less misleading than touch as a source of fundamental notions about matter. The apparent simplicity in the collision of billiard balls is quite illusory. As a matter of fact, the two billiard balls never touch at all; what really happens is inconceivably complicated, but is more analogous to what happens when a comet penetrates the solar system and goes away again than to what common sense supposes to happen.

As physics has advanced, it has become increasingly clear that sight is less misleading than touch when it comes to understanding the basics of matter. The simplicity of billiard balls colliding is quite deceptive. In reality, the two billiard balls never actually touch; what really occurs is incredibly complicated and is more similar to what happens when a comet enters the solar system and then leaves again than to what we typically think happens.

Most of what we have said hitherto was already recognized by physicists before Einstein invented the theory of relativity. “Force” was known to be merely a mathematical fiction, and it was generally held that motion is a merely relative phenomenon—that is to say, when two bodies are changing their relative position, we cannot say that one is moving while the other is at rest, since the occurrence is merely a change in their relation to each other. But a great labor was required in order to bring the actual procedure of physics into harmony with [Pg 13] these new convictions. Newton believed in force and in absolute space and time; he embodied these beliefs in his technical methods, and his methods remained those of later physicists. Einstein invented a new technique, free from Newton’s assumptions. But in order to do so he had to change fundamentally the old ideas of space and time, which had been unchallenged from time immemorial. This is what makes both the difficulty and the interest of his theory. But before explaining it there are some preliminaries which are indispensable. These will occupy the next two chapters.

Most of what we’ve discussed so far was already known by physicists before Einstein developed the theory of relativity. “Force” was understood to be just a mathematical concept, and it was generally accepted that motion is solely a relative phenomenon—that is to say, when two objects are changing their relative positions, we can't say that one is moving while the other is at rest, since it’s really just a change in their relationship to each other. However, a lot of effort was needed to align the actual practices of physics with these new beliefs. Newton believed in force and in absolute space and time; he integrated these beliefs into his technical methods, and his methods were used by later physicists. Einstein created a new approach that was free from Newton’s assumptions. But to do this, he had to fundamentally change the traditional concepts of space and time, which had been accepted for ages. This is what makes his theory both challenging and intriguing. Before explaining it, though, there are some necessary preliminaries. These will be covered in the next two chapters.


[Pg 14]

[Pg 14]

CHAPTER II:
WHAT HAPPENS AND
WHAT IS OBSERVED

A certain type of superior person is fond of asserting that “everything is relative.” This is, of course, nonsense, because, if everything were relative, there would be nothing for it to be relative to. However, without falling into metaphysical absurdities it is possible to maintain that everything in the physical world is relative to an observer. This view, true or not, is not that adopted by the “theory of relativity.” Perhaps the name is unfortunate; certainly it has led philosophers and uneducated people into confusions. They imagine that the new theory proves everything in the physical world to be relative, whereas, on the contrary, it is wholly concerned to exclude what is relative and arrive at a statement of physical laws that shall in no way depend upon the circumstances of the observer. It is true that these circumstances have been found to have more effect [Pg 15] upon what appears to the observer than they were formerly thought to have, but at the same time Einstein showed how to discount this effect completely. This was the source of almost everything that is surprising in his theory.

A specific type of superior person likes to claim that “everything is relative.” This is, of course, nonsense, because if everything were relative, there would be nothing for it to be relative to. However, without getting lost in metaphysical nonsense, it’s possible to argue that everything in the physical world is relative to an observer. This view, whether true or not, is not the one taken by the “theory of relativity.” Perhaps the name is unfortunate; it has certainly led philosophers and less knowledgeable people into confusion. They think the new theory proves that everything in the physical world is relative, but on the contrary, it is entirely focused on excluding what is relative and arriving at physical laws that do not depend on the observer’s circumstances. It is true that these circumstances have been found to influence what the observer sees more than was previously thought, but at the same time, Einstein demonstrated how to completely account for this effect. This was the source of nearly everything surprising in his theory. [Pg 15]

When two observers perceive what is regarded as one occurrence, there are certain similarities, and also certain differences, between their perceptions. The differences are obscured by the requirements of daily life, because from a business point of view they are as a rule unimportant. But both psychology and physics, from their different angles, are compelled to emphasize the respects in which one man’s perception of a given occurrence differs from another man’s. Some of these differences are due to differences in the brains or minds of the observers, some to differences in their sense organs, some to differences of physical situation: these three kinds may be called respectively psychological, physiological, and physical. A remark made in a language we know will be heard, whereas an equally loud remark in an unknown language may pass entirely unnoticed. Of two men in the Alps, one will perceive the beauty of the scenery while the other will notice the waterfalls with a view to obtaining power from them. Such [Pg 16] differences are psychological. The difference between a long-sighted and a short-sighted man, or between a deaf man and a man who hears well, are physiological. Neither of these kinds concerns us, and I have mentioned them only in order to exclude them. The kind that concerns us is the purely physical kind. Physical differences between two observers will be preserved when the observers are replaced by cameras or phonographs, and can be reproduced on the movies or the gramophone. If two men both listen to a third man speaking, and one of them is nearer to the speaker than the other is, the nearer one will hear louder and slightly earlier sounds than are heard by the other. If two men both watch a tree falling, they see it from different angles. Both these differences would be shown equally by recording instruments: they are in no way due to idiosyncrasies in the observers, but are part of the ordinary course of physical nature as we experience it.

When two observers witness what is considered a single event, there are certain similarities as well as differences between their perceptions. The differences often get overlooked in daily life because they are usually unimportant from a practical standpoint. However, both psychology and physics, from their unique perspectives, must highlight how one person’s perception of an event can differ from another’s. Some of these differences arise from variations in the brains or minds of the observers, some from differences in their sensory organs, and some from their physical circumstances. We can categorize these three types as psychological, physiological, and physical. For example, a comment made in a familiar language will be noticed, while a similarly loud comment in an unfamiliar language may go completely unnoticed. Among two men in the Alps, one might appreciate the beauty of the landscape, while the other focuses on the waterfalls for potential power generation. These types of differences are psychological. Differences between a farsighted and a nearsighted person, or between a deaf individual and someone with normal hearing, are physiological. We’re not concerned with these types, and I mention them only to rule them out. What we are focused on is the purely physical type. The physical differences between two observers remain consistent even when they are replaced by cameras or phonographs, and can be captured in films or recordings. If two men listen to a third man speaking and one is closer to the speaker, the closer observer will hear the sound louder and slightly sooner than the other. Similarly, if two men are watching a tree fall, they will see it from different angles. Both of these differences would be equally captured by recording devices; they are not due to the individual quirks of the observers but are simply a part of the normal workings of the physical world as we experience it.

The physicist, like the plain man, believes that his perceptions give him knowledge about what is really occurring in the physical world, and not only about his private experiences. Professionally, he regards [Pg 17] the physical world as “real,” not merely as something which human beings dream. An eclipse of the sun, for instance, can be observed by any person who is suitably situated, and is also observed by the photographic plates that are exposed for the purpose. The physicist is persuaded that something has really happened over and above the experiences of those who have looked at the sun or at photographs of it. I have emphasized this point, which might seem a trifle obvious, because some people imagine that Einstein has made a difference in this respect. In fact he has made none.

The physicist, like the average person, believes that his observations provide him with knowledge about what is truly happening in the physical world, not just about his personal experiences. Professionally, he sees the physical world as “real,” rather than merely a product of human imagination. An eclipse of the sun, for example, can be witnessed by anyone in the right location and is also captured by photographic plates set up for that purpose. The physicist is convinced that something has genuinely occurred beyond the experiences of those who have looked at the sun or its photographs. I have highlighted this point, which may seem somewhat obvious, because some people think that Einstein has changed this perspective. In reality, he has not. [Pg 17]

But if the physicist is justified in this belief that a number of people can observe the “same” physical occurrence, then clearly the physicist must be concerned with those features which the occurrence has in common for all observers, for the others cannot be regarded as belonging to the occurrence itself. At least, the physicist must confine himself to the features which are common to all “equally good” observers. The observer who uses a microscope or a telescope is preferred to one who does not, because he sees all that the latter sees and more too. A sensitive photographic plate may “see” still more, and is then preferred to any eye. But such things as differences of [Pg 18] perspective, or differences of apparent size due to difference of distance, are obviously not attributable to the object; they belong solely to the point of view of the spectator. Common sense eliminates these in judging of objects; physics has to carry the same process much further, but the principle is the same.

But if the physicist believes that multiple people can observe the “same” physical event, then it's clear that the physicist must focus on the features that the event has in common for all observers, as the others can't be considered part of the event itself. At the very least, the physicist should limit themselves to the features that are shared by all “equally good” observers. An observer using a microscope or a telescope is favored over one who doesn’t, because they can see everything the latter sees and more. A sensitive photographic plate might "see" even more and is thus preferred to any eye. However, things like differences in perspective or variations in apparent size due to distance are clearly not due to the object; they belong solely to the observer's point of view. Common sense removes these when assessing objects; physics has to take this process much further, but the principle remains the same.

I want to make it clear that I am not concerned with anything that can be called inaccuracy. I am concerned with genuine physical differences between occurrences each of which is a correct record of a certain event, from its own point of view. When a man fires a gun, people who are not quite close to him see the flash before they hear the report. This is not due to any defect in their senses, but to the fact that sound travels more slowly than light. Light travels so fast that, from the point of view of phenomena on the surface of the earth, it may be regarded as instantaneous. Anything that we can see on the earth happens practically at the moment when we see it. In a second, light travels 300,000 kilometers (about 186,000 miles). It travels from the sun to the earth in about eight minutes, and from the stars to us in anything from three to a thousand years. But of course we [Pg 19] cannot place a clock in the sun, and send out a flash of light from it at 12 noon, Greenwich Mean Time, and have it received at Greenwich at 12.08 p.m. Our methods of estimating the speed of light have to be more or less indirect. The only direct method would be that which we apply to sound when we use an echo. We could send a flash to a mirror, and observe how long it took for the reflection to reach us; this would give the time of the double journey to the mirror and back. On the earth, however, the time would be so short that a great deal of theoretical physics has to be utilized if this method is to be employed—more even than is required for the employment of astronomical data.

I want to make it clear that I'm not worried about anything that can be called inaccurate. I'm focused on real physical differences between events, each of which accurately records a specific occurrence from its own perspective. When a person fires a gun, people who aren't too close to him see the flash before they hear the sound. This isn't because of any issue with their senses, but because sound travels slower than light. Light moves so fast that, from the perspective of things happening on the Earth's surface, it can be considered instantaneous. Anything we can see on Earth happens almost at the moment we see it. In one second, light travels 300,000 kilometers (about 186,000 miles). It takes about eight minutes for light to travel from the sun to the Earth, and from the stars to us, it can take anywhere from three to a thousand years. But of course, we can't set a clock on the sun, send out a flash of light from it at 12 noon, Greenwich Mean Time, and have it arrive in Greenwich at 12:08 p.m. Our methods for measuring the speed of light have to be somewhat indirect. The only direct method would be similar to how we use an echo for sound. We could send a flash to a mirror and time how long it takes for the reflection to return to us; this would give us the time for the round trip to the mirror and back. However, on Earth, the time would be so brief that a lot of theoretical physics has to come into play for this method to be effective—more so than what's needed for using astronomical data. [Pg 19]

The problem of allowing for the spectator’s point of view, we may be told, is one of which physics has at all times been fully aware; indeed it has dominated astronomy ever since the time of Copernicus. This is true. But principles are often acknowledged long before their full consequences are drawn. Much of traditional physics is incompatible [Pg 20] with the principle, in spite of the fact that it was acknowledged theoretically by all physicists.

The issue of considering the spectator's perspective has always been recognized in physics; in fact, it has influenced astronomy since the days of Copernicus. This is accurate. However, principles are often recognized long before their complete implications are understood. Much of traditional physics conflicts with this principle, even though it has been acknowledged theoretically by all physicists. [Pg 20]

There existed a set of rules which caused uneasiness to the philosophically minded, but were accepted by physicists because they worked in practice. Locke had distinguished “secondary” qualities—colors, noises, tastes, smells, etc.—as subjective, while allowing “primary” qualities—shapes and positions and sizes—to be genuine properties of physical objects. The physicist’s rules were such as would follow from this doctrine. Colors and noises were allowed to be subjective, but due to waves proceeding with a definite velocity—that of light or sound as the case may be—from their source to the eye or ear of the percipient. Apparent shapes vary according to the laws of perspective, but these laws are simple and make it easy to infer the “real” shapes from several visual apparent shapes; moreover, the “real” shapes can be ascertained by touch in the case of bodies in our neighborhood. The objective time of a physical occurrence can be inferred from the time when we perceive it by allowing for the velocity of transmission—of light or sound or nerve currents according to [Pg 21] circumstances. This was the view adopted by physicists in practice, whatever qualms they may have had in unprofessional moments.

There were a set of rules that made philosophers uneasy, but physicists accepted them because they worked in real life. Locke had identified “secondary” qualities—like colors, sounds, tastes, and smells—as subjective, while he considered “primary” qualities—such as shapes, positions, and sizes—as genuine properties of physical objects. The physicist's rules followed this idea. Colors and sounds were seen as subjective, but they come from waves traveling at a definite speed—whether that’s the speed of light or sound—from their source to the eye or ear of the observer. The shapes we see can change based on perspective, but these rules are straightforward, making it easy to deduce the “real” shapes from various visible shapes; additionally, we can determine the “real” shapes by touch for nearby objects. The objective timing of a physical event can be inferred from when we perceive it by accounting for the transmission speed—whether light, sound, or nerve impulses—depending on the situation. This perspective was adopted by physicists in practice, no matter any reservations they might have had in casual moments.

This view worked well enough until physicists became concerned with much greater velocities than those that are common on the surface of the earth. An express train travels about a mile in a minute; the planets travel a few miles in a second. Comets, when they are near the sun, travel much faster, and behave somewhat oddly; but they were puzzling in various ways. Practically, the planets were the most swiftly moving bodies to which dynamics could be adequately applied. With radio-activity a new range of observations became possible. Individual electrons can be observed, emanating from radium with a velocity not far short of that of light. The behavior of bodies moving with these enormous speeds is not what the old theories would lead us to expect. For one thing, mass seems to increase with speed in a perfectly definite manner. When an electron is moving very fast, a bigger force is required to have a given effect upon it than when it is moving slowly. Then reasons were found for thinking that the size [Pg 22] of a body is affected by its motion—for example, if you take a cube and move it very fast, it gets shorter in the direction of its motion, from the point of view of a person who is not moving with it, though from its own point of view (i.e. for an observer traveling with it) it remains just as it was. What was still more astonishing was the discovery that lapse of time depends on motion; that is to say, two perfectly accurate clocks, one of which is moving very fast relatively to the other, will not continue to show the same time if they come together again after a journey. It follows that what we discover by means of clocks and foot rules, which used to be regarded as the acme of impersonal science, is really in part dependent upon our private circumstances, i.e. upon the way in which we are moving relatively to the bodies measured.

This view worked fairly well until physicists started dealing with much higher speeds than those typical on the surface of the Earth. An express train moves about a mile in a minute; the planets travel a few miles in a second. Comets, when they're close to the sun, travel even faster and behave strangely, leaving scientists puzzled in various ways. Practically, the planets were the fastest objects to which dynamics could be effectively applied. With radioactivity, a new range of observations became possible. Individual electrons can be seen coming from radium at speeds close to that of light. The behavior of objects moving at these incredible speeds doesn't match what old theories would suggest. For one, an object's mass seems to increase with speed in a specific way. When an electron is moving very quickly, it requires a larger force to produce a certain effect compared to when it’s moving slowly. Moreover, it was discovered that the size of an object is influenced by its motion—for instance, if you take a cube and move it very fast, it appears shorter in the direction of motion to someone who isn't moving with it, while from its own perspective (i.e., for someone traveling with it), it remains unchanged. Even more surprising was the finding that the passage of time is affected by motion; that is, two perfectly accurate clocks, one of which is moving very fast compared to the other, won't show the same time when they come back together after a journey. This implies that what we determine using clocks and measuring tools, which were once considered the pinnacle of objective science, is actually partly based on our individual circumstances, i.e., the way we're moving in relation to the objects being measured.

This shows that we have to draw a different line from that which is customary in distinguishing between what belongs to the observer and what belongs to the occurrence which he is observing. If a man is wearing blue spectacles he knows that the blue look of everything is due to his spectacles, and does not belong to what he is observing. But if he observes two flashes of lightning, and notes the interval [Pg 23] of time between his observations; if he knows where the flashes took place, and allows, in each case, for the time the light took to reach him—in that case, if his chronometer is accurate, he naturally thinks that he has discovered the actual interval of time between the two flashes, and not something merely personal to himself. He is confirmed in this view by the fact that all other careful observers to whom he has access agree with his estimates. This, however, is only due to the fact that all these observers are on the earth, and share its motion. Even two observers in aeroplanes moving in opposite directions would have at the most a relative velocity of 400 miles an hour, which is very little in comparison with 186,000 miles a second (the velocity of light). If an electron shot out from a piece of radium with a velocity of 170,000 miles a second could observe the time between the two flashes, it would arrive at a quite different estimate, after making full allowance for the velocity of light. How do you know this? the reader may ask. You are not an electron, you cannot move at these terrific speeds, no man of science has ever made the observations which would prove the truth of your assertion. Nevertheless, as we shall see [Pg 24] in the sequel, there is good ground for the assertion—ground, first of all, in experiment, and—what is remarkable—ground in reasonings which could have been made at any time, but were not made until experiments had shown that the old reasonings must be wrong.

This shows that we need to draw a different line than what is usual when separating what belongs to the observer and what belongs to what they are observing. If someone is wearing blue glasses, they understand that the blue tint of everything is due to the glasses, not to the observation itself. But if they see two flashes of lightning and note the time between them; if they know where the flashes occurred and account for the time it took for the light to reach them—in that case, if their stopwatch is accurate, they naturally think they have discovered the actual time interval between the two flashes, rather than something just personal to them. They are supported in this belief by the fact that all other careful observers they can reach agree with their estimates. However, this is only because all these observers are on Earth and share its motion. Even two observers in airplanes flying in opposite directions would have at most a relative speed of 400 miles an hour, which is tiny compared to 186,000 miles a second (the speed of light). If an electron shot out from a piece of radium at a speed of 170,000 miles a second could observe the time between the two flashes, it would come to a completely different conclusion after accounting for the speed of light. How can you be sure of this? the reader might ask. You’re not an electron; you can’t move at such incredible speeds; no scientist has ever made the observations needed to confirm your claim. Still, as we will see later, there is solid reasoning behind this claim—first from experiments, and—remarkably—from reasoning that could have been done at any time, but only emerged after experiments showed that the old reasoning was incorrect.

There is a general principle to which the theory of relativity appeals, which turns out to be more powerful than anybody would suppose. If you know that one man is twice as rich as another, this fact must appear equally whether you estimate the wealth of both in pounds or dollars or francs or any other currency. The numbers representing their fortunes will be changed, but one number will always be double the other. The same sort of thing, in more complicated forms, reappears in physics. Since all motion is relative, you may take any body you like as your standard body of reference, and estimate all other motions with reference to that one. If you are in a train and walking to the dining-car, you naturally, for the moment, treat the train as fixed and estimate your motion by relation to it. But when you think of the journey you are making, you think of the earth as fixed, and say you [Pg 25] are moving at the rate of sixty miles an hour. An astronomer who is concerned with the solar system takes the sun as fixed, and regards you as rotating and revolving; in comparison with this motion, that of the train is so slow that it hardly counts. An astronomer who is interested in the stellar universe may add the motion of the sun relatively to the average of the stars. You cannot say that one of these ways of estimating your motion is more correct than another; each is perfectly correct as soon as the reference body is assigned. Now just as you can estimate a man’s fortune in different currencies without altering its relations to the fortunes of other men, so you can estimate a body’s motion by means of different reference bodies without altering its relations to other motions. And as physics is entirely concerned with relations, it must be possible to express all the laws of physics by referring all motions to any given body as the standard.

There’s a basic principle that the theory of relativity relies on, which turns out to be more powerful than anyone would think. If you know one person is twice as rich as another, this fact must hold true no matter if you measure their wealth in pounds, dollars, francs, or any other currency. The numbers representing their wealth will change, but one will always be double the other. A similar concept, in more complex forms, appears in physics. Since all motion is relative, you can choose any object as your reference point and measure all other movements in relation to it. If you’re on a train and walking to the dining car, you naturally consider the train as fixed for that moment and measure your movement against it. But when you think about the journey you're on, you consider the Earth as fixed and say you're moving at sixty miles an hour. An astronomer focused on the solar system treats the sun as fixed and sees you as rotating and revolving; compared to this motion, the motion of the train seems so slow that it barely matters. An astronomer interested in the wider universe might also take the sun's motion relative to an average of the stars into account. You can't say that one of these ways of measuring your motion is more accurate than another; each is completely correct once the reference object is established. Just as you can assess a person's wealth in different currencies without changing its relationship to the wealth of others, you can measure an object's motion using different reference points without altering its relationship to other movements. And since physics is all about relationships, it's possible to express all the laws of physics by relating all motions to any given object as a standard.

We may put the matter in another way. Physics is intended to give information about what really occurs in the physical world, and not only about the private perceptions of separate observers. Physics must, therefore, be concerned with those features which a physical [Pg 26] process has in common for all observers, since such features alone can be regarded as belonging to the physical occurrence itself. This requires that the laws of phenomena should be the same whether the phenomena are described as they appear to one observer or as they appear to another. This single principle is the generating motive of the whole theory of relativity.

We can look at this differently. Physics aims to provide information about what actually happens in the physical world, rather than just the individual perceptions of different observers. Therefore, physics must focus on the aspects that a physical process shares for all observers because those aspects can be considered part of the physical event itself. This means that the laws of phenomena should be consistent regardless of whether they are described from one observer's perspective or another's. This fundamental principle is the driving force behind the entire theory of relativity. [Pg 26]

Now what we have hitherto regarded as the spatial and temporal properties of physical occurrences are found to be in large part dependent upon the observer; only a residue can be attributed to the occurrences in themselves, and only this residue can be involved in the formulation of any physical law which is to have an à priori chance of being true. Einstein found ready to his hand an instrument of pure mathematics, called the theory of tensors, which enabled him to discover laws expressed in terms of the objective residue and agreeing approximately with the old laws. Where Einstein’s laws differed from the old ones, they have hitherto proved more in accord with observation.

Now what we've previously seen as the spatial and temporal properties of physical events are largely dependent on the observer; only a small part can be attributed to the events themselves, and only this part can be used in formulating any physical law that might have a chance of being true from the start. Einstein had a powerful tool at his disposal, known as the theory of tensors, which allowed him to discover laws expressed in terms of the objective residue that closely aligned with the old laws. Where Einstein's laws deviated from the old ones, they've so far proven to be more consistent with observation.

If there were no reality in the physical world, but only a number of dreams dreamed by different people, we should not expect to find any [Pg 27] laws connecting the dreams of one man with the dreams of another. It is the close connection between the perceptions of one man and the (roughly) simultaneous perceptions of another that makes us believe in a common external origin of the different related perceptions. Physics accounts both for the likenesses and for the differences between different people’s perceptions of what we call the “same” occurrence. But in order to do this it is first necessary for the physicist to find out just what are the likenesses. They are not quite those traditionally assumed, because neither space nor time separately can be taken as strictly objective. What is objective is a kind of mixture of the two called “space-time.” To explain this is not easy, but the attempt must be made; it will be begun in the next chapter.

If there weren't any real physical reality, just a bunch of dreams from different people, we shouldn't expect to see any connections between one person's dreams and another's. It's the strong link between what one person perceives and the (approximately) simultaneous perceptions of someone else that leads us to believe in a shared external source for these related perceptions. Physics explains both the similarities and the differences in how different people perceive what we call the "same" event. However, to do this, physicists first need to identify what those similarities are. They aren't exactly what we've traditionally thought, because neither space nor time can be understood as strictly objective on their own. What's actually objective is a blend of the two, referred to as "space-time." Explaining this isn't simple, but we need to try; we'll start in the next chapter.


[Pg 28]

[Pg 28]

CHAPTER III:
THE VELOCITY OF LIGHT

Most of the curious things in the theory of relativity are connected with the velocity of light. If the reader is to grasp the reasons for such a serious theoretical reconstruction, he must have some idea of the facts which made the old system break down.

Most of the interesting aspects of the theory of relativity are related to the speed of light. To understand the reasons behind this significant theoretical change, the reader needs to be aware of the facts that caused the old system to fail.

The fact that light is transmitted with a definite velocity was first established by astronomical observations. Jupiter’s moons are sometimes eclipsed by Jupiter, and it is easy to calculate the times when this ought to occur. It was found that when Jupiter was unusually near the earth an eclipse of one of his moons would be observed a few minutes earlier than was expected; and when Jupiter was unusually remote, a few minutes later than was expected. It was found that these deviations could all be accounted for by assuming that light has a certain velocity, so that what we observe to be happening in Jupiter really happened a little while ago—longer ago when Jupiter is distant than [Pg 29] when it is near. Just the same velocity of light was found to account for similar facts in regard to other parts of the solar system. It was therefore accepted that light in vacuo always travels at a certain constant rate, almost exactly 300,000 kilometers a second. (A kilometer is about five-eighths of a mile.) When it became established that light consists of waves, this velocity was that of propagation of waves in the ether—at least they used to be in the ether, but now the ether has grown somewhat shadowy, though the waves remain. This same velocity is that of the waves used in wireless telegraphy (which are like light waves, only longer) and in X-rays (which are like light waves, only shorter). It is generally held nowadays to be the velocity with which gravitation is propagated, though Eddington considers this not yet certain. (It used to be thought that gravitation was propagated instantaneously, but this view is now abandoned.)

The idea that light travels at a specific speed was first proven through astronomical observations. Jupiter’s moons sometimes get eclipsed by Jupiter, making it easy to predict when these events should happen. It was discovered that when Jupiter was particularly close to Earth, an eclipse of one of its moons would be seen a few minutes earlier than expected; conversely, when Jupiter was far away, the eclipse would happen a few minutes later than anticipated. These differences could be explained by the fact that light has a certain speed, meaning what we see happening with Jupiter actually occurred a bit earlier—longer ago when Jupiter is far away compared to when it is near. The same speed of light also explained similar phenomena in other parts of the solar system. Thus, it became accepted that light in vacuo always travels at a constant speed, nearly 300,000 kilometers per second. (A kilometer is approximately five-eighths of a mile.) Once it was established that light consists of waves, this speed was understood as the rate at which waves propagate in the ether—although the ether concept has faded somewhat, the waves still exist. This same speed applies to the waves used in wireless telegraphy (which are similar to light waves but longer) and in X-rays (which are similar to light waves but shorter). Nowadays, it is generally believed this is also the speed at which gravity propagates, although Eddington thinks this is still uncertain. (Previously, it was thought that gravity propagated instantly, but that view has since been discarded.)

So far, all is plain sailing. But as it became possible to make more accurate measurements, difficulties began to accumulate. The waves were supposed to be in the ether, and therefore their velocity ought to be relative to the ether. Now since the ether (if it exists) clearly offers no resistance to the motions of the heavenly bodies, it would [Pg 30] seem natural to suppose that it does not share their motion. If the earth had to push a lot of ether before it, in the sort of way that a steamer pushes water before it, one would expect a resistance on the part of the ether analogous to that offered by the water to the steamer. Therefore the general view was that the ether could pass through bodies without difficulty, like air through a coarse sieve, only more so. If this were the case, then the earth in its orbit must have a velocity relative to the ether. If, at some point of its orbit, it happened to be moving exactly with the ether, it must at other points be moving through it all the faster. If you go for a circular walk on a windy day, you must be walking against the wind part of the way, whatever wind may be blowing; the principle in this case is the same. It follows that, if you choose two days six months apart, when the earth in its orbit is moving in exactly opposite directions, it must be moving against an ether wind on at least one of these days.

So far, everything has gone smoothly. But as we started getting better measurements, problems began to pile up. The waves were thought to be in the ether, so their speed should be relative to it. Since the ether (if it exists) clearly doesn't resist the motion of celestial bodies, it seems reasonable to assume it doesn’t move with them. If the Earth had to push a lot of ether ahead, like a steamboat pushes water, we would expect some resistance from the ether similar to what water offers to the steamboat. So, the general belief was that ether could flow through objects easily, like air through a coarse sieve, only even more so. If this is true, then the Earth in its orbit must have a speed relative to the ether. If, at some point in its orbit, it happens to move exactly with the ether, then at other points, it must be moving through it at a faster rate. If you take a circular walk on a windy day, you must be walking against the wind for part of the time, no matter which way the wind is blowing; the principle is the same here. This means that, if you pick two days six months apart, when the Earth in its orbit is moving in exactly opposite directions, it must be moving against an ether wind on at least one of those days.

Now if there is an ether wind, it is clear that, relatively to an observer on the earth, light signals will seem to travel faster with [Pg 31] the wind than across it, and faster across it than against it. This is what Michelson and Morley set themselves to test by their famous experiment. They sent out light signals in two directions at right angles; each was reflected from a mirror, and came back to the place from which both had been sent out. Now anybody can verify, either by trial or by a little arithmetic, that it takes longer to row a given distance on a river upstream and then back again, than it takes to row the same distance across the stream and back again. Therefore, if there were an ether wind, one of the two light signals, which consist of waves in the ether, ought to have traveled to the mirror and back at a slower average rate than the other. Michelson and Morley tried the experiment, they tried it in various positions, they tried it again later. Their apparatus was quite accurate enough to have detected the expected difference of speed or even a much smaller difference, if it had existed, but not the smallest difference could be observed. The result was a surprise to them as to everybody else; but careful repetitions made doubt impossible. The experiment was first made as long ago as 1881, and was repeated with more elaboration in 1887. But [Pg 32] it was many years before it could be rightly interpreted.

Now, if there is an ether wind, it's clear that, for someone observing from Earth, light signals will seem to travel faster with the wind than across it, and faster across it than against it. This is what Michelson and Morley set out to test with their famous experiment. They sent out light signals in two directions at right angles; each was reflected from a mirror and returned to the original point. Anyone can verify, either by testing or by a bit of math, that it takes longer to row a certain distance upstream on a river and then back down than to row the same distance across the stream and back. So, if there were an ether wind, one of the two light signals, which are waves in the ether, should have taken longer on average to travel to the mirror and back compared to the other. Michelson and Morley conducted the experiment, tried it in different positions, and repeated it later. Their equipment was accurate enough to detect the expected difference in speed or even a smaller difference if it existed, but no difference could be observed at all. The outcome surprised them, just as it surprised everyone else; however, careful repetitions made doubt impossible. The experiment was first conducted as early as 1881 and was repeated with more detail in 1887. But it took many years before it could be correctly interpreted.

The supposition that the earth carries the neighboring ether with it in its motion was found to be impossible, for a number of reasons. Consequently a logical deadlock seemed to have arisen, from which at first physicists sought to extricate themselves by very arbitrary hypotheses. The most important of these was that of Fitzgerald, developed by Lorentz, and known as the Fitzgerald contraction hypothesis.

The idea that the earth moves through the surrounding ether was proven to be impossible for various reasons. As a result, physicists faced a logical impasse, and initially tried to resolve it with some very arbitrary theories. The most significant of these was Fitzgerald's theory, which Lorentz expanded upon, and it's known as the Fitzgerald contraction hypothesis.

According to this hypothesis, when a body is in motion it becomes shortened in the direction of motion by a certain proportion depending upon its velocity. The amount of the contraction was to be just enough to account for the negative result of the Michelson-Morley experiment. The journey up stream and down again was to have been really a shorter journey than the one across the stream, and was to have been just so much shorter as would enable the slower light wave to traverse it in the same time. Of course the shortening could never be detected by measurement, because our measuring rods would share it. A foot rule placed in the line of the earth’s motion would be shorter than the [Pg 33] same foot rule placed at right angles to the earth’s motion. This point of view resembles nothing so much as the White Knight’s “plan to dye my whiskers green, and always use so large a fan that they could not be seen.” The odd thing was that the plan worked well enough. Later on, when Einstein propounded his special theory of relativity (1905), it was found that the theory was in a certain sense correct, but only in a certain sense. That is to say, the supposed contraction is not a physical fact, but a result of certain conventions of measurement which, when once the right point of view has been found, are seen to be such as we are almost compelled to adopt. But I do not wish yet to set forth Einstein’s solution of the puzzle. For the present, it is the nature of the puzzle itself that I want to make clear.

According to this idea, when an object is moving, it gets shorter in the direction it's moving by a certain amount based on its speed. The amount of contraction was supposed to be just enough to explain the negative outcome of the Michelson-Morley experiment. The journey upstream and back was meant to be actually a shorter distance than the one across the stream, and it would be just short enough to allow the slower light wave to cover it in the same amount of time. Of course, the contraction could never be noticed in measurements because our measuring tools would experience it too. A foot ruler positioned along the direction of the Earth’s movement would be shorter than the same foot ruler placed perpendicular to that motion. This idea is quite similar to the White Knight’s “plan to dye my whiskers green and always use such a big fan that they couldn’t be seen.” The strange part was that the plan actually worked reasonably well. Later, when Einstein introduced his special theory of relativity (1905), it was recognized that the theory was correct in some ways, but only in a specific sense. In other words, the assumed contraction isn’t a physical reality but rather a consequence of certain measurement conventions that, once the right perspective is adopted, are seen as almost unavoidable. However, I don’t want to dive into Einstein’s solution to the riddle yet. For now, I want to clarify the nature of the riddle itself.

On the face of it, and apart from hypotheses ad hoc, the Michelson-Morley experiment (in conjunction with others) showed that, relatively to the earth, the velocity of light is the same in all directions, and that this is equally true at all times of the year, although the direction of the earth’s motion is always changing as [Pg 34] it goes round the sun. Moreover, it appeared that this is not a peculiarity of the earth, but is true of all bodies: if a light signal is sent out from a body, that body will remain at the center of the waves as they travel outwards, no matter how it may be moving—at least, that will be the view of observers moving with the body. This was the plain and natural meaning of the experiments, and Einstein succeeded in inventing a theory which accepted it. But at first it was thought logically impossible to accept this plain and natural meaning.

On the surface, and aside from specific hypotheses, the Michelson-Morley experiment (along with others) demonstrated that, relative to the Earth, the speed of light is consistent in all directions, and this holds true at all times of the year, even though the Earth's motion continually changes as it orbits the sun. Furthermore, it seemed that this isn't just a characteristic of the Earth, but applies to all objects: if a light signal is emitted from a moving object, that object will stay at the center of the waves as they spread outward, regardless of its movement—at least, that's how observers moving with the object would perceive it. This was the clear and straightforward interpretation of the experiments, and Einstein managed to develop a theory that embraced it. However, initially, it was believed to be logically impossible to accept this clear and straightforward interpretation.

A few illustrations will make it clear how very odd the facts are. When a shell is fired, it moves faster than sound: the people at whom it is fired first see the flash, then (if they are lucky) see the shell go by, and last of all hear the report. It is clear that if you could put a scientific observer on the shell, he would never hear the report, as the shell would burst and kill him before the sound had overtaken him. But if sound worked on the same principles as light, our observer would hear everything just as if he were at rest. In that case, if a screen, suitable for producing echoes, were attached to the shell and traveling with it, say a hundred yards in front of it, our observer would hear [Pg 35] the echo of the report from the screen after just the same interval of time as if he and the shell were at rest. This, of course, is an experiment which cannot be performed, but others which can be performed will show the difference. We might find some place on a railway where there is an echo from a place further along the railway—say a place where the railway goes into a tunnel—and when a train is traveling along the railway, let a man on the bank fire a gun. If the train is traveling towards the echo, the passengers will hear the echo sooner than the man on the bank; if it is traveling in the opposite direction, they will hear it later. But these are not quite the circumstances of the Michelson-Morley experiment. The mirrors in that experiment correspond to the echo, and the mirrors are moving with the earth, so that echo ought to move with the train. Let us suppose that the shot is fired from the guard’s van, and the echo comes from a screen on the engine. We will suppose the distance from the guard’s van to the engine to be the distance that sound can travel in a second (about one-fifth of a mile), and the speed of the train to be one-twelfth of the speed [Pg 36] of sound (about sixty miles an hour). We now have an experiment which can be performed by the people in the train. If the train were at rest, the guard would hear the echo in two seconds; as it is, he will hear it in 2 and ²/₁₄₃ seconds. From this difference, if he knows the velocity of sound, he can calculate the velocity of the train, even if it is a foggy night so that he cannot see the banks. But if sound behaved like light, he would hear the echo in two seconds however fast the train might be traveling.

A few examples will clarify just how strange the facts are. When a shell is fired, it travels faster than sound: the people it’s aimed at first see the flash, then (if they're lucky) see the shell go by, and finally hear the sound. It's clear that if you could place a scientific observer on the shell, they would never hear the sound, since the shell would explode and kill them before the sound reached them. However, if sound worked the same way as light, our observer would hear everything as if they were standing still. In that case, if a screen designed to create echoes was attached to the shell and was traveling with it, say a hundred yards in front of it, our observer would hear the echo of the sound from the screen after the same amount of time as if both they and the shell were stationary. This is, of course, an experiment that can't be conducted, but there are others that can demonstrate the difference. We could find a spot on a railway where there’s an echo further along the tracks—like where the railway goes into a tunnel—and when a train is moving along the tracks, a man on the side can fire a gun. If the train is moving toward the echo, the passengers will hear the echo sooner than the man on the side; if it's moving away, they will hear it later. However, these aren’t exactly the conditions of the Michelson-Morley experiment. The mirrors in that experiment act like the echo, and because the mirrors are moving with the Earth, the echo should move with the train. Let’s assume the shot is fired from the guard’s van, and the echo comes from a screen on the engine. We’ll say the distance from the guard's van to the engine is the distance that sound can travel in a second (about one-fifth of a mile), and the speed of the train is one-twelfth of the speed of sound (about sixty miles an hour). Now we have an experiment that can be conducted by the people on the train. If the train were stopped, the guard would hear the echo in two seconds; as it is, he will hear it in 2 and ²/₁₄₃ seconds. From this difference, if he knows the speed of sound, he can calculate the train's speed, even on a foggy night when he can’t see the sides. But if sound behaved like light, he would hear the echo in two seconds no matter how fast the train was moving.

Various other illustrations will help to show how extraordinary—from the point of view of tradition and common sense—are the facts about the velocity of light. Every one knows that if you are on an escalator you reach the top sooner if you walk up than if you stand still. But if the escalator moved with the velocity of light (which it does not do even in New York), you would reach the top at exactly the same moment whether you walked up or stood still. Again: if you are walking along a road at the rate of four miles an hour, and a motor-car passes you going in the same direction at the rate of forty miles an hour, if you and the motor-car both keep going the distance between you after an [Pg 37] hour will be thirty-six miles. But if the motor-car met you, going in the opposite direction, the distance after an hour would be forty-four miles. Now if the motor-car were traveling with the velocity of light, it would make no difference whether it met or passed you: in either case, it would, after a second, be 186,000 miles from you. It would also be 186,000 miles from any other motor-car which happened to be passing or meeting you less rapidly at the previous second. This seems impossible: how can the car be at the same distance from a number of different points along the road?

Various other examples will illustrate how extraordinary the facts about the speed of light are, given our traditional views and common sense. Everyone knows that if you're on an escalator, you get to the top faster if you walk up than if you just stand there. However, if the escalator moved at the speed of light (which it certainly doesn't in New York), you would reach the top at the same time whether you walked or stood still. Similarly, if you're walking down a road at four miles an hour and a car passes you in the same direction at forty miles an hour, the distance between you after an hour will be thirty-six miles. But if the car is coming toward you from the opposite direction, after an hour, the distance would be forty-four miles. Now, if the car were traveling at the speed of light, it wouldn't matter if it passed or met you; in either case, after just a second, it would be 186,000 miles away from you. It would also be 186,000 miles away from any other car that happened to pass or meet you just a second before. This seems impossible: how can the car be the same distance from multiple points along the road? [Pg 37]

Let us take another illustration. When a fly touches the surface of a stagnant pool, it causes ripples which move outwards in widening circles. The center of the circle at any moment is the point of the pool touched by the fly. If the fly moves about over the surface of the pool, it does not remain at the center of the ripples. But if the ripples were waves of light, and the fly were a skilled physicist, it would find that it always remained at the center of the ripples, however it might move. Meanwhile a skilled physicist sitting beside the pool would judge, as in the case of ordinary ripples, that the center [Pg 38] was not the fly, but the point of the pool touched by the fly. And if another fly had touched the water at the same spot at the same moment, it also would find that it remained at the center of the ripples, even if it separated itself widely from the first fly. This is exactly analogous to the Michelson-Morley experiment. The pool corresponds to the ether; the fly corresponds to the earth; the contact of the fly and the pool corresponds to the light signal which Messrs. Michelson and Morley send out; and the ripples correspond to the light waves.

Let’s consider another example. When a fly lands on the surface of a still pond, it creates ripples that spread out in expanding circles. The center of the circle at any given moment is where the fly touched the water. If the fly moves around on the surface, it doesn’t stay at the center of the ripples. But if the ripples were light waves and the fly was a skilled physicist, it would discover that it always remained at the center of the ripples, no matter how it moved. Meanwhile, a skilled physicist sitting next to the pond would conclude, just like with regular ripples, that the center was not the fly but the point where the fly touched the water. If another fly landed in the same spot at the same time, it would also find itself at the center of the ripples, even if it moved far away from the first fly. This is exactly similar to the Michelson-Morley experiment. The pond represents the ether; the fly represents the earth; the fly’s contact with the pond represents the light signal that Michelson and Morley sent out; and the ripples represent the light waves.

Such a state of affairs seems, at first sight, quite impossible. It is no wonder that, although the Michelson-Morley experiment was made in 1881, it was not rightly interpreted until 1905. Let us see what, exactly, we have been saying. Take the man walking along a road and passed by a motor-car. Suppose there are a number of people at the same point of the road, some walking, some in motor-cars; suppose they are going at varying rates, some in one direction and some in another. I say that if, at this moment, a light flash is sent out from the place where they all are, the light waves will be 186,000 miles from each [Pg 39] one of them after a second by his watch, although the travelers will not any longer be all in the same place. At the end of a second by your watch it will be 186,000 miles from you, and it will also be 186,000 miles from a person who met you when it was sent out, but was moving in the opposite direction, after a second by his watch—assuming both to be perfect watches. How can this be?

This situation seems completely impossible at first glance. It's not surprising that even though the Michelson-Morley experiment took place in 1881, it wasn't properly understood until 1905. Let's clarify what we've been discussing. Imagine a person walking down a road and being passed by a car. Picture several people at the same spot on the road, some walking and some in cars, all moving at different speeds, with some going in one direction and others in another. I say that if, at that moment, a light flash is sent out from where they all are, the light waves will be 186,000 miles from each of them after one second by their watches, even though the travelers won't all be in the same place anymore. After one second on your watch, the light will be 186,000 miles away from you, and it will also be 186,000 miles away from someone who was next to you when it was sent but is now moving in the opposite direction, after one second on their watch—assuming both watches are perfectly accurate. How can this happen?

There is only one way of explaining such facts, and that is, to assume that watches and clocks are affected by motion. I do not mean that they are affected in ways that could be remedied by greater accuracy in construction; I mean something much more fundamental. I mean that, if you say an hour has elapsed between two events, and if you base this assertion upon ideally careful measurements with ideally accurate chronometers, another equally precise person, who has been moving rapidly relatively to you, may judge that the time was more or less than an hour. You cannot say that one is right and the other wrong, any more than you could if one used a clock showing Greenwich time and another a clock showing New York time. How this comes about, I shall explain in the next chapter. [Pg 40]

There’s only one way to explain these facts, and that’s by assuming that watches and clocks are influenced by motion. I’m not saying they’re affected in ways that could be fixed by making them more accurate; I’m talking about something much more fundamental. If you say an hour has passed between two events, and you base this statement on perfectly careful measurements with extremely accurate clocks, another equally precise person who’s been moving quickly relative to you might say that the time was either more or less than an hour. You can’t claim that one person is right and the other wrong, just like you wouldn’t if one used a clock showing Greenwich time and the other a clock showing New York time. I’ll explain how this happens in the next chapter. [Pg 40]

There are other curious things about the velocity of light. One is, that no material body can ever travel as fast as light, however great may be the force to which it is exposed, and however long the force may act. An illustration may help to make this clear. At exhibitions one sometimes sees a series of moving platforms, going round and round in a circle. The outside one goes at four miles an hour; the next goes four miles an hour faster than the first; and so on. You can step across from each to the next; until you find yourself going at a tremendous pace. Now you might think that, if the first platform does four miles an hour, and the second does four miles an hour relatively to the first, then the second does eight miles an hour relatively to the ground. This is an error; it does a little less, though so little less that not even the most careful measurements could detect the difference. I want to make quite clear what it is that I mean. I will suppose that, in the morning, when the apparatus is just about to start, three men with ideally accurate chronometers stand in a row, one on the ground, one on the first platform, and one on the second. The [Pg 41] first platform moves at the rate of four miles an hour with respect to the ground. Four miles an hour is 352 feet in a minute. The man on the ground, after a minute by his watch, notes the place on the ground opposite the man on the first platform, who has been standing still while the platform carried him along. The man on the ground measures the distance on the ground from himself to the point opposite the man on the first platform, and finds it is 352 feet. The man on the first platform, after a minute by his watch, notes the point on his platform opposite to the man on the second platform. The man on the first platform measures the distance from himself to the point opposite the man on the second platform; it is again 352 feet. Problem: how far will the man on the ground judge that the man on the second platform has traveled in a minute? That is to say, if the man on the ground, after a minute by his watch, notes the place on the ground opposite the man on the second platform, how far will this be from the man on the ground? You would say, twice 352 feet, that is to say, 704 feet. But in fact it will be a little less, though so little less as to be inappreciable. The discrepancy is owing to the fact that the two watches do not keep perfect time, in spite of the fact that each is [Pg 42] accurate from its owner’s point of view. If you had a long series of such moving platforms, each moving four miles an hour relatively to the one before it, you would never reach a point where the last was moving with the velocity of light relatively to the ground, not even if you had millions of them. The discrepancy, which is very small for small velocities, becomes greater as the velocity increases, and makes the velocity of light an unattainable limit. How all this happens, is the next topic with which we must deal.

There are other interesting things about the speed of light. One is that no object can ever move as fast as light, no matter how powerful the force acting on it or how long that force is applied. An example might help clarify this. At fairs, you sometimes see a series of moving platforms that go around in a circle. The outer one moves at four miles per hour; the next one moves four miles per hour faster than the first, and so on. You can step from one to the next and find yourself going really fast. Now you might think that if the first platform goes four miles per hour, and the second platform goes four miles per hour relative to the first, then the second platform is going eight miles per hour relative to the ground. This is incorrect; it actually moves slightly less than that, but so little less that even the most careful measurements couldn't detect the difference. Let me clarify what I mean. I'll assume that, in the morning, when the setup is about to start, three men with perfectly accurate stopwatches stand in a line: one on the ground, one on the first platform, and one on the second. The first platform moves at four miles per hour with respect to the ground. Four miles per hour is 352 feet in a minute. After one minute according to his watch, the man on the ground notes the spot on the ground opposite the man on the first platform, who has been standing still while the platform moved under him. The man on the ground measures the distance from himself to the point opposite the man on the first platform and finds it is 352 feet. After one minute according to his watch, the man on the first platform notes the point on his platform opposite the man on the second platform. He measures the distance from himself to the point opposite the man on the second platform; it is again 352 feet. The question is: how far does the man on the ground think the man on the second platform has traveled in one minute? That is, when the man on the ground notes the spot opposite the second platform after a minute, how far will that be from him? You might say, twice 352 feet, which is 704 feet. But in reality, it will be a bit less, though so slightly less that it’s barely noticeable. The difference occurs because the two watches don’t keep perfect time, even though each is accurate from its owner's perspective. If you had a long series of such moving platforms, each moving four miles an hour relative to the one before it, you would never reach a point where the last platform is moving at the speed of light relative to the ground, not even if you had millions of them. The difference, which is very small for low speeds, becomes larger as the speed increases, making the speed of light an unreachable limit. How all this works is the next topic we need to cover.

Note. The negative result of the Michelson-Morley experiment has recently been called in question by Professor Dayton C. Miller, as a result of observations by what is said to be an improved method. His claim is set forth by Professor Silberstein in Nature, May 23, 1925, and discussed unfavorably by Eddington in the issue of June 6. The matter is sub judice, but it seems highly questionable whether the results bear out the interpretation which is put upon them.

Note. The negative result of the Michelson-Morley experiment has recently been challenged by Professor Dayton C. Miller, based on observations using what is claimed to be an improved method. His assertion is presented by Professor Silberstein in Nature, May 23, 1925, and critiqued by Eddington in the issue from June 6. The matter is sub judice, but it seems very questionable whether the results support the interpretation that has been assigned to them.


[Pg 43]

[Pg 43]

CHAPTER IV:
CLOCKS AND FOOT RULES

Until the advent of the special theory of relativity, no one had thought that there could be any ambiguity in the statement that two events in different places happened at the same time. It might be admitted that, if the places were very far apart, there might be difficulty in finding out for certain whether the events were simultaneous, but every one thought the meaning of the question perfectly definite. It turned out, however, that this was a mistake. Two events in distant places may appear simultaneous to one observer who has taken all due precautions to insure accuracy (and, in particular, has allowed for the velocity of light), while another equally careful observer may judge that the first event preceded the second, and still another may judge that the second preceded the first. This would happen if the three observers were all moving rapidly relatively to each other. It would not be the case that one of them would be right and the other two wrong: they would all be equally right. The time order of events is in part dependent upon the observer; it is not always and altogether an intrinsic relation between the events themselves. Einstein has shown, not only that this view accounts for the phenomena, but also that it is the one which ought to have resulted from careful reasoning based upon the old data. In actual fact, however, no one noticed the logical basis of the theory of relativity until the odd results of experiment had given a jog to people’s reasoning powers.

Until the arrival of the special theory of relativity, no one considered that there could be any uncertainty in the claim that two events in different locations occurred at the same time. It could be acknowledged that if the locations were very far apart, there might be challenges in confirming whether the events were truly simultaneous, but everyone thought the question's meaning was perfectly clear. However, this turned out to be a misconception. Two events in distant locations may seem simultaneous to one observer who has taken all necessary precautions to ensure accuracy (especially accounting for the speed of light), while another equally careful observer might conclude that the first event happened before the second, and yet another might determine that the second event occurred before the first. This situation would arise if the three observers were all moving quickly relative to one another. It wouldn't mean that one of them was correct and the other two were wrong; they would all be equally correct. The chronological order of events partly depends on the observer; it is not always a straightforward relationship between the events themselves. Einstein demonstrated that this perspective not only explains the phenomena but also should have emerged from careful reasoning based on previous data. In reality, however, no one recognized the logical foundation of the theory of relativity until the unusual results of experiments prompted people to rethink their reasoning.

[Pg 44] How should we naturally decide whether two events in different places were simultaneous? One would naturally say: they are simultaneous if they are seen simultaneously by a person who is exactly half-way between them. (There is no difficulty about the simultaneity of two events in the same place, such, for example, as seeing a light and hearing a noise.) Suppose two flashes of lightning fall in two different places, say Greenwich Observatory and Kew Observatory. Suppose that St. Paul’s is half-way between them, and that the flashes appear simultaneous to an observer on the dome of St. Paul’s. In that [Pg 45] case, a man at Kew will see the Kew flash first, and a man at Greenwich will see the Greenwich flash first, because of the time taken by light to travel over the intervening distance. But all three, if they are ideally accurate observers, will judge that the two flashes were simultaneous, because they will make the necessary allowance for the time of transmission of the light. (I am assuming a degree of accuracy far beyond human powers.) Thus, so far as observers on the earth are concerned, the definition of simultaneity will work well enough, so long as we are dealing with events on the surface of the earth. It gives results which are consistent with each other, and can be used for terrestrial physics in all problems in which we can ignore the fact that the earth moves.

[Pg 44] How should we naturally determine if two events happening in different locations occur at the same time? One would naturally say: they are simultaneous if a person standing exactly halfway between them sees them both at the same time. (There’s no issue with the simultaneity of two events in the same place, such as seeing a light and hearing a sound.) Let’s say two flashes of lightning strike at two different spots, like Greenwich Observatory and Kew Observatory. If St. Paul’s is halfway between them, and the flashes appear simultaneous to someone on the dome of St. Paul’s, then a person at Kew will see the Kew flash first, and a person at Greenwich will see the Greenwich flash first, due to the time it takes for light to travel the distance in between. However, if all three are perfectly accurate observers, they will conclude that the two flashes were simultaneous, as they will account for the time it takes for the light to travel. (I’m assuming a level of accuracy that far exceeds human capability.) Therefore, as far as observers on Earth are concerned, this definition of simultaneity works well enough when we are talking about events on the planet's surface. It produces consistent outcomes and can be applied to terrestrial physics in all situations where we can ignore the Earth's movement. [Pg 45]

But our definition is no longer so satisfactory when we have two sets of observers in rapid motion relatively to each other. Suppose we see what would happen if we substitute sound for light, and defined two occurrences as simultaneous when they are heard simultaneously by a man half-way between them. This alters nothing in the principle, but makes the matter easier owing to the much slower velocity of sound. [Pg 46] Let us suppose that on a foggy night two men belonging to a gang of brigands shoot the guard and engine driver of a train. The guard is at the end of the train; the brigands are on the line, and shoot their victims at close quarters. An old gentleman who is exactly in the middle of the train hears the two shots simultaneously. You would say, therefore, that the two shots were simultaneous. But a station master who is exactly half-way between the two brigands hears the shot which kills the guard first. An Australian millionaire uncle of the guard and the engine driver (who are cousins) has left his whole fortune to the guard, or, should he die first, to the engine driver. Vast sums are involved in the question of which died first. The case goes to the House of Lords, and the lawyers on both sides, having been educated at Oxford, are agreed that either the old gentleman or the station master must have been mistaken. In fact, both may perfectly well be right. The train travels away from the shot at the guard, and towards the shot at the engine driver; therefore the noise of the shot at the guard has farther to go before reaching the old gentleman than the shot at the engine driver has. Therefore if the old gentleman is right in saying [Pg 47] that he heard the two reports simultaneously, the station master must be right in saying that he heard the shot at the guard first.

But our definition doesn’t really hold up when we have two groups of observers moving quickly relative to each other. Imagine if we swapped sound for light and defined two events as simultaneous if they're heard at the same time by a person positioned halfway between them. This doesn’t change the principle but simplifies things because sound travels much slower. [Pg 46] Let’s say on a foggy night, two men from a gang of outlaws shoot the guard and the engine driver of a train. The guard is at the back of the train; the outlaws are on the tracks and shoot their targets up close. An elderly gentleman sitting exactly in the middle of the train hears the two shots at the same time. You would then conclude that the two shots were simultaneous. However, a station master located halfway between the two outlaws hears the shot that kills the guard first. An Australian millionaire uncle of the guard and the engine driver (who are cousins) has left his entire fortune to the guard, or, if the guard dies first, to the engine driver. A lot of money is at stake regarding who died first. The case goes to the House of Lords, and the lawyers on both sides, having been educated at Oxford, agree that either the old gentleman or the station master must have been mistaken. In reality, both can be completely correct. The train moves away from the shot at the guard and toward the shot at the engine driver; therefore, the sound from the shot that hit the guard has a farther distance to travel before it reaches the old gentleman than the sound from the shot that hit the engine driver. Thus, if the old gentleman is correct in claiming that he heard the two shots simultaneously, then the station master must be correct in saying that he heard the shot at the guard first. [Pg 47]

We, who live on the earth, would naturally, in such a case, prefer the view of simultaneity obtained from a person at rest on the earth to the view of a person traveling in a train. But in theoretical physics no such parochial prejudices are permissible. A physicist on a comet, if there were one, would have just as good a right to his view of simultaneity as an earthly physicist has to his, but the results would differ, in just the same sort of way as in our illustration of the train and the shots. The train is not any more “really” in motion than the earth; there is no “really” about it. You might imagine a rabbit and a hippopotamus arguing as to whether man is “really” a large animal; each would think his own point of view the natural one, and the other a pure flight of fancy. There is just as little substance in an argument as to whether the earth or the train is “really” in motion. And, therefore, when we are defining simultaneity between distant events, we have no right to pick and choose among different [Pg 48] bodies to be used in defining the point half-way between the events. All bodies have an equal right to be chosen. But if, for one body, the two events are simultaneous according to the definition, there will be other bodies for which the first precedes the second, and still others for which the second precedes the first. We cannot therefore say unambiguously that two events in distant places are simultaneous. Such a statement only acquires a definite meaning in relation to a definite observer. It belongs to the subjective part of our observation of physical phenomena, not to the objective part which is to enter into physical laws.

We, who live on Earth, would naturally prefer the perspective of simultaneity from someone at rest on the planet over that of someone traveling in a train. However, in theoretical physics, no such narrow-minded biases are acceptable. A physicist on a comet, if there were one, would have just as valid a perspective on simultaneity as an Earth-based physicist, but the outcomes would differ in the same way as our example of the train and the shots. The train is not "really" in motion any more than the Earth is; there’s no "really" about it. You could imagine a rabbit and a hippopotamus arguing about whether humans are "really" large animals; each would consider their own perspective the natural one and the other a mere fantasy. There's just as little substance in an argument over whether the Earth or the train is "really" moving. Therefore, when defining simultaneity between distant events, we have no right to choose one specific body over others to establish the midpoint between the events. All bodies have an equal right to be selected. However, if for one body the two events are simultaneous according to our definition, there will be other bodies for which the first event comes before the second and still others for which the second event comes before the first. We cannot, therefore, say definitively that two events in different locations are simultaneous. Such a statement only gains a clear meaning in relation to a specific observer. It pertains to the subjective aspect of our observation of physical phenomena, not to the objective part that should be incorporated into physical laws. [Pg 48]

This question of time in different places is perhaps, for the imagination, the most difficult aspect of the theory of relativity. We are accustomed to the idea that everything can be dated. Historians make use of the fact that there was an eclipse of the sun visible in China on August 29 in the year 776 B. C.[1] No doubt astronomers could tell the exact hour and minute when the [Pg 49] eclipse began to be total at any given spot in North China. And it seems obvious that we can speak of the positions of the planets at a given instant. The Newtonian theory enables us to calculate the distance between the earth and (say) Jupiter at a given time by the Greenwich clocks; this enables us to know how long light takes at that time to travel from Jupiter to the earth—say half an hour; this enables us to infer that half an hour ago Jupiter was where we see it now. All this seems obvious. But in fact it only works in practice because the relative velocities of the planets are very small compared with the velocity of light. When we judge that an event on the earth and an event on Jupiter have happened at the same time—for example, that Jupiter eclipsed one of his moons when the Greenwich clocks showed twelve midnight—a person moving rapidly relatively to the earth would judge differently, assuming that both he and we had made the proper allowance for the velocity of light. And naturally the disagreement about simultaneity involves a disagreement about periods of time. If we judged that two events on Jupiter were separated by [Pg 50] twenty-four hours, another person might judge that they were separated by a longer time, if he were moving rapidly relatively to Jupiter and the earth.

This issue of time in different places is probably the most challenging part of the theory of relativity for our imagination. We’re used to thinking that everything can be dated. Historians rely on the fact that there was a solar eclipse visible in China on August 29, 776 B.C.[1]. No doubt astronomers could pinpoint the exact hour and minute when the eclipse became total at any given location in North China. It seems obvious that we can talk about the positions of the planets at a specific moment. The Newtonian theory lets us calculate the distance between the Earth and, say, Jupiter at a particular time using Greenwich clocks; this allows us to determine how long light takes to travel from Jupiter to Earth at that time—about half an hour; this means we can infer that half an hour ago, Jupiter was where we see it now. All of this seems straightforward. However, it really only works because the relative speeds of the planets are very small compared to the speed of light. When we conclude that an event on Earth and an event on Jupiter occurred simultaneously—for instance, that Jupiter eclipsed one of its moons when the Greenwich clocks showed midnight—a person moving quickly relative to Earth would have a different judgment, provided that both of us accounted for the speed of light. Naturally, differing views on simultaneity lead to disagreements about time intervals. If we determined that two events on Jupiter were 24 hours apart, another person might think they were separated by a longer time if he were moving rapidly relative to both Jupiter and Earth.

The universal cosmic time which used to be taken for granted is thus no longer admissible. For each body, there is a definite time order for the events in its neighborhood; this may be called the “proper” time for that body. Our own experience is governed by the proper time for our own body. As we all remain very nearly stationary on the earth, the proper times of different human beings agree, and can be lumped together as terrestrial time. But this is only the time appropriate to large bodies on the earth. For Beta-particles in laboratories, quite different times would be wanted; it is because we insist upon using our own time that these particles seem to increase in mass with rapid motion. From their own point of view, their mass remains constant, and it is we who suddenly grow thin or corpulent. The history of a physicist as observed by a Beta-particle would resemble Gulliver’s travels.

The universal cosmic time that used to be taken for granted is no longer acceptable. For each object, there is a specific timeline for the events happening around it; this is referred to as the “proper” time for that object. Our own experiences are based on the proper time for our bodies. Since we all stay mostly still on the earth, the proper times of different people align and can be grouped together as terrestrial time. However, this is only the time relevant to large objects on earth. For Beta particles in labs, entirely different times would be needed; it's because we insist on using our own time that these particles appear to gain mass with fast motion. From their perspective, their mass stays the same, and it's us who suddenly seem to lose or gain weight. The life story of a physicist as seen by a Beta particle would be like Gulliver’s travels.

The question now arises: what really is measured by a clock? When we speak of a clock in the theory of relativity, we do not mean only clocks made by human hands: we mean anything which goes through some [Pg 51] regular periodic performance. The earth is a clock, because it rotates once in every twenty-three hours and fifty-six minutes. An atom is a clock, because the electrons go round the nucleus a certain number of times in a second; its properties as a clock are exhibited to us in its spectrum, which is due to light waves of various frequencies. The world is full of periodic occurrences, and fundamental mechanisms, such as atoms, show an extraordinary similarity in different parts of the universe. Any one of these periodic occurrences may be used for measuring time; the only advantage of humanly manufactured clocks is that they are specially easy to observe. One question is: If cosmic time is abandoned, what is really measured by a clock in the wide sense that we have just given to the term?

The question now arises: what does a clock actually measure? When we talk about a clock in the theory of relativity, we’re not just referring to human-made clocks; we mean anything that has a regular, repetitive cycle. The Earth is a clock because it completes a rotation every twenty-three hours and fifty-six minutes. An atom is also a clock, as the electrons orbit the nucleus a specific number of times per second; its clock-like properties are revealed in its spectrum, which is made up of light waves of various frequencies. The universe is full of periodic events, and fundamental structures like atoms show remarkable similarities across different areas of the cosmos. Any of these periodic events can be used to measure time; the only real benefit of human-made clocks is that they are particularly easy to observe. One question to consider is: if we discard cosmic time, what is actually measured by a clock in the broad sense we’ve just discussed? [Pg 51]

Each clock gives a correct measure of its own “proper” time, which, as we shall see presently, is an important physical quantity. But it does not give an accurate measure of any physical quantity connected with events on bodies that are moving rapidly in relation to it. It gives one datum towards the discovery of a physical quantity connected [Pg 52] with such events, but another datum is required, and this has to be derived from measurement of distances in space. Distances in space, like periods of time, are in general not objective physical facts, but partly dependent upon the observer. How this comes about must now be explained.

Each clock accurately tracks its own “proper” time, which, as we'll discuss shortly, is a significant physical quantity. However, it doesn’t provide an exact measure of any physical quantity related to events on objects that are moving quickly relative to it. It offers one piece of information toward finding a physical quantity linked to such events, but a second piece is needed, which has to come from measuring distances in space. Distances in space, like periods of time, are generally not objective physical facts; they are partly influenced by the observer. We must now explain how this happens. [Pg 52]

First of all, we have to think of the distance between two events, not between two bodies. This follows at once from what we have found as regards time. If two bodies are moving relatively to each other—and this is really always the case—the distance between them will be continually changing, so that we can only speak of the distance between them at a given time. If you are in a train traveling towards Edinburgh, we can speak of your distance from Edinburgh at a given time. But, as we said, different observers will judge differently as to what is the “same” time for an event in the train and an event in Edinburgh. This makes the measurement of distances relative, in just the same way as the measurement of times has been found to be relative. We commonly think that there are two separate kinds of interval between two events, an interval in space and an interval in time: between your [Pg 53] departure from London and your arrival in Edinburgh, there are 400 miles and ten hours. We have already seen that another observer will judge the time differently; it is even more obvious that he will judge the distance differently. An observer in the sun will think the motion of the train quite trivial, and will judge that you have traveled the distance traveled by the earth in its orbit and its diurnal rotation. On the other hand, a flea in the railway carriage will judge that you have not moved at all in space, but have afforded him a period of pleasure which he will measure by his “proper” time, not by Greenwich Observatory. It cannot be said that you or the sun dweller or the flea are mistaken: each is equally justified, and is only wrong if he ascribes an objective validity to his subjective measures. The distance in space between two events is, therefore, not in itself a physical fact. But, as we shall see, there is a physical fact which can be inferred from the distance in time together with the distance in space. This is what is called the “interval” in space-time.

First of all, we need to consider the distance between two events, rather than between two objects. This follows directly from our findings about time. If two objects are moving relative to each other—and they always are—the distance between them will constantly change, so we can only refer to the distance between them at a specific moment. If you're on a train heading toward Edinburgh, we can talk about your distance from Edinburgh at a certain time. But, as we mentioned, different observers will perceive what the "same" time is for an event on the train and an event in Edinburgh differently. This makes measuring distances relative, just like measuring times is relative. We usually think there are two distinct kinds of intervals between two events: a space interval and a time interval. Between your departure from London and your arrival in Edinburgh, there are 400 miles and ten hours. We've already established that another observer will perceive the time differently; it's even clearer that they will perceive the distance differently. An observer on the sun will consider the train's motion quite minor and will conclude that you've traveled the distance covered by the Earth in its orbit and its daily rotation. On the other hand, a flea in the train carriage will think you haven't moved at all in space but provided it a nice time, measured by its own "proper" time, not by Greenwich Mean Time. It's not accurate to say that you, the sun observer, or the flea are mistaken; each is justified in their view and is only wrong if they attribute an objective reality to their subjective measurements. The distance in space between two events is, therefore, not a physical fact in itself. However, as we will see, there's a physical fact that can be inferred from the time distance together with the space distance. This is known as the "interval" in space-time.

Taking any two events in the universe, there are two different possibilities as to the relation between them. It may be physically [Pg 54] possible for a body to travel so as to be present at both events, or it may not. This depends upon the fact that no body can travel as fast as light. Suppose, for example, that it were possible to send out a flash of light from the earth and have it reflected back from the moon. The time between the sending of the flash and the return of the reflection would be about two and a half seconds. No body could travel so fast as to be present on the earth during any part of those two and a half seconds and also present on the moon at the moment of the arrival of the flash, because in order to do so the body would have to travel faster than light. But theoretically a body could be present on the earth at any time before or after those two and a half seconds and also present on the moon at the time when the flash arrived. When it is physically impossible for a body to travel so as to be present at both events, we shall say that the interval[2] between the two events is “space-like”; when it is physically possible for a body to be present at both events, we shall say that the interval between the two events is “time-like.” When the interval is [Pg 55] “space-like,” it is possible for a body to move in such a way that an observer on the body will judge the two events to be simultaneous. In that case, the “interval” between the two events is what such an observer will judge to be the distance in space between them. When the interval is “time-like,” a body can be present at both events; in that case, the “interval” between the two events is what an observer on the body will judge to be the time between them, that is to say, it is his “proper” time between the two events. There is a limiting case between the two, when the two events are parts of one light flash—or, as we might say, when the one event is the seeing of the other. In that case, the interval between the two events is zero.

Considering any two events in the universe, there are two possible relationships between them. It might be physically possible for an object to be present at both events, or it might not be. This is due to the fact that no object can travel as fast as light. For instance, imagine sending a flash of light from Earth to the moon and having it reflected back. The time it takes for the flash to travel and return would be about two and a half seconds. No object could move fast enough to be on Earth during any part of those two and a half seconds and also be on the moon when the flash arrives because it would need to exceed the speed of light. However, theoretically, an object could be on Earth at any point before or after those two and a half seconds and also be on the moon when the flash arrives. When it is physically impossible for an object to be present at both events, we will refer to the interval between the two events as “space-like”; when it is physically possible for an object to be present at both events, we will call the interval “time-like.” When the interval is “space-like,” it is possible for an object to move in such a way that an observer on that object perceives the two events as simultaneous. In this case, the “interval” between the two events is what that observer would consider the distance in space between them. When the interval is “time-like,” an object can be present at both events; in this case, the “interval” is what an observer on that object would perceive as the time between them, which is his “proper” time between the two events. There is a boundary case between the two, where the two events are parts of one light flash—or, as we might put it, when one event is the observation of the other. In that case, the interval between the two events is zero.

There are thus three cases. (1) It may be possible for a ray of light to be present at both events; this happens whenever one of them is the seeing of the other. In this case the interval between the two events is zero. (2) It may happen that no body can travel from one event to the other, because in order to do so it would have to travel faster than light. In that case, it is always physically possible for a body to travel in such a way that an observer on the body would judge the two events to be simultaneous. The interval is what he would judge to [Pg 56] be the distance in space between the two events. Such an interval is called “space-like.” (3) It may be physically possible for a body to travel so as to be present at both events; in that case, the interval between them is what an observer on such a body will judge to be the time between them. Such an interval is called “time-like.”

There are three scenarios. (1) A ray of light might be present at both events; this occurs whenever one event is the observation of the other. In this situation, the interval between the two events is zero. (2) It’s possible that no object can travel from one event to the other, because doing so would require traveling faster than light. In this case, it's always physically possible for an object to move in such a way that someone on the object would consider the two events to be simultaneous. The interval is what they would perceive as the distance in space between the two events. This type of interval is called “space-like.” (3) It may be physically possible for an object to travel in a way that allows it to be present at both events; in that case, the interval between them is what an observer on that object would perceive as the time between them. This type of interval is called “time-like.”

The interval between two events is a physical fact about them, not dependent upon the particular circumstances of the observer.

The gap between two events is a physical reality about them, not reliant on the specific situation of the observer.

There are two forms of the theory of relativity, the special and the general. The former is in general only approximate, but is exact at great distances from gravitating matter. When the special theory can be applied, the interval can be calculated when we know the distance in space and the distance in time between the two events, estimated by any observer. If the distance in space is greater than the distance that [Pg 57] light would have traveled in the time, the separation is space-like. Then the following construction gives the interval between the two events: Draw a line AB as long as the distance that light would travel in the time; round A describe a circle whose radius is the distance in space between the two events; through B draw BC perpendicular to AB, meeting the circle in C. Then BC is the length of the interval between the two events.

There are two types of the theory of relativity: special and general. The special theory is usually only an approximation, but it's exact at large distances from any gravitational matter. When the special theory applies, we can calculate the interval when we know the spatial distance and the time distance between two events, as estimated by any observer. If the spatial distance is greater than the distance that light would travel in that time, the separation is considered space-like. Then the following construction determines the interval between the two events: Draw a line AB that represents the distance light would travel in the given time; around A, draw a circle with a radius equal to the spatial distance between the two events; through B, draw BC perpendicular to AB, which intersects the circle at C. Then BC is the length of the interval between the two events.

When the distance is time-like, use the same figure, but let AC be now the distance that light would travel in the time, while AB is the distance in space between the two events. The interval between them is now the time that light would take to travel the distance BC.

When the distance is time-like, use the same figure, but let AC be the distance that light would travel in that time, while AB is the distance in space between the two events. The interval between them is now the time it would take light to travel the distance BC.

Although AB and AC are different for different observers, BC is the same length for all observers, subject to corrections made by the general theory. It represents the one interval in “space-time” which replaces the two intervals in space and time of the older physics. So far, this notion of interval may appear somewhat mysterious, but as we proceed it will grow less so, and its reason in the nature of things will gradually emerge.

Although AB and AC vary for different observers, BC is the same length for everyone, with adjustments made by the general theory. It represents the single interval in “space-time” that replaces the two separate intervals in space and time from the earlier physics. At this point, the concept of interval might seem a bit mysterious, but as we move forward, it will become clearer, and its underlying reasons in the nature of things will gradually become apparent.


[Pg 58]

[Pg 58]

CHAPTER V:
SPACE-TIME

Everybody who has ever heard of relativity knows the phrase “space-time,” and knows that the correct thing is to use this phrase when formerly we should have said “space and time.” But very few people who are not mathematicians have any clear idea of what is meant by this change of phraseology. Before dealing further with the special theory of relativity, I want to try to convey to the reader what is involved in the new phrase “space-time,” because that is, from a philosophical and imaginative point of view, perhaps the most important of all the novelties that Einstein has introduced.

Everyone who has heard of relativity knows the term “space-time” and understands that it’s appropriate to use this term instead of saying “space and time.” However, very few people who aren’t mathematicians have a clear understanding of what this change in terminology means. Before diving deeper into the special theory of relativity, I want to explain what the new term “space-time” entails, as it’s possibly the most significant of all the new concepts that Einstein has brought to light from a philosophical and imaginative perspective.

Suppose you wish to say where and when some event has occurred—say an explosion on an airship—you will have to mention four quantities, say the latitude and longitude, the height above the ground, and the time. According to the traditional view, the first three of these give the position in space, while the fourth gives the position in [Pg 59] time. The three quantities that give the position in space may be assigned in all sorts of ways. You might, for instance, take the plane of the equator, the plane of the meridian of Greenwich, and the plane of the ninetieth meridian, and say how far the airship was from each of these planes; these three distances would be what are called “Cartesian co-ordinates,” after Descartes. You might take any other three planes all at right angles to each other, and you would still have Cartesian co-ordinates. Or you might take the distance from London to a point vertically below the airship, the direction of this distance (northeast, west-southwest, or whatever it might be), and the height of the airship above the ground. There are an infinite number of such ways of fixing the position in space, all equally legitimate; the choice between them is merely one of convenience.

If you want to indicate where and when an event happened—like an explosion on an airship—you'll need to reference four things: the latitude and longitude, the height above the ground, and the time. According to the traditional view, the first three provide the location in space, while the fourth provides the location in [Pg 59] time. The first three quantities that define the position in space can be specified in various ways. For example, you might use the plane of the equator, the plane of the Greenwich meridian, and the plane of the ninetieth meridian, and specify how far the airship was from each of these planes; these three distances would be known as “Cartesian coordinates,” named after Descartes. You could choose any other three planes that are all at right angles to one another, and you would still have Cartesian coordinates. Alternatively, you might determine the distance from London to a point directly below the airship, the direction of this distance (northeast, west-southwest, or whatever it might be), and the height of the airship above the ground. There are countless ways to define the position in space, all equally valid; the choice among them is simply a matter of convenience.

When people said that space had three dimensions, they meant just this: that three quantities were necessary in order to specify the position of a point in space, but that the method of assigning these quantities was wholly arbitrary.

When people said that space has three dimensions, they meant exactly this: that three values are needed to define the position of a point in space, but how those values are assigned is completely arbitrary.

With regard to time, the matter was thought to be quite different. The only arbitrary elements in the reckoning of time were the unit, and [Pg 60] the point of time from which the reckoning started. One could reckon in Greenwich time, or in Paris time, or in New York time; that made a difference as to the point of departure. One could reckon in seconds, minutes, hours, days, or years; that was a difference of unit. Both these were obvious and trivial matters. There was nothing corresponding to the liberty of choice as to the method of fixing position in space. And, in particular, it was thought that the method of fixing position in space and the method of fixing position in time could be made wholly independent of each other. For these reasons, people regarded time and space as quite distinct.

When it comes to time, people believed it was a different story. The only factors that were arbitrary in measuring time were the unit and the starting point of the measurement. You could use Greenwich time, Paris time, or New York time; that changed where you began. You could measure in seconds, minutes, hours, days, or years; that changed the unit. Both of these were obvious and trivial. There was nothing similar to the choice of method when it came to determining location in space. Moreover, it was believed that the methods for determining location in space and time could operate completely independently of each other. For these reasons, people saw time and space as separate and distinct. [Pg 60]

The theory of relativity has changed this. There are now a number of different ways of fixing position in time, which do not differ merely as to the unit and the starting point. Indeed, as we have seen, if one event is simultaneous with another in one reckoning, it will precede it in another, and follow it in a third. Moreover, the space and time reckonings are no longer independent of each other. If you alter the way of reckoning position in space, you may also alter the time [Pg 61] interval between two events. If you alter the way of reckoning time, you may also alter the distance in space between two events. Thus space and time are no longer independent, any more than the three dimensions of space are. We still need four quantities to determine the position of an event, but we cannot, as before, divide off one of the four as quite independent of the other three.

The theory of relativity has changed this. There are now several ways to define a position in time, which don’t just differ by the unit or starting point. In fact, as we've seen, if one event is simultaneous with another in one system, it may come before it in another and after it in a third. Additionally, the measurements of space and time are no longer independent of each other. If you change how you measure position in space, you might also change the time interval between two events. If you change how you measure time, you might also change the distance in space between two events. Thus, space and time are no longer separate from each other, just like the three dimensions of space aren't. We still need four quantities to determine the position of an event, but we can’t treat one of them as completely independent of the other three anymore. [Pg 61]

It is not quite true to say that there is no longer any distinction between time and space. As we have seen, there are time-like intervals and space-like intervals. But the distinction is of a different sort from that which was formerly assumed. There is no longer a universal time which can be applied without ambiguity to any part of the universe; there are only the various “proper” times of the various bodies in the universe, which agree approximately for two bodies which are not in rapid relative motion, but never agree exactly except for two bodies which are at rest relatively to each other.

It's not entirely accurate to say that there's no longer a difference between time and space. As we've seen, there are intervals that feel more like time and others that feel more like space. However, the distinction is not what it used to be. There's no longer a universal time that applies clearly to every part of the universe; instead, there are the different "proper" times of the various bodies in the universe. These times are roughly the same for two bodies that aren't moving quickly relative to each other, but they only match exactly when the two bodies are at rest with respect to each other.

The picture of the world which is required for this new state of affairs is as follows: Suppose an event E occurs to me, and simultaneously a flash of light goes out from me in all directions. [Pg 62] Anything that happens to any body after the light from the flash has reached it is definitely after the event E in any system of reckoning time. Any event anywhere which I could have seen before the event E occurred to me is definitely before the event E in any system of reckoning time. But any event which happened in the intervening time is not definitely either before or after the event E. To make the matter definite: suppose I could observe a person in Sirius, and he could observe me. Anything which he does, and which I see before the event E occurs to me, is definitely before E; anything he does after he has seen the event E is definitely after E. But anything that he does before he sees the event E, but so that I see it after the event E has happened, is not definitely before or after E. Since light takes many years to travel from Sirius to the earth, this gives a period of twice as many years in Sirius which may be called “contemporary” with E, since these years are not definitely before or after E.

The picture of the world needed for this new situation is as follows: Imagine an event E happens to me, and at the same time, a flash of light radiates from me in all directions. [Pg 62] Anything that happens to anyone after the light from the flash reaches them is definitely after event E in any timekeeping system. Any event that could have been observed before event E occurred to me is definitely before event E in any timekeeping system. However, any event that happens in the time in between is not definitively before or after event E. To clarify: if I can observe someone on Sirius and he can observe me, then anything he does that I see before event E occurs is definitely before E; anything he does after he sees event E is definitely after E. But anything he does before he sees event E, and that I see after event E has happened, is not definitely before or after E. Since light takes many years to travel from Sirius to Earth, this creates a period of twice as many years on Sirius that can be called “contemporary” with E, since those years are not definitively before or after E.

Dr. A. A. Robb, in his Theory of Time and Space, suggests a point of view which may or may not be philosophically fundamental, but is at any rate a help in understanding the state of affairs we [Pg 63] have been describing. He maintains that one event can only be said to be definitely before another if it can influence that other in some way. Now influences spread from a center at varying rates. Newspapers exercise an influence emanating from London at an average rate of about twenty miles an hour—rather more for long distances. Anything a man does because of what he reads in the newspaper is clearly subsequent to the printing of the newspaper. Sounds travel much faster: it would be possible to arrange a series of loud speakers along the main roads, and have newspapers shouted from each to the next. But telegraphing is quicker, and wireless telegraphy travels with the velocity of light, so that nothing quicker can ever be hoped for. Now what a man does in consequence of receiving a wireless message he does after the message was sent; the meaning here is quite independent of conventions as to the measurement of time. But anything that he does while the message is on its way cannot be influenced by the sending of the message, and cannot influence the sender until some little time after he sent the message. That is to say, if two bodies are widely separated, neither can influence the other except after a certain lapse of time; what happens before that time has elapsed [Pg 64] cannot affect the distant body. Suppose, for instance, that some notable event happens on the sun: there is a period of sixteen minutes on the earth during which no event on the earth can have influenced or been influenced by the said notable event on the sun. This gives a substantial ground for regarding that period of sixteen minutes on the earth as neither before nor after the event on the sun.

Dr. A. A. Robb, in his Theory of Time and Space, presents a perspective that may or may not be fundamentally philosophical, but it helps clarify the situation we've been discussing. He argues that one event can only be considered definitely before another if it has the potential to influence that other event in some way. Influences radiate from a source at different speeds. For example, newspapers have an influence that spreads from London at an average speed of about twenty miles an hour, and even faster for long distances. Any action a person takes as a result of reading a newspaper clearly occurs after the newspaper is printed. Sounds travel much faster; it's feasible to set up a series of loudspeakers along main roads to shout newspapers from one to the next. But telegraphy is quicker, and wireless telegraphy moves at the speed of light, so nothing faster is possible. Now, anything a person does in response to receiving a wireless message happens after the message was sent; this interpretation is independent of how we measure time. However, anything a person does while the message is en route cannot be influenced by the sending of the message and cannot affect the sender until some time has passed after the message was sent. In other words, if two objects are far apart, neither can influence the other until some time passes; what happens before that time has elapsed [Pg 64] cannot impact the distant object. For instance, if a significant event occurs on the sun, there is a sixteen-minute interval on Earth during which no event on Earth can have influenced or been influenced by that notable event on the sun. This provides a solid basis for considering that sixteen-minute period on Earth as neither before nor after the event on the sun.

The paradoxes of the special theory of relativity are only paradoxes because we are unaccustomed to the point of view, and in the habit of taking things for granted when we have no right to do so. This is especially true as regards the measurement of lengths. In daily life, our way of measuring lengths is to apply a foot rule or some other measure. At the moment when the foot rule is applied, it is at rest relatively to the body which is being measured. Consequently the length that we arrive at by measurement is the “proper” length, that is to say, the length as estimated by an observer who shares the motion of the body. We never, in ordinary life, have to tackle the problem of [Pg 65] measuring a body which is in continual motion. And even if we did, the velocities of visible bodies on the earth are so small relatively to the earth that the anomalies dealt with by the theory of relativity would not appear. But in astronomy, or in the investigation of atomic structure, we are faced with problems which cannot be tackled in this way. Not being Joshua, we cannot make the sun stand still while we measure it; if we are to estimate its size, we must do so while it is in motion relatively to us. And similarly if you want to estimate the size of an electron, you have to do so while it is in rapid motion, because it never stands still for a moment. This is the sort of problem with which the theory of relativity is concerned. Measurement with a foot rule, when it is possible, gives always the same result, because it gives the “proper” length of a body. But when this method is not possible, we find that curious things happen, particularly if the body to be measured is moving very fast relatively to the observer. A figure like the one at the end of the previous chapter will help us to understand the state of affairs. [Pg 66]

The paradoxes of the special theory of relativity only seem like paradoxes because we aren't used to that perspective and tend to take things for granted when we shouldn't. This is especially true when it comes to measuring lengths. In everyday life, we measure lengths using a foot ruler or some other tool. When we use the ruler, it’s stationary relative to the object being measured. As a result, the length we obtain is the “proper” length, meaning it’s the measurement made by an observer who is moving along with the object. We never really have to deal with measuring something that’s constantly in motion in our daily lives. Even if we did, the speeds of visible objects on Earth are so slow compared to it that the oddities described by relativity wouldn’t show up. But in fields like astronomy or atomic research, we encounter situations that can’t be handled this way. Unlike Joshua, we can’t make the sun stop moving while we measure it; if we want to determine its size, we have to do it while it’s in motion relative to us. Similarly, to measure the size of an electron, we must consider it while it’s moving quickly because it never stays still for a moment. This is the type of issue that the theory of relativity addresses. Using a foot ruler, when it’s possible, always gives the same result because it provides the “proper” length of an object. However, when this approach isn’t possible, we discover that strange things happen, especially if the object being measured is moving very fast relative to the observer. A diagram like the one at the end of the previous chapter will help clarify the situation.

Let us suppose that the body on which we wish to measure lengths is moving relatively to ourselves, and that in one second it moves the distance OM. Let us draw a circle round O whose radius is the distance that light travels in a second. Through M draw MP perpendicular to OM, meeting the circle in P. Thus OP is the distance that light travels in a second. The ratio of OP to OM is the ratio of the velocity of light to the velocity of the body. The ratio of OP to MP is the ratio in which apparent lengths are altered by the motion. That is to say, if the observer judges that two points in the line of motion on the moving body are at a distance from each other represented by MP, a person moving with the body would judge that they were at a distance represented (on the same scale) by OP. Distances on the moving body at right angles to the line of motion are not affected by the motion. The whole thing is reciprocal; that is to say, if an observer moving with the body were to measure lengths on the previous observer’s body, they would be altered in just the same proportion. When two bodies are moving relatively to each other, lengths on either [Pg 67] appear shorter to the other than to themselves. This is the Fitzgerald contraction, which was first invented to account for the result of the Michelson-Morley experiment. But it now emerges naturally from the fact that the two observers do not make the same judgment of simultaneity.

Let’s say the body we want to measure lengths on is moving relative to us, and in one second it covers the distance OM. Let’s draw a circle draw a circle around O whose radius is the distance that light travels in a second. From M, draw MP perpendicular to OM, intersecting the circle at P. So, OP represents the distance that light travels in one second. The ratio of OP to OM is the ratio of the speed of light to the speed of the body. The ratio of OP to MP is how motion changes perceived lengths. In other words, if the observer thinks that two points along the line of motion on the moving body are separated by a distance represented by MP, a person moving with the body would perceive the distance as OP (on the same scale). Distances on the moving body that are perpendicular to the direction of motion aren’t affected by this motion. It’s reciprocal; if an observer moving with the body measures lengths on the previous observer’s body, they would appear altered by the same proportion. When two bodies are moving relative to each other, lengths on either one appear shorter to the other than to themselves. This is the Fitzgerald contraction, originally created to explain the results of the Michelson-Morley experiment. Now it naturally arises from the fact that the two observers don’t agree on simultaneous events.

The way in which simultaneity comes in is this: We say that two points on a body are a foot apart when we can simultaneously apply one end of a foot rule to the one and the other end to the other. If, now, two people disagree about simultaneity, and the body is in motion, they will obviously get different results from their measurements. Thus the trouble about time is at the bottom of the trouble about distance.

The way simultaneity works is like this: We say that two points on an object are a foot apart when we can simultaneously place one end of a foot ruler on one point and the other end on the other point. If two people have different opinions about simultaneity and the object is moving, they will clearly come up with different measurements. Therefore, the issues with time are fundamentally connected to the issues with distance.

The ratio of OP to MP is the essential thing in all these matters. Times and lengths and masses are all altered in this proportion when the body concerned is in motion relatively to the observer. It will be seen that, if OM is very much smaller than OP, that is to say, if the body is moving very much more slowly than light, MP and OP are very nearly equal, so that the alterations produced by the motion are very small. But if OM is [Pg 68] nearly as large as OP, that is to say, if the body is moving nearly as fast as light, MP becomes very small compared to OP, and the effects become very great. The apparent increase of mass in swiftly moving particles had been observed, and the right formula had been found, before Einstein invented his special theory of relativity. In fact, Lorentz had arrived at the formulæ called the “Lorentz transformation,” which embody the whole mathematical essence of the special theory of relativity. But it was Einstein who showed that the whole thing was what we ought to have expected, and not a set of makeshift devices to account for surprising experimental results. Nevertheless, it must not be forgotten that experimental results were the original motive of the whole theory, and have remained the ground for undertaking the tremendous logical reconstruction involved in Einstein’s theories.

The ratio of OP to MP is the key factor in all these discussions. Times, lengths, and masses all change in this proportion when the body in question is moving relative to the observer. It will be clear that if OM is much smaller than OP, meaning the body is moving much slower than light, MP and OP are almost equal, so the changes caused by the motion are minimal. But if OM is nearly as large as OP, meaning the body is moving nearly as fast as light, MP becomes very small compared to OP, and the effects become significant. The apparent increase in mass of fast-moving particles had been observed, and the correct formula had been discovered before Einstein introduced his special theory of relativity. In fact, Lorentz had developed the formulas known as the “Lorentz transformation,” which capture the entire mathematical essence of the special theory of relativity. However, it was Einstein who demonstrated that this was exactly what we should have expected, rather than a collection of temporary solutions to unexpected experimental results. Still, it should not be overlooked that experimental results were the original motivation for the entire theory and have continued to be the basis for the substantial logical reconstruction involved in Einstein’s theories.

We may now recapitulate the reasons which have made it necessary to substitute “space-time” for space and time. The old separation of space and time rested upon the belief that there was no ambiguity in saying that two events in distant places happened at the same time; [Pg 69] consequently it was thought that we could describe the topography of the universe at a given instant in purely spatial terms. But now that simultaneity has become relative to a particular observer, this is no longer possible. What is, for one observer, a description of the state of the world at a given instant, is, for another observer, a series of events at various different times, whose relations are not merely spatial but also temporal. For the same reason, we are concerned with events, rather than with bodies. In the old theory, it was possible to consider a number of bodies all at the same instant, and since the time was the same for all of them it could be ignored. But now we cannot do that if we are to obtain an objective account of physical occurrences. We must mention the date at which a body is to be considered, and thus we arrive at an “event,” that is to say, something which happens at a given time. When we know the time and place of an event in one observer’s system of reckoning, we can calculate its time and place according to another observer. But we must know the time as well as the place, because we can no longer ask what is its place for the new observer at the “same” time as for the old observer. There is [Pg 70] no such thing as the “same” time for different observers, unless they are at rest relatively to each other. We need four measurements to fix a position, and four measurements fix the position of an event in space-time, not merely of a body in space. Three measurements are not enough to fix any position. That is the essence of what is meant by the substitution of space-time for space and time.

We can now summarize the reasons why we need to replace “space-time” with just space and time. The previous division between space and time was based on the assumption that there was no confusion in claiming that two events happening in different places occurred at the same time; [Pg 69] therefore, it was believed we could describe the layout of the universe at any given moment purely in spatial terms. However, since simultaneity has become relative to a specific observer, this is no longer feasible. What one observer perceives as a description of the world at a certain moment is viewed by another observer as a series of events at different times, where the connections are not only spatial but also temporal. For this reason, we focus on events rather than bodies. In the old theory, it was possible to consider several bodies all at the same moment, and since the time was the same for all of them, it could be disregarded. But now, if we want to obtain an objective account of physical occurrences, we cannot do that. We must specify the date when a body is being considered, leading us to the concept of an “event,” meaning something that occurs at a specific time. Once we know the time and place of an event according to one observer’s system, we can figure out its time and place in another observer's system. But we need to know both time and place, as we can no longer simply ask what its location is for the new observer at the “same” time as for the old observer. There is [Pg 70] no such thing as the “same” time for different observers unless they are at rest relative to one another. We need four measurements to determine a position, and four measurements pinpoint the position of an event in space-time, not merely of a body in space. Three measurements are insufficient to establish any position. That is the core meaning behind substituting space-time for space and time.


[Pg 71]

[Pg 71]

CHAPTER VI:
THE SPECIAL THEORY
OF RELATIVITY

The special theory of relativity arose as a way of accounting for the facts of electromagnetism. We have here a somewhat curious history. In the eighteenth and early nineteenth centuries the theory of electricity was wholly dominated by the Newtonian analogy. Two electric charges attract each other if they are of different kinds, one positive and one negative, but repel each other if they are of the same kind; in each case, the force varies as the inverse square of the distance, as in the case of gravitation. This force was conceived as an action at a distance, until Faraday, by a number of remarkable experiments, demonstrated the effect of the intervening medium. Faraday was no mathematician; Clerk Maxwell first gave a mathematical form to the results suggested by Faraday’s experiments. Moreover Clerk Maxwell gave grounds for thinking that light is an electromagnetic phenomenon, [Pg 72] consisting of electromagnetic waves. The medium for the transmission of electromagnetic effects could therefore be taken to be the ether, which had long been assumed for the transmission of light. The correctness of Maxwell’s theory of light was proved by the experiments of Hertz in manufacturing electromagnetic waves; these experiments afforded the basis for wireless telegraphy. So far, we have a record of triumphant progress, in which theory and experiment alternately assume the leading role. At the time of Hertz’s experiments, the ether seemed securely established, and in just as strong a position as any other scientific hypothesis not capable of direct verification. But a new set of facts began to be discovered, and gradually the whole picture was changed.

The special theory of relativity came about as a way to explain the principles of electromagnetism. The history here is quite interesting. In the 1700s and early 1800s, the theory of electricity was completely influenced by Newtonian principles. Two electric charges attract each other if they are different types, one being positive and the other negative, but they repel each other if they are the same type; in both cases, the force changes inversely with the square of the distance, similar to gravity. This force was seen as an action at a distance until Faraday, through several impressive experiments, showed the role of the medium between them. Faraday wasn’t a mathematician; it was Clerk Maxwell who first provided a mathematical framework for the results of Faraday’s experiments. Additionally, Clerk Maxwell suggested that light is an electromagnetic phenomenon, consisting of electromagnetic waves. Therefore, the medium for transmitting electromagnetic effects could be considered as the ether, which had long been thought to be necessary for the transmission of light. The validity of Maxwell’s theory of light was supported by Hertz's experiments in creating electromagnetic waves; these experiments laid the groundwork for wireless telegraphy. Up to this point, we have an account of successful progress, where theory and experiment take turns leading the way. When Hertz conducted his experiments, the ether appeared to be firmly established and at least as reliable as any other scientific theory that couldn’t be directly tested. However, a new set of discoveries began to emerge, and gradually, the entire picture changed.

The movement which culminated with Hertz was a movement for making everything continuous. The ether was continuous, the waves in it were continuous, and it was hoped that matter would be found to consist of some continuous structure in the ether. Then came the discovery of the electron, a small finite unit of negative electricity, and the proton, a small finite unit of positive electricity. The most modern view is that electricity is never found except in the form of [Pg 73] electrons and protons; all electrons have the same amount of negative electricity, and all protons have an exactly equal and opposite amount of positive electricity. It appeared that an electric current, which had been thought of as a continuous phenomenon, consists of electrons traveling one way and positive ions traveling the other way; it is no more strictly continuous than the stream of people going up and down an escalator. Then came the discovery of quanta, which seems to show a fundamental discontinuity in all such natural processes as can be measured with sufficient precision. Thus physics has had to digest new facts and face new problems.

The movement that concluded with Hertz aimed to make everything continuous. The ether was seen as continuous, the waves within it were continuous, and there was hope that matter would be found to consist of some continuous structure in the ether. Then, the electron was discovered—a tiny, discrete unit of negative electricity—and the proton, a tiny, discrete unit of positive electricity. The most current understanding is that electricity only exists in forms of electrons and protons; all electrons carry the same amount of negative electricity, and all protons have an exactly equal and opposite amount of positive electricity. It turned out that electric current, which had been considered a continuous phenomenon, consists of electrons moving in one direction and positive ions moving in the opposite direction; it's not really any more continuous than a stream of people going up and down an escalator. Then came the discovery of quanta, indicating a fundamental discontinuity in all measurable natural processes. So, physics has had to absorb new facts and confront new challenges.

But the problems raised by the electron and the quantum are not those that the theory of relativity can solve, at any rate at present; as yet, it throws no light upon the discontinuities which exist in nature. The problems solved by the special theory of relativity are typified by the Michelson-Morley experiment. Assuming the correctness of Maxwell’s theory of electromagnetism, there should have been certain discoverable effects of motion through the ether; in fact, there were none. Then [Pg 74] there was the observed fact that a body in very rapid motion appears to increase its mass; the increase is in the ratio of OP to MP in the figure in the preceding chapter. Facts of this sort gradually accumulated, until it became imperative to find some theory which would account for them all.

But the issues raised by electrons and quantum mechanics can't be addressed by the theory of relativity, at least not for now; it still doesn't explain the discontinuities found in nature. The problems tackled by the special theory of relativity are exemplified by the Michelson-Morley experiment. If Maxwell’s theory of electromagnetism is correct, there should have been observable effects of motion through the ether; in reality, there were none. Then, there’s the fact that a body moving at very high speeds seems to increase its mass; this increase is in the ratio of OP to MP in the figure in the preceding chapter. Facts like these gradually piled up, making it essential to develop a theory that could explain them all.

Maxwell’s theory reduced itself to certain equations, known as “Maxwell’s equations.” Through all the revolutions which physics has undergone in the last fifty years, these equations have remained standing; indeed they have continually grown in importance as well as in certainty—for Maxwell’s arguments in their favor were so shaky that the correctness of his results must almost be ascribed to intuition. Now these equations were, of course, obtained from experiments in terrestrial laboratories, but there was a tacit assumption that the motion of the earth through the ether could be ignored. In certain cases, such as the Michelson-Morley experiment, this ought not to have been possible without measurable error; but it turned out to be always possible. Physicists were faced with the odd difficulty that Maxwell’s equations were more accurate than they should be. A very similar difficulty was explained by Galileo at the very beginning of modern [Pg 75] physics. Most people think that if you let a weight drop it will fall vertically. But if you try the experiment in the cabin of a moving ship, the weight falls, in relation to the cabin, just as if the ship were at rest; for instance, if it starts from the middle of the ceiling it will drop onto the middle of the floor. That is to say, from the point of view of an observer on the shore it does not fall vertically, since it shares the motion of the ship. So long as the ship’s motion is steady, everything goes on inside the ship as if the ship were not moving. Galileo explained how this happens, to the great indignation of the disciples of Aristotle. In orthodox physics, which is derived from Galileo, a uniform motion in a straight line has no discoverable effects. This was, in its day, as astonishing a form of relativity as that of Einstein is to us. Einstein, in the special theory of relativity, set to work to show how electromagnetic phenomena could be unaffected by uniform motion through the ether if there be an ether. This was a more difficult problem, which could not be solved by merely adhering to the principles of Galileo.

Maxwell’s theory came down to a set of equations known as “Maxwell’s equations.” Throughout the revolutions in physics over the last fifty years, these equations have not only persisted but have also increased in significance and reliability—though Maxwell’s reasoning for them was rather weak, so the accuracy of his results could almost be attributed to intuition. These equations were, of course, derived from experiments conducted in earthly labs, but there was an unspoken assumption that Earth's movement through the ether could be overlooked. In some instances, such as the Michelson-Morley experiment, this shouldn't have been feasible without noticeable error, but it turned out to always be possible. Physicists encountered the strange problem that Maxwell’s equations were more precise than expected. A similar issue was explained by Galileo at the very dawn of modern physics. Most people believe that if you drop a weight, it will fall straight down. But if you conduct the experiment in the cabin of a moving boat, the weight falls, in relation to the cabin, just like it would if the boat were still; for instance, if it’s dropped from the middle of the ceiling, it will land in the middle of the floor. This means that from the perspective of someone on the shore, it doesn’t fall straight down since it carries the boat's motion. As long as the boat’s movement is consistent, everything inside it behaves as if it were stationary. Galileo clarified this, much to the dismay of Aristotle’s followers. In traditional physics, which stems from Galileo, constant motion in a straight line has no observable effects. This was, in its time, as surprising a notion of relativity as Einstein’s theories are to us today. Einstein, in his special theory of relativity, aimed to demonstrate how electromagnetic phenomena could remain unaffected by uniform motion through the ether if such an ether existed. This was a tougher challenge that couldn’t be resolved simply by sticking to Galileo’s principles.

The really difficult effort required for solving this problem was in [Pg 76] regard to time. It was necessary to introduce the notion of “proper” time which we have already considered, and to abandon the old belief in one universal time. The quantitative laws of electromagnetic phenomena are expressed in Maxwell’s equations, and these equations are found to be true for any observer, however he may be moving.[3] It is a straight-forward mathematical problem to find out what differences there must be between the measures applied by one observer and the measures applied by another, if, in spite of their relative motion, they are to find the same equations verified. The answer is contained in the “Lorentz transformation,” found as a formula by Lorentz, but interpreted and made intelligible by Einstein.

The real challenge in solving this problem was related to time. We had to introduce the idea of “proper” time, which we've already discussed, and let go of the old belief in a single universal time. The quantitative laws of electromagnetic phenomena are captured in Maxwell’s equations, and these equations hold true for any observer, no matter how they're moving. It’s a straightforward mathematical challenge to determine the differences that must exist between the measurements of one observer and those of another, so that despite their relative motion, they can validate the same equations. The solution is found in the “Lorentz transformation,” formulated by Lorentz but clarified and made understandable by Einstein.

The Lorentz transformation tells us what estimate of distances and periods of time will be made by an observer whose relative motion is known, when we are given those of another observer. We may suppose that you are in a train on a railway which travels due east. You have been traveling for a time which, by the clocks at the station from which you started, is t. At a distance x from your starting point, as measured by the people on the line, an event occurs at this [Pg 77] moment—say the line is struck by lightning. You have been traveling all the time with a uniform velocity v. The question is: How far from you will you judge that this event has taken place, and how long after you started will it be by your watch, assuming that your watch is correct from the point of view of an observer on the train?

The Lorentz transformation explains how distances and time intervals will be perceived by an observer moving relative to another observer whose measurements are known. Imagine you’re on a train heading due east. You’ve been traveling for a duration, according to the clocks at the station where you began, of t. At a distance x from your starting location, as seen by the people along the train tracks, an event happens at this moment—let’s say lightning strikes the track. You’ve been moving at a constant speed of v. The question is: How far away from you do you think this event happened, and how much time has passed according to your watch since you started, assuming your watch is accurate from the perspective of an observer on the train? [Pg 77]

Our solution of this problem has to satisfy certain conditions. It has to bring out the result that the velocity of light is the same for all observers, however they may be moving. And it has to make physical phenomena—in particular, those of electromagnetism—obey the same laws for different observers, however they may find their measures of distances and times affected by their motion. And it has to make all such effects on measurement reciprocal. That is to say, if you are in a train and your motion affects your estimate of distances outside the train, there must be an exactly similar change in the estimate which people outside the train make of distances inside it. These conditions are sufficient to determine the solution of the problem, but the [Pg 78] method of obtaining the solution cannot be explained without more mathematics than is possible in the present work.

Our solution to this problem has to meet certain criteria. It needs to show that the speed of light is the same for all observers, regardless of how they are moving. It must also ensure that physical phenomena—specifically, those related to electromagnetism—follow the same laws for different observers, no matter how their measurements of distances and times are influenced by their motion. Additionally, it has to ensure that all such measurement effects are reciprocal. This means that if you’re on a train and your motion impacts your perception of distances outside the train, there must be an exactly similar change in the perception that people outside the train have of distances inside it. These conditions are enough to determine the solution to the problem, but the method to obtain the solution requires more mathematics than can be covered in this work. [Pg 78]

Before dealing with the matter in general terms, let us take an example. Let us suppose that you are in a train on a long straight railway, and that you are traveling at three-fifths of the velocity of light. Suppose that you measure the length of your train, and find that it is a hundred yards. Suppose that the people who catch a glimpse of you as you pass succeed, by skilful scientific methods, in taking observations which enable them to calculate the length of your train. If they do their work correctly, they will find that it is eighty yards long. Everything in the train will seem to them shorter in the direction of the train than it does to you. Dinner plates, which you see as ordinary circular plates, will look to the outsider as if they were oval: they will seem only four-fifths as broad in the direction in which the train is moving as in the direction of the breadth of the train. And all this is reciprocal. Suppose you see out of the window a man carrying a fishing rod which, by his measurement, is fifteen feet long. If he is holding it upright, you will see it as he does; so you [Pg 79] will if he is holding it horizontally at right angles to the railway. But if he is pointing it along the railway, it will seem to you to be only twelve feet long. All lengths in the direction of motion are diminished by twenty per cent, both for those who look into the train from outside and for those who look out of the train from inside.

Before we discuss the topic in broader terms, let’s consider an example. Imagine you’re in a train on a long, straight track, traveling at three-fifths the speed of light. You measure the length of your train and find it to be one hundred yards long. People who catch a glimpse of you as you pass by manage, using advanced scientific methods, to calculate the length of your train. If they do it right, they’ll find it’s only eighty yards long. Everything inside the train will appear shorter to them in the direction of travel than it does to you. Dinner plates, which you see as regular circular plates, will look to the outside observer as if they’re oval; they will appear to be only four-fifths as wide in the direction the train is moving compared to the direction that is perpendicular to it. And this effect is mutual. If you see a man outside the window holding a fishing rod that measures fifteen feet long by his standards, you’ll perceive it as he does if he’s holding it upright or horizontally at a right angle to the railway. However, if he’s pointing it along the railway, it will seem to you to be only twelve feet long. All lengths in the direction of motion appear reduced by twenty percent, both for those outside looking into the train and for those inside looking out.

But the effects in regard to time are even more strange. This matter has been explained with almost ideal lucidity by Eddington in Space, Time and Gravitation. He supposes an aviator traveling, relatively to the earth, at a speed of 161,000 miles a second, and he says:

But the effects related to time are even stranger. Eddington explains this matter with almost perfect clarity in Space, Time and Gravitation. He imagines a pilot flying, relative to the Earth, at a speed of 161,000 miles per second, and he states:

“If we observed the aviator carefully we should infer that he was unusually slow in his movements; and events in the conveyance moving with him would be similarly retarded—as though time had forgotten to go on. His cigar lasts twice as long as one of ours. I said ‘infer’ deliberately; we should see a still more extravagant slowing down of time; but that is easily explained, because the aviator is rapidly increasing his distance from us and the light impressions take longer and longer to reach us. The more moderate retardation referred to remains after we have allowed for the time of transmission of [Pg 80] light. But here again reciprocity comes in, because in the aviator’s opinion it is we who are traveling at 161,000 miles a second past him; and when he has made all allowances, he finds that it is we who are sluggish. Our cigar lasts twice as long as his.”

“If we carefully observe the pilot, we would conclude that he moves unusually slowly; and the events happening around him would feel similarly sluggish—as if time has forgotten to keep moving. His cigar lasts twice as long as ours. I used the word ‘conclude’ on purpose; we would actually perceive an even more dramatic slowing down of time; but that’s easy to explain since the pilot is quickly putting distance between us, and the light signals take longer and longer to reach us. The more moderate delay we notice remains after we account for the time it takes for light to travel. But once again, this goes both ways, because from the pilot’s perspective, it’s us who are traveling at 161,000 miles per second past him; and when he makes his adjustments, he finds that it’s us who are slow. Our cigar lasts twice as long as his.”

What a situation for envy! Each man thinks that the other’s cigar lasts twice as long as his own. It may, however, be some consolation to reflect that the other man’s visits to the dentist also last twice as long.

What a situation to be jealous of! Each guy believes that the other’s cigar lasts twice as long as his own. However, it might be a bit comforting to think that the other guy’s trips to the dentist also take twice as long.

This question of time is rather intricate, owing to the fact that events which one man judges to be simultaneous another considers to be separated by a lapse of time. In order to try to make clear how time is affected, I shall revert to our railway train traveling due east at a rate three-fifths of that of light. For the sake of illustration, I assume that the earth is large and flat, instead of small and round.

This question of time is quite complex because what one person sees as happening at the same time, another sees as happening with an interval in between. To clarify how time is influenced, I'll go back to our train traveling east at a speed that's three-fifths the speed of light. For the sake of this example, let's assume the earth is large and flat, rather than small and round.

If we take events which happen at a fixed point on the earth, and ask ourselves how long after the beginning of the journey they will seem to be to the traveler, the answer is that there will be that retardation that Eddington speaks of, which means in this case that what seems an [Pg 81] hour in the life of the stationary person is judged to be an hour and a quarter by the man who observes him from the train. Reciprocally, what seems an hour in the life of the person in the train is judged by the man observing him from outside to be an hour and a quarter. Each makes periods of time observed in the life of the other a quarter as long again as they are to the person who lives through them. The proportion is the same in regard to times as in regard to lengths.

If we look at events happening at a specific location on Earth and consider how long they seem to a traveler after starting their journey, the answer includes the delay that Eddington mentions. This means that what appears to be an hour for the stationary person is perceived as an hour and a quarter by the traveler observing from the train. Conversely, what seems to be an hour for the person in the train is viewed by the observer outside as an hour and a quarter. Each person sees the time periods in the other's life as being a quarter longer than they are for the person experiencing them. The ratio is the same for time as it is for distance. [Pg 81]

But when, instead of comparing events at the same place on the earth, we compare events at widely separated places, the results are still more odd. Let us now take all the events along the railway which, from the point of view of a person who is stationary on the earth, happen at a given instant, say the instant when the observer in the train passes the stationary person. Of these events, those which occur at points towards which the train is moving will seem to the traveler to have already happened, while those which occur at points behind the train will, for him, be still in the future. When I say that events in the forward direction will seem to have already happened, I am saying something not strictly accurate, because he will not yet have [Pg 82] seen them; but when he does see them, he will, after allowing for the velocity of light, come to the conclusion that they must have happened before the moment in question. An event which happens in the forward direction along the railway, and which the stationary observer judges to be now (or rather, will judge to have been now when he comes to know of it), if it occurs at a distance along the line which light could travel in a second, will be judged by the traveler to have occurred three-quarters of a second ago. If it occurs at a distance from the two observers which the man on the earth judges that light could travel in a year, the traveler will judge (when he comes to know of it) that it occurred nine months earlier than the moment when he passed the earth dweller. And generally, he will ante-date events in the forward direction along the railway by three-quarters of the time that it would take light to travel from them to the man on the earth whom he is just passing, and who holds that these events are happening now—or rather, will hold that they happened now when the light from them reaches him. Events happening on the railway behind the train will be post-dated by an exactly equal amount. [Pg 83]

But when we compare events happening in far-apart locations instead of the same spot on Earth, the results become even stranger. Let’s look at all the events happening along the railway from the perspective of someone standing still on the ground, at a specific moment—let’s say the moment when the observer in the train passes by the stationary person. For the traveler, the events occurring at the points the train is heading toward will seem to have already happened, while those at points behind the train will feel like they are still in the future. When I say that the forward events will seem to have already happened, it's not completely accurate, because he hasn’t seen them yet; but when he does see them, he will, accounting for the speed of light, reason that they must have happened before that specific moment. An event that occurs ahead along the railway, which the stationary observer believes is happening now (or rather, will think happened when he learns about it), if it takes place at a distance light can cover in a second, will be considered by the traveler to have occurred three-quarters of a second ago. If it happens at a distance that the person on Earth estimates light could travel in a year, the traveler will assume (when he learns about it) that it happened nine months before he passed the Earth dweller. Generally, he will date events in the forward direction along the railway as occurring three-quarters of the time it would take light to travel from them to the person on Earth he's just passed, who thinks those events are happening now—or will think they happened now when the light from them reaches him. Events occurring on the railway behind the train will be dated similarly, but with the exact same delay.

We have thus a two-fold correction to make in the date of an event when we pass from the terrestrial observer to the traveler. We must first take five-fourths of the time as estimated by the earth dweller, and then subtract three-fourths of the time that it would take light to travel from the event in question to the earth dweller.

We have a two-part adjustment to make in the date of an event when we switch from the view of someone on the ground to that of a traveler. First, we need to take one and a quarter times the duration estimated by the person on Earth, and then subtract three-quarters of the time it takes for light to travel from the event in question to the person on Earth.

Take some event in a distant part of the universe, which becomes visible to the earth dweller and the traveler just as they pass each other. The earth dweller, if he knows how far off the event occurred, can judge how long ago it occurred, since he knows the speed of light. If the event occurred in the direction towards which the traveler is moving, the traveler will infer that it happened twice as long ago as the earth dweller thinks. But if it occurred in the direction from which he has come, he will argue that it happened only half as long ago as the earth dweller thinks. If the traveler moves at a different speed, these proportions will be different.

Take an event happening in a far-off part of the universe that becomes visible to the person on Earth and the traveler just as they pass each other. The person on Earth, if they know how far away the event took place, can figure out how long ago it happened, since they understand the speed of light. If the event occurred in the direction the traveler is heading, the traveler will conclude that it happened twice as long ago as the Earth dweller thinks. But if it happened in the direction the traveler came from, they will argue that it happened only half as long ago as the Earth dweller believes. If the traveler moves at a different speed, these ratios will change.

Suppose now that (as sometimes occurs) two new stars have suddenly flared up, and have just become visible to the traveler and to the earth dweller whom he is passing. Let one of them be in the direction towards which the train is traveling, the other in the direction from [Pg 84] which it has come. Suppose that the earth dweller is able, in some way, to estimate the distance of the two stars, and to infer that light takes fifty years to reach him from the one in the direction towards which the traveler is moving, and one hundred years to reach him from the other. He will then argue that the explosion which produced the new star in the forward direction occurred fifty years ago, while the explosion which produced the other new star occurred a hundred years ago. The traveler will exactly reverse these figures: he will infer that the forward explosion occurred a hundred years ago, and the backward one fifty years ago. I assume that both argue correctly on correct physical data. In fact, both are right, unless they imagine that the other must be wrong. It should be noted that both will have the same estimate of the velocity of light, because their estimates of the distances of the two new stars will vary in exactly the same proportion as their estimates of the times since the explosions. Indeed, one of the main motives of this whole theory is to secure that the velocity of light shall be the same for all observers, however they may be moving. This fact, established by experiment, was incompatible [Pg 85] with the old theories, and made it absolutely necessary to admit something startling. The theory of relativity is just as little startling as is compatible with the facts. Indeed, after a time, it ceases to seem startling at all.

Suppose now that, as sometimes happens, two new stars have suddenly appeared and have just become visible to the traveler and the person on Earth that he is passing. Let one star be in the direction the train is traveling, and the other in the direction from which it has come. Imagine that the Earth dweller is somehow able to estimate the distances to the two stars and infers that it takes fifty years for light to reach him from the one in the direction the traveler is moving, and one hundred years for light from the other. He would then conclude that the explosion that created the new star in front happened fifty years ago, while the explosion that created the other new star occurred a hundred years ago. The traveler will completely reverse these figures: he will deduce that the explosion in front happened a hundred years ago and the one behind happened fifty years ago. I assume that both are reasoning correctly based on accurate physical data. In fact, both are correct unless they believe the other must be mistaken. It’s important to note that both will have the same estimate of the speed of light because their estimates of the distances to the two new stars will change in exactly the same way as their estimates of the time since the explosions. Indeed, one of the main goals of this theory is to ensure that the speed of light is the same for all observers, no matter how they are moving. This fact, confirmed by experiments, was incompatible with the old theories and made it absolutely necessary to accept something surprising. The theory of relativity is no more surprising than what is consistent with the facts. In fact, after a while, it stops seeming surprising at all.

There is another feature of very great importance in the theory we have been considering, and that is that, although distances and times vary for different observers, we can derive from them the quantity called “interval,” which is the same for all observers. The “interval,” in the special theory of relativity, is obtained as follows: Take the square of the distance between two events, and the square of the distance traveled by light in the time between the two events; subtract the lesser of these from the greater, and the result is defined as the square of the interval between the events. The interval is the same for all observers, and represents a genuine physical relation between the two events, which the time and the distance do not. We have already given a geometrical construction for the interval at the end of Chapter IV; this gives the same result as the above rule. The interval is “time-like” when the time between the events is longer than [Pg 86] light would take to travel from the place of the one to the place of the other; in the contrary case it is “space-like.” When the time between the two events is exactly equal to the time taken by light to travel from one to the other, the interval is zero; the two events are then situated on parts of one light ray, unless no light happens to be passing that way.

There’s another crucial aspect of the theory we’ve been discussing, which is that while distances and times can differ for different observers, we can derive a value called “interval,” which is the same for everyone. The “interval” in the special theory of relativity is obtained this way: Take the square of the distance between two events, and the square of the distance light travels during the time between those events; subtract the smaller from the larger, and the result is defined as the square of the interval between the events. The interval is consistent for all observers and represents a true physical relationship between the two events, unlike time and distance. We’ve already provided a geometrical construction for the interval at the end of Chapter IV; this gives the same outcome as the method described above. The interval is “time-like” when the time between the events is longer than the time light would need to travel from one to the other; otherwise, it is “space-like.” When the time between the two events exactly matches the time it takes light to travel from one to the other, the interval is zero; the two events are then located on parts of the same light ray, unless no light happens to be traveling that way.

When we come to the general theory of relativity, we shall have to generalize the notion of interval. The more deeply we penetrate into the structure of the world, the more important this concept becomes; we are tempted to say that it is the reality of which distances and periods of time are confused representations. The theory of relativity has altered our view of the fundamental structure of the world; that is the source both of its difficulty and of its importance.

When we get to the general theory of relativity, we’ll need to broaden our understanding of the idea of interval. The deeper we explore the structure of the universe, the more crucial this concept becomes; we might say it represents a reality where distances and time intervals are imperfect reflections. The theory of relativity has changed how we see the fundamental structure of the universe; that’s what makes it both challenging and significant.

The remainder of this chapter may be omitted by readers who have not even the most elementary acquaintance with geometry or algebra. But for the benefit of those whose education has not been entirely neglected, I will add a few explanations of the general formula of which I have hitherto given only particular examples. The general formula in question is the “Lorentz transformation,” which tells, when [Pg 87] one body is moving in a given manner relatively to another, how to infer the measures of lengths and times appropriate to the one body from those appropriate to the other. Before giving the algebraical formulæ, I will give a geometrical construction. As before, we will suppose that there are two observers, whom we will call O and O′, one of whom is stationary on the earth while the other is traveling at a uniform speed along a straight railway. At the beginning of the time considered, the two observers were at the same point of the railway, but now they are separated by a certain distance. A flash of lightning strikes a point X on the railway, and O judges that at the moment when the flash takes place the observer in the train has reached the point O′. The problem is: how far will O′ judge that he is from the flash, and how long after the beginning of the journey (when he was at O) will he judge that the flash took place? We are supposed to know O′s estimates, and we want to calculate those of O′. [Pg 88]

The rest of this chapter can be skipped by readers who don't have even a basic understanding of geometry or algebra. However, for those whose education hasn't been completely neglected, I'll provide a few explanations of the general formula I've previously given only specific examples of. The general formula is the “Lorentz transformation,” which explains how to determine the measurements of lengths and times for one object that is moving relative to another. Before presenting the algebraic formulas, I'll show a geometric construction. As before, we'll assume there are two observers, whom we'll call O and O′. One is stationary on the ground while the other travels at a constant speed along a straight railway. At the start of the time period we're considering, the two observers were at the same point on the railway, but now they are separated by a certain distance. A lightning strike hits a point X on the railway, and O believes that at the moment of the flash, the observer on the train has reached point O′. The question is: how far does O′ think he is from the flash, and how long after the journey began (when he was at O) does he think the flash occurred? We need to know O′s estimates and want to calculate those of O′.

In the time that, according to O, has elapsed since the beginning of the journey, let OC be the distance that light would have traveled along the railway. Describe a circle about O, with OC as radius, and through O′ draw a perpendicular to the railway, meeting the circle in D. On OD take a point Y such that OY is equal to OX (X is the point of the railway where the lightning strikes). Draw YM perpendicular to the railway, and OS perpendicular to OD. Let YM and OS meet in S. Also let DO′ produced and OS produced meet in R. Through X and C draw perpendiculars to [Pg 89] the railway meeting OS produced in Q and Z respectively. Then RQ (as measured by O) is the distance at which O′ will believe himself to be from the flash, not O′X as it would be according to the old view. And whereas O thinks that, in the time from the beginning of the journey to the flash, light would travel a distance OC, O′ thinks that the time elapsed is that required for light to travel the distance SZ (as measured by O). The interval as measured by O is got by subtracting the square on OX from the square on OC; the interval as measured by O′ is got by subtracting the square on RQ from the square on SZ. A little very elementary geometry shows that these are equal.

In the time that has passed since the start of the journey, let OC be the distance that light would have traveled along the railway. Draw a circle around O, using OC as the radius, and from O′, draw a line perpendicular to the railway that intersects the circle at D. On OD, take a point Y such that OY is equal to OX (where X is the point on the railway where the lightning strikes). Draw YM perpendicular to the railway, and OS perpendicular to OD. Let YM and OS intersect at S. Also, let the extended line DO′ and the extended line OS meet at R. From X and C, draw lines perpendicular to the railway that meet the extended OS at Q and Z, respectively. Then RQ (as measured by O) is the distance at which O′ will believe he is from the flash, rather than O′X as it would have been in the old view. While O thinks that, from the start of the journey to the flash, light would travel a distance OC, O′ thinks that the time elapsed is that required for light to travel the distance SZ (as measured by O). The interval as measured by O is found by subtracting the square of OX from the square of OC; the interval as measured by O′ is found by subtracting the square of RQ from the square of SZ. Some basic geometry shows that these are equal.

The algebraical formulæ embodied in the above construction are as follows: From the point of view of O, let an event occur at a distance x along the railway, and at a time t after the beginning of the journey (when O′ was at O). From the point of view of O′, let the same event occur at a distance x′ along the railway, and at a time t′ after the beginning of the journey. Let c be the velocity of light, and v the velocity of O′ relative to O. Put [Pg 90]

The algebraic formulas found in the above construction are as follows: From O's perspective, let an event take place at a distance x along the railway, and at a time t after the journey starts (when O′ was at O). From O′'s perspective, let the same event occur at a distance x′ along the railway, and at a time t′ after the journey begins. Let c be the speed of light, and v be the speed of O′ relative to O. Put [Pg 90]

    c
β = ————
    (c² - v²)

Then

Then

x′ = β(x - vt)

x′ = β(x - vt)

      vx
t′ = β  t - —
      c²

This is the Lorentz transformation, from which everything in this chapter can be deduced.

This is the Lorentz transformation, from which everything in this chapter can be figured out.


[Pg 91]

[Pg 91]

CHAPTER VII:
INTERVALS IN SPACE-TIME

The special theory of relativity, which we have been considering hitherto, solved completely a certain definite problem: to account for the experimental fact that, when two bodies are in uniform relative motion, all the laws of physics, both those of ordinary dynamics and those connected with electricity and magnetism, are exactly the same for the two bodies. “Uniform” motion, here, means motion in a straight line with constant velocity. But although one problem was solved by the special theory, another was immediately suggested: what if the motion of the two bodies is not uniform? Suppose, for instance, that one is the earth while the other is a falling stone. The stone has an accelerated motion: it is continually falling faster and faster. Nothing in the special theory enables us to say that the laws of physical phenomena will be the same for an observer on the stone as for one on the earth. This is [Pg 92] particularly awkward, as the earth itself is, in an extended sense, a falling body: It has at every moment an acceleration[4] towards the sun, which makes it go round the sun instead of moving in a straight line. As our knowledge of physics is derived from experiments on the earth, we cannot rest satisfied with a theory in which the observer is supposed to have no acceleration. The general theory of relativity removes this restriction, and allows the observer to be moving in any way, straight or crooked, uniformly or with an acceleration. In the course of removing the restriction, Einstein was led to his new law of gravitation, which we shall consider presently. The work was extraordinarily difficult, and occupied him for ten years. The special theory dates from 1905, the general theory from 1915.

The special theory of relativity that we've discussed so far completely solved a specific problem: it explained the experimental observation that when two bodies are in uniform relative motion, all the laws of physics, including those of regular dynamics and those linked to electricity and magnetism, are exactly the same for both bodies. “Uniform” motion here means moving in a straight line at a constant speed. However, while one problem was resolved by the special theory, another immediately arose: what if the motion of the two bodies isn't uniform? For example, consider the Earth and a stone that is falling. The stone is in accelerated motion, continually falling faster and faster. Nothing in the special theory tells us that the laws of physical phenomena will be the same for an observer on the stone as for one on the Earth. This is [Pg 92] particularly problematic because the Earth itself is, in a broader sense, a falling body: it has an acceleration[4]towards the sun, which causes it to orbit the sun instead of moving in a straight line. Since our understanding of physics is based on experiments conducted on Earth, we can't be satisfied with a theory where the observer is considered to have no acceleration. The general theory of relativity lifts this restriction and allows the observer to move in any manner, whether straight or curved, uniformly or with acceleration. In developing this theory, Einstein was led to his new law of gravitation, which we will explore shortly. The work was incredibly challenging and took him ten years to complete. The special theory was established in 1905, while the general theory emerged in 1915.

It is obvious from experiences with which we are all familiar that an accelerated motion is much more difficult to deal with than a uniform one. When you are in a train which is traveling steadily, the motion is not noticeable so long as you do not look out of the window; but when the brakes are applied suddenly you are precipitated forwards, [Pg 93] and you become aware that something is happening without having to notice anything outside the train. Similarly in a lift everything seems ordinary while it is moving steadily, but at starting and stopping, when its motion is accelerated, you have odd sensations in the pit of the stomach. (We call a motion “accelerated” when it is getting slower as well as when it is getting quicker; when it is getting slower the acceleration is negative.) The same thing applies to dropping a weight in the cabin of a ship. So long as the ship is moving uniformly, the weight will behave, relatively to the cabin, just as if the ship were at rest: if it starts from the middle of the ceiling, it will hit the middle of the floor. But if there is an acceleration everything is changed. If the boat is increasing its speed very rapidly, the weight will seem to an observer in the cabin to fall in a curve directed towards the stern; if the speed is being rapidly diminished, the curve will be directed towards the bow. All these facts are familiar, and they led Galileo and Newton to regard an accelerated motion as something radically different, in its own nature, from a uniform motion. But this distinction could only be maintained by regarding motion as absolute, not relative. If all motion is relative, [Pg 94] the earth is accelerated relatively to the lift just as truly as the lift relatively to the earth. Yet the people on the ground have no sensations in the pits of their stomachs when the lift starts to go up. This illustrates the difficulty of our problem. In fact, though few physicists in modern times have believed in absolute motion, the technique of mathematical physics still embodied Newton’s belief in it, and a revolution in method was required to obtain a technique free from this assumption. This revolution was accomplished in Einstein’s general theory of relativity.

It’s clear from experiences that we all know about that accelerated motion is much harder to handle than uniform motion. When you’re on a train that’s moving steadily, you won’t notice the motion as long as you don’t look out the window; but when the brakes are suddenly applied, you’re thrown forward, and you realize something is happening without needing to see anything outside the train. The same goes for an elevator; everything seems normal while it's moving steadily, but when it starts or stops and the speed changes, you get a funny feeling in your stomach. (We call motion “accelerated” whether it’s speeding up or slowing down; when it slows down, the acceleration is negative.) This also applies to dropping a weight inside a ship’s cabin. As long as the ship is moving steadily, the weight will act inside the cabin as if the ship is at rest: if it drops from the middle of the ceiling, it will land in the middle of the floor. But if there’s acceleration, everything changes. If the boat speeds up quickly, the weight will appear to fall in a curve toward the back; if the speed is decreasing rapidly, the curve will point toward the front. All these examples are well-known, and they led Galileo and Newton to see accelerated motion as fundamentally different from uniform motion. However, this distinction could only be made by considering motion as absolute, not relative. If all motion is relative, then the Earth is accelerating relative to the elevator just as the elevator is relative to the Earth. Yet, people on the ground don’t feel any funny sensations in their stomachs when the elevator begins to go up. This highlights the challenge we face. In fact, while few modern physicists believe in absolute motion, the methods of mathematical physics still reflected Newton’s belief in it, and a shift in approach was needed to create a technique free from this assumption. This shift was achieved with Einstein’s general theory of relativity.

It is somewhat optional where we begin in explaining the new ideas which Einstein introduced, but perhaps we shall do best by taking the conception of “interval.” This conception, as it appears in the special theory of relativity, is already a generalization of the traditional notion of distance in space and time; but it is necessary to generalize it still further. However, it is necessary first to explain a certain amount of history, and for this purpose we must go back as far as Pythagoras.

It’s somewhat up to us where to start in explaining the new ideas that Einstein introduced, but maybe it’s best to begin with the concept of “interval.” This concept, as it is laid out in the special theory of relativity, is already an expansion of the traditional idea of distance in space and time; however, it needs to be expanded even more. Before that, though, we need to cover some historical context, and for that purpose, we have to go back as far as Pythagoras.

Pythagoras, like many of the greatest characters in history, perhaps [Pg 95] never existed: he is a semi-mythical character, who combined mathematics and priestcraft in uncertain proportions. I shall, however, assume that he existed, and that he discovered the theorem attributed to him. He was roughly a contemporary of Confucius and Buddha; he founded a religious sect, which thought it wicked to eat beans, and a school of mathematicians, who took a particular interest in right-angled triangles. The theorem of Pythagoras (the forty-seventh proposition of Euclid) states that the sum of the squares on the two shorter sides of a right-angled triangle is equal to the square on the side opposite the right angle. No proposition in the whole of mathematics has had such a distinguished history. We all learned to “prove” it in youth. It is true that the “proof” proved nothing, and that the only way to prove it is by experiment. It is also the case that the proposition is not quite true—it is only approximately true. But everything in geometry, and subsequently in physics, has been derived from it by successive generalizations. The latest of these generalizations is the general theory of relativity.

Pythagoras, like many of the greatest figures in history, may never have actually existed: he is more of a semi-mythical figure who mixed math and religious practices in unclear ways. Still, I'll assume he did exist and discovered the theorem linked to him. He lived around the same time as Confucius and Buddha; he started a religious sect that believed it was wrong to eat beans and a group of mathematicians who were especially interested in right-angled triangles. The Pythagorean theorem (the forty-seventh proposition of Euclid) states that the sum of the squares of the two shorter sides of a right-angled triangle equals the square of the side opposite the right angle. No other proposition in all of mathematics has such a notable history. We all learned to "prove" it when we were young. The truth is, the "proof" didn't really prove anything, and the only real way to verify it is through experimentation. It's also worth noting that the proposition isn't completely true—it’s only approximately true. But everything in geometry, and later in physics, has been derived from it through successive generalizations. The most recent of these generalizations is the general theory of relativity.

The theorem of Pythagoras was itself, in all probability, a [Pg 96] generalization of an Egyptian rule of thumb. In Egypt, it had been known for ages that a triangle whose sides are 3, 4, and 5 units of length is a right-angled triangle; the Egyptians used this knowledge practically in measuring their fields. Now if the sides of a triangle are 3, 4, and 5 inches, the squares on these sides will contain respectively 9, 16, and 25 square inches; and 9 and 16 added together make 25. Three times three is written “3²”; four times four, “4²”; five times five, “5².” So that we have

The Pythagorean theorem was likely a generalization of a rule that the Egyptians had known for a long time. In Egypt, they understood that a triangle with sides measuring 3, 4, and 5 units is a right triangle; they used this knowledge practically for measuring their fields. If the sides of a triangle are 3, 4, and 5 inches, the squares of these sides would be 9, 16, and 25 square inches, respectively; and when you add 9 and 16 together, you get 25. Three times three is written as “3²”; four times four is “4²”; five times five is “5².” So we have

3² + 4² = 5².

3² + 4² = 5².

It is supposed that Pythagoras noticed this fact, after he had learned from the Egyptians that a triangle whose sides are 3, 4 and 5 has a right angle. He found that this could be generalized, and so arrived at his famous theorem: In a right-angled triangle, the square on the side opposite the right angle is equal to the sum of the squares on the other two sides.

It is thought that Pythagoras observed this after he learned from the Egyptians that a triangle with sides measuring 3, 4, and 5 forms a right angle. He realized that this could be generalized and came up with his famous theorem: In a right-angled triangle, the square of the length of the side opposite the right angle is equal to the sum of the squares of the lengths of the other two sides.

Similarly in three dimensions: if you take a right-angled solid block, [Pg 97] the square on the diagonal (the dotted line in the figure) is equal to the sum of the squares on the three sides.

Similarly in three dimensions: if you take a right-angled solid block, [Pg 97] the square on the diagonal (the dotted line in the figure) is equal to the sum of the squares on the three sides.

This is as far as the ancients got in this matter.

This is as far as the ancients got with this issue.

The next step of importance is due to Descartes, who made the theorem of Pythagoras the basis of his method of analytical geometry. Suppose you wish to map out systematically all the places on a plain—we will suppose it small enough to make it possible to ignore the fact that the earth is round. We will suppose that you live in the middle of the plain. One of the simplest ways of describing the position of a place is to say: starting from my house, go first such and such a distance east, then such and such a distance north (or it may be west in the first case, and south in the second). This tells you exactly where the place is. In the rectangular cities of America, it is the natural method to adopt: in New York you will be told to go so many blocks east (or west) and then so many blocks north (or south). The distance you have to go east is called x, and the distance you have to [Pg 98] go north is called y. (If you have to go west, x is negative; if you have to go south, y is negative.) Let O be your starting point (the “origin”); let OM be the distance you go east, and MP the distance you go north. How far are you from home in a direct line when you reach P? The theorem of Pythagoras gives the answer. The square on OP is the sum of the squares on OM and MP. If OM is four miles, and MP is three miles, OP is 5 miles. If OM is 12 miles and MP is 5 miles, OP is 13 miles, because 12² + 5² = 13². So that if you adopt Descartes’ method of mapping, the theorem of Pythagoras is essential in giving you the distance from place to place. In three dimensions the thing is exactly analogous. Suppose that, instead of wanting merely to fix positions on the plain, you want to fix stations for captive balloons above it, you will then have to add a third quantity, the height at which the balloon is to be. If you call the height z, and if r is the direct distance from O to the balloon, you will have

The next important step comes from Descartes, who used the Pythagorean theorem as the foundation for his method of analytical geometry. Imagine you want to systematically map out all the locations on a flat surface—we’ll assume it’s small enough to ignore that the Earth is round. Let’s say you live in the center of this surface. One of the simplest ways to describe a location is to say: starting from my house, go a certain distance east, and then a certain distance north (or it might be west in the first case and south in the second). This gives you an exact position. In the grid-like cities of America, this is the typical approach: in New York, you’d be told to go a certain number of blocks east (or west) and then a certain number of blocks north (or south). The distance you go east is called x, and the distance you go north is called y. (If you go west, x is negative; if you go south, y is negative.) Let O be your starting point (the “origin”); let OM be the distance you go east, and MP the distance you go north. How far are you from home in a straight line when you reach P? The Pythagorean theorem provides the answer. The square of OP equals the sum of the squares of OM and MP. If OM is four miles and MP is three miles, OP is 5 miles. If OM is 12 miles and MP is 5 miles, OP is 13 miles, because 12² + 5² = 13². So, if you use Descartes’ method of mapping, the Pythagorean theorem is crucial for determining the distance from one place to another. In three dimensions, it’s exactly the same. Suppose that instead of just wanting to identify positions on the flat surface, you want to set up locations for balloons floating above it; you’ll need to add a third measurement, which is the height at which the balloon will be. If you call the height z, and if r represents the direct distance from O to the balloon, you will have

r² = x² + y² + z²,

r² = x² + y² + z²,

and from this you can calculate r when you know x, y, and z. For example, if you can get to the balloon by [Pg 99] going 12 miles east, 4 miles north, and then 3 miles up, your distance from the balloon in a straight line is 13 miles, because

and from this you can calculate r when you know x, y, and z. For example, if you can reach the balloon by [Pg 99] traveling 12 miles east, 4 miles north, and then 3 miles up, your straight-line distance from the balloon is 13 miles, because

  • 12 × 12 = 144,
  • 4 × 4 = 16,
  • 3 × 3 = 9,
  • 144 + 16 + 9 = 169 = 13 × 13.

But now suppose that, instead of taking a small piece of the earth’s surface which can be regarded as flat, you consider making a map of the world. An accurate map of the world on flat paper is impossible. A globe can be accurate, in the sense that everything is produced to scale, but a flat map cannot be. I am not talking of practical difficulties, I am talking of a theoretical impossibility. For example: the northern halves of the meridian of Greenwich and the ninetieth meridian of west longitude, together with the piece of the equator between them, make a triangle whose sides are all equal and whose angles are all right angles. On a flat surface, a triangle of that sort would be impossible. On the other hand, it is possible to make a square on a flat surface, but on a sphere it is impossible. Suppose you try on the earth: walk 100 miles west, then 100 miles north, then 100 miles east, then 100 miles south. You might think this would make a square, but it wouldn’t, because you would not at the end have come back to [Pg 100] your starting point. If you have time, you may convince yourself of this by experiment. If not, you can easily see that it must be so. When you are nearer the pole, 100 miles takes you through more longitude than when you are nearer the equator, so that in doing your 100 miles east (if you are in the northern hemisphere) you get to a point further east than that from which you started. As you walk due south after this, you remain further east than your starting point, and end up at a different place from that in which you began. Suppose, to take another illustration, that you start on the equator 4,000 miles east of the Greenwich meridian; you travel till you reach the meridian, then you travel northwards along it for 4,000 miles, through Greenwich and up to the neighborhood of the Shetland Islands; then you travel eastward for 4,000 miles, and then 4,000 miles south. This will take you to the equator at a point 4,000 miles further east than the point from which you started.

But now imagine that, instead of taking a small flat section of the earth's surface, you're trying to make a map of the entire world. An accurate flat map of the world is impossible. A globe can be accurate since everything is represented to scale, but a flat map cannot achieve that. I’m not talking about practical issues; I’m discussing a theoretical impossibility. For instance, the northern halves of the Greenwich meridian and the 90th meridian of west longitude, along with the segment of the equator between them, form a triangle where all sides are equal and all angles are right angles. Such a triangle couldn't exist on a flat surface. Conversely, you can draw a square on a flat surface, but it's impossible to do so on a sphere. Now, let’s say you try this on the Earth: you walk 100 miles west, then 100 miles north, then 100 miles east, and finally 100 miles south. You might think this would create a square, but it wouldn’t, because you wouldn't end up back where you started. If you have time, you could prove this through experimentation, but it’s also clear that it must be true. When you're closer to the pole, walking 100 miles covers more longitude than it does when you’re closer to the equator, so when you walk 100 miles east (if you’re in the Northern Hemisphere), you end up further east than your starting point. As you walk directly south after that, you remain further east than where you began and finish at a different location. To give another example, imagine you start on the equator 4,000 miles east of the Greenwich meridian; you walk until you reach the meridian, then go north along it for 4,000 miles, passing through Greenwich and up to the area near the Shetland Islands; next, you travel east for 4,000 miles, and finally 4,000 miles south. This route will bring you back to the equator at a point that is 4,000 miles further east than where you began.

In a sense, what we have just been saying is not quite fair, because, except on the equator, traveling due east is not the shortest route from a place to another place due east of it. A ship traveling (say) [Pg 101] from New York to Lisbon, which is nearly due east, will start by going a certain distance northward. It will sail on a “great circle,” that is to say, a circle whose centre is the centre of the earth. This is the nearest approach to a straight line that can be drawn on the surface of the earth. Meridians of longitude are great circles, and so is the equator, but the other parallels of latitude are not. We ought, therefore, to have supposed that, when you reach the Shetland Islands, you travel 4,000 miles, not due east, but along a great circle which lands you at a point due east of the Shetland Islands. This, however, only reinforces our conclusion: you will end at a point even further east of your starting point than before.

In a way, what we’ve just discussed isn’t entirely accurate because, except at the equator, traveling directly east isn’t the shortest route from one place to another directly east of it. A ship traveling (for example) [Pg 101] from New York to Lisbon, which is almost due east, will begin by going a certain distance northward. It will sail along a “great circle,” which is a circle whose center is at the center of the earth. This represents the closest thing to a straight line that can be drawn on the earth's surface. Meridians of longitude are great circles, and so is the equator, but other parallels of latitude are not. Therefore, we should have considered that when you reach the Shetland Islands, you actually travel 4,000 miles not directly east, but along a great circle that leads you to a point east of the Shetland Islands. However, this only strengthens our conclusion: you will end up even further east of your starting point than before.

What are the differences between the geometry on a sphere and the geometry on a plane? If you make a triangle on the earth, whose sides are great circles, you will not find that the angles of the triangle add up to two right angles: they will add up to rather more. The amount by which they exceed two right angles is proportional to the size of the triangle. On a small triangle such as you could make with strings on your lawn, or even on a triangle formed by three ships which can [Pg 102] just see each other, the angles will add up to so little more than two right angles that you will not be able to detect the difference. But if you take the triangle made by the equator, the Greenwich meridian, and the ninetieth meridian, the angles add up to three right angles. And you can get triangles in which the angles add up to anything up to six right angles. All this you could discover by measurements on the surface of the earth, without having to take account of anything in the rest of space.

What are the differences between the geometry on a sphere and the geometry on a flat surface? If you create a triangle on Earth, where the sides are great circles, you won't find that the angles of the triangle add up to two right angles; they will add up to a bit more. The amount by which they exceed two right angles is proportional to the size of the triangle. For a small triangle you could create with strings on your lawn, or even a triangle formed by three ships that can just see each other, the angles will add up to only slightly more than two right angles, so you probably won't notice the difference. But if you consider the triangle formed by the equator, the Greenwich meridian, and the ninetieth meridian, the angles add up to three right angles. You can also create triangles where the angles add up to as much as six right angles. You could discover all this by measuring on the surface of the Earth, without needing to consider anything else in space.

The theorem of Pythagoras also will fail for distances on a sphere. From the point of view of a traveler bound to the earth, the distance between two places is their great circle distance, that is to say, the shortest journey that a man can make without leaving the surface of the earth. Now suppose you take three bits of great circles which make a triangle, and suppose one of them is at right angles to another—to be definite, let one be the equator and one a bit of the meridian of Greenwich going northward from the equator. Suppose you go 3,000 miles along the equator, and then 4,000 miles due north; how far will you be from your starting point, estimating the distance along a great circle? [Pg 103] If you were on a plane, your distance would be 5,000 miles, as we saw before. In fact, however, your great circle distance will be considerably less than this. In a right-angled triangle on a sphere, the square on the side opposite the right angle is less than the sum of the squares on the other two sides.

The Pythagorean theorem doesn't hold true for distances on a sphere. When you're on Earth, the distance between two locations is measured along the great circle, which is the shortest path you can take without leaving the planet's surface. Now imagine drawing three segments of great circles to form a triangle, with one segment perpendicular to another. For example, let one segment be the equator and the other a part of the Greenwich meridian heading north from the equator. If you travel 3,000 miles along the equator and then 4,000 miles straight north, how far are you from your starting point if you measure along a great circle? [Pg 103] If you were on a flat plane, the distance would be 5,000 miles, as previously discussed. However, the actual great circle distance will be significantly shorter. In a right-angled triangle on a sphere, the square of the length of the side opposite the right angle is less than the sum of the squares of the other two sides.

These differences between the geometry on a sphere and the geometry on a plane are intrinsic differences; that is to say, they enable you to find out whether the surface on which you live is like a plane or like a sphere, without requiring that you should take account of anything outside the surface. Such considerations led to the next step of importance in our subject, which was taken by Gauss, who flourished a hundred years ago. He studied the theory of surfaces, and showed how to develop it by means of measurements on the surfaces themselves, without going outside them. In order to fix the position of a point in space, we need three measurements; but in order to fix the position of a point on a surface we need only two—for example, a point on the earth’s surface is fixed when we know its latitude and longitude. [Pg 104]

These differences between the geometry on a sphere and the geometry on a flat surface are essential differences; in other words, they allow you to determine whether the surface you’re on behaves like a flat plane or like a sphere, without needing to consider anything outside that surface. These ideas led to the next significant development in our field, introduced by Gauss, who was active about a hundred years ago. He explored the theory of surfaces and demonstrated how to evolve that theory based solely on measurements taken from the surfaces themselves, without any need to look beyond them. To pinpoint a location in space, we need three measurements; however, to identify a position on a surface, only two are necessary—for instance, a location on the earth’s surface is defined when we know its latitude and longitude. [Pg 104]

Now Gauss found that, whatever system of measurement you adopt, and whatever the nature of the surface, there is always a way of calculating the distance between two not very distant points of the surface, when you know the quantities which fix their positions. The formula for the distance is a generalization of the formula of Pythagoras: it tells you the square of the distance in terms of the squares of the differences between the measure quantities which fix the points, and also the product of these two quantities. When you know this formula, you can discover all the intrinsic properties of the surface, that is to say, all those which do not depend upon its relations to points outside the surface. You can discover, for example, whether the angles of a triangle add up to two right angles, or more, or less, or more in some cases and less in others.

Now Gauss discovered that, no matter what measurement system you use, and regardless of the surface's nature, there’s always a way to calculate the distance between two nearby points on that surface, as long as you know the values that define their positions. The formula for distance is a generalization of Pythagoras's theorem: it shows the square of the distance in relation to the squares of the differences between the values that define the points and also the product of these two values. Once you understand this formula, you can identify all the intrinsic properties of the surface, meaning those that don't rely on its relationship to points outside of it. For instance, you can determine whether the angles of a triangle add up to two right angles, more, or less, or even if they vary in some cases and not in others.

But when we speak of a “triangle,” we must explain what we mean, because on most surfaces there are no straight lines. On a sphere, we shall replace straight lines by great circles, which are the nearest possible approach to straight lines. In general, we shall take, instead of straight lines, the lines that give the shortest route on [Pg 105] the surface from place to place. Such lines are called “geodesics.” On the earth, the geodesics are great circles. In general, they are the shortest way of traveling from point to point if you are unable to leave the surface. They take the place of straight lines in the intrinsic geometry of a surface. When we inquire whether the angles of a triangle add up to two right angles or not, we mean to speak of a triangle whose sides are geodesics. And when we speak of the distance between two points, we mean the distance along a geodesic.

But when we talk about a “triangle,” we need to clarify what we mean, because on most surfaces, there aren't any straight lines. On a sphere, we’ll replace straight lines with great circles, which are the closest thing to straight lines. Generally, instead of straight lines, we’ll use the paths that provide the shortest route on [Pg 105] the surface from one place to another. These paths are called “geodesics.” On Earth, geodesics are great circles. In general, they represent the shortest way to travel from point to point if you can't leave the surface. They take the place of straight lines in the intrinsic geometry of a surface. When we ask whether the angles of a triangle add up to two right angles, we’re referring to a triangle whose sides are geodesics. And when we talk about the distance between two points, we mean the distance along a geodesic.

The next step in our generalizing process is rather difficult: it is the transition to non-Euclidean geometry. We live in a world in which space has three dimensions, and our empirical knowledge of space is based upon measurement of small distances and of angles. (When I speak of small distances, I mean distances that are small compared to those in astronomy; all distances on the earth are small in this sense.) It was formerly thought that we could be sure à priori that space is Euclidean—for instance, that the angles of a triangle add up to two right angles. But it came to be recognized that we could not prove this by reasoning; if it was to be known, it must be known as the result of [Pg 106] measurements. Before Einstein, it was thought that measurements confirm Euclidean geometry within the limits of exactitude attainable; now this is no longer thought. It is still true that we can, by what may be called a natural artifice, cause Euclidean geometry to seem true throughout a small region, such as the earth; but in explaining gravitation Einstein is led to the view that over large regions where there is matter we cannot regard space as Euclidean. The reasons for this will concern us later. What concerns us now is the way in which non-Euclidean geometry results from a generalization of the work of Gauss.

The next step in our generalizing process is quite challenging: it involves the shift to non-Euclidean geometry. We live in a world with three dimensions, and our understanding of space comes from measuring short distances and angles. (When I say short distances, I mean distances that are small compared to those in astronomy; all distances on Earth are relatively small in this sense.) It was once believed that we could be certain à priori that space is Euclidean—for example, that the angles of a triangle add up to two right angles. However, it became clear that we couldn't prove this through reasoning; if it was to be known, it had to be understood as a result of [Pg 106] measurements. Before Einstein, the belief was that measurements confirmed Euclidean geometry within the limits of achievable precision; that view has changed. It's still true that, through what might be called a natural artifice, we can make Euclidean geometry seem true over a small area, like Earth; but in explaining gravitation, Einstein comes to see that in large areas where matter exists, we cannot treat space as Euclidean. The reasons for this will be addressed later. What we need to focus on now is how non-Euclidean geometry arises from a generalization of Gauss's work.

There is no reason why we should not have the same circumstances in three-dimensional space as we have, for example, on the surface of a sphere. It might happen that the angles of a triangle would always add up to more than two right angles, and that the excess would be proportional to the size of the triangle. It might happen that the distance between two points would be given by a formula analogous to what we have on the surface of a sphere, but involving three quantities instead of two. Whether this does happen or not, can only [Pg 107] be discovered by actual measurements. There are an infinite number of such possibilities.

There’s no reason we can’t have the same conditions in three-dimensional space as we do, for example, on the surface of a sphere. It could turn out that the angles of a triangle always add up to more than two right angles, and that the extra amount relates to the size of the triangle. It’s also possible that the distance between two points could be described by a formula similar to the one we use on a sphere, but with three variables instead of two. Whether this is actually the case can only be determined through real measurements. There are endless possibilities like this. [Pg 107]

This line of argument was developed by Riemann, in his dissertation “On the hypotheses which underlie geometry” (1854), which applied Gauss’s work on surfaces to different kinds of three-dimensional spaces. He showed that all the essential characteristics of a kind of space could be deduced from the formula for small distances. He assumed that, from the small distances in three given directions which would together carry you from one point to another not far from it, the distances between the two points could be calculated. For instance, if you know that you can get from one point to another by first moving a certain distance east, then a certain distance north, and finally a certain distance straight up in the air, you are to be able to calculate the distance from the one point to the other. And the rule for the calculation is to be an extension of the theorem of Pythagoras, in the sense that you arrive at the square of the required distance by adding together multiples of the squares of the component distances, together possibly with multiples of their products. From certain characteristics in the formula, you can tell what sort of [Pg 108] space you have to deal with. These characteristics do not depend upon the particular method you have adopted for determining the positions of points.

This line of reasoning was developed by Riemann in his dissertation “On the Hypotheses That Underlie Geometry” (1854), which applied Gauss’s work on surfaces to various types of three-dimensional spaces. He demonstrated that all the essential features of a type of space could be derived from the formula for small distances. He assumed that, based on the small distances in three given directions that would together take you from one point to another nearby, the distance between the two points could be calculated. For example, if you know that you can reach another point by first moving a certain distance east, then a certain distance north, and finally a certain distance straight up, you should be able to calculate the distance from one point to the other. The calculation rule would extend the Pythagorean theorem, meaning you arrive at the square of the required distance by adding together multiples of the squares of the individual distances, possibly along with multiples of their products. From specific characteristics in the formula, you can determine what kind of space you are dealing with. These characteristics do not depend on the particular method you used to establish the positions of points. [Pg 108]

In order to arrive at what we want for the theory of relativity, we now have one more generalization to make: we have to substitute the “interval” between events for the distance between points. This takes us to space-time. We have already seen that, in the special theory of relativity, the square of the interval is found by subtracting the square of the distance between the events from the square of the distance that light would travel in the time between them. In the general theory, we do not assume this special form of interval, except at a great distance from matter. Elsewhere, we assume to begin with a general form, like that which Riemann used for distances. Moreover, like Riemann, Einstein only assumes his formula for neighboring events, that is to say, events which have only a small interval between them. What goes beyond these initial assumptions depends upon observation of the actual motion of bodies, in ways which we shall explain in later chapters. [Pg 109]

To get to what we want for the theory of relativity, we need to make one more generalization: we have to replace the “interval” between events with the distance between points. This brings us to space-time. We've already seen that in the special theory of relativity, the square of the interval is calculated by subtracting the square of the distance between the events from the square of the distance that light would travel in the time between them. In the general theory, we don’t assume this specific form of interval, except at a great distance from matter. Instead, we start with a general form similar to what Riemann used for distances. Additionally, like Riemann, Einstein only applies his formula to neighboring events, meaning events with just a small interval between them. What goes beyond these initial assumptions relies on the observation of the actual motion of bodies, which we will explain in later chapters. [Pg 109]

We may now sum up and re-state the process we have been describing. In three dimensions, the position of a point relatively to a fixed point (the “origin”) can be determined by assigning three quantities (“co-ordinates”). For example, the position of a balloon relatively to your house is fixed if you know that you will reach it by going first a given distance due east, then another given distance due north, then a third given distance straight up. When, as in this case, the three co-ordinates are three distances all at right angles to each other, which, taken successively, transport you from the origin to the point in question, the square of the direct distance to the point in question is got by adding up the squares of the three co-ordinates. In all cases, whether in Euclidean or in non-Euclidean spaces, it is got by adding multiples of the squares and products of the co-ordinates according to an assignable rule. The co-ordinates may be any quantities which fix the position of a point, provided that neighboring points must have neighboring quantities for their co-ordinates. In the general theory of relativity, we add a fourth co-ordinate to give the time, and our formula gives “interval” instead of spatial distance; moreover we [Pg 110] assume the accuracy of our formula for small distances only. We assume further that, at great distances from matter, the formula approximates more and more closely to the formula for interval which is used in the special theory.

We can now summarize and restate the process we've been discussing. In three dimensions, the position of a point relative to a fixed point (the “origin”) can be determined by assigning three values (“coordinates”). For example, the position of a balloon relative to your house is fixed if you know you'll reach it by traveling a certain distance due east, then a certain distance due north, and finally a certain distance straight up. When, as in this case, the three coordinates are three distances that are all at right angles to each other, which, taken one after the other, take you from the origin to the point in question, the square of the direct distance to that point can be found by adding the squares of the three coordinates. In all cases, whether in Euclidean or non-Euclidean spaces, this is obtained by adding multiples of the squares and products of the coordinates according to a specific rule. The coordinates can be any values that fix the position of a point, as long as neighboring points have neighboring values for their coordinates. In the general theory of relativity, we add a fourth coordinate to represent time, and our formula gives “interval” instead of spatial distance; moreover, we assume the accuracy of our formula is valid only for small distances. We also assume that, at large distances from matter, the formula gets closer and closer to the interval formula used in the special theory. [Pg 110]

We are now at last in a position to tackle Einstein’s theory of gravitation.

We are finally in a position to take on Einstein’s theory of gravity.


[Pg 111]

[Pg 111]

CHAPTER VIII:
EINSTEIN’S LAW OF GRAVITATION

Before tackling Einstein’s new law, it is as well to convince ourselves, on logical grounds, that Newton’s law of gravitation cannot be quite right.

Before diving into Einstein’s new theory, it’s important to convince ourselves, using logic, that Newton’s law of gravitation can’t be completely accurate.

Newton said that between any two particles of matter there is a force which is proportional to the product of their masses and inversely proportional to the square of their distance. That is to say, ignoring for the present the question of mass, if there is a certain attraction when the particles are a mile apart, there will be a quarter as much attraction when they are two miles apart, a ninth as much when they are three miles apart, and so on: the attraction diminishes much faster than the distance increases. Now, of course, Newton, when he spoke of the distance, meant the distance at a given time: He thought there could be no ambiguity about time. But we have seen that this was a mistake. What one observer judges to be the same moment on the [Pg 112] earth and the sun, another will judge to be two different moments. “Distance at a given moment” is therefore a subjective conception, which can hardly enter into a cosmic law. Of course, we could make our law unambiguous by saying that we are going to estimate times as they are estimated by Greenwich Observatory. But we can hardly believe that the accidental circumstances of the earth deserve to be taken so seriously. And the estimate of distance, also, will vary for different observers. We cannot, therefore, allow that Newton’s form of the law of gravitation can be quite correct, since it will give different results according to which of many equally legitimate conventions we adopt. This is as absurd as it would be if the question whether one man had murdered another were to depend upon whether they were described by their Christian names or their surnames. It is obvious that physical laws must be the same whether distances are measured in miles or in kilometers, and we are concerned with what is essentially only an extension of the same principle.

Newton stated that between any two particles of matter, there is a force that is proportional to the product of their masses and inversely proportional to the square of their distance. In simpler terms, ignoring the mass for now, if there is a certain attraction when the particles are a mile apart, the attraction will be a quarter as strong when they are two miles apart, a ninth as strong when they are three miles apart, and so on: the attraction decreases much faster than the distance increases. Naturally, when Newton discussed distance, he meant the distance at a specific time: he believed there could be no confusion about time. However, we have seen that this was a mistake. What one observer considers to be the same moment on Earth and the sun, another may see as two different moments. Thus, "distance at a given moment" is a subjective concept that is difficult to include in a cosmic law. We could clarify our law by saying we will use the time measurements from Greenwich Observatory. But it seems unreasonable to take the random factors of the Earth so seriously. Additionally, the estimate of distance will vary for different observers. Therefore, we can’t accept that Newton’s version of the law of gravitation is completely correct, as it would yield different results based on which of the many valid conventions we choose. This is as ridiculous as if the question of whether one person had murdered another depended on whether they were referred to by their first names or last names. It is clear that physical laws must remain the same whether distances are measured in miles or kilometers, and we are essentially discussing an extension of the same principle.

Our measurements are conventional to an even greater extent than [Pg 113] is admitted by the special theory of relativity. Moreover, every measurement is a physical process carried out with physical material; the result is certainly an experimental datum, but may not be susceptible of the simple interpretation which we ordinarily assign to it. We are, therefore, not going to assume to begin with that we know how to measure anything. We assume that there is a certain physical quantity, called “interval,” which is a relation between two events that are not widely separated; but we do not assume in advance that we know how to measure it, beyond taking it for granted that it is given by some generalization of the theorem of Pythagoras such as we spoke of in the preceding chapter.

Our measurements are even more conventional than what the special theory of relativity acknowledges. Furthermore, every measurement is a physical process involving physical materials; the result is definitely an experimental fact, but it might not be simply interpreted in the way we usually think. Therefore, we won't assume from the start that we know how to measure anything. We assume there is a specific physical quantity called “interval,” which relates to two events that aren’t too far apart; however, we won’t assume from the outset that we understand how to measure it, aside from the notion that it is derived from some generalization of the Pythagorean theorem, as we discussed in the previous chapter.

We do assume, however, that events have an order, and that this order is four-dimensional. We assume, that is to say, that we know what we mean by saying that a certain event is nearer to another than to a third, so that before making accurate measurements we can speak of the “neighborhood” of an event; and we assume that, in order to assign the position of an event in space-time, four quantities (co-ordinates) are necessary—e.g. in our former case of an explosion on an [Pg 114] airship, latitude, longitude, altitude and time. But we assume nothing about the way in which these co-ordinates are assigned, except that neighboring co-ordinates are assigned to neighboring events.

We do assume, however, that events have an order, and that this order is four-dimensional. We believe, in other words, that we understand what it means to say that one event is closer to another than it is to a third, so that before making precise measurements we can talk about the “neighborhood” of an event; and we assume that to define the position of an event in space-time, four quantities (coordinates) are necessary—e.g. in our earlier example of an explosion on an airship, latitude, longitude, altitude, and time. But we don’t assume anything about how these coordinates are assigned, other than that neighboring coordinates are linked to neighboring events.

The way in which these numbers, called co-ordinates, are to be assigned is neither wholly arbitrary nor a result of careful measurement—it lies in an intermediate region. While you are making any continuous journey, your co-ordinates must never alter by sudden jumps. In America one finds that the houses between (say) Fourteenth Street and Fifteenth Street are likely to have numbers between 1400 and 1500, while those between Fifteenth Street and Sixteenth Street have numbers between 1500 and 1600, even if the 1400’s were not used up. This would not do for our purposes, because there is a sudden jump when we pass from one block to the next. Or again we might assign the time co-ordinate in the following way: take the time that elapses between two successive births of people called Smith; an event occurring between the births of the 3000th and the 3001st Smith known to history shall have a co-ordinate lying between 3000 and 3001; the fractional part of its co-ordinate [Pg 115] shall be the fraction of a year that has elapsed since the birth of the 3000th Smith. (Obviously there could never be as much as a year between two successive additions to the Smith family.) This way of assigning the time co-ordinate is perfectly definite, but it is not admissible for our purposes, because there will be sudden jumps between events just before the birth of a Smith and events just after, so that in a continuous journey your time co-ordinate will not change continuously. It is assumed that, independently of measurement, we know what a continuous journey is. And when your position in space-time changes continuously, each of your four co-ordinates must change continuously. One, two, or three of them may not change at all; but whatever change does occur must be smooth, without sudden jumps. This explains what is not allowable in assigning co-ordinates.

The way these numbers, called coordinates, are assigned is neither completely random nor just based on careful measurements—it falls somewhere in between. When you’re on a continuous journey, your coordinates should never change suddenly. In America, houses between (for example) Fourteenth Street and Fifteenth Street usually have numbers between 1400 and 1500, while those between Fifteenth Street and Sixteenth Street have numbers between 1500 and 1600, even if the 1400s haven’t been completely used up. This wouldn’t work for our needs because there’s a sudden jump when moving from one block to the next. Alternatively, we could assign the time coordinate like this: measure the time between two successive births of people named Smith; an event happening between the births of the 3000th and 3001st Smith in history would have a coordinate between 3000 and 3001; the fractional part of that coordinate would be the fraction of a year that has passed since the birth of the 3000th Smith. (Clearly, there can never be a whole year between two successive births in the Smith family.) This method of setting the time coordinate is very precise, but it wouldn’t work for our needs because there would be sudden jumps between events just before a Smith is born and those just after, meaning that during a continuous journey, your time coordinate wouldn’t change smoothly. It’s assumed that we understand what a continuous journey is, regardless of measurements. When your position in space-time changes smoothly, each of your four coordinates must also change smoothly. One, two, or three of them might not change at all; but any change that does happen must be gradual, with no sudden jumps. This clarifies what is not acceptable when assigning coordinates.

To explain all the changes that are legitimate in your co-ordinates, suppose you take a large piece of soft india-rubber. While it is in an unstretched condition, measure little squares on it, each one-tenth of an inch each way. Put in little tiny pins at the corners of the squares. We can take as two of the co-ordinates of one of these pins [Pg 116] the number of pins passed in going to the right from a given pin until we come just below the pin in question, and then the number of pins we pass on the way up to this pin. In the figure, let O be the pin we start from and P the pin to which we are going to assign co-ordinates. P is in the fifth column and the third row, so its co-ordinates in the plane of the india-rubber are to be 5 and 3.

To explain all the changes that are valid in your coordinates, imagine taking a large piece of soft rubber. While it’s in an unstretched state, measure small squares on it, each one-tenth of an inch on each side. Place tiny pins at the corners of the squares. We can use the number of pins moved to the right from a given pin until we reach just below the pin in question as one of the coordinates, and then the number of pins we move up to this pin as the other coordinate. In the diagram, let O be the pin we start from and P be the pin we are assigning coordinates to. P is in the fifth column and the third row, so its coordinates on the rubber plane are 5 and 3.

 

Fig. 1.

Fig. 1.

Fig. 2.

Fig. 2.

Now take the india-rubber and stretch it and twist it as much as you like. Let the pins now be in the shape they have in Fig. 2. The divisions now no longer represent distances according to our usual notions, but they will still do just as well as co-ordinates. We may still take P as having the co-ordinates 5 and 3 in the plane of the india-rubber; and we may still regard the india-rubber as being in a plane, even if we have twisted it out of what we should ordinarily [Pg 117] call a plane. Such continuous distortions do not matter.

Now take the rubber and stretch it and twist it as much as you want. Let the pins now be in the shape they are in Fig. 2. The divisions no longer represent distances according to our usual ideas, but they will still work just as well as coordinates. We can still take P as having the coordinates 5 and 3 in the plane of the rubber; and we can still think of the rubber as being in a plane, even if we’ve twisted it out of what we would usually call a plane. Such continuous distortions don’t matter. [Pg 117]

To take another illustration: instead of using a steel measuring rod to fix our co-ordinates, let us use a live eel, which is wriggling all the time. The distance from the tail to the head of the eel is to count as one from the point of view of co-ordinates, whatever shape the creature may be assuming at the moment. The eel is continuous, and its wriggles are continuous, so it may be taken as our unit of distance in assigning co-ordinates. Beyond the requirement of continuity, the method of assigning co-ordinates is purely conventional, and therefore a live eel is just as good as a steel rod.

To illustrate further: instead of using a steel measuring rod to set our coordinates, let’s use a live eel that’s constantly wriggling. The distance from the tail to the head of the eel counts as one unit from the perspective of coordinates, no matter what shape the creature is taking at the moment. The eel is continuous, and its movements are also continuous, so we can use it as our unit of distance for setting coordinates. Aside from the need for continuity, the method of setting coordinates is completely conventional, which means a live eel is just as good as a steel rod.

We are apt to think that, for really careful measurements, it is better to use a steel rod than a live eel. This is a mistake: not because the eel tells us what the steel rod was thought to tell, but because the steel rod really tells no more than the eel obviously does. The point is, not that eels are really rigid, but that steel rods really wriggle. To an observer in just one possible state of motion, the eel would appear rigid, while the steel rod would seem to wriggle just [Pg 118] as the eel does to us. For everybody moving differently both from this observer and ourselves, both the eel and the rod would seem to wriggle. And there is no saying that one observer is right and another wrong. In such matters, what is seen does not belong solely to the physical process observed, but also to the standpoint of the observer. Measurements of distances and times do not directly reveal properties of the things measured, but relations of the things to the measurer. What observation can tell us about the physical world is therefore more abstract than we have hitherto believed.

We often think that for really accurate measurements, it's better to use a steel rod than a live eel. This is a mistake: not because the eel provides the same information that we expect from the steel rod, but because the steel rod actually doesn't provide any more information than the eel obviously does. The point is not that eels are completely rigid, but that steel rods actually move in a way that looks like they're wriggling. To an observer in just one specific state of motion, the eel would look rigid, while the steel rod would seem to wiggle just like the eel does for us. For everyone moving differently from both this observer and ourselves, both the eel and the rod would appear to wriggle. There's no way to say that one observer is right and another is wrong. In these matters, what is seen is not just about the physical process being observed, but also about the perspective of the observer. Measurements of distances and times don’t directly show the properties of what’s being measured, but rather the relationship between those things and the person measuring them. What observation can teach us about the physical world is therefore more abstract than we have previously thought.

It is important to realize that geometry, as taught in schools since Greek times, ceases to exist as a separate science, and becomes merged in physics. The two fundamental notions in elementary geometry were the straight line and the circle. What appears to you as a straight road, whose parts all exist now, may appear to another observer to be like the flight of a rocket, some kind of curve whose parts come into existence successively. The circle depends upon measurement of distances, since it consists of all the points at a given distance from its center. And measurement of distances, as we have seen, is [Pg 119] a subjective affair, depending upon the way in which the observer is moving. The failure of the circle to have objective validity was demonstrated by the Michelson-Morley experiment, and is thus, in a sense, the starting point of the whole theory of relativity. Rigid bodies, which we need for measurement, are only rigid for certain observers; for others, they will be constantly changing all their dimensions. It is only our obstinately earth-bound imagination that makes us suppose a geometry separate from physics to be possible.

It's important to understand that geometry, as it's been taught in schools since Greek times, no longer exists as a separate discipline and instead merges with physics. The two basic concepts in elementary geometry are the straight line and the circle. What looks like a straight road to you, where all parts are present now, might seem to another observer like the path of a rocket, a kind of curve where parts appear one after another. The circle relies on measuring distances, as it includes all points at a specific distance from its center. And as we've noted, measuring distances is a subjective process, depending on the movement of the observer. The fact that the circle lacks objective validity was shown by the Michelson-Morley experiment, which is a key point in the theory of relativity. Rigid bodies, which we use for measurement, are only considered rigid by certain observers; for others, they will be constantly changing in size. It's only our stubbornly earth-bound imagination that leads us to think a geometry separate from physics is possible.

That is why we do not trouble to give physical significance to our co-ordinates from the start. Formerly, the co-ordinates used in physics were supposed to be carefully measured distances; now we realize that this care at the start is thrown away. It is at a later stage that care is required. Our co-ordinates now are hardly more than a systematic way of cataloguing events. But mathematics provides, in the method of tensors, such an immensely powerful technique that we can use co-ordinates assigned in this apparently careless way just as effectively as if we had applied the whole apparatus of minutely accurate measurement in arriving at them. The advantage of being [Pg 120] haphazard at the start is that we avoid making surreptitious physical assumptions, which we can hardly help making, if we suppose that our co-ordinates have initially some particular physical significance.

That's why we don't bother assigning physical meaning to our coordinates right from the beginning. In the past, coordinates used in physics were thought to be carefully measured distances; now we understand that this initial precision isn't necessary. Care is needed later on. Our coordinates now are basically just a systematic way of organizing events. However, mathematics allows us to use tensors, a super powerful method, meaning we can work with coordinates assigned in this seemingly careless manner just as effectively as if we had done precise measurements to define them. The benefit of being messy at the start is that we avoid making hidden physical assumptions, which are difficult to avoid if we believe our coordinates initially have some specific physical meaning. [Pg 120]

We assume that, if two events are close together (but not necessarily otherwise), there is an interval between them which can be calculated from the differences between their co-ordinates by some such formula as we considered in the preceding chapter. That is to say, we take the squares and products of the differences of co-ordinates, we multiply them by suitable amounts (which in general will vary from place to place), and we add the results together. The sum obtained is the square of the interval. We do not assume in advance that we know the amounts by which the squares and products must be multiplied; this is going to be discovered by observing physical phenomena. We know, however, certain things. We know that the old Newtonian physics is very nearly accurate when our co-ordinates have been chosen in a certain way. We know that the special theory of relativity is still more nearly accurate for suitable co-ordinates. From such facts we can [Pg 121] infer certain things about our new co-ordinates, which, in a logical deduction, appear as postulates of the new theory.

We assume that if two events are close together (but not necessarily in any other way), there is a measurable interval between them that can be calculated from the differences in their coordinates using a formula like the one we discussed in the previous chapter. In other words, we take the squares and products of the differences in coordinates, multiply them by suitable amounts (which generally vary by location), and then add the results together. The total we get is the square of the interval. We don’t assume we already know the amounts by which the squares and products should be multiplied; we will figure this out by observing physical phenomena. However, we do know certain things. We know that the old Newtonian physics is almost accurate when our coordinates are chosen in a specific way. We also know that the special theory of relativity is even more accurate for appropriate coordinates. From these observations, we can infer certain things about our new coordinates, which, through logical reasoning, emerge as postulates of the new theory. [Pg 121]

As such postulates we take:

As such, we assume:

1. That every body travels in a geodesic in space-time, except in so far as electromagnetic forces act upon it.

1. That every body moves along a geodesic in space-time, except to the extent that electromagnetic forces influence it.

2. That a light ray travels so that the interval between two parts of it is zero.

2. That a light ray travels in a way that the distance between two parts of it is zero.

3. That at a great distance from gravitating matter, we can transform our co-ordinates by mathematical manipulation so that the interval shall be what it is in the special theory of relativity; and that this is approximately true wherever gravitation is not very powerful.

3. That when we are far away from gravitational forces, we can change our coordinates through mathematical techniques so that the distance corresponds to what it is in the special theory of relativity; and that this is roughly accurate when gravity isn't very strong.

Each of these postulates requires some explanation.

Each of these assumptions needs some explanation.

We saw that a geodesic on a surface is the shortest line that can be drawn on the surface from one point to another; for example, on the earth the geodesics are great circles. When we come to space-time, the mathematics is the same, but the verbal explanations have to be rather different. In the general theory of relativity, it is only neighboring events that have a definite interval, independently of [Pg 122] the route by which we travel from one to the other. The interval between distant events depends upon the route pursued, and has to be calculated by dividing the route into a number of little bits and adding up the intervals for the various little bits. If the interval is space-like, a body cannot travel from one event to the other; therefore when we are considering the way bodies move, we are confined to time-like intervals. The interval between neighboring events, when it is time-like, will appear as the time between them for an observer who travels from the one event to the other. And so the whole interval between two events will be judged by a person who travels from one to the other to be what his clocks show to be the time that he has taken on the journey. For some routes this time will be longer, for others shorter; the more slowly the man travels, the longer he will think he has been on the journey. This must not be taken as a platitude. I am not saying that if you travel from London to Edinburgh you will take longer if you travel more slowly. I am saying something much more odd. I am saying that if you leave London at 10 a.m. and arrive in Edinburgh at 6.30 p.m. Greenwich time, the more slowly you [Pg 123] travel the longer you will take—if the time is judged by your watch. This is a very different statement. From the point of view of a person on the earth, your journey takes eight and a half hours. But if you had been a ray of light traveling round the solar system, starting from London at 10 a.m., reflected from Jupiter to Saturn, and so on, until at last you were reflected back to Edinburgh and arrived there at 6.30 p.m., you would judge that the journey had taken you exactly no time. And if you had gone by any circuitous route, which enabled you to arrive in time by traveling fast, the longer your route the less time you would judge that you had taken; the diminution of time would be continual as your speed approached that of light. Now I say that when a body travels, if it is left to itself, it chooses the route which makes the time between two stages of the journey as long as possible; if it had traveled from one event to another by any other route, the time, as measured by its own clocks, would have been shorter. This is a way of saying that bodies left to themselves do their journeys as slowly as they can; it is a sort of law of cosmic laziness. Its mathematical expression is that they travel in geodesics, in which the total interval between any two events on the journey is [Pg 124] greater than by any alternative route. (The fact that it is greater, not less, is due to the fact that the sort of interval we are considering is more analogous to time than to distance.) For example, if a person could leave the earth and travel about for a time and then return, the time between his departure and return would be less by his clocks than by those on the earth: the earth, in its journey round the sun, chooses the route which makes the time of any bit of its course by its clocks longer than the time as judged by clocks which move by a different route. This is what is meant by saying that bodies left to themselves move in geodesics in space-time.

We found that a geodesic on a surface is the shortest path you can draw from one point to another; for instance, on the Earth, the geodesics are great circles. When we move to space-time, the math stays the same, but the explanations need to be quite different. In the general theory of relativity, only neighboring events have a specific interval, regardless of how you travel between them. The interval between distant events depends on the path taken and needs to be calculated by breaking the route into small segments and adding up the intervals for those segments. If the interval is space-like, a body can't move from one event to the other; therefore, when considering how bodies move, we focus on time-like intervals. The interval between neighboring time-like events will appear to an observer traveling from one to the other as the time it takes for that journey. Thus, the entire interval between two events will be perceived by someone traveling the route as the time shown on their clocks for the trip. For some paths, this time will be longer, and for others, shorter; the slower someone travels, the longer they'll feel they’ve been traveling. This shouldn't be seen as a simple point. I'm not saying that if you travel from London to Edinburgh, it will take longer if you go slower. I'm making a much stranger claim. I'm saying that if you leave London at 10 a.m. and arrive in Edinburgh at 6:30 p.m. Greenwich time, the slower you travel, the longer you'll think you took—if you're judging by your own watch. This is a very different claim. From the perspective of someone on Earth, your journey takes eight and a half hours. But if you were a ray of light traveling around the solar system, leaving London at 10 a.m., bouncing off Jupiter to Saturn, and so on, until finally being reflected back to Edinburgh by 6:30 p.m., you would see that the journey took you absolutely no time. If you took any winding route that allowed you to arrive on time while traveling fast, the longer your path, the less time you would feel you took; the reduction in time would continue as your speed approached that of light. Now I say that when a body moves, if left to its own devices, it chooses the path that makes the time between two stages of the journey as long as possible; if it had taken any other route, the time measured by its own clocks would be shorter. This means that bodies left alone do their journeys as slowly as they can; it’s a sort of cosmic laziness law. Its mathematical form is that they travel in geodesics, where the total interval between any two events on the journey is greater than any alternative route. (The fact that it is greater, not less, is because the type of interval we're considering is more similar to time than to distance.) For example, if a person could leave Earth, travel for a while, and then return, the time as measured by their clocks would be less than that measured by clocks on Earth: Earth, in its orbit around the sun, chooses the route that makes the time of any segment of its path longer, according to its clocks, compared to clocks moving along a different route. This is what we mean when we say that bodies left alone move in geodesics in space-time.

We assume that the body considered is not acted upon by electromagnetic forces. We are concerned at present with the law of gravitation, not with the effects of electromagnetism. These effects have been brought into the framework of the general theory of relativity by Weyl,[5] but for the present we will ignore his work. The planets, in any case, are not subject, as wholes, to appreciable electromagnetic forces; it is only gravitation that has to be considered in accounting for their motions, with which we are concerned in this chapter. [Pg 125]

We assume that the body we're discussing is not influenced by electromagnetic forces. Right now, we're focusing on the law of gravitation, not on the effects of electromagnetism. Weyl has included these effects in the general theory of relativity, but for now, we'll ignore his work. The planets, as whole entities, aren't significantly affected by electromagnetic forces; we only need to consider gravitation to account for their movements, which is what this chapter is about. [Pg 125]

Our second postulate, that a light ray travels so that the interval between two parts of it is zero, has the advantage that it does not have to be stated only for small distances. If each little bit of interval is zero, the sum of them all is zero, and so even distant parts of the same light ray have a zero interval. The course of a light ray is also a geodesic according to the definition. Thus we now have two empirical ways of discovering what are the geodesics in space-time, namely light rays and bodies moving freely. Among freely-moving bodies are included all which are not subject to constraints or to electromagnetic forces, that is to say, the sun, stars, planets and satellites, and also falling bodies on the earth, at least when they are falling in a vacuum. When you are standing on the earth, you are subject to electromagnetic forces: the electrons and protons in the neighborhood of your feet exert a repulsion on your feet which is just enough to overcome the earth’s gravitation. This is what prevents you from falling through the earth, which, solid as it looks, is mostly empty space. [Pg 126]

Our second idea, that a light ray moves in a way that the distance between two points on it is zero, has the benefit of not needing to be limited to small distances. If every little bit of distance is zero, then adding them all together results in zero, which means that even distant points on the same light ray have a zero distance. A light ray's path is also a geodesic according to the definition. Thus, we now have two practical methods for determining what the geodesics are in space-time: light rays and freely moving bodies. Freely moving bodies include all those not affected by constraints or electromagnetic forces, such as the sun, stars, planets, and satellites, as well as falling objects on Earth, at least when they fall in a vacuum. When you stand on Earth, you experience electromagnetic forces: the electrons and protons close to your feet push against them just enough to counteract Earth's gravity. This is what keeps you from falling through the Earth, which, despite appearing solid, is mostly empty space. [Pg 126]

The third postulate, which relates the general to the special theory, is very useful. It is not necessary for the application of the special theory to a limited region that there should be no gravitation in the region; it is enough if the intensity of gravitation is practically the same throughout the region. This enables us to apply the special theory within any small region. How small it will have to be, depends upon the neighborhood. On the surface of the earth, it would have to be small enough for the curvature of the earth to be negligible. In the spaces between the planets, it need only be small enough for the attraction of the sun and the planets to be sensibly constant throughout the region. In the spaces between the stars it might be enormous—say half the distance from one star to the next—without introducing measurable inaccuracies.

The third principle, which connects the general and special theories, is very useful. For the special theory to be applied in a limited area, it's not necessary for there to be no gravity; it’s enough that the strength of gravity is nearly the same all over that area. This allows us to use the special theory in any small region. How small it needs to be depends on the surroundings. On the surface of the Earth, it has to be small enough that the curvature of the Earth is insignificant. In the spaces between the planets, it only needs to be small enough that the gravitational pull from the sun and the planets is effectively constant throughout the area. In the spaces between the stars, it could be quite large—like half the distance from one star to the next—without causing measurable errors.

At a great distance from gravitating matter, we can so choose our co-ordinates as to obtain a Euclidean space; this is really only another way of saying that the special theory of relativity applies. In the neighborhood of matter, although we can make our space Euclidean in any small region, we cannot do so throughout any region within [Pg 127] which gravitation varies sensibly—at least, if we do, we shall have to abandon the view that bodies move in geodesics. In the neighborhood of a piece of matter, there is, as it were, a hill in space-time; this hill grows steeper and steeper as it gets nearer the top, like the neck of a champagne bottle. It ends in a sheer precipice. Now by the law of cosmic laziness which we mentioned earlier, a body coming into the neighborhood of the hill will not attempt to go straight over the top, but will go round. This is the essence of Einstein’s view of gravitation. What a body does, it does because of the nature of space-time in its own neighborhood, not because of some mysterious force emanating from a distant body.

At a great distance from any matter, we can choose our coordinates in such a way that we achieve a Euclidean space; this really just means that the special theory of relativity is applicable. Close to matter, even though we can make our space Euclidean in small areas, we can't do this across any larger area where gravity changes noticeably—at least, if we try, we would have to give up the idea that objects move along geodesics. Near a mass, there's kind of a hill in space-time; this hill becomes steeper as you approach the top, similar to the neck of a champagne bottle. It ends in a sheer drop. Now, according to the law of cosmic laziness that we talked about earlier, an object coming near the hill won't try to go straight over the top, but will instead go around it. This captures Einstein's view of gravity. What an object does is determined by the nature of space-time in its vicinity, not because of some mysterious force coming from a distant object.

An analogy will serve to make the point clear. Suppose that on a dark night a number of men with lanterns were walking in various directions across a huge plain, and suppose that in one part of the plain there was a hill with a flaring beacon on the top. Our hill is to be such as we have described, growing steeper as it goes up, and ending in a precipice. I shall suppose that there are villages dotted about the plain, and the men with lanterns are walking to and from these various [Pg 128] villages. Paths have been made showing the easiest way from any one village to any other. These paths will all be more or less curved, to avoid going too far up the hill; they will be more sharply curved when they pass near the top of the hill than when they keep some way off from it. Now suppose that you are observing all this, as best you can, from a place high up in a balloon, so that you cannot see the ground, but only the lanterns and the beacon. You will not know that there is a hill, or that the beacon is at the top of it. You will see that people turn out of the straight course when they approach the beacon, and that the nearer they come the more they turn aside. You will naturally attribute this to an effect of the beacon; you may think that it is very hot and people are afraid of getting burnt. But if you wait for daylight you will see the hill, and you will find that the beacon merely marks the top of the hill and does not influence the people with lanterns in any way.

An analogy will help clarify the point. Imagine it's a dark night and several men with lanterns are walking in different directions across a vast plain. There's a hill in one area with a bright beacon at the top. This hill is steep, ending in a cliff. Let's also assume there are villages scattered throughout the plain, and the men with lanterns are traveling to and from these villages. Paths have been created showing the easiest routes from one village to another. These paths will all be somewhat curved to avoid climbing too much up the hill; they will curve more sharply as they get closer to the top compared to when they're farther away. Now, imagine you're observing all this from a balloon high up in the air, unable to see the ground but only the lanterns and the beacon. You won't recognize that there's a hill or that the beacon is on top of it. You’ll notice that people deviate from their straight path as they approach the beacon, and the closer they get, the more they veer off. You might assume this is because of the beacon's influence; perhaps you think it's very hot and people are worried about getting burned. But if you wait for daylight, you'll see the hill and realize that the beacon simply marks the hilltop and doesn’t affect the people with lanterns at all.

Now in this analogy the beacon corresponds to the sun, the people with lanterns correspond to the planets and comets, the paths correspond to their orbits, and the coming of daylight corresponds to the coming [Pg 129] of Einstein. Einstein says that the sun is at the top of a hill, only the hill is in space-time, not in space. (I advise the reader not to try to picture this, because it is impossible.) Each body, at each moment, adopts the easiest course open to it, but owing to the hill the easiest course is not a straight line. Each little bit of matter is at the top of its own little hill, like the cock on his own dung-heap. What we call a big bit of matter is a bit which is at the top of a big hill. The hill is what we know about; the bit of matter at the top is assumed for convenience. Perhaps there is really no need to assume it, and we could do with the hill alone, for we can never get to the top of any one else’s hill, any more than the pugnacious cock can fight the peculiarly irritating bird that he sees in the looking glass.

In this analogy, the beacon represents the sun, the people with lanterns represent the planets and comets, the paths represent their orbits, and the arrival of daylight represents the arrival of Einstein. Einstein argues that the sun is situated at the top of a hill, but this hill exists in space-time, not in space. (I suggest not trying to visualize this, as it's impossible.) Each body, at every moment, takes the easiest path available, but because of the hill, the easiest path isn't a straight line. Each small piece of matter is at the top of its own little hill, just like a rooster on its own pile of dung. What we consider a large piece of matter is one that sits atop a big hill. The hill represents what we know; the piece of matter at the top is something we assume for the sake of convenience. Maybe we don't really need to assume it, and we could manage with just the hill, since we can never reach the top of someone else’s hill, just as the aggressive rooster cannot fight the annoyingly familiar bird it sees in the mirror. [Pg 129]

I have given only a qualitative description of Einstein’s law of gravitation; to give its exact quantitative formulation is impossible without more mathematics than I am permitting myself. The most interesting point about it is that it makes the law no longer the result of action at a distance: the sun exerts no force on the planets whatever. Just as geometry has become physics, so, in a sense, physics [Pg 130] has become geometry. The law of gravitation has become the geometrical law that every body pursues the easiest course from place to place, but this course is affected by the hills and valleys that are encountered on the road.

I have provided just a general overview of Einstein’s law of gravitation; giving its precise mathematical expression is impossible without more math than I plan to cover. The most interesting aspect is that it removes the idea of action at a distance: the sun doesn’t exert any force on the planets at all. Just like geometry has evolved into physics, in a way, physics has transformed into geometry. The law of gravitation is now seen as the geometric principle that every body takes the easiest path from one point to another, but this path is influenced by the hills and valleys encountered along the way. [Pg 130]


[Pg 131]

[Pg 131]

CHAPTER IX:
PROOFS OF EINSTEIN’S
LAW OF GRAVITATION

The reasons for accepting Einstein’s law of gravitation rather than Newton’s are partly empirical, partly logical. We will begin with the former.

The reasons for accepting Einstein’s law of gravitation instead of Newton’s are partly based on observation and partly on logic. We'll start with the observation-based reasons.

Einstein’s law of gravitation gives very nearly the same results as Newton’s, when applied to the calculation of the orbits of the planets and their satellites. If it did not, it could not be true, since the consequences deduced from Newton’s law have been found to be almost exactly verified by observation. When, in 1915, Einstein first published his new law, there was only one empirical fact to which he could point to show that his theory was better than Newton’s. This was what is called the “motion of the perihelion of Mercury.”

Einstein’s law of gravitation gives results that are very close to Newton’s when calculating the orbits of the planets and their moons. If it didn’t, it couldn’t be true, since the outcomes derived from Newton’s law have been confirmed almost exactly by observations. When Einstein first published his new law in 1915, there was only one empirical fact he could use to demonstrate that his theory was superior to Newton’s. This was the so-called “motion of the perihelion of Mercury.”

The planet Mercury, like the other planets, moves round the sun in an ellipse, with the sun in one of the foci. At some points of its [Pg 132] orbit it is nearer to the sun than at other points. The point where it is nearest to the sun is called its “perihelion.” Now it was found by observation that, from one occasion when Mercury is nearest to the sun until the next, Mercury does not go exactly once round the sun, but a little bit more. The discrepancy is very small; it amounts to an angle of forty-two seconds in a century. That is to say, in each year the planet has to move rather less than half a second of angle after it has finished a complete revolution from the last perihelion before it reaches the next perihelion. This very minute discrepancy from Newtonian theory had puzzled astronomers. There was a calculated effect due to perturbations caused by the other planets, but this small discrepancy was the residue after allowing for these perturbations. Einstein’s theory accounted for this residue, as well as for its absence in the case of the other planets. (In them it exists, but is too small to be observed.) This was, at first, his only empirical advantage over Newton.

The planet Mercury, like the other planets, orbits the sun in an ellipse, with the sun located at one of the foci. At certain points in its orbit, Mercury is closer to the sun than at others. The point where it is closest to the sun is known as its “perihelion.” Observations showed that from one time Mercury is closest to the sun until the next, it doesn’t complete exactly one full orbit, but rather a tiny bit more. This discrepancy is very small; it amounts to an angle of forty-two seconds over a century. In other words, each year the planet has to move just under half a second of angle after completing a full revolution since the last perihelion before reaching the next perihelion. This tiny difference from Newtonian theory puzzled astronomers. There was a calculated effect from perturbations caused by other planets, but this small discrepancy remained after accounting for those perturbations. Einstein’s theory explained this residue, as well as why it doesn't occur with the other planets. (In those cases, it exists but is too small to be detected.) Initially, this was his only experimental advantage over Newton.

His second success was more sensational. According to orthodox opinion, light in a vacuum ought always to travel in straight lines. Not being composed of material particles, it ought to be unaffected [Pg 133] by gravitation. However, it was possible, without any serious breach with old ideas, to admit that, in passing near the sun, light might be deflected out of the straight path as much as if it were composed of material particles. Einstein, however, maintained, as a deduction from his law of gravitation, that light would be deflected twice as much as this. That is to say, if the light of a star passed very near the sun, Einstein maintained that the ray from the star would be turned through an angle of just under one and three-quarters seconds. His opponents were willing to concede half of this amount. Now it is not every day that a star almost in line with the sun can be seen. This is only possible during a total eclipse, and not always then, because there may be no bright stars in the right position. Eddington points out that, from this point of view, the best day of the year is May 29, because then there are a number of bright stars close to the sun. It happened by incredible good fortune that there was a total eclipse of the sun on May 29, 1919—the first year after the armistice. Two British expeditions photographed the stars near the sun during the eclipse, and the results confirmed Einstein’s prediction. Some astronomers [Pg 134] who remained doubtful whether sufficient precautions had been taken to insure accuracy were convinced when their own observations in a subsequent eclipse gave exactly the same result. Einstein’s estimate of the amount of the deflection of light by gravitation is therefore now universally accepted.

His second success was even more remarkable. Traditionally, it was believed that light in a vacuum should always travel in straight lines. Since light isn’t made up of material particles, it should be unaffected by gravity. However, it was acceptable, without straying too far from established ideas, to suggest that light might bend slightly when passing close to the sun, just as if it were made of material particles. Einstein, however, argued based on his law of gravitation that light would be deflected twice as much. This means that if the light from a star passed very close to the sun, Einstein claimed the ray would be bent by just under one and three-quarters seconds. His opponents agreed to half of that amount. It’s not common to see a star that nearly aligns with the sun; this only happens during a total eclipse, and even then, there may not be any bright stars in the right position. Eddington pointed out that the best day of the year for this is May 29, as several bright stars are near the sun then. Remarkably, there was a total solar eclipse on May 29, 1919—the first year after the armistice. Two British expeditions captured images of the stars near the sun during the eclipse, and the results supported Einstein’s prediction. Some astronomers, who were still uncertain if the methods ensured accuracy, became convinced when their observations during a later eclipse produced exactly the same outcome. Einstein’s estimate of how much gravity bends light is now widely accepted.

The third experimental test is on the whole favorable to Einstein, though the quantities concerned are so small that it is only just possible to measure them, and the result is therefore not decisive. But successive investigations have made it more and more probable that the small effect predicted by Einstein really occurs. Before explaining the effect in question, a few preliminary explanations are necessary. The spectrum of an element consists of certain lines of various shades of light, separated by a prism, and emitted by the element when it glows. They are the same (to a very close approximation) whether the element is in the earth or the sun or a star. Each line is of some definite shade of color, with some definite wave length. Longer wave lengths are towards the red end of the spectrum, shorter ones towards the violet end. When the source of light is moving towards you, the apparent wave [Pg 135] lengths grow shorter, just as waves at sea come quicker when you are traveling against the wind. When the source of light is moving away from you, the apparent wave lengths grow longer, for the same reason. This enables us to know whether the stars are moving towards us or away from us. If they are moving towards us, all the lines in the spectrum of an element are moved a little toward violet; if away from us, toward red. You may notice the analogous effect in sound any day. If you are in a station and an express comes through whistling, the note of the whistle seems much more shrill while the train is approaching you than when it has passed. Probably many people think the note has “really” changed, but in fact the change in what you hear is only due to the fact that the train was first approaching and then receding. To people in the train, there was no change of note. This is not the effect with which Einstein is concerned. The distance of the sun from the earth does not change much; for our present purposes, we may regard it as constant. Einstein deduces from his law of gravitation that any periodic process which takes place in an atom in the sun (whose [Pg 136] gravitation is very intense) must, as measured by our clocks, take place at a slightly slower rate than it would in a similar atom on the earth. The “interval” involved will be the same in the sun and on the earth, but the same interval in different regions does not correspond to exactly the same time; this is due to the “hilly” character of space-time which constitutes gravitation. Consequently any given line in the spectrum ought, when the light comes from the sun, to seem to us a little nearer the red end of the spectrum than if the light came from a source on the earth. The effect to be expected is very small—so small that there is still some slight uncertainty as to whether it exists or not. But it now seems highly probable that it exists.

The third experimental test generally supports Einstein, although the quantities involved are so tiny that they can barely be measured, making the results not entirely conclusive. However, ongoing research has increasingly suggested that the small effect Einstein predicted actually occurs. Before diving into the specific effect, a few initial explanations are needed. The spectrum of an element consists of specific lines of different shades of light, separated by a prism and emitted by the element when it glows. These lines are nearly identical whether the element is on Earth, in the sun, or in a star. Each line corresponds to a specific shade of color with a particular wavelength. Longer wavelengths are found at the red end of the spectrum, while shorter ones are at the violet end. When the light source is moving towards you, the apparent wavelengths become shorter, just like how waves at sea come closer when you're traveling against the wind. If the light source is moving away, the apparent wavelengths get longer for the same reason. This lets us determine whether stars are moving towards us or away from us. If they're approaching, all the lines in an element's spectrum shift slightly towards violet; if they're moving away, they shift towards red. You can notice a similar effect with sound any day. If you're at a station and an express train passes by whistling, the note seems much sharper as the train approaches you than after it has passed. Many people might think the note has truly changed, but the difference in what you hear is just because the train was first getting closer and then moving away. For people on the train, there is no change in the note. This is not the effect Einstein is addressing. The distance from the sun to the Earth doesn't change much; for our purposes, we can treat it as constant. Einstein infers from his law of gravitation that any regular process happening in an atom in the sun (where gravity is very strong) must, as measured by our clocks, occur at a slightly slower rate than it would in a similar atom on Earth. The “interval” involved will be the same on the sun and Earth, but the same interval in different locations does not correspond to exactly the same time; this is due to the “hilly” nature of space-time that makes up gravitation. Therefore, any specific line in the spectrum from the sun should appear a bit closer to the red end compared to light from a source on Earth. The expected effect is very small—so small that there is still some uncertainty about its existence. However, it now seems highly likely that it does exist.

No other measurable differences between the consequences of Einstein’s law and those of Newton’s have hitherto been discovered. But the above experimental tests are quite sufficient to convince astronomers that, where Newton and Einstein differ as to the motions of the heavenly bodies, it is Einstein’s law that gives the right results. Even if the empirical grounds in favor of Einstein stood alone, they would be conclusive. Whether his law represents the exact truth or not, it is [Pg 137] certainly more nearly exact than Newton’s, though the inaccuracies in Newton’s were all exceedingly minute.

No other measurable differences between the outcomes of Einstein’s law and Newton’s have been found so far. However, the experimental tests mentioned above are enough to persuade astronomers that, wherever Newton and Einstein disagree on the movements of celestial bodies, it’s Einstein’s law that provides the correct results. Even if the evidence supporting Einstein stood by itself, it would be definitive. Whether his law represents the exact truth or not, it is definitely closer to being accurate than Newton’s, even though the inaccuracies in Newton’s were all very slight. [Pg 137]

But the considerations which originally led Einstein to his law were not of this detailed kind. Even the consequence about the perihelion of Mercury, which could be verified at once from previous observations, could only be deduced after the theory was complete, and could not form any part of the original grounds for inventing such a theory. These grounds were of a more abstract logical character. I do not mean that they were not based upon observed facts, and I do not mean that they were à priori fantasies such as philosophers indulged in formerly. What I mean is that they were derived from certain general characteristics of physical experience, which showed that Newton must be wrong and that something like Einstein’s law must be substituted.

But the reasons that initially led Einstein to his law weren't this detailed. Even the finding about Mercury's orbit, which could be confirmed right away from earlier observations, could only be deduced once the theory was fully developed, and couldn’t be part of the original reasons for creating such a theory. These reasons were more abstract and logical. I'm not saying they weren't based on observed facts, nor that they were just random ideas like philosophers used to come up with. What I mean is that they came from certain general traits of physical experience that indicated that Newton had to be wrong and that something like Einstein's law needed to take its place.

The arguments in favor of the relativity of motion are, as we saw in earlier chapters, quite conclusive. In daily life, when we say that something moves, we mean that it moves relatively to the earth. In dealing with the motions of the planets, we consider them as moving [Pg 138] relatively to the sun, or to the center of mass of the solar system. When we say that the solar system itself is moving, we mean that it is moving relatively to the stars. There is no physical occurrence which can be called “absolute motion.” Consequently the laws of physics must be concerned with relative motions, since these are the only kind that occur.

The arguments supporting the relativity of motion are, as we discussed in earlier chapters, pretty convincing. In our everyday lives, when we say something is moving, we mean it's moving in relation to the earth. When looking at the movements of the planets, we think of them as moving relative to the sun or the center of mass of the solar system. When we claim that the solar system itself is moving, we mean it's moving in relation to the stars. There’s no physical event that can be called "absolute motion." Therefore, the laws of physics must deal with relative motions, since those are the only types that exist. [Pg 138]

We now take the relativity of motion in conjunction with the experimental fact that the velocity of light is the same relatively to one body as relatively to another, however the two may be moving. This leads us to the relativity of distances and times. This in turn shows that there is no objective physical fact which can be called “the distance between two bodies at a given time,” since the time and the distance will both depend on the observer. Therefore Newton’s law of gravitation is logically untenable, since it makes use of “distance at a given time.”

We now consider the relativity of motion along with the experimental fact that the speed of light is constant, regardless of the motion of the observer or the source. This brings us to the relativity of distances and times. Consequently, it shows that there’s no objective physical fact that can be referred to as “the distance between two bodies at a specific time,” because both time and distance depend on the observer. Thus, Newton’s law of gravitation is not logically sound, since it relies on “distance at a given time.”

This shows that we cannot rest content with Newton, but it does not show what we are to put in his place. Here several considerations enter in. We have in the first place what is called “the equality of gravitational and inertial mass.” What this means is as follows: [Pg 139] When you apply a given force[6] to a heavy body, you do not give it as much acceleration as you would to a light body. What is called the “inertial” mass of a body is measured by the amount of force required to produce a given acceleration. At a given point of the earth’s surface, the “mass” is proportional to the “weight.” What is measured by scales is rather the mass than the weight: the weight is defined as the force with which the earth attracts the body. Now this force is greater at the poles than at the equator, because at the equator the rotation of the earth produces a “centrifugal force” which partially counteracts gravitation. The force of the earth’s attraction is also greater on the surface of the earth than it is at a great height or at the bottom of a very deep mine. None of these variations are shown by scales, because they affect the weights used just as much as the body weighed; but they are shown if we use a spring balance. The mass does not vary in the course of these changes of weight. [Pg 140]

This shows that we can't be satisfied with Newton's theories, but it doesn't tell us what to replace them with. Several factors come into play here. First, there's what we call “the equality of gravitational and inertial mass.” This means that when you apply a specific force to a heavy object, it doesn't accelerate as much as it would if it were a lighter object. The “inertial” mass of an object is determined by the amount of force needed to produce a certain acceleration. At any specific spot on the Earth's surface, the “mass” is proportional to the “weight.” What we measure on scales is more accurately the mass than the weight: weight is defined as the force with which the Earth pulls on an object. This force is stronger at the poles than at the equator because the Earth's rotation generates a “centrifugal force” that partially counteracts gravity. The Earth's attraction is also stronger at the surface than it is at higher altitudes or at the bottom of a deep mine. None of these differences are shown by scales because they affect both the weights used and the object being weighed equally; however, they can be detected using a spring balance. The mass remains constant despite these weight changes. [Pg 140]

The “gravitational” mass is differently defined. It is capable of two meanings. We may mean (1), the way a body responds in a situation where gravitation has a known intensity, for example, on the surface of the earth, or on the surface of the sun; or (2), the intensity of the gravitational force produced by the body, as, for example, the sun produces stronger gravitational forces than the earth does. Newton says that the force of gravitation between two bodies is proportional to the product of their masses. Now let us consider the attraction of different bodies to one and the same body, say the sun. Then different bodies are attracted by forces which are proportional to their masses, and which, therefore, produce exactly the same acceleration in all of them. Thus if we mean “gravitational mass” in sense (1), that is to say, the way a body responds to gravitation, we find that “the equality of inertial and gravitational mass,” which sounds formidable, reduces to this: that in a given gravitational situation, all bodies behave exactly alike. As regards the surface of the earth, this was one of the first discoveries of Galileo. Aristotle thought that heavy bodies fall faster than light ones; Galileo showed that this is not the case, [Pg 141] when the resistance of the air is eliminated. In a vacuum, a feather falls as fast as a lump of lead. As regards the planets, it was Newton who established the corresponding facts. At a given distance from the sun, a comet, which has a very small mass, experiences exactly the same acceleration towards the sun as a planet experiences at the same distance. Thus the way in which gravitation affects a body depends only upon where the body is, and in no degree upon the nature of the body. This suggests that the gravitational effect is a characteristic of the locality, which is what Einstein makes it.

The "gravitational" mass has a different definition. It can mean two things. We might refer to (1) how a body reacts in an environment where gravity has a known strength, such as on the surface of the Earth or the Sun; or (2) the strength of the gravitational force generated by the body, like how the Sun exerts a stronger gravitational pull than the Earth. Newton states that the gravitational force between two bodies is proportional to the product of their masses. Now, let’s look at how different bodies are attracted to the same object, say the Sun. Various bodies are pulled by forces that are proportional to their masses, which accordingly produce the same acceleration in all of them. So, if we interpret "gravitational mass" in the first sense, meaning how a body reacts to gravity, we discover that “the equality of inertial and gravitational mass,” which sounds impressive, actually means that in a given gravitational situation, all bodies behave the same. Regarding the surface of the Earth, this was one of Galileo's first discoveries. Aristotle believed that heavier objects fall faster than lighter ones; Galileo proved this isn’t true when air resistance is removed. In a vacuum, a feather falls at the same rate as a lump of lead. For the planets, it was Newton who established similar facts. At a specific distance from the Sun, a comet with a very small mass experiences the same acceleration towards the Sun as a planet does at that distance. Therefore, how gravity affects a body only depends on its location and not at all on the kind of body it is. This implies that the gravitational effect is a property of the location, which is how Einstein interprets it. [Pg 141]

As for the gravitational mass in sense (2), i.e., the intensity of the force produced by a body, this is no longer exactly proportional to its inertial mass. The question involves some rather complicated mathematics, and I shall not go into it.[7]

As for gravitational mass in sense (2), i.e., the strength of the force created by a body, this is no longer exactly proportional to its inertial mass. The issue involves some pretty complicated math, and I won’t dive into it. [7]

We have another indication as to what sort of thing the law of gravitation must be, if it is to be a characteristic of a neighborhood, as we have seen reason to suppose that it is. It must [Pg 142] be expressed in some law which is unchanged when we adopt a different kind of co-ordinates. We saw that we must not, to begin with, regard our co-ordinates as having any physical significance: they are merely systematic ways of naming different parts of space-time. Being conventional, they cannot enter into physical laws. That means to say that, if we have expressed a law correctly in terms of one set of co-ordinates, it must be expressed by the same formula in terms of another set of co-ordinates. Or, more exactly, it must be possible to find a formula which expresses the law, and which is unchanged however we change the co-ordinates. It is the business of the theory of tensors to deal with such formulæ. And the theory of tensors shows that there is one formula which obviously suggests itself as being possibly the law of gravitation. When this possibility is examined, it is found to give the right results; it is here that the empirical confirmations come in. But if Einstein’s law had not been found to agree with experience, we could not have gone back to Newton’s law. We should have been compelled by logic to seek some law expressed in terms of “tensors,” and therefore independent of our choice of co-ordinates. [Pg 143] It is impossible without mathematics to explain the theory of tensors; the non-mathematician must be content to know that it is the technical method by which we eliminate the conventional element from our measurements and laws, and thus arrive at physical laws which are independent of the observer’s point of view. Of this method, Einstein’s law of gravitation is the most splendid example.

We have another clue about what the law of gravitation must be, if it truly represents a characteristic of a neighborhood, which we've reason to believe. It must be expressed in a way that stays true when we switch to a different set of coordinates. We noted that we shouldn't see our coordinates as having any physical meaning; they are just systematic ways of labeling different parts of space-time. Since they are conventional, they can't play a role in physical laws. This means that if we’ve accurately expressed a law using one set of coordinates, it must be stated using the same formula in another set. More precisely, there should be a formula that conveys the law, and remains consistent no matter how we alter the coordinates. The theory of tensors is responsible for handling such formulas. According to the theory of tensors, there’s one formula that clearly stands out as possibly being the law of gravitation. When this possibility is investigated, it shows the correct results; this is where the empirical confirmations come in. However, if Einstein’s law hadn't been found to match up with reality, we wouldn't have been able to revert back to Newton’s law. We would have been logically compelled to look for a law represented in terms of “tensors,” thus making it independent of our choice of coordinates. [Pg 142] It's impossible to explain the theory of tensors without math; those who aren't mathematicians must accept that it’s the technical method we use to remove the conventional elements from our measurements and laws, resulting in physical laws independent of the observer’s perspective. Einstein’s law of gravitation is the most impressive example of this method. [Pg 143]


[Pg 144]

[Pg 144]

CHAPTER X:
MASS, MOMENTUM, ENERGY
AND ACTION

The pursuit of quantitative precision is as arduous as it is important. Physical measurements are made with extraordinary exactitude; if they were made less carefully, such minute discrepancies as form the experimental data for the theory of relativity could never be revealed. Mathematical physics, before the coming of relativity, used a set of conceptions which were supposed to be as precise as physical measurements, but it has turned out that they were logically defective, and that this defectiveness showed itself in very small deviations from expectations based upon calculation. In this chapter I want to show how the fundamental ideas of pre-relativity physics are affected, and what modifications they have had to undergo.

The quest for precise measurements is both challenging and crucial. Physical measurements are taken with remarkable accuracy; if they weren't done so meticulously, the tiny discrepancies that form the experimental data for the theory of relativity would never come to light. Before relativity emerged, mathematical physics relied on a set of concepts that were thought to be as accurate as physical measurements, but it turns out they had logical flaws, and these flaws manifested as very slight differences from the expectations set by calculations. In this chapter, I want to demonstrate how the fundamental ideas of physics before relativity have been impacted and what changes they have had to undergo.

We have already had occasion to speak of mass. For purposes of [Pg 145] daily life, mass is much the same as weight; the usual measures of weight—ounces, grams, etc.—are really measures of mass. But as soon as we begin to make accurate measurements, we are compelled to distinguish between mass and weight. Two different methods of weighing are in common use, one, that of scales, the other that of the spring balance. When you go a journey and your luggage is weighed, it is not put on scales, but on a spring; the weight depresses the spring a certain amount, and the result is indicated by a needle on a dial. The same principle is used in automatic machines for finding your weight. The spring balance shows weight, but scales show mass. So long as you stay in one part of the world, the difference does not matter; but if you test two weighing machines of different kinds in a number of different places, you will find, if they are accurate, that their results do not always agree. Scales will give the same result anywhere, but a spring balance will not. That is to say, if you have a lump of lead weighing ten pounds by the scales, it will also weigh ten pounds by scales in any other part of the world. But if it weighs ten pounds by a spring balance in London, it will weigh more at the North Pole, less at the equator, less high up in an aeroplane, and less at the [Pg 146] bottom of a coal mine, if it is weighed in all those places on the same spring balance. The fact is that the two instruments measure quite different quantities. The scales measure what may be called (apart from refinements which will concern us presently) “quantity of matter.” There is the same “quantity of matter” in a pound of feathers as in a pound of lead. Standard “weights,” which are really standard “masses,” will measure the amount of mass in any substance put into the opposite scales. But “weight” is a properly due to the earth’s gravitation: It is the amount of the force by which the earth attracts a body. This force varies from place to place. In the first place, anywhere outside the earth the attraction varies inversely as the square of the distance from the center of the earth; it is therefore less at great heights. In the second place, when you go down a coal mine, part of the earth is above you, and attracts matter upwards instead of downwards, so that the net attraction downwards is less than on the surface of the earth. In the third place, owing to the rotation of the earth, there is what is called a “centrifugal force,” which acts against gravitation. This is greatest at the equator, because there the rotation of the [Pg 147] earth involves the fastest motion; at the poles it does not exist, because they are on the axis of rotation. For all these reasons, the force with which a given body is attracted to the earth is measureably different at different places. It is this force that is measured by a spring balance; that is why a spring balance gives different results in different places. In the case of scales, the standard “weights” are altered just as much as the body to be weighed, so that the result is the same everywhere; but the result is the “mass,” not the “weight.” A standard “weight” has the same mass everywhere, but not the same “weight”; it is in fact a unit of mass, not of weight. For theoretical purposes, mass, which is almost invariable for a given body, is much more important than weight, which varies according to circumstances. Mass may be regarded, to begin with, as “quantity of matter”; we shall see that this view is not strictly correct, but it will serve as a starting point for subsequent refinements.

We've already talked about mass. In everyday life, mass is pretty much the same as weight; the common weight measurements—ounces, grams, etc.—are actually measures of mass. However, when we start doing precise measurements, we need to differentiate between mass and weight. Two common methods of weighing are scales and spring balances. When you go on a trip and your luggage is weighed, it's placed on a spring, not on scales; the weight compresses the spring by a certain amount, and the result is shown by a needle on a dial. The same principle is used in automatic machines to find your weight. The spring balance shows weight, but scales show mass. As long as you stay in one part of the world, the difference isn’t a big deal; but if you test different types of weighing machines in a variety of locations, you’ll find that, if they’re accurate, their results don’t always match. Scales will give the same result anywhere, but a spring balance will not. For example, if you have a piece of lead weighing ten pounds on the scales, it will weigh ten pounds anywhere else using the scales. However, if it weighs ten pounds on a spring balance in London, it will weigh more at the North Pole, less at the equator, less at high altitude in an airplane, and less at the bottom of a coal mine, if measured on the same spring balance. The truth is that the two instruments measure completely different things. Scales measure what can be called (aside from some details we’ll cover later) "quantity of matter." There’s the same "quantity of matter" in a pound of feathers as in a pound of lead. Standard “weights,” which are really standard “masses,” will measure the amount of mass in any substance placed in the opposite scales. But “weight” is strictly related to the earth’s gravity: it’s the force with which the earth pulls on a body. This force changes from one place to another. First of all, anywhere outside the earth, the force varies inversely with the square of the distance from the center of the earth, meaning it's less at higher elevations. Secondly, when you go down a coal mine, part of the earth is above you, pulling matter upward instead of downward, so the net downward pull is less than on the earth’s surface. Thirdly, due to the earth’s rotation, there is a "centrifugal force" that acts against gravity. This force is strongest at the equator, where the earth spins the fastest; at the poles, it doesn’t exist because they are on the axis of rotation. For all these reasons, the force that attracts a given body to the earth varies measurably in different locations. This force is what a spring balance measures, which is why a spring balance gives different results in different places. With scales, the standard “weights” change just like the body being weighed, so the result is the same everywhere; but the result indicates “mass,” not “weight.” A standard “weight” has the same mass everywhere, but not the same “weight”; it’s actually a unit of mass, not weight. For theoretical purposes, mass, which is nearly consistent for a specific body, is much more significant than weight, which fluctuates based on circumstances. Mass can initially be viewed as “quantity of matter”; we’ll see that this perspective isn’t completely accurate, but it will serve as a good starting point for further details.

For theoretical purposes, a mass is defined as being determined by the amount of force required to produce a given acceleration: The more massive a body is, the greater will be the force required to alter its [Pg 148] velocity by a given amount in a given time. It takes a more powerful engine to make a long train attain a speed of ten miles an hour at the end of the first half-minute, than it does to make a short train do so. Or we may have circumstances where the force is the same for a number of different bodies; in that case, if we can measure the accelerations produced in them, we can tell the ratios of their masses: the greater the mass, the smaller the acceleration. We may take, in illustration of this method, an example which is important in connection with relativity. Radio-active bodies emit beta-particles (electrons) with enormous velocities. We can observe their path by making them travel through water vapor and form a cloud as they go. We can at the same time subject them to known electric and magnetic forces, and observe how much they are bent out of a straight line by these forces. This makes it possible to compare their masses. It is found that the faster they travel, the greater is their mass, as measured by the stationary observer; the increase is greatest as applied to their mass as measured by the effect of a force in the line of motion. In regard to forces at right angles to the line of motion, there is a change of mass with [Pg 149] velocity in the same proportion as the changes of length and time. It is known otherwise that, apart from the effect of motion, all electrons have the same mass.

For theoretical purposes, mass is defined by the amount of force needed to produce a specific acceleration: The more massive an object is, the more force it takes to change its velocity by a certain amount in a set time. It requires a more powerful engine to get a long train to reach a speed of ten miles per hour in the first half-minute than it does for a short train. There might be situations where the force is the same for several different objects; in that case, if we can measure the acceleration produced in them, we can determine their mass ratios: the larger the mass, the smaller the acceleration. To illustrate this method, we can use an example relevant to relativity. Radioactive materials emit beta particles (electrons) at extremely high speeds. We can track their path by having them move through water vapor and create a cloud as they do so. At the same time, we can apply known electric and magnetic forces to them and observe how much they veer off from a straight line due to these forces. This allows us to compare their masses. It's found that the faster they go, the greater their mass appears to a stationary observer; the increase is most significant in relation to their mass as measured by the effect of a force moving in the same direction. When it comes to forces that are perpendicular to the direction of motion, mass changes with velocity in the same way that length and time do. It's also known that, aside from the effects of motion, all electrons have the same mass.

All this was known before the theory of relativity was invented, but it showed that the traditional conception of mass had not quite the definiteness that had been ascribed to it. Mass used to be regarded as “quantity of matter,” and supposed to be quite invariable. Now mass was found to be relative to the observer, like length and time, and to be altered by motion in exactly the same proportion. However, this could be remedied. We could take the “proper mass,” the mass as measured by an observer who shares the motion of the body. This was easily inferred from the measured mass, by taking the same proportion as in the case of lengths and times.

All of this was known before the theory of relativity was developed, but it demonstrated that the traditional understanding of mass wasn’t as clear-cut as it was thought to be. Mass used to be seen as “amount of matter” and believed to be completely constant. Now, mass was found to be relative to the observer, just like length and time, and it changes in the same way with movement. However, this issue could be resolved. We could use the “proper mass,” which is the mass measured by someone who is moving with the object. This could be easily calculated from the measured mass by using the same ratio as with lengths and times.

But there is a more curious fact, and that is, that after we have made this correction we still have not obtained a quantity which is at all times exactly the same for the same body. When a body absorbs energy—for example, by growing hotter—its “proper mass” increases slightly. The increase is very slight, since it is measured [Pg 150] by dividing the increase of energy by the square of the velocity of light. On the other hand, when a body parts with energy it loses mass. The most notable case of this is that four hydrogen atoms can come together to make one helium atom, but a helium atom has rather less than four times the mass of one hydrogen atom.

But there's a more interesting point, which is that even after we make this correction, we still don't have a quantity that is consistently the same for the same body. When a body takes in energy—like when it gets hotter—its “proper mass” increases slightly. This increase is very small, as it is calculated by dividing the increase in energy by the square of the speed of light. Conversely, when a body loses energy, it loses mass. The most notable example of this is when four hydrogen atoms combine to form one helium atom, but a helium atom has slightly less mass than four hydrogen atoms combined. [Pg 150]

We have thus two kinds of mass, neither of which quite fulfils the old ideal. The mass as measured by an observer who is in motion relative to the body in question is a relative quantity, and has no physical significance as a property of the body. The “proper mass” is a genuine property of the body, not dependent upon the observer; but it, also, is not strictly constant. As we shall see shortly, the notion of mass becomes absorbed into the notion of energy; it represents, so to speak, the energy which the body expends internally, as opposed to that which it displays to the outer world.

We now have two types of mass, neither of which fully meets the old standard. The mass measured by an observer who is moving relative to the object in question is a relative quantity and doesn’t have any real physical meaning as a characteristic of the object. The “proper mass” is a true characteristic of the object, independent of the observer; however, it is also not strictly constant. As we will see shortly, the idea of mass becomes intertwined with the idea of energy; it represents, in a sense, the energy that the object uses internally, as opposed to the energy it shows to the outside world.

Conservation of mass, conservation of momentum, and conservation of energy were the great principles of classical mechanics. Let us next consider conservation of momentum.

Conservation of mass, conservation of momentum, and conservation of energy were the fundamental principles of classical mechanics. Next, let's take a look at conservation of momentum.

The momentum of a body in a given direction is its velocity in that direction multiplied by its mass. Thus a heavy body moving slowly may [Pg 151] have the same momentum as a light body moving fast. When a number of bodies interact in any way, for instance by collisions, or by mutual gravitation, so long as no outside influences come in, the total momentum of all the bodies in any direction remains unchanged. This law remains true in the theory of relativity. For different observers, the mass will be different, but so will the velocity; these two differences neutralize each other, and it turns out that the principle still remains true.

The momentum of an object moving in a specific direction is its velocity in that direction times its mass. So, a heavy object moving slowly can have the same momentum as a light object moving quickly. When multiple objects interact, like in collisions or through gravitational attraction, as long as there are no external influences, the total momentum of all the objects in any direction stays the same. This law holds in the theory of relativity as well. For different observers, the mass will differ, but so will the velocity; these two differences balance each other out, proving that the principle still stands. [Pg 151]

The momentum of a body is different in different directions. The ordinary way of measuring it is to take the velocity in a given direction (as measured by the observer) and multiply it by the mass (as measured by the observer). Now the velocity in a given direction is the distance traveled in that direction in unit time. Suppose we take instead the distance traveled in that direction while the body moves through unit “interval.” (In ordinary cases, this is only a very slight change, because, for velocities considerably less than that of light, interval is very nearly equal to lapse of time.) And suppose that [Pg 152] instead of the mass as measured by the observer we take the proper mass. These two changes increase the velocity and diminish the mass, both in, the same proportion. Thus the momentum remains the same, but the quantities that vary according to the observer have been replaced by quantities which are fixed independently of the observer—with the exception of the distance traveled by the body in the given direction.

The momentum of an object varies in different directions. The usual way to measure it is to take the velocity in a specific direction (as seen by the observer) and multiply it by the mass (as determined by the observer). The velocity in a given direction represents the distance covered in that direction over a unit of time. Instead, let’s consider the distance traveled in that direction while the object moves through a unit “interval.” (Typically, this is only a slight change, because for speeds significantly less than that of light, the interval is almost equal to the passage of time.) Now, instead of using the mass measured by the observer, let’s use the proper mass. These adjustments increase the velocity and decrease the mass, both by the same proportion. Therefore, the momentum stays the same, but the quantities that vary depending on the observer have been replaced by fixed quantities—except for the distance covered by the object in the specified direction. [Pg 152]

When we substitute space-time for time, we find that the measured mass (as opposed to the proper mass) is a quantity of the same kind as the momentum in a given direction; it might be called the momentum in the time direction. The measured mass is obtained by multiplying the invariant mass by the time traversed in traveling through unit interval; the momentum is obtained by multiplying the same invariant mass by the distance traversed (in the given direction) in traveling through unit interval. From a space-time point of view, these naturally belong together.

When we replace space-time with just time, we see that measured mass (unlike proper mass) is similar to momentum in a specific direction; we could refer to it as momentum in the time direction. The measured mass is calculated by multiplying the invariant mass by the time taken to travel through a unit interval; the momentum is calculated by multiplying the same invariant mass by the distance traveled (in that direction) through a unit interval. From a space-time perspective, these concepts naturally go together.

Although the measured mass of a body depends upon the way the observer is moving relatively to the body, it is none the less a very important [Pg 153] quantity. For any given observer, the measured mass of the whole physical universe is constant.[8] The proper mass of all the bodies in the world is not necessarily the same at one time as at another, so that in this respect the measured mass has an advantage. The conservation of measured mass is the same thing as the conservation of energy. This may seem surprising, since at first sight mass and energy are very different things. But it has turned out that energy is the same thing as measured mass. To explain how this comes about is not easy; nevertheless we will make the attempt.

Although the mass of an object depends on the observer's movement in relation to it, it's still a very important quantity. For any specific observer, the mass of the entire physical universe remains constant. The actual mass of all objects in the world may not be the same at one time as it is at another, which is where the measured mass has an advantage. The conservation of measured mass is equivalent to the conservation of energy. This might seem surprising at first since mass and energy appear to be very different things. However, it turns out that energy is essentially the same as measured mass. Explaining how this works isn’t easy, but we’ll give it a try. [Pg 153] [8]

In popular talk, “mass” and “energy” do not mean at all the same thing. We associate “mass” with the idea of a fat man in a chair, very slow to move, while “energy” suggests a thin person full of hustle and “pep.” Popular talk associates “mass” and “inertia,” but its view of inertia is one-sided: it includes slowness in beginning to move, but not slowness in stopping, which is equally involved. All these terms have technical meanings in physics, which are only more or less analogous [Pg 154] to the meanings of the terms in popular talk. For the present, we are concerned with the technical meaning of “energy.”

In everyday conversations, “mass” and “energy” don't mean the same thing at all. We picture “mass” as a heavy guy sitting in a chair, really slow to move, while “energy” brings to mind a lean person brimming with enthusiasm and activity. People usually connect “mass” with “inertia,” but this view of inertia is limited: it covers being slow to start moving, but doesn’t account for being slow to stop, which is just as important. All these terms have specific meanings in physics that are only somewhat similar to how we use them in casual language. Right now, we’re focusing on the technical definition of “energy.” [Pg 154]

Throughout the latter half of the nineteenth century, a great deal was made of the “conservation of energy,” or the “persistence of force,” as Herbert Spencer preferred to call it. This principle was not easy to state in a simple way, because of the different forms of energy; but the essential point was that energy is never created or destroyed, though it can be transformed from one kind into another. The principle acquired its position through Joule’s discovery of “the mechanical equivalent of heat,” which showed that there was a constant proportion between the work required to produce a given amount of heat and the work required to raise a given weight through a given height: in fact, the same sort of work could be utilized for either purpose according to the mechanism. When heat was found to consist in motion of molecules, it was seen to be natural that it should be analogous to other forms of energy. Broadly speaking, by the help of a certain amount of theory, all forms of energy were reduced to two, which were called respectively “kinetic” and “potential.” These were defined as follows: [Pg 155]

Throughout the latter half of the nineteenth century, a lot of focus was put on the “conservation of energy,” or “persistence of force,” as Herbert Spencer liked to call it. This principle wasn’t easy to explain simply due to the various forms of energy; the key idea was that energy is never created or destroyed, but it can change from one type to another. The principle gained recognition through Joule’s discovery of “the mechanical equivalent of heat,” which demonstrated that there was a consistent relationship between the work needed to create a certain amount of heat and the work needed to lift a specific weight to a certain height: in fact, the same kind of work could be used for either purpose depending on the mechanism. When heat was discovered to be the result of molecular motion, it became clear that it should be comparable to other forms of energy. Generally speaking, with the aid of some theoretical framework, all forms of energy were categorized into two types, referred to as “kinetic” and “potential.” These were defined as follows: [Pg 155]

The kinetic energy of a particle is half the mass multiplied by the square of the velocity. The kinetic energy of a number of particles is the sum of the kinetic energies of the separate particles.

The kinetic energy of a particle is half the mass times the square of the velocity. The kinetic energy of multiple particles is the total of the kinetic energies of each individual particle.

The potential energy is more difficult to define. It represents any state of strain, which can only be preserved by the application of force. To take the easiest case: If a weight is lifted to a height and kept suspended, it has potential energy, because, if left to itself, it will fall. Its potential energy is equal to the kinetic energy which it would acquire in falling through the same distance through which it was lifted. Similarly when a comet goes round the sun in a very eccentric orbit, it moves much faster when it is near the sun than when it is far from it, so that its kinetic energy is much greater when it is near the sun. On the other hand, its potential energy is greatest when it is farthest from the sun, because it is then like the stone which has been lifted to a height. The sum of the kinetic and potential energies of the comet is constant, unless it suffers collisions or loses matter by forming a tail. We can determine accurately the change of [Pg 156] potential energy in passing from one position to another, but the total amount of it is to a certain extent arbitrary, since we can fix the zero level where we like. For example, the potential energy of our stone may be taken to be the kinetic energy it would acquire in falling to the surface of the earth, or what it would acquire in falling down a well to the center of the earth, or any assigned lesser distance. It does not matter which we take, so long as we stick to our decision. We are concerned with a profit-and-loss account, which is unaffected by the amount of the assets with which we start.

Potential energy is harder to define. It represents any state of strain, which can only be maintained by applying force. To illustrate with a simple example: if a weight is lifted to a height and held up, it has potential energy because, if released, it will fall. Its potential energy equals the kinetic energy it would gain while falling through the same distance that it was lifted. Similarly, when a comet orbits the sun in a highly eccentric path, it moves much faster when it's close to the sun than when it's farther away, so its kinetic energy is significantly greater when it's near the sun. Conversely, its potential energy is highest when it's farthest from the sun, resembling the stone that has been lifted to a height. The total of the kinetic and potential energies of the comet remains constant unless it collides with something or loses matter by forming a tail. We can accurately determine the change in potential energy as it moves from one position to another, but the total amount is somewhat arbitrary since we can define the zero level wherever we choose. For instance, we can consider the potential energy of our stone as the kinetic energy it would gain if it fell to the earth's surface, or the energy it would gain falling down a well to the earth's center, or any other shorter distance. It doesn't matter which reference point we choose, as long as we remain consistent. We're dealing with a profit-and-loss account that remains unaffected by the initial amount of assets. [Pg 156]

Both the kinetic and the potential energies of a given set of bodies will be different for different observers. In classical dynamics, the kinetic energy differed according to the state of motion of the observer, but only by a constant amount; the potential energy did not differ at all. Consequently, for each observer, the total energy was constant—assuming always that the observers concerned were moving in straight lines with uniform velocities, or, if not, were able to refer their motions to bodies which were so moving. But in relativity dynamics the matter becomes more complicated. We cannot profitably [Pg 157] adapt the idea of potential energy to the theory of relativity, and therefore the conservation of energy, in a strict sense, cannot be maintained. But we obtain a property, closely analogous to conservation, which applies to kinetic energy alone. As Eddington puts it: the kinetic energy is not always strictly conserved, and the classical theory therefore introduces a supplementary quantity, the potential energy, so that the sum of the two is strictly conserved. The relativity treatment, on the other hand, discovers another formula, analogous to the one expressing conservation, which holds always for the kinetic energy. “The relativity treatment adheres to the physical quantity and modifies the law; the classical treatment adheres to the law and modifies the physical quantity.” The new formula, he continues, may be spoken of “as the law of conservation of energy and momentum, because, though it is not formally a law of conservation, it expresses exactly the phenomena which classical mechanics attributes to conservation.”[9] It is only in this modified and less rigorous sense that the conservation of energy remains true.

Both the kinetic and potential energies of a specific group of bodies will vary for different observers. In classical dynamics, the kinetic energy changed depending on the observer's state of motion, but only by a fixed amount; the potential energy remained the same. As a result, for each observer, the total energy was constant—assuming that the observers were moving in straight lines at constant speeds or, if not, were able to relate their movements to bodies that were. However, in relativity dynamics, things get more complex. We can’t effectively adapt the concept of potential energy to relativity, so the conservation of energy, in a strict sense, can't be upheld. Instead, we gain a property that's similar to conservation that applies only to kinetic energy. As Eddington explains: kinetic energy isn't always strictly conserved, which leads classical theory to introduce an additional quantity, potential energy, so that their sum is strictly conserved. On the other hand, relativity uncovers another formula similar to conservation that always holds for kinetic energy. “The relativity treatment focuses on the physical quantity and adjusts the law; the classical treatment focuses on the law and adjusts the physical quantity.” He goes on to say that this new formula can be referred to “as the law of conservation of energy and momentum because, although it isn't formally a law of conservation, it accurately describes the phenomena that classical mechanics attributes to conservation.”[9] It is only in this modified and less strict sense that the conservation of energy remains valid.

What is meant by “conservation” in practice is not exactly what it [Pg 158] means in theory. In theory we say that a quantity is conserved when the amount of it in the world is the same at any one time as at any other. But in practice we cannot survey the whole world, so we have to mean something more manageable. We mean that, taking any given region, if the amount of the quantity in the region has changed, it is because some of the quantity has passed across the boundary of the region. If there were no births and deaths, population would be conserved; in that case the population of a country could only change by emigration or immigration, that is to say, by passing across the boundaries. We might be unable to take an accurate census of China or Central Africa, and, therefore, we might not be able to ascertain the total population of the world. But we should be justified in assuming it to be constant if, wherever statistics were possible, the population never changed except through people crossing the frontiers. In fact, of course, population is not conserved. A physiologist of my acquaintance once put four mice into a thermos. Some hours later, when he went to take them out, there were eleven of them. But mass is not subject to these fluctuations: [Pg 159] the mass of the eleven mice at the end of the time was no greater than the mass of the four at the beginning.

What we mean by “conservation” in practice isn’t quite what it means in theory. In theory, we say that a quantity is conserved when its amount in the world is the same at any one time as at any other time. But in practice, we can’t survey the entire world, so we have to refer to something more manageable. We mean that within a specific region, if the amount of that quantity has changed, it’s because some of it has crossed the boundary of the region. If there were no births and deaths, the population would be conserved; in that case, a country’s population could only change through emigration or immigration, meaning by crossing the borders. We might struggle to take an accurate census of China or Central Africa, and therefore, we might not be able to determine the total population of the world. However, we would be justified in assuming it to be constant if, wherever statistics were available, the population only changed due to people crossing the borders. In reality, though, population isn’t conserved. A physiologist I know once put four mice into a thermos. A few hours later, when he went to take them out, there were eleven. But mass doesn’t experience these fluctuations: the mass of the eleven mice at the end of that time was no greater than the mass of the four at the beginning.

This brings us back to the problem for the sake of which we have been discussing energy. We stated that, in relativity theory, measured mass and energy are regarded as the same thing, and we undertook to explain why. It is now time to embark upon this explanation. But here, as at the end of Chapter VI, the totally unmathematical reader will do well to skip, and begin again at the following paragraph.

This brings us back to the issue we've been discussing regarding energy. We mentioned that in relativity theory, measured mass and energy are considered the same thing, and we will explain why. Now it's time to start that explanation. However, as noted at the end of Chapter VI, readers who are not comfortable with math may want to skip ahead and start again at the next paragraph.

Let us take the velocity of light as the unit of velocity; this is always convenient in relativity theory. Let m be the proper mass of a particle, v its velocity relative to the observer. Then its measured mass will be

Let’s use the speed of light as our unit of velocity; this is always handy in relativity theory. Let m be the rest mass of a particle and v its speed relative to the observer. Then its measured mass will be

m
———
√(1 - )

while its kinetic energy, according to the usual formula, will be

while its kinetic energy, according to the usual formula, will be

½ mv²

½ mv²

As we saw before, energy only occurs in a profit-and-loss account, so[Pg 160] that we can add any constant quantity to it that we like. We may therefore take the energy to be

As we saw earlier, energy only exists in a profit-and-loss account, so[Pg 160] we can add any constant quantity to it that we want. Therefore, we can consider the energy to be

m + ½ mv²

m + ½ mv²

Now if  v  is a small fraction of the velocity of light,

Now if  v  is a small fraction of the speed of light,

m + ½ mv²

m + ½ mv²

is almost exactly equal to

is nearly identical to

m
———
√(1 - )

Consequently, for velocities such as large bodies have, the energy and the measured mass turn out to be indistinguishable within the limits of accuracy attainable. In fact, it is better to alter our definition of energy, and take it to be

Consequently, for speeds that large bodies reach, energy and measured mass end up being indistinguishable within the limits of achievable accuracy. In fact, it makes more sense to change our definition of energy and consider it to be

m
———
√(1 - )

because this is the quantity for which the law analogous to conservation holds. And when the velocity is very great, it gives a better measure of energy than the traditional formula. The traditional formula must therefore be regarded as an approximation, of which the new formula gives the exact version. In this way, energy and measured mass become identified.

because this is the amount for which the law similar to conservation applies. And when the speed is really high, it provides a more accurate measure of energy than the traditional formula. The traditional formula should therefore be seen as an approximation, while the new formula offers the precise version. In this way, energy and measured mass become the same.

I come now to the notion of “action,” which is less familiar to the general public than energy, but has become more important in [Pg 161] relativity physics, as well as in the theory of quanta.[10] (The quantum is a small amount of action.) The word “action” is used to denote energy multiplied by time. That is to say, if there is one unit of energy in a system, it will exert one unit of action in a second, 100 units of action in 100 seconds, and so on; a system which has 100 units of energy will exert 100 units of action in a second, and 10,000 in 100 seconds, and so on. “Action” is thus, in a loose sense, a measure of how much has been accomplished: it is increased both by displaying more energy and by working for a longer time. Since energy is the same thing as measured mass, we may also take action to be measured mass multiplied by time. In classical mechanics, the “density” of matter in any region is the mass divided by the volume; that is to say, if you know the density in a small region, you discover the total amount of matter by multiplying the density by the volume of the small region. In relativity mechanics, we always want to substitute space-time for space; therefore a “region” must no longer be taken to [Pg 162] be merely a volume, but a volume lasting for a time; a small region will be a small volume lasting for a small time. It follows that, given the density, a small region in the new sense contains, not a small mass merely, but a small mass multiplied by a small time, that is to say, a small amount of “action.” This explains why it is to be expected that “action” will prove of fundamental importance in relativity mechanics. And so in fact it is.

I now turn to the concept of “action,” which is less well-known to the general public than energy, but has become increasingly significant in relativity physics and in the theory of quanta. [Pg 161] (A quantum is a small amount of action.) The term “action” refers to energy multiplied by time. In other words, if there is one unit of energy in a system, it will produce one unit of action in a second, 100 units of action in 100 seconds, and so on; a system with 100 units of energy will generate 100 units of action in a second and 10,000 in 100 seconds, etc. “Action” is thus, in a broad sense, a measure of what has been achieved: it increases both by applying more energy and by working longer. Since energy equals measured mass, we can also consider action to be measured mass multiplied by time. In classical mechanics, the “density” of matter in any region is the mass divided by the volume; so, if you know the density in a small area, you can find the total amount of matter by multiplying the density by the volume of that area. In relativity mechanics, we replace space with space-time; therefore, a “region” should no longer just be seen as a volume, but as a volume over time; a small region will be a small volume over a small duration. This means that, given the density, a small region in this new understanding contains not just a small mass, but a small mass multiplied by a small time, which is to say, a small amount of “action.” This explains why it is expected that “action” will be fundamentally important in relativity mechanics. And so it is. [Pg 162]

All the laws of dynamics have been put together into one principle, called “The Principle of Least Action.” This states that, in passing from one state to another, a body chooses a route involving less action than any slightly different route—again a law of cosmic laziness. The principle is subject to certain limitations, which have been pointed out by Eddington,[11] but it remains one of the most comprehensive ways of stating the purely formal part of mechanics. The fact that the quantum is a unit of action seems to show that action is also fundamental in the empirical structure of the world. But at present there is no bridge connecting the quantum with the theory of relativity.

All the laws of dynamics have been combined into one principle, known as “The Principle of Least Action.” This states that when moving from one state to another, a body chooses a path that involves less action than any slightly different path—essentially a law of cosmic laziness. The principle does have some limitations, which Eddington has noted, but it remains one of the most comprehensive ways to express the purely formal aspect of mechanics. The fact that the quantum is a unit of action suggests that action is also fundamental to the empirical structure of the world. However, currently, there is no connection between quantum mechanics and the theory of relativity.


[Pg 163]

[Pg 163]

CHAPTER XI:
IS THE UNIVERSE FINITE?

We have been dealing hitherto with matters that must be regarded as acquired scientific results—not that they will never be found to need improvement, but that further progress must be built upon them, as Einstein is built upon Newton. Science does not aim at establishing immutable truths and eternal dogmas: its aim is to approach truth by successive approximations, without claiming that at any stage final and complete accuracy has been achieved. There is a difference, however, between results which are pretty certainly in the line of advance, and speculations which may or may not prove to be well founded. Some very interesting speculations are connected with the theory of relativity, and we shall consider certain of them. But it must not be supposed that we are dealing with theories having the same solidity as those with which we have been concerned hitherto.

We have been discussing issues that should be seen as established scientific results—not that they won’t ever need improvement, but that further advancements must build on them, just like Einstein built on Newton. Science doesn’t aim to establish unchangeable truths or eternal dogmas; its goal is to get closer to the truth through ongoing refinements, without claiming that any point reached is the ultimate and complete accuracy. However, there’s a distinction between results that are quite certainly on the right track and speculations that might or might not be accurate. Some intriguing speculations relate to the theory of relativity, and we will examine a few of them. However, it shouldn’t be assumed that we’re discussing theories that have the same level of certainty as those we’ve previously focused on.

One of the most fascinating of the speculations to which I have been [Pg 164] alluding is the suggestion that the universe may be of finite extent. Two somewhat different finite universes have been constructed, one by Einstein, the other by De Sitter. Before considering their differences, we will discuss what they have in common.

One of the most interesting ideas I’ve mentioned is the suggestion that the universe might be finite. Two somewhat different finite universes have been created, one by Einstein and the other by De Sitter. Before we look at their differences, let’s talk about what they have in common. [Pg 164]

There are, to begin with, certain reasons for thinking that the total amount of matter in the universe is limited. If this were not the case, the gravitational effects of enormously distant matter would make the kind of world in which we live impossible. We must therefore suppose that there is some definite number of electrons and protons in the world: theoretically, a complete census would be possible. These are all contained within a certain finite region; whatever space lies outside that region is, so to speak, waste, like unfurnished rooms in a house too large for its inhabitants. This seems futile, but in former days no one knew of any alternative possibility. It was obviously impossible to conceive of an edge to space, and therefore, it was thought, space must be infinite.

There are, to start with, several reasons to believe that the total amount of matter in the universe is limited. If it weren’t, the gravitational effects of incredibly distant matter would make the kind of world we live in impossible. We must, therefore, assume that there is a definite number of electrons and protons in the universe: theoretically, a complete count would be possible. All of these are contained within a specific finite region; whatever space exists outside that region is, so to speak, wasted, like empty rooms in a house that’s too big for its residents. This may seem pointless, but in the past, no one knew of any alternative. It was clearly impossible to imagine an edge to space, and thus, it was believed that space must be infinite.

Non-Euclidean geometry, however, showed other possibilities. The surface of a sphere has no boundary, yet it is not infinite. In [Pg 165] traveling round the earth, we never reach “the edge of the world,” and yet the earth is not infinite. The surface of the earth is contained in three-dimensional space, but there is no reason in logic why three-dimensional space should not be constructed on an analogous plan. What we imagine to be straight lines going on for ever will then be like great circles on a sphere: they will ultimately return to their starting point. There will not be in the universe anything straighter than these great circles; the Euclidean straight line may remain as a beautiful dream, but not as a possibility in the actual world. In particular, light rays in empty space will travel in what are really great circles. If we could make measurements with sufficient accuracy, we should be able to infer this state of affairs even from a small part of space, because the sum of the angles of a triangle would always be greater than two right angles, and the excess would be proportional to the size of the triangle. The suggestion we have to consider is the suggestion that our universe may be spherical in this sense.

Non-Euclidean geometry, however, revealed other possibilities. The surface of a sphere has no boundaries, yet it isn’t infinite. In traveling around the Earth, we never reach “the edge of the world,” but the Earth isn’t infinite. The surface of the Earth exists within three-dimensional space, but there’s no logical reason why three-dimensional space shouldn't be set up similarly. What we think of as straight lines stretching on forever will be like great circles on a sphere: they’ll eventually loop back to where they started. There won’t be anything straighter in the universe than these great circles; the Euclidean straight line may remain a beautiful ideal but not a reality in the actual world. Specifically, light rays in empty space will move in what are effectively great circles. If we could measure with enough precision, we could deduce this from even a small portion of space because the sum of the angles in a triangle would always be greater than two right angles, and the excess would relate to the size of the triangle. The idea we need to consider is that our universe might be spherical in this way.

The reader must not confuse this suggestion with the non-Euclidean character of space upon which the new law of gravitation depends. The [Pg 166] latter is concerned with small regions such as the solar system. The departures from flatness which it notices are like hills and valleys on the surface of the earth, local irregularities, not characteristics of the whole. We are now concerned with the possible curvature of the universe as a whole, not with the occasional ups and downs due to the sun and the stars. It is suggested that on the average, and in regions remote from matter, the universe is not quite flat, but has a slight curvature, analogous, in three dimensions, to the curvature of a sphere in two dimensions.

The reader should not confuse this suggestion with the non-Euclidean nature of space that the new law of gravitation relies on. The latter focuses on small areas like the solar system. The deviations from flatness it observes are similar to hills and valleys on the Earth's surface—local irregularities, not traits of the entire structure. We are currently looking at the potential curvature of the universe as a whole, rather than the occasional ups and downs caused by the sun and the stars. It’s suggested that, on average, and in areas far from matter, the universe isn’t completely flat, but has a slight curvature, similar, in three dimensions, to the curvature of a sphere in two dimensions.

It is important to realize, in the first place, that there is not the slightest reason à priori why this should not be the case. People unaccustomed to non-Euclidean geometry may feel that, even if such a thing be logically possible, the world simply cannot be so odd as all that. We all have a tendency to think that the world must conform to our prejudices. The opposite view involves some effort of thought, and most people would die sooner than think—in fact, they do so. But the fact that a spherical universe seems odd to people who have been brought up on Euclidean prejudices is no evidence that it is impossible. There is no law of nature to the [Pg 167] effect that what is taught at school must be true. We cannot therefore dismiss the hypothesis of a spherical universe as in any degree less worthy of examination than any other. We have to ask ourselves the same two questions as we should in any other case, namely: (1) Are the facts consistent with this hypothesis? (2) Is this hypothesis the only one with which the facts are consistent?

It’s important to understand, first of all, that there’s absolutely no reason à priori why this shouldn’t be the case. People who aren’t familiar with non-Euclidean geometry might feel that, even if such a thing is logically possible, the world just cannot be that strange. We all tend to believe that the world must fit our biases. The opposite perspective requires some effort to think about, and most people would rather avoid thinking—actually, they do. However, the fact that a spherical universe seems strange to those raised with Euclidean assumptions doesn’t prove it’s impossible. There’s no law of nature that says what we learn in school has to be true. We can’t therefore dismiss the idea of a spherical universe as any less deserving of consideration than any other. We need to ask ourselves the same two questions we would in any other situation: (1) Are the facts consistent with this hypothesis? (2) Is this hypothesis the only one that aligns with the facts?

With regard to the first question, the answer is undoubtedly in the affirmative. All the known facts are perfectly consistent with the hypothesis of a spherical universe. A very slight modification of the law of gravitation—a modification suggested by Einstein himself—leads to a spherical space, without producing any measurable differences in a small region such as the solar system. The known stars are all within a certain distance from us. There is nothing whatever in the stellar universe as we know it to show that space must be infinite. There can therefore be no doubt whatever that, so far as our present knowledge goes, the hypothesis of a finite universe may be true.

Regarding the first question, the answer is definitely yes. All the facts we know align perfectly with the idea of a spherical universe. A very minor change to the law of gravity—one suggested by Einstein himself—results in a spherical space, without causing any noticeable differences in a small area like the solar system. The stars we know of are all within a certain distance from us. There’s nothing in the stellar universe as we currently understand it to indicate that space must be infinite. Therefore, there’s no doubt that, based on what we know right now, the idea of a finite universe may be true.

But when we ask whether the hypothesis of a finite universe [Pg 168] must be true, the answer is different. It is obvious, on general grounds, that we cannot, from what we know, draw conclusive inferences as to the totality of things. A very slight change in the Newtonian formula for gravitation would prevent masses beyond the limits of the visible universe from having appreciable effects if they existed, and would therefore destroy our reason for supposing that they do not exist. All arguments as to regions which are too distant to be observed depend upon extending to them the laws which hold in our part of the world, and upon assuming that there is not, in these laws, some inaccuracy which is inappreciable for observable distances, but fatal to inferences in which very much greater distances are involved. We cannot, therefore, say that the universe must be finite. We can say that it may be, and we can even say a little more than this. We can say that a finite universe fits in better with the laws that hold in the part we know, and that awkward adjustments of the laws have to be made in order to allow the universe to be infinite. From the point of view of choosing the best framework into which to fit what we know—best, I mean, from a logico-æsthetic point of view—there is no [Pg 169] doubt that the hypothesis of a finite universe is preferable. This, I think, is the extent of what can be said in its favor.

But when we ask whether the idea of a finite universe [Pg 168] must be true, the answer changes. It's clear, on a general level, that we can't draw definitive conclusions about the totality of things based on what we currently know. A small adjustment in the Newtonian formula for gravity could mean that masses beyond the visible universe wouldn't have any noticeable effects if they existed, which would eliminate our reason to assume they don't exist. All arguments about regions that are too far away to be observed rely on applying the laws that work in our part of the universe and on the assumption that there's no inaccuracy in these laws that is negligible for observable distances but critical for much greater distances. Therefore, we can't definitively state that the universe must be finite. We can say that it might be, and we can even say a bit more than that. We can say that a finite universe aligns better with the laws we understand, and that making the laws fit an infinite universe requires awkward adjustments. From the perspective of choosing the best framework for what we know—best in terms of logical and aesthetic considerations—there's no [Pg 169] doubt that the idea of a finite universe is more preferable. I think this is about the limit of what can be argued in its favor.

Let us now see what the two finite universes are like. The difference between them is that in Einstein’s world it is only space that is queer, whereas in De Sitter’s time is queer too. Consequently Einstein’s world is less puzzling, and we will describe it first.

Let’s now take a look at what the two finite universes are like. The difference between them is that in Einstein’s universe, only space is strange, whereas in De Sitter’s, time is also strange. As a result, Einstein’s universe is less confusing, so we’ll describe it first.

In Einstein’s world, light travels round the whole universe in a time which is supposed to be something like a thousand million years. The odd thing is that all the rays of light which start (say) from the sun will meet again, after their enormous journey, in the place where the sun was when they started. The case is exactly analogous to that of a number of travelers who set out from London to go round the world in great circles, all traveling at the same rate in different aeroplanes. One starts due north, passes the North Pole, then the South Pole, and finally comes home. Another starts due south, reaches the South Pole first and then the North Pole. Another starts westward, but he must not continue to travel due west, because then he would not be traveling on [Pg 170] a great circle. Another starts eastward, and so on. They all meet in the antipodes of London, and then they all meet again in London. Now if instead of aeronauts going round the earth you take rays of light going round the universe, the same sort of thing happens: they all meet first at the antipodes of their starting point, and then meet again at their starting point. That means to say that a person who is near the antipodes of the place where the sun was about five hundred million years ago will see what is apparently a body as bright as the sun then was (except for the small amount of light that has been stopped on the way by opaque bodies), and having the same shape and size. And a person who is near where the sun was a thousand million years ago will see what is apparently a body just like what the sun was a thousand million years ago. And the same applies to the antipodes of the sun fifteen hundred million years ago, and to the place of the sun two thousand million years ago, and so on. This series only ends when it carries us back to a time before the sun existed.

In Einstein’s universe, light travels around the entire universe in a time estimated to be around a billion years. The strange part is that all the rays of light that leave (let's say) the sun will eventually converge again at the point where the sun was when they began their journey. This is similar to a group of travelers leaving London to circle the globe along great circles, all flying at the same speed in different airplanes. One traveler heads due north, reaches the North Pole, then the South Pole, and finally returns home. Another heads due south, visits the South Pole first and then the North Pole. One starts westward, but they can’t keep going directly west, or they wouldn't be traveling along a great circle. Another goes eastward, and so on. They all meet at the antipodes of London, and then they all come back together in London. Now, if you replace the flyers circling the Earth with rays of light traveling around the universe, the same thing happens: they first meet at the antipodes of their starting point and then meet again at their starting point. This means that someone located near the antipodes of where the sun was about five hundred million years ago will see what appears to be a body as bright as the sun was then (minus the small amount of light absorbed by obstacles along the way), with the same shape and size. Similarly, a person near where the sun was a billion years ago will see what looks like the sun from a billion years back. This continues for the antipodes of the sun one and a half billion years ago, and two billion years ago, and so on. This sequence only ends when it goes back to a time before the sun even existed.

But all these suns are only ghosts; that is to say, you could pass through them without experiencing resistance, and they do not exert [Pg 171] gravitation. They are, in fact, like images in a mirror: they exist only for the sense of sight, not for any other sense. It is rather disturbing to reflect that, if this theory is true, any number of the objects we see in the heavens may be merely ghosts. They are like ghosts in their habit of revisiting the scenes of their past life. Suppose a star had exploded at a certain place, as stars sometimes will. Every thousand million years its ghost would return to the scene of the disaster and explode again in the same place. There is, however, considerable doubt whether rays of light could perform the journey with sufficient accuracy to produce a clear image. Some would be stopped by matter on the way, some would be turned out of the straight course by passing near heavy bodies, as in the eclipse observations described in Chapter IX, and for one reason or another their return would not be punctual and exact.

But all these suns are just illusions; in other words, you could move through them without feeling any resistance, and they don't have any gravitational pull. They are essentially like images in a mirror: they exist only for our sight, not for any other sense. It's somewhat unsettling to think that, if this theory is correct, many of the things we see in the sky could be nothing but illusions. They're like spirits that keep coming back to the places they used to be. Imagine a star exploded in a specific spot, as stars sometimes do. Every billion years, its ghost would come back to that disaster site and explode there again. However, there's a fair bit of skepticism about whether light rays could travel that distance accurately enough to create a clear image. Some would be blocked by matter along the way, while others might be diverted from their path by passing near massive objects, as seen in the eclipse observations mentioned in Chapter IX, causing their return to be neither timely nor precise. [Pg 171]

There are various reasons for doubting whether Einstein’s universe can be quite right.[12] Some of these are rather complicated. But there is one objection which is easily appreciated: in Einstein’s theory, [Pg 172] absolute space and time re-enter by another door. The ghostly sun is formed in the “place” where it was a thousand million years ago. Both the “place” and the period of time are in a sense absolute. We saw as early as Chapter I that “place” is a vague and popular notion, incapable of scientific precision. It seems hardly worth while to go through such a vast intellectual labor if the errors we set out to correct are to reappear at the end.

There are several reasons to question whether Einstein's universe is completely accurate.[12] Some of these are quite complex. However, there is one objection that's easy to understand: in Einstein’s theory, [Pg 172] absolute space and time come back in a different way. The distant sun is located in the “place” it occupied a billion years ago. Both the “place” and the time period are, in a way, absolute. We saw in Chapter I that “place” is a vague and commonly used term that lacks scientific precision. It seems hardly worth the extensive intellectual effort if the mistakes we aimed to fix are just going to come back at the end.

De Sitter’s world is even odder than Einstein’s, because time goes mad as well as space. I despair of explaining, in non-mathematical language, the particular form of lunacy with which time is afflicted, but some of its manifestations can be described. An observer in this world, if he observes a number of clocks, each of which is perfectly accurate from its own point of view, will think that distant clocks are going slow as compared with those in his neighborhood. They will seem to go slower and slower, until, at a distance of one quarter of the circumference of the universe, they will seem to have stopped altogether. That region will seem to our observer a sort of lotus [Pg 173] land, where nothing is ever done. He will not be able to have any cognizance of things farther off, because no light waves can get across the boundary. Not that there is any real boundary: the people who live in what our observer takes to be lotus land live just as bustling a life as he does, but get the impression that he is eternally standing still. As a matter of fact, you would never become aware of the lotus land, because it would take an infinite time for light to travel from it to you. You could become aware of places just short of it, but it would remain itself always just beyond your ken. There will not be the ghostly suns of Einstein’s world, because light cannot travel so far.

De Sitter’s world is even stranger than Einstein’s, because time goes wonky along with space. I struggle to explain, in simple terms, the specific kind of craziness that time experiences, but I can describe some of its effects. An observer in this world, if they check out several clocks, each perfectly accurate from its own perspective, will think that distant clocks are ticking slower compared to those nearby. They will seem to slow down more and more, until, at a distance equal to one quarter of the universe's circumference, they will appear to have completely stopped. That area will seem to our observer like a sort of lotus land, where nothing ever happens. They won’t be able to perceive anything further away, because no light waves can cross that boundary. Not that there’s an actual boundary: the people living in what our observer sees as lotus land are living just as busy a life as he is, but they get the impression that he is forever standing still. In reality, you would never be aware of lotus land, because it would take infinite time for light to travel from there to you. You could become aware of places just short of it, but it would always remain just out of reach. There won't be the ghostly suns of Einstein’s world, because light can't travel that far.

One of the oddest things about this state of affairs is that empirical evidence for or against it is possible, and that there is actually some slight evidence in its favor. If all “clocks” are slowed down at a great distance from the observer, this will apply to the periodic motions of atoms, and therefore to the light which they emit. Consequently all rays of light emitted by distant objects ought, when they reach us, to look rather more red or less violet than when they started. This can be tested by the spectroscope. We can compare a [Pg 174] known line, as it appears in the spectrum of a spiral nebula, with the same line as it appears in a terrestrial laboratory. We find, as a matter of fact, that in a large majority of spiral nebulæ there is a considerable displacement of spectral lines towards the red. The spiral nebulæ are the most distant objects we can see: Eddington states that their distances “may perhaps be of the order of a million light-years.” (A light-year is the distance light travels in a year.) The usual interpretation of a shifting of spectral lines towards the red is that it is a “Doppler effect,” due to the fact that the source of light is moving away from us. But one would expect to find the nebulæ just as often moving towards us as moving away from us, if nothing operated but the law of chances. If the world is such as De Sitter says it is, the spectral lines of the spiral nebulæ will be displaced towards the red owing to the slowing down of distant clocks, even if in fact they are not moving away from us. This, for what it is worth, is an argument in favor of De Sitter.

One of the strangest things about this situation is that it's possible to find evidence for or against it, and there is actually some slight evidence supporting it. If all "clocks" are slowed down at a great distance from the observer, this will also apply to the periodic motions of atoms, and therefore to the light they emit. As a result, all rays of light emitted by distant objects should, when they reach us, appear a bit more red or less violet than when they started. This can be tested using a spectroscope. We can compare a [Pg 174] known line, as it appears in the spectrum of a spiral nebula, with the same line as it appears in a lab on Earth. We find that in a large majority of spiral nebulae, there is a significant shift of spectral lines towards the red. The spiral nebulae are the most distant objects we can see: Eddington states that their distances "may perhaps be of the order of a million light-years." (A light-year is the distance light travels in a year.) The common interpretation of a shift in spectral lines towards red is the "Doppler effect," meaning that the light source is moving away from us. However, one would expect to find the nebulae moving towards us just as often as moving away, if only chance played a role. If the universe is as De Sitter describes, the spectral lines of the spiral nebulae will be shifted towards red due to the slowing down of distant clocks, even if they are not actually moving away from us. This, for what it's worth, is an argument in favor of De Sitter.

The same facts afford another argument in favor of De Sitter, for another reason. If, at a given moment, a body is at rest relatively to [Pg 175] the observer, and at a distance from him, it will (in the absence of counteracting causes) not remain at rest from his point of view, but will begin to move away from him, and will continue to move away faster and faster; the further it is from him, the more its retreat will be accelerated. For bodies which are not too distant from each other, gravitation may overcome this tendency; but as this tendency increases with the distance, while gravitation diminishes, we should expect to find very distant bodies receding from us if De Sitter’s theory is right. Thus we have two reasons for the displacement of spectral lines in spiral nebulæ: one, the slowing down of time; the other, the movement away from us which we should expect at distances too great for gravitation to be sensible. However, it cannot be said that the argument, on either ground, is very strong. Eddington gives a list of forty-one spiral nebulæ, of which five have their spectral lines shifted towards the violet, not towards the red. Thus the material is neither very copious nor quite harmonious.

The same facts provide another argument in support of De Sitter for a different reason. If, at a certain moment, an object is at rest relative to the observer and at a distance from him, it will (assuming no opposing forces) not stay at rest from his perspective, but will start to move away from him, and will continue to move away faster and faster; the farther it is from him, the more its retreat will accelerate. For objects that are not too far apart, gravity may counteract this tendency; however, as this tendency increases with distance while gravity decreases, we should expect to see very distant objects moving away from us if De Sitter’s theory is correct. So we have two reasons for the shift of spectral lines in spiral nebulae: one, the slowing down of time; the other, the movement away from us that we would expect at distances where gravity is negligible. Yet, it can't be said that either argument is very strong. Eddington lists forty-one spiral nebulae, of which five have their spectral lines shifted toward the violet, not toward the red. Thus, the evidence is neither very abundant nor entirely consistent.

Einstein’s and De Sitter’s hypotheses do not exhaust the possibilities of a finite world: they are merely the two simplest forms of such a [Pg 176] world. There are arguments against each, and it hardly seems probable that either is quite true. But it does seem probable that something more or less analogous is true. If the universe is finite, it is theoretically conceivable that there should be a complete inventory of it. We may be coming to the end of what physics can do in the way of stretching the imagination and systematizing the world. The period since Galileo has been essentially the period of physics, as the age of the Greeks was the period of geometry. It may be that physics will lose its attractions through success: if the fundamental laws of physics come to be fully known, adventurous and inquiring intellects will turn to other fields. This may alter profoundly the whole texture of human life, since our present absorption in machinery and industrialism is the reflection in the practical world of the theorist’s interest in physical laws. But such speculations are even more rash than those of De Sitter, and I do not wish to lay any stress upon them.

Einstein’s and De Sitter’s theories don’t cover all the possibilities of a finite universe; they’re just the two simplest versions of such a universe. There are counterarguments to both, and it seems unlikely that either is completely accurate. However, it does seem likely that something roughly similar is true. If the universe is finite, it’s theoretically possible to have a complete inventory of it. We might be reaching the limits of what physics can achieve in terms of expanding our imagination and organizing our understanding of the world. Since Galileo, we’ve essentially been in the age of physics, much like how the Greeks were in the age of geometry. It’s possible that physics could become less appealing as we achieve more: if we fully understand the fundamental laws of physics, curious minds will shift their focus to other areas. This could radically change the fabric of human life, as our current fascination with machinery and industrialism reflects the theorist’s interest in physical laws. But such speculations are even bolder than those of De Sitter, and I don’t want to emphasize them. [Pg 176]


[Pg 177]

[Pg 177]

CHAPTER XII:
CONVENTIONS AND NATURAL LAWS

One of the most difficult matters in all controversy is to distinguish disputes about words from disputes about facts: it ought not to be difficult, but in practice it is. This is quite as true in physics as in other subjects. In the seventeenth century there was a terrific debate as to what “force” is; to us now, it was obviously a debate as to how the word “force” should be defined, but at the time it was thought to be much more. One of the purposes of the method of tensors, which is employed in the mathematics of relativity, is to eliminate what is purely verbal (in an extended sense) in physical laws. It is of course obvious that what depends on the choice of co-ordinates is “verbal” in the sense concerned. A man punting walks along the boat, but keeps a constant position with reference to the river bed so long as he does not pick up his pole. The Lilliputians might debate endlessly whether he is walking [Pg 178] or standing still: the debate would be as to words, not as to facts. If we choose co-ordinates fixed relatively to the boat, he is walking; if we choose co-ordinates fixed relatively to the river bed, he is standing still. We want to express physical laws in such a way that it shall be obvious when we are expressing the same law by reference to two different systems of co-ordinates, so that we shall not be misled into supposing we have different laws when we only have one law in different words. This is accomplished by the method of tensors. Some laws which seem plausible in one language cannot be translated into another; these are impossible as laws of nature. The laws that can be translated into any co-ordinate language have certain characteristics: this is a substantial help in looking for such laws of nature as the theory of relativity can admit to be possible. Combined with what we know of the actual motions of bodies, it enables us to decide what must be the correct expression of the law of gravitation: logic and experience combine in equal proportions in obtaining this expression.

One of the toughest issues in any debate is figuring out the difference between arguments about words and arguments about facts. It shouldn’t be hard, but in reality, it is. This holds true in physics just as much as in other fields. Back in the seventeenth century, there was a huge debate about what “force” really meant; to us now, it seems like a discussion about how to define the word “force,” but at the time, it was believed to be something much deeper. One goal of the tensor method used in the mathematics of relativity is to eliminate what is purely verbal (in a broader sense) in physical laws. It's clear that anything depending on the choice of coordinates is “verbal” in that context. A person punting moves along the boat but remains in the same position relative to the riverbed as long as he doesn't pick up his pole. The Lilliputians could argue forever about whether he’s walking or standing still: their debate would be about words, not facts. If we use coordinates fixed to the boat, he’s walking; if we use coordinates fixed to the riverbed, he’s standing still. We want to frame physical laws in a way that makes it clear when we’re expressing the same law through two different coordinate systems, so we won’t be misled into thinking we have different laws when we’re just using different terms for one law. The tensor method achieves this. Some laws that make sense in one language can’t be translated into another; these can’t be considered laws of nature. The laws that can be translated into any coordinate language have certain characteristics: this greatly assists us in searching for the natural laws that the theory of relativity allows to be possible. Combined with what we know about the actual motions of bodies, it helps us determine what the correct expression of the law of gravitation should be: logic and experience work together equally to arrive at this expression.

But the problem of arriving at genuine laws of nature is not to be solved by the method of tensors alone; a good, deal of careful thought [Pg 179] is wanted in addition. Some of this has been done, especially by Eddington; much remains to be done.

But figuring out true laws of nature can't be solved by just using tensors; it requires a lot of careful thinking too. Some of this has been accomplished, especially by Eddington; but there's still a lot left to do. [Pg 179]

To take a simple illustration: Suppose, as in the hypothesis of the Fitzgerald contraction, that lengths in one direction were shorter than in another. Let us assume that a foot rule pointing north is only half as long as the same foot rule pointing east, and that this is equally true of all other bodies. Does such an hypothesis have any meaning? If you have a fishing rod fifteen feet long when it is pointing west, and you then turn it to the north, it will still measure fifteen feet, because your foot rule will have shrunk too. It won’t “look” any shorter, because your eye will have been affected in the same way. If you are to find out the change, it cannot be by ordinary measurement; it must be by some such method as the Michelson-Morley experiment, in which the velocity of light is used to measure lengths. Then you still have to decide whether it is simpler to suppose a change of length or a change in the velocity of light. The experimental fact would be that light takes longer to traverse what your foot rule declares to [Pg 180] be a given distance in one direction than in another—or, as in the Michelson-Morley experiment, that it ought to take longer but doesn’t. You can adjust your measures to such a fact in various ways; in any way you choose to adopt, there will be an element of convention. This element of convention survives in the laws that you arrive at after you have made your decision as to measures, and often it takes subtle and elusive forms. To eliminate the element of convention is, in fact, extraordinarily difficult; the more the subject is studied, the greater the difficulty is seen to be.

To give a simple example: Imagine, similar to the idea of the Fitzgerald contraction, that measurements in one direction are shorter than in another. Let's say a measuring stick pointing north is only half as long as the same stick pointing east, and this holds true for all other objects as well. Does such an idea even make sense? If you have a fishing rod that's fifteen feet long when it’s facing west, and then you turn it north, it will still measure fifteen feet because your measuring stick has shrunk too. It won’t “look” any shorter because your eye will perceive it the same way. To determine the change, you can’t rely on regular measurement; it has to be done using something like the Michelson-Morley experiment, which uses the speed of light to measure lengths. Then you need to decide whether it makes more sense to think of a change in length or a change in the speed of light. The experimental finding would be that light takes longer to cover what your measuring stick states is a certain distance in one direction compared to another—or, as shown in the Michelson-Morley experiment, it should take longer but doesn’t. You can adjust your measurements in different ways; whichever method you choose, there will be a level of convention involved. This element of convention persists in the laws you establish after deciding on your measurements, and it often takes subtle and elusive forms. Removing the element of convention is actually incredibly challenging; the deeper you study the topic, the more evident the difficulty becomes.

A more important example is the question of the size and shape of the electron. We find experimentally that all electrons are the same size, and that they are symmetrical in all directions. How far is this a genuine fact ascertained by experiment, and how far is it a result of our conventions of measurement? We have here a number of different comparisons to make: (1) between different directions in regard to one electron at one time; (2) in regard to one electron at different times; (3) in regard to two electrons at the same time. We can then arrive at the comparison of two electrons at different times, by combining [Pg 181] (2) and (3). We may dismiss any hypothesis which would affect all electrons equally; for example, it would be useless to suppose that in one region of space-time they were all larger than in another. Such a change would affect our measuring appliances just as much as the things measured, and would therefore produce no discoverable phenomena. This is as much as to say that it would be no change at all. But the fact that two electrons have the same mass, for instance, cannot be regarded as purely conventional. Given sufficient minuteness and accuracy, we could compare the effects of two different electrons upon a third; if they were equal under like circumstances, we should be able to infer equality in a not purely conventional sense. The question of the symmetry of the forces exerted by an electron—i.e., that these forces depend only upon the distance from the electron, and not upon the direction—is more complicated. Eddington finally comes to the conclusion that this, too, is a matter of convention. The argument is difficult and I have not fully understood it; but I feel some hesitation in accepting it as valid.

A more important example is the question of the size and shape of the electron. We find from experiments that all electrons are the same size and symmetrical in all directions. How much of this is a genuine fact determined by experiment, and how much is just a result of how we measure things? We have several comparisons to consider: (1) different directions for one electron at a time; (2) one electron at different times; (3) two electrons at the same time. We can then make the comparison of two electrons at different times by combining (2) and (3). We can rule out any hypothesis that would affect all electrons equally; for instance, it would be pointless to suggest that they were all larger in one area of space-time than in another. Such a change would impact our measuring devices just as much as the objects being measured, resulting in no noticeable phenomena. This means there would be no change at all. However, the fact that two electrons have the same mass, for example, can't be seen as purely conventional. With enough precision and accuracy, we could compare how two different electrons affect a third one; if they were equal under the same conditions, we could conclude equality in a way that isn't purely conventional. The question of the symmetry of the forces from an electron—meaning these forces depend only on the distance from the electron and not on the direction—is more complex. Eddington ultimately concludes that this is also a matter of convention. The argument is challenging, and I don't fully understand it, but I have some hesitation in accepting it as valid.

Eddington describes the process concerned in the more advanced portions of the theory of relativity as “world-building.” The structure to be [Pg 182] built is the physical world as we know it; the economical architect tries to construct it with the smallest possible amount of material. This is a question for logic and mathematics. The greater our technical skill in these two subjects, the more real building we shall do, and the less we shall be content with mere heaps of stones. But before we can use in our building the stones that nature provides, we have to hew them into the right shapes: this is all part of the process of budding. In order that this may be possible, the raw material must have some structure (which we may conceive as analogous to the grain in timber), but almost any structure will do. By successive mathematical refinements, we whittle away our initial requirements until they amount to very little. Given this necessary minimum of structure in the raw material, we find that we can construct from it a mathematical expression which will have the properties that are needed for describing the world we perceive—in particular, the properties of conservation which are characteristic of momentum and energy (or mass). Our raw material consisted merely of events; but when we find [Pg 183] that we can build out of it something which, as measured, will seem to be never created or destroyed, it seems not surprising that we should come to believe in “bodies.” These are really mere mathematical constructions out of events, but owing to their permanence they are practically important, and our senses (which were presumably developed by biological needs) are adapted for noticing them, rather than the crude continuum of events which is theoretically more fundamental. From this point of view, it is astonishing how little of the real world is revealed by physical science: our knowledge is limited, not only by the conventional element, but also by the selectiveness of our perceptual apparatus.

Eddington refers to the advanced aspects of the theory of relativity as “world-building.” The goal is to create the physical world as we understand it; the efficient architect aims to do this using the least amount of material possible. This is largely a matter of logic and mathematics. The more skilled we become in these areas, the more substantial our constructions will be, and the less we’ll settle for just piles of stones. However, before we can use the natural stones available to us, we need to shape them correctly: this is all part of the building process. For this to work, the raw material must have some structure (which we can think of as similar to the grain in wood), but almost any structure will suffice. Through a series of mathematical refinements, we reduce our initial needs until they become minimal. With this essential minimum of structure in our raw material, we are able to create a mathematical expression that possesses the necessary properties for describing the world we observe—specifically, the conservation properties integral to momentum and energy (or mass). Our raw material comprised only events; yet when we discover we can construct something from it that, when measured, appears to be neither created nor destroyed, it’s not surprising we come to believe in “bodies.” These are essentially just mathematical constructs derived from events, but due to their stability, they hold practical significance, and our senses (which likely evolved from biological needs) are more attuned to noticing them rather than the raw continuum of events that is theoretically more fundamental. From this perspective, it’s remarkable how little of the actual world is revealed by physical science: our understanding is constrained not only by conventional factors but also by the limits of our perceptual abilities.

We assume that there is an “interval” between two events, in the sense explained in Chapter VII, but we no longer assume that we can unambiguously compare the length of an interval in one region with the length of an interval in another. It is assumed by Weyl, who introduced this limitation, that we can compare a number of small intervals which all start from the same point; also that, in a very small journey, our measuring rod will not alter its length much, so that there will [Pg 184] only be a small error if we compare lengths in neighboring places by the usual methods. Weyl found that, by diminishing our assumptions as to interval in this way, it was possible to bring electromagnetism and gravitation into one system. The mathematics of Weyl’s theory is complicated, and I shall not attempt to explain it. For the present, I am concerned with a different consequence of his theory. If lengths in different regions cannot be compared directly, there is an element of convention in the indirect comparisons which we actually make. This element will be at first unrecognized, but will be such as to simplify to the utmost the expression of the laws of nature. In particular, conditions of symmetry may be entirely created by conventions as to measurement, and there is no reason to suppose that they represent any property of the real world. The law of gravitation itself, according to Eddington, may be regarded as expressing conventions of measurement. “The conventions of measurement,” he says, “introduce an isotropy[13] and homogeneity into measured space which need not originally have any counterpart in the relation-structure which is being surveyed. This isotropy and homogeneity is exactly expressed by Einstein’s law of gravitation.”[14]

We assume that there is an “interval” between two events, as explained in Chapter VII, but we no longer believe that we can clearly compare the length of an interval in one area with the length of an interval in another. Weyl, who first introduced this limitation, assumes that we can compare several small intervals that all start from the same point; he also assumes that during a very short journey, our measuring tool won’t change its length significantly, so there will be only a small error when we compare lengths in nearby locations using standard methods. Weyl discovered that by reducing our assumptions about intervals in this way, it became possible to combine electromagnetism and gravitation into a unified system. The math behind Weyl’s theory is complex, and I won’t try to explain it. For now, I want to focus on another implication of his theory. If we can’t directly compare lengths in different areas, there’s an element of convention in the indirect comparisons we actually make. This element may initially go unrecognized, but it will simplify the expression of the laws of nature as much as possible. Specifically, conditions of symmetry might be entirely created by the conventions we use for measurement, and there's no reason to think they reflect any aspect of the actual world. According to Eddington, the law of gravitation itself can be seen as representing measurement conventions. “The conventions of measurement,” he states, “introduce an isotropy[13] and homogeneity into measured space that need not originally have any counterpart in the relation structure being surveyed. This isotropy and homogeneity is precisely expressed by Einstein’s law of gravitation.”[14]

[Pg 185] The limitations of knowledge introduced by the selectiveness of our perceptual apparatus may be illustrated by the indestructibility of matter. This has been gradually discovered by experiment, and seemed a well-founded empirical law of nature. Now it turns out that, from our original space-time continuum, we can construct a mathematical expression which will have properties causing it to appear indestructible. The statement that matter is indestructible then ceases to be a proposition of physics, and becomes instead a proposition of linguistics and psychology. As a proposition of linguistics: “Matter” is the name of the mathematical expression in question. As a proposition of psychology: Our senses are such that we notice what is roughly the mathematical expression in question, and we are led nearer and nearer to it as we refine upon our crude perceptions by scientific observation. This is much less than physicists used to think they knew about matter.

[Pg 185] The limits of knowledge created by the selectivity of our perception can be illustrated by the indestructibility of matter. This concept has been gradually revealed through experimentation and seemed like a solid empirical law of nature. Now, it appears that we can create a mathematical expression from our original space-time continuum that has properties making it seem indestructible. The assertion that matter is indestructible shifts from a statement of physics to one of linguistics and psychology. As a linguistic proposition: “Matter” refers to the mathematical expression in question. As a psychological proposition: Our senses are such that we recognize what is essentially the mathematical expression in question, and we get closer to it as we refine our basic perceptions through scientific observation. This understanding is far less than what physicists once believed they knew about matter.

The reader may say: What then is left of physics? What do we really [Pg 186] know about the world of matter? Here we may distinguish three departments of physics. There is first what is included within the theory of relativity, generalized as widely as possible. Next, there are laws which cannot be brought within the scope of relativity. Thirdly, there is what may be called geography. Let us consider each of these in turn.

The reader might ask: So, what’s left of physics? What do we actually know about the world of matter? Here, we can identify three areas of physics. First, there's everything that falls under the theory of relativity, as broadly defined as possible. Next, there are laws that don’t fit into the framework of relativity. Third, there’s what we can call geography. Let’s look at each of these in detail.

The theory of relativity, apart from convention, tells us that the events in the universe have a four-dimensional order, and that, between any two events which are near together in this order, there is a relation called “interval,” which is capable of being measured if suitable precautions are taken. We make also an assumption as to what happens when a little measuring rod is carried round a closed circuit in a certain manner; the consequences of this assumption are such as to make it highly probable that it is true. Beyond this, there is little in the theory of relativity that can be regarded as physical laws. There is a great deal of mathematics, showing that certain mathematically-constructed quantities must behave like the things we perceive; and there is a suggestion of a bridge between psychology and [Pg 187] physics in the theory that these mathematically-constructed quantities are what our senses are adapted for perceiving. But neither of these things is physics in the strict sense.

The theory of relativity, aside from established norms, explains that events in the universe have a four-dimensional structure, and that between any two events that are close together in this structure, there is a relationship called “interval,” which can be measured if the right precautions are taken. We also assume what happens when a small measuring rod is moved around a closed loop in a specific way; the implications of this assumption make it very likely to be true. Beyond this, there isn't much in the theory of relativity that can be seen as physical laws. There's a lot of mathematics that demonstrates that certain mathematically-derived quantities must behave like the things we experience; and there's an idea suggesting a connection between psychology and [Pg 187] physics in the idea that these mathematically-derived quantities are what our senses are designed to perceive. But neither of these aspects qualifies as physics in the strict sense.

The part of physics which cannot, at present, be brought within the scope of relativity is large and important. There is nothing in relativity to show why there should be electrons and protons; relativity cannot give any reason why matter should exist in little lumps. With this goes the whole theory of the structure of the atom. The theory of quanta also is quite outside the scope of relativity. Relativity is, in a sense, the most extreme application of what may be called next-to-next methods. Gravitation is no longer regarded as due to the effect of the sun upon a planet, but as expressing characteristics of the region in which the planet happens to be. Distance, which used to be thought to have a definite meaning however far apart two points might be, is now only definite for neighboring points. The distance between widely separated places depends upon the route chosen. We may, it is true, define the distance as the geodesic distance, but that can only be estimated by adding up little [Pg 188] bits, that is to say, by the method we use in estimating the length of a curve. What applies to distance applies equally to the straight line. There is nothing in the actual world having exactly the properties that straight lines were supposed to have; the nearest approach is the track of a light ray. Straight lines have to be replaced by geodesics, which are defined by what they do at each point, not all at once, like Euclidean straight lines. Measurement, in Weyl’s theory, suffers the same fate. We can only use a measuring rod to give lengths in one place: when we move it to another region, there is no knowing how it will alter. We do assume, however, that, if it alters, it alters bit by bit, gradually, continuously, and not by sudden jumps. Perhaps this assumption is unjustified. It belongs to the general outlook of relativity, which is that of continuity. No doubt it is owing to this outlook that relativity is unable to account for the discontinuities in physics, such as quanta, electrons and protons. Perhaps relativity will conquer these domains when it learns to dispense with the assumption of continuity.

The part of physics that can't currently be explained by relativity is both extensive and significant. Relativity offers no explanation for the existence of electrons and protons; it doesn't explain why matter exists in small chunks. This also encompasses the entire theory of atomic structure. The theory of quanta is similarly outside the realm of relativity. In a way, relativity represents the most extreme application of what might be called next-to-next methods. Gravitation is no longer seen as the result of the sun's influence on a planet, but rather as reflecting characteristics of the area where the planet is located. Distance, which used to have a clear definition regardless of how far apart two points were, is now only definite for nearby points. The distance between far-apart locations depends on the path taken. We can define the distance as the geodesic distance, but that can only be calculated by summing up small parts, just like we do when measuring the length of a curve. The same principle applies to straight lines. There’s nothing in the real world that matches the exact properties that straight lines were thought to possess; the closest example is the path of a light ray. Straight lines have to be replaced by geodesics, which are defined by their behavior at each point rather than all at once, like Euclidean straight lines. Measurement in Weyl’s theory faces the same issue. We can only use a ruler to measure lengths in one spot: when we move it to another area, we can't predict how it will change. However, we do assume that if it does change, it does so gradually and continuously, not in sudden jumps. Perhaps this assumption is unwarranted. It aligns with the general perspective of relativity, which emphasizes continuity. This viewpoint is likely why relativity struggles to account for discontinuities in physics, such as quanta, electrons, and protons. It’s possible that relativity will eventually address these areas once it learns to let go of the continuity assumption.

Finally we come to geography, in which I include history. The separation of history from geography rests upon the separation of time [Pg 189] from space; when we amalgamate the two in space-time, we need one word to describe the combination of geography and history. For the sake of simplicity, I shall use the one word geography in this extended sense.

Finally, we arrive at geography, which also includes history. The distinction between history and geography is based on the division of time from space; when we combine the two in space-time, we need a single term to describe the blend of geography and history. To keep things simple, I will use the term geography in this broader sense. [Pg 189]

Geography, in this sense, includes everything that, as a matter of crude fact, distinguishes one part of space-time from another. One part is occupied by the sun, one by the earth; the intermediate regions contain light waves, but no matter (apart from a very little here and there). There is a certain degree of theoretical connection between different geographical facts; to establish this is the purpose of physical laws. It is thought that a sufficient knowledge of the geographical facts of the solar system throughout any finite time, however short, would enable an ideally competent physicist to predict the future of the solar system so long as it remained remote from other stars. We are already in a position to calculate the large facts about the solar system backwards and forwards for vast periods of time. But in all such calculations we need a basis of crude fact. The facts are interconnected, but facts can only be inferred from other facts, not [Pg 190] from general laws alone. Thus the facts of geography have a certain independent status in physics. No amount of physical laws will enable us to infer a physical fact unless we know other facts as data for our inference. And here when I speak of “facts” I am thinking of particular facts of geography, in the extended sense in which I am using the term.

Geography, in this context, includes everything that distinguishes one part of space-time from another. One area is occupied by the sun, another by the earth; the space in between holds light waves but little matter (except for a tiny bit here and there). There's a theoretical connection between different geographical facts; the goal of physical laws is to establish this connection. It's believed that having enough knowledge about the geographical facts of the solar system over any finite period, no matter how brief, would allow an ideally skilled physicist to predict the solar system’s future as long as it stays far from other stars. We can currently calculate the major facts of the solar system both backward and forward over long periods. However, all these calculations require a basis of crude facts. The facts are interconnected, but facts can only be inferred from other facts, not from general laws alone. Therefore, the facts of geography hold a certain independent status within physics. No amount of physical laws will allow us to deduce a physical fact unless we have other facts to use as data for our deduction. And here, when I mention “facts,” I am referring to specific geographical facts in the broad sense in which I'm using the term.

In the theory of relativity, we are concerned with structure, not with the material of which the structure is composed. In geography, on the other hand, the material is relevant. If there is to be any difference between one place and another, there must either be differences between the material in one place and that in another, or places where there is material and places where there is none. The former of these alternatives seems the more satisfactory. We might try to say: There are electrons and protons, and the rest is empty. But in the “empty” regions there are light waves, so that we cannot say nothing happens in them. Some people maintain that the light waves take place in the ether, others are content to say simply that they take place; but in any case events are occurring where there are light waves. That is all that we can really say for the places where [Pg 191] there is matter, since matter has turned out to be a mathematical construction built out of events. We may say, therefore, that there are events everywhere in space-time, but they must be of a somewhat different kind according as we are dealing with a region where there is an electron or proton or with the sort of region we should ordinarily call empty. But as to the intrinsic nature of these events we can know nothing, except when they happen to be events in our own lives. Our own perceptions and feelings must be part of the crude material of events which physics arranges into a pattern—or rather, which physics finds to be arranged in a pattern. As regards events which do not form part of our own lives, physics tells us the pattern of them, but is quite unable to tell us what they are like in themselves. Nor does it seem possible that this should be discovered by any other method.

In the theory of relativity, we're focused on structure, not on what the structure is made of. In geography, however, the material matters. If there’s going to be any difference between one place and another, there has to be a difference in the material at one location compared to another, or some places must have material while others do not. The first option seems more satisfactory. We might say: There are electrons and protons, and the rest is empty. But in the “empty” areas, there are light waves, so we can't say that nothing happens there. Some argue that light waves exist in an ether, while others simply say they exist; in any case, events are happening where there are light waves. That’s about all we can really say for places with matter since matter has turned out to be a mathematical construct built from events. So, we can say that there are events everywhere in space-time, but they must be of a slightly different nature depending on whether we're talking about a region with an electron or proton or what we typically consider empty space. However, regarding the intrinsic nature of these events, we can’t know anything, except when they are part of our own lives. Our perceptions and feelings must be part of the raw material of events that physics organizes into a pattern—or rather, which physics finds to be organized in a pattern. When it comes to events that are separate from our lives, physics can describe the pattern they form, but it cannot tell us what they are like in themselves. It also doesn’t seem possible that any other method could uncover this.


[Pg 192]

[Pg 192]

CHAPTER XIII:
THE ABOLITION OF “FORCE”

In the Newtonian system, bodies under the action of no forces move in straight lines with uniform velocity; when bodies do not move in this way, their change of motion is ascribed to a “force.” Some forces seem intelligible to our imagination: those exerted by a rope or string, by bodies colliding, or by any kind of obvious pushing or pulling. As explained in an earlier chapter, our apparent imaginative understanding of these processes is quite fallacious; all that it really means is that past experience enables us to foresee more or less what is going to happen without the need of mathematical calculations. But the “forces” involved in gravitation and in the less familiar forms of electrical action do not seem in this way “natural” to our imagination. It seems odd that the earth can float in the void: the natural thing to suppose is that it must fall. That is why it has to be supported on an elephant, and the [Pg 193] elephant on a tortoise, according to some early speculators. The Newtonian theory, in addition to action at a distance, introduced two other imaginative novelties. The first was, that gravitation is not always and essentially directed what we should call “downwards,” i.e., towards the center of the earth. The second was, that a body going round and round in a circle with uniform velocity is not “moving uniformly” in the sense in which that phrase is applied to the motion of bodies under no forces, but is perpetually being turned out of the straight course towards the center of the circle, which requires a force pulling it in that direction. Hence Newton arrived at the view that the planets are attracted to the sun by a force, which is called gravitation.

In the Newtonian system, objects that experience no forces move in straight lines at a constant speed; when objects do not move this way, their change in motion is attributed to a “force.” Some forces are easy to understand: those exerted by a rope or string, by colliding objects, or by any obvious pushing or pulling. As mentioned in an earlier chapter, our seemingly intuitive grasp of these processes is actually misleading; it simply means that past experiences help us predict what will likely happen without needing mathematical calculations. However, the “forces” involved in gravity and in the less familiar types of electrical interactions don't feel “natural” to our intuition. It seems strange for the earth to float in empty space; we tend to think that it should fall. That’s why some early thinkers suggested it’s supported by an elephant, and the elephant is on a tortoise. The Newtonian theory introduced two other imaginative ideas alongside action at a distance. The first is that gravity isn’t always directed towards what we would call “down,” i.e. towards the center of the earth. The second is that an object moving in a circle at a constant speed isn’t “moving uniformly” in the same way as objects that aren’t acted on by forces; instead, it’s constantly being pulled away from a straight path towards the center of the circle, which requires a force to pull it in that direction. Therefore, Newton concluded that the planets are drawn to the sun by a force known as gravitation.

This whole point of view, as we have seen, is superseded by relativity. There are no longer such things as “straight lines” in the old geometrical sense. There are “straightest lines,” or geodesics, but these involve time as well as space. A light ray passing through the solar system does not describe the same orbit as a comet, from a geometrical point of view; nevertheless each moves in a geodesic. The whole imaginative picture is changed. A poet might say that water [Pg 194] runs down hill because it is attracted to the sea, but a physicist or an ordinary mortal would say that it moves as it does, at each point, because of the nature of the ground at that point, without regard to what lies ahead of it. Just as the sea does not cause the water to run towards it, so the sun does not cause the planets to move round it. The planets move round the sun because that is the easiest thing to do—in the technical sense of “least action.” It is the easiest thing to do because of the nature of the region in which they are, not because of an influence emanating from the sun.

This whole perspective, as we’ve seen, is replaced by relativity. There are no longer “straight lines” in the old geometrical sense. There are “straightest lines,” or geodesics, but these involve time as well as space. A light ray traveling through the solar system doesn’t follow the same path as a comet from a geometric standpoint; however, both move along a geodesic. The entire imaginative picture shifts. A poet might say that water runs downhill because it's drawn to the sea, but a physicist or an everyday person would say it moves the way it does at each point due to the characteristics of the ground at that point, without considering what’s ahead of it. Just as the sea doesn’t cause the water to flow toward it, the sun doesn’t cause the planets to orbit around it. The planets orbit the sun because that’s the easiest way to move—in the technical sense of “least action.” It’s the easiest option because of the nature of the space they’re in, not due to an influence coming from the sun.

The supposed necessity of attributing gravitation to a “force” attracting the planets towards the sun has arisen from the determination to preserve Euclidean geometry at all costs. If we suppose that our space is Euclidean, when in fact it is not, we shall have to call in physics to rectify the errors of our geometry. We shall find bodies not moving in what we insist upon regarding as straight lines, and we shall demand a cause for this behavior. Eddington has stated this matter with admirable lucidity. He supposes a physicist who has assumed the formula for interval which is used in the special [Pg 195] theory of relativity—a formula which still supposes that the observer’s space is Euclidean. He continues:

The supposed need to attribute gravity to a "force" pulling the planets towards the sun comes from the insistence on keeping Euclidean geometry at all costs. If we assume our space is Euclidean when it really isn’t, we’ll end up needing physics to fix the mistakes in our geometry. We will see objects not moving in what we insist on considering straight lines, and we will look for a reason for this behavior. Eddington has explained this very clearly. He imagines a physicist who has accepted the interval formula used in the special theory of relativity—a formula that still assumes the observer’s space is Euclidean. He goes on: [Pg 195]

Since intervals can be compared by experimental methods, he ought soon to discover that his (formula for the interval) cannot be reconciled with observational results, and so realize his mistake. But the mind does not so readily get rid of an obsession. It is more likely that our observer will continue in his opinion, and attribute the discrepancy of the observations to some influence which is present and affects the behavior of his test-bodies. He will, so to speak, introduce a supernatural agency which he can blame for the consequences of his mistake.... The name given to any agency which causes deviation from uniform motion in a straight line is force according to the Newtonian definition of force. Hence the agency invoked through our observer’s mistake is described as a “field of force.”... A field of force represents the discrepancy between the naturalgeometry of a co-ordinate system and the abstractgeometry arbitrarily ascribed to it.[15]

Since intervals can be compared through experimental methods, he should soon realize that his formula for the interval doesn’t match with observations, and therefore acknowledge his mistake. However, the mind doesn't easily let go of an obsession. It's more likely that our observer will stick to his belief and attribute the differences in observations to some influence that is present and affecting the behavior of his test subjects. In a way, he will introduce a supernatural force that he can blame for the outcomes of his error. The term used for any influence that causes deviation from uniform motion in a straight line is force, according to the Newtonian definition of force. Thus, the influence brought up by our observer’s error is described as a “field of force.”... A field of force represents the discrepancy between the natural geometry of a coordinate system and the abstract geometry arbitrarily assigned to it.[15]

If people were to learn to conceive the world in the new way, without the old notion of “force,” it would alter not only their physical imagination, but probably also their morals and politics. The latter [Pg 196] effect would be quite illogical, but is none the less probable on that account. In Newton’s theory of the solar system, the sun seems like a monarch whose behests the planets have to obey. In Einstein’s world there is more individualism and less government than in Newton’s. There is also far less hustle: we have seen that laziness is the fundamental law of Einstein’s universe. The word “dynamic” has come to mean, in newspaper language, “energetic and forceful”; but if it meant “illustrating the principles of dynamics,” it ought to be applied to the people in hot climates who sit under banana trees waiting for the fruit to drop into their mouths. I hope that journalists, in future, when they speak of a “dynamic personality,” will mean a person who does what is least trouble at the moment, without thinking of remote consequences. If I can contribute to this result, I shall not have written in vain.

If people could learn to view the world in a new way, without the outdated idea of “force,” it would change not only their physical imagination but likely their morals and politics as well. That latter effect might seem illogical, but it's still likely. In Newton’s theory of the solar system, the sun appears as a monarch that the planets have to obey. In Einstein’s universe, there’s more individuality and less control than in Newton’s. There’s also a lot less hustle: we’ve seen that laziness is the fundamental rule of Einstein’s universe. The term “dynamic” has come to mean “energetic and forceful” in the news, but if it truly meant “illustrating the principles of dynamics,” it should apply to people in warm climates who sit under banana trees waiting for the fruit to fall into their mouths. I hope that in the future, when journalists talk about a “dynamic personality,” they will mean someone who does what is easiest at the moment, without worrying about the long-term consequences. If I can help make this happen, I won’t have written in vain.

It has been customary for people to draw arguments from the laws of nature as to what we ought to do. Such arguments seem to me a mistake: to imitate nature may be merely slavish. But if nature, as portrayed by Einstein, is to be our model, it would seem that the anarchists will [Pg 197] have the best of the argument. The physical universe is orderly, not because there is a central government, but because every body minds its own business. No two particles of matter ever come into contact; when they get too close, they both move off. If a man were had up for knocking another man down, he would be scientifically correct in pleading that he had never touched him. What happened was that there was a hill in space-time in the region of the other man’s nose, and it fell down the hill.

It's common for people to base arguments on the laws of nature about what we should do. I think that's a mistake: mimicking nature can just be mindless imitation. However, if we consider nature as described by Einstein to be our guide, it looks like anarchists might have the upper hand in this debate. The physical universe is organized not because there’s a central authority, but because each entity takes care of its own affairs. No two particles of matter ever actually touch; when they get too close, they move away from each other. If someone were charged for knocking another person down, they’d be technically correct in arguing that they never physically made contact. What really happened was that there was a bump in space-time near the other person’s nose, and they rolled down that bump. [Pg 197]

The abolition of “force” seems to be connected with the substitution of sight for touch as the source of physical ideas, as explained in Chapter I. When an image in a looking glass moves, we do not think that something has pushed it. In places where there are two large mirrors opposite to each other, you may see innumerable reflections of the same object. Suppose a gentleman in a top-hat is standing between the mirrors, there may be twenty or thirty top-hats in the reflections. Suppose now somebody comes and knocks off the gentleman’s hat with a stick: all the other twenty or thirty top-hats will tumble down at the same moment. We think that a force is needed to knock off the “real” [Pg 198] top-hat, but we think the remaining twenty or thirty tumble off, so to speak, of themselves, or out of a mere passion for imitation. Let us try to think out this matter a little more seriously.

The removal of “force” seems to relate to replacing touch with sight as the source of physical ideas, as described in Chapter I. When an image in a mirror moves, we don't assume that something pushed it. In places with two large mirrors facing each other, you can see countless reflections of the same object. If a man in a top hat stands between the mirrors, you might see twenty or thirty top hats in the reflections. Now, if someone comes along and knocks off the man’s hat with a stick, all the other twenty or thirty top hats will fall at the same time. We believe a force is needed to knock off the “real” top hat, but we think the other twenty or thirty hats fall on their own, or out of a simple desire to imitate. Let’s consider this issue a bit more seriously. [Pg 198]

Obviously something happens when an image in a looking glass moves. From the point of view of sight, the event seems just as real as if it were not in a mirror. But nothing has happened from the point of view of touch or hearing. When the “real” top-hat falls, it makes a noise; the twenty or thirty reflections fall without a sound. If it falls on your toe, you feel it; but we believe that the twenty or thirty people in the mirrors feel nothing, though top-hats fall on their toes too. But all this is equally true of the astronomical world. It makes no noise, because sound cannot travel across a vacuum. So far as we know, it causes no “feelings,” because there is no one on the spot to “feel” it. The astronomical world, therefore, seems hardly more “real” or “solid” than the world in the looking glass, and has just as little need of “force” to make it move.

Obviously, something happens when an image in a mirror moves. From the perspective of sight, the event feels just as real as if it weren’t in a mirror. But nothing has changed from the perspective of touch or hearing. When the “real” top hat falls, it makes a noise; the twenty or thirty reflections fall silently. If it falls on your toe, you feel it; but we believe that the twenty or thirty people in the mirrors feel nothing, even though top hats fall on their toes too. But all of this is equally true for the astronomical world. It makes no sound because sound can’t travel through a vacuum. As far as we know, it causes no “feelings,” because there is no one around to “feel” it. The astronomical world, therefore, seems hardly more “real” or “solid” than the world in the mirror and needs just as little “force” to make it move.

The reader may feel that I am indulging in idle sophistry. “After all,” he may say, “the image in the mirror is the reflection of something [Pg 199] solid, and the top-hat in the mirror only falls off because of the force applied to the real top-hat. The top-hat in the mirror cannot indulge in behavior of its own; it has to copy the real one. This shows how different the image is from the sun and the planets, because they are not obliged to be perpetually imitating a prototype. So you had better give up pretending that an image is just as real as one of the heavenly bodies.”

The reader might think I'm just engaging in pointless arguments. "After all," he might say, "the image in the mirror reflects something solid, and the top hat in the mirror only falls off because of the force applied to the real top hat. The top hat in the mirror can’t act on its own; it has to mimic the real one. This highlights how different the image is from the sun and the planets, since they aren't bound to constantly imitate something else. So you might as well stop pretending that an image is just as real as one of the heavenly bodies." [Pg 199]

There is, of course, some truth in this; the point is to discover exactly what truth. In the first place, images are not “imaginary.” When you see an image, certain perfectly real light waves reach your eye; and if you hang a cloth over the mirror, these light waves cease to exist. There is, however, a purely optical difference between an “image” and a “real” thing. The optical difference is bound up with this question of imitation. When you hang a cloth over the mirror, it makes no difference to the “real” object; but when you move the “real” object away, the image vanishes also. This makes us say that the light rays which make the image are only reflected at the surface of the mirror, and do not really come from a point behind it, but from [Pg 200] the “real” object. We have here an example of a general principle of great importance. Most of the events in the world are not isolated occurrences, but members of groups of more or less similar events, which are such that each group is connected in an assignable manner with a certain small region of space-time. This is the case with the light rays which make us see both the object and its reflection in the mirror: they all emanate from the object as a center. If you put an opaque globe round the object at a certain distance, the object and its reflection are invisible at any point outside the globe. We have seen that gravitation, although no longer regarded as an action at a distance, is still connected with a center: there is, so to speak, a hill symmetrically arranged about its summit, and the summit is the place where we conceive the body to be which is connected with the gravitational field we are considering. For simplicity, common sense lumps together all the events which form one group in the above sense. When two people see the same object, two different events occur, but they are events belonging to one group and connected with the same center. Just the same applies when two people (as we say) hear the [Pg 201] same noise. And so the reflection in a mirror is less “real” than the object reflected, even from an optical point of view, because light rays do not spread in all directions from the place where the image seems to be, but only in directions in front of the mirror, and only so long as the object reflected remains in position. This illustrates the usefulness of grouping connected events about a center in the way we have been considering.

There’s definitely some truth to this; the key is to figure out exactly what that truth is. First off, images aren’t “imaginary.” When you see an image, real light waves hit your eye; and if you cover the mirror with a cloth, those light waves disappear. However, there is an optical difference between an “image” and something “real.” This optical difference is related to the idea of imitation. When you cover the mirror, it doesn't affect the “real” object; but when you move the “real” object away, the image disappears too. This leads us to say that the light rays that create the image are only reflected off the surface of the mirror, and don’t actually come from a point behind it, but from the “real” object. [Pg 200] Here we have an example of a general principle that’s very important. Most events in the world aren’t isolated incidents; they’re part of groups of similar happenings, with each group linked to a specific small area of space-time. This applies to the light rays that allow us to see both the object and its reflection in the mirror: they all radiate from the object as a center. If you place an opaque globe around the object at a certain distance, neither the object nor its reflection can be seen from outside the globe. We’ve observed that gravitation, even though it’s no longer viewed as an action at a distance, is still associated with a center: there’s, in a way, a hill symmetrically shaped around its peak, and the peak is where we imagine the body that’s linked to the gravitational field we’re considering. For simplicity, common sense groups all the related events in this way. When two people see the same object, two distinct events happen, but they are events that belong to the same group and are linked to the same center. The same goes for when two people (as we say) hear the [Pg 201] same noise. Thus, the reflection in a mirror is less “real” than the object being reflected, even from an optical standpoint, because light rays don’t spread in all directions from where the image appears to be, but only in front of the mirror, and only as long as the reflected object stays in place. This shows how useful it is to group related events around a center as we've been discussing.

When we examine the changes in such a group of objects, we find that they are of two kinds: there are those which affect only some member of the group, and those which make connected alterations in all the members of the group. If you put a candle in front of a mirror, and then hang black cloth over the mirror, you alter only the reflection of the candle as seen from various places. If you shut your eyes, you alter its appearance to you, but not its appearance elsewhere. If you put a red globe round it at a distance of a foot, you alter its appearance at any distance greater than a foot, but not at any distance less than a foot. In all these cases, you do not regard the candle itself as having changed; in fact, in all of them, you find that [Pg 202] there are groups of changes connected with a different center or with a number of different centers. When you shut your eyes, for instance, your eyes, not the candle, look different to any other observer: the center of the changes that occur is in your eyes. But when you blow out the candle, its appearance everywhere is changed; in this case you say that the change has happened to the candle. The changes that happen to an object are those that affect the whole group of events which center about the object. All this is only an interpretation of common sense, and an attempt to explain what we mean by saying that the image of the candle in the mirror is less “real” than the candle. There is no connected group of events situated all round the place where the image seems to be, and changes in the image center about the candle, not about a point behind the mirror. This gives a perfectly verifiable meaning to the statement that the image is “only” a reflection. And at the same time it enables us to regard the heavenly bodies, although we can only see and not touch them, as more “real” than an image in a looking glass.

When we look at the changes in a group of objects, we see that there are two types: those that only affect some members of the group and those that cause related changes in all the members. For example, if you put a candle in front of a mirror and then cover the mirror with black cloth, you only change how the candle's reflection appears from different angles. If you close your eyes, the candle looks different to you, but its appearance remains the same for everyone else. If you place a red globe around it a foot away, you change how it looks from any distance greater than a foot, but not from any distance less than a foot. In all these examples, you wouldn’t say the candle itself has changed; instead, you find that there are groups of changes linked to a different center or several different centers. For example, when you close your eyes, your eyes, not the candle, appear different to anyone else observing: the center of the changes is in your eyes. However, when you blow out the candle, its appearance changes everywhere; in this case, you say the change has happened to the candle. The changes that occur to an object are those that influence the whole group of events surrounding it. This is just a way to interpret common sense and to explain what we mean when we say the image of the candle in the mirror is less “real” than the candle itself. There is no connected group of events happening around where the image appears, and changes in the image are tied to the candle, not to a point behind the mirror. This gives a clear meaning to the idea that the image is “only” a reflection. At the same time, it allows us to consider celestial bodies, even though we can only see and not touch them, as more “real” than an image in a mirror.

We can now begin to interpret the common sense notion of one body [Pg 203] having an “effect” upon another, which we must do if we are really to understand what is meant by the abolition of “force.” Suppose you come into a dark room and switch on the electric light: the appearance of everything in the room is changed. Since everything in the room is visible because it reflects the electric light, this case is really analogous to that of the image in the mirror; the electric light is the center from which all the changes emanate. In this case, the “effect” is explained by what we have already said. The more important case is when the effect is a movement. Suppose you let loose a tiger in the middle of a Bank Holiday crowd: they would all move, and the tiger would be the center of their various movements. A person who could see the people but not the tiger would infer that there was something repulsive at that point. We say in this case that the tiger has an effect upon the people, and we might describe the tiger’s action upon them as of the nature of a repulsive force. We know, however, that they fly because of something which happens to them, not merely because the tiger is where he is. They fly because they can see and hear him, that is to say, because certain waves reach their eyes and [Pg 204] ears. If these waves could be made to reach them without there being any tiger, they would fly just as fast, because the neighborhood would seem to them just as unpleasant.

We can now start to understand the everyday idea of one body having an “effect” on another, which we need to clarify if we truly want to grasp what’s meant by the elimination of “force.” Imagine you walk into a dark room and turn on the electric light: everything in the room looks different. Since everything is visible because it reflects the electric light, this situation is similar to seeing an image in a mirror; the electric light is the source from which all the changes come. In this case, the “effect” is explained by what we’ve already discussed. A more significant example is when the effect is movement. Imagine you set a tiger loose in the middle of a crowded Bank Holiday; everyone would react and move, with the tiger being the focus of their movements. Someone who could see the crowd but not the tiger would assume there’s something alarming at that spot. We say that the tiger has an effect on the people, and we might describe the tiger’s influence on them as a kind of repulsive force. However, we know they flee because of something happening to them, not just because the tiger is present. They run away because they can see and hear him; in other words, because specific waves are reaching their eyes and ears. If those waves could reach them without any tiger present, they would flee just as quickly, because the area would still seem just as threatening.

Let us now apply similar considerations to the sun’s gravitation. The “force” exerted by the sun only differs from that exerted by the tiger in being attractive instead of repulsive. Instead of acting through waves of light or sound, the sun acquires its apparent power through the fact that there are modifications of space-time all round the sun. Like the noise of the tiger, they are more intense near their source; as we travel away they grow less and less. To say that the sun “causes” these modifications of space-time is to add nothing to our knowledge. What we know is that the modifications proceed according to a certain rule, and that they are grouped symmetrically about the sun as center. The language of cause and effect adds only a number of quite irrelevant imaginings, connected with will, muscular tension, and such matters. What we can more or less ascertain is merely the formula according to which space-time is modified by the presence of gravitating matter. [Pg 205] More correctly: we can ascertain what kind of space-time is the presence of gravitating matter. When space-time is not accurately Euclidean in a certain region, but has a non-Euclidean character which grows more and more marked as we approach a certain center, and when, further, the departure from Euclid obeys a certain law, we describe this state of affairs briefly by saying that there is gravitating matter at the center. But this is only a compendious account of what we know. What we know is about the places where the gravitating matter is not, not about the place where it is. The language of cause and effect (of which “force” is a particular case) is thus merely a convenient shorthand for certain purposes; it does not represent anything that is genuinely to be found in the physical world.

Let's now consider the sun's gravity. The "force" that the sun exerts is similar to that of a tiger, but it's attractive rather than repulsive. Instead of acting through waves of light or sound, the sun's power comes from the way it changes space-time around it. Just like the sound of a tiger, it's stronger near the source and weaker as you move away. Saying that the sun "causes" these changes in space-time doesn’t really add to our understanding. What we know is that these changes follow specific rules and are symmetrically arranged around the sun as the center. The idea of cause and effect only introduces a lot of irrelevant concepts related to will, muscle tension, and such. What we can somewhat determine is the formula that dictates how space-time is altered by the presence of gravitating matter. [Pg 205] More accurately, we can identify what kind of space-time *is* generated by gravitating matter. When space-time isn't exactly Euclidean in a certain area but takes on a non-Euclidean character that becomes more pronounced as we approach a particular center, and when this deviation from Euclidean space follows a specific law, we summarize this situation by stating that there is gravitating matter at the center. However, this is just a concise summary of what we know. What we know involves the locations where gravitating matter is *not*, rather than where it is. Therefore, the language of cause and effect (with "force" being a specific case) is just a convenient shorthand for certain purposes; it doesn't accurately reflect anything commonly found in the physical world.

And how about matter? Is matter also no more than a convenient shorthand? This question, however, being a large one, demands a separate chapter.

And what about matter? Is matter also just a useful shortcut? This question is quite big, so it needs its own chapter.


[Pg 206]

[Pg 206]

CHAPTER XIV:
WHAT IS MATTER?

The question “What is matter?” is of the kind that is asked by metaphysicians, and answered in vast books of incredible obscurity. But I am not asking the question as metaphysician: I am asking it as a person who wants to find out what is the moral of modern physics, and more especially of the theory of relativity. It is obvious from what we have learned of that theory that matter cannot be conceived quite as it used to be. I think we can now say more or less what the new conception must be.

The question "What is matter?" is one that metaphysicians ask, often answered in lengthy books full of confusing language. However, I'm not asking this question as a metaphysician; I'm asking it as someone who wants to understand the implications of modern physics, especially the theory of relativity. From what we've learned about that theory, it's clear that matter can't be understood in the old way anymore. I believe we can now outline what this new understanding should look like.

There were two traditional conceptions of matter, both of which have had advocates ever since scientific speculation began. There were the atomists, who thought that matter consisted of tiny lumps which could never be divided; these were supposed to hit each other and then bounce off in various ways. After Newton, they were no longer supposed actually to come into contact with each other, but to attract and [Pg 207] repel each other, and move in orbits round each other. Then there were those who thought that there is matter of some kind everywhere, and that a true vacuum is impossible. Descartes held this view, and attributed the motions of the planets to vortices in the ether. The Newtonian theory of gravitation caused the view that there is matter everywhere to fall into discredit, the more so as light was thought by Newton and his disciples to be due to actual particles traveling from the source of the light. But when this view of light was disproved, and it was shown that light consisted of waves, the ether was revived so that there should be something to undulate. The ether became still more respectable when it was found to play the same part in electromagnetic phenomena as in the propagation of light. It was even hoped that atoms might turn out to be a mode of motion of the ether. At this stage, the atomic view of matter was, on the whole, getting the worst of it.

There were two traditional views of matter, both of which have had supporters since the beginning of scientific inquiry. There were the atomists, who believed that matter was made up of tiny, indivisible particles that would collide and then bounce off each other in different ways. After Newton, it was believed that these particles didn’t actually touch, but rather attracted and repelled one another and moved in orbits. On the other hand, there were those who thought that some form of matter existed everywhere, making true vacuums impossible. Descartes held this belief and suggested that the movements of the planets were caused by vortices in the ether. The Newtonian gravity theory discredited the idea of universal matter, especially since Newton and his followers thought that light was made up of actual particles traveling from the source. However, when this particle theory of light was proven wrong and it was shown that light was made up of waves, the ether concept was revived to provide something to oscillate. The ether gained even more credibility when it was found to have a similar role in electromagnetic phenomena as it did in light propagation. There was even hope that atoms might be revealed as a form of motion within the ether. At this point, the atomic view of matter was generally losing ground. [Pg 207]

Leaving relativity aside for the moment, modern physics has provided proof of the atomic structure of ordinary matter, while not disproving the arguments in favor of the ether, to which no such structure is attributed. The result was a sort of compromise between the two views, [Pg 208] the one applying to what was called “gross” matter, the other to the ether. There can be no doubt about electrons and protons, though, as we shall see shortly, they need not be conceived as atoms were conceived traditionally. As for the ether, its status is very curious: many physicists still maintain that, without it, the propagation of light and other electromagnetic waves would be inconceivable, but except in this way it is difficult to see what purpose it serves. The truth is, I think, that relativity demands the abandonment of the old conception of “matter,” which is infected by the metaphysics associated with “substance,” and represents a point of view not really necessary in dealing with phenomena. This is what we must now investigate.

Leaving relativity aside for now, modern physics has proven the atomic structure of ordinary matter, while not disproving the arguments for the ether, to which no such structure is assigned. The outcome was a sort of compromise between the two perspectives, one applying to what was called “gross” matter and the other to the ether. There is no doubt about electrons and protons, though, as we will see soon, they don't need to be thought of in the same way atoms were traditionally understood. As for the ether, its status is quite strange: many physicists still argue that without it, the transmission of light and other electromagnetic waves would be unimaginable, but aside from this argument, it's hard to see what purpose it serves. The truth is, I believe, that relativity requires us to let go of the old idea of “matter,” which is influenced by the metaphysics linked to “substance,” and represents a viewpoint that isn’t really necessary when examining phenomena. This is what we must now explore. [Pg 208]

In the old view, a piece of matter was something which survived all through time, while never being at more than one place at a given time. This way of looking at things is obviously connected with the complete separation of space and time in which people formerly believed. When we substitute space-time for space and time, we shall naturally expect to derive the physical world from constituents which are as limited in [Pg 209] time as in space. Such constituents are what we call “events.” An event does not persist and move, like the traditional piece of matter; it merely exists for its little moment and then ceases. A piece of matter will thus be resolved into a series of events. Just as, in the old view, an extended body was composed of a number of particles, so, now, each particle, being extended in time, must be regarded as composed of what we may call “event-particles.” The whole series of these events makes up the whole history of the particle, and the particle is regarded as being its history, not some metaphysical entity to which the events happen. This view is rendered necessary by the fact that relativity compels us to place time and space more on a level than they were in the older physics.

In the old perspective, a piece of matter was something that lasted over time, while never being in more than one place at any given moment. This way of thinking is clearly linked to the complete separation of space and time that people used to believe in. When we replace space and time with space-time, we naturally expect to derive the physical world from components that are as limited in time as they are in space. These components are what we call “events.” An event doesn’t last and move like a traditional piece of matter; it simply exists for its brief moment and then disappears. Therefore, a piece of matter can be broken down into a series of events. Just as, in the old view, an extended body was made up of several particles, now each particle, being extended in time, must be seen as made up of what we can call “event-particles.” The entire series of these events forms the complete history of the particle, and the particle is seen as being its history, not some metaphysical entity to which the events occur. This view is necessary because relativity pushes us to position time and space more equally than they were in earlier physics. [Pg 209]

This abstract requirement must be brought into relation with the known facts of the physical world. Now what are the known facts? Let us take it as conceded that light consists of waves traveling with the received velocity. We then know a great deal about what goes on in the parts of space-time where there is no matter; we know, that is to say, that there are periodic occurrences (light waves) obeying certain [Pg 210] laws. These light waves start from atoms, and the modern theory of the structure of the atoms enables us to know a great deal about the circumstances under which they start, and the reasons which determine their wave lengths. We can find out not only how one light wave travels, but how its source moves relatively to ourselves. But when I say this I am assuming that we can recognise a source of light as the same at two slightly different times. This is, however, the very thing which had to be investigated.

This abstract requirement needs to be connected with the known facts of the physical world. So, what are those known facts? Let's assume that light consists of waves traveling at the established speed. We then have a solid understanding of what happens in parts of space-time where there is no matter; that is, we know that there are periodic occurrences (light waves) following specific [Pg 210] laws. These light waves originate from atoms, and modern atomic structure theory allows us to understand a lot about the conditions under which they are emitted and the reasons that determine their wavelengths. We can determine not only how one light wave travels but also how its source moves in relation to us. However, when I say this, I am assuming we can identify a source of light as the same at two slightly different times. This is, in fact, the very thing that needed to be explored.

We saw, in the preceding chapter, how a group of connected events can be formed, all related to each other by a law, and all ranged about a center in space-time. Such a group of events will be the arrival, at various places, of the light waves emitted by a brief flash of light. We do not need to suppose that anything particular is happening at the center; certainly we do not need to suppose that we know what is happening there. What we know is that, as a matter of geometry, the group of events in question are ranged about a center, like widening ripples on a pool when a fly has touched it. We can hypothetically invent an occurrence which is to have happened at the center, and set [Pg 211] forth the laws by which the consequent disturbance is transmitted. This hypothetical occurrence will then appear to common sense as the “cause” of the disturbance. It will also count as one event in the biography of the particle of matter which is supposed to occupy the center of the disturbance.

We saw in the previous chapter how a connected group of events can be formed, all related by a law and all arranged around a center in space-time. This group of events will be the arrival of light waves emitted by a brief flash of light at various locations. We don’t need to assume that anything specific is happening at the center; in fact, we don’t need to know what is happening there. What we do know is that, in terms of geometry, the group of events is arranged around a center, like ripples spreading out on a pond when a fly lands on it. We can hypothetically create an event that is supposed to have happened at the center and outline the laws by which the resulting disturbance is transmitted. This hypothetical event will then seem to common sense as the “cause” of the disturbance. It will also count as one event in the history of the particle of matter that is believed to occupy the center of the disturbance. [Pg 211]

Now we find not only that one light wave travels outward from a center according to a certain law, but also that, in general, it is followed by other closely similar light waves. The sun, for example, does not change its appearance suddenly; even if a cloud passes across it during a high wind, the transition is gradual, though swift. In this way a group of occurrences connected with a center at one point of space-time is brought into relation with other very similar groups whose centers are at neighboring points of space-time. For each of these other groups common sense invents similar hypothetical occurrences to occupy their centers, and says that all these hypothetical occurrences are part of one history; that is to say, it invents a hypothetical “particle” to which the hypothetical occurrences are to have occurred. It is only by [Pg 212] this double use of hypothesis, perfectly unnecessary in each case, that we arrive at anything that can be called “matter” in the old sense of the word.

Now we see that one light wave radiates outward from a center following a specific pattern, and usually, it’s accompanied by other similar light waves. For instance, the sun doesn’t suddenly change its look; even when a cloud moves across it in a strong wind, the shift is smooth, though quick. In this way, a series of events linked to one center in space-time connects with other similar series whose centers are at nearby points in space-time. For each of these other series, common sense creates similar hypothetical events to fill their centers and claims that all these hypothetical events belong to one story; essentially, it invents a hypothetical “particle” that these hypothetical events are said to have happened to. It’s only through this dual use of hypothesis, which is completely unnecessary in each instance, that we reach anything that can be described as “matter” in the traditional sense. [Pg 212]

If we are to avoid unnecessary hypotheses, we shall say that an electron at a given moment is the various disturbances in the surrounding medium which, in ordinary language, would be said to be “caused” by it. But we shall not take these disturbances at what is, for us, the moment in question, since that would make them depend upon the observer; we shall instead travel outward from the electron with the velocity of light, and take the disturbance we find in each place as we reach it. The closely similar set of disturbances, with very nearly the same center, which is found existing slightly earlier or slightly later, will be defined as being the electron at a slightly earlier or slightly later moment. In this way, we preserve all the laws of physics, without having recourse to unnecessary hypotheses or inferred entities, and we remain in harmony with the general principle of economy which has enabled the theory of relativity to clear away so much useless lumber.

If we want to avoid making unnecessary assumptions, we can say that an electron at any given moment represents the various disturbances in the surrounding medium that, in everyday language, would be called the “effects” it causes. However, we won't consider these disturbances as they appear at what we define as the current moment, since that would make them dependent on the observer. Instead, we'll move outward from the electron at the speed of light and note the disturbance we encounter in each location as we arrive. The similar set of disturbances that exist just a bit earlier or later, with nearly the same center, will be defined as being the electron at a slightly earlier or slightly later moment. This way, we maintain all the laws of physics without resorting to unnecessary assumptions or inferred entities, while also aligning with the overarching principle of economy that has helped the theory of relativity eliminate so much unnecessary complexity.

Common sense imagines that when it sees a table it sees a table. This is a gross delusion. When common sense sees a table, certain light [Pg 213] waves reach its eyes, and these are of a sort which, in its previous experience, has been associated with certain sensations of touch, as well as with other people’s testimony that they also saw the table. But none of this ever brought us to the table itself. The light waves caused occurrences in our eyes, and these caused occurrences in the optic nerve, and these in turn caused occurrences in the brain. Any one of these, happening without the usual preliminaries, would have caused us to have the sensations we call “seeing the table,” even if there had been no table. (Of course, if matter in general is to be interpreted as a group of occurrences, this must apply also to the eye, the optic nerve, and the brain.) As to the sense of touch when we press the table with our fingers, that is an electric disturbance in the electrons and protons of our finger tips, produced, according to modern physics, by the proximity of the electrons and protons in the table. If the same disturbance in our finger tips arose in any other way, we should have the same sensations, in spite of there being no table. The testimony of others is obviously a second-hand affair. A witness in a law court, [Pg 214] if asked whether he had seen some occurrence, would not be allowed to reply that he believed so because of the testimony of others to that effect. In any case, testimony consists of sound waves and demands psychological as well as physical interpretation; its connection with the object is therefore very indirect. For all these reasons, when we say that a man “sees a table,” we use a highly abbreviated form of expression, concealing complicated and difficult inferences, the validity of which may well be open to question.

Common sense thinks that when it sees a table, it really sees a table. This is a major misconception. When common sense sees a table, certain light waves hit its eyes, and these waves are linked to past experiences associated with certain feelings of touch, as well as with the testimony of others who claim they also saw the table. But none of this actually brings us to the table itself. The light waves trigger reactions in our eyes, which then send signals through the optic nerve, and those signals activate our brain. Any one of these processes, happening without the usual steps, would lead us to have the sensations we call “seeing the table,” even if there were no table there. (Of course, if we interpret matter as a series of occurrences, this also applies to the eye, the optic nerve, and the brain.) As for our sense of touch when we press the table with our fingers, that is an electrical disturbance in the electrons and protons of our fingertips, caused, according to modern physics, by the proximity of the electrons and protons in the table. If that same disturbance in our fingertips happened in any other way, we would have the same sensations, even if there were no table. The testimony of others is clearly secondary. A witness in a courtroom, if asked whether they saw something happen, wouldn’t be allowed to answer that they believe so based on what others said. In any case, testimony consists of sound waves and requires both psychological and physical interpretation; its link to the object is therefore very indirect. For all these reasons, when we say that someone “sees a table,” we’re using a shorthand expression that hides complicated and challenging inferences, the validity of which might be questionable.

But we are in danger of becoming entangled in psychological questions, which we must avoid if we can. Let us therefore return to the purely physical point of view.

But we risk getting caught up in psychological questions, which we should avoid if possible. So, let's go back to a purely physical perspective.

What I wish to suggest may be put as follows. Everything that occurs elsewhere, owing to the existence of an electron, can be explored experimentally, at least in theory, unless it occurs in certain concealed ways. But what occurs within the electron (if anything occurs there) it is absolutely impossible to know: there is no conceivable apparatus by which we could obtain even a glimpse of it. An electron is known by its “effects.” But the word “effects” belongs to a view of [Pg 215] causation which will not fit modern physics, and in particular will not fit relativity. All that we have a right to say is that certain groups of occurrences happen together, that is to say, in neighboring parts of space-time. A given observer will regard one member of the group as earlier than the other, but another observer may judge the time order differently. And even when the time order is the same for all observers, all that we really have is a connection between the two events, which works equally backwards and forwards. It is not true that the past determines the future in some sense other than that in which the future determines the past: the apparent difference is only due to our ignorance, because we know less about the future than about the past. This is a mere accident: there might be beings who would remember the future and have to infer the past. The feelings of such beings in these matters would be the exact opposite of our own, but no more fallacious.

What I want to suggest can be summed up like this. Everything that happens elsewhere because of an electron can be studied experimentally, at least in theory, unless it happens in specific hidden ways. But what happens inside the electron (if anything does) is completely unknowable: there’s no conceivable device that could give us even a hint of it. We understand an electron through its “effects.” However, the term “effects” comes from a perspective on causation that doesn’t align with modern physics, especially relativity. All we can really say is that certain events occur together, meaning in nearby areas of space-time. One observer may see one event as happening before another, while a different observer might perceive the order differently. Even when everyone agrees on the time order, all we really have is a connection between the two events, which works both forwards and backwards. It isn’t accurate to say that the past determines the future in any way other than how the future determines the past; the seeming difference is just due to our limited understanding, as we know less about the future than the past. This is merely coincidental: there could be beings who remember the future and have to deduce the past. Their experiences in these matters would be the exact opposite of ours, but no less valid.

The moral of this is that, if an electron is only known by its “effects,” there is no reason to suppose that anything exists except the “effects.” In so far as these “effects” consist of light waves [Pg 216] and other electromagnetic disturbances, we may say that what is called “empty space” consists of regions where these disturbances are propagated freely. Every such disturbance, we find, has a center, and when we get very near the center (though still at a finite distance from it) we find that the law of propagation of the disturbance ceases to be valid. This region within which the law does not hold is called “matter”; it will be an electron or proton according to circumstances. The region so defined is found to move relatively to other such regions, and its movements follow the known laws of dynamics. So far, this theory provides for electromagnetic phenomena and the motions of matter; and it does so without assuming that “matter” is anything but systems of electromagnetic phenomena. In order to carry out the theory fully, it would no doubt be necessary to introduce many complications. But it seems fairly clear that all the facts and laws of physics can be interpreted without assuming that “matter” is anything more than groups of events, each event being of the sort which we should naturally regard as “caused” by the matter in question. This does not [Pg 217] involve any change in the symbols or formulæ of physics: it is merely a question of interpretation of the symbols.

The takeaway here is that if we only know an electron through its “effects,” there’s no reason to believe that anything exists apart from those “effects.” As these “effects” involve light waves and other electromagnetic disturbances, we can say that what we call “empty space” is made up of areas where these disturbances move freely. Each disturbance has a center, and when we get very close to this center (even if we’re still at a finite distance), we discover that the propagation laws of the disturbance no longer apply. This area where the laws break down is termed “matter”; it will be an electron or a proton depending on the situation. This defined area is found to move in relation to other similar areas, and its movements adhere to the known laws of dynamics. So far, this theory accounts for electromagnetic phenomena and the behavior of matter without assuming that “matter” is anything other than systems of electromagnetic happenings. To fully develop the theory, it would likely need several complications. However, it seems quite clear that all the facts and laws of physics can be understood without suggesting that “matter” is anything beyond groups of events, with each event being something we would naturally consider to be “caused” by the relevant matter. This doesn’t require any changes to the symbols or formulas of physics; it’s simply a matter of how we interpret the symbols.

This latitude in interpretation is a characteristic of mathematical physics. What we know is certain very abstract logical relations, which we express in mathematical formulæ; we know also that, at certain points, we arrive at results which are capable of being tested experimentally. Take, for example, the eclipse observations by which Einstein’s theory as to the bending of light was established. The actual observation consisted in the careful measurement of certain distances on certain photographic plates. The formulæ which were to be verified were concerned with the course of light in passing near the sun. Although the part of these formulæ which gives the observed result must always be interpreted in the same way, the other part of them may be capable of a great variety of interpretations. The formulæ giving the motions of the planets are almost exactly the same in Einstein’s theory as in Newton’s, but the meaning of the formulæ is quite different. It may be said generally that, in the mathematical treatment of nature, we can be far more certain that our formulæ are [Pg 218] approximately correct than we can be as to the correctness of this or that interpretation of them. And so in the case with which this chapter is concerned: the question as to the nature of an electron or a proton is by no means answered when we know all that mathematical physics has to say as to the laws of its motion and the laws of its interaction with the environment. A definite and conclusive answer to our question is not possible just because a variety of answers are compatible with the truth of mathematical physics. Nevertheless some answers are preferable to others, because some have a greater probability in their favor. We have been seeking, in this chapter, to define matter so that there must be such a thing if the formulæ of physics are true. If we had made our definition such as to secure that a particle of matter should be what one thinks of as substantial, a hard, definite lump, we should not have been sure that any such thing exists. That is why our definition, though it may seem complicated, is preferable from the point of view of logical economy and scientific caution.

This flexibility in interpretation is a hallmark of mathematical physics. What we know are certain abstract logical relationships that we express in mathematical formulas; we also know that, at certain points, we achieve results that can be tested experimentally. Take, for instance, the eclipse observations that established Einstein’s theory about the bending of light. The actual observation involved carefully measuring certain distances on specific photographic plates. The formulas that needed verification were related to the path of light passing near the sun. Although the part of these formulas that gives the observed result must always be interpreted in the same way, the other part can have a wide range of interpretations. The formulas describing the motions of the planets are almost identical in Einstein’s theory and Newton’s, but the meaning of the formulas is quite different. Generally speaking, in the mathematical treatment of nature, we can be much more confident that our formulas are approximately correct than we can be about the accuracy of this or that interpretation of them. This is also true for the issue this chapter is focused on: knowing all that mathematical physics has to say about the laws of an electron's or proton's motion and interaction with the environment does not definitively answer the question about their nature. A clear and conclusive answer to our question is unattainable because multiple answers can be consistent with the truth of mathematical physics. Nevertheless, some answers are more favorable than others because they have a higher likelihood of being correct. In this chapter, we have aimed to define matter in such a way that it must exist if the formulas of physics are true. If we defined it to ensure that a particle of matter is what we consider substantial—a hard, solid object—we would not be sure that such a thing exists. That’s why our definition, even though it may seem complicated, is preferable for logical clarity and scientific caution.


[Pg 219]

[Pg 219]

CHAPTER XV:
PHILOSOPHICAL CONSEQUENCES

The philosophical consequences of relativity are neither so great nor so startling as is sometimes thought. It throws very little light on time-honored controversies, such as that between realism and idealism. Some people think that it supports Kant’s view that space and time are “subjective” and are “forms of intuition.” I think such people have been misled by the way in which writers on relativity speak of “the observer.” It is natural to suppose that the observer is a human being, or at least a mind; but he is just as likely to be a photographic plate or a clock. That is to say, the odd results as to the difference between one “point of view” and another are concerned with “point of view” in a sense applicable to physical instruments just as much as to people with [Pg 220] perceptions. The “subjectivity” concerned in the theory of relativity is a physical subjectivity, which would exist equally if there were no such things as minds or senses in the world.

The philosophical consequences of relativity aren't as significant or surprising as people often think. It doesn't shed much light on old debates, like the one between realism and idealism. Some believe it backs up Kant’s idea that space and time are “subjective” and are “forms of intuition.” I think these people are misled by how writers on relativity refer to “the observer.” It’s easy to assume that the observer is a human or at least a mind; however, it could just as easily be a photographic plate or a clock. In other words, the strange results regarding the differences between one “point of view” and another relate to “point of view” in a way that applies to physical instruments just as much as to people with [Pg 220] perceptions. The “subjectivity” involved in the theory of relativity is a physical subjectivity, which would exist even if there were no minds or senses in the world.

Moreover, it is a strictly limited subjectivity. The theory does not say that everything is relative; on the contrary, it gives a technique for distinguishing what is relative from what belongs to a physical occurrence in its own right. If we are going to say that the theory supports Kant about space and time, we shall have to say that it refutes him about space-time. In my view, neither statement is correct. I see no reason why, on such issues, philosophers should not all stick to the views they previously held. There were no conclusive arguments on either side before, and there are none now; to hold either view shows a dogmatic rather than a scientific temper.

Moreover, it is a strictly limited perspective. The theory doesn’t claim that everything is relative; instead, it provides a method for distinguishing what is relative from what exists as a physical occurrence in its own right. If we're going to argue that the theory supports Kant's ideas about space and time, we'll also need to claim that it contradicts him regarding space-time. In my opinion, neither claim is accurate. I see no reason why, on these topics, philosophers shouldn't stick to their previous beliefs. There were no decisive arguments on either side before, and there are none now; holding either view reflects a dogmatic attitude rather than a scientific one.

Nevertheless, when the ideas involved in Einstein’s work have become familiar, as they will when they are taught in schools, certain changes in our habits of thought are likely to result, and to have great importance in the long run.

Nevertheless, once the concepts from Einstein’s work become well-known, as they will when taught in schools, certain changes in our ways of thinking are likely to happen, and they will be significant in the long run.

One thing which emerges is that physics tells us much less about the physical world than we thought it did. Almost all the “great principles” of traditional physics turn out to be like the “great [Pg 221] law” that there are always three feet to a yard; others turn out to be downright false. The conservation of mass may serve to illustrate both these misfortunes to which a “law” is liable. Mass used to be defined as “quantity of matter,” and as far as experiment showed it was never increased or diminished. But with the greater accuracy of modern measurements, curious things were found to happen. In the first place, the mass as measured was found to increase with the velocity; this kind of mass was found to be really the same thing as energy. This kind of mass is not constant for a given body, but the total amount of it in the universe is conserved, or at least obeys a law very closely analogous to conservation. This law itself, however, is to be regarded as a truism, of the nature of the “law” that there are three feet to a yard; it results from our methods of measurement, and does not express a genuine property of matter. The other kind of mass, which we may call “proper mass,” is that which is found to be the mass by an observer moving with the body. This is the ordinary terrestrial case, where the body we are weighing is not flying through the air. The “proper [Pg 222] mass” of a body is very nearly constant, but not quite, and the total amount of “proper mass” in the world is not quite constant. One would suppose that if you have four one-pound weights, and you put them all together into the scales, they will together weigh four pounds. This is a fond delusion: they weigh rather less, though not enough less to be discovered by even the most careful measurements. In the case of four hydrogen atoms, however, when they are put together to make one helium atom, the defect is noticeable; the helium atom weighs measurably less than four separate hydrogen atoms.

One thing that stands out is that physics actually tells us much less about the physical world than we used to think. Almost all the “great principles” of traditional physics are like the “great law” stating that there are always three feet in a yard; others are just plain false. The conservation of mass can illustrate both these issues that a “law” can have. Mass used to be defined as “quantity of matter,” and according to experiments, it was never increased or decreased. However, with modern measurements getting more precise, interesting things started to happen. First, the mass that was measured increased with speed; this kind of mass turned out to be the same as energy. This type of mass isn't constant for a specific body, but the total amount of it in the universe is conserved, or at least follows a law that closely resembles conservation. This law should be seen as a truism, similar to the “law” that there are three feet in a yard; it comes from how we measure things and doesn't reflect a true property of matter. The other type of mass, which we can call “proper mass,” is what an observer moving with the body measures. This is the typical terrestrial scenario, where the object we’re weighing isn’t flying through the air. The “proper mass” of an object is almost constant, but not entirely, and the total amount of “proper mass” in the world isn’t completely constant either. You might think that if you have four one-pound weights and put them all on the scale, they would weigh four pounds. This is a nice illusion: they actually weigh a bit less, though not enough less to be detected by even the most precise measurements. In the case of four hydrogen atoms, though, when they combine to form one helium atom, the shortfall is noticeable; the helium atom weighs measurably less than four separate hydrogen atoms.

Broadly speaking, traditional physics has collapsed into two portions, truisms and geography. There are, however, newer portions of physics, such as the theory of quanta, which do not come under this head, but appear to give genuine knowledge of laws reached by experiment.

In general terms, traditional physics has split into two parts: basic truths and locations. However, there are newer areas of physics, like quantum theory, that don’t fit into this category but seem to provide real insights into laws discovered through experimentation.

The world which the theory of relativity presents to our imagination is not so much a world of “things” in “motion” as a world of events. It is true that there are still electrons and protons which persist, but these (as we saw in the preceding chapter) are really to be conceived as strings of connected events, like the successive notes of a song. It is events that are the stuff of [Pg 223] relativity physics. Between two events which are not too remote from each other there is, in the general theory as in the special theory, a measurable relation called “interval,” which appears to be the physical reality of which lapse of time and distance in space are two more or less confused representations. Between two distant events, there is not any one definite interval. But there is one way of moving from one event to another which makes the sum of all the little intervals along the route greater than by any other route. This route is called a “geodesic,” and it is the route which a body will choose if left to itself.

The world that the theory of relativity shows us isn’t just a world of “things” in “motion,” but a world of events. While electrons and protons still exist, they should be viewed as strings of connected events, similar to the successive notes of a song, as we discussed in the previous chapter. It’s events that make up the essence of [Pg 223] relativity physics. Between two events that aren’t too far apart, there’s a measurable relationship called “interval,” which represents the physical reality of how time passes and distances in space are often blurry concepts. For two distant events, there isn’t just one clear interval. However, there is one specific way to get from one event to another that makes the total of all the small intervals along that path greater than any other route. This path is known as a “geodesic,” and it’s the path a body will naturally take if left on its own.

The whole of relativity physics is a much more step-by-step matter than the physics and geometry of former days. Euclid’s straight lines have to be replaced by light rays, which do not quite come up to Euclid’s standard of straightness when they pass near the sun or any other very heavy body. The sum of the angles of a triangle is still thought to be two right angles in very remote regions of empty space, but not where there is matter in the neighborhood. We, who cannot leave the earth, are incapable of reaching a place where Euclid is true. Propositions [Pg 224] which used to be proved by reasoning have now become either conventions, or merely approximate truths verified by observation.

The entire field of relativity is much more step-by-step than the physics and geometry of earlier times. Euclid’s straight lines must be substituted with light rays, which don’t quite meet Euclid’s definition of straightness when they travel near the sun or any other massive body. The total of the angles in a triangle is still considered to be two right angles in very distant areas of empty space, but not where matter is present. We, who cannot leave Earth, are unable to reach a place where Euclid’s principles hold true. Propositions that used to be proved through reasoning have now become either conventions or just approximate truths confirmed by observation. [Pg 224]

It is a curious fact—of which relativity is not the only illustration—that, as reasoning improves, its claims to the power of proving facts grow less and less. Logic used to be thought to teach us how to draw inferences; now, it teaches us rather how not to draw inferences. Animals and children are terribly prone to inference: a horse is surprised beyond measure if you take an unusual turning. When men began to reason, they tried to justify the inferences that they had drawn unthinkingly in earlier days. A great deal of bad philosophy and bad science resulted from this propensity. “Great principles,” such as the “uniformity of nature,” the “law of universal causation,” and so on, are attempts to bolster up our belief that what has often happened before will happen again, which is no better founded than the horse’s belief that you will take the turning you usually take. It is not altogether easy to see what is to replace these pseudo-principles in the practice of science; but perhaps the theory of relativity gives us a glimpse of the kind of thing we may expect. Causation, in the [Pg 225] old sense, no longer has a place in theoretical physics. There is, of course, something else which takes its place, but the substitute appears to have a better empirical foundation than the old principle which it has superseded.

It’s an interesting fact—of which relativity is just one example—that, as reasoning gets better, its ability to prove facts seems to lessen. People used to think logic taught us how to make inferences; now, it teaches us more about how not to make them. Animals and children are really quick to jump to conclusions: for instance, a horse is completely taken aback if you take an unexpected turn. When humans started to reason, they tried to justify the conclusions they had made without thinking in the past. This tendency led to a lot of flawed philosophy and poor science. "Great principles," like the "uniformity of nature" and the "law of universal causation," are attempts to support our belief that what has happened frequently before will occur again, which is no more solid than the horse thinking you’ll take the same turn you usually do. It’s not easy to see what will replace these fake principles in scientific practice; however, the theory of relativity might give us a hint of what to expect. Causation, in the traditional sense, no longer fits into theoretical physics. Of course, something else has taken its place, but the new concept seems to be based on better empirical evidence than the old principle it has replaced. [Pg 225]

The collapse of the notion of one all-embracing time, in which all events throughout the universe can be dated, must in the long run affect our views as to cause and effect, evolution, and many other matters. For instance, the question whether, on the whole, there is progress in the universe, may depend upon our choice of a measure of time. If we choose one out of a number of equally good clocks, we may find that the universe is progressing as fast as the most optimistic American thinks it is; if we choose another equally good clock, we may find that the universe is going from bad to worse as fast as the most melancholy Slav could imagine. Thus optimism and pessimism are neither true nor false, but depend upon the choice of clocks.

The breakdown of the idea of a single, all-encompassing time in which all events in the universe can be dated will eventually influence our views on cause and effect, evolution, and many other issues. For example, whether or not there's overall progress in the universe might depend on how we measure time. If we pick one of several equally valid clocks, we might conclude that the universe is progressing as quickly as the most hopeful American believes; if we choose a different equally valid clock, we might see the universe worsening just as fast as the most pessimistic Slav could imagine. So, optimism and pessimism are neither right nor wrong; they depend on which clock we choose.

The effect of this upon a certain type of emotion is devastating. The poet speaks of

The effect of this on a specific kind of emotion is devastating. The poet talks about

One far-off divine event
To which the whole creation moves.

[Pg 226] But if the event is sufficiently far off, and the creation moves sufficiently quickly, some parts will judge that the event has already happened, while others will judge that it is still in the future. This spoils the poetry. The second line ought to be:

[Pg 226] But if the event is far enough away and the creation moves fast enough, some parts will believe that the event has already happened, while others will think it's still to come. This messes with the poetry. The second line should be:

To which some parts of the creation move,
while others move away from it.

But this won’t do. I suggest that an emotion which can be destroyed by a little mathematics is neither very genuine nor very valuable. But this line of argument would lead to a criticism of the Victorian Age, which lies outside my theme.

But this won't cut it. I propose that an emotion that can be wiped out by some basic math is neither very real nor very meaningful. However, this line of reasoning would lead to a critique of the Victorian Age, which is outside the scope of my topic.

What we know about the physical world, I repeat, is much more abstract, than was formerly supposed. Between bodies there are occurrences, such as light waves; of the laws of these occurrences, we know something—just so much as can be expressed in mathematical formulæ—but of their nature we know nothing. Of the bodies themselves, as we saw in the preceding chapter, we know so little that we cannot even be sure that they are anything: they may be merely groups of events in other places, those events which we should [Pg 227] naturally regard as their effects. We naturally interpret the world pictorially; that is to say, we imagine that what goes on is more or less like what we see. But in fact this likeness can only extend to certain formal logical properties expressing structure, so that all we can know is certain general characteristics of its changes. Perhaps an illustration may make the matter clear. Between a piece of orchestral music as played, and the same piece of music as printed in the score, there is a certain resemblance, which may be described as a resemblance in structure. The resemblance is of such a sort that, when you know the rules, you can infer the music from the score or the score from the music. But suppose you had been stone deaf from birth, but had lived among musical people. You could understand, if you had learned to speak and to do lip-reading, that the musical scores represented something quite different from themselves in intrinsic quality, though similar in structure.[16] The value of music would be completely unimaginable to you, but you could infer all its mathematical characteristics, since [Pg 228] they are the same as those of the score. Now our knowledge of nature is something like this. We can read the scores, and infer just so much as our stone-deaf person could have inferred about music. But we have not the advantages which he derived from association with musical people. We cannot know whether the music represented by the scores is beautiful or hideous; perhaps, in the last analysis, we cannot be quite sure that the scores represent anything but themselves. But this is a doubt which the physicist, in his professional capacity, cannot permit himself to entertain.

What we know about the physical world, I want to emphasize, is much more abstract than we used to think. There are events happening between objects, like light waves; we understand some of the laws governing these events—just enough to put it into mathematical formulas—but we know nothing about their nature. As we saw in the previous chapter, we have so little knowledge of the bodies themselves that we can’t even be sure they exist: they might just be clusters of events happening elsewhere, those events that we would normally consider their effects. We tend to picture the world in a straightforward way; that means we think that what occurs is somewhat like what we see. However, this similarity can only go so far as certain formal logical properties that express structure, so all we can really know are some general traits of its changes. Perhaps an example can clarify this. Between a piece of orchestral music as performed and the same piece as written in the score, there’s a certain similarity, which can be described as a similarity in structure. The resemblance is such that, when you understand the rules, you can deduce the music from the score or the score from the music. But imagine you’ve been completely deaf from birth, yet you’ve lived among musical people. You could understand, if you had learned to speak and read lips, that the musical scores represented something quite different from themselves in intrinsic quality, even though they’re similar in structure. The value of music would be completely unimaginable to you, but you could infer all of its mathematical characteristics, as they are the same as those of the score. Our knowledge of nature is somewhat like this. We can read the scores and infer as much as our deaf person could have inferred about music. However, we don't have the advantages they would have gained from being around musical people. We can’t know if the music represented by the scores is beautiful or awful; maybe, in the end, we can’t be completely sure that the scores represent anything other than themselves. But this is a doubt that a physicist, in their professional role, cannot allow themselves to consider.

Assuming the utmost that can be claimed for physics, it does not tell us what it is that changes, or what are its various states; it only tells us such things as that changes follow each other periodically, or spread with a certain speed. Even now we are probably not at the end of the process of stripping away what is merely imagination, in order to reach the core of true scientific knowledge. The theory of relativity has accomplished a very great deal in this respect, and in doing so has taken us nearer and nearer to bare structure, which is the mathematician’s goal—not because it is the only thing in which he [Pg 229] is interested as a human being, but because it is the only thing that he can express in mathematical formulæ. But far as we have traveled in the direction of abstraction, it may be that we shall have to travel further still.

Assuming the most we can claim for physics, it doesn't tell us what actually changes or what its various states are; it only tells us things like changes happen periodically or spread at a certain speed. Even now, we're probably still in the process of stripping away mere imagination to reach the core of true scientific knowledge. The theory of relativity has achieved a lot in this regard, bringing us closer to the bare structure, which is what mathematicians aim for—not because it’s the only thing that interests them as people, but because it's the only thing they can express using mathematical formulas. But even with how far we've come in terms of abstraction, we might still need to go even further.

In the preceding chapter, I suggested what may be called a minimum definition of matter, that is to say, one in which matter has, so to speak, as little “substance” as is compatible with the truth of physics. In adopting a definition of this kind, we are playing for safety: our tenuous matter will exist, even if something more beefy also exists. We tried to make our definition of matter, like Isabella’s gruel in Jane Austen, “thin, but not too thin.” We shall, however, fall into error if we assert positively that matter is nothing more than this. Leibniz thought that a piece of matter is really a colony of souls. There is nothing to show that he was wrong, though there is also nothing to show that he was right: we know no more about it either way than we do about the flora and fauna of Mars.

In the previous chapter, I proposed a basic definition of matter, meaning one where matter has, so to speak, the least amount of “substance” necessary to align with the principles of physics. By adopting a definition like this, we’re playing it safe: our minimal matter will exist, even if something more substantial also exists. We aimed to make our definition of matter, like Isabella’s gruel in Jane Austen, “thin, but not too thin.” However, we would be mistaken if we claim that matter is just this. Leibniz believed that a piece of matter is essentially a colony of souls. There’s no evidence that he was wrong, but there’s also no evidence that he was right: we know just as little about it as we do about the plants and animals on Mars.

To the non-mathematical mind, the abstract character of our physical knowledge may seem unsatisfactory. From an artistic or imaginative [Pg 230] point of view, it is perhaps regrettable, but from a practical point of view it is of no consequence. Abstraction, difficult as it is, is the source of practical power. A financier, whose dealings with the world are more abstract than those of any other “practical” man, is also more powerful than any other practical man. He can deal in wheat or cotton without needing ever to have seen either: all he needs to know is whether they will go up or down. This is abstract mathematical knowledge, at least as compared to the knowledge of the agriculturist. Similarly the physicist, who knows nothing of matter except certain laws of its movements, nevertheless knows enough to enable him to manipulate it. After working through whole strings of equations, in which the symbols stand for things whose intrinsic nature can never be known to us, he arrives at last at a result which can be interpreted in terms of our own perceptions, and utilized to bring about desired effects in our own lives. What we know about matter, abstract and schematic as it is, is enough, in principle, to tell us the rules according to which it produces perceptions and feelings in ourselves; and it is upon these rules that the practical uses of physics depend. [Pg 231]

To someone who isn't into math, the abstract nature of our physical knowledge might seem unsatisfying. From an artistic or imaginative perspective, this could be seen as unfortunate, but when it comes to practical matters, it doesn’t really matter. Abstraction, as challenging as it is, is the foundation of practical power. A financier, whose interactions with the world are more abstract than those of any other "practical" person, holds more power than anyone else in that realm. They can trade in wheat or cotton without ever having seen either; all they need to know is whether prices will rise or fall. This represents abstract mathematical knowledge, at least when compared to that of a farmer. Similarly, a physicist, who understands nothing about matter except for certain laws governing its movements, still knows enough to manipulate it. After working through complex equations where the symbols represent things we can never fully comprehend, they finally arrive at results that can be connected to our own perceptions and used to achieve desired outcomes in our lives. What we understand about matter, while abstract and schematic, is sufficient, in principle, to explain the rules by which it creates perceptions and feelings within us; and it is these rules that the practical applications of physics rely on.

The final conclusion is that we know very little, and yet it is astonishing that we know so much, and still more astonishing that so little knowledge can give us so much power.

The final conclusion is that we know very little, but it's amazing that we know so much, and even more amazing that such little knowledge can give us so much power.

THE END

THE END

Footnotes:

References:

[1] A contemporary Chinese ode, after giving the day of the year correctly, proceeds:

[1] A modern Chinese poem, after accurately stating the date, continues:

“For the moon to be eclipsed
Is but an ordinary matter.
Now that the sun has been eclipsed,
How bad it is.”

[2] I shall define “interval” in a moment.

__A_TAG_PLACEHOLDER_0__ I’ll explain “interval” soon.

[3] So long as he has no considerable acceleration. The treatment of acceleration belongs to the general theory of relativity.

[3] So long as he has no significant acceleration. The topic of acceleration is covered in the general theory of relativity.

[4] This does not mean that its velocity is increasing, but that it is changing its direction. The only sort of motion which is called “unaccelerated” is motion with uniform velocity in a straight line.

[4] This doesn’t mean that its speed is increasing, but that it is changing direction. The only type of motion referred to as “unaccelerated” is motion with a constant speed in a straight line.

[5] See his Space, Time, Matter, Methuen, 1922.

[5] See his Space, Time, Matter, Methuen, 1922.

[6] Although “force” is no longer to be regarded as one of the fundamental concepts of dynamics, but only as a convenient way of speaking, it can still be employed, like “sunrise” and “sunset,” provided we realize what we mean. Often it would require very roundabout expressions to avoid the term “force.”

[6] Although "force" isn’t seen as one of the core concepts of dynamics anymore, but rather just a useful term, we can still use it, like "sunrise" and "sunset," as long as we understand what we’re talking about. Often, it would take some convoluted phrasing to steer clear of the word "force."

[7] See Eddington, The Mathematical Theory of Relativity, Cambridge University Press, 2d edition, p. 128.

[7] See Eddington, The Mathematical Theory of Relativity, Cambridge University Press, 2nd edition, p. 128.

[8] This is subject to the explanations given below as regards conservation of energy.

[8] This is based on the explanations provided below about energy conservation.

[9] Mathematical Theory of Relativity, p. 135.

[9] Mathematical Theory of Relativity, p. 135.

[10] On this subject, see the present author’s A.B.C. of Atoms, chaps. VI and XIII.

[10] For more on this topic, check out the author's A.B.C. of Atoms, chapters VI and XIII.

[11] Op. cit. § 60.

__A_TAG_PLACEHOLDER_0__ Same source § 60.

[12] See Eddington, Space, Time and Gravitation, p. 162ff.

[12] See Eddington, Space, Time and Gravitation, p. 162ff.

[13] “Isotropy” means being similar in all directions—e.g., that a foot rule is as long when it points north as when it points east.

[13] “Isotropy” means being the same in every direction—e.g., that a ruler is just as long when it faces north as it is when it faces east.

[14] Mathematical Theory of Relativity, p. 238.

[14] Mathematical Theory of Relativity, p. 238.

[15] Mathematical Theory of Relativity, pp. 37-38. Italics in the original.

[15] Mathematical Theory of Relativity, pp. 37-38. Italics in the original.

[16] For the definition of “structure,” see the present author’s Introduction to Mathematical Philosophy.

[16] For the definition of “structure,” check out the current author's Introduction to Mathematical Philosophy.

Transcriber’s Notes:

Transcriber’s Notes:


The cover image was created by the transcriber, and is in the public domain.

The cover image was made by the transcriber and is in the public domain.

The illustrations have been moved so that they do not break up paragraphs and so that they are next to the text they illustrate.

The illustrations have been repositioned so they don't disrupt paragraphs and are placed next to the text they depict.

Typographical and punctuation errors have been silently corrected.

Typographical and punctuation errors have been quietly fixed.


Download ePUB

If you like this ebook, consider a donation!