This is a modern-English version of Einstein's Theories of Relativity and Gravitation: A selection of material from the essays submitted in the competition for the Eugene Higgins prize of $5,000, originally written by Bird, J. Malcolm (James Malcolm).
It has been thoroughly updated, including changes to sentence structure, words, spelling,
and grammar—to ensure clarity for contemporary readers, while preserving the original spirit and nuance. If
you click on a paragraph, you will see the original text that we modified, and you can toggle between the two versions.
Scroll to the bottom of this page and you will find a free ePUB download link for this book.


Dr. Albert Einstein,
Originator of the Special and General Theories of Relativity
Dr. Albert Einstein,
Creator of the Special and General Theories of Relativity

SCIENTIFIC AMERICAN PUBLISHING CO.,
MUNN & CO.
1921
Copyright 1921
by
Scientific American Publishing Company
Copyright 1921
by
Scientific American Publishing Co.
All rights reserved
All rights reserved
Great Britain copyright secured
UK copyright secured
The right of translation is reserved in all languages, including the Scandinavian
The right to translate is reserved in all languages, including Scandinavian.
Swedish rights secured by Thall and Carlsson, Stockholm [iii]
Swedish rights secured by Thall and Carlsson, Stockholm [__A_TAG_PLACEHOLDER_0__]
PREFACE
The obstacles which the layman finds to understanding Einstein’s relativity theories lie not so much in the inherent difficulty of these theories themselves as in the difficulty of preparing the mind for their reception. The theory is no more difficult than any scientific development of comparable depth; it is not so difficult as some of these. But it is a fact that for a decent understanding of it, a large background of scientific knowledge and scientific habit of thought is essential. The bulk of the writers who have attempted to explain Einstein to the general reader have not realized the great gulf which lies between the mental processes of the trained mathematician and those of the man in the street. They have not perceived that the lay reader must be personally conducted for a long distance from the vestibule of the temple of science before he comes to Einstein, and that he cannot by any possibility make this journey unaided. The result has been to pitchfork the reader into the intricacies of the subject without adequate preparation.
The challenges that everyday people face in understanding Einstein's theories of relativity come less from the theories themselves being inherently difficult and more from the struggle to prepare one's mind to grasp them. The theories aren't any harder than other scientific developments of similar complexity; in fact, some are harder. However, to truly understand them, a solid foundation of scientific knowledge and a scientific way of thinking are crucial. Many writers who have tried to explain Einstein to the general public haven't recognized the significant gap between the way trained mathematicians think and the way regular people think. They haven't understood that the average reader needs to be guided through a long process before reaching Einstein, and that it’s impossible for them to make this journey alone. As a result, readers are often thrown into the complexities of the subject without adequate preparation.
The present volume avoids this mistake with the utmost care. It avoids it, in fact, with such deliberation as to make it in order to say a word in explanation of what will at first glance seem an extraordinary arrangement of material. It was to be expected, doubtless, that this book would open with a brief statement of the genesis and the outcome of [iv]the Einstein Prize Essay Contest for the $5,000 prize offered by Mr. Eugene Higgins. It was doubtless to be expected that, after this had been dismissed, the winning essay would be given the post of honor in advance of all other material bearing actually on the Einstein theories. When the reader observes that this has not been done, he will by all means expect a word of explanation; and it is mainly for the purpose of giving this that we make these introductory remarks.
The current volume carefully avoids this mistake. In fact, it does so so intentionally that we need to explain what might initially seem like an unusual arrangement of material. It’s understandable that this book would start with a brief overview of how the Einstein Prize Essay Contest came about and its outcome for the $5,000 prize offered by Mr. Eugene Higgins. It’s also likely expected that, after this introduction, the winning essay would take the lead as the main focus before all other content related to Einstein’s theories. When readers notice that this hasn't happened, they will surely want an explanation; and it’s mainly to provide that clarity that we are making these introductory comments.
The essays submitted in the contest, and in particular the comments of a few disappointed readers upon Mr. Bolton’s prize essay, make quite plain what might have been anticipated—that in the small compass of 3,000 words it is not possible both to prepare the reader’s mind for a discussion of Relativity and to give a discussion that shall be adequate. Mr. Bolton himself, in replying to a protest that he had not done all this, has used the word “miracle”—we think it a well-advised one. No miracle was expected as a result of the contest, and none has been achieved. But in awarding the prize, the Judges had to decide whether it was the best preliminary exposition or the best discussion that was wanted. They decided, and rightly we believe, that the award should go to an actual statement of what the Einstein theories are and what they do, rather than to a mere introduction, however well conceived and well executed the latter might be. Nevertheless, we should be closing our eyes to a very obvious fact if we did not recognize that, without something in the way of preparation, the general reader is not going to pursue Mr. Bolton’s [v]essay, or any other essay on this subject, with profit. It is in order the more forcefully to hold out inducements to him to subject himself to this preparation that we place at the head of the book the chapters designed to give it to him.
The essays submitted for the contest, especially the feedback from a few disappointed readers about Mr. Bolton’s prize-winning essay, show clearly what we could have expected—that in just 3,000 words, it's impossible to both prepare the reader for a discussion about Relativity and provide an adequate discussion at the same time. Mr. Bolton, in response to criticism that he hadn’t achieved this, used the word “miracle”—which we think is an apt choice. No miracle was anticipated from the contest, and none occurred. However, in deciding who should win the prize, the Judges needed to choose whether they were looking for the best introductory explanation or the best discussion. They determined, and we believe correctly, that the prize should go to a clear explanation of what Einstein's theories are and what they accomplish, rather than to a mere introduction, no matter how well thought out or executed it may be. Still, we would be ignoring a very obvious truth if we didn’t acknowledge that without some preparation, the average reader isn’t going to engage with Mr. Bolton’s [__A_TAG_PLACEHOLDER_0__]essay or any other essay on this topic profitably. To encourage the reader to prepare themselves, we have included the chapters at the beginning of the book that are meant to assist in that preparation.
Chapter II. is intended so to bring the mind of the reader into contact with certain philosophical problems presented to us by our experiences with the external world and our efforts to learn the facts about it, that he may approach the subject of relativity with an appreciation of the place it occupies as a phase of human thought and a pillar of the scientific structure. Until the reader is aware of the existence of these problems and the directions taken by the efforts, successful and unsuccessful, to unravel them, he is not equipped to comprehend the doctrine of relativity at all; he is in much the same case as a child whose education had reached only the primer stage, if asked to read the masterpieces of literature. He lacks not alone the vocabulary, but equally the mental background on which the vocabulary is based.
Chapter II aims to connect the reader with certain philosophical questions that arise from our experiences with the outside world and our attempts to understand it. This way, the reader can approach the topic of relativity with an understanding of its significance as part of human thought and a foundation of scientific understanding. Until the reader acknowledges these problems and the various efforts, successful and unsuccessful, to solve them, they won't be ready to grasp the concept of relativity at all. They're much like a child whose education has only reached the beginning level if asked to read great works of literature. They not only lack the vocabulary but also the mental framework that the vocabulary relies on.
It will be noted that in this and the chapters immediately following it, the Editor has supplied material freely. The obvious interpretation is that satisfactory material covering the desired ground was not found in any of the essays; for we are sure the scope and number of the credited excerpts will make it clear that all contributions were adequately scrutinized in search of available passages. This “inadequacy” of the competing essays has been severely commented upon by several correspondents, and the inference drawn that as a whole the offerings [vi]were not up to the mark. Such a viewpoint is wholly unjust to the contestants. The essays which paid serious attention to the business of paving the way to relativity necessarily did so at the expense of completeness in the later paragraphs where specific explanation of the Einstein theories was in order. Mr. Law, whose essay was by all means the best of those that gave much space to introductory remarks, found himself left with only 600 words in which to tell what it was that he had been introducing. The majority of the contestants appear to have faced the same question as to subject matter which the Judges faced, and to have reached the same decision. They accordingly devoted their attention toward the prize, rather than toward the production of an essay that would best supplement that of the winner. It is for this very reason that, in these preliminary chapters, so large a proportion of the material has had to be supplied by the Editor; and this very circumstance is a tribute to the good judgment of the competitors, rather than ground for criticism of their work.
It should be noted that in this chapter and the ones that follow, the Editor has provided material extensively. The obvious conclusion is that satisfactory content covering the necessary topics wasn't found in any of the essays; we are confident that the range and number of the credited excerpts will show that all contributions were thoroughly reviewed for available passages. This "inadequacy" of the competing essays has been heavily criticized by several correspondents, leading to the assumption that overall, the submissions [__A_TAG_PLACEHOLDER_0__]were lacking. Such a perspective is completely unfair to the contestants. The essays that focused seriously on paving the way for relativity inevitably did so at the expense of thoroughness in the later sections where specific explanations of Einstein's theories were required. Mr. Law, whose essay was undoubtedly the best among those that included extensive introductory comments, found himself with only 600 words left to explain what he had been introducing. Most of the contestants seem to have faced the same topic-related dilemma as the Judges did and reached the same conclusion. They therefore focused their efforts on competing for the prize, rather than on creating an essay that would best complement the winner's work. This is precisely why so much material had to be supplied by the Editor in these early chapters; this situation actually reflects the sound judgment of the competitors, rather than serving as a basis for criticizing their efforts.
The general introduction of Chapter II. out of the way, Chapters III. and IV. take up the business of leading the reader into the actual subject of relativity. The subject is here developed in what may be called the historical order—the order in which it took form in Einstein’s own mind. Both in and outside the contest of which this book is the outcome, a majority of those who have written on relativity have followed this order, which is indeed a very natural one and one well calculated to give to the rather surprising assumptions of relativity [vii]a reasonableness which they might well appear to the lay mind to lack if laid down more arbitrarily. In these two chapters no effort is made to carry the argument beyond the formulation of the Special Principle of the relativity of uniform motion, but this principle is developed in considerably more detail than would be the case if it were left entirely to the competing essayists. The reason for this is again that we are dealing with a phase of the subject which is of subordinate importance so far as a complete statement of the General Theory of Relativity is concerned, but which is of the greatest significance in connection with the effort of the layman to acquire the proper preliminary orientation toward the larger subject.
The general introduction of Chapter II out of the way, Chapters III and IV focus on guiding the reader into the actual topic of relativity. The subject is developed in what might be called a historical order—the way it formed in Einstein’s own mind. Both within and outside the context of this book, most writers on relativity have followed this sequence, which is quite natural and effectively lends a sense of reasonableness to the surprising assumptions of relativity, assumptions that might seem arbitrary and harder to grasp for those unfamiliar with the topic. In these two chapters, no effort is made to extend the argument beyond the formulation of the Special Principle of the relativity of uniform motion, but this principle is explored in much greater detail than it would be if left solely to competing essayists. The reason for this is that we are addressing a phase of the subject that is of secondary importance regarding a complete explanation of the General Theory of Relativity, yet it holds significant value for helping the layperson gain the right preliminary understanding of the broader topic.
Chapter V. goes back again to general ground. Among the ideas which the competing essayists were forced to introduce into their text on a liberal scale is that of non-Euclidean geometry. The entire formulation of the General Theory of Relativity is in fact an exercise in this. The essayists—good, bad and indifferent alike—were quite unanimous in their decision that this was one thing which the reader would have to assume the responsibility of acquiring for himself. Certainly they were justified in this; for the Editor has been able to explain what non-Euclidean geometry is only by using up considerably more space than the contestants had for an entire essay. No effort has been made to set forth any of the details of any of the various non-Euclidean geometries; it has simply been the aim to draw the dividing line between Euclidean and non-Euclidean, and to make the existence of the latter appear [viii]reasonable, so that when the essayists come to talk about it the reader will not feel hopelessly at sea. In other words, this is another case of providing the mental background, but on such a scale that it has seemed necessary to give a separate chapter to it.
Chapter V goes back to general topics. One of the ideas that the competing essayists had to include in their texts extensively is non-Euclidean geometry. The entire formulation of the General Theory of Relativity is essentially an exercise in this concept. All the essayists—good, bad, and mediocre—agreed that this was something the reader would need to take the initiative to understand for themselves. They were definitely right about that; the Editor has found that explaining non-Euclidean geometry requires much more space than the contestants had for an entire essay. No attempt has been made to detail the specifics of the various non-Euclidean geometries; the focus has simply been on drawing the line between Euclidean and non-Euclidean geometry and making the existence of the latter seem reasonable, so that when the essayists discuss it, the reader won’t feel completely lost. In other words, this is another instance of providing the necessary mental groundwork, but it was deemed important enough to warrant a separate chapter.
Chapter VI. completes the preliminary course in the fundamentals of relativity by tying up together the findings of Chapter V. and those of Chapters III. and IV. It represents more or less of a last-minute change of plan; for while it had been the Editor’s intent from the beginning to place the material of Chapters II.–V. in its present position, his preliminary impression would have been that the work of the present Chapter VI. would be adequately done by the essayists themselves. His reading of the essays, however, convinced him that it had not so been done—that with the possible exception of Mr. Francis, the essayists did not make either a serious or a successful effort to show the organic connection between the Special Theory of Relativity and the Minkowski space-time structure, or the utter futility of trying to reconcile ourselves to the results of the former without employing the ideas of the latter. So Chapter VI. was supplied to make good this deficiency, and to complete the mental equipment which the reader requires for his battle with the General Theory.
Chapter VI wraps up the introductory course on the basics of relativity by connecting the findings from Chapter V with those from Chapters III and IV. This represents a bit of a last-minute change of plans; while the Editor had originally intended for the material from Chapters II–V to be in its current place, his initial thought was that the contributors could cover what Chapter VI entails. However, after reviewing the essays, he realized that this wasn't achieved—except possibly for Mr. Francis, the authors did not make a serious or effective attempt to demonstrate the link between the Special Theory of Relativity and the Minkowski space-time structure, nor the complete futility of attempting to come to terms with the results of the former without utilizing the ideas from the latter. Therefore, Chapter VI was added to fill this gap and equip the reader with the knowledge needed for tackling the General Theory.
In laying down a set of general principles to govern the award of the prize, one of the first things considered by the Judges was the relative importance of the Special and the General Theories. It was their opinion that no essay could possibly qualify for the prize which did not very distinctly [ix]give to the General Theory the center of the stage; and that in fact discussion of the Special Theory was pertinent only so long as it contributed, in proportion to the space assigned it, to the attack upon the main subject. The same principle has been employed in selecting essays for complete or substantially complete reproduction in this volume. Writers who dealt with the Special Theory in any other sense than as a preliminary step toward the General Theory have been relegated to the introductory chapters, where such excerpts from their work have been used as were found usable. The distinction of publication under name and title is reserved for those who wrote consistently and specifically upon the larger subject—with the one exception of Dr. Russell, whose exposition of the Special Theory is so far the best of those submitted and at the same time so distinctive that we have concluded it will appear to better advantage by itself than as a part of Chapters III. and IV.
In establishing guidelines for awarding the prize, one of the first things the Judges looked at was the importance of the Special and General Theories. They believed that no essay could qualify for the prize unless it prominently featured the General Theory, and that the discussion of the Special Theory was relevant only as it related to the main topic. This same approach has been used to select essays for complete or nearly complete reproduction in this volume. Writers who addressed the Special Theory in any way other than as a preliminary step toward the General Theory have been placed in the introductory chapters, where excerpts from their work have been included as appropriate. The distinction of publication under name and title is reserved for those who wrote thoughtfully and specifically about the broader topic—with one exception for Dr. Russell, whose explanation of the Special Theory is the best submitted and is so unique that we believe it will be better presented on its own rather than as part of Chapters III. and IV.
Following after Mr. Bolton’s essay we have tried to arrange the various contributions, not at all in any order of merit, but in the order that will make connected reading of the book most nearly possible and profitable. Each essay should be made easier of reading by the examination of those preceding it; at the same time each, by the choice of ground covered and by the emphasis on points not brought out sharply by its predecessors, should throw new light upon these predecessors.
Following Mr. Bolton's essay, we've tried to organize the various contributions, not by any rank or importance, but in a way that allows for a more connected and beneficial reading of the book. Each essay should be easier to read thanks to the examination of those that came before it; at the same time, each one, through its choice of topics and emphasis on points not highlighted by earlier essays, should provide new insights into those earlier works.
The reader will find that no two of the essays given thus in full duplicate or even come close to duplicating one another. They have of course been [x]selected with this in view; each represents the best of several essays of substantially the same character. Not all of them require comment here, but concerning some of them a word may well be said.
The reader will notice that no two of the essays provided here are identical or even very similar to each other. They have, of course, been [__A_TAG_PLACEHOLDER_0__]chosen with this in mind; each one represents the best of several essays of a similar nature. Not all of them need further explanation, but it’s worth mentioning a few of them.
Mr. Francis, we believe, has succeeded in packing more substance into his 3,000 words than any other competitor. Mr. Elliot has come closer than anybody else to really explaining relativity in terms familiar to everybody, without asking the reader to enlarge his vocabulary and with a minimum demand so far as enlarging his mental outlook is concerned. Were it not for certain conspicuous defects, his essay would probably have taken the prize. In justice to the Judges, we should state that we have taken the liberty of eliminating Mr. Elliot’s concluding paragraph, which was the most objectionable feature of his essay.
Mr. Francis has managed to pack more substance into his 3,000 words than any other competitor. Mr. Elliot has come closer than anyone else to really explaining relativity in terms that everyone can understand, without requiring the reader to expand their vocabulary and with minimal demand in terms of broadening their mental perspective. If it weren't for some noticeable flaws, his essay would likely have won the prize. To be fair to the Judges, we should mention that we've chosen to remove Mr. Elliot’s final paragraph, which was the most problematic part of his essay.
Dr. Dushman chose for his title the one which we adopted for this book. It became necessary, therefore, for us to find a new title for his essay; aside from this instance, the main titles appearing at the heads of the various complete essays are those of the authors. The subtitles have in practically every instance been supplied editorially.
Dr. Dushman chose the title we used for this book. So, we needed to come up with a new title for his essay; except for this case, the main titles at the beginning of the different essays are the authors' own. The subtitles have mostly been added by the editors.
Dr. Pickering submitted two essays, one written from the viewpoint of the physicist, the other from that of the astronomer. To make each complete, he naturally found it necessary to duplicate between them certain introductory and general material. We have run the two essays together into a single narrative, with the elimination of this duplicated material; aside from this blue-penciling no alteration has been made in Dr. Pickering’s text. This text however [xi]served as the basis of blue-penciling that of several other contestants, as indicated in the foot notes.
Dr. Pickering submitted two essays, one from the perspective of a physicist and the other from an astronomer's viewpoint. To make each essay complete, he found it necessary to repeat some introductory and general content in both. We have combined the two essays into a single narrative, removing the duplicated material; apart from this editing, no changes have been made to Dr. Pickering's text. This text, however, [__A_TAG_PLACEHOLDER_0__]served as the foundation for editing the texts of several other contestants, as noted in the footnotes.
For the reader who is qualified or who can qualify to understand it, Dr. Murnaghan’s essay is perhaps the most illuminating of all. Even the reader who does not understand it all will realize that its author brings to the subject a freshness of viewpoint and an originality of treatment which are rather lacking in some of the published essays, and which it will readily be understood were conspicuously lacking in a good many of the unpublished ones. Dr. Murnaghan of all the competitors has come closest to making a contribution to science as well as to the semi-popular literature of science.
For readers who are capable of understanding it, Dr. Murnaghan's essay is likely the most enlightening of all. Even those who might not grasp everything will see that the author offers a fresh perspective and a unique approach that are often missing in some of the published essays, and it's clear that these qualities were significantly absent in many of the unpublished ones. Among all the competitors, Dr. Murnaghan has come the closest to making a meaningful contribution both to science and to the general literature on science.
In the composite chapters, the brackets followed by reference numbers have been used as the most practicable means of identifying the various individual contributions. We believe that this part of the text can be read without allowing the frequent occurrence of these symbols to distract the eye. As to the references themselves, the asterisk marks the contributions of the Editor. The numbers are those attached to the essays in order of and at the time of their receipt; it has been more convenient to use these than to assign consecutive numbers to the quoted essays. The several numbers identify passages from the essays of the following contestants:
In the combined chapters, the brackets followed by reference numbers have been used as the easiest way to identify the various individual contributions. We believe that this section of the text can be read without letting the frequent appearance of these symbols distract you. Regarding the references themselves, the asterisk indicates the contributions of the Editor. The numbers correspond to the essays in the order and at the time they were received; it has been more convenient to use these than to give consecutive numbers to the quoted essays. The different numbers identify passages from the essays of the following contestants:
10: | Frederick W. Shurlock, Derby, England. |
18: | L. L. Whyte, Cambridge, England. |
24: | Prof. Moritz Schlick, University of Rostock, Germany. |
30: | C. E. Rose, M.E., Little Rock, Ark. |
33: | H. Gartelmann, Bremen, Germany. |
35: | Prof. Joseph S. Ames, Johns Hopkins University, Baltimore. |
47: | James O. G. Gibbons, East Orange. N. J. |
82: | Charles H. Burr, Philadelphia. |
101: | L. F. H. de Miffonis. B.A., C.E., Ottawa, Canada. |
102: | Charles A. Brunn, Kansas City.[__A_TAG_PLACEHOLDER_0__] |
106: | J. Elias Fries, Fellow A.I.E.E., Birmingham, Ala. |
114: | Dean W. P. Graham, Syracuse University, Syracuse, N. Y. |
115: | Rev. George Thomas Manley, London. |
116: | Prof. J. A. Schouten, Delft, Netherlands. |
121: | Elwyn F. Burrill, Berkeley, Cal. |
125: | Dorothy Burr, Bryn Mawr, Pa. |
130: | C. W. Kanolt, Bureau of Standards, Washington. |
135: | Robert Stevenson, New York. |
139: | Leopold Schorsch, New York. |
141: | Dr. M. C. Mott-Smith, Los Angeles, Calif. |
147: | Edward A. Clarke, Columbus, O. |
149: | Edward A. Partridge, Philadelphia. |
150: | Col. John Millis, U. S. A., Chicago. |
152: | George F. Marsteller, Detroit. |
156: | D. B. Hall, Cincinnati. |
165: | Francis Farquhar, York, Pa. |
178: | Dr. George de Bothezat, Dayton, O. |
179: | Professor A. E. Caswell, University of Oregon, Eugene, Ore. |
182: | C. E. Dimick, New London, Conn. |
186: | Earl R. Evans, Washington, D. C. |
188: | Norman E. Gilbert, Dartmouth College, Hanover, N. H. |
192: | A. d’Abro. New York. |
194: | L. M. Alexander, Cincinnati. |
197: | Kenneth W. Reed, East Cleveland, O. |
198: | Prof. E. N. da C. Andrade, Ordnance College, Woolwich, England. |
216: | Professor Andrew H. Patterson, University of North Carolina, Chapel Hill, N. C. |
220: | Prof. Arthur Gordon Webster, Clark College, Worcester, Mass. |
221: | Walter van B. Roberts, Princeton University, N. J. |
223: | Paul M. Batchelder, Austin, Tex. |
227: | Prof. R. W. Wood, Johns Hopkins University, Baltimore. |
229: | E. P. Fairbairn, M.C., B.Sc., Glasgow. |
231: | R. F. Deimel, Hoboken, N. J. |
232: | Lieut. W. Mark Angus, U. S. N., Philadelphia. |
235: | Edward Adams Richardson, Kansas City. |
263: | Prof. William Benjamin Smith, Tulane University, New Orleans. |
264: | James Rice, University of London, London. |
267: | William Hemmenway Pratt, Lynn, Mass. |
272: | R. Bruce Lindsay, New Bedford, Mass. |
283: | Frank E. Law, Montclair, N. J. |
In addition to the specific credit given by these references for specifically quoted passages, the Editor feels that he ought to acknowledge his general indebtedness to the competing essayists, collectively, for the many ideas which he has taken away from their text to clothe in his own words. This does not mean that the Editor has undertaken generally to improve upon the language of the competitors, but merely that the reading of all their essays has given him many ideas of such complex origin that he could not assign credit if he would. [xiii]
In addition to the specific credit given by these references for quoted passages, the Editor feels he should acknowledge his general indebtedness to the competing essayists for the many ideas he has taken from their text and rephrased in his own words. This doesn’t imply that the Editor has generally improved on the language of his competitors, but rather that reading all their essays has provided him with numerous ideas of such complex origin that he couldn’t assign credit even if he wanted to. [__A_TAG_PLACEHOLDER_0__]
Table of Contents
I.—The Einstein $5,000 Prize: How the Contest Came to be Held, and Some of the Details of Its Conduct. By the Editor 1
I.—The Einstein $5,000 Prize: How the Contest Happened and Some Details About How It Was Run. By the Editor 1
II.—The World—And Us: An Introductory Discussion of the Philosophy of Relativity, and of the Mechanism of our Contact with Time and Space. By various contributors and the Editor 19
II.—The World—And Us: An Introductory Discussion of the Philosophy of Relativity and How We Connect with Time and Space. By various contributors and the Editor 19
III.—The Relativity of Uniform Motion: Classical Ideas on the Subject; the Ether and the Apparent Possibility of Absolute Motion; the Michelson-Morley Experiment and the Final Negation of this possibility. By various contributors and the Editor 47
III.—The Relativity of Uniform Motion: Classical Ideas on the Topic; the Ether and the Apparent Possibility of Absolute Motion; the Michelson-Morley Experiment and the Final Rejection of this possibility. By various contributors and the Editor 47
IV.—The Special Theory of Relativity: What Einstein’s Study of Uniform Motion Tells Us About Time and Space and the Nature of the External Reality. By various contributors and the Editor 76
IV.—The Special Theory of Relativity: What Einstein’s Study of Uniform Motion Reveals About Time, Space, and the Nature of External Reality. By various contributors and the Editor 76
V.—That Parallel Postulate: Modern Geometric Methods; the Dividing Line Between Euclidean and Non-Euclidean; and the Significance of the Latter. By the Editor 111
V.—That Parallel Postulate: Modern Geometric Methods; the Divide Between Euclidean and Non-Euclidean; and the Importance of the Latter. By the Editor 111
VI.—The Space-Time Continuum: Minkowski’s World of Events, and the Way in Which It Fits Into Einstein’s Structure. By the Editor and a few contributors 141
VI.—The Space-Time Continuum: Minkowski’s World of Events, and How It Fits Into Einstein’s Framework. By the Editor and a few contributors 141
VII.—Relativity: The Winning Essay in the Contest for the Eugene Higgins $5,000 Prize. By Lyndon Bolton, British Patent Office, London 169
VII.—Relativity: The Winning Essay in the Contest for the Eugene Higgins $5,000 Prize. By Lyndon Bolton, British Patent Office, London 169
VIII.—The New Concepts of Time and Space: The Essay in Behalf of Which the Greatest Number of Dissenting Opinions Have Been Recorded. By Montgomery Francis, New York 181
VIII.—The New Concepts of Time and Space: The Essay for Which the Most Dissenting Opinions Have Been Recorded. By Montgomery Francis, New York 181
IX.—The Principle of Relativity: A Statement of What it is All About, in Ideas of One Syllable. By Hugh Elliot, Chislehurst, Kent, England 195
IX.—The Principle of Relativity: A Summary of What It's All About, in Simple Words. By Hugh Elliot, Chislehurst, Kent, England 195
X.—Space, Time and Gravitation: An Outline of Einstein’s Theory of General Relativity. By W. de Sitter, University of Leyden 206
X.—Space, Time and Gravitation: An Overview of Einstein’s Theory of General Relativity. By W. de Sitter, University of Leyden 206
XI.—The Principle of General Relativity: How Einstein, to a Degree Never Before Equalled, Isolates the External Reality from the Observer’s Contribution. By E. T. Bell, University of Seattle 218 [xiv]
XI.—The Principle of General Relativity: How Einstein, more than anyone else, separates external reality from the observer’s influence. By E. T. Bell, University of Seattle 218 [__A_TAG_PLACEHOLDER_0__]
XII.—Force Vs. Geometry: How Einstein Has Substituted the Second for the First in Connection with the Cause of Gravitation. By Saul Dushman, Schenectady 230
XII.—Force Vs. Geometry: How Einstein Replaced the First with the Second Regarding the Cause of Gravitation. By Saul Dushman, Schenectady 230
XIII.—An Introduction to Relativity: A Treatment in which the Mathematical Connections of Einstein’s Work are Brought Out More Strongly and More Successfully than Usual in a Popular Explanation. By Harold T. Davis, University of Wisconsin 240
XIII.—An Introduction to Relativity: A Treatment that Highlights the Mathematical Links in Einstein’s Work More Clearly and Effectively than Typical Popular Explanations. By Harold T. Davis, University of Wisconsin 240
XIV.—New Concepts for Old: What the World Looks Like After Einstein Has Had His Way with It. By John G. McHardy, Commander R. N., London 251
XIV.—New Concepts for Old: What the World Looks Like After Einstein Has Changed It. By John G. McHardy, Commander R. N., London 251
XV.—The New World: A Universe in Which Geometry Takes the Place of Physics, and Curvature that of Force. By George Frederick Hemens, M.C., B.Sc., London 265
XV.—The New World: A Universe Where Geometry Replaces Physics, and Curvature Replaces Force. By George Frederick Hemens, M.C., B.Sc., London 265
XVI.—The Quest of the Absolute: Modern Developments in Theoretical Physics, and the Climax Supplied by Einstein. By Dr. Francis D. Murnaghan, Johns Hopkins University, Baltimore 276
XVI.—The Quest of the Absolute: Recent Advances in Theoretical Physics and the Peak Contributed by Einstein. By Dr. Francis D. Murnaghan, Johns Hopkins University, Baltimore 276
XVII.—The Physical Side of Relativity: The Immediate Contacts between Einstein’s Theories and Current Physics and Astronomy. By Professor William H. Pickering, Harvard College Observatory, Mandeville, Jamaica 287
XVII.—The Physical Side of Relativity: The Direct Connections between Einstein’s Theories and Modern Physics and Astronomy. By Professor William H. Pickering, Harvard College Observatory, Mandeville, Jamaica 287
XVIII.—The Practical Significance of Relativity: The Best Discussion of the Special Theory Among All the Competing Essays. By Prof. Henry Norris Russell, Princeton University 306
XVIII.—The Practical Significance of Relativity: The Best Discussion of the Special Theory Among All the Competing Essays. By Prof. Henry Norris Russell, Princeton University 306
XIX.—Einstein’s Theory of Relativity: A Simple Explanation of His Postulates and Their Consequences. By T. Royds, Kodaikanal Observatory, India 318
XIX.—Einstein’s Theory of Relativity: A Simple Explanation of His Postulates and Their Consequences. By T. Royds, Kodaikanal Observatory, India 318
XX.—Einstein’s Theory of Gravitation: The Discussion of the General Theory and Its Most Important Application, from the Essay by Prof. W. F. G. Swann, University of Minnesota, Minneapolis 327
XX.—Einstein’s Theory of Gravitation: The Discussion of the General Theory and Its Most Important Application, from the Essay by Prof. W. F. G. Swann, University of Minnesota, Minneapolis 327
XXI.—The Equivalence Hypothesis: The Discussion of This, With Its Difficulties and the Manner in Which Einstein Has Resolved Them, from the Essay by Prof. E. N. da C. Andrade, Ordnance College, Woolwich, England 334
XXI.—The Equivalence Hypothesis: The Discussion of This, With Its Difficulties and the Manner in Which Einstein Has Resolved Them, from the Essay by Prof. E. N. da C. Andrade, Ordnance College, Woolwich, England 334
XXII.—The General Theory: Fragments of Particular Merit on This Phase of the Subject. By Various Contributors 338
XXII.—The General Theory: Highlights of Note on This Topic. By Various Authors 338
Table of Contents
[1]
[__A_TAG_PLACEHOLDER_0__]
I.
THE EINSTEIN $5,000 PRIZE
How the Contest Came to be Held, and Some of the Details of its Conduct
How the Contest Was Held and Some of the Details of Its Management
In January, 1909, an anonymous donor who was interested in the spread of correct scientific ideas offered through the Scientific American a prize of $500 for the best essay explaining, in simple non-technical language, that paradise of mathematicians and bugaboo of plain ordinary folk—the fourth dimension. Many essays were submitted in this competition, and in addition to that of the winner some twenty were adjudged worthy of ultimate publication. It was felt that the competition had added distinctly to the popular understanding of this significant subject; that it had done much to clear up popular misconception of just what the mathematician means when he talks of four or even more dimensions; and that it had therefore been as successful as it was unusual in character.
In January 1909, an anonymous donor interested in the spread of accurate scientific ideas offered a $500 prize through Scientific American for the best essay that explained, in plain simple language, that realm of mathematicians and the scary concept for everyday people—the fourth dimension. Many essays were submitted for this competition, and in addition to the winning entry, about twenty others were deemed worthy of publication. It was believed that the competition significantly improved popular understanding of this important topic; it helped clarify what mathematicians mean when they talk about four or more dimensions, making it as successful as it was unique.
In November, 1919, the world was startled by the announcement from London that examination of the photographs taken during the total solar eclipse of May 29th had been concluded, and that predictions based upon the Einstein theories of relativity had been verified. In the reaction from the long surfeit of war news an item of this sort was a thoroughly journalistic one. Long cable dispatches [2]were carried in the news columns all over the world; Einstein and his theories were given a prominent place on the front pages day after day; leading scientists in great number were called upon to tell the public through the reportorial medium just what the excitement was all about, just in what way the classical scientific structure had been overthrown.
In November 1919, the world was shocked by the announcement from London that the analysis of the photographs taken during the total solar eclipse on May 29th was complete and that predictions based on Einstein's theories of relativity had been confirmed. In a break from the usual war news, this kind of story was very much a journalistic highlight. Long cable reports [__A_TAG_PLACEHOLDER_0__] were featured in news columns around the world; Einstein and his theories consistently occupied prominent spots on front pages; numerous leading scientists were called upon to explain to the public through the media what the excitement was all about and how the classical scientific framework had been challenged.
Instead of being a mere nine days’ wonder, the Einstein theories held their place in the public mind. The more serious periodicals devoted space to them. First and last, a very notable group of scientific men attempted to explain to the general reader the scope and content of Einstein’s system. These efforts, well considered as they were, could be no more than partially successful on account of the very radical character of the revisions which the relativity doctrine demands in our fundamental concepts. Such revisions cannot be made in a day; the average person has not the viewpoint of the mathematician which permits a sudden and complete exchange of one set of fundamentals for another. But the whole subject had caught the popular attention so strongly, that even complete initial failure to discover what it was all about did not discourage the general reader from pursuing the matter with determination to come to some understanding of what had happened to Newton and Newtonian mechanics.
Instead of being just a fleeting craze, Einstein's theories remained prominent in the public consciousness. Serious publications dedicated space to them. A notable group of scientists tried to explain the scope and content of Einstein's system to everyday readers. Despite their well-thought-out efforts, they could only achieve partial success due to the radical changes the relativity concept requires in our fundamental ideas. Such shifts can't happen overnight; the average person lacks the mathematician's perspective that allows for a swift and complete replacement of one set of fundamentals with another. However, the topic captured public interest so strongly that even initial confusion about what it was all about didn't deter the average reader from trying to understand what had happened to Newton and Newtonian mechanics.
The Donor and the Award
In May, 1920, Mr. Eugene Higgins, an American citizen long resident in Paris, a liberal patron of the arts and sciences, and a lifelong friend of the Scientific [3]American and its proprietors, suggested that the success of the Fourth Dimension Prize Contest of 1910 had been so great that it might be desirable to offer another prize in similar fashion for the best popular essay on the Einstein theories. He stated that if in the opinion of the Scientific American these theories were of sufficient importance, and the probability of getting a good number of meritorious essays were sufficiently great, and the public need and desire for enlightenment were sufficiently present, he would feel inclined to offer such a prize, leaving the conduct of the contest to the Scientific American as in the former event. It was the judgment of the editors of the Scientific American that all these provisos should be met with an affirmative, and that Mr. Higgins accordingly could with propriety be encouraged to offer the prize.
In May 1920, Mr. Eugene Higgins, an American who had lived in Paris for a long time, a supportive patron of the arts and sciences, and a longtime friend of the Scientific American and its owners, suggested that because the Fourth Dimension Prize Contest of 1910 had been so successful, it would be a good idea to offer another prize for the best popular essay on Einstein's theories. He mentioned that if the Scientific American believed these theories were important enough, and if there was a good chance of receiving several quality essays, along with a genuine public interest in understanding them, he would be inclined to offer such a prize, letting the Scientific American manage the contest as before. The editors of the Scientific American concluded that all these conditions were likely to be met positively, and thus, Mr. Higgins could rightfully be encouraged to offer the prize.
In his preliminary letter Mr. Higgins had suggested that in view of the apparent greater importance of the subject to be proposed for discussion by the contestants of 1920, the prize offered should probably be more liberal than in the former instance. This view met with the approval of the editors as well; but they were totally unprepared for the receipt, late in June, of a cablegram from Mr. Higgins stating that he had decided to go ahead with the matter, and that he was forwarding a draft for $5,000 to represent the amount of the prize. Such a sum, exceeding any award open to a professional man with the single exception of the Nobel Prize, for which he cannot specifically compete, fairly took the breath of the Editors, and made it immediately [4]clear that the contest would attract the widest attention, and that it should score the most conspicuous success. It also made it clear that the handling of the contest would be a more serious matter than had been anticipated.
In his initial letter, Mr. Higgins suggested that since the topic for discussion by the contestants of 1920 seemed to be more significant, the prize offered should probably be more generous than before. The editors agreed with this idea, but they were completely unprepared for the cablegram they received from Mr. Higgins in late June, stating that he had decided to move forward with the plan and was sending a draft for $5,000 to represent the prize amount. This amount, greater than any award available to a professional, except for the Nobel Prize, for which he cannot specifically compete, took the editors by surprise and immediately made it clear that the contest would draw significant attention and should be a major success. It also underscored that organizing the contest would be more serious than they had expected.
In spite of the fact that it would not for some time be possible to announce the identity of the Judges, it was felt that the prospective contestants should have every opportunity for extensive preparation; so the contest was announced, and the rules governing it printed as far as they could be determined on such short shrift, in the Scientific American for July 10, 1920. Several points of ambiguity had to be cleared up after this initial publication. In particular, it had been Mr. Higgins’ suggestion that in the very probable event of the Judges’ inability to agree upon the winning essay, the prize might, at their discretion, be divided between the contributors of the best two essays. This condition was actually printed in the first announcement, but the Post Office Department insisted upon its withdrawal, on the ground that with it in force the contestant would not know whether he were competing for $5,000 or for $2,500, and that this would introduce the “element of chance” which alone was necessary, under the Federal statutes, to make the contest a lottery. So this provision was replaced by one to the effect that in the event the Judges were not able to agree, the Einstein Editor should cast the deciding vote between the essays respectively favored by them.
Even though it wouldn't be possible to announce the identity of the Judges for some time, it was important for the potential contestants to have every opportunity for thorough preparation. So, the contest was announced, and the rules were printed in the Scientific American on July 10, 1920, as far as they could be determined on such short notice. A few points of confusion needed to be clarified after this initial publication. Specifically, Mr. Higgins suggested that if the Judges couldn't agree on the winning essay, they could choose to split the prize between the writers of the two best essays. This condition was included in the first announcement, but the Post Office Department insisted it be removed, arguing that if it remained, contestants wouldn't know if they were competing for $5,000 or $2,500, introducing an "element of chance" that could turn the contest into a lottery under federal law. So, this rule was changed to state that if the Judges couldn't come to an agreement, the Einstein Editor would cast the deciding vote for the essays they favored.
The announcement attracted the widest attention, and was copied in newspapers and magazines all over the world. Inquiries poured in from all quarters, [5]and the Einstein Editor found it almost impossible to keep himself supplied with proofs of the conditions and rules to mail in response to these inquiries. It was immediately clear that there was going to be a large number of essays submitted, and that many distinguished names would be listed among the competitors.
The announcement grabbed a lot of attention and was shared in newspapers and magazines around the globe. Inquiries flooded in from everywhere, [__A_TAG_PLACEHOLDER_0__]and the Einstein Editor found it nearly impossible to keep up with the requests for the conditions and rules to send back in response to these inquiries. It quickly became obvious that a significant number of essays would be submitted, and that many notable names would be among the competitors.
The Judges
In the Scientific American for September 18, announcement was carried in the following words:
In the Scientific American for September 18, the announcement was made in the following words:
“We are assured with complete certainty that the competition for the five-thousand-dollar prize will be very keen, and that many essays will be submitted which, if they bore the names of their authors, would pass anywhere as authoritative statements. The judges will confront a task of extraordinary difficulty in the effort to determine which of these efforts is the best; and we believe the difficulties are such that multiplication of judges would merely multiply the obstacles to an agreement. It is altogether likely that the initial impressions of two or three or five judges would incline toward two or three or five essays, and that any final decision would be attainable only after much consultation and discussion. It seems to us that by making the committee as small as possible while still preserving the necessary feature that its decision represent a consensus, we shall simplify both the mental and the physical problem of coming to an agreement. We believe that the award should if possible represent a unanimous decision, without any minority report, and that such a requirement [6]is far more likely to be met among two men than among three or five. At the same time, the bringing together of two men and the details of general administration of their work together are far simpler than if there were three or five. So we have finally decided to have but two judges, and in this we have the endorsement of all the competent opinion that we have consulted.
“We are completely confident that the competition for the five-thousand-dollar prize will be intense, and that many essays will be submitted which, if they included the authors' names, would be seen as authoritative works. The judges will face an extremely challenging task in trying to decide which of these submissions is the best; and we think the challenges are such that adding more judges would only increase the difficulties of reaching a consensus. It’s likely that the initial opinions of two, three, or five judges would lean toward two, three, or five different essays, and any final decision would probably be reached only after much discussion and deliberation. We believe that by keeping the committee as small as possible while still ensuring that its decision reflects a consensus, we can simplify both the mental and logistical aspects of reaching an agreement. We think the award should ideally represent a unanimous decision, without any minority reports, and that this requirement is much more likely to be fulfilled by two judges than by three or five. Moreover, coordinating the work between two people and managing the general administration is much simpler than if there were three or five. Therefore, we have decided to have only two judges, and we have the support of all the qualified opinions we've consulted.”
“The gentlemen who have consented to act as Judges are Professors Leigh Page and Edwin Plimpton Adams, of the departments of physics of Yale and Princeton Universities, respectively. Both are of the younger generation of physicists that has paid special attention to those phases of mathematics and physics involved in the Einstein theories, and both have paid special attention to these theories themselves. We are gratified to be able to put forward as Judges two men so eminently qualified to act. We feel that we may here appropriately quote Professor Page, who says in his acceptance: ‘As the large prize offers a great inducement, I had thought of entering the contest. However I realize that not many people in this country have made a considerable study of Einstein’s theory, and if all who have should enter the contest, it would be difficult to secure suitable Judges.’ Without any desire to put the gentleman in the position of pleading for himself, we think this suggests very well the extent to which the Scientific American, the contestants, and the public at large, are indebted to Professors Page and Adams for their willingness to serve in the difficult capacity of Judges.”
“The gentlemen who have agreed to serve as Judges are Professors Leigh Page and Edwin Plimpton Adams, from the physics departments of Yale and Princeton Universities, respectively. Both belong to the younger generation of physicists who have focused on the mathematical and physical concepts related to Einstein's theories, and they have dedicated considerable attention to these theories themselves. We are pleased to present two such highly qualified individuals as Judges. It’s fitting to quote Professor Page, who mentioned in his acceptance: ‘Since the large prize provides a strong incentive, I considered participating in the contest. However, I recognize that not many people in this country have studied Einstein’s theory in depth, and if everyone who has were to enter, it would be challenging to find suitable Judges.’ Without intending to put him in a position to advocate for himself, we believe this effectively highlights how much the Scientific American, the contestants, and the public owe to Professors Page and Adams for their willingness to take on the challenging role of Judges.”
It might appropriately have been added to this [7]announcement that it was altogether to the credit of science and the scientific spirit that the first two gentlemen approached with the invitation to act as Judges were willing to forego their prospects as contestants in order thus to contribute to the success of the contest.
It could have been properly mentioned in this [__A_TAG_PLACEHOLDER_0__]announcement that it truly showed the integrity of science and the scientific spirit that the first two individuals who came forward with the invitation to serve as judges were willing to give up their chances to compete in order to help ensure the contest's success.
3,000 Words
Of the conditions, the one which evoked most comment was that stating the word limit. This limit was decided upon after the most careful discussion of the possibilities of the situation. It was not imagined for a moment that any contestant would succeed in getting within 3,000 words a complete discussion of all aspects of the Special and the General Theories of Relativity. It was however felt that for popular reading a single essay should not be much if any longer than this. Moreover, I will say quite frankly that we should never have encouraged Mr. Higgins to offer such a prize if we had supposed that the winning essay was the only thing of value that would come from the contest, or if we had not expected to find in many of the other essays material which would be altogether deserving of the light. From the beginning we had in view the present volume, and the severe restriction in length was deliberately imposed for the purpose of forcing every contestant to stick to what he considered the most significant viewpoints, and to give his best skill to displaying the theories of Einstein to the utmost advantage from these viewpoints. We felt that divergent viewpoints would [8]be more advantageously treated in this manner than if we gave each contestant enough space to discuss the subject from all sides; and that the award of the prize to the essay which, among other requirements, seemed to the Judges to embody the best choice of material, would greatly simplify the working of the contest without effecting any injustice against those contestants who displayed with equal skill less happily chosen material. Perhaps on this point I may again quote with profit the editorial page of the Scientific American:
Of all the conditions, the one that got the most attention was the word limit. This limit was established after careful consideration of the situation's potential. It was never assumed that any contestant could fully cover all aspects of the Special and General Theories of Relativity in under 3,000 words. However, it was thought that for general readers, a single essay shouldn’t be much longer than this. Additionally, I must be honest: we wouldn’t have encouraged Mr. Higgins to offer such a prize if we believed that only the winning essay would be valuable or if we hadn’t expected to find substantial content in many other essays worthy of attention. From the start, we envisioned this volume, and the strict length restriction was intentionally set to compel each contestant to focus on what they felt were the most important viewpoints, showcasing the theories of Einstein as effectively as possible from those angles. We believed that differing viewpoints would be better presented this way than if we allowed each contestant enough space to explore the topic from every angle; and that awarding the prize to the essay which, among other criteria, seemed to the judges to have the best selection of material would significantly streamline the contest without unfairly disadvantaging those contestants who, while equally skilled, chose less fortunate material. Perhaps on this matter, I can again quote usefully from the editorial page of the Sci American:
“An essay of three thousand words is not long enough to lose a reader more than once; if it does lose him it is a failure, and if it doesn’t it is a competitor that will go into the final elimination trials for the prize. If we can present, as a result of the contest, six or a dozen essays of this length that will not lose the lay reader at all, we shall have produced something amply worth the expenditure of Mr. Higgins’ money and our time. For such a number of essays of such character will of necessity present many different aspects of the Einstein theories, and in many different ways, and in doing so will contribute greatly to the popular enlightenment.
“An essay of three thousand words isn’t long enough to lose a reader more than once; if it loses them, it’s a failure, and if it doesn’t, it’s a contender that will go into the final elimination rounds for the prize. If we can present, as a result of the contest, six or a dozen essays of this length that will keep the general reader engaged, we will have created something well worth the investment of Mr. Higgins’ money and our time. A collection of essays like this will inevitably showcase many different perspectives on Einstein’s theories, and in various ways, contributing significantly to the public’s understanding.
“Really the significant part of what has already appeared is not the part that is intelligible, but rather the part that, being unintelligible, casts the shadow of doubt and suspicion on the whole. The successful competitor for the prize and his close contestants will have written essays that, without any claim to completeness, will emphasize what seems to each author the big outstanding feature; and every one of them will be intelligible. Together they will [9]in all probability be reasonably complete, and will retain the individual characteristic of intelligibility. They will approach the various parts of the field from various directions—we could fill this page with suggestions as to how the one item of the four-dimensional character of Einstein’s time-space might be set forth for the general reader. And when a man must say in three thousand words as much as he can of what eminent scientists have said in whole volumes—well, the result in some cases will be sheer failure, and in others a product of the first water. The best of the essays will shine through intelligent selection of what is to be said, and brilliant success in saying it. It is to get a group of essays of this character, not to get the single essay which will earn the palm, that the prize is offered.”
“Honestly, the important part of what’s already been shared isn’t the understandable parts, but the parts that are unclear, which cast doubt and suspicion on everything. The winner of the prize and his close rivals will have written essays that, while not claiming to be complete, will highlight what each author sees as the main point; and all of them will be clear. Together, they will [__A_TAG_PLACEHOLDER_0__] likely be fairly complete and will keep the unique quality of clarity. They will approach different aspects of the topic from various angles—we could easily fill this page with ideas on how to explain the four-dimensional nature of Einstein’s time-space for a general audience. And when someone has to convey in three thousand words what notable scientists have said in entire books—well, sometimes it will be a total failure, and at other times, it will be outstanding. The best essays will stand out through smart choices of what to include and skillful expression. The prize is intended to gather a collection of essays like this, rather than just to reward one essay that stands out.”
The Competition Essays
At all times after the first announcement the Einstein Editor had a heavy correspondence; but the first real evidence that the contest was under way came with the arrival of the first essay, which wandered into our office in the middle of September. About a week later they began to filter in at the rate of one or two per day—mostly from foreign contestants who were taking no chances on the mails. Heavy returns did not commence until about ten days before the closing date. The great avalanche, however, was reserved for the morning of Monday, November 1st. Here we had the benefit of three days’ mail; there were about 120 essays. Among those which were thrown out on the ground of lateness [10]the honors should no doubt go to the man who mailed his offering in The Hague on October 31st.
At all times after the initial announcement, the Einstein Editor was bogged down with correspondence; however, the first real sign that the contest was officially underway came when the first essay arrived at our office in mid-September. About a week later, submissions started coming in at a rate of one or two each day—mostly from foreign contestants who weren’t taking any chances with the mail. Significant submissions didn’t begin until about ten days before the deadline. The major influx, though, came on the morning of Monday, November 1st. We benefited from three days’ worth of mail, totaling around 120 essays. Among those disqualified for being late [__A_TAG_PLACEHOLDER_0__], the honor likely goes to the person who mailed in his submission from The Hague on October 31st.
Essays were received in greater quantity from Germany than from any other foreign country, doubtless because of the staggering value of $5,000 when converted into marks at late 1920 rates. England stood next on the list; and one or more essays were received from Austria, Czechoslovakia, Jugoslavia, France, Switzerland, the Netherlands, Denmark, Italy, Chile, Cuba, Mexico, India, Jamaica, South Africa and the Fiji Islands. Canada, of course, contributed her fair share; and few of our own states were missing on the roll-call.
Essays came in larger numbers from Germany than from any other foreign country, likely because $5,000 had an impressive value when converted into marks at the late 1920 rates. England was next on the list, and one or more essays were submitted from Austria, Czechoslovakia, Yugoslavia, France, Switzerland, the Netherlands, Denmark, Italy, Chile, Cuba, Mexico, India, Jamaica, South Africa, and the Fiji Islands. Canada, of course, did its part, and few of our own states were absent from the roll call.
The general level of English composition among the essays from non-English-speaking sources was about what might have been expected. A man may have a thorough utilitarian knowledge of a foreign tongue, but when he attempts intensive literary competition with a man who was brought up in that tongue he is at a disadvantage. We read French and German with ease and Spanish and Italian without too much difficulty, ourselves; we should never undertake serious writing in any of these languages. Not many of the foreign contributions, of course, were as ludicrous as the one we quote to some extent in our concluding chapter, but most of them were distinctly below par as literary compositions. Drs. De Sitter and Schlick were the notable exceptions to this; both showed the ability to compete on a footing of absolute equality with the best of the native product.
The overall quality of English writing in essays from non-English-speaking sources was pretty much what we expected. A person can have a solid practical knowledge of a foreign language, but when trying to compete in serious literary work against someone who grew up speaking that language, they are at a disadvantage. We can read French and German easily and Spanish and Italian without too much trouble ourselves; however, we would never attempt serious writing in any of these languages. Not many of the foreign contributions were as ridiculous as the one we reference to some extent in our concluding chapter, but most of them were definitely below average as literary pieces. Drs. De Sitter and Schlick were the notable exceptions; both showed they could compete on equal footing with the best of the native works.
We dare say it was a foregone conclusion that [11]many essays should have been over the limit, and that a few should have been over it to the point of absurdity. The winning essay contains 2,919 words, plus or minus a reasonable allowance for error in counting; that it should come so far from being on the ragged edge should be sufficient answer to those who protested against the severity of the limitation. One inquirer, by the way, wanted to know if 3,000 words was not a misprint for 30,000. Another contestant suggested that instead of disqualifying any essay that was over the line, we amputate the superfluous words at the end. This was a plausible enough suggestion, since any essay able to compete after such amputation must necessarily have been one of extreme worth; but fortunately we did not have to decide whether we should follow the scheme. Perhaps twenty of the essays submitted were so seriously in excess of the limit that it was not even necessary to count their words in detail; most of these offenders ran to 3,500 words or thereabouts, and one—a good one, too, from which we use a good deal of material in this volume—actually had 4,700. On the other extreme were a few competitors who seemed to think that the shortest essay was necessarily the best, and who tried to dismiss the subject with 500 or 1,000 words.
We can confidently say that many essays were definitely over the limit, and a few were absurdly so. The winning essay has 2,919 words, give or take a bit for counting errors; the fact that it was so far from the limit should effectively address those who complained about the strictness of the word count. One person even asked if 3,000 words was a typo for 30,000. Another contestant suggested that instead of disqualifying any essay that exceeded the limit, we should just cut off the extra words at the end. This suggestion made sense, since any essay able to compete after such a cut would have to be of high quality; however, we were fortunate not to have to decide whether to implement that idea. Around twenty of the submitted essays were so far over the limit that we didn't even need to count the words in detail; most of these offenders were about 3,500 words long, and one—quite a good one, from which we drew a lot of material for this volume—actually had 4,700 words. On the opposite end, there were a few competitors who seemed to think that the shortest essay was automatically the best and attempted to tackle the topic in just 500 or 1,000 words.
By a curious trick of chance there were submitted in competition for the prize exactly 300 essays. Of course a few of these did not require serious consideration—this is inevitable in a contest of such magnitude. But after excluding all the essays that were admittedly not about the Einstein theories at all, and all those whose English was so execrable [12]as to make them quite out of the question, and all those which took the subject so lightly as not to write reasonably close to the limit of 3,000 words, and all those which were given over to explanation of the manner in which Einstein’s theories verify those of the writer, and all those in which the writer attempted to substitute his own cosmic scheme for Einstein’s—after all this, there remained some 275 essays which were serious efforts to explain in simple terms the nature and content and consequences of Special and General Relativity.
By a strange twist of fate, exactly 300 essays were submitted for the prize competition. Naturally, a few of these didn't warrant serious consideration—this is to be expected in a contest of this scale. However, after excluding all the essays that clearly didn’t relate to Einstein’s theories, and those with such poor English [__A_TAG_PLACEHOLDER_0__]that they were completely unmanageable, and all the ones that took the topic too lightly and didn’t come close to the 3,000-word limit, and those that focused on explaining how Einstein’s theories support the writer’s own views, as well as those where the writer tried to replace Einstein’s concepts with their own cosmic ideas—after all that, there were about 275 essays left that genuinely attempted to explain in simple terms the nature, content, and implications of Special and General Relativity.
Searching for the Winner
The Einstein Editor was in sufficiently close touch with the details of the adjudication of the essays to have every realization of the difficulty of this work. The caliber of the essays submitted was on the whole high. There were many which would have been well worthy of the prize in the absence of others that were distinctly better—many which it was not possible to eliminate on the ground of specific faults, and which could only be adjudged “not the best” by detailed comparison with specific other essays. It was this detailed comparison which took time, and which so delayed the award that we were not able to publish the winning essay any sooner than February 5th. Especially difficult was this process of elimination after the number of surviving essays had been reduced to twenty or less. The advantages of plan possessed by one essay had to be weighed against those of execution exhibited in another. A certain essay had to be critically compared [13]with another so like it in plan that the two might have been written from a common outline, and at the same time with a third as unlike it in scope and content as day and night. And all the time there was present in the background the consciousness that a prize of $5,000 hung upon the decision to be reached. For anyone who regards this as an easy task we have no worse wish than that he may some day have to attack a similar one.
The Einstein Editor was closely involved with the details of judging the essays and fully understood how challenging this work was. Overall, the quality of the submitted essays was high. Many of them would have deserved the prize if not for others that were clearly better—plenty that couldn’t be eliminated due to specific faults and could only be deemed "not the best" through detailed comparisons with other essays. This thorough comparison took time, which delayed the announcement of the winning essay until February 5th. The process of elimination became especially difficult once the number of remaining essays dropped to twenty or fewer. We had to weigh the strengths of one essay’s plan against the execution shown in another. One essay had to be critically compared [__A_TAG_PLACEHOLDER_0__] with another that was so similar in structure that they might have come from a shared outline, while also comparing it with a third that was as different in scope and content as night and day. Throughout this process, we were always aware that a $5,000 prize was on the line with the decision we had to make. For anyone who thinks this is an easy task, we can only hope that they one day have to take on something similar.
We had anticipated that the bulk of the superior essays would be among those received during the last day or two of the contest; for we felt that the men best equipped to attack the subject would be the most impressed with its seriousness. Here we were quite off the track. The seventeen essays which withstood most stubbornly the Judges’ efforts at elimination were, in order of receipt, numbers 8, 18, 28, 40, 41, 43, 92, 95, 97, 130, 181, 194, 198, 223, 267, 270, 275: a fairly even distribution. The winner was the 92nd essay received.
We expected that most of the best essays would come in during the last day or two of the contest because we thought that the people most prepared to tackle the topic would be the ones who took it most seriously. We were completely wrong. The seventeen essays that resisted the Judges’ attempts to eliminate them the longest, in the order they were received, were numbers 8, 18, 28, 40, 41, 43, 92, 95, 97, 130, 181, 194, 198, 223, 267, 270, 275: a pretty even spread. The winner was the 92nd essay received.
The Judges held their final meeting in the editorial office on January 18, 1921. The four essays which were before the committee at the start of the session were speedily cut to three, and then to two; and after an all-day session the Judges found themselves conscientiously able to agree on one of these as the best. This unanimity was especially gratifying, the more so since it by no means was to be confidently expected, on a priori grounds, that it would be possible of attainment. Even the Einstein Editor, who might have been called upon for a final decision but wasn’t, can hardly be classed as a dissenter; for with some slight mental reservations in favor of the essay by [14]Mr. Francis which did not enter the Judges’ final discussion at all, and which he rather suspects appeals more to his personal taste than to his soundest judgment, he is entirely in accord with the verdict rendered.
The Judges had their final meeting in the editorial office on January 18, 1921. The four essays that were presented at the beginning of the session were quickly narrowed down to three, and then to two; after a full day of discussion, the Judges were able to agree on one of these as the best. This agreement was particularly satisfying, especially since it wasn't something that could be confidently anticipated from the outset. Even the Einstein Editor, who could have been called on for a final decision but wasn’t, can hardly be considered a dissenting voice; because despite some slight personal preferences for the essay by [__A_TAG_PLACEHOLDER_0__] Mr. Francis, which didn’t even make it to the Judges’ final discussions and which he suspects he prefers more for personal reasons than sound judgment, he completely supports the decision made.
The fact that the prize went to England was no surprise to those acquainted with the history of Einstein’s theories. The Special Theory, promulgated fifteen years ago, received its fair share of attention from mathematicians all over the world, and is doubtless as well known and as fully appreciated here as elsewhere. But it has never been elevated to a position of any great importance in mathematical theory, simply because of itself, in the absence of its extension to the general case, it deserves little importance. It is merely an interesting bit of abstract speculation.
The fact that the award went to England wasn’t surprising to anyone who knows the background of Einstein’s theories. The Special Theory, introduced fifteen years ago, got a lot of attention from mathematicians around the globe and is certainly just as well known and appreciated here as anywhere else. However, it has never really been considered of great significance in mathematical theory because, on its own, without being expanded to the general case, it doesn’t hold much weight. It’s just an intriguing piece of abstract speculation.
The General Theory was put out by Einstein in finished form during the war. Owing to the scientific moratorium, his paper, and hence a clear understanding of the new methods and results and of the sweeping consequences if the General Theory should prevail, did not attain general circulation outside Germany until some time in 1918 or even later. Had it not been for Eddington it is doubtful that the British astronomers would have realized that the eclipse expeditions were of particular consequence. Therefore at the time of these expeditions, and even as late as the November announcement of the findings, the general body of scientific men in America had not adequately realized the immense distinction between the Special and the General Theories, had not adequately appreciated that the latter led to [15]distinctive consequences of any import, and we fear in many cases had not even realized explicitly that the deflection of light and the behavior of Mercury were matters strictly of the General and in no sense of the Special Theory. Certainly when the American newspapers were searching frantically for somebody to interpret to their public the great stir made by the British announcement that Einstein’s predictions had been verified, they found no one to do this decently; nor were our magazines much more successful in spite of the greater time they had to devote to the search. In a word, there is not the slightest room for doubt that American science was in large measure caught asleep at the switch—perhaps for no reason within its control; and that American writers were in no such favorable case to write convincingly on the subject as were their British and continental contemporaries.
The General Theory was published by Einstein in its final form during the war. Because of the scientific moratorium, his paper, and therefore a clear understanding of the new methods and results and the significant consequences if the General Theory was accepted, didn’t gain widespread attention outside Germany until sometime in 1918 or even later. If it hadn't been for Eddington, it’s doubtful that British astronomers would have recognized the importance of the eclipse expeditions. So, at the time of these expeditions, and even as late as the November announcement of the findings, many scientists in America didn’t fully grasp the huge difference between the Special and General Theories. They didn’t appreciate that the latter led to [__A_TAG_PLACEHOLDER_0__]distinctive and significant consequences, and we fear that in many cases they didn’t even realize that the deflection of light and the behavior of Mercury were strictly matters of the General Theory and not at all related to the Special Theory. When American newspapers were desperately looking for someone to explain to their audience the excitement caused by the British announcement that Einstein’s predictions had been confirmed, they couldn’t find anyone to do so adequately; nor were our magazines much more successful, despite having more time to search. In short, there’s no doubt that American science was largely caught off guard—perhaps for reasons beyond its control; and American writers were not in as good a position to write convincingly on the topic as their British and continental contemporaries.
So it was quite in accord with what might have been expected to find, on opening the identifying envelopes, that not alone the winning essay, but its two most immediate rivals, come from members of that school of British thought which had been in contact with the Einstein theories in their entirety for two years longer than the average American of equal competence. This riper familiarity with the subject was bound to yield riper fruit. Indeed, had it not been for the handicap of writing in a strange language, it is reasonable to assume that the scientists of Germany would have made a showing superior to that of either Americans or British—and for the same reason that Britain showed to better advantage than America. [16]
It was exactly what you would expect to find that, upon opening the identifying envelopes, not only the winning essay but also its two closest competitors came from members of that school of British thought that had engaged with the Einstein theories fully for two years longer than the average American with similar skills. This deeper familiarity with the topic was sure to produce better results. In fact, if it hadn't been for the challenge of writing in a foreign language, it's reasonable to think that German scientists would have outperformed both American and British competitors—just as Britain performed better than America for the same reason. [__A_TAG_PLACEHOLDER_0__]
The Prize Winner
Mr. Bolton, the winner of the big prize, we suppose may fairly be referred to as unknown in a strict scientific sense. Indeed, at the time of the publication of his essay in the Scientific American nothing could be learned about him on the American side of the water beyond the bare facts that he was not a young man, and that he had for a good many years occupied a position of rank in the British Patent Office. (It will be recalled that Einstein himself was in the Swiss Patent Office for some time.) In response to the request of the Scientific American for a brief biographical sketch that would serve to introduce him better to our readers, Mr. Bolton supplied such a concise and apparently such a characteristic statement that we can do no better than quote it verbatim.
Mr. Bolton, the winner of the big prize, might fairly be called unknown in a strict scientific sense. In fact, when his essay was published in the SciAm, there was little information available about him on this side of the ocean, other than the basic facts that he wasn't a young man and that he had held a high-ranking position in the British Patent Office for several years. (It’s worth noting that Einstein himself spent some time in the Swiss Patent Office.) In response to the Scientific American's request for a brief biographical sketch to better introduce him to our readers, Mr. Bolton provided such a concise and apparently characteristic statement that we can do no better than quote it verbatim.
“I was born in Dublin in 1860, but I have lived in England since 1869. My family belonged to the landed gentry class, but I owe nothing to wealth or position. I was in fact put through school and college on an income which a workman would despise nowadays. After attending sundry small schools, I entered Clifton College in 1873. My career there was checkered, but it ended well. I was always fairly good at natural science and very fond of all sorts of mechanical things. I was an honest worker but no use at classics, and as I did practically nothing else for the first four years at Clifton, I came to consider myself something of a dunce. But a big public school is a little world. Everyone gets an opportunity, often seemingly by accident, and it is [17]up to him to take it. Mine did not come till I was nearly 17. As I was intended for the engineering profession, I was sent to the military side of the school in order to learn some mathematics, at which subject I was then considered very weak. This was certainly true, as at that time I barely knew how to solve a quadratic, I was only about halfway through the third book of Euclid, and I knew no trigonometry. But the teaching was inspiring, and I took readily to mathematics. One day it came out that I had been making quite a good start with the differential calculus on my own without telling anybody. After that all was well. I left Clifton in 1880 with a School Exhibition and a mathematical scholarship at Clare College, Cambridge.
“I was born in Dublin in 1860, but I’ve lived in England since 1869. My family was part of the landed gentry, but I don’t owe anything to wealth or status. I actually went through school and college on an income that a worker today would look down on. After attending several small schools, I entered Clifton College in 1873. My experience there was mixed, but it ended positively. I was always decent at natural science and really enjoyed all kinds of mechanical things. I was a hard worker but didn’t excel at classics, and since I focused almost exclusively on that for the first four years at Clifton, I started to think of myself as somewhat of a dummy. But a big public school is like a little world. Everyone gets a chance, often seemingly by chance, and it’s [__A_TAG_PLACEHOLDER_0__]up to them to seize it. Mine didn’t come until I was nearly 17. Since I was meant to go into engineering, I was sent to the military side of the school to learn some mathematics, a subject I was considered very weak in at that time. This was definitely true, as I barely knew how to solve a quadratic, I was only about halfway through the third book of Euclid, and I didn’t know any trigonometry. But the teaching was inspiring, and I quickly took to mathematics. One day, it came out that I had been making quite a good start with differential calculus on my own without telling anyone. After that, everything went well. I left Clifton in 1880 with a School Exhibition and a mathematical scholarship at Clare College, Cambridge.”
“After taking my degree in 1883 as a Wrangler, I taught science and mathematics at Wellington College, but I was attracted by what I had heard of the Patent Office and I entered it through open competition in 1885. During my official career I have been one of the Comptroller’s private secretaries and I am now a Senior Examiner. During the war I was attached to the Inventions Department of the Ministry of Munitions, where my work related mainly to anti-aircraft gunnery. I have contributed, and am still contributing to official publications on the subject.
“After graduating as a Wrangler in 1883, I taught science and math at Wellington College, but I was drawn to what I heard about the Patent Office, so I entered it through open competition in 1885. Throughout my career, I've served as one of the Comptroller’s private secretaries and I'm now a Senior Examiner. During the war, I was assigned to the Inventions Department of the Ministry of Munitions, where my work mainly focused on anti-aircraft gunnery. I've contributed, and continue to contribute, to official publications on the subject.”
“I have written a fair number of essays on various subjects, even on literature, but my only extra-official publications relate to stereoscopic photography. I read a paper on this subject before the Royal Photographic Society in 1903 which was favorably noticed by Dr. von Rohr of Messrs. Zeiss of Jena. [18]I have also written in the Amateur Photographer.
“I've written quite a few essays on different topics, including literature, but my only unofficial publications are about stereoscopic photography. I presented a paper on this topic at the Royal Photographic Society in 1903, which was positively acknowledged by Dr. von Rohr from Zeiss in Jena. [__A_TAG_PLACEHOLDER_0__] I've also contributed to the Amateur Photographer.
“I have been fairly successful at athletics, and I am a member of the Leander Club.”
“I have done quite well in athletics, and I’m a member of the Leander Club.”
That Mr. Bolton did not take the prize through default of serious competition should be plain to any reader who examines the text from competing essays which is to be found in this volume. The reference list of these competitors, too, supplemented by the names that appear at the heads of complete essays, shows a notable array of distinguished personalities, and I could mention perhaps a dozen more very well known men of science whose excellent essays have seemed a trifle too advanced for our immediate use, but to whom I am under a good deal of obligation for some of the ideas which I have attempted to clothe in my own language.
That Mr. Bolton didn’t win the prize due to a lack of serious competition should be clear to anyone who looks at the competing essays included in this volume. The list of these competitors, along with the names at the beginning of the complete essays, showcases a remarkable group of distinguished individuals. I could also name about a dozen more well-known scientists whose outstanding essays might have felt a bit too advanced for our current purposes, but I owe a lot to them for some of the ideas I’ve tried to express in my own words.
Before leaving the subject, we wish to say here a word of appreciation for the manner in which the Judges have discharged their duties. The reader will have difficulty in realizing what it means to read such a number of essays on such a subject. We were fortunate beyond all expectation in finding Judges who combined a thorough scientific grasp of the mathematical and physical and philosophical aspects of the matter with an extremely human viewpoint which precluded any possibility of an award to an essay that was not properly a popular discussion, and with a willingness to go to meet each other’s opinions that is rare, even among those with less ground for confidence in their own views than is possessed by Drs. Page and Adams. [19]
Before wrapping up this topic, we want to express our appreciation for how the Judges have handled their responsibilities. It’s hard for the reader to grasp what it’s like to go through so many essays on such a complex subject. We were incredibly fortunate to find Judges who not only had a deep understanding of the scientific, mathematical, and philosophical aspects of the topic but also approached it with a very human perspective. This perspective ensured that no award would be given to an essay that didn't truly engage the general audience. Their openness to consider each other’s views is rare, even among those who might be less confident in their opinions than Drs. Page and Adams. [__A_TAG_PLACEHOLDER_0__]
II.
THE WORLD—AND US
An Introductory Discussion of the Philosophy of Relativity, and of the Mechanism of Our Contact with Time and Space
A Beginner's Guide to the Philosophy of Relativity and Our Interaction with Time and Space
From a time beyond the dawn of history, mankind has been seeking to explain the universe. At first the effort did not concern itself further probably than to make a supposition as to what were the causes of the various phenomena presented to the senses. As knowledge increased, first by observation and later by experiment also, the ideas as to these causes passed progressively through three stages—the theological (the causes were thought to be spirits or gods); the metaphysical (the causes were thought in this secondary or intermediate stage to be some inherent, animating, energizing principles); and the scientific (the causes were finally thought of as simply mechanical, chemical, and magneto-electrical attractions and repulsions, qualities or characteristics of matter itself, or of the thing of which matter is itself composed.)
From a time long before recorded history, humanity has been trying to understand the universe. Initially, this effort likely focused on making guesses about the reasons behind the various phenomena we experience. As knowledge grew through observation and later experimentation, ideas about these causes evolved through three stages: the theological (where causes were believed to be spirits or gods), the metaphysical (where causes were thought of as inherent, animating, energizing principles), and the scientific (where causes were ultimately seen as simple mechanical, chemical, and magneto-electrical forces, characteristics of matter itself, or of what matter is composed of).
With increase of knowledge, and along with the inquiry as to the nature of causes, there arose an inquiry also as to what reality was. What was the essential nature of the stuff of which the universe was made, what was matter, what were things in themselves, what were the noumena (the realities), [20]lying back of the phenomena (the appearances)? Gradually ideas explaining motion, force, and energy were developed. At the same time inquiry was made as to the nature of man, the working of his mind, the nature of thought, the relation of his concepts (ideas) to his perceptions (knowledge gained through the sense) and the relations of both to the noumena (realities).]283
With the growth of knowledge and the exploration of the nature of causes, questions about what reality really is also emerged. What is the fundamental nature of the material that makes up the universe? What is matter? What are things in themselves? What are the noumena (the realities) [__A_TAG_PLACEHOLDER_0__] that underlie the phenomena (the appearances)? Gradually, ideas explaining motion, force, and energy were developed. At the same time, questions were raised about the nature of humanity, the function of the mind, the nature of thought, and how our concepts (ideas) relate to our perceptions (knowledge acquired through the senses) and how both relate to the noumena (realities).]283
[The general direction taken by this inquiry has been that of a conflict between two schools of thought which we may characterize as those of absolutism and of relativism.]* [The ancient Greek philosophers believed that they could tap a source of knowledge pure and absolute by sitting down in a chair and reasoning about the nature of time and space, and the mechanism of the physical world.]221 [They maintained that the mind holds in its own right certain concepts than which nothing is more fundamental. They considered it proper to conceive of time and space and matter and the other things presented to their senses by the world as having a real existence in the mind, regardless of whether any external reality could be identified with the concept as ultimately put forth. They could even dispute with significance the qualities which were to be ascribed to this abstract conceptual time and space and matter. All this was done without reference to the external reality, often in defiance of that reality. The mind could picture the world as it ought to be; if the recalcitrant facts refused to fit into the picture, so much the worse for them. We all have heard the tale of how generation after generation of Greek philosophers disputed learnedly why and how it was [21]that a live fish could be added to a brimming pail of water without raising the level of the fluid or increasing the weight; until one day some common person conceived the troublesome idea of trying it out experimentally to learn whether it were so—and found that it was not. True or false, the anecdote admirably illustrates the subordinate place which the externals held in the absolutist system of Greek thought.]*
[The general direction taken by this inquiry has been that of a conflict between two schools of thought, which we can describe as absolutism and relativism.]* [The ancient Greek philosophers believed that by simply sitting down and thinking about the nature of time, space, and the workings of the physical world, they could tap into a pure and absolute source of knowledge.]221 [They argued that the mind inherently possesses certain concepts that are more fundamental than anything else. They thought it was appropriate to view time, space, matter, and other things perceived through their senses as having real existence in the mind, no matter if any external reality could ultimately be matched with those concepts. They could even meaningfully debate the qualities ascribed to this abstract conceptualization of time, space, and matter. All of this was done without considering external reality, often against it. The mind could envision the world as it should be; if the stubborn facts didn't align with that vision, that was their problem. We all know the story of how generation after generation of Greek philosophers earnestly debated why and how it was [__A_TAG_PLACEHOLDER_0__] possible to add a live fish to a full bucket of water without raising the water level or adding weight; until one day, a regular person thought to test it out experimentally to see if it were true—and discovered that it wasn't. True or false, this anecdote wonderfully illustrates the secondary role that external factors played in the absolutist system of Greek thought.]*
[Under this system a single observer is competent to examine a single phenomenon, and to write down the absolute law of nature by referring the results to his innate ideas of absolute qualities and states. The root of the word absolute signifies “taking away,” and in its philosophical sense the word implies the ability of the mind to subtract away the properties or qualities from things, and to consider these abstract qualities detached from the things; for example, to take away the coldness from ice, and to consider pure or abstract coldness apart from anything that is cold; or to take away motion from a moving body, and to consider pure motion apart from anything that moves. This assumed power is based upon the Socratic theory of innate ideas. According to this theory the mind is endowed by nature with the absolute ideas of hardness, coldness, roundness, equality, motion, and all other absolute qualities and states, and so does not have to learn them. Thus a Socratic philosopher could discuss pure or absolute being, absolute space and absolute time.]121
[Under this system, one observer is capable of examining a single phenomenon and writing down the absolute law of nature by linking the results to their innate ideas of absolute qualities and states. The root of the word absolute means “taking away,” and in a philosophical context, it suggests the mind's ability to remove the properties or qualities from things and to view these abstract qualities independently from their objects; for instance, to separate coldness from ice and consider pure or abstract coldness apart from anything cold, or to detach motion from a moving body and think about pure motion without regard to anything that moves. This assumed power is based on the Socratic theory of innate ideas. According to this theory, the mind is naturally equipped with absolute ideas of hardness, coldness, roundness, equality, motion, and all other absolute qualities and states, meaning it doesn't need to learn them. Thus, a Socratic philosopher could discuss pure or absolute being, absolute space, and absolute time.]121
Moving Away from Greek Concepts
[This Greek mode of thought persisted into the [22]late Middle Ages, at which time it was still altogether in order to dispose of a troublesome fact of the external world by quoting Aristotle against it. During the Renaissance, which intellectually at least marks the transition from ancient to modern, there came into being another type of absolutism, equally extreme, equally arbitrary, equally unjustified. The revolt against the mental slavery to Greek ideas carried the pendulum too far to the other side, and early modern science as a consequence is disfigured by what we must now recognize as gross materialism. The human mind was relegated to the position of a mere innocent bystander. The external reality was everything, and aside from his function as a recorder the observer did not in the least matter. The whole aim of science was to isolate and classify the elusive external fact. The rôle of the observer was in every possible way minimized. It was of course his duty to get the facts right, but so far as any contribution to these was concerned he did not count—he was definitely disqualified. He really played the part of an intruder; from his position outside the phenomena he was searching for the absolute truth about these phenomena. The only difference between his viewpoint and that of Aristotle was that the latter looked entirely inside himself for the elusive “truth,” while the “classical” scientist, as we call him now, looked for it entirely outside himself.
[This Greek way of thinking continued into the [__A_TAG_PLACEHOLDER_0__]late Middle Ages, when it was still typical to dismiss a troublesome fact about the external world by quoting Aristotle. During the Renaissance, which marks the intellectual shift from ancient to modern, another form of absolutism emerged—equally extreme, arbitrary, and unjustified. The reaction against the mental bondage to Greek ideas swung the pendulum too far the other way, resulting in early modern science being marred by what we now recognize as blatant materialism. The human mind was reduced to the role of a mere bystander. External reality was everything, and aside from being a recorder, the observer didn’t really matter. The goal of science became to isolate and classify elusive external facts. The observer's role was minimized in every possible way. It was certainly his responsibility to get the facts right, but he didn’t count in terms of any contribution—he was definitively disqualified. He essentially acted as an intruder; from his outside position, he was searching for the absolute truth about these phenomena. The only difference between his viewpoint and Aristotle’s was that while Aristotle searched entirely within himself for the elusive “truth,” the “classical” scientist, as we call him now, looked for it entirely outside himself.]
Let me illustrate the difference between the two viewpoints which I have discussed, and the third one which I am about to outline, by another concrete instance. The Greeks, and the medievals as well, were [23]fond of discussing a question which embodies the whole of what I have been saying. This question involved, on the part of one who attempted to answer it, a choice between the observer and the external world as the seat of reality. It was put in many forms; a familiar one is the following: “If the wind blew down a great tree at a time and place where there was no conscious being to hear, would there be any noise?” The Greek had to answer this question in the negative because to him the noise was entirely a phenomenon of the listener. The classical scientist had to answer it in the affirmative because to him the noise was entirely a phenomenon of the tree and the air and the ground. Today we answer it in the negative, but for a very different reason from that which swayed the Greek. We believe that the noise is a joint phenomenon of the observer and the externals, so that in the absence of either it must fail to take existence. We believe there are sound waves produced, and all that; but what of it? There is no noise in the presence of the falling tree and the absence of the observer, any more than there would be in the presence of the observer and the absence of the tree and the wind; the noise, a joint phenomenon of the observer and the externals, exists only in their joint presence.
Let me show you the difference between the two perspectives I've discussed and the third one I'm about to explain, using another clear example. The Greeks, and medieval thinkers too, were [__A_TAG_PLACEHOLDER_0__]keen on debating a question that encapsulates everything I've been saying. This question required anyone attempting to answer it to choose between the observer and the external world as the source of reality. It was phrased in many ways; a common one is: “If the wind knocked down a large tree in a place and at a time where there was no one around to hear it, would it make any noise?” The Greek would have to respond negatively because, for him, noise was solely a phenomenon of the listener. The classical scientist would answer affirmatively because, for him, noise was a phenomenon of the tree, the air, and the ground. Today, we answer negatively, but for a very different reason from that which influenced the Greek. We believe that noise is a joint phenomenon of the observer and the external world, so without either, it cannot exist. We acknowledge that sound waves are produced and all that; but so what? There's no noise when the tree falls and the observer is absent, just as there would be none in the presence of the observer and the absence of the tree and wind; the noise, a joint phenomenon of the observer and the external world, only exists when both are present together.
Relativism and Reality
This is the viewpoint of relativism. The statue is golden for one observer and silver to the other. The sun is rising here and setting in another part of the world. It is raining here and clear in Chicago. The [24]observer in Delft hears the bombardment of Antwerp and the observer in London does not. If they were to be consistent, both the Greek and the medieval-modern absolutist would have to dispute whether the statue were “really” golden or silver, whether the sun were “really” rising or setting, whether the weather were “really” fair or foul, whether the bombardment were “really” accompanied by loud noises or not; and on each of these questions they would have to come to an agreement or confess their methods inadequate. But to the relativist the answer is simple—whether this or that be true depends upon the observer. In simple cases we understand this full well, as we have always realized it. In less simple cases we recognize it less easily or not at all, so that some of our thought is absolutist in its tendencies while the rest is relativistic. Einstein is the first ever to realize this fully—or if not this, then the first ever to realize it so fully as to be moved toward a studied effort to free human thought from the mixture of relativism and absolutism and make it consistently the one or the other.
This is the perspective of relativism. The statue looks gold to one person and silver to another. The sun is rising here while it's setting in another part of the world. It's raining here, but it's clear in Chicago. The [__A_TAG_PLACEHOLDER_0__]observer in Delft hears the bombardment in Antwerp, while the observer in London does not. If they were to be consistent, both the Greek and the medieval-modern absolutists would have to argue about whether the statue is “really” gold or silver, whether the sun is “really” rising or setting, whether the weather is “really” nice or bad, and whether the bombardment is “really” loud or not; and on each of these points, they would need to come to an agreement or admit their methods are lacking. But to the relativist, the answer is straightforward—whether something is true or not depends on the observer. In simple situations, we understand this clearly, as we have always known it. In more complex situations, we recognize it less easily or not at all, leading some of our thinking to lean towards absolutism while the rest is relativistic. Einstein is the first to fully grasp this—or at least the first to understand it to a degree that drives him to make a concerted effort to disentangle human thought from the mix of relativism and absolutism, aiming to make it consistently one or the other.
This brings it about that the observed fact occupies a position of unexpected significance. For when we discuss matters of physical science under a strictly relativistic philosophy, we must put away as metaphysical everything that smacks of a “reality” partly concealed behind our observations. We must focus attention upon the reports of our senses and of the instruments that supplement them. These observations, which join our perceptions to their external objects, afford us our only objective manifestations; them we must accept as final—subject always to such [25]correction as more refined observations may suggest. The question whether a “true” length or area or mass or velocity or duration or temperature exists back of the numerical determination, or in the presence of a determination that is subject to correction, or in the absence of any determination at all, is a metaphysical one and one that the physicist must not ask. Length, area, mass, velocity, duration, temperature—none of these has any meaning other than the number obtained by measurement.]* [If several different determinations are checked over and no error can be found in any of them, the fault must lie not with the observers but with the object, which we must conclude presents different values to different observers.]33
This means that the fact we observe holds unexpected significance. When we talk about physical science through a strictly relativistic lens, we need to set aside anything metaphysical that suggests a “reality” hidden behind our observations. We should concentrate on the reports from our senses and the tools that enhance them. These observations, connecting our perceptions to their external objects, provide our only objective evidence; we must accept them as final—always open to correction by more precise observations. The question of whether a “true” length, area, mass, velocity, duration, or temperature exists behind the numerical values, in the presence of a value subject to correction, or without any value at all, is a metaphysical question that physicists should not pursue. Length, area, mass, velocity, duration, temperature—none of these mean anything beyond the measurement number obtained. If multiple determinations are checked and no errors are found, the issue must not lie with the observers but with the object itself, indicating it presents different values to different observers.33
[We are after all accustomed to this viewpoint; we do not demand that Pittsburgh shall present the same distance from New York and from Philadelphia, or that the New Yorker and the Philadelphian come to any agreement as to the “real” distance of Pittsburgh. The distance of Pittsburgh depends upon the position of the observer. Nor do we demand that the man who locates the magnetic pole in one spot in 1900 and in another in 1921 come to a decision as to where it “really” is; we accept his statement that its position depends upon the time of the observation.
[We are used to this perspective; we don’t expect Pittsburgh to be the same distance from New York as it is from Philadelphia, or for New Yorkers and Philadelphians to agree on the “real” distance to Pittsburgh. The distance to Pittsburgh depends on where you're observing from. Similarly, we don’t expect someone who pinpoints the magnetic pole in one location in 1900 and in a different one in 1921 to determine where it “really” is; we accept that its position varies with the time of the observation.]
What this really means is that the distance to Pittsburgh and the position of the magnetic pole are joint properties of the observer and the observed—relations between them, as we might put it. This is obvious enough in the case of the distance of Pittsburgh; it is hardly so obvious in the case of the [26]position of the magnetic pole, varying with the lapse of time. But if we reflect that the observation of 1900 and that of 1921 were both valid, and both represented the true position of the pole for the observer of the date in question, we must see that this is the only explanation that shows us the way out.
What this really means is that the distance to Pittsburgh and the location of the magnetic pole are shared characteristics of both the observer and the observed—relations between them, as we might say. This is pretty clear in terms of the distance to Pittsburgh; it’s not as obvious when it comes to the [__A_TAG_PLACEHOLDER_0__]position of the magnetic pole, which changes over time. However, if we consider that both the observation from 1900 and the one from 1921 were valid and accurately reflected the true position of the pole for the observer at that time, we must recognize that this is the only explanation that provides clarity.
I do not wish to speak too definitely of the Einstein theories in these introductory remarks, and so shall refrain from mentioning explicitly in this place the situation which they bring up and upon which what I have just said has direct bearing. It will be recognized when it arises. What must be pointed out here, however, is that we are putting the thing which the scientist calls the “observed value” on a footing of vastly greater consequence than we should have been willing offhand to concede to it. So far as any single observer is concerned, his own best observed values are themselves the external world; he cannot properly go behind the conditions surrounding his observations and speak of a real external world beyond these observations. Any world which he may think of as so existing is purely a conceptual world, one which for some reason he infers to exist behind the deceptive observations. Provided he makes this reservation he is quite privileged to speculate about this concealed world, to bestow upon it any characteristics that he pleases; but it can have no real existence for him until he becomes able to observe it. The only reality he knows is the one he can directly observe.
I don’t want to go too in-depth about Einstein’s theories in these opening comments, so I’ll avoid directly addressing the situation they introduce, which is closely related to what I just mentioned. It will become clear when the time comes. What I need to highlight here is that we are treating what scientists call the “observed value” as far more significant than we might have initially accepted. For any individual observer, their best observed values are the external world itself; they can't properly consider the conditions surrounding their observations and talk about a real external world beyond those observations. Any world they imagine as existing is purely conceptual, one that they infer exists behind misleading observations. As long as they keep this in mind, they’re free to speculate about this hidden world and assign it any characteristics they like; however, it won’t hold any real existence for them until they can actually observe it. The only reality they know is the one they can see directly.
Laws of Nature
The observations which we have been discussing, [27]and which we have been trying to endow with characteristics of “reality” which they are frequently not realized to possess, are the raw material of physical science. The finished product is the result of bringing together a large number of these observations.]* [The whole underlying thought behind the making of observations, in fact, is to correlate as many as possible of them, to obtain some generalization, and finally to express this in some simple mathematical form. This formulation is then called a “law of nature.”]35
The observations we've been discussing, [__A_TAG_PLACEHOLDER_0__]and that we often try to associate with the features of “reality” that they don't always have, are the basic elements of physical science. The end result comes from combining a large number of these observations. The main idea behind making observations is to connect as many of them as possible to find some general pattern, and ultimately to express this in a straightforward mathematical way. This expression is then referred to as a “law of nature.”35
[Much confusion exists because of a misunderstanding in the lay mind of what is meant by a “law of nature.” It is perhaps not a well chosen term. One is accustomed to associate the word law with the idea of necessity or compulsion. In the realm of nature the term carries no such meaning. The laws of nature are man’s imperfect attempts to explain natural phenomena; they are not inherent in matter and the universe, not an iron bar of necessity running through worlds, systems and suns. Laws of nature are little more than working hypotheses, subject to change or alteration or enlargement or even abandonment, as man’s vision widens and deepens. No sanctity attaches to them, and if any one, or all, of them fail to account for any part, or all, of the phenomena of the universe, then it or they must be supplemented or abandoned.]102
[There's a lot of confusion because people often misunderstand what is meant by a “law of nature.” The term might not be the best choice. People typically associate the word law with necessity or compulsion. However, in nature, that term doesn't carry the same meaning. The laws of nature are simply humanity's imperfect attempts to explain natural phenomena; they aren’t built into matter and the universe, nor are they a strict necessity running through worlds, systems, and suns. Laws of nature are essentially just working hypotheses that can change, be modified, expanded, or even discarded as our understanding improves. There’s no inherent authority to them, and if any one or all of them fail to explain some or all of the phenomena in the universe, then they need to be updated or replaced.]102
[The test of one of these laws is that it can be shown to include all the related phenomena hitherto known and that it enables us to predict new phenomena which can then be verified. If new facts are discovered that are not in agreement with one of [28]these generalized statements, the assumptions on which the latter is based are examined, those which are not in accordance with the new facts are given up, and the statement is modified so as to include the new facts.]10 [And if one remembers that the laws of physics were formerly based on a range of observations much narrower than at present available, it seems natural that in the light of this widening knowledge one law or another may be seen to be narrow and insufficient. New theories and laws do not necessarily disprove old ones, but explain certain discrepancies in them and penetrate more deeply into their underlying principles, thereby broadening our ideas of the universe. To follow the new reasoning we must rid ourselves of the prejudice behind the old, not because it is wrong but because it is insufficient. The universe will not be distorted to fit our rules, but will teach us the rules of existence.]125
[The test of one of these laws is that it can show all the related phenomena known so far and that it allows us to predict new phenomena that can be verified later. If new facts are found that do not align with one of [__A_TAG_PLACEHOLDER_0__]these generalized statements, the assumptions behind the statement are examined, the ones that don't agree with the new facts are discarded, and the statement is adjusted to include the new facts.]10 [And if we remember that the laws of physics were once based on a much narrower range of observations than what we have today, it seems natural that, with this expanding knowledge, one law or another might be seen as limited and inadequate. New theories and laws don’t necessarily invalidate old ones but rather clarify certain discrepancies in them and delve deeper into their underlying principles, thus expanding our understanding of the universe. To embrace the new reasoning, we need to let go of the biases from the old, not because they are wrong, but because they are not enough. The universe will not bend to fit our rules but will instead teach us the rules of existence.]125
[Always, however, we must guard against the too easy error of attributing to these rules anything like absolute truth.]* [The modern scientist has attained a very business-like point of view toward his “laws of nature.” To him a law is fundamentally nothing but a short-hand way of expressing the results of a large number of experiments in a single statement. And it is important to remember that this mere shortening of the description of a lot of diverse occurrences is by no means any real explanation of how and why they happened. In other words, the aim of science is not ultimately to explain but only to discover the relations that hold good among physical quantities and to embody all these relations in as few and as simple physical laws as possible.]221 [This is [29]inherently the method of relativism.]* [Under it a set of phenomena is observed. There are two or many observers, and they write down their several findings. These are reviewed by a final observer or judge, who strains out the bias due to the different viewpoints of the original observers. He then writes down, not any absolute law of nature governing the observed phenomena, but a law as general as possible expressing their interrelations.]121 [And through this procedure modern science and philosophy reveal with increasing emphasis that we superimpose our human qualities on external nature to such an extent that]106 [we have at once the strongest practical justification, in addition to the arguments of reason, for our insistence that the contact between objective and subjective represented by the observation is the only thing which we shall ever be able to recognize as real. We may indulge in abstract metaphysical speculation to our heart’s content, if we be metaphysically inclined; we may not attempt to impose the dicta of metaphysics upon the physical scientist.]*
[However, we must always be careful not to mistakenly assume that these rules represent anything like absolute truth.]* [Today’s scientist has adopted a very pragmatic approach to their “laws of nature.” To them, a law is essentially just a shorthand way of summarizing the results of many experiments in one statement. It’s important to remember that this simplification of a variety of events doesn’t actually provide any real explanation of how or why these events occur. In other words, the goal of science is not ultimately to explain but simply to uncover the relationships that exist among physical quantities and to express all these relationships in as few and as simple physical laws as possible.]221 [This is [__A_TAG_PLACEHOLDER_0__]inherently the method of relativism.]* [In this approach, a set of phenomena is observed. There are two or more observers who document their individual findings. These findings are then reviewed by a final observer or judge, who filters out the biases caused by the different perspectives of the original observers. They then write down not an absolute law of nature governing the observed phenomena, but the most general law possible that expresses their interconnections.]121 [Through this process, modern science and philosophy increasingly reveal that we impose our human characteristics onto the external world to such a degree that]106 [we have the strongest practical justification, along with logical arguments, for insisting that the interaction between the objective and subjective represented by observation is the only thing we can ever truly recognize as real. We can freely engage in abstract metaphysical speculation if we wish; however, we must not try to impose the conclusions of metaphysics on the physical scientist.]*
Ideas and Facts
[From the inquiry and criticism which have gone on for centuries has emerged the following present-day attitude of mind toward the sum total of our knowledge. The conceptual universe in our minds in some mysterious way parallels the real universe, but is totally unlike it. Our conceptions (ideas) of matter, molecules, atoms, corpuscles, electrons, the ether, motion, force, energy, space, and time stand in the same or similar relation to reality as the x’s and y’s [30]of the mathematician do to the entities of his problem. Matter, molecules, atoms, corpuscles, electrons, the ether, motion, force, energy, space, and time do not exist actually and really as we conceive them, nor do they have actually and really the qualities and characteristics with which we endow them. The concepts are simply representations of things outside ourselves; things which, while real, have an essential nature not known to us. Matter, molecules, atoms, corpuscles, electrons, the ether, motion, force, energy, space and time are merely devices, symbols, which enable us to reason about reality. They are parts of a conceptual mechanism in our minds which operates, or enables our minds to operate, in the same sequence of events as the sequence of phenomena in the external universe, so that when we perceive by our senses a group of phenomena in the external universe, we can reason out what result will flow from the interaction of the realities involved, and thus predict what the situation will be at a given stage in the sequence.
[From the inquiry and criticism that have taken place for centuries has come the following modern mindset towards the totality of our knowledge. The conceptual universe in our minds somehow mirrors the real universe, yet is completely different from it. Our ideas about matter, molecules, atoms, corpuscles, electrons, the ether, motion, force, energy, space, and time relate to reality much like the x's and y's [__A_TAG_PLACEHOLDER_0__] in mathematics relate to the entities in a problem. Matter, molecules, atoms, corpuscles, electrons, the ether, motion, force, energy, space, and time don’t actually exist as we imagine them, nor do they possess the qualities and characteristics we attribute to them. These concepts are just representations of things outside ourselves; things that, while real, have a fundamental nature that we do not understand. Matter, molecules, atoms, corpuscles, electrons, the ether, motion, force, energy, space, and time are simply tools, symbols that help us think about reality. They are components of a conceptual mechanism in our minds that operates, or allows our minds to function, in the same order as the sequence of events in the external universe. This way, when we perceive a set of phenomena in the external world through our senses, we can deduce the results of interactions between the realities involved, and thus anticipate what the situation will be at a specific moment in the sequence.]
But while our conceptual universe has thus a mechanical aspect, we do not regard the real universe as mechanical in its nature.]283 [This may be illustrated by a little story. Entering his friend’s house, a gentleman is seized unawares from behind. He turns his head but sees nothing. His hat and coat are removed and deposited in their proper places by some invisible agent, seats and tables and refreshments appear in due time where they are required, all without any apparent cause. The visitor shivers with horror and asks his host for an explanation. He is then told that the ideas “order” and “regularity” [31]are at work, and that it is they who acquit themselves so well of their tasks. These ideas cannot be seen nor felt nor seized nor weighed; they reveal their existence only by their thoughtful care for the welfare of mankind. I think the guest, coming home, will relate that his friend’s house is haunted. The ghosts may be kind, benevolent, even useful; yet ghosts they are. Now in Newtonian mechanics, absolute space and absolute time and force and inertia and all the other apparatus, altogether imperceptible, appearing only at the proper time to make possible a proper building up of the theory, play the same mysterious part as the ideas “order” and “regularity” in my story. Classical mechanics is haunted.]116
But while our conceptual universe has a mechanical side, we don't see the real universe as being mechanical in its essence.283 [This can be illustrated with a little story. One day, a man enters his friend's house and is unexpectedly grabbed from behind. He turns his head but sees nothing. An invisible force takes off his hat and coat and places them where they belong, and soon enough, chairs, tables, and snacks appear just where they’re needed, all with no visible cause. The visitor shivers in dread and asks his host for an explanation. He’s then told that the concepts of "order" and "regularity" [__A_TAG_PLACEHOLDER_0__] are doing their job well. These concepts can’t be seen, felt, grasped, or weighed; they only show their existence through their careful consideration for human welfare. I think when the guest goes home, he’ll say that his friend's house is haunted. The ghosts might be kind, generous, or even helpful, but they’re still ghosts. In Newtonian mechanics, absolute space, absolute time, force, inertia, and all the other tools, which are all imperceptible and show up only when needed to properly construct the theory, play the same mysterious role as the concepts of "order" and "regularity" in my story. Classical mechanics is haunted.116
[As a matter of fact, we realize this and do not allow ourselves to be imposed upon with regard to the true nature of these agencies.]* [We use a mechanistic terminology and a mechanistic mode of reasoning only because we have found by experience that they facilitate our reasoning. They are the tools which we find produce results. They are adapted to our minds, but perhaps it would be better to say that our minds are so constructed as to render our conceptual universe necessarily mechanical in its aspect in order that our minds may reason at all. Two things antithetic are involved—subject (our perceiving mind which builds up concepts) and object (the external reality); and having neither complete nor absolute knowledge of either, we cannot affirm which is more truly to be said to be mechanistic in its nature, though we may suspect that really neither is. We no longer think of cause and [32]effect as dictated by inherent necessity, we simply regard them as sequences in the routine of our sense-impressions of phenomena. In a word, we have at length grasped the idea that our notions of reality, at present at least, whatever they may become ultimately, are not absolute, but simply relative. We see, too, that we do not explain the universe, but only describe our perceptions of its contents.
[Actually, we understand this and don't let ourselves be misled about the true nature of these agencies.]* [We use a mechanical terminology and a mechanical way of thinking only because we've learned from experience that they help our reasoning. They are the tools that give us results. They fit our way of thinking, but maybe it's better to say our minds are structured in a way that makes our conceptual world necessarily mechanical for us to reason at all. Two opposing elements are involved—subject (our perceiving mind that creates concepts) and object (the external reality); and since we don't have complete or absolute knowledge of either, we can't say which is more genuinely mechanistic in nature, though we might suspect that really neither is. We no longer see cause and [__A_TAG_PLACEHOLDER_0__]effect as determined by inherent necessity; we simply see them as sequences in the routine of our sensory experiences of phenomena. In short, we have finally understood that our ideas of reality, at least for now, whatever they may ultimately become, are not absolute, but simply relative. We also realize that we do not explain the universe, but only describe our perceptions of its contents.
The so-called laws of nature are simply statements of formulæ which resume or sum up the relationships and sequences of phenomena. Our effort is constantly to find formulæ which will describe the widest possible range of phenomena. As our knowledge increases, that is, as we perceive new phenomena, our laws or formulæ break down, that is, they fail to afford a description in brief terms of all of our perceptions. It is not that the old laws are untrue, but simply that they are not comprehensive enough to include all of our perceptions. The old laws are often particular or limiting instances of the new laws.]283
The so-called laws of nature are just statements of formulas that summarize the relationships and patterns of phenomena. We constantly strive to find formulas that can describe the widest range of phenomena possible. As our knowledge grows, meaning as we notice new phenomena, our laws or formulas fail to fully explain all of our observations. It's not that the old laws are false, but rather that they aren't broad enough to account for everything we see. The old laws are often specific or limited cases of the new laws.283
[From what we have said of the reality of observations it follows that we must support that school of psychology, and the parallel school of philosophy, which hold that concepts originate in perceptions. But this does not impose so strong a restriction upon conceptions as might appear. The elements of all our concepts do come to us from outside; we manufacture nothing out of whole cloth. But when perception has supplied a sufficient volume of raw material, we may group its elements in ways foreign to actual occurrence in the perceptual world, and in so doing get [33]conceptual results so entirely different from what we have consciously perceived that we are strongly tempted to look upon them as having certainly been manufactured in our minds without reference to the externals. Of even more significance is our ability to abstract from concrete objects and concrete incidents the essential features which make them alike and different. But unlike the Greeks, we see that our concept of coldness is not something with which we were endowed from the beginning, but merely an abstraction from concrete experiences with concrete objects that have been cold.
[Based on what we’ve discussed about the reality of observations, it follows that we should support the school of psychology and the parallel school of philosophy that believe concepts come from perceptions. However, this doesn't impose as strict a limitation on our concepts as it might seem. The components of all our concepts do come from the outside; we create nothing from scratch. But once perception has provided enough raw material, we can organize its elements in ways that don’t actually occur in the perceptual world, leading to conceptual results that can be so different from what we have consciously perceived that we’re strongly tempted to think they were entirely created in our minds without any reference to the external world. Even more importantly, we can abstract essential features from concrete objects and incidents that highlight their similarities and differences. Unlike the Greeks, we recognize that our concept of coldness isn’t something we were born with; it's simply an abstraction drawn from actual experiences with cold objects.
The Concepts of Space and Time
When we have formed the abstract ideas of coldness and warmth, and have had experience indicating that the occurrence of these properties varies in degree, we are in a position to form the secondary abstract notion covered by the word “temperature.” When we have formed the abstract ideas of size and position and separation, we are similarly in a position to form a secondary abstraction to which we give the name “space.” Not quite so easy to trace to its definite source but none the less clearly an abstraction based on experience, is our idea of what we call “time.” None of us are deceived as to the reality of these abstractions.]* [We do not regard space as real in the sense that we regard a chair as real; it is merely an abstract idea convenient for the location of material objects like the chair.]198 [Nor do we regard time as real in this sense. Things occupy space, events occupy time; space and time themselves we realize are immaterial and unreal; [34]space does not exist and time does not happen in the same sense that material objects exist and events occur. But we find it absolutely necessary to have, among the mental machinery mentioned above as the apparatus by aid of which we keep track of the external world, these vessels for that world to exist in and move in.
Once we have developed the basic concepts of coldness and warmth, and have experienced how these qualities vary in intensity, we can create a secondary abstract idea that we call “temperature.” Similarly, after understanding the concepts of size, position, and separation, we can form another secondary abstraction known as “space.” Our understanding of “time” is a bit harder to pinpoint, but it is still clearly an abstraction based on our experiences. None of us are fooled about the reality of these abstractions. We don’t see space as real in the same way we see a chair as real; it’s just an abstract idea that helps us place material objects like chairs. Nor do we see time as real in that sense. Things occupy space, and events take place in time; we understand that space and time themselves are immaterial and unreal. Space doesn’t exist, and time doesn’t pass in the same way that physical objects exist and events happen. However, we find it essential to have these concepts as part of our mental framework, which helps us navigate and understand the external world.
Space and time, then, are concepts.]* [It is not strange, however, that when confronted with the vast and bewildering complexity of the universe and the difficulty of keeping separate and distinct in our minds our perceptions and conceptions, we should at times and as respects certain things project our conceptions illegitimately into the perpetual universe and mistake them for perceptions. The most notable example perhaps of this projection has occurred in the very case of space and time, most fundamental of all of our concepts. We got to think of these as absolute, as independent of each other and of all other things, and as always existing and continuing to exist whether or not we or anything else existed—space as a three-dimensional, uniform continuum, having the same properties in all directions; time as a one-dimensional, irreversible continuum, flowing in one direction. It is difficult to get back to the idea that space and time so described and defined are concepts merely, for the idea of their absolute existence is ingrained in us as the result probably of long ancestral experience.]283
Space and time are, essentially, concepts. It’s not surprising, though, that when faced with the vast and confusing complexity of the universe and the challenge of keeping our perceptions and ideas distinct in our minds, we sometimes mistakenly project our ideas onto the ongoing universe and confuse them with actual perceptions. One of the clearest examples of this projection is with space and time, which are our most basic concepts. We've come to view them as absolute, independent from each other and everything else, and as always existing, even if we or anything else didn’t exist—space as a three-dimensional, uniform continuum with the same properties in every direction, and time as a one-dimensional, irreversible continuum that flows in one direction. It's hard to return to the understanding that space and time, as we've described and defined them, are just concepts because the notion of their absolute existence is deeply embedded in us, likely due to long ancestral experiences.283
[Newton’s definitions of course represent the classical idea of time and space. He tells us that “absolute, true and mathematical time flows in virtue of its own nature, uniformly and without reference [35]to any external object;” and that “absolute space, by virtue of its own nature and without reference to any external object, always remains the same and is immovable.” Of course from modern standpoints it is absurd to call either of these pronouncements a definition; but they represent about as well as any words can the ideas which Newton had about time and space, and they make it clear enough that he regarded both as having real existence in the external world.
[Newton’s definitions definitely reflect the classic view of time and space. He states that “absolute, true, and mathematical time flows according to its own nature, uniformly and without reference [__A_TAG_PLACEHOLDER_0__]to any external object;” and that “absolute space, by its own nature and without reference to any external object, always remains the same and is unchanging.” Of course, from a modern perspective, it seems ridiculous to call either of these statements a definition; but they express as well as any words can the concepts Newton had about time and space, and they clearly indicate that he saw both as having a real existence in the external world.]
If space and time are to be the vessels of our universe, and if the only thing that really matters is measured results, it is plain enough that we must have, from the very beginning, means of measuring space and time. Whether we believe space and time to have real existence or not, it is obvious that we can measure neither directly. We shall have to measure space by measuring from one material object to another; we shall have to measure time by some similar convention based on events. We shall later have something further to say about the measurement of time; for the present we need only point out that]* [Newtonian time is measured independently of space; and the existence is presupposed of a suitable timekeeper.]10
If space and time are the frameworks of our universe, and if the only thing that truly matters is measurable outcomes, it's clear that we need ways to measure space and time from the very start. Regardless of whether we think space and time exist for real, it's evident that we can’t measure either directly. We have to measure space by taking measurements from one physical object to another; we’ll measure time through a similar system based on events. We’ll discuss the measurement of time more extensively later; for now, we just need to note that Newtonian time is measured separately from space, and we assume the existence of an appropriate timekeeping device.10
[The space of Galileo and Newton was conceived of as empty, except in so far as certain parts of it were occupied by matter. Positions of bodies in this space were in general determined by reference to]283 [a “coordinate system” of some kind. This is again something that demands a certain amount of discussion. [36]
[The space of Galileo and Newton was seen as empty, except where certain areas were filled with matter. The locations of objects in this space were generally determined by a “coordinate system” of some sort. This is another topic that needs some discussion. [__A_TAG_PLACEHOLDER_0__]
The Space Reference Frame
The mathematician, following the lead of the great French all-around genius, Descartes, shows us very clearly how to set up, for the measurement of space, the framework known as the Cartesian coordinate system. The person of most ordinary mathematical attainments will realize that to locate a point in a plane we must have two measurements; and we could probably show this person, without too serious difficulty, that we can locate a point in any surface by two measurements. An example of this is the location of points on the earth’s surface by means of their latitude and longitude. It is equally clear that if we add a third dimension and attempt to locate points in space, we must add a third measurement. In the case of points on the earth’s surface, this might be the elevation above sea level, which would define the point not as part of the spherical surface of the earth but as part of the solid sphere. Or we may fall back on Dr. Slosson’s suggestion that in order to define completely the position of his laboratory, we must make a statement about Broadway, and one about 116th Street, and one telling how many flights of stairs there are to climb. In any event, it should be clear enough that the complete definition of a point in space calls for three measurements.
The mathematician, following the example of the great French polymath, Descartes, clearly demonstrates how to establish the framework known as the Cartesian coordinate system for measuring space. Anyone with basic math skills will understand that to pinpoint a location on a plane, we need two measurements; we can probably show this person, without too much trouble, that we can identify a point on any surface with two measurements. For instance, we locate points on the earth’s surface using latitude and longitude. It’s also clear that if we add a third dimension and try to identify points in space, we must include a third measurement. In the case of points on the earth’s surface, this could be elevation above sea level, which would define the point not just on the earth's curved surface but as part of the solid sphere. Alternatively, we might refer to Dr. Slosson’s suggestion that to fully describe the location of his lab, we need to specify something about Broadway, something about 116th Street, and how many flights of stairs to climb. In any case, it should be clear that a complete definition of a point in space requires three measurements.
The mathematician formulates all this with the utmost precision. He asks us to]* [pick out any point whatever in space and call it O. We then draw or conceive to be drawn through this point three mutually perpendicular lines called coordinate axes, [37]which we may designate OX, OY and OZ, respectively. Finally, we consider the three planes also mutually perpendicular like the two walls and the floor of a room that meet in one common corner, which are formed by the lines OX and OY, OY and OZ, and OZ and OX, respectively. These three planes are called coordinate planes. And then any other point P in space can be represented with respect to O by its perpendicular distances from each of the three coordinate planes—the distances x, y, z in the figure. These quantities are called the coordinates of the point.]272
The mathematician lays this out with great precision. He asks us to pick any point in space and call it O. Then, we draw or imagine three lines through this point that are all perpendicular to each other, known as the coordinate axes, [__A_TAG_PLACEHOLDER_0__]which we can label as OX, OMG, and OZ, respectively. Finally, we consider three planes that are also perpendicular, like the two walls and the floor of a room meeting at one corner. These planes are formed by the lines OX and OMG, OY and OZ, and Oz and OX, respectively. These three planes are called coordinate planes. Now, any other point P in space can be represented in relation to O by its perpendicular distances from each of the three coordinate planes—the distances

[To the layman there seems something altogether naive in this notion of the scientist’s setting up the three sides of a box in space and using them as the basis of all his work. The layman somehow feels that while it is perfectly all right for him to tell us that he lives at 1065 (one coordinate) 156th Street (two coordinates) on the third floor (three coordinates), it is rather trivial business for the serious-minded [38]scientist to consider the up-and-down, the forward-and-back, the right-and-left of every point with which he has occasion to deal. There seems to the layman something particularly inane and foolish and altogether puerile about a set of coordinate axes, and you simply can’t make him believe that the serious-minded scientist has to monkey with any such funny business. He can’t be induced to take this coordinate-axis business seriously. Nevertheless, the fact is that the scientist takes it with the utmost seriousness. It is necessary for him to define the positions of points; and he does do it by means of a set of coordinate axes.
[To the average person, there seems to be something quite naive about the idea of a scientist creating three sides of a box in space and using them as the foundation for all their work. The average person feels that while it's perfectly fine for him to say he lives at 1065 (one coordinate) 156th Street (two coordinates) on the third floor (three coordinates), it seems pretty trivial for a serious scientist to consider the up-and-down, the forward-and-back, and the right-and-left of every point they work with. The average person finds something particularly silly and foolish about a set of coordinate [__A_TAG_PLACEHOLDER_0__]axes, and you just can't convince him that a serious scientist needs to mess around with something so absurd. He can't be persuaded to take this coordinate-axis idea seriously. However, the truth is that the scientist treats it with the utmost seriousness. It is essential for him to define the positions of points; and he does do it by using a set of coordinate axes.
The scientist, however, is not interested in points of empty space. The point is to him merely part again of the conceptual machinery which he uses in his effort to run along with the external world. He knows there are no real points, but it suits his convenience to keep track of certain things that are real by representing them as points. But these things are in practically every instance material bodies; and in practically every instance, instead of staying put in one spot, they insist upon moving about through space. The scientist has to use his coordinate system, not merely to define a single position of such a “point,” but to keep track of the path over which it moves and to define its position in that path at given moments.
The scientist, however, isn’t focused on empty points in space. For him, a point is just a part of the conceptual tools he uses to engage with the external world. He understands that there aren’t any real points, but it’s convenient for him to track certain real things by representing them as points. However, these things are almost always physical objects, and nearly every time, instead of staying in one place, they insist on moving through space. The scientist needs to use his coordinate system not just to define a single position of such a “point,” but to monitor the path it takes and specify its position along that path at different moments.
Time and the Coordinate System
This introduces the concept of time into intimate relationship with the spatial coordinate system. [39]And at once we feel the lack of a concrete, visualized fourth dimension.]* [If we want to fix objects in the floor alone, the edge of the room running toward the ceiling would become unnecessary and could be dropped from our coordinate system. That is, we need only two coordinates to fix the position of a point in a plane. Suppose instead of discarding the third coordinate, we use it to represent units of time. It then enables us to record the time it took a moving point in the floor to pass from position to position. Certain points in the room would be vertically above the corresponding points occupied by the moving point in its path across the floor; and the vertical height above the floor of such points corresponds to a value of the time-coordinate which indicates the time it took the point to move from position to position.]152 [Just as the path of the point across the floor is a continuous curve (for the mathematician, it should be understood, this term “curve” includes the straight line, as a special case in which the curvature happens to be zero); so the series of points above these in the room forms a continuous curve which records for us, not merely the path of the point across the floor, but in addition the time of its arrival at each of its successive positions. In the algebraic work connected with such a problem, the third coordinate behaves exactly the same, regardless of whether we consider it to represent time or a third spatial dimension; we cannot even tell from the algebra what it does represent.
This introduces the idea of time in close relation to the spatial coordinate system. [__A_TAG_PLACEHOLDER_0__]Right away, we notice the absence of a clear, visualized fourth dimension. If we only want to fix objects on the floor, the edge of the room extending toward the ceiling would be unnecessary and could be removed from our coordinate system. In other words, we only need two coordinates to determine a point's location on a plane. Instead of discarding the third coordinate, we can use it to represent units of time. This allows us to track the time it takes for a moving point on the floor to move from one position to another. Certain points in the room would be directly above the corresponding points occupied by the moving point on the floor; the vertical distance above the floor of those points corresponds to a time-coordinate that shows how long it took for the point to move from position to position. ]152 Just like the path of the point moving across the floor is a continuous curve (and for mathematicians, it's important to note that the term “curve” includes straight lines as a special case where the curvature is zero); the series of points above these in the room creates a continuous curve that not only shows the point's path across the floor but also the time it arrives at each of its successive positions. In the algebra related to this problem, the third coordinate behaves exactly the same whether we view it as representing time or as a third spatial dimension; we cannot even determine from the algebra what it represents.
When we come to the more general case of a point moving freely through space, we have but three coordinates at our disposal; there is not a fourth one [40]by aid of which we can actually diagram its time-space record. Nevertheless, we can write down the numerical and algebraic relations between its three space-coordinates and the time which it takes to pass from one position to another; and by this means we can make all necessary calculations. Its motion is completely defined with regard both to space and to time. We are very apt to call attention to the fact that if we did have at our disposal a fourth, space-coordinate, we could use it to represent the time graphically, as before, and actually construct a geometric picture of the path of our moving point with regard to space and time. And on this account we are very apt to speak as though the time measurements constituted a fourth coordinate, regardless of any question of our ability to construct a picture of this coordinate. The arrival of a point in a given position constitutes an event; and this event is completely defined by means of four coordinates—three in space, which we can picture on our coordinate axes, and one in time which we cannot.
When we look at the general case of a point moving freely through space, we only have three coordinates available; there isn’t a fourth one [__A_TAG_PLACEHOLDER_0__] that we can actually use to diagram its time-space record. However, we can write down the numerical and algebraic relationships between its three spatial coordinates and the time it takes to move from one position to another; this allows us to perform all necessary calculations. Its motion is completely defined in terms of both space and time. We often highlight that if we did have a fourth spatial coordinate, we could use it to represent time graphically, just like before, and actually create a geometric illustration of the path of our moving point in relation to space and time. For this reason, we often talk as if time measurements serve as a fourth coordinate, regardless of whether we can visually represent this coordinate. The arrival of a point at a specific position is an event, and this event is fully defined using four coordinates—three in space that we can visualize on our coordinate axes, and one in time that we cannot.
The set of coordinate axes in space, together with the zero point from which we measure time, constitute what we call a frame of reference. If we are not going to pay any attention to time, we can think of the space coordinate system alone as constituting our reference frame. This expression appears freely throughout the subsequent text, and always with one or the other of these interpretations.
The coordinate axes in space, along with the starting point we use to measure time, make up what we refer to as a frame of reference. If we choose to ignore time, we can simply consider the spatial coordinate system as our reference frame. This term will appear frequently in the following text, and will always have one of these meanings.
We see, then, how we can keep track of a moving point by keeping track of the successive positions which it occupies in our reference frame.]* [Now we have implied that these coordinate axes are fixed [41]in space; but there is nothing to prevent us from supposing that they move.]272 [If they do, they carry with them all their points; and any motion of these points which we may speak about will be merely motion with reference to the coordinate system. If we find something outside our coordinate system that is not moving, the motion of points in our system with regard to those outside it will be a combination of their motion with regard to our coordinate axes and that of these axes with regard to the external points. This will be a great nuisance; and it represents a state of affairs which we shall try to avoid. We shall avoid it, if at all, by selecting a coordinate system with reference to which we, ourselves, are not moving; one which partakes of any motion which we may have. Or perhaps we shall sometimes wish to reverse the process, in studying the behavior of some group of bodies, and seek a set of axes which is at rest with respect to these bodies; one which partakes of any motion they may have.
We can see how to track a moving point by monitoring the positions it takes up in our reference frame. We’ve suggested that these coordinate axes are fixed in space, but there’s nothing stopping us from imagining that they move. If they do move, they carry all their points with them, and any motion we discuss will just be relative to the coordinate system. If we find something outside our coordinate system that isn’t moving, the motion of points in our system compared to those outside will be a mix of their motion relative to our coordinate axes and that of these axes compared to the external points. This could be quite problematic, and it represents a situation we’ll try to avoid. We’ll avoid it, if possible, by choosing a coordinate system where we ourselves aren’t moving; one that accounts for any motion we may have. Alternatively, sometimes we might want to reverse this process when studying a group of bodies and look for a set of axes that is still relative to these bodies; one that considers any motion they might have.
Choosing a Coordinate Frame
All this emphasizes the fact that our coordinate axes are not picked out for us in advance by nature, and set down in some one particular spot. We select them for ourselves, and we select them in the most convenient way. But different observers, or perhaps the same observer studying different problems, will find it advantageous to utilize different coordinate systems.]* [The astronomer has found it possible, and highly convenient, to select a coordinate frame such that the great majority of the stars have, on [42]the whole, no motion with respect to it.]283 [Such a system would be most unsuited for investigations confined to the earth; for these we naturally select a framework attached to the earth, with its origin O at the earth’s center if our investigation covers the entire globe and at some more convenient point if it does not, and in either event accompanying the earth in its rotation and revolution. But such a framework, as well as the one attached to the fixed stars, would be highly inconvenient for an investigator of the motions of the planets; he would doubtless attach his reference frame to the sun.]101
All of this highlights the fact that nature doesn’t designate our coordinate axes for us in advance or place them in a specific location. We choose them ourselves, and we select them in the most convenient way. However, different observers, or even the same observer looking at different issues, might find it useful to use different coordinate systems. The astronomer has found it possible and very convenient to choose a coordinate frame where most stars have, overall, no motion concerning it. Such a system would not work well for studies focused on Earth; for those, we naturally choose a framework attached to the Earth, with its origin at the Earth’s center if our study covers the entire globe and at a more convenient point if it doesn’t, while also moving with the Earth in its rotation and revolution. But this framework, like the one linked to the fixed stars, would be very inconvenient for someone studying the movements of the planets; they would likely connect their reference frame to the sun.
[In this connection a vital question suggests itself. Is the expression of natural law independent of or dependent upon the choice of a system of coordinates? And to what extent shall we be able to reconcile the results of one observer using one reference frame, and a second observer using a different one? The answer to the second question is obvious.]* [True, if any series of events is described using two different sets of axes, the descriptions will be different, depending upon the time system adopted and the relative motion of the axes. But if the connection between the reference systems is known, it is possible by mathematical processes to deduce the quantities observed in one system if those observed in the other are known.]35 [This process of translating the results of one observer into those of another is known as a transformation; and the mathematical statement of the rule governing the transformation is called the equation or the equations (there are usually several of them) of the transformation.]* [Transformations of this character constitute [43]a well-developed branch of mathematics.]35
[In this context, a crucial question arises. Is the expression of natural law independent of, or dependent on, the choice of a coordinate system? And to what extent can we reconcile the results of one observer using one reference frame with those of another observer using a different one? The answer to the second question is clear.]* [Indeed, if a series of events is described using two different sets of axes, the descriptions will differ, based on the time system chosen and the relative motion of the axes. However, if the relationship between the reference systems is known, it is possible, through mathematical processes, to derive the quantities observed in one system from those observed in the other.]35 [This process of converting the results of one observer into those of another is known as a transformation; and the mathematical expression of the rule governing the transformation is referred to as the equation or equations (there are usually several of them) of the transformation.]* [Transformations of this type make up [__A_TAG_PLACEHOLDER_0__]a well-established branch of mathematics.]35
[When we inquire about the invariance of natural law it is necessary to be rather sure of just what we mean by this expression. The statement that a given body is moving with a velocity of 75 miles per hour is of course not a natural law; it is a mere numerical observation. But aside from such numerical results, we have a large number of mathematical relations which give us a more or less general statement of the relations that exist between velocities, accelerations, masses, forces, times, lengths, temperatures, pressures, etc., etc. There are some of these which we would be prepared to state at once as universally valid—distance travelled equals velocity multiplied by time, for instance. We do not believe that any conceivable change of reference systems could bring about a condition in which the product of velocity and time, as measured from a certain framework, would fail to equal distance as measured from this same framework. There are other relations more or less of the same sort which we probably believe to be in the same invariant category; there are others, perhaps, of which we might be doubtful; and presumably there are still others which we should suspect of restricted validity, holding in certain reference systems only and not in others.
[When we question the consistency of natural law, it's important to be clear about what we mean by this term. Saying that a particular object is moving at 75 miles per hour isn’t a natural law; it’s simply a numerical observation. Beyond such numerical data, we have many mathematical relationships that provide a more or less general overview of the connections between velocities, accelerations, masses, forces, times, lengths, temperatures, pressures, and so on. Some of these we can confidently assert as universally true—like the idea that distance traveled equals velocity multiplied by time. We don’t think that any possible change in reference systems could lead to a scenario where the product of velocity and time, measured from a specific frame, would not equal distance measured from that same frame. There are other relationships that we probably consider to be in the same invariant category; some we might be uncertain about; and likely, there are others that we suspect only hold true in certain reference systems and not in others.]
The question of invariance of natural law, then, may turn out to be one which may be answered in the large by a single statement; it may equally turn out to be one that has to be answered in the small, by considering particular laws in connection with particular transformations between particular reference systems. Or, perhaps, we may find ourselves justified [44]in taking the stand that an alleged “law of nature” is truly such a law only in the event that it is independent of the change from one reference system to another. In any event, the question may be formulated as follows:
The issue of whether natural law is invariant might be something we can answer broadly with a simple statement; it could also require us to look closely at specific laws related to specific transformations between particular reference systems. Or maybe we’ll conclude that a so-called “law of nature” is genuinely that only if it is independent of changes from one reference system to another. Regardless, we can phrase the question this way:
Observer A, using the reference system R, measures certain quantities t, w, x, y, z. Observer B, using the reference system S, measures the same items and gets the values t′, w′, x′, y′, z′. The appropriate transformation equations for calculating the one set of values from the other is found. If a mathematical relation of any sort is found to exist between the values t, w, x, y, z, will the same relation exist between the values t′, w′, x′, y′, z′? If it does not, are we justified in still calling it a law of nature? And if it does not, and we refrain from calling it such a law, may we expect in every case to find some relation that will be invariant under the transformation, and that may therefore be recognized as the natural law connecting t, w, x, y and z?
Observer A, using reference system R, measures certain quantities t, w, x, y, z. Observer B, using reference system S, measures the same quantities and gets the values t′, w′, x′, y', z'. The proper transformation equations for calculating one set of values from the other are identified. If a mathematical relationship exists between the values t, w, x, y, z, will the same relationship exist for the values t′, w′, x′, y', z′? If it does not, can we still call it a law of nature? And if it does not, and we choose not to call it such a law, can we expect to find a relationship in every case that will remain invariant under the transformation, and that can therefore be recognized as the natural law connecting t, w, x, y, and z?
I have found it advisable to discuss this point in such detail because here more than in any other single place the competing essayists betray uncertainty of thought and sloppiness of expression. It doesn’t amount to much to talk about the invariance of natural laws and their persistence as we pass from one coordinate system to another, unless we are fairly well fortified with respect to just what we mean by invariance and by natural law. We don’t expect the velocity of a train to be 60 miles per hour alike when we measure it with respect to a signal tower along the line and with respect to a moving train on the other track. We don’t expect the [45]angular displacement of Mars to change as rapidly when he is on the other side of the sun as when he is on our side. But we do, I think, rather expect that in any phenomenon which we may observe, we shall find a natural law of some sort which is dependent for its validity neither upon the units we employ, nor the place from which we make our measurements, nor anything else external to the phenomenon itself. We shall see, later, whether this expectation is justified, or whether it will have to be discarded in the final unravelling of the absolutist from the relativistic philosophy which, with Einstein, we are to undertake.]* [46]
I’ve found it important to discuss this point in detail because, more than anywhere else, the competing essayists show uncertainty in their thoughts and carelessness in their writing. It doesn’t really mean much to talk about the constancy of natural laws and their persistence as we move from one coordinate system to another unless we clearly understand what we mean by constancy and natural law. We don’t expect a train's speed to be 60 miles per hour when we measure it against a signal tower along the tracks and against another moving train on a parallel track. We also don’t expect Mars' angular displacement to change as quickly when it’s on the other side of the sun compared to when it’s on our side. However, I think we do expect to find some sort of natural law in any phenomenon we observe, one that doesn’t depend on our units, the location of our measurements, or anything else outside the phenomenon itself. We’ll see later whether this expectation is valid or if it needs to be reconsidered as we differentiate between absolutist and relativistic philosophies, which we will explore with Einstein. [__A_TAG_PLACEHOLDER_0__]
III
THE RELATIVITY OF UNIFORM MOTION
Classical Ideas on the Subject; the Ether and the Apparent Possibility of Absolute Motion; the Michelson-Morley Experiment and the Final Negation of This Possibility
Classic Ideas on the Subject; the Ether and the Potential Existence of Absolute Motion; the Michelson-Morley Experiment and the Final Rejection of This Possibility
When we speak of a body as being “in motion,” we mean that this body is changing its position “in space.” Now it is clear that the position of an object can only be determined with reference to other objects: in order to describe the place of a material thing we must, for example, state its distances from other things. If there were no such bodies of reference, the words “position in space” would have no definite meaning for us.]24 [The number of such external bodies of reference which it is necessary to cite in order to define completely the position of a given body in space depends upon the character of the space dealt with. We have seen that when we visualize the space of our experience as a surface of any character, two citations are sufficient; and that when we conceive of it as surrounding us in three dimensions we require three. It will be realized that the mathematician [47]is merely meeting this requirement when he sets up his system of coordinate axes to serve as a reference frame.]*
When we talk about a body being “in motion,” we mean that it is changing its position “in space.” It’s clear that the position of an object can only be defined in relation to other objects: to describe where a physical thing is, we need to mention its distances from other things. Without these reference bodies, the phrase “position in space” wouldn't have any clear meaning for us.24 [The number of external reference bodies needed to fully define the position of a specific body in space depends on the nature of the space in question. We’ve seen that when we picture the space of our experience as a surface of any kind, two references are enough; and when we think of it as surrounding us in three dimensions, we need three. It's understood that the mathematician [__A_TAG_PLACEHOLDER_0__] is simply addressing this need when he establishes his system of coordinate axes to serve as a reference frame.]
[What is true of “place” must be true also of “motion,” since the latter is nothing but change of place. In fact, it would be impossible to ascribe a state of motion or of rest to a body poised all alone in empty space. Whether a body is to be regarded as resting or as moving, and if the latter at what speed, depends entirely upon the objects to which we refer its positions in space.]24 [As Einstein sits at his desk he appears to us to be at rest; but we know that he is moving with the rotation of the earth on its axis, with the earth in its orbit about the sun, and with the solar system in its path through space—a complex motion of which the parts or the whole can be detected only by reference to appropriately chosen ones of the heavenly bodies. No mechanical test has ever been devised which will detect this motion,]182 [if we reserve for discussion in its proper place the Foucault pendulum experiment which will reveal the axial rotation of our globe.]* [No savage, if he were to “stand still,” could be convinced that he was moving with a very high velocity or in fact that he was moving at all.]30 [You drop a coin straight down a ship’s side: from the land its path appears parabolic; to a polar onlooker it whirls circle-wise; to dwellers on Mars it darts spirally about the sun; to a stellar observer it gyrates through the sky]263 [in a path of many complications. To you it drops in a straight line from the deck to the sea.]* [Yet its various tracks in ship-space, sea-space, earth-space, sun-space, star-space, are all [48]equally real,]263 [and the one which will be singled out for attention depends entirely upon the observer, and the objects to which he refers the motion.]* [The earth moves in the solar system, which is itself approaching a distant star-cluster. But we cannot say whether we are moving toward the cluster, or the cluster toward us,]18 [or both, or whether we are conducting a successful stern chase of it, or it of us,]* [unless we have in mind some third body with reference to which the motions of earth and star-cluster are measured.]18 [And if we have this, the measurements made with reference to it are of significance with regard to it, rather than with regard to the earth and the star-cluster alone.]*
[What is true of “place” must also be true of “motion,” since motion is just a change of place. In fact, it would be impossible to say whether a body is in motion or at rest if it is alone in empty space. Whether a body is viewed as at rest or in motion, and if it's in motion at what speed, depends entirely on the objects we use as references for its position in space.]24 [As Einstein sits at his desk, he seems to be at rest; but we know he is moving because of the rotation of the Earth on its axis, the Earth's orbit around the sun, and the solar system's movement through space—a complex motion that can only be detected by comparing it to selected celestial bodies. No mechanical test has ever been created that can detect this motion,]182 [if we save for later discussion the Foucault pendulum experiment, which shows the axial rotation of our planet.]* [No primitive person, if he were to “stand still,” could be convinced that he was moving at a very high speed, or that he was moving at all.]30 [You drop a coin straight down the side of a ship: from the land, its path looks parabolic; to a polar observer, it seems to whirl in a circle; to Martians, it darts spirally around the sun; to a stellar observer, it moves through the sky]263 [in a path of many complexities. To you, it falls straight down from the deck into the sea.]* [Yet its various paths in ship-space, sea-space, earth-space, sun-space, star-space, are all [__A_TAG_PLACEHOLDER_0__]equally real,]263 [and the one that gets the most attention depends entirely on the observer, and the objects to which he compares the motion.]* [The Earth moves within the solar system, which is also moving toward a distant star cluster. But we can’t say whether we are moving toward the cluster, or the cluster is moving toward us,]18 [or if both are moving, or whether we are successfully chasing it, or it is pursuing us,]* [unless we have some third body in mind to compare the motions of the Earth and the star cluster.]*18 [And if we use this, the measurements made in relation to it are significant for that reference point, rather than just for the Earth and the star cluster alone.]*
[We can express all this by saying “All motions are relative; there is no such thing as absolute motion.” This line of argument has in fact been followed by many natural philosophers. But is its result in agreement with actual experience? Is it really impossible to distinguish between rest and motion of a body if we do not take into consideration its relations to other objects? In fact it can easily be seen that, at least in many cases, no such distinction is possible.
[We can say it like this: “All movements are relative; there’s no such thing as absolute motion.” Many natural philosophers have actually followed this line of reasoning. But does it align with real-world experience? Is it truly impossible to tell the difference between a body at rest and one in motion without considering its relationship to other objects? It’s clear that, at least in many situations, such a distinction can’t be made.]
Who’s Moving?
Imagine yourself sitting in a railroad car with veiled windows and running on a perfectly straight track with unchanging velocity: you would find it absolutely impossible to ascertain by any mechanical means whether the car were moving or not. All mechanical instruments behave exactly the same, [49]whether the car be standing still or in motion.]24 [If you drop a ball you will see it fall to the floor in a straight line, just as though you had dropped it while standing on the station platform. Furthermore, if you drop the ball from the same height in the two cases, and measure the velocities with which it strikes the car floor and the station platform, or the times which it requires for the descent, you will find these identical in the two cases.]182
Imagine you're sitting in a train car with tinted windows, rolling down a perfectly straight track at a constant speed: you'd find it completely impossible to tell, using any mechanical method, whether the car is moving or not. All mechanical instruments react the same way, [__A_TAG_PLACEHOLDER_0__]whether the car is stationary or in motion.]24 [If you drop a ball, you’ll see it fall to the floor in a straight line, just like if you had dropped it while standing on the platform. Also, if you drop the ball from the same height in both situations and measure how fast it hits the floor of the car and the platform, or how long it takes to fall, you'll find these measurements are the same in both cases.]182
[Any changes of speed or of direction (as when the car speeds up or slows down or rounds a curve) can be detected by observing the behavior of bodies in the car, without apparent reference to any outside objects. This becomes particularly obvious with sudden irregularities of motion, which manifest themselves by shaking everything in the car. But a uniform motion in a straight line does not reveal itself by any phenomenon within the vehicle.]24
[Any changes in speed or direction (like when the car accelerates, decelerates, or goes around a curve) can be noticed by watching how things inside the car react, without needing to refer to anything outside. This is especially clear during sudden, erratic movements, which cause everything in the car to shake. However, a steady motion in a straight line doesn’t show itself through anything happening inside the vehicle.]24
[Moreover, if we remove the veil from our window to the extent that we may observe the train on the adjoining track, we shall be able to make no decision as to whether we or it be moving. This is indeed an experience which we have all had.]* [Often when seated in a train about to leave the station, we have thought ourselves under way, only to perceive as the motion becomes no longer uniform that another train has been backing into the station on the adjoining track. Again, as we were hurried on our journey, we have, raising suddenly our eyes, been puzzled to say whether the passing train were moving with us or against us or indeed standing still; or more rarely we have had the impression that both it and we seemed to be at rest, when in truth both [50]were moving rapidly with the same speed.]82 [Even this phrase “in truth” is a relative one, for it arises through using the earth as an absolute reference body. We are indeed naive if we cannot appreciate that there is no reason for doing this beyond convenience, and that to an observer detached from the earth it were just as reasonable to say that the rails are sliding under the train as that the train is advancing along the rails. One of my own most vivid childhood recollections is of the terror with which, riding on a train that passed through a narrow cut, I hid my head in the maternal lap to shut out the horrid sight of the earth rushing past my window. The absence of a background in relatively slow retrograde motion was sufficient to prevent my consciousness from drawing the accustomed conclusion that after all it was really the train that was moving.]*
Moreover, if we pull back the curtain from our window enough to see the train on the next track, we won't be able to tell if we’re moving or if it is. This is something we’ve all experienced. Often, when we’re seated on a train about to leave the station, we think we’re on our way, only to realize as the motion becomes uneven that another train is backing into the station on the next track. Similarly, as we’re rushing on our journey, we might suddenly look up and feel confused about whether the passing train is moving with us, against us, or is actually standing still; or, more rarely, we might feel that both trains are at rest when, in reality, both [__A_TAG_PLACEHOLDER_0__] are moving quickly at the same speed.82 Even the phrase “in truth” is relative since it comes from using the earth as a fixed point of reference. It’s naive not to recognize that we only do this for convenience, and that for an observer detached from the earth, it would make just as much sense to say that the rails are sliding under the train as to say the train is moving along the rails. One of my most vivid childhood memories is of the fear I felt while riding a train that went through a narrow cut, as I buried my head in my mom's lap to block out the terrifying sight of the ground rushing past my window. The lack of a background in relatively slow backward motion was enough to prevent my mind from reaching the usual conclusion that it was actually the train that was moving.
Mechanical Relativity
[So we can enunciate the following principle: When a body is in uniform rectilinear motion relatively to a second body, then all phenomena take place on the first in exactly the same manner as on the second; the physical laws for the happenings on both bodies are identical.]24 [And between a system of bodies, nothing but relative motion may be detected by any mechanical means whatever; any attempt to discuss absolute motion presupposes a super-observer on some body external to the system. Even then, the “absolute” motion is nothing but motion relative to this super-observer. By no mechanical [51]means is uniform straight-line motion of any other than relative character to be detected. This is the Principle of Mechanical Relativity.
[So we can state the following principle: When one object moves in a straight line at a constant speed relative to another object, then all events occurring on the first object happen in exactly the same way as on the second; the physical laws governing the events on both objects are the same.]24 [And within a system of objects, only relative motion can be observed by any mechanical means; any attempt to discuss absolute motion assumes a super-observer on some object outside the system. Even then, the “absolute” motion is only motion relative to this super-observer. No mechanical [__A_TAG_PLACEHOLDER_0__]means can detect uniform straight-line motion as anything other than relative. This is the Principle of Mechanical Relativity.
There is nothing new in this. It was known to Galileo, it was known to Newton, it has been known ever since. But the curious persistence of the human mind in habits of thought which confuse relativity with absolutism brought about a state of affairs where we attempted to know this and to ignore it at the same time. We shall have to return to the mathematical mode of reasoning to see how this happened. The mathematician has a way all his own of putting the statement of relativity which we have made. He recalls, what we have already seen, that the observer on the earth who is measuring his “absolute” motion with respect to the earth has merely attached his reference framework to the earth; that the passenger in the train who measures all motion naively with respect to his train is merely carrying his coordinate axes along with his baggage, instead of leaving them on the solid ground; that the astronomer who deals with the motion of the earth about the sun, or with that of the “fixed” stars against one another, does so simply by the artifice of hitching his frame of reference to the sun or to one of the fixed stars. So the mathematician points out that dispute as to which of two bodies is in motion comes right down to dispute as to which of two sets of coordinate axes is the better one, the more nearly “natural” or “absolute.” He therefore phrases the mechanical principle of relativity as follows:
There’s nothing new about this. Galileo knew it, Newton knew it, and it has been known ever since. Yet, the stubborn tendency of the human mind to mix up relativity with absolutism has led us to a situation where we try to understand this while also ignoring it. We need to go back to mathematical reasoning to figure out how this happened. Mathematicians have their own way of explaining the concept of relativity that we’ve discussed. They remind us, as we have already seen, that the observer on Earth measuring his “absolute” motion relative to Earth has only attached his reference point to the Earth; that the passenger in the train, who naively measures all motion relative to his train, is just moving his coordinate system along with his luggage instead of leaving it on solid ground; that the astronomer analyzing the motion of Earth around the sun, or that of the “fixed” stars against each other, does so by cleverly aligning his reference frame with the sun or one of the fixed stars. Thus, the mathematician points out that the debate about which of two bodies is in motion really comes down to a disagreement about which of two sets of coordinate axes is better, more “natural” or “absolute.” He therefore states the mechanical principle of relativity as follows:
Among all coordinate systems that are merely in [52]uniform straight-line motion to one another, no one occupies any position of unique natural advantage; all such systems are equivalent for the investigation of natural laws; all systems lead to the same laws and the same results.
Among all coordinate systems that are simply in [__A_TAG_PLACEHOLDER_0__] uniform straight-line motion relative to each other, none has a special natural advantage; all these systems are equivalent for studying natural laws; they all lead to the same laws and the same outcomes.
The mathematician has thus removed the statement of relativity from its intimate association with the external observed phenomena, and transferred it to the observer and his reference frame. We must either accept the principle of relativity, or seek a set of coordinate axes that have been singled out by nature as an absolute reference frame. These axes must be in some way unique, so that when we refer phenomena to them, the laws of nature take a form of exceptional simplicity not attained through reference to ordinary axes. Where shall we look for such a preferred coordinate system?]*
The mathematician has moved the concept of relativity away from its close connection with external observed phenomena and placed it onto the observer and their reference frame. We must either embrace the principle of relativity or look for a set of coordinate axes that nature has designated as an absolute reference frame. These axes need to be unique in some way so that when we relate phenomena to them, the laws of nature become exceptionally simple, unlike what we get with ordinary axes. Where should we search for such a preferred coordinate system?
The Quest for the Ultimate
[Older theory clung to the belief that there was such a thing as absolute motion in space.]197 [As the body of scientific law developed from the sixteenth century onward, the not unnatural hypothesis crept in, that these laws (that is to say, their mathematical formulations rather than their verbal statements) would reveal themselves in especially simple forms, were it possible for experimenters to make their observations from some absolute standpoint; from an absolutely fixed position in space rather than from the moving earth.]264 [Somewhere a set of coordinate axes incapable of motion was to be found,]197 [a fixed set of axes for measuring [53]absolute motion; and for two hundred years the world of science strove to find it,]147 [in spite of what should have been assurance that it did not exist. But the search failed, and gradually the universal applicability of the principle of relativity, so far as it concerned mechanical phenomena, grew into general acceptance.]* [And after the development, by the great mathematicians of the eighteenth century, of Newton’s laws of motion into their most complete mathematical form, it was seen that so far as these laws are concerned the absolutist hypothesis mentioned is quite unsupported. No complication is introduced into Newton’s laws if the observer has to make his measurements in a frame of reference moving uniformly through space; and for measurements in a frame like the earth, which moves with changing speed and direction about the sun and rotates on its axis at the same time, the complication is not of so decisive a nature as to give us any clue to the earth’s absolute motion in space.
[Older theory held onto the belief that absolute motion in space existed.]197 [As scientific laws developed from the sixteenth century onward, the reasonable hypothesis emerged that these laws (specifically, their mathematical expressions rather than their verbal descriptions) would show themselves in particularly simple forms if experimenters could make observations from an absolute standpoint; from a completely fixed position in space instead of from the moving earth.]264 [Somewhere, a set of coordinate axes that couldn't move was thought to exist,]197 [a fixed set of axes for measuring [__A_TAG_PLACEHOLDER_0__]absolute motion; for two hundred years, the scientific community tried to find it,]147 [despite the fact that there should have been assurance it didn't exist. The search was unsuccessful, and gradually the universal applicability of the principle of relativity, regarding mechanical phenomena, became widely accepted.]* [After the great mathematicians of the eighteenth century developed Newton’s laws of motion into their most complete mathematical form, it became clear that as far as these laws are concerned, the previously mentioned absolutist hypothesis lacks support. No complexity is introduced into Newton’s laws if the observer has to take measurements in a frame of reference moving uniformly through space; and for measurements in a frame like the earth, which moves with varying speed and direction around the sun while rotating on its axis, the complexity isn't significant enough to provide any insight into the earth’s absolute motion in space.
But mechanics, albeit the oldest, is yet only one of the physical sciences. The great advance made in the mathematical formulation of optical and electromagnetic theory during the nineteenth century revived the hope of discovering absolute motion in space by means of the laws derived from this theory.]264 [Newton had supposed light to be a material emanation, and if it were so, its passage across “empty space” from sun and stars to the earth raised no problem. But against Newton’s theory Huyghens, the Dutch astronomer, advanced the idea that light was a wave motion of some sort. During the Newtonian period and for many years [54]after, the corpuscular theory prevailed; but eventually the tables were turned.]* [Men made rays of light interfere, producing darkness (see page 61). From this, and from other phenomena like polarization, they had deduced that light was a form of wave motion similar to water ripples; for these interfere, producing level surfaces, or reinforce each other, producing waves of abnormal height. But if light were to be regarded as a form of wave motion—and the phenomena could apparently be explained on no other basis—then there must be some medium capable of undergoing this form of motion.]135 [Transmission of waves across empty space without the aid of an intermediary material medium would be “action at a distance,” an idea repugnant to us. Trammeled by our tactual, wire-pulling conceptions of a material universe, we could not accustom ourselves to the idea of something—even so immaterial a something as a wave—being transmitted by nothing. We needed a word—ether—to carry light if not to shed it; just as we need a word—inertia—to carry a projectile in its flight.]231 [It was necessary to invest this medium with properties to account for the observed facts. On the whole it was regarded as the perfect fluid.]235 [The ether was imagined as an all-pervading, imponderable substance filling the vast emptiness through which light reaches us, and as well the intermolecular spaces of all matter. Nothing more was known definitely, yet this much served as a good working hypothesis on the basis of which Maxwell was enabled to predict the possibility of radio communication. By its fruits the ether hypothesis justified [55]itself; but does the ether exist?]231
But mechanics, although the oldest, is still just one of the physical sciences. The significant progress made in mathematically formulating optical and electromagnetic theories during the nineteenth century rekindled the hope of finding absolute motion in space through the laws derived from this theory.]264 [Newton believed light was a material emission, and if that were true, its journey across “empty space” from the sun and stars to Earth posed no problem. However, against Newton’s theory, the Dutch astronomer Huygens proposed that light was a type of wave motion. Throughout the Newtonian era and for many years [__A_TAG_PLACEHOLDER_0__] afterward, the particle theory dominated; but eventually, things changed.]* [People made rays of light interfere, creating darkness (see page 61). From this and other phenomena like polarization, they concluded that light was a form of wave motion similar to water ripples; because these can interfere, leveling surfaces, or reinforce each other, creating waves of unusual height. But if light is considered a form of wave motion—and the phenomena could seemingly only be explained this way—then there must be some medium that can carry this wave motion.]135 [Sending waves through empty space without a material medium would represent “action at a distance,” which is an idea we find hard to accept. Limited by our tangible, hands-on understanding of a material universe, we struggled to grasp the concept of something—even as intangible as a wave—being transmitted through nothing. We needed a term—ether—to carry light, if not to produce it; just like we needed a term—inertia—to sustain a projectile in its trajectory.]231 [It was essential to give this medium properties to explain the observed facts. In general, it was seen as a perfect fluid.]235 [The ether was imagined as an all-encompassing, weightless substance filling the vast void through which light reaches us, including the intermolecular spaces of all matter. Nothing more was definitively known, yet this much served as a solid working hypothesis, which enabled Maxwell to predict the potential for radio communication. The outcomes of the ether hypothesis validated [__A_TAG_PLACEHOLDER_0__] itself; but does the ether exist?]231
The Ether and Absolute Motion
[If it does exist, it seems quite necessary, on mere philosophical grounds, that it shall be eligible to serve as the long-sought reference frame for absolute motion. Surely it does not make sense to speak of a homogeneous medium filling all space, sufficiently material to serve as a means of communication between remote worlds, and in the next breath to deny that motion with respect to this medium is a concept of significance.]* [Such a system of reference as was offered by the ether, coextensive with the entire known region of the universe, must necessarily serve for all motions within our perceptions.]186 [The conclusion seems inescapable that motion with respect to the ether ought to be of a sufficiently unique character to stand out above all other motion. In particular, we ought to be able to use the ether to define, somewhere, a system of axes fixed with respect to the ether, the use of which would lead to natural laws of a uniquely simply description.
[If it does exist, it seems essential, based on philosophical reasoning, that it should serve as the long-sought reference point for absolute motion. It certainly doesn’t make sense to talk about a uniform medium filling all of space, which is substantial enough to serve as a means of communication between distant worlds, and then in the next breath deny that motion in relation to this medium is a meaningful concept.]186 [A reference system like the ether, which extends throughout the entire known universe, must necessarily account for all motions within our perception.] The conclusion seems unavoidable that motion concerning the ether should be uniquely characteristic enough to distinguish it from all other types of motion. Specifically, we should be able to use the ether to define, at some point, a system of axes fixed with respect to the ether, which would enable natural laws to be described in a uniquely simple way.
Maxwell’s work added fuel to this hope.]* [During the last century, after the units of electricity had been defined, one set for static electrical calculations and one for electromagnetic calculations, it was found that the ratio of the metric units of capacity for the two systems was numerically equal to what had already been found as the velocity with which light is transmitted through the hypothetical ether. One definition refers to electricity at rest, the other [56]to electricity in motion. Maxwell, with little more working basis than this, undertook to prove that electrical and optical phenomena were merely two aspects of a common cause,]235 [to which the general designation of “electromagnetic waves” was applied. Maxwell treated this topic in great fullness and with complete success. In particular, he derived certain equations giving the relations between the various electrical quantities involved in a given phenomenon. But it was found, extraordinarily enough, that these relations were of such character that, when we subject the quantities involved to a change of coordinate axes, the transformed quantities did not preserve these relations if the new axes happened to be in motion with respect to the original ones. This, of course, was taken to indicate that motion really is absolute when we come to deal with electromagnetic phenomena, and that the ether which carries the electromagnetic waves really may be looked to to display the properties of an absolute reference frame.
Maxwell’s work boosted this optimism. During the last century, after defining the units of electricity—one set for static electrical calculations and another for electromagnetic calculations—it was discovered that the ratio of the metric units of capacity for both systems was numerically equal to the speed at which light travels through the hypothetical ether. One definition pertains to electricity at rest, while the other [__A_TAG_PLACEHOLDER_0__] relates to electricity in motion. With little more than this foundation, Maxwell set out to demonstrate that electrical and optical phenomena were simply two sides of the same coin,]235 [which came to be known as “electromagnetic waves.” Maxwell explored this topic in great detail and successfully. Specifically, he derived certain equations that described the relationships between the various electrical quantities involved in a given phenomenon. However, it was surprisingly found that these relationships were such that when we changed the coordinate axes, the transformed quantities did not maintain these relationships if the new axes were in motion relative to the original ones. This, of course, indicated that motion really is absolute when dealing with electromagnetic phenomena, and that the ether carrying the electromagnetic waves could indeed be considered to exhibit the properties of an absolute reference frame.
Reference to the phenomenon of aberration, which Dr. Pickering has discussed adequately in his essay and which I need therefore mention here only by name, indicated that the ether was not dragged along by material bodies over and through which it might pass. It seemed that it must filter through such bodies, presumably via the molecular interstices, without appreciable opposition. Were this not the case, we should be in some doubt as to the possibility of observing the velocity through the ether of material bodies; if the ether adjacent to such bodies is not dragged along or thrown into [57]eddies, but “stands still” while the bodies pass, there seems no imaginable reason for anything other than the complete success of such observations. And of course these are of the utmost importance, the moment we assign to the ether the rôle of absolute reference frame.
Reference to the phenomenon of aberration, which Dr. Pickering has thoroughly discussed in his essay, need only be mentioned by name here, indicated that the ether wasn’t dragged along by the material bodies it flows over and through. It appeared that it must filter through these bodies, likely via the molecular gaps, without significant resistance. If this weren’t true, we would question whether it’s even possible to observe the velocity of material bodies through the ether; if the ether next to these bodies isn’t dragged or thrown into [__A_TAG_PLACEHOLDER_0__]eddies, but instead “stays still” while the bodies move, there would be no reason to doubt the complete success of such observations. And of course, these observations are extremely important once we assign the ether the role of an absolute reference frame.
The Earth and the Universe
One body in motion with respect to the ether is our earth itself. We do not know in advance in what direction to expect this motion or what magnitude to anticipate that it will have. But one thing is clear.]* [In its motion around the sun, the earth has, at opposite points on its orbit, a difference in velocity with respect to the surrounding medium which is double its orbital velocity with respect to the sun. This difference comes to 37 miles per second. The earth should therefore, at some time in the year, show a velocity equal to or greater than 18½ miles per second, with reference to the universal medium. The famous Michelson-Morley experiment of 1887 was carried out with the expectation of observing this velocity.]267
One moving body in relation to the ether is our planet Earth. We can't predict which way this motion will go or how fast it will be. But one thing is certain. In its orbit around the sun, the Earth has, at opposite points on its path, a difference in speed regarding the surrounding medium that is double its orbital speed relative to the sun. This difference amounts to 37 miles per second. Therefore, at some point during the year, the Earth should show a speed equal to or greater than 18½ miles per second in relation to the universal medium. The famous Michelson-Morley experiment of 1887 was conducted with the hope of measuring this speed. 267
[The ether, of course, and hence velocities through it, cannot be observed directly.
But it acts as the medium for the transmission of light.]* [If the velocity of light
through the ether is C and that of the earth through the ether is v, then the velocity of light past the earth, so the argument runs, must vary from
to
, according as the light is moving exactly in the same direction as the earth, or
in the opposite [58]direction,]182 [or diagonally across the earth’s path so as to get the influence only of a part
of the earth’s motion. This of course assumes that C has always the same value; an assumption that impresses one as inherently probable,
and one that is at the same time in accord with ordinary astronomical observation.
[The ether, of course, and the speeds through it, can't be seen directly. But it acts as the medium for transmitting light.] * [If the speed of light through the ether is C and the speed of the earth through the ether is v, then the speed of light passing the earth, as the argument goes, must range from to
, depending on whether the light is moving exactly in the same direction as the earth or in the opposite [__A_TAG_PLACEHOLDER_0__]direction,]182 [or diagonally across the earth’s path, capturing only part of the earth’s motion. This naturally assumes that C always has the same value; an assumption that seems inherently likely and is also consistent with regular astronomical observations.]
It is not possible to measure directly the velocity of light (186,330 miles per second, more or less) with sufficient accuracy to give any meaning to the variation in this velocity which might be effected by adding or subtracting that of the earth in its orbit (a mere 18½ miles per second). It is, however, possible to play a trick on the light by sending it back and forth over several paths, and comparing (not measuring absolutely, but merely comparing) with great minuteness the times consumed in these several round trips.
It’s not possible to measure the speed of light (about 186,330 miles per second) directly with enough accuracy to make sense of any changes in that speed caused by the Earth’s orbit (which is only about 18½ miles per second). However, we can play a trick on the light by bouncing it back and forth along different paths and comparing (not measuring absolutely, but simply comparing) the time it takes for these various round trips with great precision.
A Trip Upstream and Back
The number of letters the Scientific American has received questioning the Michelson-Morley experiment indicates that many people are not acquainted with the fundamental principle on which it is based. So let us look at a simple analogous case. Suppose a swimmer or a rower make a return trip upstream and down, contending with the current as he goes up and getting its benefit when he comes down. Obviously, says snap judgment, since the two legs of the journey are equal, he derives exactly as much benefit from the current when he goes with it as he suffers handicap from it when [59]he goes against it; so the round trip must take exactly the same time as a journey of the same length in still water, the argument applying equally in the case where the “swimmer” is a wave of light in the ether stream.
The number of letters that Scientific American has received questioning the Michelson-Morley experiment shows that many people are not familiar with the basic principle it relies on. So, let’s consider a simple similar case. Imagine a swimmer or a rower making a round trip upstream and downstream, fighting against the current on the way up and benefiting from it on the way down. Obviously, it seems at first glance that because the two parts of the journey are the same, they experience equal advantages from the current when going with it and equal disadvantages when going against it; therefore, the round trip should take exactly the same amount of time as a journey of the same distance in still water. This reasoning applies equally to the scenario where the “swimmer” is a light wave moving through the ether stream.
But let us look now at a numerical case. A man can row in still water at four miles per hour. He rows twelve miles upstream and back, in a current of two miles per hour. At a net speed of two miles per hour he arrives at his turning point in six hours. At a net speed of six miles per hour he makes the down-stream leg in two hours. The elapsed time for the journey is eight hours; in still water he would row the twenty-four miles in six hours.
But now let's examine a specific example. A man can row in calm water at four miles per hour. He rows twelve miles upstream and back, against a current of two miles per hour. At an effective speed of two miles per hour, he reaches his turnaround point in six hours. At an effective speed of six miles per hour, he completes the downstream part of the trip in two hours. The total time for the journey is eight hours; if there were no current, he would row the twenty-four miles in six hours.
If we were to attempt an explanation of this result in words we should say that by virtue of the very fact that it does delay him, the adverse current prolongs the time during which it operates; while by virtue of the very fact that it accelerates his progress, the favoring current shortens its venue. The careless observer realizes that distances are equal between the two legs of the journey, and unconsciously assumes that times are equal.
If we were to explain this result in simpler terms, we would say that because the opposing current delays him, it extends the time it takes to affect him; whereas, since the helpful current speeds him up, it reduces the time it takes. The casual observer notices that the distances are the same for both parts of the journey and unknowingly assumes that the times are equal.
If the journey be made directly with and directly against the stream of water or ether or what not, retardation is effected to its fullest extent. If the course be a diagonal one, retardation is felt to an extent measurable as a component, and depending for its exact value upon the exact angle of the path. Felt, however, it must always be.
If the journey is made straight with and straight against the flow of water or air or whatever, the slowdown is maximized. If the path is diagonal, the slowdown is experienced to a measurable extent, and its exact value depends on the specific angle of the path. However, it must always be felt.
Here is where we begin to get a grip on the problem of the earth and the ether. In any problem involving the return-trip principle, there will enter [60]two velocities—that of the swimmer and that of the medium; and the time of retardation. If we know any two of these items we can calculate the third. When the swimmer is a ray of light and the velocity of the medium is that of the ether as it flows past the earth, we know the first of these two; we hope to observe the retardation so that we may calculate the second velocity. The apparatus for the experiment is ingenious and demands description.
Here is where we start to understand the issue of the earth and the ether. In any situation involving the return-trip principle, there will be two speeds involved—one for the swimmer and one for the medium; and the time of delay. If we know any two of these factors, we can figure out the third. When the swimmer is a beam of light, and the speed of the medium is that of the ether as it moves past the earth, we know the first of these two; we aim to observe the delay so we can calculate the second speed. The setup for the experiment is clever and needs to be described.
The Michelson-Morley Experiment
The machine is of structural steel, weighing 1,900 pounds. It has two arms which form a Greek cross. Each arm is 14 feet in length. The whole apparatus is floated in a trough containing 800 pounds of mercury.
The machine is made of structural steel and weighs 1,900 pounds. It has two arms that form a Greek cross. Each arm measures 14 feet in length. The entire setup is floated in a trough filled with 800 pounds of mercury.

Four mirrors are arranged on the end of each arm, sixteen in all, with a seventeenth mirror, M, set at one of the inside corners of the cross, as [61]diagrammed. A source of light (in this case a calcium flame) is provided, and its rays directed by a lens toward the mirror M. Part of the light is allowed to pass straight through M to the opposite arm of the cross, where it strikes mirror 1. It is reflected back across the arm to mirror 2, thence to 3, and so on until it reaches mirror 8. Thence it is reflected back to mirror 7, to 6, and so on, retracing its former path, and finally is caught by the reverse side of the mirror M and is sent to an observer at O. In retracing its path the light sets up an interference phenomenon (see below) and the interference bands are visible to the observer, who is provided with a telescope to magnify the results.
Four mirrors are positioned at the end of each arm, making a total of sixteen, with a seventeenth mirror, M, placed at one of the inside corners of the cross, as [__A_TAG_PLACEHOLDER_0__]illustrated. A light source (in this instance, a calcium flame) is used, and its rays are directed through a lens toward mirror M. Some of the light passes directly through M to the opposite arm of the cross, where it hits mirror 1. It reflects back across the arm to mirror 2, then to mirror 3, and continues until it reaches mirror 8. From there, it reflects back to mirror 7, to 6, and so on, retracing its previous path, and finally gets caught by the reverse side of mirror M, which sends it to an observer at O. As the light retraces its path, it creates an interference phenomenon (see below), and the interference bands are visible to the observer, who has a telescope to enhance the results.
A second part of the original light-beam is reflected off at right angles by the mirror M, and is passed to and fro on the adjacent arms of the machine, in exactly the same manner and over a similar path, by means of the mirrors I, II, III, … VIII. This light finally reaches the observer at the telescope, setting up a second set of interference bands, parallel to the first.
A second part of the original light beam is reflected at a right angle by mirror M and is directed back and forth along the neighboring arms of the machine, in exactly the same way and over a similar path, using mirrors I, II, III, … VIII. This light ultimately reaches the observer at the telescope, creating a second set of interference bands that are parallel to the first.
A word now about this business of light interference. Light is a wave motion. The length of a wave is but a few millionths of an inch, and the amplitude is correspondingly minute; but none the less, these waves behave in a thoroughly wave-like manner. In particular, if the crests of two waves are superposed, there is a double effect; while if a crest of one wave falls with a trough of another, there is a killing-off or “interference”.
A quick note about light interference. Light acts like a wave. The wavelength is just a few millionths of an inch, and the amplitude is also very small; however, these waves still behave like true waves. Specifically, when the crests of two waves overlap, they amplify each other; but when a crest from one wave coincides with a trough from another, they cancel each other out or cause "interference."
Under ordinary circumstances interference of [62]light waves does not occur. This is simply because under ordinary circumstances light waves are not piled up on one another. But sometimes this piling up occurs; and then, just so sure as the piled-up waves are in the same phase they reinforce one another, while if they are in opposite phase they interfere. And the conditions which we have outlined above, with the telescope and the mirrors and the ray of light retracing the path over which it went out, are conditions under which interference does occur. If the returning wave is in exact phase with the outgoing one, the effect is that of uniform double illumination; if it is in exactly opposite phase the effect is that of complete extinguishing of the light, the reversed wave exactly cancelling out the original one. If the two rays are partly in phase, there is partial reinforcement or partial cancelling out, according to whether they are nearly in phase or nearly out of phase. Finally, if the mirrors are not set absolutely parallel—as must in practice be the case when we attempt to measure their parallelism in terms of the wave-length of light—adjacent parts of the light ray will vary in the extent to which they are out of phase, since they will have travelled a fraction of a wave-length further to get to and from this, that or the other mirror. There will then appear in the telescope alternate bands of illumination and darkness, whose width and spacing depend upon all the factors entering into the problem.
Under normal conditions, the interference of [__A_TAG_PLACEHOLDER_0__] light waves doesn’t happen. This is mainly because light waves typically don’t overlap each other. However, sometimes they do stack up, and when that happens, if the stacked waves are in phase, they reinforce each other. But if they are out of phase, they interfere. The conditions we mentioned earlier—with the telescope, mirrors, and the ray of light retracing its path—are situations where interference does occur. If the returning wave matches the outgoing one perfectly, you get consistent double illumination; if it’s exactly out of phase, it completely cancels the original light, making it vanish. If the two rays are somewhat in phase, there will be partial reinforcement or partial cancellation, depending on how close they are to being in phase or out of phase. Lastly, if the mirrors aren’t perfectly parallel—which is what we would need to measure their alignment in terms of the wavelength of light—different parts of the light ray will be at varying degrees of out-of-phase since they will have traveled slightly different distances to reach each mirror. This will then create alternating bands of light and dark in the telescope, with their width and spacing influenced by all the factors involved.
If it were possible for us to make the apparatus with such a degree of refinement that the path from mirror M via mirrors 1, 2, 3, etc., back through M and into the telescope, were exactly the same length [63]as that from flame to telescope by way of the mirrors I, II, III, etc.—exactly the same to a margin of error materially less than a single wave-length of light—why, then, the two sets of interference fringes would come out exactly superposed provided the motion of the earth through the “ether” turn out to have no influence upon the velocity of light; or, if such influence exist, these fringes would be displaced from one another to an extent measuring the influence in question. But our ability to set up this complicated pattern of mirrors at predetermined distances falls far short of the wave-length as a measure of error. So in practice all that we can say is that having once set the instrument up, and passed a beam of light through it, there will be produced two sets of parallel interference fringes. These sets will fail of superposition—each fringe of one set will be removed from the corresponding fringe of the other set—by some definite distance. Then, any subsequent variation in the speed of light along the two arms will at once be detected by a shifting of the interference bands through a distance which we shall be able to measure.
If we could build the apparatus with such precision that the path from mirror M through mirrors 1, 2, 3, etc., back through M and into the telescope was exactly the same length [__A_TAG_PLACEHOLDER_0__]as the path from the flame to the telescope via mirrors I, II, III, etc.—precisely the same within an error margin significantly less than a single wavelength of light—then the two sets of interference fringes would perfectly overlap, assuming the Earth's motion through the “ether” has no effect on the speed of light; or, if such an effect does exist, these fringes would be shifted apart by an amount that reflects that influence. However, our ability to arrange this complex pattern of mirrors at predetermined distances is far from accurate at the wavelength level. So practically, all we can say is that once we set up the instrument and pass a beam of light through it, we will generate two sets of parallel interference fringes. These sets will not overlap—each fringe of one set will be offset from its corresponding fringe in the other set—by a specific distance. Then, any subsequent change in the speed of light along the two paths will immediately be noticeable by a shift in the interference bands by a measurable distance.
The Decision
Under the theories and assumptions governing at the time of the original performance of this experiment, it will be readily seen that if this machine be set up in an “ether stream” with one arm parallel to the direction of the stream and the other at right angles thereto, there will be a difference in the speed of the light along the two arms. Then if the apparatus [64]be shifted to a position oblique to the ether stream, the excess velocity of the light in the one arm would be diminished, and gradually come to zero at the 45-degree angle, after which the light traveling along the other arm would assume the greater speed. In making observations, therefore, the entire apparatus was slowly rotated, the observers walking with it, so that changes of the sort anticipated would be observed.
Under the theories and assumptions that applied when this experiment was originally conducted, it’s easy to see that if this machine is set up in an “ether stream” with one arm aligned with the direction of the stream and the other arm at a right angle to it, there will be a difference in the speed of light along the two arms. If the apparatus [__A_TAG_PLACEHOLDER_0__] is then moved to a position at an angle to the ether stream, the extra speed of light in one arm would decrease and eventually reach zero at a 45-degree angle, at which point the light traveling in the other arm would move faster. To make observations, the entire apparatus was slowly rotated, with the observers moving along with it to notice the expected changes.
The investigators were, however, ignorant of the position in which the apparatus ought to be set to insure that one of the arms lie across the ether drift; and they were ignorant of the time of year at which the earth’s maximum velocity through the ether was to be looked for. In particular, it is plain that if the solar system as a whole is moving through the ether at a rate less than the earth’s orbital velocity, there is a point in our orbit where our velocity through the ether and that around the sun just cancel out and leave us temporarily in a state of “absolute rest.” So it was anticipated that the experiment might have to be repeated in many orientations of the machine and at many seasons of the year in order to give a series of readings from which the true motion of the earth through the ether might be deduced.
The investigators didn’t know how to position the apparatus to ensure that one of the arms aligned with the ether drift, and they were unaware of the time of year when the earth’s maximum velocity through the ether would occur. Specifically, it’s clear that if the solar system is moving through the ether at a speed less than the earth’s orbital velocity, there will be a point in our orbit where our velocity through the ether and our speed around the sun cancel each other out, leaving us in a state of “absolute rest” for a moment. Therefore, it was expected that the experiment might need to be repeated in various orientations of the machine and at different times of the year to gather a range of readings that would reveal the true motion of the earth through the ether.
For those who have a little algebra the demonstration which Dr. Russell gives on a subsequent page will be interesting as showing the situation in perfectly general terms. It will be realized that the more complicated arrangement of mirrors in the experiment as just described is simply an eightfold repetition of the simple experiment as outlined by [65]Dr. Russell, and that it was done so for the mere sake of multiplying by eight the distances travelled and hence the difference in time and in phase.
For those with some algebra knowledge, the demonstration that Dr. Russell presents on the next page will be intriguing as it illustrates the situation in general terms. It's clear that the more complex arrangement of mirrors in the experiment described is just an eightfold repetition of the straightforward experiment outlined by [__A_TAG_PLACEHOLDER_0__]Dr. Russell, and it was done simply to multiply the distances traveled by eight, thus affecting the difference in time and phase.
And now for the grand climax. The experiment was repeated many times, with the original and with other apparatus, indoors and outdoors, at all seasons of the year, with variation of every condition that could imaginably affect the result. The apparatus was ordinarily such that a shift in the fringes of anywhere from one-tenth to one one-hundredth of that which would have followed from any reasonable value for the earth’s motion through the ether would have been systematically apparent. The result was uniformly negative. At all times and in all directions the velocity of light past the earth-bound observer was the same. The earth has no motion with reference to the ether!
And now for the big finale. The experiment was conducted multiple times, using the original setup and other equipment, both indoors and outdoors, throughout all seasons, with every possible change to conditions that could impact the outcome. The equipment was typically sensitive enough that a shift in the fringes of anywhere from one-tenth to one-hundredth of what would be expected from any reasonable value for the earth's movement through the ether should have been clearly noticeable. The results were consistently negative. At all times and in all directions, the speed of light as observed from Earth was the same. The Earth has no motion relative to the ether!
[The amazing character of this result is not by any possibility to be exaggerated.]* [According to one experiment the ether was carried along by a rapidly moving body and according to another equally well-planned and well-executed experiment a rapidly moving body did not disturb the ether at all. This was the blind alley into which science had been led.]232
[The incredible nature of this outcome can't be overstated.]* [In one experiment, the ether was moved along by a fast-moving object, while in another well-designed and executed experiment, a fast-moving object didn’t affect the ether at all. This was the dead end that science had ended up in.]232
The “Contraction” Theory
[Numerous efforts were made to explain the contradiction.]* [It is indeed a very puzzling one, and it gave physicists no end of trouble. However Lorentz and Fitzgerald finally put forward an ingenious explanation, to the effect that the actual motion of the earth through the ether is balanced, as far [66]as the ability of our measuring instruments is concerned, by a contraction of these same instruments in the direction of their motion. This contraction obviously cannot be observed directly because all bodies, including the measuring instruments themselves (which after all are only arbitrary guides), will suffer the contraction equally. According to this theory, called the Lorentz-Fitzgerald contraction theory,]272 [all bodies in motion suffer such contraction of their length in the direction of their motion;]283 [the contraction being made evident by our inability to observe the absolute motion of the earth, which it is assumed must exist.]272 [This would suffice to show why the Michelson-Morley experiment gave a negative result, and would preserve the concept of absolute motion with reference to the ether.]283
[Numerous efforts were made to explain the contradiction.]* [It is indeed a very puzzling one, and it gave physicists endless trouble. However, Lorentz and Fitzgerald finally proposed an clever explanation, suggesting that the actual motion of the earth through the ether is balanced, as far [__A_TAG_PLACEHOLDER_0__]as our measuring instruments can tell, by a contraction of these same instruments in the direction of their motion. This contraction obviously cannot be observed directly because all bodies, including the measuring instruments themselves (which are basically arbitrary guides), will experience the contraction equally. According to this theory, called the Lorentz-Fitzgerald contraction theory,]272 [all bodies in motion experience such contraction of their length in the direction of their motion;]283 [the contraction is highlighted by our inability to observe the absolute motion of the earth, which it is assumed must exist.]272 [This would explain why the Michelson-Morley experiment produced a negative result and would maintain the concept of absolute motion concerning the ether.]283
[This proposal of Lorentz and Fitzgerald loses its startling aspect when we consider that all matter appears to be an electrical structure, and that the dimensions of the electric and magnetic fields which accompany the electrons of which it is constituted change with the velocity of motion.]267 [The forces of cohesion which determine the form of a rigid body are held to be electromagnetic in nature; the contraction may be regarded as due to a change in the electromagnetic forces between the molecules.]10 [As one writer has put it, the orientation, in the electromagnetic medium, of a body depending for its very existence upon electromagnetic forces is not necessarily a matter of indifference.]*
[This idea from Lorentz and Fitzgerald seems less surprising when we realize that all matter is essentially made up of electrical structures, and that the sizes of the electric and magnetic fields surrounding the electrons that make it up change with the speed of motion.]267 [The forces that hold a solid object's shape are thought to be electromagnetic; the contraction can be seen as a result of changes in the electromagnetic forces between the molecules.]10 [As one writer noted, the orientation of an object that relies on electromagnetic forces for its very existence in the electromagnetic medium is not something to be taken lightly.]*
[Granting the plausibility of all this, on the basis of an electromagnetic theory of matter, it leaves us [67]in an unsatisfactory position. We are left with a fixed ether with reference to which absolute motion has a meaning, but that motion remains undetected and apparently undetectable. Further, if we on shore measure the length of a moving ship, using a yard-stick which is stationary on shore, we shall obtain one result. If we take our stick aboard it contracts, and so we obtain a greater length for the ship. Not knowing our “real” motion through the ether, we cannot say which is the “true” length. Is it not, then, more satisfactory to discard all notion of true length as an inherent quality of bodies, and, by regarding length as the measure of a relation between a particular object and a particular observer, to make one length as true as the other?]182 [The opponents of such a viewpoint contend that Michelson’s result was due to a fluke; some mysterious counterbalancing influence was for some reason at work, concealing the result which should normally have been expected. Einstein refuses to accept this explanation;]192 [he refuses to believe that all nature is in a contemptible conspiracy to delude us.]*
[Assuming all of this is plausible based on an electromagnetic theory of matter, it leaves us [__A_TAG_PLACEHOLDER_0__] in a frustrating situation. We're left with a fixed ether that gives absolute motion meaning, but that motion remains undetected and seemingly undetectable. Additionally, when we measure the length of a moving ship from shore using a stationary yardstick, we get one result. If we take the yardstick onto the ship, it contracts, and we end up measuring a longer length for the ship. Since we don’t know our “real” motion through the ether, we can’t determine which measurement is the “true” length. Isn’t it better to let go of the idea of true length as an inherent quality of objects, and instead see length as a measure of the relationship between a specific object and a specific observer, making one measurement just as valid as the other?]182 [Those who oppose this viewpoint argue that Michelson’s results were a fluke; some mysterious counterbalancing force must have been at play, hiding the result that we would normally expect. Einstein rejects this explanation;]192 [he refuses to accept that all of nature is in a ridiculous conspiracy to mislead us.]*
[The Fitzgerald suggestion is further unsatisfactory because it assumes all substances, of whatever density, to undergo the same contraction; and above all for the reason that it sheds no light upon other phenomena.]194 [It is indeed a very special explanation; that is, it applies only to the particular experiment in question. And indeed it is only one of many possible explanations. Einstein conceived the notion that it might be infinitely more valuable to take the most general explanation possible, and then try to find from this its logical consequences. This “most [68]general explanation” is, of course, simply that it is impossible in any way whatever to measure the absolute motion of a body in space.]272 [Accordingly Einstein enunciated, first the Special Theory of Relativity, and later the General Theory of Relativity. The special theory was so called because it was, limited to uniform rectilinear and non-rotary motions. The general theory, on the other hand, dealt not only with uniform rectilinear motions, but with any arbitrary motion whatever.
[The Fitzgerald suggestion is still not ideal because it assumes that all substances, regardless of their density, experience the same contraction; and most importantly, it doesn’t clarify other phenomena.]194 [It is indeed a very specific explanation; it only applies to the specific experiment at hand. In fact, it is just one of many potential explanations. Einstein came up with the idea that it might be much more useful to develop the broadest explanation possible, and then see what logical implications arise from that. This “most [__A_TAG_PLACEHOLDER_0__]general explanation” is, of course, simply that it is impossible to measure the absolute motion of a body in space in any way.]272 [As a result, Einstein articulated first the Special Theory of Relativity, and later the General Theory of Relativity. The special theory is called so because it is limited to uniform straight-line and non-rotational motions. The general theory, however, addressed not just uniform straight-line motions, but any kind of motion at all.]
Taking Charge of the Situation
The hypothesis of relativity asserts that there can be no such concept as absolute position, absolute motion, absolute time; that space and time are inter-dependent, not independent; that everything is relative to something else. It thus accords with the philosophical notion of the relativity of all knowledge.]283 [Knowledge is based, ultimately, upon measurement; and clearly all measurement is relative, consisting merely in the application of a standard to the magnitude measured. All metric numbers are relative; dividing the unit multiplies the metric number. Moreover, if measure and measured change proportionately, the measuring number is unchanged. Should space with all its contents swell in fixed ratio throughout, no measurement could detect this; nor even should it pulse uniformly throughout. Furthermore, were space and space-contents in any way systematically transformed (as by reflection in curved mirrors) point for point, continuously, without rending, no measurement [69]could reveal this distortion; experience would proceed undisturbed.]263
The hypothesis of relativity states that there’s no such thing as absolute position, absolute motion, or absolute time; that space and time are interconnected, not independent; and that everything is relative to something else. This aligns with the philosophical idea that all knowledge is relative.283 Knowledge ultimately relies on measurement, and clearly all measurement is relative, as it involves applying a standard to the quantity being measured. All metric numbers are relative; if you divide the unit, you increase the metric number. Additionally, if the measure and the measured change proportionately, the measuring number stays the same. If space and everything in it expanded uniformly and proportionally, no measurement could detect this; nor could it even if it pulsed uniformly. Furthermore, if space and its contents were systematically transformed (like with reflections in curved mirrors) point by point, continuously, without tearing, no measurement [__A_TAG_PLACEHOLDER_0__]could show this distortion; experience would continue without interruption.263
[Mark Twain said that the street in Damascus “which is called straight,” is so called because while it is not as straight as a rainbow it is straighter than a corkscrew. This expresses the basic idea of relativity—the idea of comparison. All our knowledge is relative, not absolute. Things are big or little, long or short, light or heavy, fast or slow, only by comparison. An atom may be as large, compared to an electron, as is a cathedral compared to a fly. The relativity theory of Einstein emphasizes two cases of relative knowledge; our knowledge of time and space, and our knowledge of motion.]216 [And in each case, instead of allowing the notions of relativity to guide us only so far as it pleases us to follow them, there abandoning them for ideas more in accord with what we find it easy to take for granted, Einstein builds his structure on the thesis that relativity must be admitted, must be followed out to the bitter end, in spite of anything that it may do to our preconceived notions. If relativity is to be admitted at all, it must be admitted in toto; no matter what else it contradicts, we have no appeal from its conclusions so long as it refrains from contradicting itself.]*
[Mark Twain mentioned that the street in Damascus "which is called straight," is named that way because, while it’s not as straight as a rainbow, it is straighter than a corkscrew. This illustrates the core concept of relativity—the idea of comparison. All our knowledge is relative, not absolute. Things are considered big or small, long or short, light or heavy, fast or slow, only through comparison. An atom can be as large, compared to an electron, as a cathedral is compared to a fly. Einstein's theory of relativity highlights two aspects of relative knowledge: our understanding of time and space, and our understanding of motion.216 [In both cases, rather than allowing the ideas of relativity to take us only as far as we find convenient, and then abandoning them for more agreeable concepts, Einstein constructs his framework on the premise that relativity must be accepted, it has to be pursued to its ultimate conclusion, regardless of how it challenges our existing beliefs. If we are to accept relativity at all, we must accept it in toto; no matter what else it contradicts, we can't contest its conclusions as long as it doesn’t contradict itself.] *
[The hypothesis of relativity was developed by Einstein through a priori methods, not the more usual a posteriori ones. That is, certain principles were enunciated as probably true, the consequences of these were developed, and these deductions tested by comparison of the predicted and the observed phenomena. It was in no sense attained by the [70]more usual procedure of observing groups of phenomena and formulating a law or formula which would embrace them and correctly describe the routine or sequence of phenomena.
[The theory of relativity was formulated by Einstein using a priori methods, rather than the more common a posteriori approaches. This means that certain principles were stated as likely true, the implications of these were explored, and these deductions were tested by comparing the predicted outcomes with the observed phenomena. It was not achieved through the [__A_TAG_PLACEHOLDER_0__]usual practice of observing groups of phenomena and creating a law or formula that would encompass them and accurately describe the pattern or sequence of events.]
The first principle thus enunciated is that it is impossible to measure or detect absolute translatory motion through space, under any circumstances or by any means. The second is that the velocity of light in free space appears the same to all observers regardless of the relative motion of the source of light and the observer. This velocity is not affected by motion of the source toward or away from the observer,]283 [if we may for the moment use this expression with its implication of absolute motion.]* [But universal relativity insists that motion of the source toward the observer is identical with motion of the observer toward the source.]283
The first principle stated is that it's impossible to measure or detect absolute translatory motion through space, under any conditions or by any methods. The second principle is that the speed of light in free space seems the same to all observers, no matter how the source of light or the observer is moving relative to each other. This speed isn’t influenced by the source moving towards or away from the observer,]283 [if we can use this term for now, implying absolute motion.] * [However, universal relativity argues that the motion of the source towards the observer is the same as the motion of the observer towards the source.]283
[It will be seen that we are at once on the horns of a dilemma. Either we must give up relativity before we get fairly started on it, or we must overturn the foundations of common sense by admitting that time and space are so constituted that when we go to meet an advancing light-impulse, or when we retreat from it, it still reaches us with the same velocity as though we stood still waiting for it. We shall find when we are through with our investigation that common sense is at fault; that our fixed impression of the absurdity of the state of affairs just outlined springs from a confusion between relativism and absolutism which has heretofore dominated our thought and gone unquestioned. The impression of absurdity will vanish when we have resolved this confusion.]* [71]
[It’s clear that we’re facing a tough choice. We either give up on relativity before we even begin, or we have to challenge the basics of common sense by accepting that time and space are set up in such a way that whether we approach an advancing light wave or move away from it, it still reaches us at the same speed as if we were standing still, waiting for it. By the end of our investigation, we’ll discover that common sense is mistaken; our strong feeling that this situation is absurd comes from a mix-up between relativism and absolutism that has influenced our thinking and gone unchallenged. The sense of absurdity will fade once we clear up this confusion.]* [__A_TAG_PLACEHOLDER_0__]
Common Sense Questions
[But it is obvious from what has just been said that if we are to adopt Einstein’s theory, we must make very radical changes in some of our fundamental notions, changes that seem in violent conflict with common sense. It is unfortunate that many popularizers of relativity have been more concerned to astonish their readers with incredible paradoxes than to give an account such as would appeal to sound judgment. Many of these paradoxes do not belong essentially to the theory at all. There is nothing in the latter that an enlarged and enlightened common sense would not readily endorse. But common sense must be educated up to the necessary level.]141
[But it's clear from what we've just discussed that if we want to embrace Einstein's theory, we need to make some significant changes to our fundamental ideas, changes that seem to clash with common sense. It's unfortunate that many people explaining relativity focus more on shocking their readers with unbelievable paradoxes than on providing a clear explanation that resonates with good judgment. Many of these paradoxes aren't actually part of the theory itself. There's nothing in the theory that a broader and more informed common sense wouldn't accept. However, common sense needs to be elevated to the appropriate level.]141
[There was a time when it was believed, as a result of centuries of experience, that the world was flat. This belief checked up with the known facts, and it could be used as the basis for a system of science which would account for things that had happened and that were to happen. It was entirely sufficient for the time in which it prevailed.
[There was a time when people thought, based on centuries of experience, that the world was flat. This belief aligned with what was known, and it could be used as the foundation for a scientific system that explained past events and predicted future ones. It was completely adequate for the era in which it was accepted.]
Then one day a man arose to point out that all the known facts were equally accounted for on the theory that the earth was a sphere. It was in order for his contemporaries to admit this, to say that so far as the facts in hand were concerned they could not tell whether the earth was flat or round—that new facts would have to be sought that would contradict one or the other hypothesis. Instead of this the world laughed and insisted that the earth could not be round because it was flat; that it could [72]not be round because then the people would fall off the other side.
Then one day, a man came forward to point out that all the known facts were equally explained by the idea that the earth was a sphere. His contemporaries needed to accept this and acknowledge that, based on the facts available, they couldn't determine if the earth was flat or round; new evidence would need to be found that would disprove one of the two ideas. Instead, people laughed and insisted that the earth couldn't be round because it was flat; they believed it couldn't be round because then people would fall off the other side. [__A_TAG_PLACEHOLDER_0__]
But the field of experimentation widened, and men were able to observe facts that had been hidden from them. Presently a man sailed west and arrived east; and it became clear that in spite of previously accepted “facts” to the contrary, the earth was really round. The previously accepted “facts” were then revised to fit the newly discovered truth; and finally a new system of science came into being, which accounted for all the old facts and all the new ones.
But the area of experimentation expanded, and people began to see facts that had been hidden from them. Soon, a man sailed west and ended up east, proving that despite what was previously accepted as "facts," the earth was actually round. The old "facts" were then updated to align with this new truth, leading to the creation of a new scientific system that explained both the old and new facts.
At intervals this sort of thing has been repeated. A Galileo shows that preconceived ideas with regard to the heavens are wrong, and must be revised to accord with his newly promulgated principles. A Newton does the same for physics—and people unlearn the “fact” that motion has to be supported by continued application of force, substituting the new idea that it actually requires force to stop a moving body. A Harvey shows that the things which have been “known” for generations about the human body are not so. A Lyell and a Darwin force men to throw overboard the things they have always believed about the way in which the earth and its creatures came into being. Every science we possess has passed through one or more of these periods of readjustment to new facts.
At times, this kind of thing has happened repeatedly. A Galileo shows that the preconceived notions about the universe are wrong and need to be changed to fit his newly introduced principles. A Newton does the same for physics—and people unlearn the “fact” that motion requires ongoing force, replacing it with the new idea that it actually takes force to stop a moving object. A Harvey reveals that the things that have been “known” for generations about the human body aren’t true. A Lyell and a Darwin compel people to discard long-held beliefs about how the Earth and its creatures came into existence. Every science we have has gone through one or more of these periods of adjustment to new facts.
Changing the Mindset
Now we are apt to lose sight of the true significance of this. It is not alone our opinions that are [73]altered; it is our fundamental concepts. We get concepts wholly from our perceptions, making them to fit those perceptions. Whenever a new vista is opened to our perceptions, we find facts that we never could have suspected from the restricted viewpoint. We must then actually alter our concepts to make the new facts fit in with the greatest degree of harmony. And we must not hesitate to undertake this alteration, through any feeling that fundamental concepts are more sacred and less freely to be tampered with than derived facts.]* [We do, to be sure, want fundamental concepts that are easy for a human mind to conceive; but we also want our laws of nature to be simple. If the laws begin to become, intricate, why not reshape, somewhat, the fundamental concepts, in order to simplify the scientific laws? Ultimately it is the simplicity of the scientific system as a whole that is our principal aim.]178
Now we can easily lose sight of what this really means. It's not just our opinions that change; it’s our core concepts. We form concepts entirely from our perceptions, adjusting them to fit what we see. Whenever we gain a new perspective, we discover facts that we couldn't have imagined from our previous limited viewpoint. We then need to actually change our concepts to incorporate these new facts in the most harmonious way possible. We shouldn't hesitate to make these changes, despite any belief that core concepts are more sacred and should be less easily altered than derived facts.]* [While we certainly want core concepts that are easy for the human mind to grasp, we also desire our laws of nature to be straightforward. If the laws start to become complicated, why not adjust some of the fundamental concepts to simplify the scientific laws? Ultimately, the simplicity of the entire scientific system is our main goal.]178
[As a fair example, see what the acceptance of the earth’s sphericity did to the idea represented by the word “down.” With a flat earth, “down” is a single direction, the same throughout the universe; with a round earth, “down” becomes merely the direction leading toward the center of the particular heavenly body on which we happen to be located. It is so with every concept we have. No matter how intrinsic a part of nature and of our being a certain notion may seem, we can never know that new facts will not develop which will show it to be a mistaken one. Today we are merely confronted by a gigantic example of this sort of thing. Einstein tells us that when velocities are attained which have just now come within the range of our close investigation, [74]extraordinary things happen—things quite irreconcilable with our present concepts of time and space and mass and dimension. We are tempted to laugh at him, to tell him that the phenomena he suggests are absurd because they contradict these concepts. Nothing could be more rash than this.
[As a clear example, look at how accepting that the Earth is round changed the idea behind the word “down.” With a flat Earth, “down” is a single direction, the same everywhere in the universe; with a round Earth, “down” just refers to the direction toward the center of the specific celestial body we’re on. This applies to every concept we have. No matter how fundamental a certain idea may seem in nature and our existence, we can never be sure that new information won't emerge that reveals it as incorrect. Right now, we’re faced with a massive example of this. Einstein tells us that when we reach speeds that are just now within the limits of our close examination, [__A_TAG_PLACEHOLDER_0__]extraordinary things occur—things that completely clash with our current understanding of time, space, mass, and dimensions. It’s tempting to dismiss him, to say that the phenomena he describes are ridiculous because they contradict these ideas. But nothing could be more foolish than that.]
When we consider the results which follow from physical velocities comparable with that of light, we must confess that here are conditions which have never before been carefully investigated. We must be quite as well prepared to have these conditions reveal some epoch-making fact as was Galileo when he turned the first telescope upon the skies. And if this fact requires that we discard present ideas of time and space and mass and dimension, we must be prepared to do so quite as thoroughly as our medieval fathers had to discard their notions of celestial “perfection” which demanded that there be but seven major heavenly bodies and that everything center about the earth as a common universal hub. We must be prepared to revise our concepts of these or any other fundamentals quite as severely as did the first philosopher who realized that “down” in London was not parallel to “down” in Bagdad or on Mars.]*
When we think about the outcomes that come from physical speeds similar to the speed of light, we have to admit that these are conditions that have never been thoroughly examined before. We need to be just as ready for these conditions to uncover some groundbreaking truth as Galileo was when he first pointed a telescope at the sky. And if this truth means that we have to let go of our current ideas about time, space, mass, and dimensions, we must be prepared to do so just as completely as our medieval ancestors had to abandon their beliefs about celestial “perfection,” which insisted on having only seven major celestial bodies and that everything revolved around the Earth as a universal center. We must be willing to revise our understanding of these or any other fundamental concepts as rigorously as the first philosopher who recognized that “down” in London is not side by side to “down” in Baghdad or on Mars.
[In all ordinary terrestrial matters we take the earth as a fixed body, light as instantaneous. This is perfectly proper, for such matters. But we carry our earth-acquired habits with us into the celestial regions. Though we have no longer the earth to stand on, yet we assume, as on the earth, that all measurements and movements must be referred to some fixed body, and are only then valid. We [75]cling to our earth-bound notion that there is an absolute up-and-down, back-and-forth, right-and-left, in space. We may admit that we can never find it, but we still think it is there, and seek to approach it as nearly as possible. And similarly from our earth experiences, which are sufficiently in a single place to make possible this simplifying assumption, we get the idea that there is one universal time, applicable at once to the entire universe.]141 [The difficulty in accepting Einstein is entirely the difficulty in getting away from these earth-bound habits of thought.]* [76]
[In everyday life on Earth, we see it as a stable place and light as instant. This is totally appropriate for practical purposes. However, we end up bringing our earth-based habits with us into space. Even though we no longer have solid ground beneath us, we still act like all measurements and movements need to be connected to a fixed point to be meaningful. We [__A_TAG_PLACEHOLDER_0__] cling to the earthly idea that there is an absolute up and down, back and forth, right and left in space. We might acknowledge that we'll never actually find it, but we still believe it exists and try to get as close as possible to it. Similarly, from our earthly experiences, which happen in one location allowing for this simplified view, we develop the notion of one universal time that applies to the whole universe.]141 [The challenge in accepting Einstein is really about breaking free from these earth-centered ways of thinking.]* [__A_TAG_PLACEHOLDER_0__]
IV
THE SPECIAL THEORY OF RELATIVITY
What Einstein’s Study of Uniform Motion Tells Us About Time and Space and the Nature of the External Reality
What Einstein’s Research on Uniform Motion Shows About Time, Space, and the Nature of External Reality
Whatever the explanation adopted for the negative result of the Michelson-Morley experiment, one thing stands out clearly: the attempt to isolate absolute motion has again failed.]* [Einstein generalizes this with all the other and older negative results of similar sort into a negative deduction to the effect that no experiment is possible upon two systems which will determine that one of them is in motion and the other at rest.]121 [He elevates the repeated failure to detect absolute motion through space into the principle that experiment will never reveal anything in the nature of absolute velocities. He postulates that all laws of nature can and should be enunciated in such forms that they are as true in these forms for one observer as for another, even though these observers with their frames of reference be in motion relative to one another.]264
Whatever explanation is taken for the negative result of the Michelson-Morley experiment, one thing is clear: the attempt to isolate absolute motion has failed again. Einstein generalizes this, along with other similar older negative results, into a negative conclusion that no experiment can determine which of two systems is in motion and which one is at rest. He raises the repeated failure to detect absolute motion through space to the principle that experiments will never reveal anything about absolute velocities. He asserts that all laws of nature can and should be stated in such a way that they are equally true for one observer as for another, even if these observers, with their different reference frames, are in motion relative to each other.
[There are various ways of stating the principle of the relativity of uniform motion which has been [77]thus arrived at, and which forms the basis of the Special Theory of Einstein. If we care to emphasize the rôle of mathematics and the reference frame we may say that]* [any coordinate system having a uniform rectilinear motion with respect to the bodies under observation may be interchangeably used with any other such system in describing their motions;]232 [or that the unaccelerated motion of a system of reference cannot be detected by observations made on this system alone.]194 [Or we can let this aspect of the matter go, and state the relativity postulate in a form more intelligible to the non-mathematician by simply insisting that it is impossible by any means whatever to distinguish any other than the relative motion between two systems that are moving uniformly. As Dr. Russell puts it on a later page, we can assume boldly that the universe is so constituted that uniform straight-ahead motion of an observer and all his apparatus will not produce any difference whatever in the result of any physical process or experiment of any kind.
[There are different ways to explain the principle of the relativity of uniform motion, which has been [__A_TAG_PLACEHOLDER_0__] developed and is the foundation of Einstein's Special Theory. If we want to highlight the role of mathematics and the reference frame, we can say that]* [any coordinate system that moves in a straight line at a constant speed relative to the objects being observed can be used interchangeably with any other such system to describe their motions;]232 [or that the unaccelerated motion of a reference system can’t be detected just by making observations within that system alone.]194 [Alternatively, we can simplify this idea for those who aren’t mathematicians by stating that it’s impossible to distinguish any motion other than the relative motion between two systems that are moving uniformly. As Dr. Russell notes on a later page, we can confidently assume that the universe is arranged in such a way that the uniform straight-line motion of an observer and all their instruments will not affect the outcome of any physical process or experiment.]
As we have seen, this is entirely reasonable, on philosophical grounds, until we come to consider the assumptions of the past century with regard to light and its propagation. On the basis of these assumptions we had expected the Michelson-Morley experiment to produce a result negativing the notion of universal relativity. It refused to do this, and we agree with Einstein that the best explanation is to return to the notion of relativity, rather than to invent a forced and special hypothesis to account for the experiment’s failure. But we must now investigate [78]the assumptions underlying the theory of light, and remove the one that requires the ether to serve as a universal standard of absolute motion.
As we've seen, this makes complete sense from a philosophical standpoint, until we consider the assumptions made over the past century regarding light and how it travels. Based on those assumptions, we expected the Michelson-Morley experiment to show that the idea of universal relativity was wrong. It didn’t do that, and we agree with Einstein that the best explanation is to revisit the concept of relativity, rather than come up with a forced and special hypothesis to explain the experiment's results. But now we need to examine [__A_TAG_PLACEHOLDER_0__]the assumptions behind the theory of light and eliminate the one that requires ether to act as a universal standard of absolute motion.
Light and the Aether
It is among the possibilities that the wave theory of light itself will in the end be more or less seriously modified. It is even more definitely among the possibilities that the ether will be discarded.]* [Certainly when Lord Kelvin estimates that its mass per cubic centimeter is .000,000,000,000,000,001 gram, while Sir Oliver Lodge insists that the correct figure is 1,000,000,000,000,000 grams, it is quite evident that we know so little about it that it is better to get along without it if we can.]216 [But to avoid confusion we must emphasize that Einstein makes no mention whatsoever of the ether; his theory is absolutely independent of any theory of the ether.]139 [Save as he forbids us to employ the ether as a standard of absolute motion, Einstein does not in the least care what qualities we assign to it, or whether we retain it at all. His demands are going to be made upon light itself, not upon the alleged medium of light transmission.
It’s possible that the wave theory of light may eventually be somewhat altered. It’s even more likely that the concept of ether will be abandoned. Certainly, when Lord Kelvin estimates its mass at .000000000000000001 gram per cubic centimeter, while Sir Oliver Lodge argues that the correct figure is 1,000,000,000,000,000 grams, it’s clear that our understanding of it is so limited that it might be better to do without it if possible.216 But to avoid confusion, we must emphasize that Einstein does not mention the ether at all; his theory is completely independent of any ether theory.139 Aside from saying that we shouldn’t use the ether as a standard for absolute motion, Einstein doesn’t care about the properties we ascribe to it, or whether we even keep it around. His focus is on the characteristics of light itself, not on the supposed medium for light transmission.
When two observers in relative motion to one another measure their velocities with respect to a third material object, they expect to get different results. Their velocities with regard to this object properly differ, for it is no more to be taken as a universal super-observer than either of them. But if they get different results when they come to measure the velocity with which light passes their respective [79]systems, relativity is challenged. Light is with some propriety to be regarded as a universal observer; and if it will measure our velocities against each other we cannot deny it rank as an absolute standard. If we are not prepared to abandon universal relativity, and adopt one of the “fluke” explanations for the Michelson-Morley result, we must boldly postulate that in free space light presents the same velocity C to all observers—whatever the source of the light, whatever the relative motion between source and observer, whatever the relative motion between the several observers. The departure here from the old assumption lies in the circumstance that the old physics with its ether assigned to light a velocity universally constant in this ether; we have stopped talking about the medium and have made the constant C refer to the observer’s measured value of the velocity of light with regard to himself.
When two observers moving relative to each other measure their speeds against a third physical object, they expect to get different results. Their speeds compared to this object properly differ, as it shouldn't be considered a universal super-observer any more than either of them. However, if they get different results when measuring the speed of light in their respective [__A_TAG_PLACEHOLDER_0__]systems, that challenges relativity. Light is rightly seen as a universal observer; if it measures our speeds against each other, we can't deny its status as an absolute standard. If we're not ready to give up universal relativity and accept one of the “fluke” explanations for the Michelson-Morley result, we must confidently assert that in free space, light always has the same speed C for all observers—regardless of the light's source, the relative motion between the source and the observer, or the relative motion between the different observers. The shift from the old assumption comes from the fact that the old physics with its ether assigned a universally constant speed to light in this ether; we’ve stopped discussing the medium and have made the constant C refer to the velocity of light as measured by the observer themselves.
We are fortified in this assumption by the Michelson-Morley result and by all other observations bearing directly upon the matter. Nevertheless, as Mr. Francis says in his essay, we feel instinctively that space and time are not so constituted as to make it possible, if I pass you at 100 miles per hour, for the same light-impulse to pass us both at the same speed C.]* [The implicit assumptions underlying this feeling, be they true or false, are now so interwoven with the commonly received notions of space and time that any theory which questions them has all the appearance of a fantastic and unthinkable thing.]115 [We cannot, however, go back on our relativity; so when]* [Einstein shows us that an [80]entirely new set of time and space concepts is necessary to reconcile universe relativity with this fundamental fact of the absolute constancy of the observed velocity of light in vacuo,]18 [all that is left for us to do is to inquire what revisions are necessary, and submit to them.]*
We are strengthened in this belief by the Michelson-Morley experiment and all other observations directly related to the topic. Still, as Mr. Francis points out in his essay, we have an instinctive feeling that space and time aren't structured in a way that allows me to pass you at 100 miles per hour while the same light signal passes us both at the same speed C.]* [The underlying assumptions related to this feeling, whether they are accurate or not, are now so intertwined with common ideas of space and time that any theory questioning them seems utterly bizarre and unimaginable.]115 [However, we cannot abandon our understanding of relativity; so when]* [Einstein reveals that a [__A_TAG_PLACEHOLDER_0__]completely new framework of time and space concepts is essential to align the relativity of the universe with the fundamental fact of the absolute constancy of the observed speed of light in vacuo,]18 [all we can do is examine what changes are needed and accept them.]*
[The conceptual difficulties of the theory arise principally from attributing to space and time the properties of things. No portion of space can be compared with another, save by convention; it is things which we compare. No interval of time can be compared with another, save by convention. The first has gone when the second becomes “now”.]149 [It is events that we compare, through the intervention of things. Our measurements are never of space or of time, but only of the things and the events that occupy space and time. And since the measurements which we deal with as though they were of space and of time lie at the foundation of all physical science, while at the same time themselves constituting, as we have seen, the only reality of which we are entitled to speak, it is in order to examine with the utmost care the assumptions underlying them. That there are such assumptions is clear—the very possibility of making measurements is itself an assumption, and every technique for carrying them out rests on an assumption. Let us inquire which of these it is that relativity asks us to revise.]*
[The conceptual challenges of the theory mainly come from assigning the properties of objects to space and time. You can't really compare different parts of space without a mutual agreement; we actually compare the objects instead. Similarly, you can't compare different time intervals without a common understanding. The first interval has passed when the second one becomes “now.”]149 [It's the events we compare, through the context of the objects involved. Our measurements aren't truly of space or time, but rather of the objects and events that exist within space and time. Since the measurements we treat as if they are of space and time are the basis of all physical science, and at the same time represent the only reality we can actually talk about, we must carefully examine the assumptions that underlie them. It's evident that such assumptions exist—the very ability to make measurements is an assumption in itself, and every method we use to do so relies on some assumption. Let's explore which of these assumptions relativity asks us to question or change.]
The Measurement of Time and Space
[Time is generally conceived as perfectly uniform. [81]How do we judge about it? What tells us that the second just elapsed is equal to the one following? By the very nature of time the superposition of its successive intervals is impossible. How then can we talk about the relative duration of these intervals? It is clear that any relationship between them can only be conventional.]178 [As a matter of fact, we habitually measure time in terms of moving bodies. The simplest method is to agree that some entity moves with uniform velocity. It will be considered as travelling equal distances in equal intervals of time, the distances to be measured as may be specified by our assumptions governing this department of investigation.]179 [The motions of the earth through which we ultimately define the length of day and year, the division of the former into 86,400 “equal” intervals as defined by the motions of pendulum or balance wheel through equal distances, are examples of this convention of time measurement. Even when we correct the motions of the earth, on the basis of what our clocks tell us of these motions, we are following this lead; the earth and the clocks fall out, it is plain that one of them does not satisfy our assumption of equal lengths in equal times, and we decide to believe the clock.]*
[Time is usually thought of as perfectly consistent. [__A_TAG_PLACEHOLDER_0__]How do we determine this? What indicates that the second that just passed is the same as the next one? By the very essence of time, overlapping its successive intervals is impossible. So how can we discuss the relative duration of these intervals? It's clear that any connection between them can only be conventional.]178 [In reality, we typically measure time based on moving objects. The simplest method is to agree that something moves at a constant speed. It's understood to cover equal distances in equal time intervals, with distances measured according to the assumptions we've set for this area of study.]179 [The movements of the Earth, which ultimately define the length of a day and a year, and the division of a day into 86,400 "equal" intervals—as defined by the movements of a pendulum or balance wheel over equal distances—are examples of this convention in measuring time. Even when we adjust the Earth's motions based on what our clocks indicate about those motions, we are following this approach; if the Earth and the clocks don’t align, it becomes clear that one of them doesn't meet our assumption of equal lengths in equal times, and we choose to trust the clock.]*
[The foregoing concerning time may be accepted as inherent in time itself. But concerning lengths it may be thought that we are able to verify absolutely their equality and especially their invariability. Let us have the audacity to verify this statement. We have two lengths, in the shape of two rods, which coincide perfectly when brought together. What may we conclude from this coincidence? Only that [82]the two rods so considered have equal lengths at the same place in space and at the same moment. It may very well be that each rod has a different length at different locations in space and at different times; that their equality is purely a local matter. Such changes could never be detected if they affected all objects in the universe. We cannot even ascertain that both rods remain straight when we transport them to another location, for both can very well take the same curvature and we shall have no means of detecting it.
[The previous discussion about time can be seen as a fundamental aspect of time itself. However, when it comes to lengths, we might believe that we can definitively confirm their equality and especially their consistency. Let’s take the bold step to test this claim. We have two lengths, represented by two rods, that line up perfectly when placed together. What can we conclude from this alignment? Only that [__A_TAG_PLACEHOLDER_0__]the two rods, in this specific situation, have equal lengths at the same point in space and at the same moment. It’s entirely possible that each rod has a different length at other locations in space and at different times; their equality could purely be a local occurrence. Any such variations would go unnoticed if they impacted all objects in the universe. We also can't confirm that both rods stay straight when we move them to a different location, as they could very well bend in the same way, and we wouldn’t have any way to detect it.
Euclidean geometry assumes that geometrical objects have sizes and shapes independent of position and of orientation in space, and equally invariable in time. But the properties thus presupposed are only conventional and in no way subject to direct verification. We cannot even ascertain space to be independent of time, because when comparing geometrical objects we have to conceive them as brought to the same place in space and in time.]178 [Even the statement that when they are made to coincide their lengths are equal is, after all, itself an assumption inherent in our ideas of what constitutes length. And certainly the notion that we can shift them from place to place and from moment to moment, for purposes of comparison, is an assumption; even Euclid, loose as he was from modern standards in this business of “axioms,” knew this and included a superposition axiom among his assumptions.
Euclidean geometry assumes that geometric objects have sizes and shapes that are independent of their position and orientation in space, and remain constant over time. However, these properties are merely conventional and cannot be directly verified. We can't even determine that space is independent of time, because when we compare geometric objects, we have to imagine them as being in the same location both in space and time.178 Even the claim that when they overlap their lengths are equal is just an assumption based on our understanding of what length is. The idea that we can move them around in space and time for comparison is also an assumption; even Euclid, who was relaxed by modern standards regarding "axioms," recognized this and included a superposition axiom in his assumptions.
As a matter of fact, this procedure for determining equality of lengths is not always available. It assumes, it will be noted, that we have free access to the object which is to be measured—which is to [83]say, it assumes that this object is at rest with respect to us. If it is not so at rest, we must employ at least a modification of this method; a modification that will in some manner involve the sending of signals. Even when we employ the Euclidean method of superposition directly, we must be assured that the respective ends of the lengths under comparison coincide at the same time. The observer cannot be present at both ends simultaneously; at best he can only be present at one end and receive a signal from the other end.
Actually, this method for determining the equality of lengths isn't always feasible. It assumes that we have direct access to the object being measured—specifically, that this object is stationary in relation to us. If it's not at rest, we need to use at least a modified version of this method, which will involve sending signals in some way. Even when we directly apply the Euclidean method of superposition, we need to ensure that the ends of the lengths being compared align at the same time. The observer can't be at both ends at once; at most, they can be at one end and receive a signal from the other.
Communication Issues
Accordingly, in making the necessary assumptions to cover the matter of measuring lengths, we must make one with regard to the character of the signals which are to be employed for this purpose. If we could assume a system of signalling that would consume no time in transmission all would be simple enough. But we have no experience with such a system. Even if we believe that it ought to be possible thus to transmit signals at infinite velocity, we may not, in the absence of our present ability to do this, assume that it is possible. So we may only assume, with Einstein, that for our signals we shall employ the speediest messenger with which we are at present acquainted. This of course is light, the term including any of the electromagnetic impulses that travel at the speed C.
Accordingly, when we make the necessary assumptions for measuring lengths, we need to consider the nature of the signals we will use for this purpose. If we could assume a signaling system that takes no time to transmit, everything would be straightforward. However, we have no experience with such a system. Even if we think it should be possible to send signals at infinite speed, we cannot assume it's actually possible given our current capabilities. Therefore, we can only assume, along with Einstein, that we'll use the fastest messenger we know of. This, of course, is light, which refers to any electromagnetic impulses that travel at the speed C.
Of course in the vast majority of cases the distance that any light signal in which we are interested must go to reach us is so small that the time taken [84]by its transmission can by no means be measured. We are then, to all intents and purposes, at both places—the point of origin of the signal and the point of receipt—simultaneously. But this is not the question at all. Waiving the fact that in astronomical investigations this approximation no longer holds, the fact remains that it is, in every case, merely an approximation. Approximations are all right in observations, where we know that they are approximations and act accordingly. But in the conceptual universe that parallels the external reality, computation is as good an agent of observation as visual or auditory or tactile sensation; if we can compute the error involved in a wrong procedure the error is there, regardless of whether we can see it or not. We must have methods which are conceptually free from error; and if we attempt to ignore the velocity of our light signals we do not meet this condition.
Of course, in most cases, the distance that any light signal we’re interested in has to travel to reach us is so small that the time taken for its transmission can't really be measured. So, for all practical purposes, we're at both the origin of the signal and the point of receipt at the same time. But that's not really the issue. Aside from the fact that this assumption doesn't hold in astronomical studies, it's true that it’s just an approximation. Approximations are fine in observations when we recognize they are approximations and adjust accordingly. However, in the conceptual realm that mirrors external reality, computation is as valid a method of observation as seeing, hearing, or touching; if we can calculate the error from a faulty procedure, that error exists whether we can perceive it or not. We need methods that are conceptually error-free; if we try to overlook the speed of our light signals, we fail to meet this requirement.
The measurement of lengths demands that we have a criterion of simultaneity between two remote points—remote in inches or remote in light-years, it does not matter which. There is no difficulty in defining simultaneity of two events that fall in the same point—or rather, in agreeing that we know what we mean by such simultaneity. But with regard to two events that occur in remote places there may be a question. A scientific definition differs from a mere description in that it must afford us a means of testing whether a given item comes under the definition or not. There is some difficulty in setting up a definition of simultaneity between distant events that satisfies this requirement. If we [85]try simply to fall back upon our inherent ideas of what we mean by “the same instant” we see that this is not adequate. We must lay down a procedure for determining whether two events at remote points occur at “the same instant,” and check up alleged simultaneity by means of this procedure.
Measuring lengths requires a way to determine whether two distant points are simultaneous—whether they're far apart in inches or light-years doesn't matter. It's easy to define simultaneity for two events happening at the same location—or at least, we all agree on what that means. But when it comes to events that occur in different places, things get complicated. A scientific definition needs to provide a way to test whether something fits that definition. Creating a definition of simultaneity for distant events that meets this requirement is tricky. If we just rely on our basic understanding of what “the same instant” means, we find that's not enough. We need to establish a method for figuring out if two events at different locations happen at “the same instant” and verify their claimed simultaneity using that method.
Einstein says, and we must agree with him, that he can find but one reasonable definition to cover this ground. An observer can tell whether he is located half way between two points of his observation; he can have mirrors set up at these points, send out light-signals, and note the time at which he gets back the reflection. He knows that the velocity of both signals, going and coming, is the same; if he observes that they return to him together so that their time of transit for the round trip is the same, he must accept the distances as equal. He is then at the mid-point of the line joining the two points under observation; and he may define simultaneity as follows, without introducing anything new or indeterminate: Two events are simultaneous if an observer midway between them sees them at the same instant, by means, of course, of light originating at the points of occurrence.]*
Einstein states, and we must agree with him, that he can find only one reasonable definition for this situation. An observer can determine whether they are positioned halfway between two points of observation; they can set up mirrors at these points, send out light signals, and track the time it takes for the reflections to return. They know that the speed of both signals, going and coming, is the same; if they notice that both return to them simultaneously so the total travel time is equal, they must accept the distances as equal. They are then at the midpoint of the line connecting the two points being observed; and they can define simultaneity as follows, without adding anything new or indefinite: Two events are simultaneous if an observer positioned halfway between them sees them at the same instant, using light emitted from the points where the events occur.
[It is this definition of simultaneity, coupled with the assumption that all observers, on whatever uniformly moving systems, would obtain the same experimental value for the velocity of light, that leads to the apparent paradoxes of the Special Theory of Relativity. If it be asked why we adopt it, we must in turn ask the inquirer to propose a better system for defining simultaneous events on different moving bodies.]198 [86]
[This definition of simultaneity, along with the idea that all observers in uniformly moving systems would measure the same speed of light, leads to the seeming paradoxes of the Special Theory of Relativity. If someone asks why we use this definition, we should ask them to suggest a better way to define simultaneous events on different moving bodies.]198 [__A_TAG_PLACEHOLDER_0__]
[There is nothing in this definition to indicate, directly, whether simultaneity persists for all observers, or whether it is relative, so that events simultaneous to one observer are not so to another. The question must then be investigated; and the answer, of course, will hinge upon the possibility of making proper allowances for the time of transit of the light signals that may be involved. It seems as though this ought to be possible; but a simple experiment will indicate that it is not, unless the observers involved are at rest with respect to one another.
[This definition doesn't clearly indicate whether simultaneity is the same for all observers or if it's relative, meaning that what one observer sees as simultaneous might not be seen the same way by another. We need to explore this question, and the answer will depend on whether we can properly account for the time it takes for light signals to travel. It seems like this should be possible, but a simple experiment shows that it isn't, unless the observers are at rest relative to each other.]
An Einstein Experiment
Let us imagine an indefinitely long, straight railroad track, with an observer located somewhere along it at the point M. According to the convention suggested above, he has determined points A and B in opposite directions from him along the
Let’s picture an infinitely long, straight railroad track, with an observer positioned somewhere along it at point M. Following the earlier suggestion, he has identified points A and B in opposite directions from him along the

track, and equally distant from him. We shall imagine, further, than a beneficent Providence supplies two lightning flashes, one striking at A and one at B, in such a way that observer M finds them to be simultaneous.
track, and equally far from him. Let's imagine that a kind Providence sends two lightning bolts, one hitting at A and the other at B, so that observer M perceives them as happening at the same time.
While all this is going on, a train is passing—a very long train, amply long enough to overlap the section AMB of the track. Among the passengers [87]there is one, whom we may call M′, who is directly opposite M at the instant when, according to M, the lightning strikes. Observe he is not opposite M when M sees the flashes, but a brief time earlier—at the instant when, according to M’s computation, the simultaneous flashes occurred. At this instant there are definitely determined the points A′ and B′, on the train; and since we may quite well think of the two systems—train-system and track-system—as in coincidence at this instant, M′ is midway between A′ and B′, and likewise is midway between A and B.
While all this is happening, a train is passing—a very long train, plenty long enough to cover the section AMB of the track. Among the passengers [__A_TAG_PLACEHOLDER_0__] is one, whom we can call M′, who is directly across from M at the moment when, according to M, the lightning strikes. It's important to note that he is not across from M when M actually sees the flashes, but a moment earlier—at the instant when, according to M happened. Right now, the points A′ and B′ on the train are clearly defined; and since we can view the two systems—the train system and the track system—as aligned at this moment, M′ is positioned halfway between A′ and B′, and also halfway between A and B.
Now if we think of the train as moving over the track in the direction of the arrow, we see very easily that M′ is running away from the light from A and toward that from B, and that, despite—or if you prefer because of—the uniform velocity of these light signals, the one from B reaches him, over a slightly shorter course, sooner than the one from A, over the slightly longer course. When the light signals reach M, M′ is no longer abreast of him but has moved along a wee bit, so that at this instant when M has the two signals, one of these has passed M′ and the other has yet to reach him. The upshot is that the events which were simultaneous to M are not so to M′.
Now, if we think of the train moving along the track in the direction of the arrow, it’s clear that M′ is moving away from the light coming from A and toward the light from B. Because of the constant speed of these light signals, the one from B reaches him sooner over a slightly shorter path than the one from A, which takes a slightly longer path. By the time the light signals reach M, M′ has moved forward a little, so at that moment when M receives the two signals, one of them has passed M′, and the other hasn’t reached him yet. The point is that the events that were simultaneous for M are not simultaneous for M′.
It will probably be felt that this result is due to our having, somewhat unjustifiably and inconsistently, localized on the train the relative motion between train and track. But if we think of the track as sliding back under the train in the direction opposite to the arrow, and carrying with it the points A and B; and if we remember that this in no way [88]affects M’s observed velocity of light or the distances AM and BM as he observes them: we can still accept his claim that the flashes were simultaneous. Then we have again the same situation: when the flashes from A and from B reach M at the same moment, in his new position a trifle to the left of his initial position of the diagram, the flash from A has not yet reached M′ in his original position while that from B has passed him. Regardless of what assumption we make concerning the motion between train-system and track-system, or more elegantly regardless of what coordinate system we use to define that motion, the event at B precedes that at A in the observation of M′. If we introduce a second train moving on the other track in the opposite direction, the observer on it will of course find that the flash at A precedes that at B—a disagreement not merely as to simultaneity but actually as to the order of two events! If we conceive the lightning as striking at the points A′ and B′ on the train, these points travel with M′ instead of with M; they are fixed to his coordinate system instead of to the other. If you carry out the argument now, you will find that when the flashes are simultaneous to M′, the one at A precedes that at B in M’s observation.
It might be thought that this result comes from us somewhat unfairly and inconsistently localizing on the train the relative motion between the train and the track. But if we visualize the track sliding back under the train in the opposite direction of the arrow, bringing along the points A and B; and if we remember that this does not affect M’s observation of the speed of light or the distances AM and BM as he sees them: we can still agree with his claim that the flashes were simultaneous. So, we have the same situation again: when the flashes from A and from B reach M at the same time, in his new position slightly to the left of his original spot in the diagram, the flash from A has not yet reached M' in his original position while the one from B has already passed him. Regardless of which assumption we make about the motion between the train system and the track system, or more elegantly, regardless of which coordinate system we use to define that motion, the event at B happens before the event at A in the observation of M′. If we introduce a second train moving on the other track in the opposite direction, the observer on it will, of course, find that the flash at A happens before the flash at B—a disagreement not just about simultaneity but actually about the order of two events! If we imagine the lightning striking at the points A′ and B′ on the train, these points move with M′ instead of with M; they are tied to his coordinate system rather than the other. If you follow through with the argument now, you will find that when the flashes are simultaneous for M′, the flash at A happens before the one at B in M’s observation.
A large number of experiments more or less similar in outline to this one can be set up to demonstrate the consequences, with regard to measured values of time and space, of relative motion between two observers. I do not believe that a multiplicity of such demonstrations contributes to the intelligibility of the subject, and it is for this reason that I have cut loose from immediate dependence upon [89]the essayists in this part of the discussion, concentrating upon the single experiment to which Einstein himself gives the place of importance.
A lot of experiments that are somewhat similar to this one can be designed to show the effects of relative motion between two observers on measured values of time and space. I don’t think having a bunch of these demonstrations makes the topic any clearer, which is why I've stepped away from relying on the essayists in this section of the discussion, focusing instead on the one experiment that Einstein considers most significant.
Who’s right?
We may permit Mr. Francis to remind us here that neither M nor M′ may correct his observation to make it accord with the other fellow’s. The one who does this is admitting that the other is at absolute rest and that he is himself in absolute motion; and this cannot be. They are simply in disagreement as to the simultaneity of two events, just as two observers might be in disagreement about the distance or the direction of a single event. This can mean nothing else than that, under the assumptions we have made, simultaneity is not an absolute characteristic as we had supposed it to be, but, like distance and direction, is in fact merely a relation between observer and objective, and therefore depends upon the particular observer who happens to be operating and upon the reference frame he is using.
We can let Mr. Francis remind us that neither M nor M′ can change his observation to match the other person’s. The one who does this is admitting that the other is completely still while they themselves are in motion, which isn't possible. They just disagree about whether two events happen at the same time, similar to how two observers might disagree about the distance or direction of a single event. This means that, based on the assumptions we've made, simultaneity isn’t an absolute quality like we thought; instead, like distance and direction, it's just a relationship between the observer and the objective, depending on the specific observer and the reference frame they are using.
But this is serious. My time measurements depend ultimately upon my space measurements; the latter, and hence both, depend closely upon my ideas of simultaneity. Yours depend upon your reading of simultaneity in precisely the same way.]* [Suppose the observer on the track, in the above experiment, wants to measure the length of something on the car, or the observer on the car something on the track. The observer, or his assistant, must be at both ends of the length to be measured at the same time, or get simultaneous reports in some way from [90]these ends; else they will obtain false results. It is plain, then, that with different criteria of what the “same time” is, the observers in the two systems may get different values for the measured lengths in question.]220
But this is serious. My time measurements ultimately depend on my space measurements; the latter, and therefore both, are closely tied to my understanding of simultaneity. Yours are based on your interpretation of simultaneity in the exact same way. Suppose the observer on the track, in the experiment above, wants to measure something on the car, or the observer on the car wants to measure something on the track. The observer, or their assistant, has to be at both ends of the length being measured at the same time, or they need to receive simultaneous reports from those ends; otherwise, they'll get incorrect results. It's clear, then, that with different criteria for what “the same time” means, the observers in the two systems may come up with different values for the lengths being measured.
[Who is right? According to the principle of relativity a decision on this question is absolutely impossible. Both parties are right from their own points of view; and we must admit that two events in two different places may be simultaneous for certain observers, and yet not simultaneous for other observers who move with respect to the first ones. There is no contradiction in this statement, although it is not in accordance with common opinion, which believes simultaneousness to be something absolute. But this common opinion lacks foundation. It cannot be proved by direct perception, for simultaneity of events can be perceived directly,]24 [and in a manner involving none of our arbitrary assumptions,]* [only if they happen at the same place; if the events are distant from each other, their simultaneity or succession can be stated only through some method of communicating by signals. There is no logical reason why such a method should not lead to different results for observers who move with regard to one another.
[Who is right? According to the principle of relativity, deciding this question is completely impossible. Both sides are correct from their own perspectives; we have to acknowledge that two events occurring in different places can be simultaneous for some observers and yet not simultaneous for others who are moving relative to the first group. There’s no contradiction in this statement, even though it goes against the common belief that simultaneity is an absolute concept. But this widely accepted view has no real basis. It can’t be proven through direct perception, because the simultaneity of events can only be directly perceived,]24 [and in a way that doesn’t rely on arbitrary assumptions,]* [if they occur in the same location; if the events are far apart, we can only determine their simultaneity or sequence through some method of signaling. There’s no logical reason why such a method couldn’t yield different results for observers who are in motion relative to one another.
From what we have said, it follows immediately that in the new theory not only the concept of simultaneousness but also that of duration is revealed as dependent on the motion of the observer.]24 [Demonstration of this should be superfluous; it ought to be plain without argument that if two observers cannot agree whether two instants are [91]the same instant or not, they cannot agree on the interval of time between instants. In the very example which we have already examined, one observer says that a certain time-interval is zero, and another gives it a value different from zero. The same thing happens whenever the observers are in relative motion.]* [Two physicists who measure the duration of a physical process will not obtain the same result if they are in relative motion with regard to one another.
From what we've discussed, it immediately follows that in the new theory, not only the idea of simultaneity but also that of duration is shown to depend on the observer's motion.24 [This should be obvious; it should be clear without any debate that if two observers can't agree on whether two moments are the same or not, they can't agree on the amount of time between those moments. In the example we already looked at, one observer claims that a certain time interval is zero, while another assigns it a value that is not zero. The same situation occurs whenever the observers are in relative motion.][__A_TAG_PLACEHOLDER_0__] [Two physicists measuring the duration of a physical process will not get the same result if they are in relative motion to each other.]
They will also find different results for the length of a body. An observer who wants to measure the length of a body which is moving past him must in one way or another hold a measuring rod parallel to its motion and mark those points on his rod with which the ends of the body come into simultaneous coincidence. The distance between the two marks will then indicate the length of the body. But if the two markings are simultaneous for one observer, they will not be so for another one who moves with a different velocity, or who is at rest, with regard to the body under observation. He will have to ascribe a different length to it. And there will be no sense in asking which of them is right: length is a purely relative concept, just as well as duration.]24
They will also find different results for the length of an object. An observer who wants to measure the length of a body moving past them must somehow position a measuring stick parallel to its motion and mark the points on the stick where the ends of the object align. The distance between these marks will indicate the length of the object. However, if these markings are simultaneous for one observer, they won't be for another observer moving at a different speed or who is stationary relative to the object. That observer will have to assign a different length to it. And there’s no point in debating who is correct: length is a purely relative concept, just like duration.24
The Relativity of Time and Space
[The degree to which distance and time become relative instead of absolute quantities under the Special Theory of Relativity can be stated very definitely. In the first place, we must point out [92]that the relativity of lengths applies with full force only to lengths that lie parallel to the direction of relative motion. Those that lie exactly perpendicular to that direction come out the same for both observers; those that lie obliquely to it show an effect, depending upon the angle, which of course becomes greater and greater as the direction of parallelism is approached.
[The extent to which distance and time become relative rather than absolute concepts in the Special Theory of Relativity is quite clear. First, we should note [__A_TAG_PLACEHOLDER_0__] that the relativity of lengths fully applies only to lengths that are aligned with the direction of relative motion. Lengths that are perfectly perpendicular to that direction remain unchanged for both observers; lengths that are at an angle to it exhibit an effect that depends on the angle, which obviously increases as the alignment with the direction of motion improves.]
The magnitude of the effect is easily demonstrated, but with this demonstration we
do not need to be concerned here. It turns out that if an observer moving with a system
finds that a certain time interval in the system is T seconds and that a certain length in the system is L inches, then an observer moving parallel with L and with a velocity v relative to the system will find for these the respective values and
, where
C in this expression of course represents the velocity of light. It will be noted that
the fraction
is ordinarily very small; that the expression under the radical is therefore less
than 1 but by a very slight margin; and that the entire expression K is itself therefore less than 1 but by an even slighter margin. This means, then,
that the observer outside the system finds the lengths in the system to be a wee bit
shorter and the time intervals a wee bit longer than does the observer in the system.
Another way of putting the matter is based, ultimately, upon the fact that in order
for the observer in the system to get the larger value for distance and the smaller [93]value for time, his measuring rod must go into the distance under measurement more
times than that of the moving observer, while his clock must beat a longer second
in order that less of them shall be recorded in a given interval between two events.
So it is often said that the measuring rod as observed from without is contracted
and the clock runs slow. This does not impress me as a happy statement, either in
form or in content.]*
The extent of the effect is easy to show, but we don’t need to focus on that here. If an observer moving with a system notices that a specific time interval in the system is T seconds and that a specific length in the system is L inches, then another observer moving parallel to L and at a speed v relative to the system will measure the values and
. Here,
C in this equation represents the speed of light. It’s worth noting that the fraction
is usually very small; therefore, the expression under the square root is slightly less than 1, and the overall value K is also slightly less than 1. This implies that the outside observer finds the lengths in the system to be a little shorter and the time intervals to be a little longer compared to what the observer in the system measures. Another way to explain it is that in order for the observer in the system to measure a greater distance and a smaller [__A_TAG_PLACEHOLDER_0__] value for time, their measuring stick must reach the distance being measured more times than the moving observer's, while their clock must take longer to tick a second so that fewer ticks are counted in the time between two events. Thus, it's often said that the measuring stick observed from outside appears contracted and that the clock ticks more slowly. However, I don’t think this is a clear statement, either in its wording or its meaning.
[The argument that these formulae are contradicted by human experience can be refuted by examining a concrete instance. If a train is 1,000 feet long at rest, how long will it be when running a mile a minute?]232 [I have quoted this question exactly as it appears in the essay from which it is taken, because it is such a capital example of the objectionable way in which this business is customarily put. For the statement that lengths decrease and time-intervals increase “with velocity” is not true in just this form. The velocity, to have meaning, must be relative to some external system; and it is the observations from that external system that are affected. So long as we confine ourselves to the system in which the alleged modifications of size are stated as having taken place, there is nothing to observe that is any different from what is usual; there is no way to establish that we are enjoying a velocity, and in fact within the intent of the relativity theory we are not enjoying a velocity, for we are moving with the objects which we are observing. It is inter-systemic observations, and these alone, that show the effect. When we travel with the system under observation, we get the same results as any [94]other observer on this system; when we do not so travel, we must conduct our observations from our own system, in relative motion to the other, and refer our results to our system.
[The idea that these formulas clash with human experience can be disproven by looking at a specific example. If a train is 1,000 feet long when it's at rest, how long will it be when it's moving at a mile a minute?]232 [I’ve quoted this question exactly as it’s presented in the essay because it's a perfect case of how this issue is usually expressed in an unhelpful way. The claim that lengths shrink and time intervals lengthen “with velocity” isn't accurate in that form. For velocity to make sense, it has to be relative to some outside system; and it’s the observations from that outside system that are influenced. As long as we stay within the system where these supposed changes in size are said to happen, there’s nothing to observe that’s any different from usual; we can’t prove we’re moving at a certain velocity, and in fact, according to relativity theory, we’re not moving with respect to what we’re observing since we’re traveling alongside it. Only inter-systemic observations show this effect. When we move with the system being observed, we get the same results as any [__A_TAG_PLACEHOLDER_0__]other observer in that system; when we don’t, we have to make our observations from our own system, which is in motion relative to the other one, and relate our findings to our own system.
Now when no particular observer is specified, we must of course assume an observer
connected with the train, or with whatever the body mentioned. To that observer it
doesn’t make the slightest difference what the train does; it may stand at rest with
respect to some external system or it may move at any velocity whatsoever; its length
remains always 1,000 feet. In order for this question to have the significance which
its propounder means it to have, I must restate it as follows: A train is 1,000 feet
long as measured by an observer travelling with it. If it passes a second observer
at 60 miles per hour, what is its length as observed by him? The answer is now easy.]*
[According to the formula the length of the moving train as seen from the ground will
be
feet, a change entirely too small for detection by the most delicate instruments.
Examination of the expression K shows that in so far as terrestrial movements of material
objects are concerned it is equal to 1]232 [within a far smaller margin than we can ever hope to make our observations. Even
the diameter of the earth, as many of the essayists point out, will be shortened only
2½ inches for an outside observer past whom it rushes with its orbital speed [95]of 18.5 miles per second. But slight as the difference may be in these familiar cases,
its scientific importance remains the same.]*
Now, when no specific observer is mentioned, we have to assume that there’s an observer connected to the train or whatever body is being referred to. For that observer, it doesn’t matter at all what the train does; it could be stationary relative to some external system or moving at any speed; its length is always 1,000 feet. To give this question the significance that the person asking intends, I need to rephrase it like this: A train is 1,000 feet long as measured by an observer riding on it. If it passes a second observer at 60 miles per hour, what is its length as seen by that observer? The answer is straightforward now. According to the formula, the length of the moving train as seen from the ground will be
feet, a change that’s too small for even the most sensitive instruments to detect. Analyzing the expression K shows that regarding the movements of material objects on Earth, it is equal to 1]232 [within a margin much smaller than what we could ever realistically measure. As many essayists have noted, even the diameter of the Earth will only be reduced by 2½ inches for an outside observer moving past it at its orbital speed [__A_TAG_PLACEHOLDER_0__] of 18.5 miles per second. But even though the difference may be small in these familiar situations, its scientific significance remains unchanged.]*
Relativity and Reality
[A simple computation shows that this effect is exactly the amount suggested by Lorentz and Fitzgerald to explain the Michelson-Morley experiment.]188 [This ought not to surprise us, since both that explanation and the present one are got up with the same purpose. If they both achieve that purpose they must, numerically, come to the same thing in any numerical case. It is, however, most emphatically to be insisted that the present “shortening” of lengths]* [no longer appears as a “physical” shortening caused by absolute motion through the ether but is simply a result of our methods of measuring space and time.]188 [Where Fitzgerald and Lorentz had assumed that a body in motion has its dimensions shortened in the direction of its motion,]220 [this very form of statement ceases to possess significance under the relativity assumption.]* [For if we cannot tell which of two bodies is moving, which one is shortened? The answer is, both—for the other fellow. For each frame of reference there is a scale of length and a scale of time, and these scales for different frames are related in a manner involving both the length and the time.]220 [But we must not yield to the temptation to say that all this is not real; the confinement of a certain scale of length and of time to a single observation system does not in the least make it unreal.]* [The situation [96]is real—as real as any other physical event.]165
[A simple calculation shows that this effect is exactly the amount suggested by Lorentz and Fitzgerald to explain the Michelson-Morley experiment.]188 [This shouldn't surprise us, since both that explanation and the current one are created with the same objective. If they both achieve that goal, they must, numerically, arrive at the same conclusion in any numerical case. It is, however, crucial to emphasize that the current “shortening” of lengths] * [no longer appears as a “physical” shortening caused by absolute motion through the ether but is simply a result of our methods of measuring space and time.]188 [Where Fitzgerald and Lorentz assumed that a moving object has its dimensions shortened in the direction of its motion,] 220 [this kind of statement loses its meaning under the relativity assumption.] * [For if we can't tell which of two bodies is moving, which one is shortened? The answer is, both— for the other guy. For each frame of reference, there is a scale for length and a scale for time, and these scales for different frames are related in a way that involves both length and time.] 220 [But we must not give in to the temptation to say that all this isn’t real; the restriction of a certain scale of length and time to a single observation system doesn’t make it unreal at all.] * [The situation [__A_TAG_PLACEHOLDER_0__]is real—as real as any other physical event.]165
[The word physical is used in two senses in the above paragraph. It is denied that the observed variability in lengths indicates any “physical” contraction or shrinkage; and on the heels of this it is asserted that this observed variability is of itself an actual “physical” event. It is difficult to express in words the distinction between the two senses in which the term physical is employed in these two statements, but I think this distinction ought to be clear once its existence is emphasized. There is no material contraction; it is not right to say that objects in motion contract or are shorter; they are not shorter to an observer in motion with them. The whole thing is a phenomenon of observation. The definitions which we are obliged to lay down and the assumptions which we are obliged to make in order, first, that we shall be able to measure at all, and second, that we shall be able to escape the inadmissible concept of absolute motion, are such that certain realities which we had supposed ought to be the same for all observers turn out not to be the same for observers who are in relative motion with respect to one another. We have found this out, and we have found out the numerical relation which holds between the reality of the one observer and that of the other. We have found that this relation depends upon nothing save the relative velocity of the two observers. As good a way of emphasizing this as any is to point out that two observers who have the same velocity with respect to the system under examination (and whose mutual relative velocity is therefore zero) will always get [97]the same results when measuring lengths and times on that system. The object does not go through any process of contraction; it is simply shorter because it is observed from a station with respect to which it is moving. Similar remarks might be made about the time effect; but the time-interval is not so easily visualized as a concrete thing and hence does not offer such temptation for loose statement.
[The term physical is used in two ways in the paragraph above. It is stated that the observed variability in lengths doesn’t indicate any "physical" contraction or shrinking; and immediately after, it is claimed that this variability is, in itself, an actual "physical" event. It’s tricky to put into words the difference between the two ways the term physical is used in these statements, but this difference should be clear once its existence is pointed out. There is no material contraction; it’s incorrect to say that objects in motion contract or become shorter; they don’t seem shorter to an observer moving with them. The entire situation is a phenomenon of observation. The definitions and assumptions we have to establish to first be able to measure anything and second to avoid the not allowed idea of absolute motion lead to specific realities that we thought would be the same for all observers but turn out to be different for observers in relative motion with respect to one another. We’ve discovered this, along with the numerical relationship that exists between the realities of different observers. We found that this relationship depends solely on the relative velocity between the two observers. A clear way to emphasize this is to note that two observers with the same velocity concerning the system being examined (and whose mutual relative velocity is therefore zero) will always obtain [__A_TAG_PLACEHOLDER_0__] the same results when measuring lengths and times in that system. The object doesn’t undergo any process of contraction; it just appears shorter because it is observed from a point of view where it is in motion. Similar points could be made about the effect on time; however, the time interval is not as easily visualized as something concrete and thus doesn’t lead to loose statements as readily.]
The purely relative aspect of the matter is further brought out if we consider a single example both backwards and forwards. Systems S and S′ are in relative motion. An object in S which to an observer in S is L units long, is shorter for an observer in S′—shorter by an amount indicated through the “correction factor” K. Now if we have, in the first instance, made the objectionable statement that objects are shorter in system S′ than they are in S, it will be quite natural for us to infer from this that objects in S must be longer than those in S′; and from this to assert that when the observer in S measures objects lying in S′, he gets for them greater lengths than does the home observer in S′. But if we have, in the first instance, avoided the objectionable statement referred to, we shall be much better able to realize that the whole business is quite reciprocal; that the phenomena are symmetric with respect to the two systems, to the extent that we can interchange the systems in any of our statements without modifying the statements in any other way.
The completely relative nature of the situation becomes clearer when we look at a single example from both directions. Systems S and S′ are in relative motion. An object in S that appears to be L units long to an observer in S seems shorter to an observer in S′—by an amount defined by the “correction factor” K. If, initially, we make the incorrect claim that objects are shorter in system S′ than in S, it's easy to conclude that objects in S must be longer than those in S′; and that when the observer in S measures objects in S′, they get longer measurements than the observer in S′ does. However, if we avoid that problematic statement from the start, we can better understand that the entire situation is reciprocal; that the phenomena are symmetric with respect to both systems, to the point that we can swap the systems in any of our statements without changing the meaning in any other way.
Objects in S appear shorter and times in S appear longer to the external “moving” observer in S′ than they do to the domestic observer in S. Exactly in the same way, objects in S′ appear shorter to observers [98]in the foreign system S than to the home observer in S′, who remains at rest with respect to them. I think that when we get the right angle upon this situation, it loses the alleged startling character which has been imposed upon it by many writers. The “apparent size” of the astronomer is an analogy in point. Objects on the moon, by virtue of their great distance, look smaller to observers on the earth than to observers on the moon. Do objects on the earth, on this account, look larger to a moon observer than they do to us? They do not; any suggestion that they do we should receive with appropriate scorn. The variation in size introduced by distance is reciprocal, and this reciprocity does not in the least puzzle us. Why, then, should that introduced by relative motion puzzle us?
Objects in S seem shorter, and time in S seems longer to the external “moving” observer in S′ compared to how they appear to the stationary observer in S. Similarly, objects in S′ look shorter to observers [__A_TAG_PLACEHOLDER_0__] in the foreign system S than they do to the home observer in S′, who stays at rest relative to them. I believe that when we view this situation from the right perspective, it loses the exaggerated shock that many writers have assigned to it. The “apparent size” of the astronomer serves as a relevant analogy. Objects on the moon, due to their vast distance, appear smaller to observers on Earth than they do to those on the moon. Do objects on Earth, therefore, look larger to a moon observer than they do to us? They do not; any suggestion that they do should be met with proper disdain. The change in size caused by distance is reciprocal, and this reciprocity doesn’t confuse us at all. So why should the changes caused by relative motion confuse us?
Time and Space in One Package
Our old, accustomed concepts of time and space, which have grown up through countless generations of our ancestors, and been handed down to us in the form in which we are familiar with them, leave no room for a condition where time intervals and space intervals are not universally fixed and invariant. They leave no room for us to say that]* [one cannot know the time until he knows where he is, nor where he is until he knows the time,]220 [nor either time or place until he knows something about velocity. But in this concise formulation of the difference between what we have always believed and what we have seen to be among the consequences of Einstein’s postulates of the universal [99]relativity of uniform motion, we may at once locate the assumption which, underlying all the old ideas, is the root of all the trouble. The fact is we have always supposed time and space to be absolutely distinct and independent entities.]*
Our old, familiar ideas of time and space, developed over countless generations and passed down to us in forms we recognize, don’t allow for a situation where time and space intervals aren’t universally fixed and unchanging. They don’t give us the space to say that one can’t know the time until they know where they are, nor where they are until they know the time, nor either time or place until they know something about speed. But in this clear summary of the difference between what we’ve always believed and what we’ve found to be true from Einstein’s principles of the universal relativity of uniform motion, we can immediately identify the assumption that underlies all the old ideas and is the root of all the problems. The truth is, we’ve always thought of time and space as completely separate and independent things.
[The concept of time has ever been one of the most absolute of all the categories. It is true that there is much of the mysterious about time; and philosophers have spent much effort trying to clear up the mystery—with unsatisfactory results. However, to most persons it has seemed possible to adopt an arbitrary measure or unit of duration and to say that this is absolute, independent of the state of the body or bodies on which it is used for practical purposes.]272 [Time has thus been regarded as something which of itself flows on regularly and continuously, regardless of physical events concerning matter.]150 [In other words, according to this view, time is not affected by conditions or motions in space.]272 [We have deliberately chosen to ignore the obvious fact that time can never appear to us, be measured by us, or have the least significance for us, save as a measure of something that is closely tied up with space and with material space-dimensions. Not merely have we supposed that time and space are separated in nature as in our easiest perceptions, but we have supposed that they are of such fundamentally distinct character that they can never be tied up together. In no way whatever, assumes the Euclidean and Newtonian intellect, may space ever depend upon time or time upon space. This is the assumption which we must remove in order to attain universal relativity; and while it may come [100]hard, it will not come so hard as the alternative. For this alternative is nothing other than to abandon universal relativity. This course would leave us with logical contradictions and discrepancies that could not be resolved by any revision of fundamental concepts or by any cleaning out of the Augean stables of old assumptions; whereas the relativity doctrine as built up by Einstein requires only such a cleaning out in order to leave us with a strictly logical and consistent whole. The rôle of Hercules is a very difficult one for us to play. Einstein has played it for the race at large, but each of us must follow him in playing it for himself.
[The idea of time has always been one of the most absolute concepts. It’s true that time has an air of mystery about it; philosophers have invested a lot of effort trying to unravel that mystery—with limited success. Still, for most people, it has seemed feasible to adopt an arbitrary measurement or unit of time and to claim that this is absolute, independent of the state of any bodies that it's applied to for practical purposes.]272 [Thus, time has been seen as something that flows continuously and regularly, regardless of physical events related to matter.]150 [In other words, from this perspective, time isn't influenced by conditions or movements in space.]272 [We have intentionally chosen to overlook the obvious fact that time can never be perceived, measured, or hold any significance for us, except as a measure of something that is closely tied to space and material dimensions. We have not only assumed that time and space are separate in nature as they appear to us easily, but we have also assumed that they are so fundamentally different that they can never be interconnected. In no way, as assumed by the Euclidean and Newtonian mindset, can space depend on time or time on space. This is the assumption we must dismantle to achieve universal relativity; and while it may be challenging, it won’t be as difficult as the alternative. That alternative would mean abandoning universal relativity altogether. Choosing that path would leave us with logical contradictions and inconsistencies that couldn't be fixed by any revisions of fundamental concepts or by cleaning out the mess of old assumptions; whereas the theory of relativity as developed by Einstein only requires such a cleanup to present us with a clear and logical framework. The task of Hercules is a very challenging one for us to undertake. Einstein has taken it on for humanity as a whole, but each of us must individually follow his lead in tackling it for ourselves.]
Additional Consequences
I need not trespass upon the subject matter of those essays which appear in full by
going here into any details with regard to the manner in which time and space are
finally found to depend upon one another and to form the parts of a single universal
whole. But I may appropriately point out that if time and space are found to be relative,
we may surely expect some of the less fundamental concepts that depend upon them to
be relative also. In this expectation we are not disappointed. For one thing,]* [mass
has always been assumed to be a constant, independent of any motion or energy which
it might possess. Just as lengths and times depend upon relative motion, however,
it is found that mass, which is the remaining factor in the expression for energy
due to motion, also depends upon relative velocities. The dependence is such that
if a body [101]takes up an amount of energy E with respect to a certain system, the body behaves, to measurements made from that system, as though its mass had been increased by an amount , where C is as usual the velocity of light.]194
I won't go into the details of the essays that discuss how time and space ultimately depend on each other and come together to form a single universal whole. However, I can point out that if time and space are found to be relative, we can also expect some of the less fundamental concepts that depend on them to be relative as well. This expectation holds true. For instance, mass has always been thought of as a constant, not affected by any motion or energy it may have. Just as lengths and times rely on relative motion, it turns out that mass, which is the remaining factor in the expression for energy due to motion, also depends on relative velocities. Specifically, if a body [__A_TAG_PLACEHOLDER_0__] gains an amount of energy E relative to a certain system, the body acts, from the perspective of measurements made from that system, as if its mass has increased by an amount , where C is the speed of light. ]194
[This should not startle us. The key to the situation lies in the italicized words above, which indicate that the answer to the query whether a body has taken up energy or not depends upon the seat of observation. If I take up my location on the system S, and you on the system S′, and if we find that we are in relative motion, we must make some assumption about the energy which was necessary, initially, to get us into this condition. Suppose we are on two passing trains.]* [The chances are that either of us will assume that he is at rest and that it is the other train which moves, although if sufficiently sophisticated one of us may assume that he is moving and that the other train is at rest.]272 [Whatever our assumption, whatever the system, the localization of the energy that is carried in latent form by our systems depends upon this assumption. Indeed, if our systems are of differing mass, our assumptions will even govern our ideas of the amount of energy which is represented by our relative motion; if your system be the more massive, more energy would have to be localized in it than in mine to produce our relative motion. If we did not have the universal principle of relativity to forbid, we might make an arbitrary assumption about our motions and hence about our respective latent energies; in the presence of this veto, the only chance of adjustment lies in our masses, which must differ according to whether [102]you or I observe them.]*
[This shouldn't surprise us. The key to the situation lies in the italicized words above, which indicate that whether a body has taken up energy or not depends on the observer's perspective. If I position myself on system S and you on system S' , and we find ourselves in relative motion, we must make some assumption about the energy that was initially required to place us in this condition. Imagine we are on two passing trains.]* [It's likely that either of us will think of ourselves as at rest while believing the other train is moving, although if we're sophisticated enough, one of us may recognize that we are the ones in motion while the other train is at rest.]272 [Regardless of our assumptions or the system we are in, the localization of the energy carried in latent form by our systems depends on these assumptions. In fact, if our systems have different masses, our assumptions will even influence our understanding of the amount of energy represented by our relative motion; if your system is more massive, more energy would need to be localized in it than in mine to produce our relative motion. If there were no universal principle of relativity preventing us, we might make arbitrary assumptions about our movements and thus about our respective latent energies; with this limitation in place, our only chance of adjustment lies in our masses, which must differ based on whether [__A_TAG_PLACEHOLDER_0__]you or I observe them.]*
[For most of the velocities with which we are familiar is, like the difference between K and unity, such an extremely small quantity that the most delicate measurements fail
to detect it. But the electrons in a highly evacuated tube and the particles shot
out from radioactive materials attain in some cases velocities as high as eight-tenths
that of light. When we measure the mass of such particles at different velocities
we find that it actually increases with the velocity, and in accordance with the foregoing
law.]194 [This observation, in fact, antedates Einstein’s explanation, which is far more satisfactory
than the earlier differentiation between “normal mass” and “electrical mass” which
was called upon to account for the increase.]*
[For most of the speeds we're familiar with, is, similar to the difference between K and one, such an extremely small amount that even the most sensitive measurements can't detect it. However, electrons in a highly evacuated tube and particles ejected from radioactive materials can reach velocities as high as eighty percent of the speed of light. When we measure the mass of such particles at different speeds, we find that it actually increases with the speed, in line with the previously mentioned law.]194 [This observation actually predates Einstein’s explanation, which is much more satisfying than the older distinction between “normal mass” and “electrical mass” that was used to explain the increase.]*
[But if the quantity is to be considered as an actual increase in mass, may it not be possible that all
mass is energy? This would lead to the conclusion that the energy stored up in any mass is
. The value is very great, since C is so large; but it is in good agreement with the internal energy of the atom as
calculated from other considerations. It is obvious that conservation of mass and
of momentum cannot both hold good under a theory that translates the one into the
other. Mass is then not considered by Einstein as conservative in the ordinary sense,
but it is the total quantity of mass plus energy in any closed system that remains
constant. Small amounts of energy may be transformed into mass, and vice versa.]194
[But if the quantity is seen as an actual increase in mass, could it be possible that all mass is energy? This would imply that the energy contained in any mass is
. This value is quite large since C is so significant; however, it aligns well with the internal energy of the atom as determined by other factors. It's clear that conservation of mass and momentum can't both be valid in a theory that converts one into the other. Thus, Einstein doesn't regard mass as conserved in the usual way, but rather it is the total amount of mass plus energy in any closed system that stays constant. Small amounts of energy can be converted into mass, and vice versa.]194
[Other features of the theory which are often displayed as consequences are really more in the [103]nature of assumptions. It will be recalled that when we had agreed upon the necessity of employing signals of some sort, we selected as the means of signalling the speediest messenger with which we happened to be acquainted. Our subsequent difficulties were largely due to the impossibility of making a proper allowance for this messenger’s speed, even though we knew its numerical value; and as a consequence, this speed enters into our formulae. Now we have not said in so many words that C is the greatest speed attainable, but we have tacitly assumed that it is. We need not, therefore, be surprised if our formulae give us absurd results for speeds higher than C, and indicate the impossibility of ever attaining these. Whatever we put into a problem the algebra is bound to give us back. If we look at our formula for K, we see that in the event of v equalling C, lengths become zero and times infinite. The light messenger itself, then, has no dimension; and for it time stands still.
[Other features of the theory that are often shown as outcomes are actually more about the [__A_TAG_PLACEHOLDER_0__]nature of assumptions. Remember that when we agreed on the need to use some kind of signals, we chose the fastest messenger we knew of as our means of signaling. Our later challenges mostly came from the difficulty of properly accounting for this messenger’s speed, even though we knew its exact value; and as a result, this speed is part of our formulas. Now, we haven't explicitly stated that C is the highest speed possible, but we've implicitly assumed that it is. So, we shouldn't be surprised if our formulas give us nonsensical results for speeds greater than C, indicating that it's impossible to reach them. Whatever we input into a problem, the algebra will return to us. Looking at our formula for K, we find that if v equals C, lengths shrink to zero and time becomes infinite. The light messenger itself, then, has no dimensions; for it, time is at a standstill.
If we suppose v to be greater than C, we get even more bizarre results, for then the factor K is the square root of a negative number, or as the mathematician calls it an “imaginary” quantity; and with it, lengths and times become imaginary too.
If we assume v is greater than C, we end up with even stranger results, because then the factor K is the square root of a negative number, or as mathematicians put it, an “imaginary” quantity; and with that, lengths and times also become imaginary.
The fact that time stops for it, and the fact that it is the limiting velocity, give
to C certain of the attributes of the mathematician’s infinity. Certainly if it can never
be exceeded, we must have a new formula for the composition of velocities. Otherwise
when my system passes yours at a speed of 100,000 miles per second, while yours passes
a third in the same direction at the same velocity, I shall be [104]passing this third framework at the forbidden velocity of 200,000 miles per second—greater
than C. In fact Einstein is able to show that an old formula, which had already been found
to connect the speed of light in a material medium with the speed of that medium, will now serve universally for the composition
of velocities. When we combine the velocities v and u, instead of getting the resultant as we would have supposed, we get the resultant
or
The fact that time stops for it and that it is the ultimate speed gives C some traits of a mathematician's infinity. Since it can never be surpassed, we need a new formula for combining speeds. Otherwise, when my system overtakes yours at 100,000 miles per second, while yours moves a third in the same direction at the same speed, I will be [__A_TAG_PLACEHOLDER_0__] surpassing this third frame at the impossible speed of 200,000 miles per second—greater than C. In fact, Einstein shows that an old formula, which already connected the speed of light in a material medium to the speed of that medium, will now work universally for combining speeds. When we combine the speeds v and u, instead of getting the resultant that we might expect, we get the resultant
or
This need not surprise us either, if we will but reflect that the second velocity effects a second revision of length and time measurements between the systems involved. And now, if we let either v, or u, or even both of them, take the value C, the resultant still is C. In another way we have found C to behave like the mathematician’s infinity, to which, in the words of the blind poet, if we add untold thousands, we effect no real increment.
This shouldn’t surprise us, especially if we consider that the second speed leads to a second adjustment of length and time measurements between the systems involved. Now, if we let either v, or you, or even both, equal C, the result is still C. In another sense, we’ve discovered that C behaves like the mathematician’s concept of infinity, to which, as the blind poet said, if we add countless thousands, we still see no real increase.
Assumption and Outcome
A good many correspondents who have given the subject sufficient thought to realize that the limiting character of the velocity C is really read into Einstein’s system by assumption have written, in more or less perturbed inquiry, to know whether this does not invalidate the whole structure. The answer, of course, is yes—provided you can show this assumption [105]to be invalid. The same answer may be made of any scientific doctrine whatever, and in reference to any one of the multitudinous assumptions underlying it. If we were to discover, tomorrow, a way of sending signals absolutely instantaneously, Einstein’s whole structure would collapse as soon as we had agreed to use this new method. If we were to discover a signalling agent with finite velocity greater than that of light, relativity would persist with this velocity written in its formulae in the place of C.
A lot of correspondents who have thought carefully about this issue understand that the limiting nature of the velocity C is something that’s assumed in Einstein’s theory. They have written, sometimes quite anxiously, to ask if this assumption undermines the entire framework. The answer is yes—if you can prove that this assumption [__A_TAG_PLACEHOLDER_0__] is invalid. The same response applies to any scientific principle, concerning any of the numerous assumptions that support it. If we were to discover a way to send signals instantly by tomorrow, Einstein’s entire theory would fall apart as soon as we agreed to use this new method. If we found a signaling medium with a finite speed greater than that of light, relativity would still hold, but with this new speed replacing C in its equations.
It is a mistake to quote Einstein’s theory in support of the statement that such a velocity can never be. An assumption proves its consequences, but never can prove itself; it must remain always an assumption. But in the presence of long human experience supporting Einstein’s assumption that no velocity in excess of C can be found, it is fair to demand that it be disputed not with argument but with demonstration. The one line of argument that would hold out a priori hope of reducing the assumption to an absurdity would be one based on the familiar idea of adding velocities; but Einstein has spiked this argument before it is started by replacing the direct addition of velocities with another method of combining them that fits in with his assumption and as well with the observed facts. The burden of proof is then on the prosecution; anyone who would contradict our assertion that C is the greatest velocity attainable may do so only by showing us a greater one. Until this has been done, the admission that it may properly be attempted can in no way be construed as a confession of weakness on the part of Einstein. [106]
It’s a mistake to use Einstein’s theory to argue that such a speed can never exist. An assumption can prove its consequences, but it can never prove itself; it will always remain an assumption. However, given the extensive human experience backing Einstein’s assumption that no speed greater than C can exist, it’s reasonable to expect that it be challenged not with arguments but with evidence. The only line of reasoning that might offer a chance of rendering the assumption absurd would be one based on the common idea of adding velocities; however, Einstein has already invalidated this argument by replacing direct addition of speeds with a method that aligns with his assumption and the observed facts. The responsibility of proof lies with the challenger; anyone who wants to contradict our claim that C is the maximum speed achievable can only do so by demonstrating a greater one. Until this is accomplished, the recognition that it could possibly be attempted cannot be seen as a weakness on Einstein’s part. [__A_TAG_PLACEHOLDER_0__]
It may be well to point out that in no event may analogy be drawn with sound, as many have tried to do. In the first place sound requires a material medium and its velocity with regard to this rather than relative to the observer we know to be fixed; in the second place, requiring a material medium, sound is not a universal signalling agent; in the third place, we know definitely that its velocity can be exceeded, and are therefore barred from making the assumption necessary to establish the analogy. The very extraordinary behavior of light in presenting a velocity that is the same for all observers, and in refusing to betray the least material evidence of any medium for its transmission, rather fortifies us in believing that Einstein’s assumption regarding the ultimate character of this velocity is in accord with the nature of things.
It’s important to note that we can’t compare this to sound, despite what many people have tried to do. First, sound needs a physical medium to travel through, and its speed is related to that medium rather than to a fixed observer; second, since it requires a physical medium, sound isn't a universal way to signal; third, we definitely know that light's speed can be surpassed, which stops us from assuming what we need to make that comparison. The unusual behavior of light, which shows a speed that is the same for all observers and doesn’t show any physical evidence of a medium for its transmission, strengthens our belief that Einstein’s idea about the fundamental nature of this speed aligns with reality.
Relativity for the Everyday Person
A great deal can be said in the direction of general comment making the Special Theory and its surprising accompaniments easier of acceptance, and we shall conclude the present discussion by saying some of these things.]* [It has been objected that the various effects catalogued above are only apparent, due to the finite velocity of light—that the real shape and size of a body or the real time of an event cannot be affected by the point of view or the motion of an observer. This argument would be perfectly valid, if there were real times and distances; but there are not. These are earth-bound notions, due to our experience on an apparently motionless platform, [107]with slow-moving bodies. Under these circumstances different observations of the same thing or of the same event agree. But when we no longer have the solid earth to stand on, and are dealing with velocities so high that the relativity effects become appreciable, there is no standard by which to resolve the disagreements. No one of the observations can claim to be nearer reality than any other. To demand the real size of a thing is to demand a stationary observer or an instantaneous means of information. Both are impossible.
A lot can be said about general comments that make the Special Theory and its surprising aspects easier to accept, and we'll wrap up this discussion by touching on some of these points. It has been argued that the various effects listed above are just apparent, caused by the finite speed of light—that the actual shape and size of an object or the actual timing of an event cannot be influenced by the perspective or motion of an observer. This argument would hold true if there were actual times and distances; but there aren't. These are concepts tied to our earthly experiences, based on an apparently motionless frame of reference, [__A_TAG_PLACEHOLDER_0__]with slow-moving objects. In such cases, different observations of the same thing or event tend to agree. But when we step away from solid ground and look at speeds so high that the effects of relativity become significant, there's no standard to settle the differences. None of the observations can claim to represent reality more accurately than the others. To ask for the true size of something is to ask for a stationary observer or instantaneous information, and both of those are impossible.
When relativity asks us to give up our earth-bound notions of absolute space and absolute time the sensation, at first, is that we have nothing left to stand on. So must the contemporaries of Columbus have felt when told that the earth rested on—nothing. The remedy too is similar. Just as they had to be taught that falling is a local affair, that the earth is self-contained, and needs no external support—so we must be taught that space and time standards are local affairs. Each moving body carries its own space and time standards with it; it is self-contained. It does not need to reach out for eternal support, for an absolute space and time that can never quite be attained. All we ever need to know is the relation of the other fellow’s space and time standards to our own. This is the first thing relativity teaches us.]141
When relativity asks us to let go of our earth-centered ideas of absolute space and absolute time, it can feel like we have nothing solid to stand on. This must have been how Columbus's contemporaries felt when they were told that the earth was resting on—nothing. The solution is pretty much the same. Just as they had to learn that falling is specific to a location, that the earth is complete on its own, and doesn't need anything else to hold it up—so we need to learn that concepts of space and time are also local. Every moving object carries its own space and time with it; it is complete in itself. It doesn't need to reach out for some eternal support or an absolute space and time that can never really be reached. All we really need to understand is how someone else's space and time relate to ours. This is the first lesson that relativity teaches us.141
[The consequences of Einstein’s assumptions have led many to reject the theory of relativity, on the ground that its conclusions are contrary to common sense—as they undoubtedly are. But to the contemporaries of Copernicus and Galileo the theory that [108]the earth rotates on its axis and revolves around the sun was contrary to common sense; yet this theory prevailed. There is nothing sacred about common sense; in the last analysis its judgments are based on the accumulated experience of the human race. From the beginning of the world up to the present generation, no bodies were known whose velocities were not extremely small compared with that of light. The development of modern physics has led to discovery of very much larger velocities, some as high as 165,000 miles per second. It is not to be wondered at that such an enlargement of our experience requires a corresponding enlargement or generalization of the concepts of space and time. Just as the presupposition of primitive man that the earth was flat had to be given up in the light of advancing knowledge, so we are now called upon to give up our presupposition that space and time are absolute and independent in their nature.
[The consequences of Einstein’s assumptions have led many to dismiss the theory of relativity, arguing that its conclusions clash with common sense—which they undoubtedly do. However, to the contemporaries of Copernicus and Galileo, the idea that the earth rotates on its axis and orbits the sun also seemed contrary to common sense; yet this theory ultimately triumphed. Common sense isn’t sacred; ultimately, its judgments stem from the cumulative experience of humanity. From the dawn of time up to now, no objects were known that moved at speeds significantly close to that of light. The advancement of modern physics has revealed much larger speeds, some reaching up to 165,000 miles per second. It’s not surprising that such an expansion of our experience necessitates a corresponding broadening or rethinking of our concepts of space and time. Just as early humans had to abandon the belief that the earth was flat as knowledge progressed, we are now challenged to let go of the belief that space and time are absolute and independent in nature.
The reader must not expect to understand the theory of relativity in the sense of making it fit in with his previous ideas. If the theory be right these ideas are wrong and must be modified, a process apt to be painful.]223 [All the reader can do is to become familiar with the new concepts, just as a child gets used to the simple relations and quantities he meets until he “understands” them.]221 [Mr. Francis has said something of the utmost significance when he points out that “understanding” really means nothing in the world except familiarity and accustomedness.]* [The one thing about the relativity doctrine that we can hope thus to understand at once and without pain is the logical process used [109]in arriving at our results.]221 [Particularly is it hard to give a satisfactory explanation of the theory in popular language, because the language itself is based on the old concepts; the only language which is really adequate is that of mathematics.]223 [Unless we have, in addition to the terms of our ordinary knowledge, a set of definitions that comes with a wide knowledge of mathematics and a lively sense of the reality of mathematical constructions, we are likely to view the theory of relativity through a fog of familiar terms suddenly become self-contradictory and deceptive. Not that we are unfamiliar with the idea that some of our habitual notions may be wrong; but knowledge of their illusory nature arises and becomes convincing only with time. We may now be ready to grant that the earth, seemingly so solid, is really a whirling globe rushing through space; but we are no more ready immediately to accept the bald assertion that this space is not what it seems than our ancestors were to accept the idea that the earth was round or that it moved.]156 [What we must have, if we are to comprehend relativity with any degree of thoroughness, is the mathematician’s attitude toward his assumptions, and his complete readiness to swap one set of assumptions for another as a mere part of the day’s work, the spirit of which I have endeavored to convey in the chapter on non-Euclidean geometry.]*
The reader shouldn't expect to understand the theory of relativity by trying to fit it into their existing ideas. If the theory is correct, those ideas are wrong and need to change, which can be a challenging process. All the reader can do is get used to the new concepts, much like a child learns to grasp the simple relationships and quantities they encounter until they "understand" them. Mr. Francis made an important point when he said that “understanding” really just means being familiar with something and having become accustomed to it. The one thing about the relativity theory that we can hope to grasp quickly and without discomfort is the logical process used in arriving at our conclusions. It is especially difficult to explain the theory in everyday language because that language is based on older concepts; the only language that is truly adequate is mathematics. Unless we have, in addition to our common terminology, a set of definitions supported by a strong understanding of mathematics and a clear sense of the reality of mathematical concepts, we are likely to perceive the theory of relativity through a haze of familiar terms that suddenly appear contradictory and misleading. While we might acknowledge that some of our usual beliefs could be incorrect, true understanding of their deceptive nature develops gradually over time. We might be prepared to accept that the earth, which seems solid, is actually a spinning ball speeding through space; however, we’re not immediately ready to believe the blunt claim that this space isn’t what it appears to be, just as our ancestors had difficulty accepting that the earth was round or that it moved. To truly grasp relativity with any depth, we need to adopt the mathematician’s mindset towards their assumptions and their openness to replacing one set of assumptions with another as a normal part of their work, a mindset that I've tried to convey in the chapter on non-Euclidean geometry.
Physics vs. Metaphysics
[The ideas of relativity may seem, at first sight, to be giving us a new and metaphysical theory of [110]time and space. New, doubtless; but certainly the theory was meant by its author to be quite the opposite of metaphysical. Our actual perception of space is by measurement, real and imagined, of distances between objects, just as our actual perception of time is by measurement. Is it not less metaphysical to accept space and time as our measurements present them to us, than to invent hypotheses to force our perceptual space into an absolute space that is forever hidden from us?]182 [In order not to be metaphysical, we must eliminate our preconceived notions of space and time and motion, and focus our attention upon the indications of our instruments of observation, as affording the only objective manifestations of these qualities and therefore the only attributes which we can consider as functions of observed phenomena.]47 [Einstein has consistently followed the teachings of experience, and completely freed himself from metaphysics.]114 [That this is not always easy to do is clear, I think, if we will recall the highly metaphysical character often taken by the objections to action-at-a-distance theories and concepts; and if we will remind ourselves that it was on purely metaphysical grounds that Newton refused to countenance Huyghens’ wave theory of light. Whether, as in the one case, it leads us to valid conclusions, or, as in the other, to false ones, metaphysical reasoning is something to avoid. Einstein, I think, has avoided it about as thoroughly as anyone ever did.]* [111]
[The ideas of relativity might, at first glance, seem to be presenting us with a new, metaphysical theory of [__A_TAG_PLACEHOLDER_0__]time and space. New, for sure; but the author intended this theory to be quite the opposite of metaphysical. Our actual understanding of space comes from measuring, both real and imagined, the distances between objects, just as our understanding of time comes from measurement. Isn’t it less metaphysical to accept space and time as our measurements show them to us than to create theories that try to fit our perceived space into an absolute space that’s always out of reach?]182 [To avoid being metaphysical, we need to set aside our preconceived ideas about space, time, and motion, and concentrate on what our observation instruments indicate, as they provide the only objective evidence of these qualities and are the only attributes we can consider as functions of observed phenomena.]47 [Einstein has consistently adhered to the lessons from experience and completely distanced himself from metaphysics.]114 [It’s clear that this isn’t always easy, especially if we remember how often the objections to action-at-a-distance theories have a highly metaphysical tone; and if we recall that Newton dismissed Huyghens’ wave theory of light based on purely metaphysical reasons. Whether it leads us to valid conclusions, as in one case, or to false ones, as in the other, metaphysical reasoning is something we should steer clear of. I believe Einstein has avoided it as thoroughly as anyone could.]* [__A_TAG_PLACEHOLDER_0__]
V
THAT PARALLEL POSTULATE
Modern Geometric Methods; the Dividing Line Between Euclidean and Non-Euclidean; and the Significance of the Latter
Modern Geometric Methods: The Gap Between Euclidean and Non-Euclidean Geometry and the Significance of the Non-Euclidean Approach
The science of geometry has undergone a revolution of which the outsider is not informed, but which it is necessary to understand if we are to attain any comprehension of the geometric formulation of Einstein’s results; and especially if we are to appreciate why it is proper and desirable to formulate these results geometrically at all. The classical geometer regarded his science from a narrow viewpoint, as the study of a certain set of observed phenomena—those of the space about us, considered as an entity in itself and divorced from everything in it. It is clear that some things about that space are not as they appear (optical illusions), and that other things about it are true but by no means apparent (the sum-of-squares property of a right triangle, the formulæ for surface and volume of a sphere, etc.). While many things about space are “obvious,” these need in the one case disproof and in the other discovery and proof. With all their love of mental processes for their own sake, it is [112]then not surprising that the Greeks should have set themselves the task of proving by logical process the properties of space, which a less thoughtful folk would have regarded as a subject only for observational and experimental determination.
The field of geometry has completely changed in ways that outsiders might not know about, but understanding this change is crucial if we want to grasp the geometric interpretation of Einstein’s findings. It's especially important to see why it makes sense and is beneficial to present these results geometrically. Traditional geometers looked at their field from a limited perspective, treating it as the study of specific observed phenomena—our surrounding space viewed as an isolated entity, separate from everything within it. It's clear that some aspects of that space can be misleading (like optical illusions), while others are true but not immediately obvious (such as the sum of squares in a right triangle and the formulas for the surface area and volume of a sphere). While many properties of space seem “obvious,” they actually require disproof or, in the case of others, discovery and proof. Given their deep appreciation for reasoning and thought processes, it’s not surprising that the Greeks took on the challenge of logically proving the properties of space, which a less reflective group might have thought only required observation and experimentation.
But, abstract or concrete, the logical structure must have a starting point; and it is fair to demand that this consist in a statement of the terms we are going to use and the meanings we are going to attach to them. In other words, the first thing on the program will be a definition, or more probably, several definitions.
But whether it's abstract or concrete, the logical structure needs a starting point, and it's reasonable to expect that this should include a clear statement of the terms we’re going to use and the meanings we’re going to assign to them. In other words, the first thing in the program will be a definition, or more likely, several definitions.
Now the modern scientist has a somewhat iconoclastic viewpoint toward definitions, and especially toward the definition of his very most fundamental ideas.
Now, the modern scientist has a somewhat unconventional perspective on definitions, especially regarding his most fundamental concepts.
We do not speak here in terms of dictionary definitions. These have for object the eminent necessity of explaining the meaning and use of a word to some one who has just met it for the first time. It is easy enough to do this, if the doer possesses a good command of the language. It is not even a matter of grave concern that the words used in the definition be themselves known to the reader; if they are not, he must make their acquaintance too. Dr. Johnson’s celebrated definition of a needle stands as perpetual evidence that when he cannot define a simple thing in terms of things still simpler, the lexicographer is forced to define it in terms of things more complex. Or we might demonstrate this by noting that the best dictionaries are driven to define such words as “and” and “but” by using such complex [113]notions as are embodied in “connective,” “continuative,” “adversative,” and “particle.”
We’re not talking about dictionary definitions here. Those are aimed at explaining what a word means and how to use it for someone encountering it for the first time. This is pretty straightforward if the person explaining has a solid grasp of the language. It's not a big deal if the words in the definition aren’t familiar to the reader; if they aren’t, they’ll have to learn those too. Dr. Johnson’s famous definition of a needle shows that when he can’t break down a simple concept using even simpler terms, the lexicographer has to explain it using more complex ideas. We can also see this with the best dictionaries, which end up defining words like “and” and “but” with complicated ideas like “connective,” “continuative,” “adversative,” and “particle.”
It is otherwise with the scientist who undertakes to lay down a definition as the basis of further procedure in building up the tissue of his science. Here a degree of rigorous logic is called for which would be as superfluous in the dictionary as the effort there to attain it would be out of place. The scientist, in building up a logical structure that will withstand every assault, must define everything, not in terms of something which he is more or less warranted in supposing his audience to know about, but actually in terms of things that have already been defined. This really means that he must explain what he is talking about in terms of simpler ideas and simpler things, which is precisely what the lexicographer does not have to worry about. This is why it is quite trivial to quote a dictionary definition of time or space or matter or force or motion in settlement of a controversy of scientific or semi-scientific nature.
It’s different for the scientist who sets out to create a definition as the foundation for further work in developing the framework of his science. Here, a strict logic is needed that would be unnecessary in a dictionary, where such an effort would feel out of place. The scientist, while constructing a logical framework that can endure any challenge, must define everything, not based on what he assumes his audience already knows, but referring to things that have been clearly defined already. This really means he needs to explain what he’s discussing in terms of simpler concepts and basic ideas, which is something the dictionary writer doesn’t have to worry about. That’s why it’s quite simplistic to use a dictionary definition of time, space, matter, force, or motion to resolve a scientific or semi-scientific debate.
Terms We Can't Define
But the scientist who attempts to carry out this ideal system of defining everything in terms of what precedes meets one obstacle which he cannot surmount directly. Even a layman can construct a passable definition of a complex thing like a parallelopiped, in terms of simpler concepts like point, line, plane and parallel. But who shall define point in terms of something simpler and something which precedes point in the formulation of geometry? The scientist is embarrassed, not in handling the complicated later parts of his work, but in the very beginning, [114]in dealing with the simplest concepts with which he has to deal.
But the scientist trying to create this ideal system of defining everything based on what comes before faces one challenge that he can't overcome directly. Even a non-expert can come up with a decent definition of a complex shape like a parallelepiped using simpler ideas like point, line, plane, and parallel. But who will define a point in terms of something simpler and something that comes before the point in geometry? The scientist struggles, not with the complicated later parts of his work, but right at the beginning, [__A_TAG_PLACEHOLDER_0__]when dealing with the simplest concepts he has to work with.
Suppose a dictionary were to be compiled with the definitions arranged in logical rather than alphabetical order: every word to be defined by the use only of words that have already been defined. The further back toward the beginning we push this project, the harder it gets. Obviously we can never define the first word, or the second, save as synonymous with the first. In fact we should need a dozen words, more or less, to start with—God-given words which we cannot define and shall not try to define, but of which we must agree that we know the significance. Then we have tools for further procedure; we can start with, say, the thirteenth word and define all the rest of the words in the language, in strictly logical fashion.
Suppose a dictionary was put together with the definitions organized in logical order instead of alphabetical order: every word would be defined using only words that have already been defined. The further back toward the start we go with this project, the more challenging it becomes. Obviously, we can never define the first word, or the second, except as same meaning with the first. In fact, we would need about a dozen words to begin with—essential words that we cannot define and won't try to define, but that we must agree that we know the meaning of. Then we have the tools for moving forward; we can start with, say, the thirteenth word and define all the other words in the language, in a strictly logical way.
What we have said about definitions applies equally to statements of fact, of the sort which are going to constitute the body of our science. In the absence of simpler facts to cite as authority, we shall never be able to prove anything, however simple this may itself be; and in fact the simpler it be, the harder it is to find something simpler to underlie it. If we are to have a logical structure of any sort, we must begin by laying down certain terms which we shall not attempt to define, and certain statements which we shall not try to prove. Mathematics, physics, chemistry—in the large and in all their many minor fields—all these must start somewhere. Instead of deceiving ourselves as to the circumstances surrounding their start, we prefer to be quite frank in recognizing that they start where we decide [115]to start them. If we don’t like one set of undefined terms as the foundation, by all means let us try another. But always we must have such a set.
What we’ve said about definitions applies just as much to statements of fact, which will make up the core of our science. Without simpler facts to use as authority, we can’t prove anything, no matter how simple it is; and in fact, the simpler it is, the harder it is to find something even simpler to support it. If we want any kind of logical structure, we need to start by establishing certain terms that we won’t define and certain statements that we won’t prove. Mathematics, physics, chemistry—in all their branches—must begin somewhere. Rather than fooling ourselves about the circumstances of their beginnings, we prefer to be honest in acknowledging that they start where we decide [__A_TAG_PLACEHOLDER_0__] to start them. If we don’t like one set of undefined terms as the foundation, then let’s try another. But we must always have such a set.
The classical geometer sensed the difficulty of defining his first terms. But he supposed that he had met it when he defined these in words free of technical significance. “A point is that which has position without size” seemed to him an adequate definition, because “position” and “size” are words of the ordinary language with which we may all be assumed familiar. But today we feel that “position” and “size” represent ideas that are not necessarily more fundamental than those of “line” and “point,” and that such a definition begs the question. We get nowhere by replacing the undefined terms “point” and “line” and “plane,” which really everybody understands, by other undefined terms which nobody understands any better.
The classic geometer recognized the challenge of defining his basic terms. However, he thought he addressed it when he defined them using everyday language. "A point is something that has a position but no size" seemed like a good definition to him because "position" and "size" are common words that everyone knows. But today, we realize that "position" and "size" are not necessarily simpler concepts than "line" and "point," and this type of definition avoids the issue. We don't make progress by swapping out the undefined terms "point," "line," and "plane," which everyone truly understands, for other undefined terms that no one understands any better.
In handling the facts that it was inconvenient to prove, the classical geometer came closer to modern practice. He laid down at the beginning a few statements which he called “axioms,” and which he considered to be so self-evident that demonstration was superfluous. That the term “self-evident” left room for a vast amount of ambiguity appears to have escaped him altogether. His axioms were axioms solely because they were obviously true.
In dealing with the facts that were hard to prove, the classical geometer moved closer to modern practices. He started by stating a few things he called “axioms,” which he believed were so obviously true that proving them was unnecessary. It seems he completely overlooked that the term “obviously true” allowed for a lot of ambiguity. His axioms were considered axioms simply because they seemed true.
Laying the groundwork
The modern geometer falls in with Euclid when he writes an elementary text, satisfying the beginner’s demand for apparent rigor by defining point [116]and line in some fashion. But when he addresses to his peers an effort to clarify the foundations of geometry to a further degree of rigor and lucidity than has ever before been attained, he meets these difficulties from another quarter. In the first place he is always in search of the utmost possible generality, for he has found this to be his most effective tool, enabling him as it does to make a single general statement take the place and do the work of many particular statements. The classical geometer attained generality of a sort, for all his statements were of any point or line or plane. But the modern geometer, confronted with a relation that holds among points or between points and lines, at once goes to speculating whether there are not other elements among or between which it holds. The classical geometer isn’t interested in this question at all, because he is seeking the absolute truth about the points and lines and planes which he sees as the elements of space; to him it is actually an object so to circumscribe his statements that they may by no possibility refer to anything other than these elements. Whereas the modern geometer feels that his primary concern is with the fabric of logical propositions that he is building up, and not at all with the elements about which those propositions revolve.
The modern geometer agrees with Euclid when he writes an introductory text, meeting beginners' need for clear definitions of point [__A_TAG_PLACEHOLDER_0__] and line. However, when he works with his colleagues to clarify the foundations of geometry with greater precision and clarity than ever before, he encounters challenges from a different angle. First, he always seeks the greatest possible generality because he has found that it is his most effective tool, allowing one general statement to serve the function of many specific ones. The classical geometer achieved some level of generality, as all his statements applied to any point, line, or plane. But the modern geometer, faced with relationships among points or between points and lines, immediately wonders if there are other elements where these relationships also apply. The classical geometer isn't interested in this question at all because he is focused on uncovering absolute truths about the points, lines, and planes that he views as the building blocks of space; his goal is to frame his statements so that they can only refer to these elements. In contrast, the modern geometer believes that his main focus is on the structure of the logical statements he is creating, rather than the specific elements those statements are based on.
It is of obvious value if the mathematician can lay down a proposition true of points, lines and planes. But he would much rather lay down a proposition true at once of these and of numerous other things; for such a proposition will group more phenomena under a single principle. He feels that on pure [117]scientific grounds there is quite as much interest in any one set of elements to which his proposition applies as there is in any other; that if any person is to confine his attention to the set that stands for the physicist’s space, that person ought to be the physicist, not the geometer. If he has produced a tool which the physicist can use, the physicist is welcome to use it; but the geometer cannot understand why, on that ground, he should be asked to confine his attention to the materials on which the physicist employs that tool.
It’s clearly valuable if a mathematician can state a proposition that’s true for points, lines, and planes. But they would prefer to establish a proposition that’s true for these and many other things at the same time, as this type of proposition can explain more phenomena under a single principle. They believe that, from a purely scientific standpoint, there’s just as much interest in any particular set of elements their proposition applies to as there is in any other. If someone is going to focus solely on the set relevant to a physicist's space, that should be the physicist, not the geometer. If the mathematician has created a tool that the physicist can use, the physicist is welcome to utilize it; however, the geometer can’t understand why they should be expected to limit their focus to the materials that the physicist uses that tool on.
It will be alleged that points and lines and planes lie in the mathematician’s domain, and that the other things to which his propositions may apply may not so lie—and especially that if he will not name them in advance he cannot expect that they will so lie. But the mathematician will not admit this. If mathematics is defined on narrow grounds as the science of number, even the point and line and plane may be excluded from its field. If any wider definition be sought—and of course one must be—there is just one definition that the mathematician will accept: Dr. Keyser’s statement that “mathematics is the art or science of rigorous thinking.”
It will be claimed that points, lines, and planes fall under the mathematician’s scope, while other things that his propositions might relate to do not—especially if he doesn't define them beforehand, he can't expect them to fit. But the mathematician will reject this. If mathematics is narrowly defined as the study of numbers, even points, lines, and planes could be excluded from its domain. If a broader definition is sought—and it certainly should be—there's only one definition the mathematician will accept: Dr. Keyser’s assertion that “mathematics is the art or science of rigorous thinking.”
The immediate concern of this science is the means of rigorous thinking—undefined terms and definitions, axioms and propositions. Its collateral concern is the things to which these may apply, the things which may be thought about rigorously—everything. But now the mathematician’s domain is so vastly extended that it becomes more than ever important for him to attain the utmost generality in all his pronouncements. [118]
The main focus of this science is on how to think rigorously—using undefined terms and definitions, axioms, and propositions. Another area of interest is the things these concepts can apply to, which is essentially everything that can be thought about rigorously. However, the mathematician’s field has grown so large that it’s more crucial than ever for him to be as general as possible in all his statements. [__A_TAG_PLACEHOLDER_0__]
One barrier to such generalization is the very name “geometry,” with the restricted significance which its derivation and long usage carry. The geometer therefore must have it distinctly understood that for him “geometry” means simply the process of deducing a set of propositions from a set of undefined primitive terms and axioms; and that when he speaks of “a geometry” he means some particular set of propositions so deduced, together with the axioms, etc., on which they are based. If you take a new set of axioms you get a new geometry.
One obstacle to this generalization is the term “geometry,” which has a specific meaning based on its origin and long history. So, the geometer needs to be clear that for him, “geometry” just refers to the process of deriving a set of propositions from a collection of undefined basic terms and axioms. When he talks about “a geometry,” he’s referring to a specific set of propositions derived in that way, along with the axioms and other foundations they rely on. If you start with a different set of axioms, you end up with a different geometry.
The geometer will, if you insist, go on calling his undefined terms by the familiar names “point,” “line,” “plane.” But you must distinctly understand that this is a concession to usage, and that you are not for a moment to restrict the application of his statements in any way. He would much prefer, however, to be allowed new names for his elements, to say “We start with three elements of different sorts, which we assume to exist, and to which we attach the names A, B and C—or if you prefer, primary, secondary and tertiary elements—or yet again, names possessing no intrinsic significance at all, such as ching, chang and chung.” He will then lay down whatever statements he requires to serve the purposes of the ancient axioms, all of these referring to some one or more of his elements. Then he is ready for the serious business of proving that, all his hypotheses being granted, his elements A, B and C, or I, II and III, or ching, chang and chung, are subject to this and that and the other propositions. [119]
The geometer will, if you insist, continue to call his undefined terms by the familiar names “point,” “line,” and “plane.” But you need to clearly understand that this is a concession to common usage, and you should not limit the application of his statements in any way. He would much rather be allowed to use new names for his elements, saying, “We start with three elements of different types, which we assume exist, and we name them A, B, and C—or if you prefer, primary, secondary, and tertiary elements—or even names that have no intrinsic meaning, like ching, chang, and chung.” He will then establish whatever statements he needs to meet the purposes of the ancient axioms, all of which will refer to one or more of his elements. After that, he is ready to get down to the serious task of proving that, assuming all his hypotheses are granted, his elements A, B, and C, or I, II, and III, or ching, chang, and chung, are subject to this, that, and the other propositions. [__A_TAG_PLACEHOLDER_0__]
The objection will be urged that the mathematician who does all this usurps the place of the logician. A little reflection will show this not to be the case. The logician in fact occupies the same position with reference to the geometer that the geometer occupies with reference to the physicist, the chemist, the arithmetician, the engineer, or anybody else whose primary interest lies with some particular set of elements to which the geometer’s system applies. The mathematician is the tool-maker of all science, but he does not make his own tools—these the logician supplies. The logician in turn never descends to the actual practice of rigorous thinking, save as he must necessarily do this in laying down the general procedures which govern rigorous thinking. He is interested in processes, not in their application. He tells us that if a proposition is true its converse may be true or false or ambiguous, but its contrapositive is always true, while its negative is always false. But he never, from a particular proposition “If A is B then C is D,” draws the particular contrapositive inference “If C is not D then A is not B.” That is the mathematician’s business.
The argument will be made that the mathematician who does all this takes the place of the logician. A bit of thought will reveal that this isn't true. The logician actually occupies the same role in relation to the geometer that the geometer has in relation to the physicist, the chemist, the arithmetician, the engineer, or anyone else whose main focus is a specific set of elements to which the geometer’s system applies. The mathematician is the toolmaker of all science, but he doesn’t create his own tools—those are provided by the logician. The logician, in turn, rarely engages in the actual practice of rigorous thinking, except when he must do so to establish the general procedures that govern rigorous thinking. He is focused on processes, not their application. He tells us that if a proposition is true, its converse might be true, false, or ambiguous, but its contrapositive is always true, while its negative is always false. However, he never derives a specific contrapositive inference like “If C is not D then A is not B” from a particular proposition like “If A is B then C is D.” That’s the mathematician’s role.
The Role of Geometry
The mathematician is the quantity-production man of science. In his absence, the worker in each narrower field where the elements under discussion take particular concrete forms could work out, for himself, the propositions of the logical structure that applies to those elements. But it would then be found that the engineer had duplicated the work of the physicist, and so for many other cases; for the [120]whole trend of modern science is toward showing that the same background of principles lies at the root of all things. So the mathematician develops the fabric of propositions that follows from this, that and the other group of assumptions, and does this without in the least concerning himself as to the nature of the elements of which these propositions may be true. He knows only that they are true for any elements of which his assumptions are true, and that is all he needs to know. Whenever the worker in some particular field finds that a certain group of the geometer’s assumptions is true for his elements, the geometry of those elements is ready at hand for him to use.
The mathematician is the person who produces quantities in science. Without him, specialists in narrower fields could figure out the logical propositions related to their specific elements on their own. However, this would lead to situations where engineers end up repeating the work of physicists, and many other similar scenarios arise; because the overall direction of modern science is to demonstrate that the same fundamental principles underlie everything. So, the mathematician creates the framework of propositions based on various groups of assumptions, without worrying about the specific nature of the elements these propositions may apply to. He only needs to know that they hold true for any elements that meet his assumptions, and that's sufficient. Whenever a specialist in a particular area discovers that a certain set of the geometer’s assumptions is valid for his elements, the geometry relevant to those elements is readily available for him to use.
Now it is all right purposely to avoid knowing what it is that we are talking about, so that the names of these things shall constitute mere blank forms which may be filled in, when and if we wish, by the names of any things in the universe of which our “axioms” turn out to be true. But what about these axioms themselves? When we lay them down in ignorance of the identity of the elements to which they may eventually apply, they cannot by any possibility be “self-evident.” We may, at pleasure, accept as self-evident a statement about points and lines and planes; or one about electrons, centimeters and seconds; or one about integers, fractions, and irrational numbers; or one about any other concrete thing or things whatever. But we cannot accept as self-evident a statement about chings, changs and chungs. So we must base our “axioms” on some other ground than this; and our modern geometer has his ground ready and waiting. He accepts his [121]axioms on the ground that it pleases him to do so. To avoid all suggestion that they are supposed to be self-evident, or even necessarily true, he drops the term “axiom” and substitutes for it the more color-less word “postulate.” A postulate is merely something that we agreed to accept, for the time being, as a basis of further argument. If it turns out to be true, or if we can find circumstances under which and elements to which it applies, any conclusions which we deduce from it by trustworthy processes are valid within the same limitations. And the propositions which tell us that, if our postulates are true, such and such conclusions are true—they, too, are valid, but without any reservation at all!
Now it’s perfectly fine to purposely avoid knowing what we’re actually talking about, so that the names of these things will serve as blank forms to be filled in when and if we choose with the names of any things in the universe where our “axioms” turn out to be true. But what about these axioms themselves? When we set them down without knowing the identity of the elements they might eventually apply to, they can’t possibly be “self-evident.” We might, at will, accept as self-evident a statement about points, lines, and planes; or about electrons, centimeters, and seconds; or about integers, fractions, and irrational numbers; or about any other concrete things at all. But we can’t accept as self-evident a statement about chings, changs, and chungs. So we need to base our “axioms” on something other than this; and our modern geometer has his foundation ready and waiting. He accepts his [__A_TAG_PLACEHOLDER_0__]axioms simply because he chooses to do so. To avoid any suggestion that they are meant to be self-evident, or even necessarily true, he drops the term “axiom” and replaces it with the more neutral term “postulate.” A postulate is just something we agree to accept, for now, as a foundation for further argument. If it turns out to be true, or if we can find situations where it applies, then any conclusions we draw from it through reliable processes are valid within the same limitations. And the propositions that tell us that, if our postulates are true, then certain conclusions are true—they, too, are valid, with no reservations whatsoever!
Perhaps an illustration of just what this means will not be out of place. Let it be
admitted, as a postulate, that is greater, by 1, than
. Let us then consider the statement: “If
, then
.” We know—at least we are quite certain—that
is not equal to 65, if by “7” and “19” and “65” we mean what you think we mean. We
are equally sure, on the same grounds, that
is not equal to 66. But, under the one assumption that we have permitted ourselves,
it is unquestionable that if
were equal to 65, then
certainly would be equal to 66. So, while the conclusion of the proposition which I have put in quotation
marks is altogether false, the proposition itself, under our assumption, is entirely
true. I have taken an illustration designed to be striking rather than to possess
scientific interest; I could just as easily have shown a true proposition leading
to a false conclusion, but of [122]such sort that it would be of decided scientific interest as telling us one of the
consequences of a certain assumption.
Perhaps an example of what this means will be useful. Let’s agree, as a starting point, that is greater by 1 than
. Now, let’s look at the statement: “If
, then
.” We know—at least we are pretty sure—that
is not equal to 65, if we take “7,” “19,” and “65” to mean what you think they mean. We are also certain, for the same reasons, that
is not equal to 66. However, under the one assumption we’ve made, it is undeniable that if
were equal to 65, then
would definitely be equal to 66. So, while the conclusion of the statement I've quoted is completely false, the statement itself, under our assumption, is entirely true. I’ve used an example meant to be striking rather than scientifically significant; I could just as easily have shown a true statement leading to a false conclusion, but of [__A_TAG_PLACEHOLDER_0__] such a nature that it would be of significant scientific interest as telling us about one of the outcomes of a certain assumption.
What Might We Take for Granted?
This is all very fine; but how does the geometer know what postulates to lay down? One is tempted to say that he is at liberty to postulate anything he pleases, and investigate the results; and that whether or not his postulate ever be realized, the propositions that he deduces from it, being true, are of scientific interest. Actually, however, it is not quite as simple as all that. If it were sufficient to make a single postulate it would be as simple as all that; but it turns out that this is not sufficient any more than it is sufficient to have a single undefined term. We must have several postulates; and they must be such, as a whole, that a geometry flows out of them. The requirements are three.
This is all very nice, but how does the geometer know what postulates to establish? One might think that he can just postulate anything he wants and explore the results; whether or not his postulate is ever realized, the propositions he derives from it are true and scientifically interesting. However, it’s not quite that straightforward. If it were enough to make just one postulate, it would be that simple, but it turns out that this is not enough, just like having a single undefined term isn't enough. We need multiple postulates that collectively allow a geometry to emerge from them. The requirements are three.
In the first place, the system of postulates must be “categorical” or complete—there must be enough of them, and they must cover enough ground, for the support of a complete system of geometry. In practice the test for this is direct. If we got to a point in the building up of a geometry where we could not prove whether a certain thing was one way always, or always the other way, or sometimes one way and sometimes the other, we should conclude that we needed an additional postulate covering this ground directly or indirectly. And we should make that postulate—because it is precisely the [123]things that we can’t prove which, in practical work, we agree to assume. Even Euclid had to adopt this philosophy.
First of all, the system of postulates has to be “categorical” or complete—there need to be enough of them, and they must cover enough territory to support a complete geometry system. In practice, the way to test this is straightforward. If we reach a point in developing a geometry where we can’t determine if something is always one way, always the other way, or sometimes one way and sometimes the other, we should conclude that we need an additional postulate that addresses this issue directly or indirectly. And we should establish that postulate—because it's exactly the things we can't prove that, in practical applications, we agree to assume. Even Euclid had to embrace this approach.
In the second place, the system of postulates must be consistent—no one or more of them may lead, individually or collectively, to consequences that contradict the results or any other or others. If in the course of building up a geometry we find we have proved two propositions that deny one another, we search out the implied contradiction in our postulates and remedy it.
In the second place, the system of postulates must be consistent—none of them, whether individually or together, can lead to outcomes that contradict the results or each other. If, while developing a geometry, we discover that we've proved two statements that contradict one another, we look for the implied contradiction in our postulates and fix it.
Finally, the postulates ought to be independent. It should not be possible to prove any one of them as a consequence of the others. If this property fails, the geometry does not fail with it; but it is seriously disfigured by the superfluity of assumptions, and one of them should be eliminated. If we are to assume anything unnecessarily, we may as well assume the whole geometry and be done with it.
Finally, the postulates should be self-sufficient. It shouldn’t be possible to prove one of them based on the others. If this characteristic is lost, the geometry isn’t invalidated, but it becomes seriously complicated by unnecessary assumptions, and one of them needs to be removed. If we’re going to assume anything without need, we might as well just assume the entire geometry and call it a day.
The geometer’s business then is to draw up a set of postulates. This he may do on any basis whatever. They may be suggested to him by the behavior of points, lines and planes, or by some other concrete phenomena; they may with equal propriety be the product of an inventive imagination. On proceeding to deduce their consequences, he will discover and remedy any lack of categoricity or consistence or independence which his original system of postulates may have lacked. In the end he will have so large a body of propositions without contradiction or failure that he will conclude the propriety of his postulates to have been established, and the geometry based on them to be a valid one. [124]
The geometer's task is to create a set of postulates. He can base them on anything. They might come from how points, lines, and planes behave or from other real-life phenomena; they can also be the result of creative thinking. As he works to figure out their implications, he'll find and fix any issues with clarity, consistency, or independence that his original set of postulates might have had. Eventually, he will have developed a substantial collection of propositions that don’t contradict or fail, leading him to affirm that his postulates are sound and that the geometry built on them is valid. [__A_TAG_PLACEHOLDER_0__]
So, what’s it all about?
Is this geometry ever realized? Strictly it is not the geometer’s business to ask or answer this question. But research develops two viewpoints. There is always the man who indulges in the pursuit of facts for their sake alone, and equally the man who wants to see his new facts lead to something else. One great mathematician is quoted as enunciating a new theory of surpassing mathematical beauty with the climacteric remark “And, thank God, no one will ever be able to find any use for it.” An equally distinguished contemporary, on being interrogated concerning possible applications for one of his most abstruse theorems, replied that he knew no present use for it; but that long experience had made him confident that the mathematician would never develop any tool, however remote from immediate utility, for which the delvers in other fields would not presently find some use.
Is this geometry ever actually achieved? Technically, it’s not the geometer’s role to ask or answer this question. But research brings up two perspectives. There’s always someone who pursues facts just for their own sake, and then there’s also the person who wants to see those new facts lead to something else. One great mathematician is quoted as declaring a new theory of incredible mathematical beauty with the noteworthy comment, “And, thank God, no one will ever be able to find any use for it.” A similarly distinguished contemporary, when asked about possible applications for one of his most complex theorems, said he didn’t see any current use for it; however, his long experience had convinced him that a mathematician would never create a tool, no matter how unrelated to immediate utility, that others wouldn’t eventually find some use for.
If we wish, however, we may inquire with perfect propriety, from the side lines, whether a given geometry is ever realized. We may learn that so far as has yet been discovered there are no elements for which all its postulates are verified, and that there is therefore no realization known. On the other hand, we may more likely find that many different sets of elements are such that the postulates can be interpreted as applying to them, and that we therefore have numerous realizations of the geometry. As a human being the geometer may be interested in all this, but as a geometer it really makes little difference to him. [125]
If we want, we can ask from the sidelines whether a certain geometry ever exists in reality. So far, it seems that there are no elements where all its principles are validated, and thus no known realization exists. On the other hand, we are more likely to discover that various sets of elements can interpret the principles as relevant to them, which means we have many realizations of the geometry. As a person, the geometer might find this interesting, but as a geometer, it doesn’t really matter much to him. [__A_TAG_PLACEHOLDER_0__]
When we look at space about us, we see it, for some reason grounded in the psychological history of the human race, as made up in the small of points, which go to make up lines, which in turn constitute planes. Or we can start at the other end and break space down first into planes, then into lines, finally into points. Our perceptions and conceptions of these points, lines and planes are very definite indeed; it seems indeed, as the Greeks thought, that certain things about them are self-evident. If we wish to take these self-evident properties of point, line and plane, and combine with them enough additional hair-splitting specifications to assure the modern geometer that we have really a categorical system of assumptions, we shall have the basis of a perfectly good system of geometry. This will be what we unavoidably think of as the absolute truth with regard to the space about us; but you mustn’t say so in the presence of the geometer. It will also be what we call the Euclidean geometry. It has been satisfactory in the last degree, because not only space, but pretty much every other system of two or three elements bearing any relations to one another can be made, by employing as a means of interpretation the Cartesian scheme of plotting, to fit into the framework of Euclidean geometry. But it is not the only thing in the world of conceptual possibilities, and it begins to appear that it may not even be the only thing in the world of cold hard fact that surrounds us. To see just how this is so we must return to Euclid, and survey the historical development of geometry from his day to the present time. [126]
When we look around at space, we perceive it, for some reason rooted in our psychological history, as being made up of small points that form lines, which in turn create planes. Alternatively, we can start with planes, break them down into lines, and then into points. Our understanding of these points, lines, and planes is quite clear; it seems that, like the Greeks believed, some aspects of them are obvious. If we take these obvious properties of points, lines, and planes and add enough detailed specifications to satisfy modern geometers, we will have the foundation of a solid system of geometry. This will be what we typically think of as absolute truth regarding the space around us, but don’t mention this in front of a geometer. This is also what we refer to as Euclidean geometry. It has been extremely useful because not only can space be described by it, but just about any other system of two or three elements that have relationships can also fit into the Euclidean framework using the Cartesian plotting method. However, it’s not the only possibility out there, and it appears it may not even be the only reality in the tangible world around us. To understand this better, we need to return to Euclid and examine the historical evolution of geometry from his time to the present. [__A_TAG_PLACEHOLDER_0__]
Euclid's Geometry
Point, line and plane Euclid attempts to define. Modern objection to these efforts was made clear above. Against Euclid’s specific performance we urge the further specific fault that his “definitions” are really assumptions bestowing certain properties upon points, lines and planes. These assumptions Euclid supplements in his axioms; and in the process of proving propositions he unconsciously supplements them still further. This is to be expected from one whose justification for laying down an axiom was the alleged obvious character of the statement made. If some things are too obvious to require demonstration, others may be admitted as too obvious to demand explicit statement at all.
Euclid tries to define points, lines, and planes. The modern objections to his efforts were mentioned earlier. In addition to Euclid's specific shortcomings, we point out that his “definitions” are really assumptions that attribute certain properties to points, lines, and planes. He builds on these assumptions in his axioms, and while proving suggestions, he unknowingly adds to them even more. This is to be expected from someone who justified stating an axiom based on the supposed obviousness of the claims made. If some things are too obvious to need proof, others might also be considered too obvious to require explicit mention at all.

Thus, if Euclid has two points A and B in a plane, on opposite sides of a line M, he will draw the line AB and without further formality speak of the point C in which it intersects M. That it does so intercept M, rather than in some way dodges it, is [127]really an assumption as to the nature of lines and planes. Or again, Euclid will speak of a point D on the line AB, between or outside the points A and B, without making the formal assumption necessary to insure that the line is “full” of points so that such a point as D must exist. That such assumptions as these are necessary follows from our previous remarks. If we think of our geometry as dealing with “chings,” “changs,” and “chungs,” or with elements I, II and III, it is no longer in the least degree obvious that the simplest property in the world applies to these elements. If we wish any property to prevail we must state it explicitly.
Thus, if Euclid has two points A and B in a plane, on opposite sides of a line M, he will draw the line AB and casually mention the point C where it intersects M. The fact that it intersects M, rather than avoiding it somehow, is [__A_TAG_PLACEHOLDER_0__]actually an assumption about the nature of lines and planes. Similarly, Euclid will refer to a point D on the line AB, either between or outside the points A and B, without making the formal assumption necessary to ensure that the line is “fullIt seems you didn't provide a text to modernize. Please provide a phrase for me to assist you with. of points, so that a point like D must exist. The need for such assumptions comes from our earlier remarks. If we think of our geometry as involving “chings,” “changs,” and “chungs,” or with elements I, II, and III, it’s no longer obvious that the simplest property in the world applies to these elements. If we want any property to hold true, we must state it clearly.
With the postulates embodied in his definitions, those stated in his axioms, and those which he reads into his structure by his methods of proof, Euclid has a categorical set—enough to serve as foundation for a geometry. We may then climb into Euclid’s shoes and take the next step with him. We follow him while he proves a number of things about intersecting lines and about triangles. To be sure, when he proves that two triangles are identically constituted by moving one of them over on top of the other, we may protest on the ground that the admission of motion, especially of motion thus imposed from without, into a geometry of things is not beyond dispute. If Euclid has caught our modern viewpoint, he will rejoin that if we have any doubts as to the admissibility of motion he will lay down a postulate admitting it, and we shall be silenced.
With the principles outlined in his definitions, those listed in his axioms, and those he incorporates into his framework through his proof methods, Euclid has established a definitive set—sufficient to serve as a foundation for geometry. We can then step into Euclid's perspective and move forward with him. We follow as he proves various things about intersecting lines and triangles. Certainly, when he demonstrates that two triangles are exactly the same by sliding one over the other, we might object, arguing that accepting movement, especially one imposed externally, into a geometry of static objects is debatable. If Euclid were aware of our modern perspective, he would respond that if we have any doubts about allowing movement, he will establish a principle that permits it, and we would have no further objections.
Having exhausted for the present the interest of intersecting lines, our guide now passes to a consideration [128]of lines in the same plane that never meet. He defines such lines as parallel. If we object that he should show the existence of a derived concept like this before laying down a definition that calls for it to exist, he can show that two lines drawn perpendicular to the same line never meet. He will execute this proof by a special sort of superposition, which requires that the plane be folded over on itself, through the third dimension of surrounding space, rather than merely slid along upon itself.
Having exhausted the topic of intersecting lines for now, our guide now moves on to a discussion [__A_TAG_PLACEHOLDER_0__] of lines in the same plane that never meet. He defines these lines as parallel. If we argue that he needs to demonstrate the existence of a derived concept like this before establishing a definition that relies on it, he can prove that two lines drawn perpendicular to the same line never meet. He will do this proof using a specific type of superposition, which requires the plane to be folded over itself in the third dimension of surrounding space, rather than just sliding along its surface.

We remain quiet while Euclid demonstrates that if two lines are cut by any transversal in such a way as to make corresponding angles at the two intersections equal, the lines are parallel. It is then in order to investigate the converse: if the lines are parallel to begin with, are the angles equal?
We stay silent as Euclid shows that if two lines are crossed by any transversal in such a way that the corresponding angles at both intersections are equal, then the lines are parallel. Next, we need to explore the opposite: if the lines are parallel from the start, are the angles equal?
Custom Axioms
This sounds innocent enough; but in no way was Euclid able to devise a proof—or, for that matter, a disproof. So he took the only way out, and said that if the lines were parallel, obviously they extended in the same direction and made the angles equal. The thing was so obvious, he argued, that [129]it was really an axiom and he didn’t have to prove it; so he stated it as an axiom and proceeded. He didn’t state it in precisely the form I have used; he apparently cast about for the form in which it would appear most obvious, and found a statement that suited him better than this one, and that comes to the same thing. This statement tells us that if the transversal makes two corresponding angles unequal, the lines that it cuts are not parallel and do meet if sufficiently prolonged. But wisely enough, he did not transplant this axiom, once he had arrived at it, to the beginning of the book where the other axioms were grouped; he left it right where it was, following the proposition that if the angles were equal the lines were parallel. This of course was so that it might appeal back, for its claim to obviousness, to its demonstrated converse of the proposition.
This sounds harmless enough; but Euclid couldn't come up with a proof—or a disproof, for that matter. So he took the only option he had and said that if the lines were parallel, they obviously extended in the same direction and created equal angles. He argued that it was so obvious that [__A_TAG_PLACEHOLDER_0__] it was really an axiom, so he didn’t need to prove it; he stated it as an axiom and moved on. He didn’t phrase it exactly as I have; he seems to have looked for a way to present it that seemed most evident to him, and he found a statement that worked better for him than this one, but it means the same thing. This statement tells us that if a transversal creates two corresponding angles that are unequal, the lines it intersects are not parallel and will meet if extended enough. But wisely, he didn’t move this axiom to the beginning of the book where the other axioms were grouped; he left it exactly where it was, after the proposition stating that if the angles were equal, the lines were parallel. This was so it could refer back, for its claim to obviousness, to the proven converse of the proposition.
Euclid must have been dissatisfied with this cutting of the Gordian knot; his successors were acutely so. For twenty centuries the parallel axiom was regarded as the one blemish in an otherwise perfect work; every respectable mathematician had his shot at removing the defect by “proving” the objectionable axiom. The procedure was always the same: expunge the parallel axiom, in its place write another more or less “obvious” assumption, and from this derive the parallel statement more or less directly. Thus if we may assume that the sum of the angles of a triangle is always exactly 180 degrees, or that there can be drawn only one line through a given point parallel to a given line, we can prove Euclid’s axiom. Sometimes the substitute assumption [130]was openly made and stated, as in the two instances cited; as often it was admitted into the demonstration implicitly, as when it is quietly assumed that we can draw a triangle similar to any given triangle and with area as great as we please, or when parallels are “defined” as everywhere equidistant. But such “proofs” never satisfied anyone other than the man who made them; the search went merrily on for a valid “proof” that should not in substance assume the thing to be proved.
Euclid must have been unhappy with this solution to the Gordian knot; his followers certainly were. For twenty centuries, the parallel axiom was seen as the one flaw in an otherwise flawless work; every respectable mathematician tried to fix this issue by “proving” the troublesome axiom. The approach was always the same: remove the parallel axiom and replace it with another more or less “obvious” assumption, then derive the parallel statement more or less directly from this. So, if we assume that the sum of the angles in a triangle is always exactly 180 degrees, or that only one line can be drawn through a certain point parallel to a given line, we can prove Euclid’s axiom. Sometimes the substitute assumption [__A_TAG_PLACEHOLDER_0__] was explicitly stated, as in the two examples mentioned; often, it was introduced into the demonstration implicitly, such as when it’s assumed that we can draw a triangle similar to any given triangle with an area as large as we want, or when parallels are “defined” as always being equidistant. But these “proofs” never satisfied anyone except the person who created them; the search continued happily for a valid “proof” that didn’t essentially assume what it was trying to prove.
Finding the Discrepancy
Saccheri, an Italian Jesuit, would have struck bottom if he had had a little more imagination. He gave an exhaustive reductio ad absurdum, on the basis of the angle-sum theorem. This sum must be (a) greater than or (b) equal to or (c) less than 180 degrees. Saccheri showed that if one of these alternatives occurs in a single triangle, it must occur in every triangle. The first case gave little trouble; admitting the possibility of superposing in the special manner mentioned above, which he did implicitly, he showed that this “obtuse-angled hypothesis” contradicted itself. He pursued the “acute-angled hypothesis” for a long time before he satisfied himself that he had caught it, too, in an inconsistency. This left only the “right-angled hypothesis,” proving the Euclidean angle-sum theory and through it the parallel postulate. But Saccheri was wrong: he had found no actual contradiction in the acute-angled hypothesis—for none exists therein.
Saccheri, an Italian Jesuit, would have reached a breakthrough if he had a bit more imagination. He conducted a thorough reductio ad absurdum based on the angle-sum theorem. This sum has to be (a) greater than or (b) equal to or (c) less than 180 degrees. Saccheri demonstrated that if any of these scenarios happens in one triangle, it must apply to every triangle. The first case wasn’t too difficult; by allowing the possibility of overlapping in the specific way he mentioned, which he did implicitly, he showed that this "obtuse-angled hypothesis" contradicted itself. He chased the "acute-angled hypothesis" for quite some time before concluding that he had also found it to be inconsistent. This left only the "right-angled hypothesis," which proved the Euclidean angle-sum theory and, through it, the parallel postulate. But Saccheri was mistaken: he hadn’t discovered any actual contradiction in the acute-angled hypothesis—because none exists.
The full facts were probably first known to Gauss, [131]who had a finger in every mathematical pie that had to do with the transition to modern times. They were first published by Lobatchewsky, the Russian, who anticipated the Hungarian John Bolyai by a narrow margin. All three worked independently of Saccheri, whose book, though theoretically available in Italian libraries, was actually lost to sight and had to be rediscovered in recent years.
The complete facts were likely first known to Gauss, [__A_TAG_PLACEHOLDER_0__] who was involved in every mathematical development related to the shift to modern times. They were first published by the Russian Lobatchewsky, who was just ahead of the Hungarian John Bolyai. All three worked independently of Saccheri, whose book, although theoretically accessible in Italian libraries, was practically forgotten and had to be found again in recent years.
Like Saccheri, Lobatchewsky investigated alternative possibilities. But he chose another point of attack: through a given point it must be possible to draw, in the same plane with a given line (a) no lines or (b) one line or (c) a plurality of lines, which shall not meet the given line. The word parallel is defined only in terms of the second of these hypotheses, so we avoid it here. These three cases correspond, respectively, to those of Saccheri.
Like Saccheri, Lobatchewsky explored different possibilities. However, he took a different approach: through a given point, it must be possible to draw, in the same plane as a given line (a) no lines, (b) one line, or (c) multiple lines, which do not intersect the given line. The term parallel is defined only based on the second of these scenarios, so we will avoid using it here. These three cases align, respectively, with those of Saccheri.
The first case Lobatchewsky ruled out just as did Saccheri, but accepting consciously the proviso attached to its elimination; the third he could not rule out. He developed the consequences of this hypothesis as far as Euclid develops those of the second one, sketching in a full outline for a system of geometry and trigonometry based on a plurality of “non-cutters.” This geometry constitutes a coherent whole, without a logical flaw.
The first case Lobatchewsky dismissed just like Saccheri did, but he was fully aware of the condition linked to its dismissal; he couldn't dismiss the third one. He explored the implications of this hypothesis as thoroughly as Euclid explored those of the second one, outlining a complete system of geometry and trigonometry based on multiple “non-cutters.” This geometry forms a cohesive structure without any logical inconsistencies.
This made it plain what was the matter with Euclid’s parallel axiom. Nobody could prove it from his other assumptions because it is not a consequence of these. True or false, it is independent of them. Trinity Church is in New York, Faneuil Hall is in Boston, but Faneuil Hall is not in Boston because Trinity is in New York; and we could not [132]prove that Faneuil Hall was in Boston if we knew nothing about America save that Trinity is in New York. The mathematicians of 2,000 years had been pursuing, on a gigantic scale, a delusion of post hoc, ergo propter hoc.
This made it clear what the issue was with Euclid’s parallel axiom. Nobody could prove it from his other assumptions because it isn’t a result of them. Whether it’s true or false, it stands alone. Trinity Church is in New York, Faneuil Hall is in Boston, but Faneuil Hall is not in Boston because Trinity is in New York; and we could not [__A_TAG_PLACEHOLDER_0__]prove that Faneuil Hall was in Boston if we only knew that Trinity is in New York. For 2,000 years, mathematicians had been chasing, on a massive scale, an illusion of post hoc, ergo propter hoc.
What the Postulate Actually Does
Moreover, in the absence of an assumption covering the ground, we shall not know which of the alternatives (a), (b), (c) holds. But when one holds in a single case it holds permanently, as Saccheri and Lobatchewsky both showed. So we cannot proceed on this indefinite basis; we must know which one is to hold. Without the parallel postulate or a substitute therefor that shall tell us the same thing or tell us something different, we have not got a categorical set of assumptions—we cannot build a geometry at all. That is why Euclid had to have his parallel postulate before he could proceed. That is why his successors had to have an assumption equivalent to his.
Moreover, without an assumption covering the ground, we won’t know which of the alternatives (a), (b), or (c) is true. But when one is true in a specific case, it remains true permanently, as both Saccheri and Lobatchewsky demonstrated. So we can’t move forward on this uncertain foundation; we need to know which one is valid. Without the parallel postulate or a substitute that tells us the same thing or something different, we don’t have a definite set of assumptions—we can’t create any geometry at all. That’s why Euclid needed his parallel postulate before he could continue. That’s why his successors needed an assumption equivalent to his.
The reason why it took so long for this to percolate into the understanding of the mathematicians was that they were thinking, not in terms of the modern geometry and about undefined elements; but in terms of the old geometry and about strictly defined and circumscribed elements. If we understand what is meant by Euclidean line and plane, of course the parallel postulate, to use the old geometer’s word, is true—of course, to adopt the modern viewpoint, if we agree to employ an element to which that assumption applies, the assumption is [133]realized. The very fact of accepting the “straight” line and the “flat” plane of Euclid constitutes acceptance of his parallel postulate—the only thing that can separate his geometry from other geometries. But of course we can’t prove it; the prior postulates which we would have to use in such an attempt apply where it does not apply, and hence it cannot possibly be consequences of.
The reason it took so long for this to sink in for mathematicians was that they weren't thinking in terms of modern geometry and undefined elements; instead, they were looking at the old geometry with strictly defined and limited elements. If we understand what is meant by a Euclidean line and plane, then, of course, the parallel postulate, as the old geometers would say, is true—naturally, if we take the modern view and agree to use an element to which that assumption applies, that assumption is [__A_TAG_PLACEHOLDER_0__]realized. Simply accepting the "straight" line and the "flat" plane of Euclid means accepting his parallel postulate—the one thing that sets his geometry apart from others. But of course we can’t prove it; the earlier postulates that we would need for such a proof apply where it doesn’t apply, and therefore it can't possibly be a consequence of those.
To all this the classical Euclidean rejoins that we seem to have in mind elements of some sort to which, with one reservation, his postulates apply. He wants to know what these elements look like. We can, and must, produce them—else our talk about generality is mere drivel. But we must take care that the Euclidean geometer does not try to apply to our elements the notions of straightness and flatness which inhere in the parallel postulate. We cannot satisfy and defy that postulate at the same time. If we do not insist on this point, we shall find that we are reading non-Euclidean properties into Euclidean geometry, and interpreting the elements of the latter as straight lines that are not straight, flat planes that are not flat. It is not the mission of non-Euclidean geometry thus to deny the possibility of Euclidean geometry; it merely demands a place of equal honor.
To all this, the classical Euclidean argues that we seem to have some elements in mind to which, with one exception, his basic assumptions apply. He wants to understand what these elements are like. We can, and must, define them—otherwise, our discussions about generality are just nonsense. However, we need to ensure that the Euclidean geometer doesn’t try to apply the ideas of straightness and flatness that are inherent in the parallel postulate to our elements. We cannot satisfy and contradict that postulate at the same time. If we don't emphasize this point, we will end up interpreting non-Euclidean properties within Euclidean geometry, viewing elements of the latter as straight lines that aren’t straight, flat planes that aren’t flat. Non-Euclidean geometry is not tasked with rejecting the existence of Euclidean geometry; it simply asks for an equal level of respect.
Surface Geometry
Let us ask the Euclidean geometer whether he can recognize his plane after we have crumpled it up like a piece of paper en route to the waste basket. He will hesitate only long enough to recall that in the [134]special case of superposition he has reserved for himself the privilege of deforming his own plane, and to realize that he can always iron his plane out smooth again after we are through with it. This emphasizes the true nature of the two-dimensionality which is the fundamental characteristic of the plane (and of other things, as we shall directly see). The plane is two-dimensional in points not because two sets of mutually perpendicular Euclidean straight lines can be drawn in it defining directions of north-south and east-west, but because a point in it can be located by means of two measures. The same statement may be made of anything whatever to which the term “surface” is applicable; anything, however crumpled or irregular it be, that possesses length and breadth without thickness. The surface of a sphere, of a cylinder, of an ellipsoid, of a cone, of a doughnut (mathematically known as a torus), of a gear wheel, of a French horn, all these possess two-dimensionality in points; on all of them we can draw lines and curves and derive a geometry of these figures. If we get away from the notion that geometry of two dimensions must deal with planes, and adopt in place of this idea the broader restriction that it shall deal with surfaces, we shall have the generalization which the Euclidean has demanded that we produce, and the one which in the hands of the modern geometer has shown results.
Let’s ask a Euclidean geometer if he can recognize his plane after we’ve crumpled it up like a piece of paper on the way to the trash can. He will pause long enough to remember that in the [__A_TAG_PLACEHOLDER_0__]special case of superposition, he’s allowed to deform his own plane, and to realize that he can always smooth it out again after we’re done. This highlights the true nature of the two-dimensionality that is the core characteristic of the plane (and other things, as we will soon see). The plane is two-dimensional in points not because we can draw two sets of mutually perpendicular Euclidean straight lines defining north-south and east-west directions, but because a point in it can be pinpointed using two measures. This statement applies to anything that can be called a “surface”; anything, no matter how crumpled or irregular, that has length and width but no thickness. The surface of a sphere, a cylinder, an ellipsoid, a cone, a doughnut (mathematically referred to as a torus), a gear wheel, or a French horn—all of these have two-dimensionality in points; on all of them, we can draw lines and curves and create a geometry of these shapes. If we move away from the idea that two-dimensional geometry must involve planes and instead adopt the broader idea that it involves surfaces, we will achieve the generalization that the Euclidean versions have requested, and one that has proven effective in the hands of modern geometers.
In this two-dimensional geometry of surfaces in general, that of the plane is merely one special case. Certain of the features met in that case are general. If we agree that we know what we mean by distance, we find that on every surface there is a shortest [135]distance between two points, together with a series of lines or curves along which such distances are taken. These lines or curves we call geodesics. On the plane the geodesic is the straight line. On surfaces in general the geodesic, whatever its particular and peculiar shape, plays the same rôle that is played by the straight line in the plane; it is the secondary element of the geometry, the surface itself and all other surfaces of its type are the tertiary elements. And it is a fact that we can take all the possible spheres, or all the possible French-horn surfaces, and conceive of space as we know it being broken down by analysis into these surfaces instead of into planes. The only reason we habitually decompose space into planes is because it comes natural to us to think that way. But geometric points, lines and surfaces must be recognized as abstractions without actual existence, for all of them lack one or more of the three dimensions which such existence implies. These figures exist in our minds but not in the external world about us. So any decomposition of space into geometric elements is a phenomenon of the mind only; it has no parallel and no significance in the external world, and is made in one way or in another purely at our pleasure. There isn’t a true, honest-to-goodness geometrical plane in existence any more than there is an honest-to-goodness spherical surface: so on intrinsic grounds one decomposition is as reasonable as another.
In this two-dimensional geometry of surfaces, the plane is just one specific case. Some of the features we encounter in that case are general. If we agree on what we mean by distance, we discover that on every surface, there's a shortest distance between two points, along with a series of lines or curves that represent such distances. We call these lines or curves geodesics. On the plane, the geodesic is the straight line. On other surfaces, the geodesic, regardless of its unique shape, plays the same role that the straight line does in the plane; it’s the secondary element of geometry, while the surface itself and all other similar surfaces are the tertiary elements. It’s a fact that we can imagine all possible spheres, or all possible French-horn surfaces, breaking down space as we know it into these surfaces instead of into planes. The only reason we usually break space down into planes is that it feels natural to think that way. However, geometric points, lines, and surfaces must be understood as abstractions without real existence, since they all lack one or more of the three dimensions that such existence suggests. These figures exist in our minds but not in the external world. So, any breakdown of space into geometric elements is purely a mental phenomenon; it has no equivalent and no significance in the external world, and is done purely at our discretion. There isn’t a real, tangible geometrical plane any more than there is a true spherical surface: therefore, from an intrinsic perspective, one breakdown is just as reasonable as another.
Certain of the most fundamental postulates are obeyed by all surfaces. As we attempt to discriminate between surfaces of different types, and get, for instance, a geometry that shall be valid for spheres [136]and ellipsoids but not for conicoids in general, we must do so by bringing in additional postulates that embody the necessary restrictions. A characteristic shared by planes, spheres, and various other surfaces is that the geodesics can be freely slid along upon themselves and will coincide with themselves in all positions when thus slid; with a similar arrangement for the surface itself. But the plane stands almost unique among surfaces in that it does not force us to distinguish between its two sides; we can turn it over and still it will coincide with itself; and this property belongs also to the straight line. It does not belong to the sphere, or to the great circles which are the geodesics of spherical geometry; when we turn one of these over, through the three-dimensional space that surrounds it, we find that the curvature lies in the wrong way to make superposition possible. If we postulate that superposition be possible under such treatment, we throw out the sphere and spherical geometry; if we postulate that superposition be only by sliding the surface upon itself we admit that geometry—as Saccheri failed to see, as Lobatchewsky realized, and as Riemann showed at great length in rehabilitating the “obtuse-angled hypothesis.” Lobatchewsky’s acute-angled geometry is realized on a surface of the proper sort, which admits of unrestricted superposition; but it is not the sort of a surface that I care to discuss in an article of this scope.
Certain fundamental principles apply to all surfaces. When we try to differentiate between various types of surfaces and create a geometry that works for spheres [__A_TAG_PLACEHOLDER_0__] and ellipsoids, but not for conicoids in general, we need to introduce extra principles that include the necessary limitations. A common feature among planes, spheres, and many other surfaces is that the geodesics can slide freely over themselves and will match up in all positions when slid in that way; a similar property applies to the surface itself. However, the plane is quite unique because it doesn’t require us to differentiate between its two sides; we can flip it over and it will still align perfectly with itself, and this property also belongs to the straight line. This doesn’t hold true for the sphere or the great circles, which are the geodesics of spherical geometry; when we flip one of these through the surrounding three-dimensional space, we find that the curvature is positioned incorrectly for superposition. If we assume that superposition is possible under such conditions, we exclude the sphere and spherical geometry; if we assume that superposition is only possible by sliding the surface over itself, we accept that geometry—as Saccheri failed to recognize, as Lobatchewsky understood, and as Riemann extensively explained in revisiting the “obtuse-angled hypothesis.” Lobatchewsky’s acute-angled geometry exists on a type of surface that allows unrestricted superposition, but it's not the kind of surface I want to discuss in an article of this extent.
Euclidean geometry is the natural and easy one, I suppose, because it makes it easy to stop with three dimensions. If we take a secondary element, a geodesic, which is “curved” in the Euclidean sense, [137]we get a tertiary element, a surface, which is likewise curved. Then unless we are to make an altogether abrupt and unreasonable break, we shall find that just as the curved geodesic generated a curved surface, the curved surface must give rise to a “curved space”; and just as the curved geodesic needed a second dimension to curve into, and the curved surface a third, so the curved three-space requires a fourth. Once started on this sort of thing, there doesn’t really seem to be any end.
Euclidean geometry is the straightforward and uncomplicated type, I guess, because it allows us to stop at three dimensions. If we introduce a secondary element, a geodesic, which is “curved” in the Euclidean way, [__A_TAG_PLACEHOLDER_0__]we create a tertiary element, a surface, which is also curved. So unless we want to make a completely sudden and unreasonable jump, we'll find that just as the curved geodesic produced a curved surface, the curved surface must lead to a “curved space”; and just as the curved geodesic needed a second dimension to curve into, and the curved surface required a third, the curved three-space needs a fourth. Once you start down this path, it really seems like there’s no end in sight.
Euclidean or Non-Euclidean
Nevertheless, we must face the possibility that the space we live in, or any other manifold of any sort whatever with which we deal on geometric principles, may turn out to be non-Euclidean. How shall we finally determine this? By measures—the Euclidean measures the angles of an actual triangle and finds the sum to be exactly 180 degrees; or he draws parallel lines of indefinite extent and finds them to be everywhere equally distant; and from these data he concludes that our space is really Euclidean. But he is not necessarily right.
Nevertheless, we need to consider the possibility that the space we inhabit, or any other kind of manifold we encounter based on geometric principles, might actually be non-Euclidean. How will we ultimately figure this out? Through measurements—the Euclidean method measures the angles of a real triangle and finds the sum to be exactly 180 degrees; or it draws parallel lines that go on forever and finds them to be equally distant from each other everywhere; and from these observations, it concludes that our space is indeed Euclidean. But it might not be correct.
We ask him to level off a plot of ground by means of a plumb line. Since the line always points to the earth’s center, the “level” plot is actually a very small piece of a spherical surface. Any test conducted on this plot will exhibit the numerical characteristics of the Euclidean geometry; yet we know the geometry of this surface is Riemannian. The angle-sum is really greater than 180 degrees; lines that are everywhere equidistant are not both geodesics. [138]
We ask him to even out a piece of land using a plumb line. Since the line always points toward the center of the Earth, the "level" plot is actually a tiny section of a spherical surface. Any test performed on this plot will show the numerical traits of Euclidean geometry; however, we know the geometry of this surface is Riemannian. The angle sum is actually greater than 180 degrees; lines that remain equidistant are not both geodesics. [__A_TAG_PLACEHOLDER_0__]
The trouble, of course, is that on this plot we deal with so minute a fraction of the whole sphere that we cannot make measurements sufficiently refined to detect the departure from Euclidean standards. So it is altogether sensible for us to ask: “Is the universe of space about us really Euclidean in whatever of realized geometry it presents to us? Or is it really non-Euclidean, but so vast in size that we have never yet been able to extend our measures to a sufficiently large portion of it to make the divergence from the Euclidean standard discernible to us?”
The issue, of course, is that on this plot we are dealing with such a tiny portion of the entire space that we can't make precise enough measurements to see any differences from Euclidean standards. So it makes perfect sense for us to ask: “Is the space around us really Euclidean in the geometry we experience? Or is it actually non-Euclidean, but so enormous that we haven't been able to measure a large enough part of it to notice any differences from the Euclidean standard?”
This discussion is necessarily fragmentary, leaving out much that the writer would prefer to include. But it is hoped that it will nevertheless make it clear that when the contestants in the Einstein competition speak of a non-Euclidean universe as apparently having been revealed by Einstein, they mean simply that to Einstein has occurred a happy expedient for testing Euclideanism on a smaller scale than has heretofore been supposed possible. He has devised a new and ingenious sort of measure which, if his results be valid, enables us to operate in a smaller region while yet anticipating that any non-Euclidean characteristics of the manifold with which we deal will rise above the threshold of measurement. This does not mean that Euclidean lines and planes, as we picture them in our mind, are no longer non-Euclidean, but merely that these concepts do not quite so closely correspond with the external reality as we had supposed.
This discussion is inherently limited, leaving out a lot that the writer would like to include. However, the hope is that it will still clarify that when the participants in the Einstein competition talk about a non-Euclidean universe as seemingly revealed by Einstein, they simply mean that Einstein has discovered a practical way to test Euclidean concepts on a smaller scale than we previously thought was possible. He has created a new and clever type of measurement that, if his results are accurate, allows us to operate in a smaller area while still expecting that any non-Euclidean traits of the space we’re studying will become noticeable. This doesn't mean that Euclidean lines and planes, as we visualize them, are no longer non-Euclidean, but rather that these ideas don’t quite align with the actual external reality as closely as we believed.
As to the precise character of the non-Euclideanism which is revealed, we may leave this to later chapters and to the competing essayists. We need [139]only point out here that it will not necessarily be restricted to the matter of parallelism. The parallel postulate is of extreme interest to us for two reasons; first because historically it was the means by which the possibilities and the importance of non-Euclidean geometry were forced upon our attention; and second because it happens to be the immediate ground of distinction between Euclidean geometry and two of the most interesting alternatives. But Euclidean geometry is characterized, not by a single postulate, but by a considerable number of postulates. We may attempt to omit any one of these so that its ground is not specifically covered at all, or to replace any one of them by a direct alternative. We might conceivably do away with the superposition postulate entirely, and demand that figures be proved equivalent, if at all, by some more drastic test. We might do away with the postulate, first properly formulated by Hilbert, on which our ideas of the property represented in the word “between” depend. We might do away with any single one of the Euclidean postulates, or with any combination of two or more of them. In some cases this would lead to a lack of categoricity and we should get no geometry at all; in most cases, provided we brought a proper degree of astuteness to the formulation of alternatives for the rejected postulates, we should get a perfectly good system of non-Euclidean geometry: one realized, if at all, by other elements than the Euclidean point, line and plane, and one whose elements behave toward one another differently from the Euclidean point, line and plane. [140]
Regarding the specific nature of the non-Euclideanism that is revealed, we can leave this for later chapters and the various essayists. We only need to mention here that it won’t necessarily be limited to the issue of parallelism. The parallel postulate is extremely interesting to us for two reasons: first, because historically it was the means through which the possibilities and significance of non-Euclidean geometry became apparent to us; and second, because it serves as the immediate distinction between Euclidean geometry and two of the most fascinating alternatives. However, Euclidean geometry is defined not by a single postulate, but by a significant number of them. We might try to leave out any one of these so that its foundation is not specifically addressed at all, or replace any of them with a direct alternative. We could potentially eliminate the superposition postulate entirely and require that figures be proven equivalent, if at all, by some more rigorous test. We could also remove the postulate, first clearly formulated by Hilbert, on which our understanding of the concept represented by the term “between” relies. We might discard any single Euclidean postulate, or any combination of two or more of them. In some instances, this would lead to a lack of certainty, resulting in no geometry at all; in most cases, as long as we approach the formulation of alternatives for the omitted postulates with the right degree of insight, we would have a perfectly workable system of non-Euclidean geometry: one realized, if at all, through different elements than the Euclidean point, line, and plane, and one whose elements interact with each other in a way that’s different from the Euclidean point, line, and plane. [__A_TAG_PLACEHOLDER_0__]
Merely to add definiteness to this chapter, I annex here the statement that in the geometry which Einstein builds up as more nearly representing the true external world than does Euclid’s, we shall dispense with Euclid’s (implicit) assumption, underlying his (explicitly stated) superposition postulate, to the effect that the act of moving things about does not affect their lengths. We shall at the same time dispense with his parallel postulate. And we shall add a fourth dimension to his three—not, of course, anything in the nature of a fourth Euclidean straight line perpendicular, in Euclidean space, to three lines that are already perpendicular to each other, but something quite distinct from this, whose nature we shall see more exactly in the next chapter. If the present chapter has made it clear that it is proper for us to do this, and has prevented anyone from supposing that the results of doing it must be visualized in a Euclidean space of three dimensions or of any number of dimensions, it will have served its purpose. [141]
To clarify this chapter, I want to state that in the geometry that Einstein develops as more accurately representing the real external world than Euclid’s, we will discard Euclid’s (implicit) assumption, which underlies his (explicitly stated) superposition postulate, that moving things around doesn’t change their lengths. We will also disregard his parallel postulate. Furthermore, we will introduce a fourth dimension to his three—not anything resembling a fourth Euclidean straight line that is perpendicular, in Euclidean space, to three lines that are already perpendicular to one another, but something entirely different, the specifics of which we will explore in the next chapter. If this chapter has made it clear that it is appropriate for us to take this approach, and has prevented anyone from thinking that the outcomes of this must be visualized in a three-dimensional Euclidean space or any other number of dimensions, then it will have achieved its goal. [__A_TAG_PLACEHOLDER_0__]
VI
THE SPACE-TIME CONTINUUM
Minkowski’s World of Events, and the Way It Fits Into Einstein’s Structure
Minkowski's World of Events and Its Role in Einstein's Framework
Seeking a basis for the secure formulation of his results, and especially a means for expressing mathematically the facts of the dependence which he had found to exist between time and space, Einstein fell back upon the prior work of Minkowski. It may be stated right here that the idea of time as a fourth dimension is not particularly a new one. It has been a topic of abstract speculation for the best part of a century, even on the part of those whose notions of the fourth dimension were pretty closely tied down to the idea of a fourth dimension of Euclidean point-space, which would be marked by a fourth real line, perpendicular to the other three, and visible to us if we were only able to see it. Moreover, every mathematician, whether or not he be inclined to this sort of mental exercise, knows well that whenever time enters his equations at all, it does so on an absolutely equal footing with each of his space coordinates, so that as far as his algebra is concerned he could never distinguish between them. When the variables x, y, z, t come to the mathematician in connection [142]with some physical investigation, he knows before he starts that the first three represent the dimensions of Euclidean three-space and that the last stands for time. But if the algebraic expressions of such a problem were handed to him independently of all physical tie-up, he would never be able to tell, from them alone, whether one of the four variables represented time, or if so, which one to pick out for this distinction.
Looking for a solid foundation for his conclusions, and especially a way to mathematically express the relationship he discovered between time and space, Einstein turned to Minkowski's earlier work. It’s important to note that the concept of time as a fourth dimension isn’t exactly new. It has been a subject of deep thought for nearly a century, even among those whose understanding of the fourth dimension was closely linked to the idea of a fourth dimension in Euclidean space, which would be represented by a real line that is perpendicular to the other three and visible to us if we could only perceive it. Furthermore, every mathematician, whether or not they enjoy this type of theoretical exploration, understands that whenever time appears in their equations, it does so on the same level as the space coordinates. So, in terms of algebra, they can't differentiate between them. When the variables x, y, z, t come to a mathematician in relation [__A_TAG_PLACEHOLDER_0__] to some physical investigation, they know from the outset that the first three represent the dimensions of Euclidean three-space and that the last one represents time. However, if he were given the algebraic expressions of such a problem without any physical context, he would have no way of knowing, just from them, whether one of the four variables represented time or which one it might be.
It was Minkowski who first formulated all this in a form susceptible of use in connection with the theory of relativity. His starting point lies in the distinction between the point and the event. Mr. Francis has brought this out rather well in his essay, being the only competitor to present the Euclidean geometry as a real predecessor of Newtonian science, rather than as a mere part of the Newtonian system. I think his point here is very well taken. As he says, Euclid looked into the world about him and saw it composed of points. Ignoring all dynamic considerations, he built up in his mind a static world of points, and constructed his geometry as a scientific machine for dealing with this world in which motion played no part. It could to be sure be introduced by the observer for his own purposes, but when so introduced it was specifically postulated to be a matter of no moment at all to the points or lines or figures that were moved. It was purely an observational device, intended for the observer’s convenience, and in the bargain a mental device, calling for no physical action and the play of no force. So far as Euclid in his daily life was obliged to take cognizance of the fact that in the world of work-a-day [143]realities motion existed, he must, as a true Greek, have looked upon this as a most unfortunate deviation of the reality from his beautiful world of intellectual abstraction, and as something to be deplored and ignored. Even in their statuary the Greeks clung to this idea. A group of marvelous action, like the Laocoon, they held to be distinctly a second rate production, a prostitution of the noble art; their ideal was a figure like the majestic Zeus—not necessarily a mere bust, be it understood, but always a figure in repose without action. Their statuary stood for things, not for action, just as their geometry stood for points, not for events.
It was Minkowski who first put all this into a form that could be used with the theory of relativity. His starting point is the distinction between a point and an event. Mr. Francis has highlighted this well in his essay, being the only competitor to present Euclidean geometry as a real predecessor to Newtonian science, rather than just a part of the Newtonian system. I think his point is spot on. As he mentions, Euclid looked around him and saw the world made up of points. Ignoring all dynamic factors, he imagined a static world of points and created his geometry as a scientific tool for interacting with this world, where motion was irrelevant. It could certainly be introduced by the observer for their own purposes, but when it was, it was specifically assumed to have no impact on the points, lines, or figures that were moved. It was purely an observational tool, meant for the convenience of the observer, and also a mental tool that required no physical action or force. As far as Euclid was concerned in his everyday life, the fact that motion existed in the real world of daily realities must have seemed, to him as a true Greek, a regrettable departure from his beautiful world of intellectual abstraction—something to be lamented and ignored. Even in their sculptures, the Greeks adhered to this concept. A group depicting marvelous action, like the Laocoon, was considered distinctly second-rate, a distortion of the noble art; their ideal was a figure like the majestic Zeus—not necessarily just a bust, but always a figure at rest, without action. Their sculptures represented things, not actions, just as their geometry represented points, not events.
Galileo and Newton took a different viewpoint. They were interested in the world as it is, not as it ought to be; and if motion appears to be a fundamental part of that world, they were bound to include it in their scheme. This made it necessary for them to pay much more attention to the concept of time and its place in the world than did the Greeks. In the superposition process, and even when he allowed a curve to be generated by a moving point, the sole interest which Euclid had in the motion was the effect which was to be observed upon his static figures after its completion. In this effect the rate of the motion did not enter. So all questions of velocity and time are completely ignored, and we have in fact the curious spectacle of motion without time.
Galileo and Newton had a different perspective. They were focused on the world as it is, not as it should be; and since motion seemed to be a fundamental aspect of that world, they had to include it in their theories. This meant they needed to pay much more attention to the concept of time and its role in the world than the Greeks did. In the superposition process, and even when he allowed a curve to be formed by a moving point, Euclid's only interest in the motion was in the effect it had on his static figures after it was finished. In this effect, the speed of the motion wasn't considered. So, all questions about velocity and time are completely overlooked, resulting in the strange scenario of motion without time.
To Galileo and Newton, on the other hand, the time which it took a body to pass from one point of its path to another was of paramount importance. The motion itself was the object of their study, and they recognized the part played by velocity. But [144]Galileo and Newton were still sufficiently under the influence of Euclid to fit the observed phenomena of motion, so far as they could, upon Euclid’s static world of points. This they effected by falling in with the age-old procedure of regarding time and space as something entirely disassociated and distinct. The motion of an object—in theory, of a point—was to be recorded by observing its successive positions. With each of these positions a time was to be associated, marking the instant at which the point attained that position. But in the face of this association, space and time were to be maintained as entirely separate entities.
To Galileo and Newton, however, the time it took for something to move from one point to another was crucial. The motion itself was their focus, and they understood the role of speed. But [__A_TAG_PLACEHOLDER_0__]Galileo and Newton were still influenced enough by Euclid to try to fit the observed phenomena of motion into Euclid’s static world of points. They did this by following the long-standing approach of treating time and space as completely separate and distinct. The motion of an object—in theory, of a point—was to be recorded by observing its successive positions. Each of these positions would be linked to a specific time, indicating when the point reached that position. But despite this connection, space and time were to be kept as entirely separate entities.
The 4D World of Events
This severe separation of time and space Minkowski has now questioned, with the statement that the elements of which the external world is composed, and which we observe, are not points at all, but are events. This calls for a revision of our whole habit of thought. It means that the perceptual world is four-dimensioned, not three-dimensioned as we have always supposed; and it means, at the very least, that the distinction between time and space is not so fundamental as we had supposed.
This strict divide between time and space is something Minkowski has now challenged, by stating that the components of the external world we observe aren't points, but are events. This requires us to rethink our entire way of thinking. It indicates that the perceptual world has four dimensions instead of the three dimensions we've always thought; and it suggests, at the very least, that the difference between time and space isn't as essential as we once believed.
[This should not impress us as strange or incomprehensible. What do we mean when we say that a plane is two-dimensional? Simply that two coordinates, two numbers, must be given to specify the position of any point of the plane. Similarly for a point in the space of our accustomed concepts we must give three numbers to fix the position—as [145]by giving the latitude and longitude of a point on the earth and its height above sea-level. So we say this space is three-dimensional. But a material body is not merely somewhere; it is somewhere now,]182 or was somewhere yesterday, or will be somewhere tomorrow. The statement of position for a material object is meaningless unless we at the same time specify the time at which it held that position. [If I am considering the life-history of an object on a moving train, I must give three space-coordinates and one time-coordinate to fix each of its positions.]182 And each of its positions, with the time pertaining to that position, constitutes an event. The dynamic, ever-changing world about us, that shows the same aspect at no two different moments, is a world of events; and since four measures or coordinates are required to fix an event, we say this world of events is four-dimensional. If we wish to test out the soundness of this viewpoint, we may well do so by asking whether the naming of values for the four coordinates fixes the event uniquely, as the naming of three under the old system fixes the point uniquely.
[This shouldn't seem strange or confusing. What do we mean when we say that a plane is two-dimensional? It simply means that we need two coordinates, two numbers, to specify the position of any point on the plane. Similarly, to identify a point in space as we typically understand it, we must provide three numbers to establish its location—like [__A_TAG_PLACEHOLDER_0__] by giving the latitude and longitude of a point on Earth and its elevation above sea level. Thus, we say this space is three-dimensional. However, a physical object is not just located somewhere; it is located somewhere right now,]182 or was located somewhere yesterday, or will be located somewhere tomorrow. Describing the position of a physical object is pointless unless we also specify the time when it was at that position. [If I’m looking at the life story of an object on a moving train, I need to give three spatial coordinates and one time coordinate to specify each of its positions.]182 Each of those positions, along with the time associated with that position, makes up an event. The dynamic, ever-changing world around us, which looks different at every moment, is a world of events; and since four measurements or coordinates are necessary to pinpoint an event, we describe this world of events as four-dimensional. If we want to evaluate the validity of this perspective, we can do so by asking whether naming the values for the four coordinates uniquely identifies the event, just as naming three coordinates in the old system uniquely identified a point.
Suppose we take some particular event as the one from which to measure, and agree upon the directions to be taken by our space axes, and make any convention about our time-axis which subsequent investigation may show to be necessary. Certainly then the act of measuring so many miles north, and so many west, and so many down, and so many seconds backward, brings us to a definite time and place—which is to say, to a definite event. Perhaps nothing “happened” there, in the sense in which we [146]usually employ the word; but that is no more serious than if we were to locate a point with reference to our familiar space coordinate system, and find it to lie in the empty void of interstellar space, with no material body occupying it. In this second case we still have a point, which requires, to insure its existence and location, three coordinates and nothing more; in the first case we still have an event, which requires for its existence and definition four coordinates and nothing more. It is not an event about which the headline writers are likely to get greatly excited; but what of that? It is there, ready and waiting to define any physical happening that falls upon it, just as the geometer’s point is ready and waiting to define any physical body that chances to fall upon it.
Suppose we choose a specific event as our reference point, agree on the directions for our spatial axes, and establish any conventions for our time axis that further exploration might require. Then, measuring several miles north, a few miles west, some distance down, and a handful of seconds back will lead us to a specific time and place—which means a specific event. Maybe nothing "happened" there in the way we usually think of it, but that’s no more significant than pinpointing a location in our familiar spatial coordinate system, only to find it in the emptiness of interstellar space, without any material object there. In this second scenario, we still identify a point, which needs just three coordinates to confirm its existence and location; similarly, in the first scenario, we have an event that requires four coordinates for its existence and definition. It’s not an event that headline writers would get too excited about; but so what? It exists, ready to define any physical occurrence that happens on it, just as a geometric point is ready to define any physical object that happens to occupy it.
A Range of Points
It is now in order to introduce a word, which I shall have to confess the great majority of the essayists introduce, somewhat improperly, without explanation. But when I attempt to explain it, I realize quite well why they did this. They had to have it; and they didn’t have space in their three thousand words to talk adequately about it and about anything else besides. The mathematician knows very well indeed what he means by a continuum; but it is far from easy to explain it in ordinary language. I think I may do best by talking first at some length about a straight line, and the points on it.
It’s time to introduce a term that, I must admit, most essayists use somewhat incorrectly without explaining. But when I try to explain it, I understand why they did that. They needed it, and they didn’t have enough space in their three thousand words to cover it properly along with everything else. The mathematician knows exactly what a continuum means, but it’s not easy to explain it in everyday language. I think the best approach is to start by discussing, in some detail, a straight line and the points on it.
If the line contains only the points corresponding [147]to the integral distances 1, 2, 3, etc., from the starting point, it is obviously
not continuous—there are gaps in it vastly more inclusive than the few (comparatively
speaking) points that are present. If we extend the limitations so that the line includes
all points corresponding to ordinary proper and improper fractions like ¼ and 17⁄29
and 1633⁄7—what the mathematician calls the rational numbers—we shall apparently
fill in these gaps; and I think the layman’s first impulse would be to say that the
line is now continuous. Certainly we cannot stand now at one point on the line and
name the “next” point, as we could a moment ago. There is no “next” rational number
to 116⁄125, for instance; 115⁄124 comes before it and 117⁄126 comes after it, but
between it and either, or between it and any other rational number we might name,
lie many others of the same sort. Yet in spite of the fact that the line containing
all these rational points is now “dense” (the technical term for the property I have
just indicated), it is still not continuous; for I can easily define numbers that
are not contained in it—irrational numbers in infinite variety like ; or, even worse, the number pi = 3.141592 … which defines the ratio of the circumference
of a circle to the diameter, and many other numbers of similar sort.
If the line only includes the points that correspond to the whole numbers 1, 2, 3, etc., from the starting point, it’s clearly not continuous—there are big gaps in it compared to the few (in relative terms) points that are there. If we broaden the limits so that the line includes all points that correspond to regular proper and improper fractions like ¼, 17⁄29, and 1633⁄7—what mathematicians refer to as rational numbers—we seemingly fill in these gaps; and I think most people would first assume that the line is now continuous. We certainly can’t stand at one point on the line and identify the “next” point like we could before. There’s no “next” rational number after 116⁄125, for instance; 115⁄124 comes before it and 117⁄126 comes after it, but between it and either, or between it and any other rational number we might name, there are many others of the same kind. Yet despite the fact that the line containing all these rational points is now “dense” (the technical term for the property I just described), it’s still not continuous; I can easily define numbers that are not included in it—irrational numbers in endless variety like ; or, even worse, the number pi = 3.141592 … which defines the ratio of the circumference of a circle to the diameter, along with many other numbers of a similar nature.
If the line is to be continuous, there may be no holes in it at all; it must have a point corresponding to every number I can possibly name. Similarly for the plane, and for our three-space; if they are to be continuous, the one must contain a point for every possible pair of numbers x and y, and the other for [148]every possible set of three numbers x, y and z, that I can name. There may be no holes in them at all.
If the line is to be continuous, it can't have any gaps; there has to be a point for every number I can name. The same goes for the plane and our three-dimensional space; for them to be continuous, one must include a point for every possible pair of numbers x and y, and the other must have a point for [__A_TAG_PLACEHOLDER_0__]every possible set of three numbers x, y, and z that I can name. There can’t be any gaps in them either.
A line is a continuum of points. A plane is a continuum of points. A three-space is a continuum of points. These three cases differ only in their dimensionality; it requires but one number to determine a point of the first continuum, two and three respectively in the second and third cases. But the essential feature is not that a continuum shall consist of points, or that we shall be able to visualize a pseudo-real existence for it of just the sort that we can visualize in the case of line, plane and point. The essential thing is merely that it shall be an aggregate of elements numerically determined in such a way as to leave no holes, but to be just as continuous as the real number system itself. Examples, however, aside from the three which I have used, are difficult to construct of such sort that the layman shall grasp them readily; so perhaps, fortified with the background of example already presented, I may venture first upon a general statement.
A line is a series of points. A plane is a series of points. A three-dimensional space is a series of points. These three cases only differ in how many dimensions they have; it takes one number to pinpoint a location in the first case, and two and three numbers respectively in the second and third cases. But the key aspect isn’t that a continuous set consists of points, or that we can visualize some kind of pseudo-reality for it, just like we can with a line, plane, and point. The important thing is that it should be a collection of elements defined in a way that leaves no gaps, but is just as continuous as the real number system itself. However, aside from the three examples I've used, it's tough to come up with more that a general audience can easily understand; so perhaps, with the examples given, I can first make a general statement.
The Continuum in General
Suppose we have a set of “elements” of some sort—any sort. Suppose that these elements possess one or more fundamental identifying characteristics, analogous to the coordinates of a point, and which, like these coordinates, are capable of being given numerical values. Suppose we find that no two elements of the set possess identically the same set of defining values. Suppose finally—and this is the critical test—that the elements of the set are such [149]that, no matter what numerical values we may specify, it we do specify the proper number of defining magnitudes we define by these an actual element of the set, that corresponds to this particular collection of values. Our elements then share with the real number system the property of leaving no holes, of constituting a continuous succession in every dimension which they possess. We have then a continuum. Whatever its elements, whatever the character of their numerical identifiers, whatever the number n of these which stands for its dimension, there may be no holes or we have no continuum. There must be an element for every possible combination of n numbers we can name, and no two of these combinations may give the same element. Granted this condition, our elements constitute a continuum.
Imagine we have a group of "elements" of any kind. Let's say these elements have one or more basic identifying features, similar to the coordinates of a point, and these features can be assigned numerical values. Now, let’s assume that no two elements in this group have exactly the same set of defining values. Finally—and this is the essential test—these elements are arranged [__A_TAG_PLACEHOLDER_0__]so that, regardless of the numerical values we choose to specify, if we do select the correct number of defining attributes, we identify an actual element of the set that corresponds to this specific collection of values. Our elements then share with the real number system the characteristic of having no gaps, forming a continuous sequence in every dimension they possess. Thus, we have a continuum. Regardless of what its elements are, what their numerical identifiers look like, or how many n dimensions there are, there must be no gaps for it to be a continuum. There needs to be an element for every possible combination of n numbers we can define, and no two combinations can produce the same element. If this condition holds, then our elements create a continuum.
As I have remarked, it is not easy to cite examples of continua which shall mean anything
to the person unaccustomed to the term. The totality of carbon-oxygen-nitrogen-hydrogen
compounds suggested by one essayist as an example is not a continuum at all, for the
set contains elements corresponding only to integer values of the numbers which tell
us how many atoms of each substance occur in the molecule. We cannot have a compound
containing carbon atoms, or
oxygen atoms. Perhaps the most satisfactory of the continua, outside the three Euclidean
space-continua already cited, [is the manifold of music notes. This is four-dimensional;
each note has four distinctions—length, pitch, intensity, timbre—to distinguish it
perfectly, to tell how long, how high, how loud, how rich.]263 We might have a little difficulty in reducing the characteristic of richness to [150]numerical expression, but presumably it could be done; and we should then be satisfied
that every possible combination of four values, l, p, i, t for these four identifying characteristics would give us a musical effect, and one
to be confused with no other.
As I mentioned, it's not easy to give examples of continuums that mean anything to someone unfamiliar with the term. The complete set of carbon-oxygen-nitrogen-hydrogen compounds suggested by one writer as an example isn't a continuum at all, since the set consists of elements that correspond only to whole number values indicating how many atoms of each substance are in the molecule. We can't have a compound with carbon atoms, or
oxygen atoms. Perhaps the most satisfying continuum, aside from the three Euclidean space-continuums already mentioned, is the variety of music notes. This is four-dimensional; each note has four features—length, pitch, intensity, timbre—to distinguish it perfectly, explaining how long, how high, how loud, and how rich it is. We might struggle a bit to express the characteristic of richness in numerical terms, but it could likely be done; and we would then be confident that every possible combination of four values, l, p, i, t for these four identifying characteristics, would produce a musical effect that isn’t confused with any other.
There is in the physical world a vast quantity of continua of one sort or another. The music-note continuum brings attention to the fact that not all of these are such that their elements make their appeal to the visual sense. This remark is a pertinent one; for we are by every right of heritage an eye-minded race, and it is frequently necessary for us to be reminded that so far as the external world is concerned, the verdict of every other sense is entirely on a par with that of sight. The things which we really see, like matter, and the things which we abstract from these visual impressions, like space, are by no means all there is to the world.
There is a vast array of continua in the physical world. The music-note continuum highlights that not all of these appeal to our sense of sight. This is an important point to consider; as a visually-oriented species, we often need to be reminded that, regarding the external world, every other sense is just as valid as sight. The things we actually see, like physical objects, and the concepts we derive from these visual experiences, like space, are far from everything that exists in the world.
Euclidean and Non-Euclidean Spaces
If we are dealing with a continuum of any sort whatever having one or two or three dimensions, we are able to represent it graphically by means of the line, the plane, or the three-space. The same set of numbers that defines an element of the given continuum likewise defines an element of the Euclidean continuum of the same dimensionality; so the one continuum corresponds to the other, element for element, and either may stand for the other. But if we have a continuum of four or more dimensions, this representation breaks down in the absence of a real, four-dimensional Euclidean point-space to [151]serve as a picture. This does not in the least detract from the reality of the continuum which we are thus prevented from representing graphically in the accustomed fashion.
If we're talking about a continuum of any kind, whether it's one, two, or three dimensions, we can visually represent it using a line, a plane, or three-dimensional space. The same set of numbers that defines an element of the given continuum also defines an element of the Euclidean continuum with the same number of dimensions; so one continuum corresponds to the other, element by element, and either can represent the other. However, when we deal with a continuum of four or more dimensions, this representation fails without a true four-dimensional Euclidean point-space to [__A_TAG_PLACEHOLDER_0__]serve as a visual reference. This doesn’t change the reality of the continuum we can't represent graphically in the usual way.
The Euclidean representation, in fact, may in some cases be unfortunate—it may be so entirely without significance as to be actually misleading. For in the Euclidean continuum of points, be it line, plane or three-space, there are certain things which we ordinarily regard as secondary derived properties, but which possess a great deal of significance none the less.
The Euclidean representation can sometimes be unhelpful—it can be so totally insignificant that it ends up being misleading. In the Euclidean continuum of points, whether it’s a line, plane, or three-dimensional space, there are things we usually consider secondary derived properties, yet they still have a lot of significance.
In particular, in the Euclidean plane and in Euclidean three-space, there is the distance between two points. I have indicated, in the chapter on non-Euclidean geometry, that the parallel postulate of Euclid, which distinguishes his geometry from others, could be replaced by any one of numerous other postulates. Grant Euclid’s postulate and you can prove any of these substitutes; grant any of the substitutes and you can prove Euclid’s postulate. Now it happens that there is one of these substitutes to which modern analysis has given a position of considerable importance. It is merely our good old friend the Pythagorean theorem, that the square on the hypotenuse equals the sum of the squares on the sides; but it is dressed in new clothes for the present occasion.
In the Euclidean plane and in Euclidean three-dimensional space, there is a distance between two points. I mentioned in the chapter on non-Euclidean geometry that Euclid's parallel postulate, which sets his geometry apart from others, could be swapped out for any number of other postulates. Accept Euclid's postulate, and you can prove any of these alternatives; accept any of the alternatives, and you can prove Euclid's postulate. Interestingly, one of these alternatives has gained significant importance in modern analysis. It’s simply our old friend, the Pythagorean theorem, which states that the square of the hypotenuse equals the sum of the squares of the other two sides; but it's been presented in a new way for this discussion.
Mr. Francis’ discussion of this part of the subject, and especially his figure, ought
to make it clear that this theorem can be considered as dealing with the distance
between any two points. When we so consider it, and take it as the fundamental, defining
postulate [152]of Euclidean geometry which distinguishes this geometry from others, we have a statement
of considerable content. We have, first, that the characteristic property of Euclidean
space is that the distance between two points is given by the square root of the sum
of the squares of the coordinate-differences for these points—by the expression
where the large letters represent the coordinates of the one point and the small ones
those of the other. We have more than this, however; we have that this distance is
the same for all observers, no matter how different their values for the individual
coordinates of the individual points. And we have, finally, as a direct result of
looking upon the thing from this viewpoint, that the expression for D is an “invariant”; which simply means that every observer may use the same expression
in calculating the value of D in terms of his own values for the coordinates involved. The distance between two points in our space
is given numerically by the square root of the sum of the squares of my coordinate-differences for the two points involved; it is given equally by the square
root of the sum of the squares of your coordinate-differences, or those of any other observer whatsoever. We have then a
natural law—the fundamental natural law characterizing Euclidean space. If we wish to apply it to
the Euclidean two-space (the plane) we have only to drop out the superfluous coordinate-difference;
if we wish to see by analogy what would be the fundamental natural law for a [153]four-dimensional Euclidean space, we have only to introduce under the radical a fourth
coordinate-difference for the fourth dimension.
Mr. Francis's discussion on this topic, particularly his illustration, should make it clear that this theorem relates to the distance between any two points. When we view it this way, and treat it as the foundational postulate [__A_TAG_PLACEHOLDER_0__] of Euclidean geometry that sets it apart from other geometries, we arrive at a statement with significant meaning. First, the key feature of Euclidean space is that the distance between two points is determined by the square root of the sum of the squares of the differences in their coordinates—expressed as
where the uppercase letters represent the coordinates of one point and the lowercase letters represent those of the other. However, there is more; this distance remains the same for all observers, regardless of how different their values are for the coordinates of each point. Finally, from this perspective, we find that the expression for D is an “invariant,” which means that every observer can use the same expression to calculate the value of D based on his own values for the involved coordinates. The distance between two points in our space is numerically expressed as the square root of the sum of the squares of my coordinate differences for the two points; it can also be expressed as the square root of the sum of the squares of your coordinate differences, or those of any other observer. This gives us a natural law—the fundamental natural law defining Euclidean space. If we wish to apply this to two-dimensional Euclidean space (the plane), we only need to exclude the unnecessary coordinate difference; if we want to understand the fundamental natural law of a [__A_TAG_PLACEHOLDER_0__] four-dimensional Euclidean space, we just introduce a fourth coordinate difference under the radical.
If we were not able to attach any concrete meaning to the expression for D the value of all this would be materially lessened. Consider, for instance, the continuum of music notes. There is no distance between different notes. There is of course significance in talking about the difference in pitch, in intensity, in duration, in timbre, between two notes; but there is none in a mode of speech that implies a composite expression indicating how far one note escapes being identical with another in all four respects at once. The trouble, of course, is that the four dimensions of the music-note continuum are not measurable in terms of a common unit. If they were, we should expect to measure their combination more or less absolutely in terms of this same unit. We can make measurements in all three dimensions of Euclidean space with the same unit, with the same measuring rod in fact. [This presents a peculiarity of our three-space which is not possessed by all three-dimensional manifolds. Riemann has given another illustration in the system of all possible colors, composed of arbitrary proportions of the three primaries, red, green and violet. This system forms a three-dimensional continuum; but we cannot measure the “distance” or difference between two colors in terms of the difference between two others.]130
If we can't give any concrete meaning to the term D, everything would lose much of its value. Take, for example, the continuum of musical notes. There's no distance between different notes. Of course, there's importance in discussing differences in pitch, intensity, duration, and timbre between two notes; but there's no use in a way of expressing that suggests a complex comparison about how one note isn’t identical to another in all four aspects at the same time. The issue, of course, is that the four dimensions of the musical note continuum can’t be measured using a common unit. If they could, we would expect to measure their combination fairly accurately using that same unit. We can measure in all three dimensions of Euclidean space with the same unit, using the same measuring device, actually. [This highlights a unique aspect of our three-dimensional space that isn’t found in all three-dimensional manifolds. Riemann provided another example with the system of all possible colors, made up of various combinations of the three primary colors: red, green, and violet. This system forms a three-dimensional continuum; however, we can’t measure the “distance” or difference between two colors based on the difference between two others.]130
Accordingly, in spite of the fact that the Euclidean three-space gives us a formal representation of the color continuum, and in spite of the fact that the hypothetical four-dimensional Euclidean space would [154]perform a like office for the music-note continuum, this representation would be without significance. We should not say that the geometry of these two manifolds is Euclidean. We should realize that any set of numerical elements can be plotted in a Euclidean space of the appropriate dimensionality; and that accordingly, before allowing such a plot to influence us to classify the geometry of the given manifold as Euclidean, we must pause long enough to ask whether the rest of the Euclidean system fits into the picture. If the square root of the sum of the squares of the coordinate-differences between two elements possesses significance in the given continuum, and if it is invariant between observers of that continuum who employ different bases of reference, then and only then may we allege the Euclidean character of the given continuum.
Therefore, even though Euclidean three-dimensional space provides a formal way to represent the color continuum, and even though a hypothetical four-dimensional Euclidean space would [__A_TAG_PLACEHOLDER_0__]serve a similar purpose for the music-note continuum, this representation would be meaningless. We should not claim that the geometry of these two manifolds is Euclidean. We need to recognize that any set of numerical values can be plotted in a Euclidean space of the right dimensions; thus, before letting such a plot sway us into categorizing the geometry of the given manifold as Euclidean, we must take a moment to consider whether the entire Euclidean system aligns with the situation. If the square root of the sum of the squares of the differences between the coordinates of two elements holds meaning in the given continuum and is consistent between observers of that continuum who use different reference frames, then and only then can we assert the Euclidean nature of that continuum.
If under this test the given continuum fails of Euclideanism, it is in order to ask what type of geometry it does present. If it is of such character that the “distance” between two elements possesses significance, we should answer this question by investigating that distance in the hope of discovering a non-Euclidean expression for it which will be invariant. If it is not of such character, we should seek some other characteristic of single elements or groups of elements, of real physical significance and of such sort that the numerical expression for it would be invariant.
If the given continuum doesn't meet the standards of Euclidean geometry under this test, we should then ask what kind of geometry it actually represents. If the "distance" between two elements is meaningful, we should explore that distance to find a non-Euclidean way to express it that remains consistent. If it's not meaningful in that way, we should look for another characteristic of individual elements or groups of elements that has real physical significance and for which the numerical expression would be consistent.
If the continuum with which we have to do is one in which the “distance” between two elements possesses significance, and if it turns out that the invariant expression for this distance is not the Pythagorean [155]one, but one indicating the non-Euclideanism of our continuum, we say that this continuum has a “curvature.” This means that, if we interpret the elements of our continuum as points in space (which of course we may properly do) and if we then try to superpose this point-continuum upon a Euclidean continuum, it will not “go”; we shall be caught in some such absurdity as trying to force a sphere into coincidence with a plane. And of course if it won’t go, the only possible reason is that it is curved or distorted, like the sphere, in such a way as to prevent its going. It is unfortunate that the visualizing of such curvature requires the visualizing of an additional dimension for the curved continuum to curve into; so that while we can picture a curved surface easily enough, we can’t picture a curved three-space or four-space. But that is a barrier to visualization alone, and in no sense to understanding.
If the continuum we’re dealing with has a “distance” between two elements that matters, and it turns out that the consistent definition of this distance isn’t the traditional Pythagorean one, but rather one that shows the non-Euclidean nature of our continuum, we say this continuum has “curvature.” This means that if we think of the elements of our continuum as points in space (which we can certainly do), and then we try to overlay this point-continuum on a Euclidean continuum, it won’t work; we’ll end up in a ridiculous situation, like trying to fit a sphere into a plane. And obviously, if it won’t fit, the only reasonable explanation is that it is curved or distorted, like the sphere, in such a way that makes fitting impossible. Unfortunately, visualizing such curvature requires us to imagine an additional dimension for the curved continuum to curve into; while we can easily picture a curved surface, visualizing a curved three-dimensional or four-dimensional space is challenging. But that’s just a limitation of visualization, not of understanding.
Our 4D World
It will be observed that we have now a much broader definition of non-Euclideanism than the one which served us for the investigation of Euclid’s parallel postulate. If we may at pleasure accept this postulate or replace it by another and different one, we may presumably do the same for any other or any others of Euclid’s postulates. The very statement that the distance between elements of the continuum shall possess significance, and shall be measurable by considering a path in the continuum which involves other elements, is an assumption. If we discard it altogether, or replace it by one postulating [156]that some other joint property of the elements than their distance be the center of interest, we get a non-Euclidean geometry. So for any other of Euclid’s postulates; they are all necessary for a Euclidean system, and in the absence of any one of them we get a non-Euclidean system.
It will be noted that we now have a much broader definition of non-Euclideanism than what we used to investigate Euclid’s parallel postulate. If we can freely accept this postulate or swap it out for a different one, we can presumably do the same with any other of Euclid’s postulates. The very assertion that the distance between elements of the continuum should have significance and can be measured by considering a path in the continuum that involves other elements is an assumption. If we ignore it entirely or replace it with one that suggests a different joint property of the elements is the main focus, we end up with a non-Euclidean geometry. This applies to any of Euclid’s postulates; they are all essential for a Euclidean system, and if any one of them is missing, we create a non-Euclidean system.
Now the four-dimensional time-space continuum of Minkowski is plainly of a sort which ought to make susceptible of measurement the separation between two of its events. We can pass from one element to another in this continuum—from one event to another—by traversing a path involving “successive” events. Our very lives consist in doing just this: we pass from the initial event of our career to the final event by traversing a path leading us from event to event, changing our time and space coordinates continuously and simultaneously in the process. And while we have not been in the habit of measuring anything except the space interval between two events and the time interval between two events, separately, I think it is clear enough that, considered as events, as elements in the world of four dimensions, there is a less separation between two events that occur in my office on the same day than between two which occur in my office a year apart; or between two events occurring 10 minutes apart when both take place in my office than when one takes place there and one in London or on Betelgeuse.
Now, the four-dimensional space-time continuum of Minkowski clearly offers a way to measure the distance between two of its events. We can move from one point to another in this continuum—from one event to the next—by following a path that includes "successive" events. Our lives are essentially made up of this: we go from the starting event of our lives to the final event by following a path that takes us from one event to another, continuously and simultaneously adjusting our time and space coordinates. While we've mainly focused on measuring just the space interval between two events and the time interval between them, I think it's evident that, when viewed as events, as elements in the four-dimensional world, there is less separation between two events that happen in my office on the same day than between two that occur a year apart; or between two events that happen 10 minutes apart when both occur in my office than when one takes place there and one in London or on Betelgeuse.
It is not at all unreasonable, a priori, then, to seek a numerical measure for the separation, in space-time of four-dimensions, of two events. If we find it, we shall doubtless be asked just what its [157]subjective significance to us is. This must be answered with some circumspection. It will presumably be something which we cannot observe with the visual sense alone, or it would have forced itself upon our attention thousands of years ago. It ought, I should think, to be something that we would sense by employing at the same time the visual sense and the sense of time-passage. In fact, I might very plausibly insist that, by my very remarks about it in the above paragraph, I have sensed it.
It’s completely reasonable, a priori, to look for a numerical measure of the separation, in space-time across four dimensions, between two events. If we find one, people will likely ask what its [__A_TAG_PLACEHOLDER_0__] subjective significance is to us. We need to respond thoughtfully to this question. It's likely something we can't perceive with just our eyesight, or else it would have caught our attention long ago. I think it should be something we experience by using both our visual sense and our perception of time passing. In fact, I might reasonably argue that, by discussing it in the previous paragraph, I have sensed it.
Minkowski, however, was not worried about this phase of the matter. He had only to
identify the invariant expression for distance; sensing it could wait. He found, of
course, that this expression was not the Euclidean expression for a four-dimensional
interval. He had discarded several of the Euclidean assumptions and could not expect
that the postulate governing the metric properties of Euclid’s space would persist.
Especially had he violated the Euclidean canons in discarding, with Einstein, the
notion that nothing which may happen to a measuring rod in the way of uniform translation
at high velocity can affect its measures. So he had to be prepared to find that his
geometry was non-Euclidean; yet it is surprising to learn how slightly it deviates
from that of Euclid. Without any extended discussion to support the statement, we
may say that he found that when two observers measure the time- and the space-coordinates
of two events, using the assumptions and therefore the methods of Einstein and hence
subjecting themselves to the condition that their measures of the pure time-interval
and of the pure space-interval between these [158]events will not necessarily be the same, they will discover that they both get the
same value for the expression
If our acceptance of this as the numerical measure of the separation in space-time
between the two events should lead to contradiction we could not so accept it. No
contradiction arises however and we may therefore accept it. And at once the mathematician
is ready with some interpretative remarks.
Minkowski wasn't concerned about this aspect of the matter. He just needed to find the consistent way to express distance; he felt that could wait. He discovered, of course, that this expression wasn’t the same as the Euclidean one for a four-dimensional interval. He had rejected several Euclidean assumptions and couldn’t expect the principles governing the metric characteristics of Euclid's space to hold true. In particular, he had gone against Euclidean principles by discarding, along with Einstein, the idea that nothing affecting a measuring rod during uniform motion at high speeds can alter its measurements. So, he had to be prepared to realize that his geometry was non-Euclidean; yet it’s surprising how little it diverges from Euclid's geometry. Without going into a lengthy explanation, we can say that he found when two observers measure the time and space coordinates of two events, using Einstein's principles and thus accepting that their measures of the pure time interval and the pure space interval between these [__A_TAG_PLACEHOLDER_0__]events might not be the same, they will both arrive at the same value for the expression
If accepting this as the numerical measure of the separation in space-time between the two events leads to any contradictions, we couldn’t accept it. However, no contradiction arises, so we can accept it. Immediately, the mathematician is ready with some explanatory comments.
The Curvature of Space-Time
The invariant expression for separation, it will be seen, is in the same form as that of the Euclidean four-dimensional invariant save for the minus sign before the time-difference (the appearance of the constant C in connection with the time coordinate t is merely an adjustment of units; see page 153). This tells us that not alone is the geometry of the time-space continuum non-Euclidean in its methods of measurement, but also in its results, to the extent that it possesses a curvature. It compares with the Euclidean four-dimensional continuum in much the same way that a spherical surface compares with a plane. As a matter of fact, a more illuminating analogy here would be that between the cylindrical surface and the plane, though neither is quite exact. To make this clear requires a little discussion of an elementary notion which we have not yet had to consider. [159]
The expression for separation remains unchanged, except for the minus sign in front of the time difference, similar to the Euclidean four-dimensional invariant (the inclusion of the constant C related to the time coordinate t is just a unit adjustment; see page 153). This indicates that not only is the geometry of the time-space continuum non-Euclidean in how it measures, but it also yields non-Euclidean results due to its curvature. It relates to the Euclidean four-dimensional continuum much like a spherical surface relates to a flat plane. In fact, a more helpful comparison would be between a cylindrical surface and a plane, although neither is completely accurate. To clarify this, we need to discuss a basic concept that we haven’t addressed yet. [__A_TAG_PLACEHOLDER_0__]
Our three-dimensional existence often reduces, for all practical purposes, to a two-dimensional one. The objects and the events of a certain room may quite satisfactorily be defined by thinking of them, not as located in space, but as lying in the floor of the room. Mathematically the justification for this viewpoint is got by saying that we have elected to consider a slice of our three-dimensional world of the sort which we know as a plane. When we consider this plane and the points in it, we find that we have taken a cross-section of the three-dimensional world. A line in that world is now reduced, for us, to a single point—the point where it cuts our plane; a plane is reduced to a line—the line where it cuts our plane; the three-dimensional world itself is reduced to our plane itself. Everything three-dimensional falls down into its shadow in our plane, losing in the process that one of the three dimensions which is not present in our plane.
Our three-dimensional existence often boils down, for all practical purposes, to a two-dimensional one. The objects and events in a certain room can be effectively described by thinking of them not as existing in space, but as lying flat on the floor of the room. Mathematically, this perspective is justified by saying that we have chosen to consider a slice of our three-dimensional world, which we know as a plane. When we look at this plane and the points within it, we realize that we've taken a cross-section of the three-dimensional world. A line in that world now reduces, for us, to a single point—the point where it intersects our plane; a plane becomes a line—the line where it intersects our plane; and the three-dimensional world itself is simplified to our plane. Everything three-dimensional collapses into its projection on our plane, losing the one dimension that isn't present in our plane.
For simplicity’s sake it is usual to take a cross-section of space parallel to one of our coordinate axes. We think of our three dimensions as extending in the directions of those axes; and it is easier to take a horizontal or vertical section which shall simply wipe out one of these dimensions than to take an oblique section which shall wipe out a dimension that consists partly of our original length, and partly of our original width, and partly of our original height.
For simplicity, we usually take a cross-section of space that is parallel to one of our coordinate axes. We view our three dimensions as extending in the directions of those axes; and it's easier to take a horizontal or vertical section that just eliminates one of these dimensions than to take a diagonal section that removes a dimension made up of parts of our original length, width, and height.
If we have a four-dimensional manifold to begin with, we may equally shake out one of the four dimensions, one of the four coordinates, and consider the three-dimensional result of this process as [160]a cross-section of the original four-dimensional continuum. And where, in cross-sectioning a three-dimensioned world, we have but three choices of a coordinate to eliminate, in cross-sectioning a world of four dimensions we have four choices. By dropping out either the x, or the y, or the z, or the t, we get a three-dimensioned cross-section.
If we start with a four-dimensional manifold, we can remove one of the four dimensions, one of the four coordinates, and view the resulting three-dimensional outcome as [__A_TAG_PLACEHOLDER_0__] a cross-section of the original four-dimensional continuum. In a three-dimensional world, we have three options for which coordinate to drop, but in a four-dimensional world, we have four options. By eliminating either the x, y, z, or t, we obtain a three-dimensional cross-section.
Now our accustomed three-dimensional space is strictly Euclidean. When we cross-section it, we get a Euclidean plane no matter what the direction in which we make the cut. Likewise the Euclidean plane is wholly Euclidean, because when we cross-section it in any direction whatever we get a Euclidean line. A cylindrical surface, on the other hand, is neither wholly Euclidean nor wholly non-Euclidean in this matter of cross-sectioning. If we take a section in one direction we get a Euclidean line and if we take a section in the other direction we get a circle (if the cylindrical surface be a circular one). And of course if we take an oblique section of any sort, it is neither line nor circle, but a compromise between the two—the significant thing being that it is still not a Euclidean line.
Now, our familiar three-dimensional space is purely Euclidean. When we make a cross-section, we end up with a Euclidean plane no matter which way we cut it. Similarly, the Euclidean plane is completely Euclidean because when we take a cross-section in any direction, we get a Euclidean line. In contrast, a cylindrical surface is neither entirely Euclidean nor completely non-Euclidean when it comes to cross-sectioning. If we take a section in one direction, we get a Euclidean line, and if we cut it in another direction, we get a circle (if the cylindrical surface is circular). And, of course, if we make an oblique section of any kind, it is neither a line nor a circle, but a blend of the two—the crucial point being that it is still not a Euclidean line.
The space-time continuum presents an analogous situation. When we cross-section it by dropping out any one of the three space dimensions, we get a three-dimensional complex in which the distance formula is still non-Euclidean, retaining the minus sign before the time-difference and therefore retaining the geometric character of its parent. But if we take our cross-section in such a way as to eliminate the time coordinate, this peculiarity disappears. The signs in the invariant expression are then all [161]plus, and the cross-section is in fact our familiar Euclidean three-space.
The space-time continuum creates a similar situation. When we slice through it by removing one of the three space dimensions, we end up with a three-dimensional complex where the distance formula remains non-Euclidean, keeping the minus sign in front of the time difference and thus preserving the geometric character of its original form. However, if we take our cross-section in a way that removes the time coordinate, this unique feature disappears. The signs in the invariant expression then become all [__A_TAG_PLACEHOLDER_0__]plus, and the cross-section essentially becomes our familiar Euclidean three-space.
If we set up a surface geometry on a sphere, we find that the elimination of one dimension leaves us with a line-geometry that is still non-Euclidean since it pertains to the great circles of the sphere rather than to Euclidean straight lines. In shaking Minkowski’s continuum down into a three-dimensional one by eliminating any one of his coordinates, if we eliminate either the x, the y or the z, we have left a three-dimensional geometry in which the disturbing minus sign still occurs in the distance-formula, and which is therefore still non-Euclidean. If we omit the t, this does not occur. We see, then, that the time dimension is the disturbing factor, the one which gives to space-time its non-Euclidean character so far as the possession of curvature is concerned. And we see that this curvature is not the same in all directions, and in one direction is actually zero—whence the attempted analogy with a cylinder instead of with a sphere.
If we create a surface geometry on a sphere, we notice that removing one dimension gives us a line geometry that remains non-Euclidean because it relates to the great circles of the sphere instead of Euclidean straight lines. When we simplify Minkowski's continuum into three dimensions by dropping one of his coordinates, whether it's the x, the y, or the z, we end up with a three-dimensional geometry that still has the troubling minus sign in the distance formula, meaning it is still non-Euclidean. However, if we drop the t, that issue disappears. This shows us that the time dimension is the problematic element, the one that gives space-time its non-Euclidean nature when it comes to curvature. Additionally, we find that this curvature isn't uniform in all directions, and in one direction, it is actually zero—leading to the analogy with a cylinder instead of a sphere.
Many writers on relativity try to give the space-time continuum an appeal to our reason and a character of inevitableness by insisting on the lack of any fundamental distinction between space and time. The very expression for the space-time invariant denies this. Time is distinguishable from space. The three dimensions of space are quite indistinguishable—we can interchange them without affecting the formula, we can drop one out and never know which is gone. But the very formula singles out time as distinct from space, as inherently different in some way. It is not so inherently different as we have [162]always supposed; it is not sufficiently different to offer any obstacle to our thinking in terms of the four-dimensional continuum. But while we can group space and time together in this way, [this does not mean at all that space and time cease to differ. A cook may combine meat with potatoes and call the product hash, but meat and potatoes do not thereby become identical.]223
Many writers on relativity try to make the space-time continuum appealing to our reason and give it a sense of inevitability by emphasizing there’s no real difference between space and time. However, the very term for the space-time invariant contradicts this. Time is distinguishable from space. The three dimensions of space can be considered identical—we can switch them around without changing the formula, and we can leave one out and never realize it’s missing. But the formula itself treats time as distinct from space, as inherently different in some way. It’s not as distinctly different as we’ve always thought; it’s not different enough to prevent us from thinking in terms of the four-dimensional continuum. However, while we can group space and time like this, [it doesn’t mean that space and time are no longer different. A cook might mix meat with potatoes and call the dish hash, but that doesn’t make meat and potatoes the same.]223
The Visualization Question
To the layman there is a great temptation to say that while, mathematically speaking, the space-time continuum may be a great simplification, it does not really represent the external world. To be sure, you can’t see the space-time continuum in precisely the same way that you can the three-dimensional space continuum, but this is only because Einstein finds the time dimension to be not quite freely interchangeable with the space dimension. Yet you do perceive this space-time continuum, in the manner appropriate for its perception; and it would be just as sensible to throw out the space continuum itself on the ground that perception of the two is not of exactly the same sort, as to throw out the space-time continuum on this ground. With appropriate conventions, either may stand as the mental picture of the external world; it is for us to choose which is the more convenient and useful image. Einstein tells us that his image is the better, and tells us why.
To the average person, it’s really tempting to argue that while the space-time continuum might be a useful simplification in math, it doesn't truly reflect the outside world. Sure, you can't visualize the space-time continuum in the same way you can with three-dimensional space, but that's just because Einstein views the time dimension as not being completely interchangeable with the space dimension. However, you do perceive this space-time continuum in a way that's fitting for its perception; it would be just as unreasonable to dismiss the space continuum just because our perception of the two isn’t exactly the same, as it would be to disregard the space-time continuum for that reason. With the right conventions, either can represent our mental image of the external world; it's up to us to determine which one is the more practical and useful representation. Einstein claims that his model is the superior one and explains why.
Before we look into this, we must let him tell us something more about the geometry of his continuum. What he tells us is, in its essentials, just [163]this. The observer in a pure space continuum of three dimensions finds that as he changes his position, his right-and-left, his backward-and-forward, and his up-and-down are not fixed directions inherent in nature, but are fully interchangeable. The observers, in the adjoined sketch, whose verticals are as indicated by the arrows, find very different vertical and horizontal components for the distance between the points O and P; a similar situation would prevail if we used all three space directions. The statement analogous to this for Einstein’s four-dimensional continuum of space and time combined
Before we dive into this, we need to have him explain a bit more about the geometry of his continuum. Essentially, what he tells us is just [__A_TAG_PLACEHOLDER_0__] this. An observer in a pure three-dimensional space continuum finds that as he moves, his right/left, forward/backward, and up/down directions aren’t fixed in nature, but are completely interchangeable. The observers in the accompanying sketch, indicated by the arrows, notice very different vertical and horizontal components for the distance between points O and P; a similar situation would happen if we considered all three spatial directions. The analogous statement for Einstein’s four-dimensional continuum of combined space and time is similar.

is that, as observers change their relative motion, their time axes take slightly different directions, so that what is purely space or purely time for the one becomes space with a small component in the time direction, or time with a small component in the space direction, for the other. This it will be seen explains fully why observers in relative motion can [164]differ about space and time measurements. We should not be at all surprised if the two observers of the figure reported different values for horizontals and verticals; we should realize that what was vertical for one had become partly horizontal for the other. It is just so, says Einstein, with his observers of time and space who are in relative motion to one another; what one sees as space the other sees as partly time, because their time axes do not run quite parallel.
As observers change their relative motion, their time axes shift slightly in different directions. This means that what is purely space or purely time for one observer becomes space with a small time component for the other, or time with a small space component. This explains why observers in relative motion can [__A_TAG_PLACEHOLDER_0__]differ in their measurements of space and time. It's not surprising if the two observers shown report different values for horizontal and vertical; what one person sees as vertical may partly appear horizontal to the other. According to Einstein, this is how observers of time and space in relative motion perceive things: what one sees as space, the other sees as partly time, because their time axes are not perfectly parallel.
The natural question here, of course, is “Well, where are their time axes?” If you know what to look for, of course, you ought to be able to perceive them in just the way you perceive ordinary time intervals—with the reservation that they are imaginary, after all, just like your space axes, and that you must only expect to see them in imagination. If you look for a fourth axis in Euclidean three-space to represent your time axis, you will of course not find it. But you will by all means agree with me that your time runs in a definite direction; and this it is that defines your time axis. Einstein adds that if you and I are in relative motion, my time does not run in quite the same direction as yours.
The natural question here is, “Well, where are their time axes?” If you know what to look for, you should be able to see them just like you perceive regular time intervals—with the caveat that they are imaginary, just like your space axes, and you should only expect to visualize them in your imagination. If you try to find a fourth axis in three-dimensional space to represent your time axis, you won’t find it. But you will definitely agree with me that your time moves in a specific direction; this is what defines your time axis. Einstein adds that if you and I are moving relative to each other, my time doesn’t run in quite the same direction as yours.
How shall we prove it? Well, how would we prove it if he told us that our space axes did not run in precisely the same direction? Of course we could not proceed through direct measures upon the axes themselves; we know these are imaginary. What we should do would be to strike out, each of us, a very long line indeed in what seemed the true horizontal direction; and we should hope that if we made them long enough, and measured them [165]accurately enough, we should be able to detect any divergence that might exist. This is precisely what we must do with our time axes if we wish to verify Einstein’s statement that they are not precisely parallel; and what better evidence could we demand of the truth of this statement than the evidence already presented—that when we measure our respective time components between two events, we get different results?
How should we prove it? Well, how would we prove it if he said our time axes didn’t align perfectly? Obviously, we couldn’t measure the axes directly since we know they’re imaginary. Instead, each of us should draw a very long line in what appears to be the true horizontal direction; and we’d hope that if we made them long enough and measured them [__A_TAG_PLACEHOLDER_0__]accurately enough, we’d be able to spot any divergence that might exist. This is exactly what we need to do with our time axes if we want to confirm Einstein’s claim that they aren’t exactly parallel; and what better proof could we ask for than the evidence we've already presented—that when we measure our respective time components between two events, we get different results?
What It All Leads To
The preceding chapters have been compiled and written with a view to putting the reader in a state of mind and in a state of informedness which shall enable him to derive profit from the reading of the actual competing essays which make up the balance of the book. For this purpose it has been profitable to take up in detail the preliminaries of the Special Theory of Relativity, and to allow the General Theory to go by default, in spite of the fact that it is the latter which constitutes Einstein’s contribution of importance to science. The reason for this is precisely the same as that for taking up Euclidean geometry and mastering it before proceeding to the study of Newtonian mechanics. The fundamental ideas of the two theories, while by no means identical, are in general terms the same; and the conditions surrounding their application to the Special Theory are so very much simpler than those which confront us when we apply them to the more general case, that this may be taken as the controlling factor in a popular presentation. We cannot omit [166]the General Theory from consideration, of course; but we can omit it from our preliminary discussion, and leave its development to the complete essays which follow, and which in almost every case give it the larger half of their space which its larger content demands. In the process of the slow and difficult preparation of the lay mind for the assimilation of an altogether new set of fundamental ideas, it is altogether desirable to give the Special Theory, with its simpler applications of these ideas, a place out of proportion to its importance in Einstein’s completed structure; and this we have therefore done.
The earlier chapters have been put together to help readers get into the right mindset and gain the knowledge they need to benefit from the competing essays that make up the rest of the book. To achieve this, it has been useful to focus closely on the basics of the Special Theory of Relativity and to skip over the General Theory, even though the latter is Einstein's significant contribution to science. The reason for this is similar to why one should learn Euclidean geometry before tackling Newtonian mechanics. While the core ideas of the two theories aren't exactly the same, they are fundamentally similar. The situations in which we apply them to the Special Theory are much simpler than those we face when dealing with the more general case, and this simplicity is key for a straightforward explanation. We can't completely ignore the General Theory, of course; however, we can leave it out of our initial discussion and allow it to be explored in the complete essays that follow, which in nearly all cases dedicate a significant portion of their content to it. As we gradually prepare the average reader to understand a completely new set of fundamental concepts, it's better to give the Special Theory, with its simpler applications, more emphasis than its proportionate significance in Einstein's overall work would suggest; and that's exactly what we've done.
The Special Theory, postulating the relativity of uniform motion and deducing the consequences of that relativity, is often referred to as a “special case” of the General Theory, in which this restriction of uniformity is removed. This is not strictly speaking correct. The General Theory, when we have formulated it, will call our attention to something which we really knew all the time, but to which we chose not to give heed—that in the regions of space to which we have access, uniform motion does not exist. All bodies in these regions are under the gravitational influence of the other bodies therein, and this influence leads to accelerated motion. Nothing in our universe can possibly travel at uniform velocity; the interference of the rest of the bodies in the universe prevents this.
The Special Theory, which suggests that uniform motion is relative and explores the implications of that relativity, is often called a “special case” of the General Theory, where the limitation of uniformity is eliminated. However, this isn't entirely accurate. The General Theory, once we have fully developed it, will highlight something we’ve always known but often ignored—that in the areas of space we can observe, uniform motion doesn’t actually exist. All objects in these areas are influenced by the gravitational pull of other bodies nearby, which results in accelerated motion. Nothing in our universe can travel at a constant speed; the presence of all other bodies in the universe makes that impossible.
Obviously, we ought not to apply the term “special case” to a case that never occurs. Nevertheless, this case is of extreme value to us in our mental processes. Many of the motions with which we are [167]concerned are so nearly at constant velocity that we find it convenient to treat them as though they were uniform, either ignoring the resulting error or correcting for it at the end of our work. In many other cases we are able to learn what actually occurs under accelerated motion by considering what would have occurred under uniform motion were such a thing possible. Science is full of complications which we unravel in this fashion. The physicist deals with gas pressures by assuming temperatures to be constant, though he knows temperature never is constant; and in turn he deals with temperatures by assuming pressures to be constant. After this, he is able to predict what will happen when, as in nature, pressures and temperatures are varying simultaneously. By using as a channel of attack the artificially simple case that never occurs, we get a grip on the complex case that gives us a true picture of the phenomenon. And because in actual nature we can come as close as we please to this artificial case, by supposing the variable factor to approach constancy, so when we assume it to be absolutely constant we speak of the result as the limiting case. This situation does not occur, but is the limiting case for those that do occur.
Clearly, we shouldn't call something a “special case” if it never happens. However, this scenario is extremely valuable for our thinking. Many of the movements we're working with are so close to constant speed that it's easier for us to treat them as if they were uniform, either ignoring the resulting error or correcting for it later. In many other situations, we can understand what actually happens under accelerated motion by considering what would have happened under uniform motion if that were possible. Science is full of complexities that we figure out this way. The physicist analyzes gas pressures by assuming temperatures are constant, even though he knows that temperatures never really are; and he also considers temperatures by assuming pressures are constant. After that, he can predict what will happen when, like in nature, pressures and temperatures are changing at the same time. By examining the artificially simple case that never occurs, we gain insight into the complex case that accurately represents the phenomenon. Moreover, since in real life we can get as close as we want to this artificial case by assuming the variable factor is almost constant, when we treat it as if it is completely constant, we refer to the outcome as the limiting case. This situation doesn’t actually happen, but it serves as the limiting case for those that do.
When, in the matter of motion, we abandon the artificial, limiting case of uniform velocity and look into the general, natural one of unrestricted motion, we find that the structure which we have built up to deal with the limiting case provides us with many of the necessary ideas and viewpoints. This is what we expect—in it lies the value of the limiting case. We shall see that the relativity of time and space, [168]established for the limiting case, holds good in the general one. We shall see that the idea of the four-dimensional space-time continuum as representing the external world persists, forming the whole background of the General Theory much more definitely than in the Special Theory. Incidentally we shall see that the greater generality of the case under consideration will demand a greater degree of generality in the geometry of this continuum, a non-Euclideanism of a much more whole-hearted type than that of the Special Theory. But all the revisions of fundamental concepts which we have been at such pains to make for the sake of the Special Theory will remain with us in the General. With this we may consider our preliminary background as established, and give our attention to the essayists, who will try to take us more deeply into the subject than we have yet gone, without losing us in its intricacies.
When we move beyond the artificial constraint of uniform motion and explore the more natural concept of unrestricted motion, we discover that the framework we've developed for the restricted case offers many of the essential ideas and perspectives needed. This is what we anticipate—this is the significance of the restricted case. We will observe that the relativity of time and space, [__A_TAG_PLACEHOLDER_0__]established for the restricted case, also applies to the more general scenario. We will recognize that the concept of the four-dimensional space-time continuum, representing the external world, remains intact, providing a more solid foundation for the General Theory compared to the Special Theory. Additionally, we will find that the broader context we are exploring requires a more comprehensive approach to the geometry of this continuum, embracing a more complete form of non-Euclidean geometry than that found in the Special Theory. However, all the revisions of fundamental concepts that we painstakingly made for the Special Theory will still be applicable in the General Theory. With this established background, we can turn our attention to the essayists, who will guide us deeper into the subject without getting lost in its complexities.

Lyndon Bolton,
Winner of the Einstein Prize Essay Contest
Lyndon Bolton,
Winner of the Einstein Prize Essay Contest
[169]
[__A_TAG_PLACEHOLDER_0__]
VII
RELATIVITY
The Winning Essay in the Contest for the Eugene Higgins $5,000 Prize
The Winning Essay in the Contest for the Eugene Higgins $5,000 Prize
The reader is probably acquainted with the method of specifying positions of points in a plane by their distances from two mutually perpendicular lines, or if the points are in space by their distances from three mutually perpendicular planes like adjacent sides of a flat-sided box. The method is in fact in common use for exhibiting relations between quantities by graphs or diagrams. These sets of axes, as they are called, together with any scales used for measuring, must be supposed rigid, otherwise the events or points which they are used to specify are indefinite. The lengths which locate any point with reference to a set of axes are called its coordinates.
The reader is likely familiar with the way of defining the location of points on a plane by measuring their distances from two intersecting lines, or if the points are in three-dimensional space, by their distances from three intersecting planes like the adjacent sides of a box. This approach is widely used to show relationships between quantities through graphs or diagrams. These coordinate systems, as they are known, along with any scales used for measuring, are assumed to be fixed; otherwise, the events or points they represent are unclear. The distances that pinpoint any point relative to a coordinate system are called its coordinates.
When such systems are used for physical purposes, they must be supplemented by clocks to enable the times at which events occur to be determined. The clocks must be synchronized, and must go at the same rate, but it must suffice here to state that this is possible without indicating how these conditions [170]can be attained. A system of axes with its clocks will hereinafter be called a Frame of Reference, and every observer will be supposed to be provided with such a frame partaking of his motion. All the objects which partake of an observer’s motion will be called his system.
When these systems are used for physical purposes, they need to be paired with clocks to determine when events happen. The clocks must be synchronized and run at the same speed, but it’s enough to say that this is possible without going into how these conditions [__A_TAG_PLACEHOLDER_0__]can be met. A system of axes along with its clocks will be referred to as a Frame of Reference, and each observer will be assumed to have such a frame that reflects their motion. All the objects that are moving with the observer will be called their system.
It is a question whether among all possible frames of reference any one frame or class of frames is more suited than another for the mathematical statement of physical laws. This is for experience to decide, and a Principle of Relativity is a statement embodying the answer.
It’s a question of whether any specific frame of reference or group of frames is better than others for mathematically expressing physical laws. This is something that experience will determine, and a Principle of Relativity is a statement that represents the answer.
The Mechanical Principle of Relativity
It has been ascertained that all such frames are equally suitable for the mathematical statement of general mechanical laws, provided that their motion is rectilinear and uniform and without rotation. This fact is comprehended in the general statement that all unaccelerated frames of reference are equivalent for the statement of the general laws of mechanics. This is the mechanical principle of relativity.
It has been determined that all such frames are equally suitable for expressing the mathematical statement of general mechanical laws, as long as their motion is straight, uniform, and without rotation. This idea is captured in the general statement that all unaccelerated frames of reference are equivalent for expressing the general laws of mechanics. This is the mechanical principle of relativity.
It is well recognized however that the laws of dynamics as hitherto stated involve the assumptions that the lengths of rigid bodies are unaffected by the motion of the frame of reference, and that measured times are likewise unaffected; that is to say that any length measured on his own system by either of two relatively moving observers appears the same to both observers, or that lengths of objects and rates of clocks do not alter whatever the motion relative to an observer. These assumptions [171]seem so obvious that it is scarcely perceived that they are assumptions at all. Yet this is the case, and as a matter of fact they are both untrue.
It is widely understood, however, that the laws of dynamics as previously stated depend on the assumptions that the lengths of rigid objects are not affected by the motion of the reference frame, and that measured times are likewise unaffected. In other words, any length measured on his own system by either of two observers in relative motion appears the same to both, or that the lengths of objects and the rates of clocks remain unchanged regardless of the motion in relation to an observer. These assumptions [__A_TAG_PLACEHOLDER_0__]seem so obvious that it’s hardly recognized that they are assumptions at all. Yet this is the case, and in fact, they are both incorrect.
The Special Theory of Relativity
Although all unaccelerated frames of reference are equivalent for the purposes of mechanical laws, this is not the case for physical laws generally as long as the above suppositions are adhered to. Electromagnetic laws do alter their form according to the motion of the frame of reference; that is to say, if these suppositions are true, electromagnetic agencies act in different ways according to the motion of the system in which they occur. There is nothing a priori impossible in this, but it does not agree with experiment. The motion of each locality on the earth is continually changing from hour to hour but no corresponding changes occur in electromagnetic actions. It has however been ascertained that on discarding these suppositions the difficulty disappears, and electromagnetic laws retain their form under all circumstances of unaccelerated motion. According to the theory of relativity, the correct view which replaces these suppositions is deducible from the following postulates:
Although all non-accelerated frames of reference are essentially equivalent for mechanical laws, the same doesn’t apply to physical laws in general as long as the aforementioned assumptions hold true. Electromagnetic laws do change their form based on the motion of the frame of reference; in other words, if these assumptions are valid, electromagnetic forces behave differently depending on the motion of the system in which they are found. There’s nothing a priori impossible about this, but it doesn’t align with experimental results. The motion of each place on Earth is constantly changing from hour to hour, yet no similar changes are observed in electromagnetic actions. However, it has been determined that if these assumptions are discarded, the issue disappears, and electromagnetic laws maintain their form under all conditions of non-accelerated motion. According to the theory of relativity, the accurate perspective that replaces these assumptions can be derived from the following postulates:
- (1) By no experiment conducted on his own system can an observer detect the unaccelerated motion of his system.
- (2) The measure of the velocity of light in vacuo is unaffected by relative motion between the observer and the source of light.
Both these postulates are well established by experiment. [172]The first may be illustrated by the familiar difficulty of determining whether a slowly moving train one happens to be sitting in, or an adjacent one, is in motion. The passenger has either to wait for bumps (that is, accelerations) or else he has to look out at some adjacent object which he knows to be fixed, such as a building (that is, he has to perform an experiment on something outside his system), before he can decide.
Both of these principles are well established by experiments. [__A_TAG_PLACEHOLDER_0__]The first can be demonstrated by the common challenge of figuring out whether the train you're sitting on is moving or if another one next to it is in motion. A passenger has to either wait for bumps (which are accelerations) or look out at some fixed object, like a building (meaning they have to conduct an experiment on something outside of their system), before they can make a decision.
The second postulate is an obvious consequence of the wave theory of light. Just as waves in water, once started by a ship, travel through the water with a velocity independent of the ship, so waves in space travel onward with a speed bearing no relation to that of the body which originated them. The statement however is based on experiment, and can be proved independently of any theory of light.
The second postulate is a clear result of the wave theory of light. Just like waves in water, which, once created by a ship, move through the water at a speed that has nothing to do with the ship's speed, waves in space move forward at a speed that is unrelated to the object that created them. However, this statement is based on experiments and can be proven without relying on any theory of light.
It is not difficult to deduce from these postulates certain remarkable conclusions relating to the systems of two observers, A and B, in relative motion, among them the following:
It’s not hard to figure out from these basic ideas some interesting conclusions about the systems of two observers, A and B, who are in relative motion, including the following:
- (1) Objects on B’s system appear to A to be shorter in the direction of relative motion than they appear to B.
- (2) This opinion is reciprocal. B thinks that A’s measurements on A’s system are too great.
- (3) Similarly for times: each observer thinks that the other’s clocks have a slower rate than his own, so that B’s durations of time appear shorter to B than to A, and conversely.
- (4) Events which appear simultaneous to A do not in general appear so to B, and conversely. [__A_TAG_PLACEHOLDER_0__]
- (5) Lengths at right angles to the direction of motion are unaffected.
- (6) These effects vary with the ratio of the relative velocity to that of light. The greater the relative velocity, the greater the effects. They vanish if there is no relative velocity.
- (7) For ordinary velocities the effects are so small as to escape notice. The remarkable point however is their occurrence rather than their magnitude.
- (8) The observers similarly form different estimates of the velocities of bodies on each other’s systems. The velocity of light however appears the same to all observers.
Taking into account these revised views of lengths and times the mechanical principle of relativity may be extended to physical laws generally as follows: All unaccelerated frames of reference are equivalent for the statement of the general laws of physics. In this form the statement is called the Special, or Restricted, Principle of Relativity, because it is restricted to unaccelerated frames of reference. Naturally the laws of classical mechanics now require some modification, since the suppositions of unalterable lengths and times no longer apply.
Considering these updated perspectives on lengths and times, the mechanical principle of relativity can be applied to physical laws in general like this: All unaccelerated frames of reference are equivalent for stating the general laws of physics. In this form, it's referred to as the Special, or Restricted, Principle of Relativity, because it only applies to unaccelerated frames of reference. Naturally, the laws of classical mechanics now need some adjustments, since the assumptions of fixed lengths and times no longer hold.
The 4D Continuum
Lengths and times therefore have not the absolute character formerly attributed to them. As they present themselves to us they are relations between the object and the observer which change as their motion relative to him changes. Time can no longer be regarded as something independent of position [174]and motion, and the question is what is the reality? The only possible answer is that objects must be regarded as existing in four dimensions, three of these being the ordinary ones of length, breadth and thickness, and the fourth, time. The term “space” is applicable only by analogy to such a region; it has been called a “continuum,” and the analogue of a point in ordinary three-dimensional space has been appropriately called an “event.” By “dimension” must be understood merely one of four independent quantities which locate an event in this continuum. In the nature of the case any clear mental picture of such a continuum is impossible; mankind does not possess the requisite faculties. In this respect the mathematician enjoys a great advantage. Not that he can picture the thing mentally any better than other people, but his symbols enable him to abstract the relevant properties from it and to express them in a form suitable for exact treatment without the necessity of picturing anything, or troubling whether or not the properties are those on which others rely for their conceptions.
Lengths and times no longer have the absolute quality that was once attributed to them. As we perceive them, they are relationships between the object and the observer, which change as the object's motion relative to the observer changes. Time cannot be seen as something independent of position [__A_TAG_PLACEHOLDER_0__] and motion, leading to the question of what is real. The only answer is that objects should be considered to exist in four dimensions: the three usual ones of length, width, and height, plus time. The term “space” only applies by analogy to this region; it has been referred to as a “continuum,” and the equivalent of a point in typical three-dimensional space is aptly called an “event.” By “dimension,” we mean one of four independent quantities that pinpoint an event in this continuum. It’s inherently impossible for anyone to have a clear mental image of such a continuum; humanity lacks the necessary capabilities. In this regard, mathematicians have a significant advantage. It's not that they can visualize it any better than others, but their symbols allow them to isolate the relevant properties and express them in a format suitable for precise analysis without needing to visualize anything or worry about whether the properties align with what others rely on for their understanding.
Gravity and Acceleration
The limitation of statements of general law to uniformly moving systems is hardly satisfactory. The very concept of general law is opposed to the notion of limitation. But the difficulties of formulating a law so that the statement of it shall hold good for all observers, whose systems may be moving with different and possibly variable accelerations, are very great. Accelerations imply forces which might be expected to upset the formulation [175]of any general dynamical principles, and besides, the behavior of measuring rods and clocks would be so erratic as to render unmeaning such terms as rigidity and measured time, and therefore to preclude the use of rigid scales, or of a rigid frame of reference which is the basis of the foregoing investigation.
Limiting general laws to systems that are moving uniformly is not really ideal. The idea of a general law contradicts the idea of limits. However, it's very challenging to create a law that is valid for all observers whose systems might be moving at different and possibly changing accelerations. Accelerations involve forces that could disrupt the formulation of any general dynamic principles. Plus, the behavior of measuring rods and clocks would be so unpredictable that terms like rigidity and measured time would become meaningless, making it impossible to use rigid scales or a stable frame of reference, which is the foundation of the previous investigation. [__A_TAG_PLACEHOLDER_0__]
The following example taken from Einstein will make this clear, and also indicate a way out of the difficulty. A rotating system is chosen, but since rotation is only a particular case of acceleration it will serve as an example of the method of treating accelerated systems generally. Moreover, as it will be seen, the attribution of acceleration to the system is simply a piece of scaffolding which can be discarded when the general theory has been further developed.
The following example from Einstein clarifies this point and suggests a solution to the issue. A rotating system is selected, but since rotation is just a specific example of acceleration, it will demonstrate how to handle accelerated systems in general. Furthermore, as you'll see, assigning acceleration to the system is merely a temporary tool that can be removed as the general theory evolves.
Let us note the experiences of an observer on a rotating disk which is isolated so that the observer has no direct means of perceiving the rotation. He will therefore refer all the occurrences on the disk to a frame of reference fixed with respect to it, and partaking of its motion.
Let’s consider the experiences of someone on a spinning disk that is isolated, meaning they can’t directly sense the rotation. This person will thus relate everything happening on the disk to a frame of reference that is stationary in relation to it and moves along with it.
He will notice as he walks about on the disk that he himself and all the objects on it, whatever their constitution or state, are acted upon by a force directed away from a certain point upon it and increasing with the distance from that point. This point is actually the center of rotation, though the observer does not recognize it as such. The space on the disk in fact presents the characteristic properties of a gravitational field. The force differs from gravity as we know it by the fact that it is directed [176]away from instead of toward a center, and it obeys a different law of distance; but this does not affect the characteristic properties that it acts on all bodies alike, and cannot be screened from one body by the interposition of another. An observer aware of the rotation of the disk would say that the force was centrifugal force; that is, the force due to inertia which a body always exerts when it is accelerated.
He will notice as he walks around on the disk that he and all the objects on it, regardless of their makeup or condition, are influenced by a force that pushes away from a certain point on it and increases with distance from that point. This point is actually the center of rotation, although the observer doesn’t recognize it as such. The space on the disk actually exhibits the typical properties of a gravitational field. The force differs from gravity as we understand it in that it is directed [__A_TAG_PLACEHOLDER_0__]away from rather than toward a center, and it follows a different law of distance; but this doesn’t change the characteristic properties that it affects all bodies the same way and cannot be blocked from one body by placing another in between. An observer aware of the rotation of the disk would say that the force was centrifugal force; that is, the force due to inertia that a body always exerts when it is accelerated.
Next suppose the observer to stand at the point of the disk where he feels no force, and to watch someone else comparing, by repeated applications of a small measuring rod, the circumference of a circle having its center at that point, with its diameter. The measuring rod when laid along the circumference is moving lengthwise relatively to the observer, and is therefore subject to contraction by his reckoning. When laid radially to measure the diameter this contraction does not occur. The rod will therefore require a greater proportional number of applications to the circumference than to the diameter, and the number representing the ratio of the circumference of the circle to the diameter thus measured will therefore be greater than 3.14159+, which is its normal value. Moreover the relative velocity decreases as the center is approached, so that the contraction of the measuring rod is less when applied to a smaller circle; and the ratio of the circumference to the diameter, while still greater than the normal, will be nearer to it than before, and the smaller the circle the less the difference from the normal. For circles whose centers are not at the point of zero force the confusion is still greater, since the velocities relative to the observer of points [177]on them now change from point to point. The whole scheme of geometry as we know it is thus disorganized. Rigidity becomes an unmeaning term since the standards by which alone rigidity can be tested are themselves subject to alteration. These facts are expressed by the statement that the observer’s measured space is non-Euclidean; that is to say, in the region under consideration measurements do not conform to the system of Euclid.
Next, imagine the observer standing at the point on the disk where they feel no force, watching someone else repeatedly use a small measuring rod to compare the circumference of a circle centered at that point with its diameter. When the measuring rod is laid along the circumference, it moves lengthwise relative to the observer, so it appears to contract. However, when measuring the diameter from the center, this contraction doesn’t happen. As a result, the rod will need more applications along the circumference than along the diameter, meaning the ratio of the circumference to the diameter measured will be greater than 3.14159+, which is the standard value. Additionally, the relative velocity decreases as the observer gets closer to the center, so the contraction of the measuring rod is less for a smaller circle; thus, the ratio of circumference to diameter, while still greater than normal, will be closer to it compared to larger circles. The smaller the circle, the smaller the difference from the standard. For circles whose centers aren’t at the zero-force point, the confusion is even greater, as the velocities of points on the circumference change from point to point relative to the observer. This disrupts the entire framework of geometry as we know it. Rigidity becomes a meaningless concept since the standards for testing rigidity are themselves subject to change. These facts are summarized by stating that the observer’s measured space is non-Euclidean; that is, in this area, measurements do not align with Euclid’s system.
The same confusion arises in regard to clocks. No two clocks will in general go at the same rate, and the same clock will alter its rate when moved about.
The same confusion happens with clocks. Generally, no two clocks run at the same speed, and the same clock will change its speed when it’s moved around.
The General Principle of Relativity
The region therefore requires a space-time geometry of its own, and be it noted that with this special geometry is associated a definite gravitational field, and if the gravitational field ceases to exist, for example if the disk were brought to rest, all the irregularities of measurement disappear, and the geometry of the region becomes Euclidean. This particular case illustrates the following propositions which form the basis of this part of the theory of relativity:
The region needs its own space-time geometry, and it's important to note that this specific geometry is linked to a distinct gravitational field. If the gravitational field were to stop existing, like if the disk came to a stop, all measurement irregularities would vanish, and the geometry of the region would become Euclidean. This particular case demonstrates the following propositions that are foundational to this section of the theory of relativity:
- (1) Associated with every gravitational field is a system of geometry, that is, a structure of measured space peculiar to that field.
- (2) Inertial mass and gravitational mass are one and the same.
- (3) Since in such regions ordinary methods of measurement fail, owing to the indefiniteness [__A_TAG_PLACEHOLDER_0__]of the standards, the systems of geometry must be independent of any particular measurements.
- (4) The geometry of space in which no gravitational field exists in Euclidean.1
The connection between a gravitational field and its appropriate geometry suggested by a case in which acceleration was their common cause is thus assumed to exist from whatever cause the gravitational field arises. This of course is pure hypothesis, to be tested by experimental trial of the results derived therefrom.
The relationship between a gravitational field and its corresponding geometry, indicated by a situation where acceleration was their shared cause, is assumed to exist regardless of what creates the gravitational field. This is, of course, purely a hypothesis that needs to be tested through experimental trials of the results derived from it.
Gravitational fields arise in the presence of matter. Matter is therefore presumed to be accompanied by a special geometry, as though it imparted some peculiar kink or twist to space which renders the methods of Euclid inapplicable, or rather we should say that the geometry of Euclid is the particular form which the more general geometry assumes when matter is either absent or so remote as to have no influence. The dropping of the notion of acceleration is after all not a very violent change in point of view, since under any circumstances the observer is supposed to be unaware of the acceleration. All that he is aware of is that a gravitational field and his geometry coexist.
Gravitational fields appear when there's matter present. Therefore, matter is thought to come with a unique geometry, as if it creates some kind of twist or distortion in space that makes Euclidean methods unusable. To be more precise, we could say that Euclidean geometry is the specific form that a more general geometry takes when matter is absent or too far away to have any effect. Letting go of the concept of acceleration isn't a drastic shift in perspective, since in any situation, the observer is assumed to be unaware of the acceleration. What they do notice is that a gravitational field exists alongside their geometry.
The prospect of constructing a system of geometry which does not depend upon measurement may not at first sight seem hopeful. Nevertheless this has been done. The system consists in defining [179]points not by their distances from lines or planes (for this would involve measurement) but by assigning to them arbitrary numbers which serve as labels bearing no relation to measured distances, very much as a house is located in a town by its number and street. If this labeling be done systematically, regard being had to the condition that the label-numbers of points which are close together should differ from one another by infinitesimal amounts only, it has been found that a system of geometry can actually be worked out. Perhaps this will appear less artificial when the fact is called to mind that even when standards of length are available no more can be done to render lengths of objects amenable to calculation than to assign numbers to them, and this is precisely what is done in the present case. This system of labeling goes by the name of “Gaussian coordinates” after the mathematician Gauss who proposed it.
The idea of creating a system of geometry that doesn't rely on measurement might not seem promising at first. However, it has been accomplished. The system involves defining [__A_TAG_PLACEHOLDER_0__] points not by their distances from lines or planes (since that would require measurement) but by giving them arbitrary numbers that act as labels with no connection to measured distances, similar to how a house is identified in a town by its number and street. If this labeling is done in a systematic way, ensuring that the label numbers of points that are close together differ by only infinitesimal amounts, it turns out that a functional system of geometry can be developed. This might seem less contrived when you consider that even when length standards are available, the only way to make the lengths of objects suitable for calculation is to assign numbers to them, which is exactly what is happening here. This labeling system is known as "Gaussian coordinates," named after the mathematician Gauss who first proposed it.
It is in terms of Gaussian coordinates that physical laws must be formulated if they are to have their widest generality, and the general principle of relativity is that all Gaussian systems are equivalent for the statement of general physical laws. For this purpose the labeling process is applied not to ordinary space but to the four dimensional space-time continuum. The concept is somewhat difficult and it may easily be aggravated into impossibility by anyone who thinks that he is expected to visualize it. Fortunately this is not necessary; it is merely one of these irrelevancies to which those who are unaccustomed to think in symbols are liable.
It is through Gaussian coordinates that physical laws should be expressed if they are to be as broadly applicable as possible, and the general principle of relativity states that all Gaussian systems are equivalent when it comes to stating general physical laws. For this reason, the labeling process is applied not to ordinary space but to the four-dimensional space-time continuum. The concept is a bit tricky, and it can easily turn into a confusion for anyone who thinks they need to visualize it. Luckily, this isn't necessary; it's just one of those distractions that can affect those who aren't used to thinking in symbols.
It will now be seen that among physical laws the [180]law of gravitation stands pre-eminent, for it is gravitating matter which determines the geometry, and the geometry determines the form of every other law. The connection between the geometry and gravitation is the law of gravitation. This law has been worked out, with the result that Newton’s law of the inverse square is found to be approximate only, but so closely approximate as to account for nearly all the motions of the heavenly bodies within the limits of observation. It has already been seen that departure from the Euclidean system is intensified by rapidity of motion, and the movements of these bodies are usually too slow for this departure to be manifest. In the case of the planet Mercury the motion is sufficiently rapid, and an irregularity in its motion which long puzzled astronomers has been explained by the more general law.
It can now be observed that among physical laws, the [__A_TAG_PLACEHOLDER_0__]law of gravitation is the most important, as it is gravitating matter that shapes the geometry, and the geometry influences the form of all other laws. The link between geometry and gravitation is defined by the law of gravitation. This law has been developed to show that Newton’s law of the inverse square is only an approximation, but it is so close to accurate that it explains almost all the movements of celestial bodies within observable limits. It has already been established that deviations from the Euclidean system become more pronounced with rapid motion, but the movements of these bodies are typically too slow for this deviation to be noticeable. However, for the planet Mercury, the motion is fast enough, and an irregularity in its movement that previously confounded astronomers has been clarified by the more general law.
Another deduction is that light is subject to gravitation. This has given rise to two predictions, one of which has been verified. The verification of the other is as yet uncertain, though the extreme difficulty of the necessary observations may account for this.
Another deduction is that light is affected by gravity. This has led to two predictions, one of which has been confirmed. The confirmation of the other is still uncertain, although the extreme difficulty of the required observations might explain this.
Since light is subject to gravitation it follows that the constancy of the velocity of light assumed in the earlier part of this paper does not obtain in a gravitational field. There is really no inconsistency. The velocity of light is constant in the absence of gravitation, a condition which unaccelerated motion implies. The special principle of relativity is therefore a limiting case of the general principle. [181]
Since light is affected by gravity, the constant speed of light we talked about earlier in this paper doesn't hold true in a gravitational field. There's no contradiction here. The speed of light remains constant when there's no gravity, which is what unaccelerated motion suggests. Therefore, the special principle of relativity is just a specific case of the broader principle. [__A_TAG_PLACEHOLDER_0__]
VIII
THE NEW CONCEPTS OF TIME AND SPACE
The Essay in Behalf of Which the Greatest Number of Dissenting Opinions Have Been Recorded
The Essay That Has the Most Dissenting Opinions Logged
We have all had experiences, on trains and boats, illustrating our inability to tell, without looking off to some external body, whether we are at rest or moving uniformly; and when we do so look, to tell, without reference to the ground or some other point external to both systems, whether ours or the other be the seat of motion. Uniform motion must be relative, because we find nowhere in the universe a body in the unique state of absolute rest from which alone absolute motion might be measured.
We’ve all had moments on trains and boats that show how hard it is to tell, without looking at something outside, whether we’re stationary or moving at a constant speed. And when we do look, it can be tricky to determine, without comparing to the ground or another external point, which of the two is actually moving. Uniform motion has to be relative, because we can't find any object in the universe that is in a unique state of absolute rest from which we could measure absolute motion.
True, the wave theory of light with its homogeneous space-filling ether seemed to provide a reference standard for the concept of absolute motion, and for its measurement by experiment with light rays. But when Michelson and Morley looked for this absolute motion they found no trace of it. To the physicist, observational student of the external world, nothing exists save observationally; what he can never observe is not there. So: I. By no [182]means whatever may we regard uniform straight-line motion as other than relative.
True, the wave theory of light, with its layer of uniform ether, seemed to set a standard for the idea of absolute motion and for measuring it through experiments with light rays. But when Michelson and Morley searched for this absolute motion, they found no evidence of it. For the physicist, an observational student of the external world, nothing exists except for what can be observed; anything that can’t be observed isn’t there. So: I. By no [__A_TAG_PLACEHOLDER_0__]means can we consider uniform straight-line motion as anything other than relative.
As a further direct consequence of the Michelson-Morley experiment we have: II. Light in a vacuum presents the same velocity, miles per second, to all observers whatever their velocity of relative motion. In addition to being experimentally established, this is necessary to support I,
for if light will distinguish between our velocities, its medium is necessarily a
universal standard for absolute motion. But it is contrary to common sense to suppose
that if I pass you at 100 miles per hour, the same light impulse can pass us both
at the same speed, C. We feel, instinctively, that space and time are not so constituted as to make this
possible. But the fact has been repeatedly demonstrated. And when common sense and
fundamental concepts clash with facts, it is not the facts that must yield. We have
survived such crises, notably one where we had to change the fundamental concept of
up-and-down; if another one is here, says Einstein, let us meet it.
As a further direct result of the Michelson-Morley Experiment experiment, we have: II. Light in a vacuum travels at the same speed, miles per second, for all observers, regardless of their relative motion. In addition to being proven through experiments, this is necessary to support I, because if light can differentiate between our speeds, its medium must serve as a universal standard for absolute motion. However, it goes against common sense to think that if I pass you at 100 miles per hour, the same light impulse can move past us both at the same speed, C. We instinctively feel that space and time aren’t structured in a way that makes this possible. But this fact has been repeatedly demonstrated. When common sense and fundamental concepts clash with facts, it’s the facts that must prevail. We have faced such crises before, particularly when we had to rethink the basic concept of up-and-down; if another challenge is here, says Einstein, let’s address it.
This the Special Theory of Relativity does. It accepts Postulates I and II above; their consequences it deduces and interprets. For extensive demonstration of these I lack space, and this has been satisfactorily done by others so it is not my chief duty; but clearly they will be startling. For the very ray of light which refuses to recognize our relative motion is the medium through which I must observe your system and you mine.
This is what the Special Theory of Relativity does. It accepts Postulates I and II above; it deduces and explains their consequences. I don’t have enough space for a detailed demonstration of these, and others have already done that satisfactorily, so it’s not my main focus; but it’s clear that they will be surprising. The very beam of light that doesn’t acknowledge our relative motion is the medium through which I have to observe your system and you have to observe mine.
It turns out that I get different values for lengths and time intervals in your system than you get, and [183]vice versa. And we are both right! For me to accept your “correction” were for me to admit that you are at absolute rest and I in absolute motion, that your measure of light velocity is right and mine wrong: admissions barred by the postulates. We have nothing to correct; we can only recognize the reason for the discrepancy; and knowing our relative velocity, each can calculate from his own results what the other’s will be. We find, of course, that at ordinary velocities the discrepancy is many times too small for detection; but at relative velocities at all comparable with that of light it rises above the observational horizon.
It turns out that I get different values for lengths and time intervals in your system than you do, and [__A_TAG_PLACEHOLDER_0__] vice versa. And we are both right! For me to accept your “correction” would mean admitting that you are at absolute rest and I am in absolute motion, that your measure of the speed of light is correct and mine is wrong: admissions that go against the principles. We have nothing to correct; we can only understand the reason for the discrepancy; and by knowing our relative speed, each of us can calculate from our own results what the other's will be. We find, of course, that at normal speeds the discrepancy is way too small to detect; but at relative speeds that are anywhere near the speed of light, it becomes noticeable.
To inquire the “true” length is meaningless. Chicago is east of Denver, west of Pittsburgh, south of Milwaukee; we do not consider this contradictory, or demand the “true” direction of Chicago. Einstein finds that the concept of length, between points in space or events in time, does not as we had supposed represent an intrinsic property of the points or the events. Like direction, it is merely a relation between these and the observer—a relation whose value changes with the observer’s velocity relative to the object. If our ideas of the part played in the world by time and space do not permit us to believe this, we must alter these ideas. Let us see how we may do this.
To ask about the “true” length is pointless. Chicago is east of Denver, west of Pittsburgh, and south of Milwaukee; we don’t see this as contradictory or demand the “true” direction of Chicago. Einstein shows that the idea of length, whether between points in space or events in time, doesn’t actually reflect an inherent quality of the points or events themselves. Like direction, it’s simply a relationship between them and the observer—a relationship that changes based on the observer’s speed relative to the object. If our understanding of the roles of time and space in the world doesn’t allow us to accept this, we need to change our understanding. Let’s explore how we can do that.
A Universe of Points
To deal with points in a plane the mathematician draws two perpendicular lines, and
locates any point, as P, by measuring its distances, X and Y, [184]from these “coordinate axes.” The directions of his axes acquire for him a peculiar
significance, standing out above other directions; he is apt to measure the distances
and
between the points P and Q in these directions, instead of measuring the single distance PQ. We do the same thing when we say that the railroad station is five blocks north
and two east.
To work with points in a plane, a mathematician draws two perpendicular lines and identifies any point, like P, by measuring its distances, X and Y, [__A_TAG_PLACEHOLDER_0__] from these "coordinate axes." The directions of his axes take on a special meaning for him, standing out from other directions; he tends to measure the distances and
between the points P and Q in these directions, rather than measuring the single distance PQ. We do the same when we say that the train station is five blocks north and two blocks east.

The mathematician visualizes himself as an observer, located on his coordinate framework.
For another observer on another framework, the horizontal and vertical distances and
between P and Q are different. But for both, the [185]distance from P direct to Q is the same. In each case the right triangle tells us that:
The mathematician sees himself as an observer in his own coordinate system. For another observer in a different system, the horizontal and vertical distances and
between P and Q are different. However, for both observers, the [__A_TAG_PLACEHOLDER_0__]distance from P to Q is the same. In each case, the right triangle shows us that:
Imagine an observer so dominated by his coordinate system that he knows no way of relating P with Q save by their horizontal and vertical separation. His whole scheme of things would be shattered by the suggestion that other observers on other reference frames find different horizontal and vertical components. We have to show him the line PQ. We have to convince him that this length is the absolute property enjoyed by his pair of points; that horizontals and verticals are merely relations between the points and the observer, result of the observer’s having analyzed the distance PQ into two components; that different observers effect this decomposition differently; that this seems not to make sense to him only because of his erroneous concept of a fundamental difference between verticals and horizontals.
Imagine an observer so focused on his own coordinate system that he can't see any way to relate P to Q except through their horizontal and vertical distances. His entire understanding would be thrown off by the idea that other observers in different reference frames could see different horizontal and vertical components. We need to show him the line PQ. We have to convince him that this length is a fundamental property of his pair of points; that horizontal and vertical measurements are just relations between the points and the observer, resulting from how the observer breaks down the distance PQ into two components; that different observers decompose this distance in various ways; and that his confusion only arises from his mistaken belief in a fundamental difference between vertical and horizontal measurements.
The Four-Dimensional World of Events
We too have created a distinction in our minds corresponding to no sufficient reality. Our minds seize on time as inherently separable from space. We see the world made up of things in a continuum of three space dimensions; to make this dead world [186]live there runs through it a one-dimensional time continuum, imposed from without, unrelated.
We have also created a distinction in our minds that doesn't truly reflect reality. Our minds perceive time as something separate from space. We view the world as consisting of objects in three dimensions of space; to give this lifeless world [__A_TAG_PLACEHOLDER_0__]some sense of life, a one-dimensional timeline runs through it, imposed from the outside and disconnected.
But did you ever observe anything suggesting the presence of time in the absence of space, or vice versa? No; these vessels of the universe always occur together. Association of the space dimensions into a manifold from which time is excluded is purely a phenomenon of the mind. The space continuum cannot begin to exist until the time dimension is supplied, nor can time exist without a place to exist in.
But have you ever noticed anything that hints at time existing without space, or the other way around? No; these elements of the universe always come together. The idea of space existing without time is just a mental construct. The space continuum can't exist unless there's a dimension of time, and time can't exist without somewhere for it to be.
The external world that we observe is composed, not of points, but of events. If a point lacks position in time it does not exist; give it this position and it becomes an event. This world of events is four-dimensional—which means nothing more terrifying than that you must make four measures to locate an event. It does not mean, at all, that you must visualize four mutually perpendicular lines in your accustomed three-space or in a four-space analogous to it. If this world of four dimensions seems to lack reality you will be able to exhibit no better reality for your old ideas. Time belongs, without question; and not as an afterthought, but as part of the world of events.
The external world we observe isn’t made up of points but of events. If a point doesn’t have a position in time, it doesn’t exist; give it that position, and it becomes an event. This world of events is four-dimensional—which just means you need to take four measurements to locate an event. It doesn’t mean you have to picture four lines that are perpendicular to each other in the three-dimensional space you’re used to or in a similar four-dimensional space. If this four-dimensional world feels less real to you, there's no better reality to be found for your old ideas. Time definitely belongs here, not as an afterthought but as part of the world of events.
To locate an event we use four measures: X, Y and Z for space, T for time. Using the same reference frame for time and space, we locate a second event
by the measures x, y, z, t. Minkowski showed that the quantity
is the same for all observers, no matter how different [187]their x’s, y’s, z’s and t’s; just as in the plane the quantity
To find an event, we use four measurements: X, Y, and Z for space, and T for time. By applying the same reference frame for time and space, we identify a second event using the measurements x, y, z, and t. Minkowski demonstrated that the quantity
is the same for all observers, regardless of how different [__A_TAG_PLACEHOLDER_0__] their x’s, y’s, z’s, and t’s are; similar to how in the plane the quantity
is the same for all observers, no matter how different their x’s and y’s.
is the same for all observers, regardless of how different their x’s and y’s are.
Such a quantity, having the same value for all observers, is absolute. In the plane it represents the true, absolute distance between the points—their intrinsic property. In dealing with events it represents the true, absolute “interval,” in time and space together between the events. It is not space, nor time, but a combination of the two. We have always broken it down into separate space and time components. In this we are as naive as the plane observer who could not visualize the distance PQ until it was split into separate horizontals and verticals. He understood with difficulty that another observer, employing a different reference frame because in different position, would make the decomposition differently. We understand with difficulty that another observer, employing a different reference frame because in uniform motion relative to us, will decompose the “interval” between events into time and space components different from ours. Time and space are relative to the observer; only the interval representing space-time is absolute. So common sense stands reconciled to the Special Theory of Relativity.
Such a quantity, which has the same value for all observers, is absolute. In the plane, it represents the true, absolute distance between the points—their intrinsic property. When dealing with events, it represents the true, absolute “interval,” in time and space together between those events. It's not just space or just time; it's a combination of both. We've always broken it down into separate space and time components. In this way, we’re as naïve as the plane observer who couldn't visualize the distance PQ until it was divided into separate horizontal and vertical components. He had a hard time understanding that another observer, using a different reference frame because they were in a different position, would break it down differently. We also find it difficult to grasp that another observer, moving uniformly relative to us, will decompose the “interval” between events into time and space components that differ from ours. Time and space are relative to the observer; only the interval that represents space-time is absolute. Therefore, common sense aligns with the Special Theory of Relativity.
Steps to Broader Understanding
Is then our laboriously acquired geometry of points in a three-dimensional space to go into the [188]discard? By no means. Jeans, investigating the equilibrium of gaseous masses, found the general case too difficult for direct attack. So he considered the case where the masses involved are homogeneous and incompressible. This never occurs; but it throws such light on the general case as to point the way toward attack on it.
Is our hard-earned understanding of points in three-dimensional space really going to be discarded? Absolutely not. Jeans, while studying the balance of gas masses, found the general case too challenging to tackle directly. Instead, he examined a scenario where the masses are uniform and incompressible. This situation never actually happens, but it provides significant insights into the general case and helps guide the approach to tackling it.
Euclidean geometry excludes motion, save that engineered by the observer; and then the time is immaterial. Time does not enter at all; the three space dimensions suffice. This simple case never occurs where matter exists; but its conclusions are of value in dealing with more general cases.
Euclidean geometry doesn't include motion, except for that caused by the observer; and in that case, time doesn't matter. Time isn't considered at all; the three dimensions of space are enough. This straightforward situation never happens when matter is present, but its conclusions are useful for addressing more general cases.
When we look into a world alleged to be that of Euclid and find motion, we may retain the Euclidean concept of what constitutes the world and invent a machinery to account for the motion; or we may abandon the Euclidean world, as inadequate, in favor of a more general one. We have adopted the second alternative.
When we examine a world that's supposedly like Euclid's and discover motion, we can either stick to the Euclidean idea of what the world is and create a system to explain the motion, or we can give up on the Euclidean world as insufficient and choose a more comprehensive one. We have chosen the second option.
Newton’s laws tells us that a body free to move will do so, proceeding in a straight line at uniform velocity until interfered with. We do not ask, nor does the theory tell us, whence comes the initial motion. There is no machinery to produce it; it is an inherent property of Newton’s world—assured by the superposition of the time continuum upon Euclid’s world to make Newton’s, accepted without question along with that world itself.
Newton’s laws tell us that a body free to move will do so, moving in a straight line at a constant speed until something disrupts it. We don’t ask, nor does the theory explain, where the initial motion comes from. There isn’t any machinery to create it; it’s an inherent property of Newton’s world—confirmed by layering the time continuum onto Euclid’s world to create Newton’s, accepted without question along with that world itself.
But Newton saw that his world of uniform motion, like Euclid’s, was never realized. In the neighborhood of one particle a second is interfered with, forced to give up its uniform motion and acquire a [189]constant acceleration. This Newton explained by employing the first of the alternatives mentioned above. He tells us that in connection with all matter there exists a force which acts on other matter in a certain way. He does not display the actual machinery through which this “force” works, because he could not discover any machinery; he had to stop with his brilliant generalization of the observed facts. And all his successors have failed to detect the slightest trace of a machinery of gravitation.
But Newton realized that his idea of a world with uniform motion, similar to Euclid's, was never fully achieved. In the vicinity of one particle, it is disrupted, forced to abandon its uniform motion and adopt a [__A_TAG_PLACEHOLDER_0__]constant acceleration. Newton explained this by using the first alternative mentioned earlier. He tells us that there is a force related to all matter that acts on other matter in a specific way. He didn't show the actual mechanism through which this "force" operates because he couldn’t find any mechanism; he had to settle for his brilliant generalization of the observed facts. And all his successors have failed to find even the slightest hint of a mechanism for gravitation.
Einstein asks whether this is not because the machinery is absent—because gravitation, like position in Euclid’s world and motion in Newton’s, is a fundamental property of the world in which it occurs. His point of attack here lay in precise formulation of certain familiar facts that had never been adequately appreciated. These facts indicate that even accelerated motion is relative, in spite of its apparently real and absolute effects.
Einstein questions whether this is due to the lack of machinery—because gravity, like location in Euclid’s world and movement in Newton’s, is a basic characteristic of the universe it exists in. His main argument focused on clearly defining certain well-known facts that had never been fully recognized. These facts show that even accelerated motion is relative, despite its seemingly real and absolute effects.
Gravity and Acceleration
An observer in a closed compartment, moving with constant acceleration through empty space, finds that the “bottom” of his cage catches up with objects that he releases; that it presses on his feet to give him the sensation of weight, etc. It displays all the effects that he would expect if it were at rest in a gravitational field. On the other hand, if it were falling freely under gravitational influence, its occupant would sense no weight, objects released would not leave his hand, the reaction from his every [190]motion would change his every position in his cage, and he could equally well assume himself at rest in a region of space free from gravitational action. Accelerated motion may always be interpreted, by the observer on the system, as ordinary force effects on his moving system, or as gravitational effects on his system at rest.
An observer in a sealed compartment, moving with constant acceleration through empty space, notices that the “floor” of his cage catches up with objects he releases; it presses on his feet, making him feel weight, and so on. It shows all the effects he would expect if it were stationary in a gravitational field. On the other hand, if it were falling freely due to gravity, the person inside wouldn’t feel any weight, released objects wouldn’t drift away from his hand, the reaction from any movement would change his position in the cage, and he could just as easily believe he was at rest in a space free from gravitational influence. Accelerated motion can be viewed, by the observer in the system, as ordinary force effects on his moving system or as gravitational effects on his system at rest.
An alternative statement of the Special Theory is that the observed phenomena of uniform motion may equally be accounted for by supposing the object in motion and the observer with his reference frame at rest, or vice versa. We may similarly state the General Theory: The observed phenomena of uniformly accelerated motion may in every case be explained on a basis of stationary observer and accelerated objective, or of stationary objective with the observer and his reference system in accelerated motion. Gravitation is one of these phenomena. It follows that if the observer enjoy properly accelerated axes (in time-space, of course), the absolute character of the world about him must be such as to present to him the phenomenon of gravitation. It remains only to identify the sort of world, of which gravitation as it is observed would be a fundamental characteristic.
An alternative way to express the Special Theory is that the observed effects of uniform motion can be explained by assuming that either the moving object and the observer with their reference frame are at rest, or the other way around. We can state the General Theory in a similar way: The observed effects of uniformly accelerated motion can always be explained based on a stationary observer and an accelerated object, or a stationary object with the observer and their reference frame in accelerated motion. Gravity is one of these effects. This means that if the observer has properly accelerated axes (in space-time, of course), the absolute nature of the world around them must present the phenomenon of gravity. We just need to identify what kind of world would make gravity as it is observed a fundamental characteristic.
Euclid’s and Newton’s systems stand as first and second approximations to that world. The Special Relativity Theory constitutes a correction of Newton, presumably because it is a third approximation. We must seek in it those features which we may most hopefully carry along, into the still more general case.
Euclid’s and Newton’s systems are the first and second attempts to explain that world. The Special Relativity Theory corrects Newton, as it is likely a third attempt. We should look for the aspects in it that we can confidently carry into the even more general case.
Newton’s system retained the geometry of Euclid. [191]But Minkowski’s invariant expression tells us that Einstein has had to abandon this;
for in Euclidean geometry of four dimensions the invariant takes the form:
Newton’s system kept Euclid's geometry. [__A_TAG_PLACEHOLDER_0__] But Minkowski's invariant formula shows that Einstein had to let go of this; in four-dimensional Euclidean geometry, the invariant is expressed as:
analogous to that of two and three dimensions. It is not the presence of the constant C in Minkowski’s formula that counts; this is merely an adjustment so that we may measure space in miles and time in the unit that corresponds to a mile. It is the minus sign where Euclidean geometry demands a plus that makes Minkowski’s continuum non-Euclidean.
analogous to that of two and three dimensions. It's not the presence of the constant C in Minkowski’s formula that matters; this is simply an adjustment so that we can measure space in miles and time in a unit that matches a mile. It's the minus sign where Euclidean geometry requires a plus that makes Minkowski’s continuum non-Euclidean.
The editor has told us what this statement means. I think he has made it clear that when we speak of the geometry of the four-dimensional world, we must not read into this term the restrictions surrounding the kind of geometry we are best acquainted with—that of the three-dimensional Euclidean continuum. So I need only point out that if we are to make a fourth (and we hope, final) approximation to the reality, its geometry must preserve the generality attained by that of the third step, if it goes no further.
The editor has explained what this statement means. I think he has clarified that when we talk about the geometry of the four-dimensional world, we shouldn’t impose the limitations associated with the type of geometry we’re most familiar with—that of the three-dimensional Euclidean space. So, I just need to emphasize that if we are to create a fourth (and hopefully final) approximation of reality, its geometry has to maintain the generality achieved in the third step, even if it doesn't go beyond that.
Einstein's Space-Time Universe
Einstein accordingly examined the possible non-Euclidean geometries of four dimensions, in search of one displaying fundamental characteristics which, interpreted in terms of space-time, would lead to the observed facts of gravitation. The mathematics of this investigation is that part of his work which, we are told, but twelve men can follow; so we may only outline his conclusions. [192]
Einstein then looked into the possible non-Euclidean geometries of four dimensions, hoping to find one that showed essential features that, when understood in the context of space-time, would explain the observed facts about gravity. The mathematics involved in this study is said to be so complex that only twelve people can fully understand it, so we will just summarize his conclusions. [__A_TAG_PLACEHOLDER_0__]
If we assume that in the neighborhood of matter the world of space-time is non-Euclidean, and that its curvature or distortion or non-Euclideanism is of a certain type already known to mathematicians; that the curvature of this world in the neighborhood of matter increases with the mass, and decreases as the distance from the matter increases; and that every particle of matter that is not interfered with travels through space-time in the most direct path possible in that continuum; then the observed facts of gravitation are accounted for as an inherent geometric property of this space-time world. We usually say that the presence of matter distorts this world, and that this distortion gives the track of particles through the region affected its non-uniform character.
If we accept that around matter the space-time world is non-Euclidean, and that its curvature or distortion or non-Euclidean qualities are of a certain type already recognized by mathematicians; that the curvature of this world near matter increases with mass and decreases as you move away from the matter; and that every particle of matter that isn’t interfered with travels through space-time in the most direct path possible in that continuum; then the observed facts of gravitation can be explained as a fundamental geometric property of this space-time world. We typically say that the presence of matter distorts this world, and that this distortion causes the paths of particles in the affected area to be non-uniform.
Gravitation then is not a force at all; it is the fundamental nature of things. A body free to move through the world must follow some definite path. Euclid says it will stand still; Newton that it will traverse a straight line in three-space at uniform time-rate; Einstein that it will move in a “geodesic” through time-space—in every-day language, that it will fall.
Gravitation isn’t a force; it’s the basic nature of things. An object that can move in the world has to follow a specific path. Euclid says it will stay still; Newton says it will move in a straight line through three-dimensional space at a constant speed; Einstein says it will travel along a "geodesic" through time-space—in simple terms, it will fall.
The numerical consequences of Einstein’s theory are, within the limits of observation, the same as those of Newton’s for all bodies save one—Mercury. This planet shows a small deviation from the path predicted by Newton’s law; Einstein’s theory gives its motion exactly. Again, when modern research showed that light must be affected by gravitation, Einstein’s theory, because of the extreme velocity of light, deviates from Newton’s, where the [193]speed is less a determining factor; and observations of starlight deflected by the sun during the eclipse were in much better accord with Einstein’s theory than Newton’s. Moreover, the Special Theory predicts that mass is an observational variable like length and duration. Radioactive emanations have a velocity high enough to give appreciable results here, and the prediction is verified, tending to support the general theory by supporting its limiting case.
The numerical outcomes of Einstein’s theory are, within the limits of observation, the same as Newton’s for all bodies except one—Mercury. This planet displays a slight deviation from the path predicted by Newton’s law; Einstein’s theory accurately describes its motion. Additionally, when modern research revealed that light is influenced by gravity, Einstein’s theory, due to the extreme speed of light, diverges from Newton’s, where speed is a less significant factor. Observations of starlight bending around the sun during an eclipse aligned much more closely with Einstein’s theory than with Newton’s. Furthermore, the Special Theory suggests that mass is an observational variable like length and time. Radioactive emissions move quickly enough to produce noticeable results in this context, and the prediction is confirmed, which supports the general theory by reinforcing its limiting case.
We like always to unify our science; and seldom, after effecting a unification, are we forced to give it up. Einstein for the first time brings mechanical, electromagnetic and gravitational phenomena within one structure. This is one reason why physicists are so open minded toward his theory—they want it to be true.
We always strive to unify our science, and rarely, after achieving a unification, do we have to abandon it. Einstein is the first to integrate mechanical, electromagnetic, and gravitational phenomena into a single framework. This is one reason why physicists are so receptive to his theory—they really want it to be true.
The Regular Person's Final Doubt
The final answer to any series of questions is inevitably “because the world is so constructed.” The things we are content to leave on that basis are those to which we are accustomed, and which we therefore think we understand; those for which this explanation leaves us unsatisfied are those which are new and unfamiliar. Newton told us that the world of three-dimensional space with one-dimensional time superposed was so constructed that bodies left to themselves would go on forever in a straight line at constant speed. We think we understand this, but our understanding consists merely of the unspoken query, “Why, of course; what is there to prevent?” [194]The Greeks, an intelligent people, looked at this differently; they would have met Newton with the unanimous demand “Why so; what is there to keep them going?” So if, in seeking an explanation of anything, we come sooner than we had expected to the finality “Because the world is so constructed,” let us not feel that we have been cheated. [195]
The final answer to any series of questions is always, “because that's how the world is made.” The things we're okay with leaving at that are those we're used to and think we understand; those for which this explanation doesn't satisfy us are the ones that are new and unfamiliar. Newton told us that the three-dimensional world with one-dimensional time was built in such a way that objects left on their own would keep moving in a straight line at a constant speed forever. We think we get this, but our understanding really just consists of the unspoken question, “Of course; what would stop them?” [__A_TAG_PLACEHOLDER_0__] The Greeks, who were smart, saw this differently; they would have responded to Newton with the collective question, “Why is that; what keeps them going?” So if, in trying to explain something, we unexpectedly reach the conclusion, “Because that's how the world is made,” let's not feel like we've been shortchanged. [__A_TAG_PLACEHOLDER_0__]
IX
THE PRINCIPLE OF RELATIVITY
A Statement of What it is All About, in Ideas of One Syllable
A Straightforward Explanation of What It All Means, Using One-Syllable Words
The invariance of the laws of nature was one of the most popular themes of nineteenth century philosophy. For it was not till last century that general acceptance was accorded to the doctrine of the “Uniformity of Law,” adumbrated in ancient times by Epicurus and Lucretius. It is now a cardinal axiom of science that the same cause in the same conditions is always followed by the same effect. There exists in nature no indeterminate element; all things are governed by fixed laws, and the discovery of these laws is the main business of science.
The consistency of the laws of nature was one of the most popular themes in nineteenth-century philosophy. It wasn't until the last century that the idea of the “Uniformity of Law,” hinted at in ancient times by Epicurus and Lucretius, was widely accepted. It's now a fundamental principle of science that the same cause under the same conditions always leads to the same effect. There’s no random element in nature; everything is governed by established laws, and uncovering these laws is the primary purpose of science.
It is necessary to guard against reading into this statement an erroneous idea of the content of a “law of nature.” Such a law is of course not an enactment of any sort; and it is not even to be thought of as an actual explanation of the how and why of the phenomena with which it has to do. It really is nothing but an expression of our belief in the pronouncement of the preceding paragraph, that like conditions do produce like results. It is a prediction based on [196]past experience, and is of value merely in that past experience leads us to credit its accuracy. The composite essay beginning on page 19 discusses this question of the reality of natural laws, and should be consulted in connection with the present contribution.—Editor.
It’s important to be careful not to misunderstand this statement as suggesting something incorrect about what a “law of nature” is. This law isn’t a formal rule of any sort; it shouldn’t even be viewed as a true explanation for how or why certain phenomena happen. Essentially, it’s just a reflection of our belief in the idea mentioned in the previous paragraph, that similar conditions lead to similar outcomes. It’s a prediction based on [__A_TAG_PLACEHOLDER_0__] past experiences, and its value comes solely from the fact that our past experiences make us trust its accuracy. The composite essay starting on page 19 discusses the reality of natural laws and should be read alongside this contribution.—Editor.
This great philosophic principle was derived of course from the study of natural science; i.e., from observations and experiments conducted upon the earth. Their comprehensiveness is therefore limited by the fact that the observer is always in a state of rest, or nearly so, as compared with the earth. All observers upon the earth are moving through space at the same velocity; and it was possible to argue that the uniformity of law might only hold good, when experiments were conducted at this velocity. An observer moving at very different velocity might discover that the laws of nature under these new conditions were somewhat different.
This important philosophical principle, of course, came from studying natural science; i.e., from observations and experiments carried out on the earth. Their comprehensiveness is therefore limited by the fact that the observer is always at rest, or nearly so, compared to the earth. All observers on the earth are moving through space at the same speed, and it could be argued that the uniformity of law might only apply when experiments are conducted at this speed. An observer moving at a very different speed might find that the laws of nature under these new conditions are somewhat different.
Such a view could indeed never be very plausible, for motion is only a relative conception. Imagine a universe consisting of infinite “empty” space, in which there is poised a single material body. How shall we determine whether this body is at rest, or whether it is moving at high or low velocity through space? It is never getting nearer to anything or farther from anything, since there is no other body for it to get nearer to or farther from. If we say it is moving at a uniform velocity of a thousand miles a second, our statement really has no significance. We have no more reason for affirming that it is in motion than we have for affirming that it is at rest. In short, there is no such thing as [197]absolute motion; the conception of motion only arises when there are two or more bodies changing their position relatively to one another. This is what is meant by the relativity of motion. It seemed therefore improbable that the laws of nature would be different if the observer were moving at high velocity; for the movement of the observer is not an absolute quantity, but merely a statement of his relation to other bodies, and if there were no other bodies, the statement itself would be meaningless.
Such a view could never really be believable because motion is only a relative concept. Imagine a universe filled with infinite "empty" space, with just one material object in it. How do we decide if this object is at rest or moving at a high or low speed through space? It's never getting closer to anything or farther away from anything since there are no other objects nearby. If we say it's moving at a constant speed of a thousand miles per second, that statement has no real meaning. We have just as much reason to say it’s in motion as we do to say it’s at rest. In short, there is no such thing as [__A_TAG_PLACEHOLDER_0__]absolute motion; the idea of motion only comes into play when two or more objects are changing their positions relative to each other. This is what we mean by the relativity of motion. It therefore seems unlikely that the laws of nature would be different if the observer were moving at high speed; the observer's movement is not an absolute measure but just a statement of their relationship to other objects, and if there were no other objects, that statement would be meaningless.
The Way Light Works
Now among the established laws of nature is that which specifies the velocity of light moving through a vacuum. If the laws of nature are invariable, this velocity will always be the same. But consider what would happen under the following circumstances: Suppose that we are at rest, and that an observer on another body flies past us at 150,000 miles a second. Suppose that at the moment he passes, a piece of flint projecting from him grazes a piece of steel projecting from us, giving rise to a spark; and that we both thereupon set about to measure the velocity of the light so produced. After one second, we should find that the light had traveled about 186,000 miles away, and since during this second the other observer had traveled 150,000 miles, we should infer that the light traveling in his direction was only about 36,000 miles ahead of him. We should also infer that he would find this out by his experiment, and that he would estimate the velocity of light as only 36,000 miles a second in his [198]own direction, and 336,000 miles a second in the opposite direction. But if this is so, then that law of nature which specifies the velocity of light is quite different for him and for us: the laws of nature must be dependent upon the observer’s motion—a conclusion which appears incompatible with the idea of the relativity of motion.
Now, one of the fundamental laws of nature states the speed of light traveling through a vacuum. If these laws are constant, this speed will always be the same. But think about what would happen in this scenario: Imagine we are stationary, and an observer on another object zips past us at 150,000 miles per second. When he flies by, a piece of flint sticking out from him lightly strikes a piece of steel sticking out from us, creating a spark; then we both try to measure the speed of the light generated. After one second, we would find that the light had traveled about 186,000 miles away, and since during this second the other observer had moved 150,000 miles, we would conclude that the light traveling in his direction was only about 36,000 miles ahead of him. We would also conclude that he would discover this through his experiment and that he would measure the speed of light as just 36,000 miles per second in his [__A_TAG_PLACEHOLDER_0__]own direction, and 336,000 miles per second in the opposite direction. But if that’s the case, then the law of nature that defines the speed of light is quite different for him and for us: the laws of nature must depend on the observer’s motion—a conclusion that seems to contradict the concept of the relativity of motion.
And it so happens that it is also contradictory to experimental conclusions. Experiments undertaken to settle the point show that each observer finds the same velocity for the light of the spark; and after one second, each observer finds that the light has traveled 186,000 miles from himself. But how is it possible that when it has traveled 186,000 miles in the same direction as the other observer who himself has moved 150,000 miles meanwhile, he should still think it 186,000 miles ahead of him? That is the initial paradox; and since there has been no room for error in the experiments, we are forced to conclude that there was something wrong in the assumptions and preconceptions with which we started.
And it turns out that it contradicts experimental findings. Experiments done to clarify the issue show that every observer measures the same speed for the light from the spark; after one second, each observer determines that the light has traveled 186,000 miles from them. But how is it possible that when it has traveled 186,000 miles in the same direction as another observer who himself has moved 150,000 miles during that time, he still perceives it as 186,000 miles ahead of him? That’s the initial paradox; and since there’s been no room for error in the experiments, we have to conclude that something was wrong with the assumptions and beliefs we started with.
Space and Time
There can in fact be only one interpretation. If we each find that the light has moved the same number of miles in the same number of seconds, then we must be meaning something different when we speak of miles and seconds. We are speaking in different languages. Some subsidence has occurred in the foundations of our systems of measurement. We are each referring to one and the same objective fact; but since we describe it quite differently, and at [199]first sight incompatibly, some profound alteration must have occurred in our perceptions—all unsuspected by ourselves. It has been shown precisely what this alteration is. A body moving at high velocity must become flattened in the direction of its motion; all its measuring apparatus, when turned in that direction, is shortened, so that no hint of the flattening can be obtained from it. Furthermore, the standards of time are lengthened out, and clocks go slower. The extent of this alteration in standards of space and time is stated in the equations of the so-called Lorentz transformation.
There can actually be only one interpretation. If we each observe that light has traveled the same number of miles in the same number of seconds, then we must be meaning something different when we talk about miles and seconds. We're speaking different languages. Some changes have happened in the foundations of our measurement systems. We are each referring to the same objective fact; however, since we describe it quite differently, and at [__A_TAG_PLACEHOLDER_0__]first glance incompatibly, some significant shift must have occurred in our perceptions—all unbeknownst to us. It has been clearly demonstrated what this shift is. An object moving at high speed gets flattened in the direction of its movement; all its measuring instruments, when aligned in that direction, become shortened, so no indication of the flattening can be detected from them. Additionally, the standards of time are stretched out, and clocks run slower. The degree of this change in standards of space and time is explained in the equations of the so-called Lorentz transformation.
Objection might be urged to the above paragraph on the ground that the connection of the observer with the variability of measured lengths and times is not sufficiently indicated, and that this variability therefore might be taken as an intrinsic property of the observed body—which of course it is not.—Editor.
Some might object to the above paragraph, claiming that the connection between the observer and the variations in measured lengths and times isn’t fully clarified. This could lead to the misunderstanding that this variability is an intrinsic feature of the observed body—which it definitely is not.—Editor.
We are accustomed to describe space as being of three dimensions, and time as being of one dimension. As a matter of fact, both space and time are “ideas,” and not immediate sense-perceptions. We perceive matter; we then infer a universal continuum filled by it, which we call space. If we had no knowledge of matter, we should have no conception of space. Similarly in the case of time: we perceive one event following another, and we then invent a continuum which we call time, as an abstraction based on the sequence of events. We do not see space, and we do not see time. They are not real things, in the sense that matter is real, and that events are real. They are products of imagination: useful [200]enough in common life, but misleading when we try to look on the universe as a whole, free from the artificial divisions and landmarks which we introduce into it for practical convenience. Hence it is perhaps not so surprising after all that in certain highly transcendental investigations, these artificial divisions should cease to be a convenience, and become a hindrance.
We usually think of space as having three dimensions and time as having one. In reality, both space and time are “concepts,” not direct sensory experiences. We observe matter and then infer a universal continuum filled with it, which we call space. Without any knowledge of matter, we wouldn’t have a concept of space. The same goes for time: we observe one event happening after another, and then we create a continuum we refer to as time, which is an abstract idea based on the sequence of events. We don’t actually see space or time. They aren’t real things like matter and events are. They’re products of our imagination—helpful in everyday life, but confusing when we try to understand the universe as a whole, free from the artificial divisions and markers we create for our own convenience. So, it’s not too shocking that in some advanced investigations, these artificial divisions can stop being helpful and turn into obstacles.
Take for instance our conception of time. It differs from our conception of space in that it has only one dimension. In space, there is a right and left, an up and down, a before and after. But in time there is only before and after. Why should there be this limitation of the time-factor? Merely because that is the verdict of all our human experience. But is our human experience based on a sufficiently broad foundation to enable us to say that, under all conditions and in all parts of the universe, there can be only one time-direction? May not our belief in the uniformity of time be due to the uniformity of the motion of all observers on the earth? Such in fact is the postulate of relativity. We now believe that, at velocities very different from our own, the standard of time would also be different from ours. From our point of view, that different standard of time would not be confined to the single direction fore and aft, as we know it, but would also have in it an element of what we might call right and left. True, it would still be of only one dimension, but its direction would differ from the direction of our time. It would still run like a thread through the universe, but not in the direction which we call straight forward. It would have a [201]slant in it, and the angle of the slant depends upon the velocity of motion. It does not follow that because we are all traveling in the same direction down the stream of time, therefore that stream can only flow in the direction which we know. “Before” and “after” are expressions which, like right and left, depend upon our personal situation. If we were differently situated, if to be precise we were moving at very high velocity, we should, so to speak, be facing in a new direction and “before” and “after” would imply a different direction of progress from that with which we are now familiar.
Consider our understanding of time. It differs from how we think about space because it has only one dimension. In space, there’s a right and left, an up and down, a before and after. But in time, there’s just before and after. Why is there this limitation in how we view time? Simply because that’s what all our human experience tells us. But is our human experience broad enough to claim that, under all circumstances and throughout the universe, there can only be one direction of time? Could our belief in the consistency of time come from the similar movements of everyone on Earth? This is actually the premise of relativity. We now understand that, at speeds much different from ours, the concept of time would also differ. From our perspective, that different standard of time wouldn’t be limited to just the forward and backward we know, but would also include what we might think of as right and left. True, it would still be one-dimensional, but its direction would vary from the direction we experience as time. It would still stretch like a thread through the universe, but not in the way we consider straight ahead. It would have a [__A_TAG_PLACEHOLDER_0__]slant to it, and the angle of that slant would depend on the speed of motion. Just because we’re all traveling in the same direction down the stream of time doesn’t mean that stream can only flow in the direction we know. “Before” and “after” are terms that, like right and left, depend on where we stand. If we were in a different position, specifically if we were moving at very high speeds, we would, in a sense, be facing a new direction, and “before” and “after” would indicate a different path of progress than the one we’re used to.
The Real World
But, after all, the objective universe is the same old universe however fast we are moving about in it, and whatever way we are facing. These details merely determine the way we divide it up into space and time. The universe is not affected by any arbitrary lines which we draw through it for our personal convenience. For practical purposes, we ascribe to it four dimensions, three in space and one in time. Clearly if the time direction is altered, all dimensions both of space and time must have different readings. If, for instance, the time direction slopes away to the left, as compared with ours, then space measurements to right and left must be correspondingly altered. An analogy will simplify the matter.
But in the end, the objective universe remains the same old universe, no matter how fast we move through it or which direction we face. These details just determine how we break it down into space and time. The universe isn’t influenced by any random lines we draw for our own convenience. For practical reasons, we attribute four dimensions to it: three in space and one in time. Clearly, if the direction of time changes, all dimensions of both space and time will have different values. For example, if the time direction tilts to the left compared to ours, then measurements of space to the right and left will need to be adjusted accordingly. An analogy will make this clearer.
Suppose we desire to reach a point ten miles off in a roughly northeasterly direction. We might do so by walking six miles due east and then eight miles [202]due north. We should then be precisely ten miles from where we started. But suppose our compass were out of order, so that its north pole pointed somewhat to the west of north. Then in order to get to our destination, we might have to walk seven miles in the direction which we thought was east, and a little more than seven miles in the direction which we thought was north. We should then reach the same point as before. Both observers have walked according to their lights, first due east and then due north, and both have reached the same point: the one observer is certain that the finishing point is six miles east of the starting-point, while the other is sure it is seven miles.
Suppose we want to get to a spot ten miles away in a roughly northeast direction. We could do this by walking six miles straight east and then eight miles straight north. We would then be exactly ten miles from where we started. But if our compass was malfunctioning, so that its north pointed a bit west of true north, we might need to walk seven miles in the direction we thought was east and just over seven miles in the direction we thought was north. We would then arrive at the same point as before. Both people walked based on their beliefs, first east and then north, and both ended up at the same location: one person is convinced that the endpoint is six miles east of the starting point, while the other is sure it’s seven miles.
Now we on the earth are all using a compass which points in the same direction as regards time. But other observers, on bodies moving with very different velocity, have a compass in which the time-direction is displaced as compared with ours. Hence our judgments of distances will not be alike. In our analogy, the northerly direction corresponds to time, and the easterly direction to space; and so long as we use the same compass we do not differ in our measurements of distances. But for any one who has a different notion of the time-direction, not only time intervals but space distances will be judged differently.
Now we on Earth are all using a compass that points in the same direction when it comes to time. But other observers, on bodies moving at very different speeds, have a compass where the time direction is shifted compared to ours. As a result, our judgments of distances will not be the same. In our analogy, the north direction corresponds to time, and the east direction corresponds to space; as long as we are using the same compass, our measurements of distances align. However, for anyone who has a different understanding of the time direction, both time intervals and space distances will be perceived differently.
In short, the universe is regarded as a space-time continuum of four dimensions. A “point” in space-time is called an “event”—that which occurs at a specified moment and at a specified place. The distance between two points in space-time is called their “interval.” All observers will agree as to the magnitude [203]of any interval, since it is a property of the objective universe; but they will disagree as to its composition in space and time separately. In short, space and time are relative conceptions; their relativity is a necessary consequence of the relativity of motion. The paradox named at the outset is overcome; for the two observers measuring the velocity of the light produced as they passed one another, were using different units of space and time. And hence emerges triumphant the Special Principle of Relativity, which states that the laws of nature are the same for all observers, whether they are in a state of rest or of uniform motion in a straight line.
In summary, the universe is seen as a four-dimensional space-time continuum. A “point” in space-time is referred to as an “event”—something that happens at a specific moment and place. The distance between two points in space-time is known as their “interval.” All observers will agree on the size [__A_TAG_PLACEHOLDER_0__] of any interval because it is a characteristic of the objective universe; however, they will have different views on its breakdown in terms of space and time separately. In short, space and time are relative concepts; their relativity is an inevitable result of the relativity of motion. The paradox mentioned at the beginning is resolved; the two observers measuring the speed of light as they passed each other were using different units for space and time. Thus, the Special Principle of Relativity emerges victorious, stating that the laws of nature are the same for all observers, whether they are at rest or in uniform motion in a straight line.
Motion at Speed
Uniform motion in a straight line is however a very special kind of motion. Our experience in ordinary life is of motions that are neither uniform nor in a straight line; both speed and direction of motion are altering. The moving body is then said to undergo “acceleration”: which means either that its speed is increasing or diminishing, or that its direction of motion is changing, or both. If we revert to our former supposition of a universe in which there is only a single body in “empty” space, we clearly cannot say whether it has acceleration any more than whether it is moving, there being no outside standard of comparison; and the General Principle of Relativity asserts the invariance of the laws of nature for all states of motion of the observer. In this case, however, a difference might be [204]detected by an observer on the moving body itself. It would be manifested to him as the action of a force; such for instance as we feel when a train in which we are traveling is increasing or reducing speed, or when, without changing speed, it is rounding a corner. The force dies away as soon as the velocity becomes uniform. Thus acceleration reveals itself to us under the guise of action by a force. Force and acceleration go together, and we may either say that the acceleration is due to the force, or the impression of force to the acceleration.
Uniform motion in a straight line is a very specific type of motion. In our everyday lives, we mostly experience motions that are neither uniform nor linear; both the speed and direction of movement change. When a moving object changes its speed or direction, we say it is experiencing “acceleration.” This means its speed is either increasing or decreasing, its direction is changing, or both. If we think about a hypothetical universe with just one object in “empty” space, we can’t determine if it has acceleration any more than we can tell if it is moving, since there’s no external standard to compare it to. The General Principle of Relativity states that the laws of nature remain the same, no matter the observer's state of motion. However, an observer on the moving body itself might notice a difference. This would be felt as a force acting on them, like when we feel a train speed up or slow down, or when it rounds a corner without changing speed. The force disappears as soon as the speed becomes uniform. Thus, we experience acceleration as the effect of a force. Force and acceleration are connected, and we can say that acceleration results from the force, or that the sensation of force comes from acceleration.
Now when we are traveling with accelerated motion, we have quite a different idea of what constitutes a straight line from that which we had when at rest or in uniform motion. If we are moving at uniform velocity in an airplane and drop a stone to the earth it will appear to us in the airplane to fall in a straight line downward, while to an observer on the earth it will appear to describe a parabola. This is due to the fact that the stone gathers speed as it falls; it is subject to the acceleration associated with gravity. Acceleration obliterates the fundamental difference between a straight and curved line. Unless we know what is the absolute motion of the stone, and the two observers, we cannot say whether the line is “really” a straight or a curved line. Since absolute motion is an illegitimate conception, it follows that there is no such thing as “really” straight or “really” curved. These are only appearances set up as a consequence of our relative motions with respect to the bodies concerned. If there were no such thing as acceleration—if the stone fell to the earth at uniform velocity—then an observer on the [205]earth or anywhere else would agree that it fell in a straight line; and straight lines would always be straight lines.
Now, when we're moving at a high speed, our understanding of what a straight line is changes compared to when we’re at rest or moving steadily. If we’re flying in an airplane at a constant speed and drop a stone, it will seem to fall straight down to us in the plane, while an observer on the ground will see it take a parabolic path. This happens because the stone picks up speed as it falls; it experiences the acceleration of gravity. Acceleration blurs the line between what we consider straight and curved. Unless we understand the absolute motion of the stone and the two observers, we can't really determine if the path is “truly” straight or curved. Since absolute motion is an invalid concept, it means there’s no such thing as “truly” straight or “truly” curved. These are just perceptions created by our different motions relative to the objects involved. If there were no acceleration—if the stone fell to the ground at a constant speed—then an observer on the [__A_TAG_PLACEHOLDER_0__]ground or anywhere else would agree that it fell in a straight line; and straight lines would always remain straight lines.
Under these circumstances, Euclidean geometry would be absolutely true. But if we are in a state of acceleration, then what we think are straight lines are “really” curved lines, and Euclidean geometry, based on the assumption that its lines are straight, must founder when tested by more accurate measurements. And in point of fact we are in a state of acceleration: for we are being acted upon by a force—namely, the force of gravitation. Wherever there is matter, there is gravitation; wherever there is gravitation there is acceleration; wherever there is acceleration Euclidean geometry is inaccurate. Hence in the space surrounding matter a different geometry holds the field; and bodies in general move through such space in curved lines.
Under these circumstances, Euclidean geometry would be completely accurate. But if we are experiencing acceleration, then what we perceive as straight lines are actually curved lines, and Euclidean geometry, which assumes its lines are straight, fails when examined with more precise measurements. In fact, we are in a state of acceleration: we are affected by a force—specifically, the force of gravity. Wherever there's matter, there's gravity; wherever there's gravity, there's acceleration; and wherever there's acceleration, Euclidean geometry doesn't hold up. Therefore, in the space surrounding matter, a different geometry applies; and objects generally move through that space along curved paths.
Different parts of space are thus characterized by different geometrical properties. All bodies in the universe proceed on their established courses through space and time. But when they come to distorted geometrical areas, their paths naturally seem to us different from when they were moving through less disturbed regions. They exhibit the difference by acquiring an acceleration; and we explain the acceleration by alleging the existence of a force, which we call the force of gravitation. But their motions can in fact be perfectly predicted if we know the geometry of the space through which they are traveling. The predictions so based have in fact proved more accurate than those based on the law of gravitation. [206]
Different areas of space have unique geometrical properties. All objects in the universe follow their designated paths through space and time. However, when they enter distorted geometrical areas, their paths seem different than when they were moving through less affected regions. They show this change by experiencing acceleration, and we explain the acceleration by suggesting the presence of a force that we call gravitational force. However, we can accurately predict their movements if we understand the geometry of the space they are moving through. Predictions based on this geometry have actually turned out to be more precise than those made using the law of gravitation. [__A_TAG_PLACEHOLDER_0__]
X
SPACE, TIME AND GRAVITATION
An Outline of Einstein’s Theory of General Relativity
An Overview of Einstein's Theory of General Relativity
“Henceforth space by itself and time by itself shall sink to mere shadows, and only a union of the two shall preserve reality.”
“From now on, space on its own and time on its own will just become shadows, and only a combination of the two will keep reality intact.”
The prophecy contained in the above-quoted words, spoken by Minkowski at the meeting of German “Naturforscher und Aerzte” at Cologne in 1908, has, however, only been completely fulfilled by Einstein’s “Allgemeine Relativitäts-theorie” of 1915, which incorporated gravitation into the union. In the following pages an attempt is made to set forth, without using any technical language, the leading ideas of that theory: I will confine myself to the theory as published by Einstein in November, 1915, which forms a consistent whole, complete in itself; and I will not refer to later developments, which are still more or less tentative, and not necessary for the understanding of the theory. The mathematics used by Einstein is the so-called Absolute Differential Calculus. It is not more difficult or recondite than that used in other branches of [207]theoretical physics, but it is somewhat unfamiliar to most of us, because it is not generally taught in the regular university courses. I will, however, in this essay abstain from using any mathematics at all, at least, I will not be using it openly. It is of course unavoidable to use at least the results of the mathematical reasoning, if not the reasoning itself; but so long as they are not put into formulas they will, it is hoped, not look so formidable to the reader.
The prophecy in the words quoted above, spoken by Minkowski at the meeting of German “Naturalists and doctors” in Cologne in 1908, has only been fully realized by Einstein’s “General Relativity” of 1915, which integrated gravitation into the theory. In the following pages, I will try to explain the main ideas of that theory without using technical terms: I will focus solely on the version that Einstein published in November 1915, which is a coherent and complete system on its own; I won't discuss later developments, which are still somewhat experimental and not necessary for understanding the theory. The mathematics Einstein used is known as Absolute Differential Calculus. It’s not any more difficult or obscure than what is used in other areas of [__A_TAG_PLACEHOLDER_0__]theoretical physics, but it may be unfamiliar to many, since it’s not typically covered in standard university courses. In this essay, though, I will avoid using any mathematics outright. It’s inevitable to reference at least the results of the mathematical reasoning, if not the reasoning itself; however, as long as they aren’t presented in formulas, they hopefully won’t seem too daunting to the reader.
Referring to the quoted words of Minkowski, we may ask what is meant by “reality.” Physical science, like common sense, takes for granted that there is a reality behind the phenomena, which is independent of the person by whom, and the particular methods by which it is observed, and which is also there when it is not observed. Strictly speaking, all talk about what is not observed is metaphysics. Nevertheless the physicist unhesitatingly believes that his laws are general, and that the phenomena continue to happen according to them when nobody is looking. And since it would be impossible to prove that they did not, he is fully entitled to his belief. The observed phenomena are the effects of the action of this reality, of which we assume the existence, on the observer’s senses—or apparatus, which are extended and refined sense-organs. The laws governing the phenomena therefore must convey some information regarding this reality. We shall never by any means be able to know anything else about it but just these laws. To all intents and purposes the laws are the reality, if we eliminate from them all that refers to the observer alone. What refers to the reality is called [208]“absolute,” and what involves reference to the observer “relative.” The elimination of the relative is one of the things the theory of relativity has set out to do.
Referring to Minkowski's words, we can ask what "reality" really means. Physical science, like common sense, assumes that there is a reality behind phenomena that exists independently of the observer and the specific methods used to observe it, and that remains even when it's not being observed. Strictly speaking, any discussion about what isn’t observed is metaphysics. Still, physicists confidently believe that their laws are universal and that phenomena continue to occur according to these laws even when no one is watching. And since it's impossible to prove otherwise, they are justified in their belief. The observed phenomena result from the influence of this assumed reality on the observer's senses—or their instruments, which are essentially enhanced versions of those senses. Therefore, the laws that govern these phenomena must provide some insight into this reality. We will never be able to know anything beyond these laws. For all practical purposes, the laws are the reality, assuming we remove everything that pertains only to the observer. What relates to the reality is termed “absolute,” and what pertains to the observer is “relative.” One of the goals of the theory of relativity is to eliminate the relative.
The Outside World and Its Geometry
To describe the phenomena and derive laws from them, we locate them in space and time. To do this we use geometry. Here it is that the part contributed by the observer comes in. There are an infinite number of geometries, and a priori there seems to be no reason to choose one rather than the other. Taking geometry of two dimensions as an example, we can draw figures on a piece of paper, and discuss their properties, and we can also do so on the shell of an egg. But we cannot draw the same figures on the egg as on the paper. The ones will be distorted as compared with the others: the two surfaces have a different geometry. Similarly it is not possible to draw an accurate map of the earth on a sheet of paper, because the earth is spherical and its representation on the flat paper is always more or less distorted. The earth requires spherical geometry, which differs from the flat, or Euclidean, geometry of the paper.
To describe phenomena and derive laws from them, we position them in space and time. For this, we use geometry. This is where the observer’s role comes into play. There are endless types of geometry, and at first glance, there doesn’t seem to be a reason to pick one over another. Taking two-dimensional geometry as an example, we can draw shapes on a piece of paper and discuss their properties, just like we can on the shell of an egg. However, we can’t draw the same figures on the egg as we do on the paper. The shapes will be distorted compared to each other: the two surfaces have different geometries. Likewise, it’s impossible to create an accurate map of the earth on a flat sheet of paper since the earth is spherical, and its representation on flat paper will always be somewhat distorted. The earth needs spherical geometry, which is different from the flat, or Euclidean, geometry of the paper.
Up to a few years ago Euclidean (i.e. flat) geometry of three dimensions had been exclusively used in physical theories. Why? Because it is the true one, is the one answer generally given. Now a statement about facts can be true or false, but a mathematical discipline is neither true nor false; it can only be correct—i.e. consistent in itself—or [209]incorrect, and of course it always is correct. The assertion that a certain geometry is the “true” one can thus only mean, that it is the geometry of “true” space, and this again, if it is to have any meaning at all, can only mean that it corresponds to the physical “reality.” Leaving aside the question whether this reality has any geometry at all, we are confronted with the more immediately practical consideration how we shall verify the asserted correspondence. There is no other way than by comparing the conclusions derived from the laws based upon our geometry, with observations. It thus appears that the only justification for the use of the Euclidean geometry is its success in enabling us to “draw an accurate map” of the world. As soon as any other geometry is found to be more successful, that other must be used in physical theories, and we may, if we like, call it the “true” one.
Until a few years ago, Euclidean (i.e. flat) geometry in three dimensions was the only geometry used in physical theories. Why? Because it's considered the "true" one, which is the common answer. A statement about facts can be true or false, but a mathematical system is neither true nor false; it can only be correct—i.e. consistent in itself—or [__A_TAG_PLACEHOLDER_0__]incorrect, and, of course, it is always correct. Saying that a certain geometry is the “true” one suggests that it describes the geometry of “true” space, and this must correspond to the physical “reality” for it to have any meaning. Setting aside whether this reality has any specific geometry, we face the more immediate question of how we can verify the claimed correspondence. The only way to do this is by comparing the conclusions drawn from our geometric laws with observations. It seems then that the only reason to use Euclidean geometry is its success in helping us “draw an accurate map” of the world. If another geometry proves to be more successful, that geometry must be used in physical theories, and we might choose to call it the “true” one.
Accurate observations always consist of measures, determining the position of material bodies in space. But the positions change, and for a complete description we also require measures of time. An important remark must be made here. Nobody has ever measured a pure space-distance, nor a pure lapse of time. The only thing that can be measured is the distance from a body at a certain point of space and a certain moment of time, to a body (either the same or another) at another point and another time. We can even go further and say that time cannot be measured at all. We profess to measure it by clocks. But a clock really measures space, and we derive the time from its space-measures by a fixed rule. This rule depends on the laws of motion of [210]the mechanism of the clock. Thus finally time is defined by these laws. This is so, whether as a “clock” we use an ordinary chronometer, or the rotating earth, or an atom emitting light-waves, or anything else that may be suggested. The physical laws, of course, must be so adjusted that all these devices give the same time. About the reality of time, if it has any, we know nothing. All we know about time is that we want it. We cannot adequately describe nature with the three space-coordinates alone, we require a fourth one, which we call time. We might thus say with some reason that the physical world has four dimensions. But so long as it was found possible adequately to describe all known phenomena by a space of three dimensions and an independent time, the statement did not convey any very important information. Only after it had been found out that the space-coordinates and the time are not independent, did it acquire a real meaning.
Accurate observations always consist of measurements, determining the position of physical objects in space. However, these positions change, so to fully describe them, we also need measurements of time. It’s essential to note that no one has ever measured a pure distance in space, nor a pure duration of time. The only thing we can measure is the distance from one object at a specific point in space and moment in time to another object (either the same or different) at another point and a different time. We can even go further to say that time can’t really be measured at all. We claim to measure it with clocks, but a clock actually measures space, and we derive time from its space measurements using a fixed rule. This rule is based on the laws of motion of [__A_TAG_PLACEHOLDER_0__]the clock mechanism. Ultimately, time is defined by these laws, whether we use a regular chronometer, the rotating Earth, an atom emitting light waves, or any other suggested method. The physical laws must be set up so that all these devices provide the same time. Regarding the reality of time, if it has any, we know nothing. All we know is that we want it. We cannot adequately describe nature using just three spatial coordinates; we need a fourth one, which we call time. So we might reasonably say that the physical world has four dimensions. But as long as it was possible to adequately describe all known phenomena with three spatial dimensions and independent time, this statement didn't convey very important information. It only gained real meaning once it was discovered that spatial coordinates and time are not independent.
As is well known the observation by which this was found out is the famous experiment of Michelson and Morley. It led to the “special” theory of relativity, which is the one referred to by Minkowski in 1908. In it a geometry of four dimensions is used, not a mere combination of a three-dimensional space and a one-dimensional time, but a continuum of truly fourfold order. This time-space is not Euclidean, since the time-component and the three space-components are not on the same footing, but its fundamental formula has a great resemblance to that of Euclidean geometry. We may call it “pseudo-Euclidean.” [211]
As is well known, the observation that led to this discovery is the famous experiment conducted by Michelson and Morley. It resulted in the “special” theory of relativity, which Minkowski referenced in 1908. This theory uses a four-dimensional geometry, not just a mix of three-dimensional space and one-dimensional time, but a true four-dimensional continuum. This time-space isn't Euclidean, as the time component and the three space components don't carry the same weight, but its fundamental formula closely resembles that of Euclidean geometry. We can call it “pseudo-Euclidean.” [__A_TAG_PLACEHOLDER_0__]
This theory, which we need not explain here, was very satisfactory so far as the laws of electromagnetism, and especially the propagation of light, were concerned, but it did not include gravitation, and mechanics generally. We then had this curious state of affairs, that physicists actually believed in two different “realities.” When they were thinking of light they believed in Minkowski’s time-space; when they were thinking of gravitation they believed in the old Euclidean space and independent time. This, of course, could not last. Attempts were made so to alter Newton’s law of gravitation that it would fit into the four-dimensional world of the special relativity-theory, but these only succeeded in making the law, which had been a model of simplicity, extremely complicated, and, what was worse, it became ambiguous.
This theory, which we don’t need to explain here, worked well for the laws of electromagnetism, especially the way light travels, but it didn’t account for gravity and mechanics in general. This led to a strange situation where physicists actually believed in two different “realities.” When they thought about light, they believed in Minkowski’s time-space; when they considered gravity, they relied on the old Euclidean space and independent time. Obviously, this couldn’t continue. Efforts were made to adjust Newton’s law of gravitation so it would fit into the four-dimensional world of the special relativity theory, but these efforts only complicated the previously simple law and, even worse, made it ambiguous.
It is Einstein’s great merit to have perceived that gravitation is of such fundamental importance, that it must not be fitted into a ready-made theory, but must be woven into the space-time geometry from the beginning. And that he not only saw the necessity of doing this, but actually did it.
It is Einstein’s great achievement to have recognized that gravity is so fundamentally important that it shouldn’t just be forced into an existing theory but should be integrated into the space-time geometry from the start. And he not only understood the need to do this but actually accomplished it.
Gravity and Its Role in the Universe
To see the necessity we must go back to Newton’s system of mechanics. Newton did two things (amongst others). He canonised Galileo’s system of mechanics into his famous “laws of motion,” the most important of which is the law of inertia, which says that: [212]
To understand the need for this, we need to revisit Newton’s system of mechanics. Newton did a couple of things (among others). He solidified Galileo’s system of mechanics into his well-known “laws of motion,” the most significant of which is the law of inertia, which states that: [__A_TAG_PLACEHOLDER_0__]
a body, that is not interfered with, moves in a straight line with constant velocity.
An object not influenced by external forces travels in a straight path at a constant speed.
The velocity, of course, can be nil, and the body at rest. This is a perfectly general law, the same for all material bodies, whatever their physical or chemical status. Newton took good care exactly to define what he meant by uniform motion in a straight line, and for this purpose he introduced the absolute Euclidean space and absolute time as an essential part of his system of laws at the very beginning of his great work. The other thing Newton did was to formulate the law of gravitation. Gravitation was in his system considered as an interference with the free, or inertial, motion of bodies, and accordingly required a law of its own.
The speed can definitely be zero, meaning the body is at rest. This is a completely universal rule that applies to all physical objects, regardless of their physical or chemical state. Newton carefully defined what he meant by uniform motion in a straight line, and to do this, he introduced the concepts of absolute Euclidean space and absolute time as a crucial part of his laws from the very start of his major work. Additionally, Newton formulated the law of gravitation. In his framework, gravitation was seen as a disruption to the free, or inertial, motion of bodies, and thus required its own law.
But gravitation has this in common with inertia, and in this it differs from all other interferences, that it is perfectly general. All material bodies are equally subjected to it, whatever their physical or chemical status may be. But there is more. Gravitation and inertia are actually indistinguishable from each other, and are measured by the same number: the “mass”. This was already remarked by Newton himself, and from his point of view it was a most wonderful accidental coincidence. If an apple falls from the tree, that which makes it fall is its weight, which is the gravitational attraction by the earth, diminished by the centrifugal force due to the earth’s rotation and the apple’s inertia. In Newton’s system the gravitational attraction is a “real” force, whereas the centrifugal force is only “fictitious”. But the one is as real as the other. The most refined experiments, already begun by Newton himself, have [213]not succeeded in distinguishing between them. Their identity is actually one of the best established facts in experimental physics. From this identity of “fictitious,” or inertial, and “real,” or gravitational, forces it follows that locally a gravitational field can be artificially created or destroyed. Thus inside a closed room which is falling freely, say a lift of which the cable has been broken, bodies have no weight: a balance could be in equilibrium with different weights in the two scales.
But gravity has this in common with inertia, and this is where it differs from all other forces: it is completely universal. All physical objects are equally affected by it, regardless of their physical or chemical makeup. But there's more. Gravity and inertia are actually indistinguishable from each other and are measured by the same quantity: "mass." Newton himself noted this, and from his perspective, it was a remarkable coincidence. When an apple falls from a tree, the reason it falls is its weight, which is the gravitational pull of the Earth, reduced by the centrifugal force from the Earth's rotation and the apple's inertia. In Newton's framework, gravitational attraction is a "real" force, while centrifugal force is only "fictitious." But both are equally real. The most sophisticated experiments, which began with Newton himself, have [__A_TAG_PLACEHOLDER_0__]failed to differentiate between them. Their similarity is actually one of the most established facts in experimental physics. From the similarity of "fictitious," or inertial, and "real," or gravitational, forces, it follows that a gravitational field can be artificially created or ruined in a localized area. For instance, inside a freely falling room, like an elevator with a severed cable, objects experience weightlessness: a balance could remain in equilibrium with different weights on either side.
Having thus come to the conclusion that gravitation is not an interference, but is identical with inertia, we are tempted to restate the law of motion, so as to include both, thus:
Having come to the conclusion that gravity is not an interference, but is the same as inertia, we are inclined to rephrase the law of motion to encompass both, like this:
Bodies which are not interfered with—do not move in straight lines, but—fall.
Undisturbed bodies—don’t travel in straight lines, but—drop.
Now this is exactly what Einstein did. Only the “falling” of course requires a precise mathematical definition (like the uniform motion in a straight line), and the whole gist of his theory is the finding of that definition. In our earthly experience the falling never lasts long, very soon something—the floor of the room, or the earth itself—interferes. But in free space bodies go on falling forever. The motion of the planets is, in fact, adequately described as falling, since it consists in nothing else but obeying Newton’s law of gravitation together with his law of inertia. A body very far removed from all other matter is not subjected to gravitation, consequently it falls with constant velocity in a straight line according to the law of inertia. The problem was thus to find a mathematical definition of “falling,” which would embrace the uniform straight-line motion [214]very far from all matter as well as the complex paths of the planets around the sun, and of an apple or a cannon-ball on earth.
Now this is exactly what Einstein did. But the term “falling” needs a precise mathematical definition (similar to uniform motion in a straight line), and the core of his theory revolves around finding that definition. In our everyday experience, falling doesn’t last long; something—like the floor or the ground—quickly gets in the way. However, in free space, objects continue to fall forever. The movement of the planets can actually be described as falling, since it’s just them following Newton’s law of gravitation along with his law of inertia. An object that is very far from any other matter isn't affected by gravitation, so it falls at a constant velocity in a straight line according to the law of inertia. The challenge was to come up with a mathematical definition of “falling” that would include both uniform straight-line motion [__A_TAG_PLACEHOLDER_0__] far from all matter and the complex paths of planets around the sun, as well as an apple or a cannonball on earth.
Gravity and Space-Time
For the definition of the uniform rectilinear motion of pure inertia Newton’s Euclidean space and independent time were sufficient. For the much more complicated falling under the influence of gravitation and inertia together, evidently a more complicated geometry would be needed. Minkowski’s pseudo-Euclidean time-space also was insufficient. Einstein accordingly introduced a general non-Euclidean four-dimensional time-space, and enunciated his law of motion thus:
For the definition of the uniform straight-line motion of pure inertia, Newton's Euclidean space and independent time were enough. However, for the much more complex scenario of falling under the combined influence of gravity and inertia, a more complicated geometry was clearly necessary. Minkowski's pseudo-Euclidean space-time was also not adequate. Einstein therefore introduced a general non-Euclidean four-dimensional space-time and stated his law of motion like this:
Bodies which are not interfered with move in geodesics.
Undisturbed bodies follow geodesics.
A geodesic in curved space is exactly the same thing as a straight line in flat space. We only call it by its technical name, because the name “straight line” would remind us too much of the old Euclidean space. If the curvature gets very small, or zero, the geodesic becomes very nearly, or exactly, a straight line.
A geodesic in curved space is basically the same as a straight line in flat space. We use the technical term because calling it a “straight line” makes us think too much about the old Euclidean space. If the curvature is very small or nonexistent, the geodesic is nearly, or exactly, a straight line.
The problem has now become to assign to time-space such curvatures that the geodesics will exactly represent the tracks of falling bodies. Space of two dimensions can just be flat, like a sheet of paper, or curved, like an egg. But in geometry of four dimensions there are several steps from perfect flatness, or “pseudo-flatness,” to complete curvature. Now the law governing the curvature of Einstein’s [215]time-space, i.e., the law of gravitation, is simply that it can never, outside matter, be curved more than just one step beyond perfect (pseudo-)flatness.
The issue now is to shape time-space in such a way that the geodesics accurately depict the paths of falling objects. Two-dimensional space can either be flat, like a sheet of paper, or curved, like an egg. However, in four-dimensional geometry, there are various degrees between complete flatness, or “pseudo-flatness,” and total curvature. The principle that governs the curvature of Einstein's [__A_TAG_PLACEHOLDER_0__] time-space, meaning the law of gravitation, is simply that it can never be curved more than one step beyond perfect (pseudo-)flatness, away from matter.
Since I have promised not to use any mathematics I can hardly convey to the reader an adequate idea of the difficulty of the problem, nor do justice to the elegance and beauty of the solution. It is, in fact, little short of miraculous that this solution, which was only adopted by Einstein because it was the simplest he could find, does so exactly coincide in all its effects with Newton’s law. Thus the remarkably accurate experimental verification of this law can at once be transferred to the new law. In only one instance do the two laws differ so much that the difference can be observed, and in this case the observations confirm the new law exactly. This is the well known case of the motion of the perihelion of Mercury, whose disagreement with Newton’s law had puzzled astronomers for more than half a century.
Since I've promised not to use any math, I can hardly give you a good idea of how difficult the problem is, nor can I fully appreciate the elegance and beauty of the solution. It's almost miraculous that this solution, which Einstein chose simply because it was the easiest he could find, aligns perfectly in all its effects with Newton's law. Therefore, the impressively accurate experimental confirmation of this law can directly apply to the new law. The two laws only differ noticeably in one case, and here, the observations perfectly support the new law. This is the well-known case of the motion of Mercury's perihelion, whose discrepancy with Newton's law puzzled astronomers for over fifty years.
Since Einstein’s time-space includes Minkowski’s as a particular case, it can do all that the other was designed to do for electro-magnetism and light. But it does more. The track of a pulse of light is also a geodesic, and time-space being curved in the neighborhood of matter, rays of light are no longer straight lines. A ray of light from a star, passing near the sun, will be bent round, and the star consequently will be seen in a different direction from where it would be seen if the sun had not been so nearly in the way. This has been verified by the observations of the eclipse of the sun of 1919 of May 29. [216]
Since Einstein’s time-space includes Minkowski’s as a specific case, it can do everything the other was meant to do for electromagnetism and light. But it goes further. The path of a light pulse is also a geodesic, and since time-space is curved around matter, light rays are no longer straight lines. A light ray from a star that passes close to the sun will be bent, so the star will appear in a different position than it would if the sun weren't so close. This was confirmed by the observations during the solar eclipse on May 29, 1919. [__A_TAG_PLACEHOLDER_0__]
There is one other new phenomenon predicted by the theory, which falls within the reach of observation with our present means. Gravitation chiefly affects the time-component of the four-dimensional continuum, in such a way that natural clocks appear to run slower in a strong gravitational field than in a weak one. Thus, if we make the hypothesis—which, though extremely probable, is still a hypothesis—that an atom emitting or absorbing light-waves is a natural clock, and the further hypothesis—still very probable, though less so than the former—that there is nothing to interfere with its perfect running, then an atom on the sun will give off light-waves of smaller frequency than a similar atom in a terrestrial laboratory emits. Opinions as yet differ as to whether this is confirmed or contradicted by observations.
There is one other new phenomenon predicted by the theory that we can observe with our current technology. Gravity mainly affects the time aspect of the four-dimensional continuum, causing natural clocks to appear to run slower in a strong gravitational field compared to a weak one. So, if we assume—though it’s still just a hypothesis—that an atom emitting or absorbing light waves acts like a natural clock, and we also assume—still quite likely, but a bit less so than the first—that nothing interferes with its perfect operation, then an atom on the sun will emit light waves of a lower frequency than a similar atom in a laboratory on Earth. There are still differing opinions about whether observations confirm or contradict this.
* * *
* * *
The great strength and the charm of Einstein’s theory do however not lie in verified predictions, nor in the explanation of small outstanding discrepancies, but in the complete attainment of its original aim: the identification of gravitation and inertia, and in the wide range of formerly apparently unconnected subjects which it embraces, and the broad view of nature which it affords.
The true strength and appeal of Einstein’s theory don’t come from confirmed predictions or solving minor inconsistencies, but from fully achieving its original goal: linking gravitation and inertia, covering a wide range of topics that once seemed unrelated, and offering a broad perspective on nature.
* * *
Below is a short piece of text (5 words or fewer). Modernize it into contemporary English if there's enough context, but do not add or omit any information. If context is insufficient, return it unchanged. Do not add commentary, and do not modify any placeholders. If you see placeholders of the form __A_TAG_PLACEHOLDER_x__, you must keep them exactly as-is so they can be replaced with links.
Outside matter, as has been explained, the law of gravitation restricts the curvature of time-space. Inside continuous matter the curvature can be of any arbitrary kind or amount; the law of gravitation then connects this curvature with measurable properties of the matter, such as density, velocity, stress, etc. Thus these properties define the curvature, or, [217]if preferred, the curvature defines the properties of matter, i.e. matter itself.
Outside of matter, as previously explained, the law of gravitation limits the curvature of space-time. Inside continuous matter, the curvature can be of any type or magnitude; the law of gravitation then links this curvature to measurable properties of the matter, such as density, velocity, stress, and so on. Therefore, these properties determine the curvature, or, [__A_TAG_PLACEHOLDER_0__]if you prefer, the curvature determines the properties of matter, i.e. matter itself.
From these definitions the laws of conservation of energy, and of conservation of momentum, can be deduced by a purely mathematical process. Thus these laws, which at one time used to be considered as the most fundamental ones of mechanics, now appear as simple corollaries from the law of gravitation. It must be pointed out that such things as length, velocity, energy, momentum, are not absolute, but relative, i.e. they are not attributes of the physical reality, but relations between this reality and the observer. Consequently the laws of conservation are not laws of the real world, like the law of gravitation, but of the observed phenomena. There is, however one law which, already before the days of relativity, had come to be considered as the most fundamental of all, viz: the principle of least action. Now action is absolute. Accordingly this principle retains its central position in Einstein’s theory. It is even more fundamental than the law of gravitation, since both this law, and the law of motion, can be derived from it. The principle of least action, so far as we can see at present, appears to be the law of the real world. [218]
From these definitions, the laws of conservation of energy and momentum can be derived using a purely mathematical approach. Therefore, these laws, once considered the most fundamental in mechanics, now seem to be simple consequences of the law of gravitation. It's important to note that concepts like length, velocity, energy, and momentum are not absolute; rather, they are relative, meaning they are not inherent qualities of physical reality, but rather relationships between this reality and the observer. As a result, the laws of conservation are not laws of the actual world, like the law of gravitation, but of observed phenomena. However, there is one law that had already been regarded as the most fundamental before the advent of relativity, namely the principle of least action. Action is absolute. Thus, this principle maintains its crucial role in Einstein’s theory. It is even more fundamental than the law of gravitation, as both this law and the law of motion can be derived from it. The principle of least action, based on our current understanding, appears to be the law of the real world. [__A_TAG_PLACEHOLDER_0__]
XI
THE PRINCIPLE OF GENERAL RELATIVITY
How Einstein, to a Degree Never Before Equalled, Isolates the External Reality from the Observer’s Contribution
How Einstein, like never before, differentiates the external world from the observer's perspective.
Einstein’s general relativity is of such vast compass, being coextensive with the realm of physical events, that in any brief account a strict selection from its numerous aspects is prescribed. The old, restricted principle being contained in the general, we shall treat the latter, its close relations with gravitation, and the significance of both for our knowledge of space and time. The essence of Einstein’s generalization is its final disentanglement of that part of any physical event which is contributed by the observer from that which is inherent in the nature of things and independent of all observers.
Einstein’s general relativity covers such a wide range, intersecting all physical events, that any brief overview requires a careful selection from its many aspects. The previous, limited principle is included in the general one, so we will focus on the latter, its close relationship with gravity, and the importance of both for our understanding of space and time. The core of Einstein’s idea is its ultimate separation of the part of any physical event that comes from the observer from what is inherent in the nature of things and independent of all observers.
The argument turns upon the fact that an observer must describe any event with reference to some framework from which he makes measurements of time and distance. Thus, suppose that at nine o’clock a ball is tossed across the room. At one second past nine the ball occupies a definite position which we can specify by giving the three distances from the [219]centre of the ball to the north and west walls and the floor. In this way, refining our measurements, we can give a precise description of the entire motion of the ball. Our final description will consist of innumerable separate statements, each of which contains four numbers corresponding to four measurements, and of these one will be for time and three for distances at the time indicated.
The argument revolves around the idea that an observer has to describe any event using a framework from which they measure time and distance. For example, let's say a ball is thrown across the room at nine o'clock. At one second past nine, the ball is at a specific position that we can define by giving the three distances from the [__A_TAG_PLACEHOLDER_0__] center of the ball to the north wall, the west wall, and the floor. By refining our measurements this way, we can provide a detailed description of the ball's entire motion. Our final description will consist of countless separate statements, each containing four numbers linked to four measurements—one for time and three for distances at that specific moment.
Imagine now that a man in an automobile looks in and observes the moving ball. Suppose he records the motion. To do so, he must refer to a timepiece and some body of reference. Say he selects his wrist-watch, the floor of his auto and two sides meeting in a corner. Fancy that just as he begins his series of observations his auto starts bucking and the main-spring of his watch breaks, so that he must measure “seconds” by the crazy running-down of his watch, and distances with reference to the sides of his erratic auto. Despite these handicaps he completes a set of observations, each of which consists of a time measured by his mad watch and three distances reckoned from the sides of his bucking machine. Let us assume him to have been so absorbed in his experiment that he noticed neither the disorders of his watch nor the motion of his auto. He gives us his sets of measurements. We remark that his seconds are only small fractions of ours, also his norths and wests are badly mixed. If we interpret his sets in terms of our stationary walls and sober clock we find the curious paradox that the ball zigzagged across the room like an intoxicated bee. He obstinately argues that we know no more than he about how the ball actually moved. For we got a smooth [220]description, he asserts, by choosing an artificially simple reference framework, having no necessary relations whatever to the ball. The crooked path plotted from his observations proves, he declares, that the ball was subject to varying forces of which we in the room suspected nothing. He contends that our room was being jarred by a system of forces which exactly compensated and smoothed out the real jaggedness of path observed by himself. But if we know all about his watch and auto we can easily apply necessary corrections to his measurements, and, fitting the corrected set to our reference-framework of walls and clock, recover our own smooth description.
Imagine a man in a car watching a moving ball. Suppose he records its motion. To do this, he needs a watch and a reference point. Let’s say he uses his wristwatch, the floor of his car, and two walls meeting at a corner. Imagine that just as he starts his observations, his car begins to bounce, and the spring in his watch breaks, so he ends up measuring “seconds” by the erratic ticking of his watch and distances based on the sides of his bouncing car. Despite these challenges, he completes a series of measurements, each consisting of a time measured by his malfunctioning watch and three distances based on his shaking vehicle. Let’s assume he’s so focused on his experiment that he doesn’t notice his watch acting up or the movement of his car. He presents us with his measurements. We notice that his seconds are just small fractions of ours, and his directions are all mixed up. If we interpret his data in terms of our stationary environment and accurate clock, we find it oddly appears that the ball zigzagged across the room like a drunken bee. He stubbornly argues that we know just as little as he does about how the ball really moved. He claims that we got a smooth description by choosing a simplified reference framework that has no real connection to the ball. The twisted path he plotted from his observations shows, he insists, that the ball was influenced by varying forces that we in the room didn't notice. He argues that our room was being shaken by a series of forces that perfectly balanced out and smoothed over the actual erratic path he observed. But if we understand everything about his watch and car, we can easily make the necessary corrections to his measurements and, fitting the corrected data into our framework of walls and clock, recover our own smooth description.
For consistency we must carry our readjustments farther. The path mapped from our measurements is a curve. Perhaps the curvature was introduced by some peculiarity of our reference framework? Possibly our own room is being accelerated upward, so that it makes the ball’s true path—whatever that may be—appear curved downward, just as the autoist’s zigzags made the path he mapped appear jagged. Tradition attributes the downward curving to the tug of gravity. This force we say accelerates the ball downward, producing the curved path. Is this the only possible explanation? Let us see.
For consistency, we need to take our adjustments further. The path derived from our measurements is curved. Could the curve be caused by something unusual in our reference framework? It’s possible that our room is accelerating upward, making the ball’s true path—whatever that may be—look like it’s curving downward, much like how a driver’s zigzags make the path they traced look jagged. Traditionally, we blame the downward curve on gravity's pull. We say this force accelerates the ball downward, creating the curved path. Is this the only explanation? Let’s explore.
Gravity and Acceleration
Imagine a man in a room out of which he cannot see. He notices that when he releases anything it falls to the floor with a constant acceleration. Further he observes that all his objects, independently [221]of their chemical and physical properties, are affected in precisely the same way. Now, he previously has experimented with magnets, and has remarked that they attract certain bodies in essentially the same way that the things which he drops are “attracted” to whatever is beneath the floor. Having explained magnetic attraction in terms of “forces,” he makes his first hypothesis: (A) He and his room are in a strong “field of force,” which he designates gravitational. This force pulls all things downward with a constant acceleration. Here he notes a singular distinction between magnetic and gravitational “forces”: magnets attract only a few kinds of matter, notably iron; the novel “force,” if indeed a force at all, acts similarly upon all kinds of matter. He makes another hypothesis: (B) His room and he are being accelerated upward.
Imagine a man in a room where he can't see outside. He notices that when he lets go of anything, it falls to the floor at a constant speed. He also sees that all his objects, regardless of their chemical and physical properties, are affected in exactly the same way. He has previously experimented with magnets and noticed that they attract certain materials in basically the same manner as the items he drops are “pulled” toward whatever is below the floor. After explaining magnetic attraction in terms of “forces,” he comes up with his first hypothesis: (A) He and his room are in a strong “field of force,” which he calls gravitational. This force pulls everything down at a constant speed. He observes a key difference between magnetic and gravitational “forces”: magnets attract only a few types of matter, primarily iron; the new “force,” if it really is a force, affects all types of matter equally. He proposes another hypothesis: (B) His room and he are being pushed upward.
* * *
* * *
Either (A) or (B) describes the facts perfectly. By no experiment can he discriminate between them. So he takes the great step, and formulates the Equivalence Hypothesis:
Either (A) or (B) describes the facts perfectly. No experiment can help him tell them apart. So he takes the big step and formulates the Equivalence Hypothesis:
A gravitational field of force is precisely equivalent in its effects to an artificial field of force introduced by accelerating the framework of reference, so that in any small region it is impossible to distinguish between them by any experiment whatever.
A gravitational field has the same effects as an artificial field created by accelerating the reference frame, making it impossible to distinguish between them through any experiment in a small area.
Next reconsidering his magnetic “forces,” he extends the equivalence hypothesis to cover all manifestations of force: The effects attributed to forces of any kind whatever can be described equally well by saying that our reference frameworks are accelerated; and moreover there is possible no experiment [222]which will discriminate between the descriptions.
Next, reconsidering his magnetic "forces," he broadens the equivalence hypothesis to include all forms of force: The effects we attribute to any kind of forces can be just as accurately described by stating that our reference frames are accelerating; furthermore, there is no experiment [__A_TAG_PLACEHOLDER_0__] that can differentiate between the two descriptions.
If the accelerations are null, the frameworks are at rest or in uniform motion relatively to one another. This special case is the “restricted” principle of relativity, which asserts that it is impossible experimentally to detect a uniform motion through the ether. Being thus superfluous for descriptions of natural phenomena, the ether may be abandoned, at least temporarily. The older physics sought this absolute ether framework to which all motions could be unambiguously referred, and failed to find it. The most exacting experiments, notably that of Michelson-Morley, revealed no trace of the earth’s supposed motion through the ether. Fitzgerald accounted for the failure by assuming that such motion would remain undetected if every moving body contracted by an amount depending upon its velocity in the direction of motion. The contraction for ordinary velocities is imperceptible. Only when as in the case of the beta particles, the velocity is an appreciable fraction of the velocity of light, is the contraction revealed. This contraction follows immediately from Einstein’s generalization constructed upon the equivalence hypothesis and the restricted relativity principle. We shall see that the contraction inevitably follows from the actual geometry of the universe.1
If the accelerations are zero, the frameworks are either at rest or moving at a constant speed relative to each other. This special case is known as the “restricted” principle of relativity, which states that it’s impossible to experimentally detect uniform motion through the ether. Since it’s unnecessary for explaining natural phenomena, the ether can be disregarded, at least for now. Older physics searched for this absolute ether framework to which all motions could be clearly referenced but failed to find it. The most precise experiments, especially the Michelson-Morley experiment, showed no evidence of the earth's supposed motion through the ether. Fitzgerald explained this failure by suggesting that such motion would go undetected if every moving object contracted by an amount based on its speed in the direction of motion. The contraction for ordinary speeds is too small to notice. It's only when, as with beta particles, the speed is a significant fraction of the speed of light that the contraction becomes apparent. This contraction directly follows from Einstein’s generalization based on the equivalence hypothesis and the restricted relativity principle. We’ll see that the contraction inevitably arises from the actual geometry of the universe.1
Let us return for a moment to the moving ball. Four measures, three of distances and one of time, are required in specifying its position with reference [223]to some framework at each point and at each instant. All of these measures can be summed up in one compendious statement—the equations of motion showed how in changing from our room to his accelerated auto we found a new summary, “transformed equations,” which seemed to indicate that the ball had traversed a strong, variable field of force. Is there then in the chaos of observational disagreements anything which is independent of all observers? There is, but it is hidden at the very heart of nature.
Let’s go back for a moment to the moving ball. To pinpoint its position in relation to some reference point, we need four measurements: three distances and one time, for every moment and location. All of these measurements can be summed up in one concise statement—the equations of motion revealed that as we moved from our room to his speeding car, we found a new summary, “transformed equations,” which seemed to suggest that the ball had moved through a strong, changing field of force. So, is there anything in the midst of all the conflicting observations that is independent of all observers? Yes, but it’s hidden deep within the essence of nature.
Paths Through the Four-Dimensional World
To exhibit this, we must recall a familiar proposition of geometry: the square on the longest side of a right-angled triangle is equal to the sum of the squares on the other two sides. It has long been known that from this alone all the metrical properties of Euclidean space—the space in which for 2,000 years we have imagined we were living—can be deduced. Metrical properties are those depending upon measurement. Now, in the geometry of any space, Euclidean or not, there is a single proposition of a similar sort which tells us how to find the most direct distance between any two points that are very close together. This small distance is expressed in terms of the two sets of distance measurements by which the end-points are located, just as two neighboring positions of our ball were located by two sets of four measurements each. We say by analogy that two consecutive positions of the ball [224]are separated by a small interval of time-space. From the formula for the very small interval of time-space we can calculate mathematically all the metrical properties of the time and space in which measurements for the ball’s motion must be made. So in any geometry mathematical analysis predicts infallibly the truth about all facts depending upon measurements from the simple formula of the interval between neighboring points. Thus, on a sphere the sum of the angles of any triangle formed by arcs of great circles exceeds 180°, and this follows from the formula for the shortest (“geodesic”) distance between neighboring points on the spherical surface.
To illustrate this, we need to remember a well-known principle of geometry: the square of the longest side of a right triangle is equal to the sum of the squares of the other two sides. It's been recognized for a long time that from this alone, we can deduce all the measurement properties of Euclidean space—the space we’ve thought we’ve lived in for 2,000 years. Measurement properties are those that are based on measurement. Now, in the geometry of any space, whether Euclidean or not, there is one similar principle that shows us how to find the most direct distance between any two points that are very close together. This small distance is expressed in terms of the two sets of distance measurements used to locate the endpoints, just like how two nearby positions of our ball were located using two sets of four measurements each. We can say by analogy that two consecutive positions of the ball [__A_TAG_PLACEHOLDER_0__] are separated by a small interval of time-space. From the formula for this very small interval of time-space, we can mathematically calculate all the measurement properties of the time and space in which the measurements for the ball's motion must be taken. So in any geometry, mathematical analysis reliably predicts the truth about all facts that depend on measurements from the simple formula for the interval between neighboring points. Thus, on a sphere, the sum of the angles of any triangle formed by arcs of great circles is greater than 180°, which follows from the formula for the shortest (“geodesic”) distance between neighboring points on the spherical surface.
We saw that it takes four measurements, one for time and three for distances, to fix an elementary event, viz., the position of the centre of our ball at any instant. A system of all possible such sets of four measurements each, constitutes what mathematicians call a four-dimensional space. The study of the four-dimensional time-space geometry, once its shortest-distance proposition is known, reveals all those relations in nature which can be ascertained by measurements, that is, experimentally. We have then to find this indispensable proposition.
We observed that it requires four measurements—one for time and three for distances—to define a basic event, namely, the position of the center of our ball at any given moment. A collection of all possible sets of these four measurements makes up what mathematicians refer to as a four-dimensional space. Exploring the four-dimensional time-space geometry, once we understand its shortest-distance principle, uncovers all the relationships in nature that can be determined through measurements, that is, experimentally. Therefore, we need to identify this essential principle.
Imagine the path taken by a particle moving solely under the influence of gravitation. This being the simplest possible motion of an actual particle in the real world, it is natural to guess that its path will be such that the particle moves from one point of time-space to another by the most direct route. This in fact is verified by forming the equations of the free particle’s motion, which turn out to be precisely those that specify a geodesic (most direct line) joining [225]the two points. On the (two-dimensional) surface of a sphere such a line is the position taken by a string stretched between two points on the surface, and this is the shortest distance on the surface between them. But in the time-space geometry we find a remarkable distinction: the interval between any two points of the path taken is the longest possible, and between any two points there is only one longest path. Translated into ordinary space and time this merely asserts that the time taken between any two points on the natural path is the longest possible.
Imagine the path taken by a particle moving only under the influence of gravity. Since this is the simplest type of motion for a real particle in the world, it makes sense to think that its path will be the most direct route from one point in time-space to another. This is confirmed by creating the equations of the particle’s motion, which turn out to be exactly those that define a geodesic (the most direct line) connecting [__A_TAG_PLACEHOLDER_0__]the two points. On a two-dimensional surface of a sphere, this line is represented by a string stretched between two points on that surface, showing the shortest distance between them. However, in the geometry of time-space, there’s an interesting difference: the interval between any two points on the path is the longest possible, and there is only one longest path between any two points. In simpler terms, this means that the time taken between any two points on the natural path is the longest possible.
Recall now that when the line-formula for any kind of space is known all the metrical properties of that space are completely determined, and combine with this what we have just found, namely, the equations of motion of a particle subject only to gravitation are the same equations as those which fix the line-formula for the four-dimensional time-space. Since gravitation alone determines the motion of the particle, and since this motion is completely described by the very equations which fix all the metrical properties of time-space, it follows that the metrical (experimentally determinable) properties of time-space are equivalent to those of gravitation, in the sense that each set of properties implies the other.
Recall that when the line formula for any type of space is known, all the measurable properties of that space are fully determined. Along with this, we just discovered that the equations of motion for a particle that is only influenced by gravity are the same equations that define the line formula for four-dimensional spacetime. Since gravity alone dictates the particle's motion, and this motion is completely described by the same equations that define all the measurable properties of spacetime, it follows that the measurable (experimentally determinable) properties of spacetime are equivalent to those of gravity, meaning each set of properties implies the other.
The Space-Time Universe
We have found the thing in nature which is independent of all observers, and it turns out to be the very structure of time-space itself. The motion of the free particle obviously is a thing unconditioned [226]by accidents of observation; the particle under the influence of gravitation alone must go a way of its own. And if some observer in an artificial field of force produced by the acceleration of his reference framework describes the path as knotted, he merely is foisting eccentricities of his own motion upon the direct path of the particle. The conclusion is rational, for we believe that time-space exists independently of any man’s way of perceiving it.
We have discovered something in nature that exists independently of any observers, and it turns out to be the very structure of space and time itself. The motion of a free particle is clearly something that isn’t affected [__A_TAG_PLACEHOLDER_0__]by the quirks of observation; the particle, influenced only by gravity, must follow its own path. If an observer in an artificial force field created by their own accelerated frame of reference describes the path as twisted, they are simply projecting their own motion onto the particle's direct path. The conclusion is logical because we believe that space and time exist independently of how any person perceives them.
Incidentally note that this space is that of the physical world. For only by measurements of distances and times can we become aware of our extension in time and space. If beyond this time-space geometry of measurements there is some “absolute geometry,” science can have no concern with it, for never can it be revealed by the one exploring device we possess—measurement.
Incidentally, keep in mind that this space refers to the physical world. We only become aware of our presence in time and space through measurements of distances and times. If there is some “absolute geometry” beyond this time-space measurement, science can't concern itself with it, because it can never be revealed by the only tool we have—measurement.
We have followed a single particle. Let us now form a picture of several. Any event can be analyzed into a multitude of coincidences in time-space. For consider two moving particles—say electrons. If they collide they both are in very approximately one place at the same time. We imagine the path of an electron through time-space plotted by a line (in four-dimensional space), which will deviate from a “most direct” (geodesic) path if the electron is subjected to forces. This is the “world-line” of the electron. If the world lines of several electrons intersect at one point in time-space, the intersection pictures the fact of their coincidence somewhere and somewhen; for all their world-lines having a time-space point in common, at some instant they must have been in collision. Each point of a world-line [227]pictures the position at a certain place at a certain time; and it is the intersections of world-lines which correspond to physical events. Of what lies between the intersections we have no experimental knowledge.
We have tracked a single particle. Now, let's picture several. Any event can be broken down into many coincidences in space-time. Consider two moving particles—like electrons. If they collide, they are both nearly in the same place at the same time. We visualize the path of an electron through space-time as a line (in four-dimensional space), which will veer off from the "most direct" (geodesic) path if the electron experiences forces. This is the electron's "world-line." If the world-lines of multiple electrons cross at a single point in space-time, that intersection represents their coincidence at some time and place; since all their world-lines share a space-time point, they must have collided at that moment. Each point on a world-line [__A_TAG_PLACEHOLDER_0__]represents its position at a specific location and time; and it's the intersections of world-lines that correspond to physical events. We have no experimental knowledge of what happens between those intersections.
Imagine the world-lines of all the electrons in the universe threading time-space like threads in a jelly. The intersections of the tangle are a complete history of all physical events. Now distort the jelly. Clearly the mutual order of the intersections will be unchanged, but the distances between them will be shortened or lengthened. To a distortion of the jelly corresponds a special choice (by some observer) of a reference framework for describing the order of events. He cannot change the natural sequence of events. Again we have found something which is independent of all observers.
Imagine the paths of all the electrons in the universe weaving through time and space like threads in jelly. The places where they intersect represent the full history of all physical events. Now, distort the jelly. Clearly, the overall order of the intersections remains unchanged, but the distances between them may become shorter or longer. To a distortion of the jelly corresponds a specific choice (by some observer) of a reference frame to describe the order of events. They cannot alter the natural sequence of events. Once again, we've identified something that depends on no observer.
We can now recapitulate our conclusions and state the principle of relativity in its most general form.
We can now summarize our conclusions and present the principle of relativity in its most general form.
(1) Observers describe events by measures of times and distances made with regard to their frameworks of reference.
(1) Observers describe events using measurements of time and distance based on their points of view.
(2) The complete history of any event is summarized in a set of equations giving the positions of all the particles involved at every instant.
(2) The full history of any event is captured in a set of equations that show the positions of all the particles involved at every moment.
(3) Two possibilities arise. (A) Either these equations are the same in form for all space-time reference frameworks, persisting formally unchanged for all shifts of the reference scheme; or (B), they subsist only when some special framework is used, altering their form as they are referred to different frameworks. If (B) holds, we naturally assume that the equations, and the phenomena which [228]they profess to represent, owe their existence to some peculiarity of the reference framework. They do not, therefore, describe anything which is inherent in the nature of things, but merely some idiosyncrasy of the observer’s way of regarding nature. If (A) holds, then obviously the equations describe some real relation in nature which is independent of all possible ways of observing and recording it.
(3) Two possibilities come up. (A) Either these equations are the same in form for all time-space reference frameworks, remaining unchanged through all shifts in the reference system; or (B), they only apply when a specific framework is used, changing their form when referenced to different frameworks. If (B) is true, we naturally think that the equations, and the phenomena that [__A_TAG_PLACEHOLDER_0__]they claim to represent, are due to some unique aspect of the reference framework. Therefore, they don’t describe anything inherent in the nature of things, but rather some quirks of the observer’s perspective on nature. If (A) is true, then clearly the equations describe some real relationship in nature that is independent of any way of observing or recording it.
(4) In its most general form the principle of relativity states that those relations, and those alone, which persist unchanged in form for all possible space-time reference frameworks are the inherent laws of nature.
(4) In its broadest sense, the principle of relativity says that only those relationships that remain unchanged in form across all possible space-time reference frames are the fundamental laws of nature.
To find such relations Einstein has applied a mathematical method of great power—the calculus of tensors—with extraordinary success. This calculus threshes out the laws of nature, separating the observer’s eccentricities from what is independent of him, with the superb efficiency of a modern harvester. The residue is a physical geometry—or geometrical physics—of time-space, in which it appears that the times and spaces contributed by the several observers’ reference frameworks are shadows of their own contrivings; while the real, enduring universe is a fourfold order of time and space indissolubly bound together. One observer separates this time-space into his own “time” and “space” in one way, determined by his path through the world of events; another, moving relatively to the first, separates it differently, and what for one is time shades into space for another.
To discover such relationships, Einstein has used a highly powerful mathematical method—the calculus of tensors—with remarkable success. This calculus extracts the laws of nature, distinguishing the observer’s quirks from what is independent of them, with the impressive efficiency of a modern harvester. The result is a physical geometry—or geometrical physics—of time-space, where it seems that the time and space provided by different observers' reference frames are just reflections of their own constructions; meanwhile, the true, lasting universe is a four-dimensional order of time and space inseparably linked together. One observer breaks this time-space into their own “time” and “space” in one way, based on their path through the world of events; another observer, moving relative to the first, perceives it differently, and what one person sees as time merges into space for another.
This time-space geometry is non-Euclidean. It is “warped” (curved), the amount of warping at [229]any place being determined by the intensity of the gravitational field there. Thus again gravitation is rooted in the nature of things. In this sense it is not a force, but a property of space. Wherever there is matter there is a gravitational field, and hence a warping of space. Conversely, as long ago imagined by Clifford, wherever there is a warping of space, there is matter; and matter is resolved ultimately into wrinkles in time-space.
This time-space geometry is non-Euclidean. It’s “warped” (curved), with the degree of warping at [__A_TAG_PLACEHOLDER_0__]any location being determined by the strength of the gravitational field there. So, once again, gravity is built into the nature of things. In this sense, it’s not a force, but a feature of space. Wherever there’s matter, there’s a gravitational field, and thus a warping of space. On the flip side, as Clifford imagined long ago, wherever there’s a warping of space, there’s matter; and matter ultimately breaks down into ripples in time-space.
To visualize a warped space, consider a simple analogy. A man walks away from a polished globe; his image recedes into the mirror-space, shortening and thinning as it goes, and thinning (in the direction of motion) faster than it shortens. Everything around him experiences a like effect. If he tries to discover this by a footrule it automatically shortens faster as he turns it into the horizontal position, so his purpose eludes him. The mirror-space is warped in the direction of the image’s motion. So is our own. For all bodies, as evidenced by the Fitzgerald contraction, shorten in the direction of motion. And just as the image can never penetrate the mirror-space a greater distance than half its radius, so probably time-space is curved in such a way that our universe, like the surface of a sphere, is finite in extent, but unbounded. [230]
To visualize a warped space, think of a simple analogy. A man walks away from a shiny globe; his reflection moves back into the mirror-space, getting shorter and thinner as it goes, and thinning (in the direction he's moving) faster than it shortens. Everything around him experiences a similar effect. If he tries to measure this with a ruler, it automatically shortens faster as he lays it horizontally, so he can't achieve his goal. The mirror-space is warped in the direction of the image’s motion. So is our own. All objects, as shown by the Fitzgerald contraction, shorten in the direction of motion. And just like the image can never penetrate the mirror-space more than half its radius, time-space is probably curved in such a way that our universe, like the surface of a sphere, is finite in size, but unbounded. [__A_TAG_PLACEHOLDER_0__]
XII
FORCE VS. GEOMETRY
How Einstein Has Substituted the Second for the First in Connection With the Cause of Gravitation
How Einstein Has Replaced the First with the Second Regarding the Cause of Gravitation
The theory of relativity represents a most strikingly original conception of time and space, which was suggested by Einstein in order to correlate with all our past experience certain observations made in recent years. It is therefore extremely comprehensive in its scope; it demands from us a radical revision in our notions of time and space; it throws new light on the nature of mass and energy, and finally, it furnishes a totally new conception of the old problem of gravitation.
The theory of relativity presents a uniquely original understanding of time and space, proposed by Einstein to connect our past experiences with recent observations. It is highly comprehensive in its reach; it requires us to completely rethink our ideas about time and space; it offers fresh insights into the nature of mass and energy, and ultimately, it provides a completely new perspective on the classic issue of gravity.
The starting point of the theory is the familiar observation that motion is always relative: that is, to define the motion of any object we must always use some point of reference. Thus we speak of the velocity of a train as 40 miles per hour with respect to the earth’s surface, but would find it impossible to determine its absolute speed, or motion in space, since we know of no star whose position can be spoken of as absolutely fixed. These and [231]similar considerations have led to the conclusion, pointed out by Newton and others, that it is impossible by any mechanical experiments on the earth to measure its velocity in space.
The starting point of the theory is the well-known observation that motion is always relative. To define the motion of any object, we need a point of reference. For example, we describe a train's speed as 40 miles per hour relative to the earth's surface, but it's impossible to determine its absolute speed or motion in space since we have no star that's considered absolutely fixed. These and [__A_TAG_PLACEHOLDER_0__] similar considerations have led to the conclusion, noted by Newton and others, that it's impossible to measure the Earth's velocity in space through any mechanical experiments conducted on it.
However, the results of observations on the phenomena of light and electricity led to the revival of the same problem under another form. As well known, there was evolved from these discoveries, the theory that light and electrical energy are of the same nature, and are in each case manifestations of wave-disturbances propagated through a hypothetical medium, the ether, with a velocity of 186,000 miles per second.
However, the observations of light and electricity brought the same issue back in a different form. As is widely known, these discoveries led to the theory that light and electrical energy are fundamentally the same and are both expressions of wave disturbances that travel through a hypothetical medium, the ether, at a speed of 186,000 miles per second.
The problem therefore arose as to whether the earth and all stellar bodies move through this ether. In that case it ought to be possible to measure the velocity of the earth with respect to this medium, and under these conditions we could speak, in a sense, of absolute motion.
The issue then became whether the Earth and all celestial bodies move through this ether. If so, it should be possible to measure the Earth's speed relative to this medium, and under these conditions, we could talk about absolute motion in a way.
A large number of experiments has been tried with this end in view. The most famous of these, and the one which stimulated the subsequent development of the theory of relativity, was that carried out by Michelson and Morley in 1887. To understand the significance of this experiment we shall refer briefly to an analogous observation which is quite familiar.
A lot of experiments have been conducted with this goal in mind. The most well-known of these, and the one that inspired the later development of the theory of relativity, was conducted by Michelson and Morley in 1887. To grasp the importance of this experiment, we’ll briefly mention a similar observation that is quite familiar.
Does it take longer to swim to a point 1 mile up a stream and back or to a point 1
mile across stream and back? The experienced swimmer will answer that the up-and-down
journey takes longer. If we assume that the swimmer has a speed of 5 miles an hour
in still water and that the current [232]is 3 miles an hour, we find that, while it requires five-eighths hour to make the
up-and-down journey, it takes only one-half hour for the trip across stream and back.
The ratio between the times required for the two journeys is thus five-fourths, and
if this is written in the form
it shows how the result depends upon the square of the ratio of the speeds of the
swimmer and the current.
Does it take longer to swim to a point 1 mile upstream and back or to a point 1 mile across the stream and back? An experienced swimmer would say that the upstream and downstream trip takes longer. If we assume that the swimmer has a speed of 5 miles per hour in still water and that the current is 3 miles per hour, we find that it takes five-eighths of an hour to complete the upstream and downstream journey, while it only takes half an hour for the trip across the stream and back. The ratio of the times needed for the two trips is five-fourths, and if this is presented as
it illustrates how the outcome is influenced by the square of the ratio of the swimmer's speed to the current.
Now the earth is moving in its orbit about the sun with a velocity of 18 miles per second. If the earth moves through the ether and a light-beam passes from one mirror to another and back again, the time taken for this journey ought to be longer when the light-path is in the direction of the earth’s motion than when it is at right angles to this direction. For we can consider the light as a swimmer having a speed of 186,000 miles per second and travelling in a stream whose current is 18 miles per second.
Now the Earth is orbiting the sun at a speed of 18 miles per second. If the Earth moves through the ether and a beam of light travels from one mirror to another and back, the time it takes for this journey should be longer when the light path is aligned with the Earth's motion compared to when it's perpendicular to that direction. We can think of light as a swimmer moving at 186,000 miles per second, swimming in a current that flows at 18 miles per second.
When Michelson and Morley tried the experiment they could not observe any difference in the velocity of light in the two directions. The experiment has since been repeated under various conditions, but always with negative results.
When Michelson and Morley conducted the experiment, they couldn't detect any difference in the speed of light in the two directions. The experiment has been repeated under various conditions since then, but it has consistently yielded negative results.
Einstein’s contribution to science consists in interpreting this result as being in accord with Newton’s ideas on mechanical relativity in that it demonstrates the impossibility of measuring absolute motion, not only by mechanical, but also by optical [233]or electrical experiments. Consequently the velocity of light must be regarded as constant and independent of the motion of either source or observer.
Einstein’s contribution to science lies in interpreting this result as aligning with Newton’s concepts of mechanical relativity, as it shows that measuring absolute motion is impossible, not just through mechanical methods, but also through optical [__A_TAG_PLACEHOLDER_0__] or electrical experiments. Therefore, the speed of light must be viewed as constant and unrelated to the movement of either the source or the observer.
The Relativity of Steady Motion
Let us consider some of the consequences which follow from this principle. An observer travelling with say one-half the velocity of light in the same direction as a ray of light would find that the latter has the usual velocity of 186,000 miles per second. Similarly an observer travelling in the opposite direction to that of the light-ray, with one-half the velocity of light, would obtain the same result.
Let’s look at some of the consequences that come from this principle. An observer moving at about half the speed of light in the same direction as a beam of light would see that beam moving at the typical speed of 186,000 miles per second. Likewise, an observer moving in the opposite direction of the light beam at half the speed of light would get the same result.
Einstein has shown that these conclusions can be valid only if the units of time and space used by the two observers depend upon their relative motions. A careful calculation shows that the unit of length used by either observer appears to the other observer contracted when placed in the direction of their relative motion (but not, when placed at right angles to this direction), and the unit of time used by either observer appears to the other too great. Moreover, the ratio of the units of length or of time varies with the square of the relative speed of the two observers, according to a relation which is similar to that mentioned above for the swimmer in the current. This relation shows that as the relative speed approaches that of light the discrepancy between the units increases.
Einstein demonstrated that these conclusions can only be valid if the units of time and space used by the two observers depend on their relative motions. A careful calculation reveals that the length unit used by either observer seems contracted to the other when it’s aligned with their relative motion (but not when it's perpendicular to that direction), and the time unit used by either observer seems too long to the other. Furthermore, the ratio of the units of length or time changes with the square of the relative speed of the two observers, according to a relationship similar to the one mentioned earlier for the swimmer in the current. This relationship shows that as the relative speed approaches that of light, the discrepancy between the units increases.
Thus, for an observer moving past our earth with a velocity which is nine-tenths that of light, a meter stick on the earth would be 44 centimeters as measured [234]by him, while a second on our clocks would be about two and a half seconds as marked by his clock. Similarly, what he calls a meter length would, for us, be only 44 centimeters and he would appear to us to be living about two and a half times slower than we are. Each observer is perfectly consistent in his measurements of time and space as long as he confines his observations to his own system, but when he tries to make observations on another system moving past his, he finds that the results which he obtains do not agree with those obtained by the other observer.
Thus, for someone observing our Earth while traveling at a speed that's 90% the speed of light, a meter stick on Earth would measure 44 centimeters according to them, while one second on our clocks would register as about two and a half seconds on their clock. Likewise, what they consider a meter would only be 44 centimeters for us, and they would seem to be living about two and a half times slower than we are. Each observer's measurements of time and space are completely consistent as long as they stick to their own system, but when they try to observe another system moving past them, they find that their results don't match those of the other observer.
It is not surprising that in accordance with this conclusion it also follows that the mass of a body must increase with its velocity. For low velocities the increase is so small that we cannot ever hope to measure it, but as the velocity of light is approached the difference becomes more and more appreciable and a body having the velocity of light would possess infinite mass, which simply means that such a velocity cannot be attained by any material object. This conclusion has been experimentally confirmed by observations on the mass of the extremely small negatively charged particles which are emitted by radioactive elements. Some of these particles are ejected with velocities which are over nine-tenths that of light, and measurements show that the increase in mass is in accord with this theory.
It’s not surprising that, based on this conclusion, a body’s mass has to increase with its speed. At low speeds, the increase is so slight that we can’t hope to measure it, but as we get closer to the speed of light, the difference becomes much more noticeable. A body traveling at the speed of light would have infinite mass, which simply means that no material object can reach that speed. This conclusion has been backed up by experiments observing the mass of tiny negatively charged particles emitted by radioactive elements. Some of these particles are released at speeds over ninety percent of the speed of light, and measurements show that the increase in mass aligns with this theory.
The relativity theory also throws new light on the nature of mass itself. According to this view, mass and energy are equivalent. The absolute destruction of 1 gram of any substance, if possible, would yield an amount of energy which is one [235]hundred million times as much as that obtained by burning the same mass of coal. Conversely, energy changes are accompanied by changes in mass. The latter are ordinarily so inappreciably small as to escape our most refined methods of measurements, but in the case of the radioactive elements we actually observe this phenomenon. From this standpoint, also, the laws of conservation of energy and of mass are shown to be intimately related.
The theory of relativity also sheds new light on the nature of mass. According to this perspective, mass and energy are equivalent. If it were possible to completely destroy 1 gram of any substance, it would produce an amount of energy that is one [__A_TAG_PLACEHOLDER_0__]hundred million times greater than what we get from burning the same mass of coal. On the other hand, changes in energy are accompanied by changes in mass. These changes are typically so tiny that they go unnoticed by even our most precise measurement techniques, but in the case of radioactive elements, we can actually observe this phenomenon. This perspective also shows that the laws of conservation of energy and mass are closely related.
Universal Relativity
So far we have dealt with what has been designated as the special theory of relativity. This, as we have seen, applies to uniform motion only. In extending the theory to include non-uniform or accelerated motion, Einstein has at the same time deduced a law of gravitation which is much more general than that of Newton.
So far, we've discussed what's known as the special theory of relativity. As we've seen, this only applies to constant motion. In expanding the theory to include non-constant or accelerated motion, Einstein has also derived a law of gravitation that is much broader than Newton's.
A body falling towards the earth increases in velocity as it falls. The motion is said to be accelerated. We ascribe this increase in velocity to a gravitational force exerted by the earth on all objects. As shown by Newton, this force acts between all particles of matter in the universe, and varies inversely as the square of the distance, and directly as the product of the masses.
A body falling toward the earth speeds up as it drops. This movement is called accelerated motion. We attribute this increase in speed to a gravitational force that the earth exerts on all objects. As Newton demonstrated, this force acts between all particles of matter in the universe and changes inversely with the square of the distance while being directly proportional to the product of the masses.
Of course, we have had a number of theories of gravitation, and none of them have proven successful. Einstein, however, was the first one to suggest a conception of gravitation which has proven extremely significant. He points out that a gravitational force is non-existent for a person falling [236]freely with the acceleration due to gravity. For this person there is no sensation of weight, and if he were in a closed box which is also falling with the same acceleration, he would be unable to decide as to whether his system were falling or situated in interplanetary space where there is no gravitational field. Furthermore, if he were to carry out any optical or electrical experiments in this box he would observe the same results as an experimenter on the earth. A ray of light would travel in a straight line so far as this observer can perceive, while an external observer would, of course, judge differently.
Sure, we’ve had several theories about gravity, and none of them have really worked out. However, Einstein was the first to offer a concept of gravity that turned out to be very important. He notes that there’s no gravitational force for someone falling [__A_TAG_PLACEHOLDER_0__] freely under the influence of gravity. For this person, there’s no feeling of weight, and if they were in a closed box that’s also falling at the same rate, they wouldn’t be able to tell if they were falling or if they were floating in interplanetary space where there’s no gravitational field. Additionally, if they conducted any optical or electrical experiments in that box, they would see the same results as someone doing the same experiments on Earth. A beam of light would appear to travel in a straight line as far as this observer can see, while an outside observer would obviously see it differently.
Einstein shows that this is equally true for all kinds of acceleration including that due to rotation. In the case of a rotating body there exists a centrifugal force which tends to make objects on the surface fly outwards, but for an external observer this force does not exist any more than gravity exists for the observer falling freely.
Einstein demonstrates that this is true for all types of acceleration, including that caused by rotation. When a body is rotating, there is a centrifugal force that tries to push objects on its surface outward, but for an outside observer, this force doesn’t exist any more than gravity exists for someone who is in free fall.
Thus we can draw the general conclusion that a gravitational field or any other field of force may be eliminated by choosing an observer moving with the proper acceleration. For this observer, however, the laws of optics and electricity must be just as valid as for an observer on the earth.
Thus we can draw the general conclusion that a gravitational field or any other type of force field can be eliminated by selecting an observer who is moving with the appropriate acceleration. For this observer, though, the laws of optics and electricity must be just as valid as they are for an observer on Earth.
In postulating this equivalence hypothesis Einstein merely makes use of the very familiar observation that, independently of the nature of the material, all bodies possess the same acceleration in a given field of force.
In proposing this equivalence hypothesis, Einstein is simply using the well-known observation that, regardless of the type of material, all objects experience the same acceleration in a specific force field.
The problem which Einstein now sets out to solve is that of determining the law which shall describe the motion of any system in a field of force in such [237]a general manner as to leave unaltered the fundamental relations of electricity and optics.
The problem that Einstein is now trying to solve is figuring out the law that will describe the motion of any system in a field of force in such [__A_TAG_PLACEHOLDER_0__]a general way that it doesn’t change the basic relationships of electricity and optics.
In connection with the solution of this problem he finds it necessary to discard the limitations placed on us by ordinary or Euclidean geometry. In this manner geometrical concepts as well as those of force are completely robbed of all notions of absoluteness, and the goal of a general theory of relativity is attained.
In solving this problem, he finds it necessary to set aside the restrictions imposed by traditional or Euclidean geometry. In doing so, both geometric concepts and those of force are stripped of any ideas of absoluteness, achieving the goal of a general theory of relativity.
The Geometry of Gravity
Let us consider a circular disc rotating with a uniform peripheral speed. According to the deductions from the “special theory” of relativity, an observer situated near the edge of this disc, but not rotating with it, will observe that units of length measured along the circumference of the disc are contracted. On the other hand, measurements along the diameter, which is at right angles to the direction of motion of the circumference, will show no contraction whatever, and, consequently the observer will find that the ratio of circumference to diameter has not the well known value 3.14159 … but exceeds this value, the difference being greater and greater as the peripheral speed approaches that of light. That is, the laws of ordinary geometry no longer hold true.
Let’s think about a circular disc that’s spinning at a steady speed around its edge. According to the findings from the “special theory” of relativity, an observer at the edge of this disc, who isn't spinning with it, will notice that lengths measured along the circumference of the disc are shorter. However, measurements taken along the diameter, which is perpendicular to the motion of the circumference, won’t show any contraction at all. As a result, the observer will see that the ratio of the circumference to the diameter is not the familiar value of 3.14159… but actually exceeds it, with the difference growing larger as the edge speed gets closer to the speed of light. This means that the usual rules of geometry no longer apply.
However, we know other cases in which the ordinary or Euclidean geometry is not applicable. Thus suppose that on the surface of a sphere we describe a series of concentric circles. Since the surface is curved, we are not surprised at finding [238]that the circumference of any one of these circles is less than 3.14159 … times the distance across the circle as measured on the surface of the sphere. What this means, therefore, is that we cannot use Euclidean geometry to describe measurements on the surface of a sphere, and every schoolboy knows this from comparing Mercator’s projection of the earth’s surface with the actual representation on a globe.
However, we know of other situations where regular or Euclidean geometry doesn't apply. For example, imagine that on the surface of a sphere, we draw a series of concentric circles. Because the surface is curved, it's not surprising to find [__A_TAG_PLACEHOLDER_0__] that the circumference of any of these circles is less than 3.14159 … times the distance across the circle as measured on the surface of the sphere. This means that we can't use Euclidean geometry to measure things on the surface of a sphere, and every school kid understands this by comparing Mercator’s projection of the earth’s surface with how it actually looks on a globe.
When we come to think of it, the reason we realize all this is because our sense of three dimensions enables us to differentiate flat surfaces from those that are curved. Let us, however, imagine a two-dimensional being living on the surface of a large sphere. So long as his measurements are confined to relatively small areas he will find it possible to describe all his measurements in terms of Euclidean geometry. As, however, his area of operation increases he will begin to observe greater and greater discrepancies. Being unfamiliar with the existence of such a three-dimensional object as a sphere, and therefore not realizing that he is on the surface of one, our intelligent two-dimensional being will conclude that the disturbance in his geometry is due to the action of a force, and by means of plausible assumptions on the “law” of this force he will reconcile his observations with the laws of plane geometry.
When we think about it, we understand all of this because our ability to perceive three dimensions allows us to tell flat surfaces apart from curved ones. Now, let’s imagine a two-dimensional being living on the surface of a large sphere. As long as its measurements are limited to relatively small areas, it can describe everything using Euclidean geometry. However, as its area of operation grows, it will start to notice more and more inconsistencies. Unfamiliar with the idea of a three-dimensional object like a sphere and not realizing it is on such a surface, our intelligent two-dimensional being will assume that the irregularities in its geometry are caused by some force. It will then come up with reasonable assumptions about the “law” of this force to align its observations with the rules of plane geometry.
Now since an acceleration in a gravitational field is identical with that due to centrifugal force produced by rotation, we concluded that the geometry in a gravitational field must also be non-Euclidean. That is, space in the neighborhood of matter is [239]distorted or curved. The curvature of space bears the same relation to three dimensions that the curvature of a spherical surface bears to two dimensions, and that is why we do not perceive it, any more than the intelligent two-dimensional being would be aware of the distortion of his space (or surface). Furthermore, like this being, we have assumed the existence of a gravitational force to account for discrepancies in our geometrical measurements.
Since acceleration in a gravitational field is the same as the centrifugal force from rotation, we concluded that the geometry in a gravitational field must also be non-Euclidean. This means that space around matter is [__A_TAG_PLACEHOLDER_0__]distorted or curved. The curvature of space relates to three dimensions the way the curvature of a sphere relates to two dimensions, which is why we don’t notice it, just like an intelligent two-dimensional being wouldn't be aware of the distortion in its space (or surface). Additionally, like this being, we have assumed that a gravitational force exists to explain the discrepancies in our geometric measurements.
The identification in this manner of gravitational effects with geometrical curvature of space enables Einstein to derive a general law for the path of any particle in a gravitational field, with respect both to space and to time. Furthermore, the law expresses this motion in terms which are independent of the relative motion and position of the observer, and satisfies the condition that the fundamental laws of physics be equally valid for all observers. The solution of the problem involved the use of a new kind of higher calculus, elaborated by two Italian mathematicians, Ricci and Levi-Civita. The result is a law of motion which is extremely general in its validity.
The identification of gravitational effects with the curvature of space allows Einstein to establish a general law for the trajectory of any particle in a gravitational field, considering both space and time. Additionally, this law describes motion in a way that is not affected by the observer's relative motion or position, ensuring that the fundamental laws of physics apply equally to all observers. Solving this problem required a new type of advanced calculus developed by two Italian mathematicians, Ricci and Levi-Civita. The outcome is a law of motion that is very broad in its applicability.
For low velocities it approximates to Newton’s solution, and in the absence of a gravitational field it leads to the same conclusions as the special theory of relativity. There are three deductions from this law which have aroused a great deal of interest, and the confirmation of two of these by actual observation must be regarded as striking proof of Einstein’s theory. [240]
At low speeds, it aligns with Newton's solution, and without a gravitational field, it comes to the same conclusions as the special theory of relativity. There are three deductions from this law that have sparked a lot of interest, and the confirmation of two of these through actual observation should be seen as strong evidence for Einstein's theory. [__A_TAG_PLACEHOLDER_0__]
XIII
AN INTRODUCTION TO RELATIVITY
A Treatment in Which the Mathematical Connections of Einstein’s Work are Brought Out More Strongly and More Successfully Than Usual in a Popular Explanation
A treatment that effectively and clearly showcases the mathematical connections in Einstein's work better than standard popular explanations.
One of the first questions which appears in philosophy is this: What is the great reality that underlies space and time and the phenomena of the physical universe? Kant, the philosopher, dismissed it as a subjective problem, affirming that space and time are “a priori” concepts beyond which we can say no more.
One of the first questions that comes up in philosophy is this: What is the fundamental reality that underpins space, time, and the occurrences in the physical universe? Kant, the philosopher, considered it a subjective issue, stating that space and time are “a priori” concepts that we can't go beyond.
Then the world came upon some startling facts. In 1905 a paper appeared by Professor Albert Einstein which asserted that the explanation of certain remarkable discoveries in physics gave us a new conception of this strange four-dimensional manifold in which we live. Thus, the great difference between the space and time of philosophy and the new knowledge is the objective reality of the latter. It rests upon an amazing sequence of physical facts, and the generalized theory, which appeared several years later, founded as it is upon the abstruse differential calculus of Riemann, Christoffel, Ricci and Levi-Civita, emerges from its maze of formulas [241]with the prediction of real phenomena to be sought for the in the world of facts.
Then the world discovered some surprising facts. In 1905, a paper was published by Professor Albert Einstein that claimed the explanation of certain extraordinary discoveries in physics provided us with a new understanding of the strange four-dimensional space we inhabit. Therefore, the main difference between the philosophical views of space and time and this new knowledge is the objective reality of the latter. It is based on an awesome series of physical facts, and the generalized theory, which was introduced a few years later, based as it is on the complex differential calculus of Riemann, Christoffel, Ricci, and Levi-Civita, emerges from its intricate formulas [__A_TAG_PLACEHOLDER_0__]with predictions of real phenomena to be observed in the world of facts.
We shall, therefore, approach the subject from this objective point of view. Let us go to the realm of actual physical events and see how the ideas of relativity gradually unfolded themselves from the first crude wonderings of science to the stately researches that first discovered the great ocean of ether and then penetrated in such a marvelous manner into some of its most mysterious properties.
We will, therefore, tackle the subject from this objective perspective. Let’s enter the world of actual physical events and observe how the concepts of relativity gradually emerged from the initial rough speculations of science to the sophisticated investigations that first uncovered the vast ocean of ether and then explored some of its most mysterious properties in an incredible way.
The Electromagnetic Theory of Light
Suppose that we go out on a summer night and look into the dark depths of the sky. A thousand bright specks are flashing there, blue, red, yellow against the dark velvet of space. And as we look we must all be impressed by the fact that such remote objects as the stars can be known to us at all. How is it that light, that curious thing which falls upon the optic nerve and transmits its pictures to the brain, can ever reach us through the black regions of interstellar space? That is the question which has for its answer the electromagnetic theory of light.
Suppose we go out on a summer night and gaze into the dark depths of the sky. A thousand bright dots are shining there, blue, red, yellow against the dark velvet of space. And as we look, we can’t help but be impressed by the fact that such distant objects as the stars can be known to us at all. How is it that light, that fascinating phenomenon that hits the optic nerve and sends images to the brain, can reach us through the vast blackness of interstellar space? That is the question answered by the electromagnetic theory of light.
The first theory to be advanced was Newton’s “corpuscular” theory which supposed that the stars are sending off into space little pellets of matter so infinitesimally small that they can move at the rate of 186,000 miles a second without injuring even so delicate a thing as the eye when they strike against it.
The first theory proposed was Newton’s “corpuscular” theory, which suggested that stars are emitting tiny particles of matter that are so minuscule that they can travel at 186,000 miles per second without harming even something as delicate as the eye when they collide with it.
But in 1801, when Thomas Young made the very important discovery of interference, this had to give way to the wave theory, first proposed by Huyghens [242]in the 17th century. The first great deduction from this, of course, was the “luminiferous ether,” because a wave without some medium for its propagation was quite unthinkable. Certain peculiar properties of the ether were at once evident, since we deduce that it must fill all space and at the same time be so extremely tenuous that it will not retard to any noticeable degree the motion through it of material bodies like the planets.
But in 1801, when Thomas Young made the crucial discovery of interference, this had to give way to the wave theory, which had been first proposed by Huyghens [__A_TAG_PLACEHOLDER_0__] in the 17th century. The first major conclusion from this was the concept of “luminiferous ether,” because the idea of a wave traveling without some medium was completely inconceivable. Certain unique properties of the ether became immediately apparent, since we concluded that it must fill all of space while also being extremely thin, so it doesn't significantly slow down the movement of solid objects like planets as they move through it.
But how light was propagated through the ether still remained a perplexing problem and various theories were proposed, most prominent among them being the “elastic solid” theory which tried to ascribe to ether the properties of an elastic body. This theory, however, laid itself open to serious objection on the ground that no longitudinal waves had been detected in the ether, so that it began to appear that further insight into the nature of light had to be sought for in another direction.
But how light traveled through the ether was still a puzzling issue, and various theories were suggested, with the most notable being the “elastic solid” theory, which attempted to give ether the characteristics of an elastic material. However, this theory faced significant criticism because no longitudinal waves had been found in the ether, leading to the conclusion that a deeper understanding of light's nature needed to be explored from a different angle.
This was soon forthcoming for in 1864 a new theory was proposed by James Clerk Maxwell which seemed to solve all of the difficulties. Maxwell had been working with the facts derived from a study of electrical and magnetic phenomena and had shown that electromagnetic disturbances were propagated through the ether at a velocity identical with that of light. This, of course, might have been merely a strange coincidence, but Maxwell went further and demonstrated the interesting fact that an oscillating electric charge should give rise to a wave that would behave in a manner identical with all of the known properties of a light wave. One particularly impressive assertion was that these waves, consisting [243]of an alternating electric field accompanied by an alternating magnetic field at right angles to it, and hence called electromagnetic waves, would advance in a direction perpendicular to the alternating fields. This satisfied the first essential property of light rays, i.e., that they must be transverse waves, and the ease with which it explained all of the fundamental phenomena of optics and predicted a most striking interrelation between the electrical and optical properties of material bodies, gave it at once a prominent place among the various theories.
This was soon to happen because in 1864, James Clerk Maxwell proposed a new theory that seemed to solve all the problems. Maxwell had been studying electrical and magnetic phenomena and showed that electromagnetic disturbances traveled through the ether at the same speed as light. While this could have just been a strange coincidence, Maxwell went further and demonstrated that an oscillating electric charge produces a wave that behaves just like a light wave. One particularly impressive claim was that these waves, consisting of an alternating electric field paired with an alternating magnetic field at right angles to it—hence called electromagnetic waves—would move in a direction perpendicular to the alternating fields. This fulfilled the essential property of light rays, i.e., that they must be transverse waves. The theory easily explained all fundamental optical phenomena and predicted a striking relationship between the electrical and optical properties of materials, giving it an important place among the various theories.
The electromagnetic theory, however, had to wait until 1888 for verification when Heinrich Hertz, in a series of brilliant experiments, succeeded in producing electromagnetic waves in the laboratory and in showing that they possessed all of the properties predicted by Maxwell. These waves moved with the velocity of light: they could be reflected, refracted, and polarized: they exhibited the phenomenon of interference and, in short, could not be distinguished from light waves except for their difference in wave length.
The electromagnetic theory had to wait until 1888 for confirmation when Heinrich Hertz, through a series of impressive experiments, managed to create electromagnetic waves in the lab and demonstrate that they had all the properties predicted by Maxwell. These waves traveled at the speed of light: they could be reflected, refracted, and polarized; they showed the phenomenon of interference and, in summary, could not be distinguished from light waves except for their difference in wavelength.
The Michelson-Morley Experiment
With the final establishment of the electromagnetic theory of light as a fact of physics, we have at last endowed the ether with an actual substantiality. The “empty void” is no longer empty, but a great ocean of ether through which the planets and the suns turn without ever being aware that it is there.
With the final establishment of the electromagnetic theory of light as a fact of physics, we have finally given the ether an actual substance. The “empty void” is no longer empty, but a vast ocean of ether through which the planets and suns move without ever realizing it’s there.
In 1881 A. A. Michelson undertook an experiment, [244]originally suggested by Maxwell, to determine the relative motion of our earth to the ether ocean and six years later he repeated it with the assistance of E. W. Morley. The experiment is now known as the Michelson-Morley experiment and since it is the great physical fact upon which the theory of relativity rests, it will be well for us to examine it in detail.
In 1881, A. A. Michelson conducted an experiment, [__A_TAG_PLACEHOLDER_0__] originally proposed by Maxwell, to find out the earth's motion relative to the ether ocean. Six years later, he repeated the experiment with E. W. Morley's help. This experiment is now known as the Michelson-Morley experiment, and since it is the key physical evidence supporting the theory of relativity, it’s important for us to look at it closely.
Since we can scarcely think that our earth is privileged in the universe and that it is at rest with respect to this great ether ocean that fills space, we propose to discover how fast we are actually moving. But the startling fact is that the experiment devised for this purpose failed to detect any motion whatever of the earth relative to the ether.1
Since it’s hard to believe that our planet is special in the universe and that it’s completely still in relation to the vast ether that fills space, we aim to find out how fast we are really moving. But the surprising thing is that the experiment designed for this purpose did not find any movement of the earth in relation to the ether.1
The explanation of this very curious fact was given by both H. A. Lorentz and G. F. Fitzgerald in what is now widely known under the name of the “contraction hypothesis.” It is nothing more nor less than this:
The explanation for this very interesting fact was provided by both H. A. Lorentz and G. F. Fitzgerald in what is now commonly referred to as the “contraction hypothesis.” It is simply this:
Every solid body undergoes a slight change in dimensions, of the order of (), when it moves with a velocity v through the ether.
Every solid object experiences a slight change in dimensions, on the order of (), when it moves at a speed v through the ether.
The reason why the experiment failed, then, was not because the earth was not moving through the ether, but because the instruments with which the experiment was being conducted had shrunk just enough to negative the effect that was being looked for.2 [245]
The reason the experiment failed wasn’t that the earth wasn’t moving through the ether, but rather that the instruments used for the experiment had shrunk just enough to cancel out the effect that was being sought. 2 [__A_TAG_PLACEHOLDER_0__]
The Lorentz Transformation
We can not at this point forebear introducing a little mathematics to further emphasize the theory and the very logical nature of this contraction hypothesis.
We can't hold back from introducing a bit of math right now to reinforce the theory and the logical nature of this contraction hypothesis.
Let us suppose that we were on a world that was absolutely motionless with respect to the ether and were looking at a ray of light. The magnetic and electric fields which form the ray can be described by means of four mathematical expressions which have come to bear the name of “Maxwell’s field equations.” Now suppose that we ask ourselves the question: How must these equations be changed so that they will apply to a ray of light which is being observed by people on a world that is moving with a velocity v through the ether?
Let’s assume we’re on a completely still world in relation to the ether and we observe a ray of light. The magnetic and electric fields that make up the ray can be described using four mathematical equations known as “Maxwell’s field equations.” Now, let’s ask ourselves: How do we need to adjust these equations so they apply to a ray of light being seen by people on a world moving at a velocity v through the ether?
The answer is immediate. From the Michelson-Morley experiment we know that we can not tell how fast or how slowly we are moving with respect to the ether. This means that no matter what world we may be upon, the form of the Maxwell field equations will always be the same, even though the second set of axes (or frame of reference) may be moving with high velocity with respect to the first.
The answer is straightforward. From the Michelson-Morley experiment, we know that we can't determine how fast or slow we're moving relative to the ether. This means that regardless of the world we find ourselves on, the structure of the Maxwell field equations will always remain the same, even if the second set of axes (or frame of reference) is moving at a high speed compared to the first.
Starting from this hypothesis (called in technical language the covariance of the
equations with respect to a transformation of coordinates), Lorentz found that the
transformation which leaves the field equations unchanged in form was the following:
Starting from this idea (known in technical terms as the covariance of the equations with respect to a transformation of coordinates), Lorentz discovered that the transformation that keeps the field equations unchanged in form was as follows:
where k is as on page 92. [246]
where k is as on page 92. [__A_TAG_PLACEHOLDER_0__]
And what, now, can be deduced from these very simple looking equations? In the first place we see that the space of x′, y′, z′, t′ is not our ordinary concept of space at all, but a space in which time is all tangled up with length. To put it more concretely, we may deduce from them the interesting fact that whenever an aviator moves with respect to our earth, his shape changes, and if he were to compare his watch with one on the earth, he would find that his time had changed also. A sphere would flatten into an ellipse, a meter stick would shorten up, a watch would slow down and all because, as H. Minkowski has shown us from these very equations, we are really living in a physical world quite different from the world of Euclid’s geometry in which we are accustomed to think we live.
And so, what can we figure out from these seemingly simple equations? First of all, we see that the space of x′, y′, z′, t′ isn't our usual idea of space at all, but a space where time is all mixed up with length. To be more specific, we can deduce from them the fascinating fact that whenever a pilot moves relative to our planet, his shape changes, and if he compares his watch with one on Earth, he would discover that his time has also changed. A sphere would become an ellipse, a meter stick would shrink, a watch would run slower, and all of this is because, as H. Minkowski has shown us from these very equations, we are actually living in a physical world that is quite different from the Euclidean geometry we think we're accustomed to.
A variety of objections has very naturally been made to this rather radical hypothesis in an attempt to discredit the entire theory, but it is easily seen that any result obtained through the field equations must necessarily be in conformity with the theory of contraction, since this theory is only the physical interpretation of that transformation which leaves the field equations unaltered. Indeed, it is even possible to postulate the Lorentz transformation together with the assumption that each element of charge is a center of uniformly diverging tubes of strain and derive the Maxwell field equations from this, which shows from another point of view the truly fundamental nature of the transformation.
A range of objections has understandably been raised against this quite radical hypothesis in an effort to undermine the entire theory, but it’s clear that any outcomes produced through the field equations must align with the theory of contraction, since this theory is merely the physical interpretation of the transformation that keeps the field equations unchanged. In fact, it’s even possible to suggest the Lorentz transformation along with the idea that each charge element acts as a center for evenly spreading tubes of strain, and derive the Maxwell field equations from this, demonstrating from an alternative perspective the truly fundamental nature of the transformation.
The First Theory of Relativity
The whole question of the ether had arrived at [247]this very interesting point when Professor Einstein in 1905 stated the theory of relativity.
He had noticed that the equations of dynamics as formulated by Newton did not admit
the Lorentz transformation, but only the simple Galilean transformation:
The whole issue of the ether reached a very interesting point when Professor Einstein introduced the theory of relativity in 1905. He observed that the equations of motion as defined by Newton did not allow for the Lorentz transformation, only the basic Galilean transformation:
Here, indeed, was a curious situation. Two physical principles, that of dynamics and that of electromagnetism, were coexistent and yet each one admitted a different transformation when the system of reference was transferred to axes moving with constant velocity with respect to the ether.
Here, indeed, was a curious situation. Two physical principles, that of dynamics and that of electromagnetism, existed together and yet each allowed a different transformation when the reference system was moved to axes moving at a constant speed relative to the ether.
Now the electromagnetic equations and their transformation had been shown to be in accord with experimental fact, whereas it had long been felt that Newton’s equations were only a first approximation to the truth. For example, the elliptic orbit of a planet had been observed by Leverrier to exhibit a disquieting tendency to rotate in the direction of motion. This precession, which in the case of Mercury was as large as 43″ per century, could not be accounted for in any way by the ordinary Newtonian laws and was, consequently, a very celebrated case of discordance in gravitational astronomy.
Now the electromagnetic equations and their transformation have been shown to match experimental results, while Newton’s equations have long been seen as just a rough approximation of the truth. For instance, Leverrier observed that the elliptical orbit of a planet had a troubling tendency to rotate in the direction of its motion. This precession, which for Mercury was as significant as 43″ per century, couldn't be explained by the usual Newtonian laws and was, therefore, a famous example of inconsistency in gravitational astronomy.
With this example clearly before him, Einstein took the great step and said that the laws of dynamics and all other physical laws had to be remade so that they, also, admit the Lorentz transformation. That is to say,
With this example in mind, Einstein made a significant leap and stated that the laws of dynamics and all other physical laws needed to be revised so that they would also include the Lorentz transformation. In other words,
The laws of physical phenomena, or rather the mathematical expressions for these laws, are covariant [248](unchanged in form) when we apply the Lorentz transformation to them.
The laws of physical phenomena, or the mathematical formulas that describe them, remain unchanged in form when we apply the Lorentz transformation.
The deductions from the Michelson-Morley experiment now seem to have reached their ultimate conclusion.
The conclusions drawn from the Michelson-Morley experiment now appear to have reached their final point.
One discordant fact in this new theory remained, however. That same precession of the perihelion of Mercury which had first lead Einstein to his theory remained unsettled. When the new approximations were applied to the formula of orbital motion, a precession was, indeed, obtained, but the computed value fell considerably below that of the observed 43″ per century.
One conflicting point in this new theory still stood out, though. The same precession of Mercury's perihelion that initially led Einstein to his theory was still unresolved. When the new approximations were applied to the formula for orbital motion, a precession was indeed achieved, but the calculated value fell significantly short of the observed 43″ per century.
The Inclusion of Gravity
With the idea of investigating the problem from the very bottom, Einstein now undertook a broader and more daring point of view. In the first place he said that there is no apparent reason in the great scheme of world events why any one special system of coordinates should be fundamental to the description of phenomena, just as in the special theory a ray of light would appear the same whether viewed from a fixed system or a system moving with constant velocity with respect to the ether. This makes the very broad assumption that no matter what system of coordinates we may use, the mathematical expressions for the laws of nature must be the same. In Einstein’s own words, then, the first principle of this more general theory of relativity must be the following:
With the goal of thoroughly exploring the problem, Einstein took a broader and bolder perspective. He first stated that there's no clear reason in the grand scheme of events why any specific coordinate system should be fundamental for describing phenomena. Just like in the special theory, a ray of light appears the same whether seen from a stationary system or one moving at a constant speed relative to the ether. This leads to the significant assumption that regardless of the coordinate system we use, the mathematical expressions for the laws of nature must remain the same. In Einstein’s own words, the first principle of this more general theory of relativity must be the following:
“The general laws of nature are expressed [249]through equations which hold for all systems of coordinates, that is, they are covariant with respect to arbitrary substitutions.”3
The basic laws of nature are shown [__A_TAG_PLACEHOLDER_0__]through equations that apply to all coordinate systems, meaning they remain the same under any changes.3
But this was not enough to include gravitation so Einstein next formulated what he was pleased to call his “equivalence hypothesis.” This is best illustrated by an example. Suppose that we are mounting in an elevator and wish to investigate the world of events from our moving platform. We mount more and more rapidly, that is with constant acceleration, and we appear to be in a strong gravitational field due to our own inertia. Suppose, on the other hand, that the elevator descends with an acceleration equal to that of gravity. We would now feel certain that we were in empty space because our own relative acceleration has entirely destroyed that of the earth’s gravitational field and all objects placed upon scales in an elevator would apparently be without weight.
But this wasn't enough to include gravity, so Einstein went on to develop what he called his “equivalence hypothesis.” This is best explained with an example. Imagine we're inside an elevator and want to observe the events around us from our moving platform. We ascend more and more quickly, meaning we're accelerating constantly, and it feels like we're in a strong gravitational field because of our own inertia. Now, suppose the elevator is descending at an acceleration equal to that of gravity. In that case, we would definitely feel like we were in empty space because our own acceleration would completely cancel out the earth's gravitational pull, and anything placed on scales in the elevator would seem to have no weight at all.
Applying this idea, then, Einstein decided to do away with gravitation entirely by referring all events in a gravitational field to a new set of axes which should move with constant acceleration with respect to the first. In other words we are going to deal with a system moving with uniform acceleration with respect to the ether, just as we considered a system moving with uniform velocity in the special theory.
Applying this idea, Einstein then decided to eliminate gravity altogether by referencing all events in a gravitational field to a new set of axes that would move with constant acceleration relative to the first. In other words, we will be looking at a system moving with uniform acceleration in relation to the ether, similar to how we examined a system moving with uniform velocity in the special theory.
The next step in the construction of this complicated theory is to reduce these two hypotheses to the language of mathematics and this was accomplished [250]by Einstein with the help of M. Grossmann by means of the theory of tensors.
The next step in building this complex theory is to translate these two hypotheses into mathematical language, and this was achieved [__A_TAG_PLACEHOLDER_0__]by Einstein with the assistance of M. Grossmann through the theory of tensors.
On account of the very great intricacy of the details, we must content ourselves with the mere statement that this really involved the generalization of the famous expressions known as Laplace’s and Poisson’s equations, on the explicit assumption that these two equations would still describe the gravitational field when we are content to use a first approximation to the truth. The set of ten differential equations which Einstein got as a result of his generalization he called his field equations of gravitation.4 [251]
Due to the complexity of the details, we'll just say that this really involved expanding upon the well-known expressions known as Laplace’s and Poisson’s equations, assuming these two equations would still represent the gravitational field when we settle for a basic approximation of the truth. The ten differential equations that Einstein derived from this generalization are what he referred to as his field equations of gravitation.4 [__A_TAG_PLACEHOLDER_0__]
1 Dr. Davis went rather fully into the algebra of the Michelson-Morley experiment. But Dr. Russell has covered the same ground in a form somewhat more advantageous from the typographical viewpoint, and the point is not one which it is profitable to discuss twice; so we eliminate this part of Dr. Davis’ text.—Editor. ↑
XIV
NEW CONCEPTS FOR OLD
What the World Looks Like After Einstein Has Had His Way With It
What the World Looks Like After Einstein's Influence
“The new-created world, which fame in heaven
“The newly created world, which is famous in heaven
Long had foretold, a fabric wonderful,
Long had foretold, a wonderful fabric,
Of absolute perfection.”
Of total perfection.”
Einstein’s Theory of Relativity has led to determining a key law of nature—the law of gravitation—which is also the basic law of mechanics. Thus it embraces a whole realm of physics, and promises, through the researches of Professor Weyl, to embrace another realm—electro-dynamics. Its limitations are not yet reached, for Einstein has already postulated therefrom a theory of a finite, yet unbounded, universe. This essay, however, is mainly concerned with mechanics, and electrical forces are not considered.
Einstein's Theory of Relativity has established a crucial law of nature—the law of gravitation—which is also the fundamental law of mechanics. Therefore, it covers a wide area of physics and, through Professor Weyl's research, aims to include another area—electrodynamics. Its limits have not yet been fully explored, as Einstein has already proposed a theory of a finite, but unbounded, universe. However, this essay primarily focuses on mechanics and does not address electrical forces.
To have synthesised Newton’s two great principles—his law of motion and law of gravitation—interpreting in the process the empirical law of equality of gravitational and inertial mass, is alone an immense achievement; but Einstein’s researches have opened up a new world to the physicist and [252]philosopher which is of greater importance. He has given us a vision of the immaterial world, a geometrical or mathematical vision, which is more satisfying than the “ether” conceptions hitherto presented. The fabric of his vision is not baseless. It is this fabric we shall consider, touching on certain aspects of the Einstein theory in the endeavor to present an image in miniature of his edifice of thought and to show the firmness of its foundations. That they are well and truly laid was demonstrated by the verification, from observations made during the solar eclipse in 1919, of Einstein’s prediction of the displacement of a wave of light in a gravitational field, showing light to have the property of weight.
To synthesize Newton's two major principles—his laws of motion and gravitation—and in doing so, interpret the empirical law of gravitational and inertial mass, is already a significant accomplishment. However, Einstein's research has opened up a new realm for physicists and philosophers that is even more important. He has provided us with a vision of the immaterial world, a geometrical or mathematical perspective, which is more fulfilling than the previous “ether” theories. The foundation of his vision is not unfounded. This foundation is what we will explore, touching on certain aspects of Einstein's theory in an effort to present a small-scale image of his thought structure and demonstrate the stability of its foundations. That these foundations are solidly established was confirmed by observations during the solar eclipse in 1919, which validated Einstein's prediction of the light wave's displacement in a gravitational field, indicating that light has weight.
The physical world is shown by Einstein to be a world of “relations.” Underlying it there is an absolute world of which physical phenomena are the manifestation. “Give me matter and motion,” says Descartes, “and I will construct the world.” “Give me a world in which there are ordered relations,” says the Relativist, “and I will show you the behavior of matter therein” (mechanics). We first view this underlying world as an abstraction, abstracting energy (“bound” as in matter and electrons, “free” as in light), and its attribute force. This abstraction we will call the “World-Frame.” Later, we will study the underlying world in connection with energy, and will call this absolute world the “World-Fabric.” The connection between the geometrical character of the World-Frame and the geometrical characters of the World-Fabric is the key to the law of gravitation. [253]
The physical world, as Einstein shows, is a world of “relations.” Beneath it lies an absolute world that physical phenomena represent. “Give me matter and motion,” Descartes claims, “and I will create the world.” “Give me a world with ordered relations,” says the Relativist, “and I will explain how matter behaves within it” (mechanics). We initially view this underlying world as an abstraction, separating energy (“bound” as in matter and electrons, “free” as in light), and its characteristic force. We will refer to this abstraction as the “World-Frame.” Later, we will examine the underlying world in relation to energy and call this absolute world the “World-Fabric.” The link between the geometrical nature of the World-Frame and the geometrical nature of the World-Fabric is the key to understanding the law of gravitation. [__A_TAG_PLACEHOLDER_0__]
The Global Framework
This is our conception of a world, if such were possible, entirely free from the influence of energy. We may conceive of it as an amorphous immaterial something containing “point-events” (a point-event being an instant of time at a point in space—a conception, not a definition). These point-events have a fourfold order and definite relation in this Frame, i.e. they can be specified by four variables or coordinates in reference to some base called a reference system, with respect to which they are forward or backward, right or left, above or below, sooner or later. This shows the World-Frame to be four-dimensional. Thus an aggregate of point-events (or an “event,” which implies limited extension in space and limited duration in time)1 would have what we familiarly describe as length, breadth, height and time. To express these metrical properties most simply we must choose a four-dimensional reference system having a particular form—rectilinear axes (Cartesian coordinates), and a particular motion—uniform and rectilinear, i.e. unaccelerated, and non-rotating with respect to the path of a light ray. We call this an inertial system [254]because Newton’s Law of Inertia holds for such a system alone. This system indicates how observers partition the World-Frame into space and time. It restricts observers to uniform rectilinear motion, and observations to bodies and light-pulses in such motion. Thus gravitational and other forces are discounted, and we obtain World-Frame conditions notwithstanding the fact that observers are in the presence of energy.
This is our idea of a world, if such a thing were possible, completely free from the influence of energy. We can imagine it as an formless, immaterial something containing "point-events" (a point-event being a moment in time at a specific location in space—a concept, not a definition). These point-events have a fourfold order and a specific relationship in this Frame, i.e. they can be pinpointed by four variables or coordinates relative to a base known as a reference system, concerning which they are ahead or behind, right or left, above or below, sooner or later. This demonstrates that the World-Frame is four-dimensional. Therefore, a collection of point-events (or an “event,” which suggests a limited extent in space and a limited duration in time) 1 would have what we usually describe as length, width, height, and time. To express these measurement properties in the simplest way, we need to choose a four-dimensional reference system with a specific form—straight axes (Cartesian coordinates)—and a particular type of motion—uniform and straight-line, i.e. not accelerating and not rotating in relation to the path of a light beam. We call this an inertial system [__A_TAG_PLACEHOLDER_0__] because Newton’s Law of Inertia applies only to such a system. This system shows how observers divide the World-Frame into space and time. It limits observers to uniform straight-line motion and observations to objects and light pulses in that motion. Thus, gravitational and other forces are ignored, and we determine World-Frame conditions despite the fact that viewers are in the presence of energy.
Now the separation between point-events which have a definite relation to each other must be absolute. The separation between two points in a plane is defined by the unique distance between them (the straight line joining them). Between point-events the analogue of this unique distance, which we call the “separation-interval” (to indicate its time-like and space-like nature), is also unique. Its unique and absolute character give it great importance as thereby it is the same for all observers regardless of their reference system.
Now, the separation between point-events that have a clear relationship with each other must be absolute. The separation between two points on a plane is defined by the unique distance between them (the straight line connecting them). For point-events, the equivalent of this unique distance, which we call the “separation-interval” (to show its time-like and space-like nature), is also unique. Its distinctive and absolute nature makes it very important, as it remains the same for all observers regardless of their reference system.
If, in place of the rather cumbersome expression to indicate the difference between the x-coordinates of two points, we employ the more compact expression dx; if for the benefit of readers who have a little algebra but no analysis we state
explicitly that this expression is a single symbol for a single quantity, and has
nothing to do with any product of two quantities d and x; and if we extend this notation to all our coordinates: then it is clear from previous
essays that the distance S between two points in a plane referred to a rectilinear system OX, OY, is given by the simple equation
. Einstein and Minkowski show [255]that the value for the separation interval
, the analogue of S, referred to an inertial system is given by the equation
If, instead of the somewhat awkward expression to show the difference between the x-coordinates of two points, we use the simpler expression dx; and if we clarify for readers who have a bit of algebra but no analysis that this expression is a single symbol representing a single quantity and isn’t related to any product of two quantities d and x; and if we apply this notation to all our coordinates: then it is clear from previous essays that the distance S between two points in a plane based on a straight coordinate system OX, Oh no, is given by the simple equation
. Einstein and Minkowski show [__A_TAG_PLACEHOLDER_0__] that the value for the separation interval
, which corresponds to S, in relation to an inertial system is given by the equation
which is seen to be a modified extension to four dimensions of the equation for S. We must measure t in the same units as x, y, z. By taking the constant velocity of light (300,000 kilometres per second) as unit velocity, we can measure in length or time indiscriminately.2
which is recognized as a modified extension into four dimensions of the equation for S. We need to measure t using the same units as x, y, z. By taking the constant speed of light (300,000 kilometers per second) as the unit velocity, we can measure either length or time interchangeably.2
We will analyse briefly this equation as it epitomizes the Special Theory of Relativity.
If the World-Frame had been Euclidean the equation would have been
We will briefly analyze this equation as it represents the Special Theory of Relativity. If the World-Frame had been Euclidean, the equation would have been
but this would not satisfy the “transformation equations” which resulted from the Special Theory. These transformation equations arose directly from a reconciliation between two observed facts; (a) the observed agreement of all natural phenomena with the “Restricted Principle of Relativity”—a principle which shows that absolute rectilinear motion cannot be established—(as regards mechanics this was recognized by Newton; the Michelson-Morley and other experiments showed this principle also applied [256]to optical and electro-dynamical phenomena); and (b) the observed disagreement of optical and electro-dynamical phenomena (notably the constancy of light velocity) with the laws of dynamics as given by classical mechanics, e.g., in regard to the compounding of relative velocities. Einstein effected this reconciliation by detecting a flaw in classical mechanics. He showed that by regarding space and time measurements as relative to the observer—not absolute as Newton defined them—there was nothing incompatible between the Principle of Relativity and the laws of dynamics so modified. Newton’s definitions were founded on conception. Einstein’s recognition of the relativity of space and time is based on observation.
but this wouldn't satisfy the “transformation equations” that came from the Special Theory. These equations came from reconciling two observed facts: (a) the observed agreement of all natural phenomena with the “Restricted Principle of Relativity”—which shows that absolute straight-line motion can’t be established (this was acknowledged by Newton; the Michelson-Morley and other experiments also demonstrated that this principle applied [__A_TAG_PLACEHOLDER_0__]to optical and electro-dynamical phenomena); and (b) the observed disagreement of optical and electro-dynamical phenomena (notably the constancy of the speed of light) with the laws of dynamics as defined by classical mechanics, e.g., regarding the combining of relative velocities. Einstein resolved this conflict by identifying a flaw in classical mechanics. He showed that by viewing space and time measurements as relative to the observer—not absolute as Newton defined them—there was nothing incompatible between the Principle of Relativity and the modified laws of dynamics. Newton’s definitions were based on conception. Einstein’s recognition of the relativity of space and time is rooted in observation.
Equation (1) shows that the geometry of the World-Frame referred to an inertial system is semi-Euclidean (hyperbolic), and that space and time measurements are relative to the observer’s inertial reference system. The equation shows that the World-Frame has a certain geometrical character which we distinguish as four-dimensional “flatness.” It is everywhere alike (homaloidal). Its flat character is shown by the straight line nature of the separation-interval and of the system to which it is most simply referred.
Equation (1) shows that the geometry of the World-Frame, when referenced to an inertial system, is semi-Euclidean (hyperbolic), and that measurements of space and time are relative to the observer’s inertial reference system. The equation indicates that the World-Frame has a specific geometric quality that we identify as four-dimensional “flatness.” It is uniform everywhere (homaloidal). Its flatness is evident in the straight-line nature of the separation-interval and the system to which it is most simply related.
Thus we have found two absolute features in the World-Frame—(1) Its geometrical character—”flatness”; (2) The separation-interval—which can be expressed in terms of measurable variables called space and time partitions, this partitioning being dependent on the observer’s motion.
Thus we have found two absolute features in the World-Frame—(1) Its geometrical character—"flatness"; (2) The separation-interval—which can be expressed in terms of measurable variables called space and time partitions, this partitioning being dependent on the observer’s motion.
We are now in a position to explore the World-Fabric. [257]Already we see that, studied under inertial conditions (free of force), it agrees with the World-Frame.
We are now ready to explore the World-Fabric. [__A_TAG_PLACEHOLDER_0__] Already, we notice that when studied under inertial conditions (free of force), it aligns with the World-Frame.
The Fabric of the World
The General Theory of relativity is largely concerned with the investigation of the World-Fabric. Consider the World-Frame to be disturbed. We may regard this disturbance, which manifests itself in physical phenomena, as energy, or more correctly “action.”
The General Theory of Relativity mainly focuses on studying the World-Fabric. Imagine the World-Frame being disturbed. We can see this disturbance, which shows up in physical events, as energy, or more accurately, “action.”
When energy is thwarted in its natural flow, force is manifested, with which are associated non-uniform motions such as accelerations and rotations. This disturbed World-Frame we distinguish as the World-Fabric. It is found to have various non-Euclidean characters differing from the simple “flat” character of the World-Frame according to the degree of disturbance (action) in the region. Disturbance gives the fabric a geometrical character of “curvature”; the more considerable the disturbance, the greater the curvature. Thus an empty region (not containing energy, but under its influence) has less curvature than a region in which free energy abounds.
When energy is blocked from flowing naturally, it creates force, which is linked to uneven movements like accelerations and rotations. This disrupted World-Frame is what we call the World-Fabric. It has different non-Euclidean characteristics that vary from the simple "flat" nature of the World-Frame depending on the level of disturbance (action) in the area. Disturbance gives the fabric a geometric quality of "curvature"; the more significant the disturbance, the greater the curvature. So, an empty area (not containing energy, but affected by it) has less curvature than an area filled with free energy.
Our problem, after showing the relativity of force (especially gravitational force), is to determine the law underlying the fabric’s geometrical character; to ascertain how the degree of curvature is related to the energy influencing a region, and how the curvature of one region is linked by differential equations to that of neighboring regions. Such a law will be seen to be the law of gravitation. [258]
Our challenge, after demonstrating the relativity of force (particularly gravitational force), is to figure out the principle behind the shape of the fabric; to understand how the amount of curvature is connected to the energy affecting a certain area, and how the curvature of one area is related through differential equations to that of nearby areas. This principle will be recognized as the law of gravitation. [__A_TAG_PLACEHOLDER_0__]
We study the World-Fabric by considering tracks on which material particles and light-pulses progress; we find such tracks regulated and defined by the Fabric’s curvature, and not, as hitherto supposed, by attractive force inherent in matter. As a track is measurable by summing the separation-intervals between near-by point-events on it, all observers will agree which is the unique track between two distant point-events. Einstein postulates that freely progressing bodies will follow unique tracks, which are therefore called natural tracks (geodesics).
We examine the World-Fabric by looking at the paths that material particles and light pulses take; we discover that these paths are shaped and determined by the curvature of the Fabric, rather than by the attractive forces that were previously thought to be inherent in matter. As a path is measurable by adding up the separation intervals between nearby events along it, all observers will agree on the one unique path connecting two distant events. Einstein proposes that freely moving bodies will follow these unique paths, which are referred to as natural paths (geodesics).
If material bodies are prevented from following natural tracks by contact with matter or other causes, the phenomenon of gravitational force is manifested relative to them. Whenever the natural flow of energy is interrupted force is born. For example, when the piston interrupts the flow of steam, or golf ball flow of club, force results—the interruption is mutual, and the force relative to both. Likewise when the earth interrupts the natural track of a particle (or observer) gravitational force is manifested relative to both.
If physical objects are blocked from following their natural paths due to contact with matter or other reasons, the phenomenon of gravitational force is shown relative to them. Whenever the natural flow of energy is disrupted, force is created. For instance, when a piston stops the flow of steam, or a golf club stops the movement of a golf ball, force occurs—the interruption is mutual, and the force is relative to both. Similarly, when the Earth blocks the natural path of a particle (or observer), gravitational force is displayed in relation to both.
So long as a body moves freely no force is appreciated by it. A falling aviator (neglecting air resistance) will not appreciate any gravitational force. He follows a natural track, thereby freeing himself from the force experienced in contact with matter. He acquires an accelerating motion with respect to an inertial system. By acquiring a particular accelerating motion an observer can annul any force experienced in any small region where the field of force can be considered constant.
As long as a body is moving freely, it doesn't feel any force acting on it. A falling pilot (ignoring air resistance) won't feel any gravitational force. He follows a natural path, which allows him to escape the forces he feels when in contact with matter. He gains an accelerating motion relative to an inertial system. By achieving a specific accelerating motion, an observer can cancel out any force experienced in any small area where the force field can be seen as constant.
Thus Einstein, interpreting the equality of gravitational [259]and inertial mass, showed that the same quality manifests itself according to circumstances as “weight” or as inertia, and that all force is purely relative and may be treated as one phenomenon (an interruption in energy flow). This “Principle of Equivalence” shows that small portions of the World-Fabric, observed from a freely moving particle (free of force), could be treated as small portions of the World-Frame.3
Thus, Einstein interpreted the equality of gravitational [__A_TAG_PLACEHOLDER_0__] and inertial mass, demonstrating that the same property appears differently depending on the situation, either as “weight” or as inertia. He argued that all force is entirely relative and can be viewed as a single phenomenon (a disruption in energy flow). This “Principle of Equivalence” indicates that small sections of the World-Fabric, viewed from a freely moving particle (one free of force), can be regarded as small sections of the World-Frame.3
If such observations were practicable, we could determine the Fabric curvature by referring point-event measurements to equation (1). We cannot observe from unique tracks but we can observe them from our restrained situation. Their importance is now apparent, because, by tracing them over a region, we are tracing something absolute in the Fabric—its geometrical character. We study this curvature by exploring separation-intervals on the tracks of freely moving bodies, relating these separation-intervals to actual measurements in terms of space and time components depending on the observer’s reference system. The law of curvature must be the law of gravitation. To illustrate the lines on which Einstein proceeded to survey the World-Fabric from the earth we will consider a similar but more simple problem—the survey of the sea-surface curvature from an airship. We study this curvature by exploring small distances on the tracks of ships (which we must suppose can only move uniformly [260]on unique tracks—arcs of great circles), relating such distances to actual measurements in terms of length and breadth components depending on the observer’s reference system. This two-dimensional surface problem can be extended to the four-dimensional Fabric one.
If we could make such observations, we could figure out the curvature of the Fabric by linking point-event measurements to equation (1). We can't observe from unique tracks, but we can see them from our limited perspective. Their significance is now clear, because tracing them across a region means we are identifying something absolute in the Fabric—its geometric nature. We examine this curvature by looking at separation intervals on the paths of freely moving bodies, connecting these separation intervals to actual measurements in terms of spatial and temporal components based on the observer's reference system. The law of curvature must align with the law of gravitation. To demonstrate the approach Einstein took to analyze the World-Fabric from Earth, we'll consider a similar but simpler problem—the curvature of the sea surface as viewed from an airship. We study this curvature by examining small distances on the paths of ships (which we must assume can only move uniformly on unique tracks—arcs of great circles), relating those distances to actual measurements of length and width based on the observer's reference system. This two-dimensional surface problem can be expanded to the four-dimensional Fabric problem.

We consider the surface to be covered by two arbitrarily drawn intersecting series
of curves: curves in one series not intersecting each other, vide figure. This Gaussian system of coordinates is appropriate only when the smaller the surface considered, the more nearly it approximates
to Euclidean conditions. It admits of defining any point on the surface by two numbers
indicating the curves intersecting at that point. P is defined by ,
.
(very near P) is defined by
,
. The equation for the minute distance s between two adjacent points in such a system is given by the general formula
[261]
The g’s may be constants or functions of
,
. Their value is dependent on the observer’s reference system and on the geometrical
character of the surface observed. The curves being arbitrary, the formula is appropriate for any reference system,
or even if the observer does not know exactly what his reference system is. (The Fabric
observer does not know what his space and time partitioning actually is because he
is in a gravitational field). It is the g’s which disclose the geometry of an observer’s partitions, and their values also
contain a reflection of the character of the region observed.
We think of the surface as being covered by two sets of intersecting curves drawn randomly: curves in one set that don’t intersect each other, see figure. This Gaussian system of coords is only suitable when considering smaller portions of the surface, which more closely resemble Euclidean conditions. It allows us to define any point on the surface using two numbers that indicate the curves intersecting at that point. P is defined by ,
.
(very close to P) is defined by
,
. The formula for the tiny distance s between two nearby points in this system is given by the general equation
[__A_TAG_PLACEHOLDER_0__]
The g values may be constants or functions of
,
. Their values depend on the observer’s reference system and the geometric characteristics of the observed surface. Since the curves are arbitrary, the formula works for any reference system, even if the observer isn’t exactly sure what their reference system is. (The Fabric observer doesn’t know how their space and time are organized because they are in a gravitational field). It’s the g values that reveal the geometry of an observer’s partitions, and their values also reflect the characteristics of the region being observed.
We find s by direct exploration with a moving ship ( is found by direct exploration with a freely moving particle);
,
are the observed length and breadth measurement differences which we have to relate
to s. By making sufficient observations in a small area and referring them to the general
formula we can find the values of the g’s for the observer’s particular reference system. Different values for g’s will be found if the observer changes his reference system, but there is a limitation
to the values so obtainable owing to the part played by the surface itself, which is diffidently expressing its intrinsic geometrical character in the g’s in each observation.
We find s through direct exploration with a moving ship ( is discovered through direct exploration with a freely moving particle);
,
are the observed differences in length and width measurements that we need to relate to s. By making enough observations in a small area and connecting them to the general formula, we can determine the values of the g’s for the observer’s specific reference system. Different values for g’s will be found if the observer changes their reference system, but there is a limit to the values that can be obtained due to the role played by the surface itself, which subtly reveals its intrinsic geometric character in the g’s in each observation.
Einstein's Findings
Thus we approach the absolute character of the surface through the relative nature of the observer’s reference system. There is a relationship common to all values of the g’s that belong to the same [262]curvature. This relationship is expressed by a differential equation. It is this equation of curvature that the airship’s observer must find. Einstein’s problem was similar, but he was concerned with four dimensions, which entailed a general formula with ten g’s, and he had to find a set of differential equations of the second order to determine the law of Fabric curvature. He divided the Fabric into regions: I. World-Frame—beyond influence of energy. II. Empty region—free of energy, but under its influence. III. Region containing free energy only. Each region has a characteristic curvature. By means of an absolute differential calculus—a wonderful mathematical scaffolding erected by Riemann, Christoffel and others—involving the theory of tensors, he succeeded in finding such a set of equations. He kept the following points in view: (1) The equations must not only give the character of region II, but must satisfy the special case of region I; (2) They must be independent of any partitioning system, because the General Theory of Relativity demands that a law of nature be in a form appropriate for all observers whatever their position and motion; (3) They must be concerned with energy which is conserved, not mass which the Special Theory showed dependent on velocity. This set of differential equations which shows how the curvature of the Fabric at any point links to the curvature at neighboring points is the law of gravitation, a law which has been severely tested by the practical observation of the solar eclipse already referred to. At a first approximation these equations degenerate into Newton’s Law. At a second [263]approximation they account for the motion of the perihelion of Mercury, which had hitherto baffled astronomers. All the laws of mechanics are deducible from this law of World-Fabric curvature, i.e. conservation of energy (which includes conservation of mass since we re-define mass as energy) and conservation of momentum (re-defined by a relativist). It must be noted that this law and the General Theory show that the velocity of light is not absolutely constant, but, like everything else, a light-pulse is affected by the Fabric curvature in a gravitational field. In conclusion we will contrast some conspicuous differences in the old world view of classical mechanics and the new view presented by Einstein.
Thus, we understand the true nature of the surface through the observer's relative perspective. There is a connection shared by all values of the g’s that belong to the same [__A_TAG_PLACEHOLDER_0__] curvature. This connection is expressed by a differential equation. It’s this curvature equation that the airship’s observer needs to determine. Einstein faced a similar challenge, but he dealt with four dimensions, which required a general formula with ten g’s, necessitating a set of second-order differential equations to define the law of Fabric curvature. He categorized the Fabric into regions: I. World-Frame—beyond energy influence. II. Empty region—free of energy but influenced by it. III. Region containing only free energy. Each region has a unique curvature. Through an absolute differential calculus—a remarkable mathematical framework developed by Riemann, Christoffel, and others—that involves tensor theory, he was able to derive such a set of equations. He kept the following points in mind: (1) The equations must not only describe region II but also satisfy the specific case of region I; (2) They must be independent of any partitioning system because the General Theory of Relativity requires that a natural law be applicable in a form suitable for all observers, regardless of their position and motion; (3) They must focus on conserved energy rather than mass, which the Special Theory showed to depend on velocity. This set of differential equations, which shows how the curvature of the Fabric at any point relates to the curvature at neighboring points, defines the law of gravitation, a law that has been rigorously tested through the practical observation of the solar eclipse mentioned earlier. At first approximation, these equations simplify to Newton’s Law. At second [__A_TAG_PLACEHOLDER_0__] approximation, they explain the motion of Mercury’s perihelion, which had puzzled astronomers until then. All the mechanics laws can be derived from this law of World-Fabric curvature, meaning conservation of energy (which includes mass conservation since we redefine mass as energy) and momentum conservation (redefined by relativists). It's important to note that this law and the General Theory demonstrate that the speed of light is not absolutely constant; rather, like everything else, a light pulse is influenced by Fabric curvature in a gravitational field. In conclusion, we will highlight some notable differences between the traditional worldview of classical mechanics and the new perspective introduced by Einstein.
1. A three-dimensional ether medium with variously conceived properties which communicated the supposed inherent attractive force in matter in some unexplained way, and transmitted electromagnetic waves, has been replaced by a four-dimensional external World-Fabric, the geometrical character of which controls the motion of matter (energy) and accounts for all mechanical laws.
1. A three-dimensional ether medium with different imagined properties that communicated the supposed inherent attractive force in matter in some unclear way and transmitted electromagnetic waves has been replaced by a four-dimensional external World-Fabric, whose geometric characteristics control the motion of matter (energy) and explain all mechanical laws.
2. After separating the observer’s subjective share in definitions from nature’s share in the things defined, space, time, and force, hitherto regarded as absolute, have been shown to be purely relative and dependent on the observer’s track. Mass has also proved to be relative to velocity unless re-defined as energy. As classical mechanics bases all definitions on space, time, and mass units, the relativity of such defined quantities is now apparent.
2. After distinguishing the observer's personal input in definitions from nature's part in the things being defined, space, time, and force, which were previously thought to be absolute, have been shown to be entirely relative and dependent on the observer’s perspective. Mass has also been found to be relative to velocity unless it's redefined as energy. Since classical mechanics relies on definitions based on space, time, and mass units, it's now clear that these defined quantities are relative.
3. Newton’s laws of motion, his law of gravitation, [264]and the laws of conservation, hitherto regarded as unrelated, are now synthesised in a basic law of mechanics.
3. Newton's laws of motion, his law of gravitation, [__A_TAG_PLACEHOLDER_0__] and the laws of conservation, which were previously seen as separate, are now combined into a fundamental law of mechanics.
Einstein has not disturbed the electric theory of matter, and both the old and new physics have in common the “Principle of Least Action.” We obtain a glimpse of this principle in the unique tracks pursued by freely moving bodies, which may be regarded as tracks of least effort, force only being manifested as an expression of the Fabric’s resentment when bodies depart from these natural tracks. Einstein has approached nearer to the truth in regard to the laws underlying nature, and, as always, this means a simplification. His theory, which entails a readjustment of such fundamental conceptions as space and time, opens up fresh fields to scientific investigation and to philosophic thought. It reveals a bridge uniting the domains of physics and philosophy, and it heralds a new era in the history of science. [265]
Einstein hasn't disrupted the electric theory of matter, and both the old and new physics share the “Principle of Least Action.” We catch a glimpse of this principle in the distinct paths taken by freely moving bodies, which can be seen as paths of least effort, with force only appearing as a response from the Fabric when bodies stray from these natural paths. Einstein has moved closer to understanding the laws that govern nature, and, as always, this leads to simplification. His theory, which requires a rethinking of fundamental concepts like space and time, opens up new areas for scientific exploration and philosophical inquiry. It establishes a connection between physics and philosophy, signaling a new era in the history of science. [__A_TAG_PLACEHOLDER_0__]
1 Commander McHardy uses the term “event” in a sense somewhat different from that seen in a majority of the essays. He reserves for the four-dimensional element—the instant of time at a point in space—the name “point-event”; and the term “event” he applies to a collection of these forming, together, an observable whole. An actual physical happening, like a railroad wreck or a laboratory experiment, it will be realized is of the latter sort, occupying an appreciable region of space rather than a single point, and an appreciable interval of time rather than a single second. To the element, the “point-event” of Commander McHardy’s essay, this bears the same relation that the geometer’s solid bears to his point. This comment is in no sense to be taken as criticism of Commander McHardy’s terminology, which rather appeals to us; we make it merely to guard against confusion in the reader’s mind.—Editor. ↑
1 Commander McHardy uses the term “event” in a way that’s a bit different from how most of the essays use it. He calls the four-dimensional element—the instant of time at a specific location in space—a “point-event.” The term “event” refers to a collection of these “point-events” that together form an observable whole. An actual physical occurrence, like a train wreck or a lab experiment, fits this latter definition, as it occupies a noticeable amount of space rather than a single point, and lasts for a significant period of time rather than just one second. The “point-event” in Commander McHardy’s essay is related to the element in the same way that a solid relates to a point in geometry. This note isn’t meant to criticize Commander McHardy’s terminology, which we actually find appealing; we just want to prevent any confusion for the reader.—Editor. ↑
2 This paragraph is the result of an editorial revision of the author’s text, designed to retain the substance of his presentation, while tying up what he has to say more definitely with the preceding essays, and eliminating the distinction between finite and infinitesimal intervals, which we believe to be out of place in an essay of this character. We will not apologize to our mathematical readers for having used finite and differential notation in the same equation, in violation of mathematical convention.—Editor. ↑
2 This paragraph is the result of an editorial revision of the author’s text, aimed at keeping the core of his presentation while better connecting it to the previous essays and removing the distinction between finite and infinitesimal intervals, which we think is unnecessary for an essay like this. We won’t apologize to our math readers for using finite and differential notation together in the same equation, even though it goes against mathematical convention.—Editor. ↑
3 Although gravitational force in a small region can be imitated or annulled by accelerating motion, there remains the disturbing influence of gravitational matter already referred to and expressed in the fabric curvature. It is this that defines how unique tracks run, or rather, how bodies progress.—Author. ↑
3 Even though gravitational force in a small area can be simulated or canceled out by acceleration, there is still the unsettling effect of gravitational matter mentioned earlier, which is shown in the bending of space. This is what determines the paths taken or, more accurately, how objects move.—Author. ↑
XV
THE NEW WORLD
A Universe in Which Geometry Takes the Place of Physics, and Curvature That of Force
A Universe Where Geometry Takes the Place of Physics, and Curvature Takes the Place of Force
It is familiar knowledge that the line, the surface and ordinary Euclidean space are to be regarded as spaces of one, two and three dimensions respectively and readers of this journal are aware that a hypothetical space of four dimensions has been closely investigated. The most convenient space to study is the surface or two-space, since we can regard it as embedded in a three-space. If a surface is curved it is generally impossible to draw a straight line on it, for as we see clearly, the “straightest” line is changing its direction at every point. To describe this property accurately it is necessary to ascribe to each point a magnitude which expresses what happens to the direction of a short line in the region when displaced a short distance parallel to itself. This is called the direction-defining magnitude. Different sets of values of this magnitude relate to surfaces of different curvatures.
It is common knowledge that lines, surfaces, and typical Euclidean space represent one, two, and three dimensions, respectively, and readers of this journal are aware that a theoretical four-dimensional space has been thoroughly explored. The easiest space to study is the surface or two-dimensional space since we can think of it as existing within three-dimensional space. If a surface is curved, it's usually not possible to draw a straight line on it, as we can see that the “straightest” line is changing direction at every point. To accurately describe this property, it's necessary to assign a value to each point that indicates how the direction of a short line in the area changes when it's moved a short distance parallel to itself. This is called the direction-defining magnitude. Different sets of values for this magnitude correspond to surfaces with different curvatures.
A second fundamental property has recently been pointed out. There is inherent in every part of a space a measure of length peculiar to that particular [266]region and which in general varies from region to region. To describe this variation accurately it is necessary to ascribe to each point another magnitude called the length-defining magnitude, which expresses the change from each point to the next of the unit of length. These two magnitudes define the surface completely.
A second fundamental property has recently been highlighted. Every part of a space has a unique measure of length specific to that region, and this generally varies from one area to another. To describe this variation accurately, we need to assign each point another value called the length-defining magnitude, which indicates the change in length from one point to the next. These two values fully define the surface.
Similarly, a space of any number of dimensions is defined completely by a similar pair of magnitudes. A space is the “field” of such a magnitude-pair and the nature of these magnitudes defines the dimensions of the space. The four-space usually described is the Euclidean member of an infinity of four-spaces.
Similarly, a space of any number of dimensions is completely defined by a comparable pair of magnitudes. A space is the “field” of that magnitude pair, and the characteristics of these magnitudes determine the dimensions of the space. The four-dimensional space typically described is the Euclidean version among an infinite number of four-dimensional spaces.
When we look into a mirror we see a space differing from ordinary space in that right
and left are interchanged and this is described mathematically by saying that if we
locate points as usual by specifying three distances ,
,
of the point from three mutually perpendicular planes, then a point
,
,
, in actual space corresponds with a point
,
,
in the mirrored space: in other words the mirrored space is derived from the real
space by multiplying the
coordinates by
. If we were to multiply by
instead of
we should derive a different space; in this case, however, we have no mirror to show
us what it looks like. Such a space is said to have one negative dimension and it
has the peculiar property that in the figure derived from the right triangle of ordinary
space the square of the “hypotenuse” equals the difference and not the sum of the
squares of the other two sides, so that the length of a line may sometimes have to
be represented by the square-root [267]of a negative number, a “complex” number.
When we look in a mirror, we see a space that is different from regular space because right and left are flipped. This can be expressed mathematically by saying that if we locate points by using three distances ,
,
of the point from three mutually perpendicular planes, then a point
,
,
in actual space matches a point
,
,
in the mirrored space. In other words, the mirrored space is derived from the real space by multiplying the
coordinates by
. If we were to multiply by
instead of
, we would create a different space. In this case, though, we don’t have a mirror to show us what it looks like. This kind of space is said to have one negative dimension and has the unique quality that in the figure formed from the right triangle of regular space, the square of the “hypotenuse” equals the difference—not the sum—of the squares of the other two sides. Therefore, sometimes the length of a line might need to be represented by the square root [__A_TAG_PLACEHOLDER_0__] of a negative number, which is a “complex” number.
In considering what at first sight may appear to be fantastic statements made by this theory, it must be borne in mind that all our knowledge of the external universe comes through our sense-impressions, and our most confident statements about external things are really of the nature of inferences from these sense-impressions and, being inferences, liable to be wrong. So that if the theory says that a stone lying on the ground is not a simple three-dimensional object, and that its substance is not the same as its substance a moment before, the matter is one for due consideration and not immediate disbelief.
When thinking about what might initially seem like unbelievable claims made by this theory, it's important to remember that all our knowledge of the outside world comes through our senses. Our strongest beliefs about external things are actually based on inferences from these senses, and since they are inferences, they can be incorrect. So, if the theory suggests that a stone resting on the ground isn’t just a straightforward three-dimensional object and that its substance isn’t the same as it was a moment ago, this is something that deserves careful thought rather than immediate rejection.
The idea that the universe extends in time as well as in space is not new, and fiction-writers have familiarized us with wonderful machines in which travellers journey in time and are present at various stages of the world’s history. This conception of the universe, to which the name “space-time” is usually applied, is adopted by the new theory and assigned the status of a physical reality.
The idea that the universe stretches through time as well as space isn't new, and writers have introduced us to amazing machines that let travelers move through time and witness different moments in history. This view of the universe, often called "space-time," is embraced by the new theory and recognized as a physical reality.
World Geometry
The fundamental creed of the new theory is that the space-time universe constitutes a true four-dimensional space of one negative dimension, this dimension being time. The variations from point to point of the direction-defining and length-defining magnitudes generate the geometrical properties of curvature, etc., and these are cognised by the human mind as physical phenomena: our sense-impressions are nothing more nor less than perceptions of the [268]geometry of a fourspace. So instead of inferring from our sense-impressions the existence of matter, motion and the like as we are accustomed to do, we should with equal justice infer the existence of a geometrical fourspace. Thus it becomes necessary to prepare a dictionary in which the familiar things of our world are identified with those geometrical properties of the four-space which really constitute them, and in so doing parts of our geometrical knowledge assume the guise of new physical knowledge.
The core belief of the new theory is that the space-time universe is actually a true four-dimensional space with one negative dimension, which is time. The changes from point to point in the direction-defining and length-defining measures create the geometric properties of curvature, etc., and these are perceived by the human mind as physical phenomena: our sense impressions are nothing more than perceptions of the [__A_TAG_PLACEHOLDER_0__]geometrical nature of a four-space. So instead of deducing the existence of matter, motion, and such from our sense impressions as we usually do, we should just as reasonably conclude that a geometrical four-space exists. Therefore, it becomes essential to create a dictionary that links the familiar aspects of our world with the geometrical properties of the four-space that truly make them up, and in doing so, some of our understanding of geometry appears as new physical knowledge.
Through the fourspace our consciousness travels, cognising a changing three-dimensional section of it as it goes and thus giving rise to time. It becomes aware that the fourspace is pleated or folded along lines all running roughly in the same direction, and possibly because this is the easiest direction to follow, it travels along the lines. The direction of this motion is the negative dimension. Thus consciousness is always aware of the nearly constant forms of the cross-sections of the pleats along which it travels. These unvarying forms constitute matter: matter is the form of a section through a uniform pleat of the fourspace—a three-dimensional aspect of a four-dimensional curvature; so that in strict accuracy we should say that a stone is the shape or form of a changing section of a four-dimensional object, the complete object being a long fold in the fourspace. The physical interpretation of this conservation of form of the cross-section is that matter is conserved. It is thus seen that the conscious mind, by following these pleats, has so determined time that the law of the conservation of matter must hold. The mathematical treatment of the [269]subject makes it clear that practically all other physical laws similarly follow as a direct result of this choice of time. The type of order prevailing in the physical universe, the laws of gravitation, heat, motion and the rest are not directly imposed by some external power, but are apparently chosen by mind itself.
Through the four-dimensional space our consciousness moves, recognizing a shifting three-dimensional slice of it along the way, which creates the concept of time. It becomes aware that the four-dimensional space is pleated or folded along lines that generally run in the same direction, likely because that’s the easiest path to follow. It travels along these lines, which represent the negative dimension. Therefore, consciousness is always aware of the nearly constant shapes of the cross-sections of the pleats it follows. These unchanging shapes make up matter: matter is the shape of a slice through a uniform pleat of the four-dimensional space—a three-dimensional aspect of a four-dimensional curvature. So, to be precise, we should say that a stone is the shape of a changing section of a four-dimensional object, the entire object being a long fold in the four-dimensional space. The physical interpretation of this consistency in the cross-section’s shape is that matter is conserved. It becomes clear that the conscious mind, by following these pleats, has determined time in such a way that the law of conservation of matter must apply. The mathematical treatment of the [__A_TAG_PLACEHOLDER_0__]subject clarifies that nearly all other physical laws similarly arise as a direct result of this understanding of time. The order that exists in the physical universe, including the laws of gravity, heat, motion, and others, are not directly imposed by some external force but appear to be chosen by the mind itself.
In the neighborhood of these pleats the fourspace is still curved, but to a smaller degree. This we cognise as energy or as a field of force. Thus energy is seen to be the same kind of thing as matter and would therefore be expected to have weight. This was experimentally demonstrated in 1919 when light was in effect actually weighed. Conversely, matter consists of energy; and it is calculated that one liter of water contains sufficient energy to develop a million horsepower for about four years. It is now believed that the sun’s energy is derived from the disintegration of the matter of which it is made.
In the area around these folds, the four-space is still curved, but to a lesser extent. We recognize this as energy or a field of force. So energy is understood to be the same as matter and would therefore be expected to have weight. This was actually demonstrated in 1919 when light was effectively weighed. On the flip side, matter is made up of energy; it’s estimated that one liter of water has enough energy to produce a million horsepower for about four years. It’s now thought that the sun’s energy comes from the breakdown of the matter it is composed of.
The method of establishing these identifications will be clear from the following: We already knew that matter is made up of electrons and that radiant energy is electromagnetic and before the advent of this theory it was regarded as certain that practically all observed physical phenomena except gravitation were manifestations of the electromagnetic field. The new theory has confirmed this belief. It is found that the gravitational and electromagnetic conditions of the universe are completely defined if to each point of space-time a gravitational and an electric potential are ascribed. These are magnitudes of the same nature as the direction-defining and length-defining magnitudes which must necessarily be [270]associated with every point of space-time if it is a true “space,” and they are therefore identified with these. By performing ordinary mathematical operations on these magnitudes statements of fact clothed in mathematical form are obtained, which are to be interpreted on the one hand as physical laws and on the other as geometrical properties of the fourspace. Nearly all our physical laws are derivable mathematically in this way, so that an extensive identification is effected which has been fruitful of results.
The way we establish these identifications will be clear from the following: We already knew that matter is made up of electrons and that radiant energy is electromagnetic. Before this theory came about, it was commonly accepted that almost all observed physical phenomena, except for gravity, were manifestations of the electromagnetic field. The new theory has confirmed this belief. It has been found that the gravitational and electromagnetic conditions of the universe can be fully defined by assigning a gravitational and an electric potential to each point in space-time. These are measurements similar to the direction and length measurements that must be associated with every point in space-time if it truly qualifies as "space," and so they are identified as such. By performing standard mathematical operations on these measurements, we obtain statements of fact expressed in mathematical form, which can be interpreted both as physical laws and as geometric properties of the four-dimensional space. Almost all of our physical laws can be mathematically derived in this way, resulting in a broad identification that has yielded significant results.
It has been mentioned that a slight curvature is sometimes cognised as force and as this identification appeared originally as a postulate its history is interesting.
It has been noted that a slight curve is sometimes recognized as force, and since this identification initially appeared as a postulate, its history is intriguing.
The Origin of the Theory
An experiment by Michelson and Morley (1887), on which the whole theory is based, made it appear that if a man measures the velocity at which light passes him he will get the same result whether he is stationary, rushing to meet the light, or moving in the same direction as the light. The solution was provided by Einstein in 1905. He suggested that since we know the results of these determinations ought not to agree, something must have happened to the clocks and measuring-rods used in measuring the velocity so that the standards of length and time were not the same in the three cases, the alterations being exactly such as to make the velocity of light constant. This solution is universally accepted as true and is the fundamental postulate. Thus the length of a stick and the rate at which time passes [271]will change as the velocity of the person observing these things changes. If a man measured the length of an aeroplane going past him at 161,000 miles per second it would measure only half the length observed when stationary. If the aeroplane were going with the velocity of light, its length would vanish though its breadth and height would be unaltered. Similarly, if of two twin brothers one were continually moving with reference to the other their ages would gradually diverge, for time would go at different rates for the two. If one moved with the velocity of light, time would stand still for him while for the other it would go on as usual. To get actually younger it would be necessary to move quicker than light which is believed to be impossible. The velocity of light is assumed to be the greatest velocity occurring in nature.
An experiment by Michelson and Morley (1887), on which the whole theory is based, showed that if a person measures the speed at which light passes them, they will get the same result whether they are stationary, moving toward the light, or traveling in the same direction as the light. Einstein provided the solution in 1905. He suggested that since we expect these measurements to not agree, something must have impacted the clocks and measuring devices used to determine the speed, so the standards of length and time were different in the three situations, with the changes being just right to keep the speed of light constant. This solution is widely accepted as true and is the fundamental principle. Therefore, the length of an object and the flow of time [__A_TAG_PLACEHOLDER_0__]will change as the speed of the observer changes. If a person measured the length of an airplane passing by at 161,000 miles per second, it would appear only half as long as when they are stationary. If the airplane were traveling at the speed of light, it would seem to have no length, though its width and height would remain unchanged. Similarly, if one twin brother is always moving in relation to the other, their ages would gradually differ because time would pass at different rates for the two. If one traveled at the speed of light, time would stop for them while it would continue normally for the other. To actually become younger, one would need to move faster than light, which is believed to be impossible. The speed of light is thought to be the maximum speed in nature.
Evidently then if the distance in space and the interval in time separating two given
events, such as the firing of a gun and the bursting of the shell, are measured by
two observers in uniform relative motion, their estimates will not agree. Consider
now the simple problem of measuring the distance between two points on an ordinary
drawing-board. If we draw two perpendicular axes, we can define this distance by specifying
the lengths of the projections on the two axes of the line joining the points. If
we choose two different axes the projections will not be the same but will define
the same length. Similarly, in a Euclidean four-space the distance between two points
will be defined by the projections on the four axes, but if these axes be rotated
slightly, the projections will be different, but [272]will define the same length. Now, returning to the two observers just mentioned, it
was noticed by Minkowski in 1908 that if the space measurements between the two events
are split into the usual three components, and if the time measurements are multiplied
by , the difference between the two sets of measurements is exactly the same as would
have occurred had these two events been points in a Euclidean fourspace, and two different
observations made of their distance apart using two sets of axes inclined to each
other. The velocity of light is made equal to 1 in this calculation by a suitable
choice of units. This discovery threw a vivid light on the problem of space-time,
showing that it is probably a true four-space of one negative dimension, a simple
derivative of the much-discussed and now familiar Euclidean four-space.
Clearly, if two observers in uniform relative motion measure the distance in space and the interval in time between two specific events, like the firing of a gun and the explosion of a shell, they will come up with different estimates. Let's look at a basic example of measuring the distance between two points on a regular drawing board. By drawing two perpendicular axes, we can define this distance by specifying the lengths of the projections on the two axes of the line connecting the points. If we choose two different axes, the projections won't be the same, but they'll measure the same length. Similarly, in a Euclidean four-space, the distance between two points will be defined by the projections on the four axes, but if these axes are tilted slightly, the projections will differ, yet [__A_TAG_PLACEHOLDER_0__]will measure the same length. Now, going back to the two observers mentioned earlier, Minkowski observed in 1908 that if the space measurements between the two events are divided into the usual three components, and if the time measurements are multiplied by , the difference between these two sets of measurements is exactly what would happen if these two events were points in a Euclidean four-space, and two different observations were made of their distance apart using two sets of axes that are tilted relative to each other. We set the speed of light to 1 in this calculation by choosing appropriate units. This finding shed light on the space-time issue, suggesting that it is likely a true four-space with one negative dimension, a straightforward derivative of the now well-known Euclidean four-space.
Although this discovery gave a tremendous impetus to the progress of the theory, it is probable that it holds a deeper significance not yet revealed. It is probably a statement of the “stuff” of which the four-space is made, and perhaps also of how it is made; but the problem remains unsolved.
Although this discovery greatly advanced the theory, it likely has a deeper significance that hasn't been uncovered yet. It probably describes the "material" that makes up four-dimensional space and maybe even how it's formed; however, the problem is still unresolved.
It thus becomes plain that our two observers are merely looking at the same thing from different viewpoints. Each has just as much right as the other to regard himself as being at rest in ordinary space (this is the postulate of the relativity of uniform motion) and to regard his time direction as a straight line in the four-space. The difference is merely that the two time axes are inclined to each other. If, however, one were moving with an acceleration with reference to the other his path in the four-space [273]will appear curved to the other, though he himself, since he regards it as his time axis, will still assume it to be straight. If there is a body moving in what one observer sees to be a straight line, the other will, of course, in general see it as curved, and following the usual custom, since this body, without apparent reason, deviates from the straight path, will say there must be some force acting on it. Thus the curvature of his time axis, due to his accelerated motion, makes it appear that there is round him a field of force, which causes freely moving bodies to deviate from the straight path. Now if space-time is itself inherently curved it is not generally possible for any line in it to be straight any more than it is possible for any line on the surface of a sphere to be straight. Hence, all axes must be curved, and all observers, whatever their states of motion, must experience fields of force which are of the same nature as those due to motion only. The extra force experienced when a lift begins to rise is an example of force due to pure motion: gravitation is the similar force due to an inherent curvature of the four-space, and it was the postulate that these forces were similar that made possible Einstein’s solution of the general problem of gravitation.
It becomes clear that our two observers are just looking at the same thing from different perspectives. Each has just as much right as the other to see themselves as at rest in regular space (this is the idea behind the relativity of uniform motion) and to view their time direction as a straight line in four-dimensional space. The difference is that the two time axes are tilted relative to each other. However, if one is accelerating compared to the other, their path in four-dimensional space [__A_TAG_PLACEHOLDER_0__] will seem curved to the other, while the one moving will still view it as straight since they consider it their time axis. If there is an object moving in what one observer perceives as a straight line, the other will generally see it as curved, and, following the usual pattern, since this object deviates from the straight path for no apparent reason, they will conclude that some force must be acting on it. Thus, the curvature of their time axis, due to their accelerated motion, creates the impression that there is a field of force around them, which causes freely moving objects to stray from a straight path. Now, if space-time is inherently curved, it’s generally impossible for any line in it to be straight, just like no line on the surface of a sphere can be straight. Therefore, all axes must be curved, and all observers, regardless of their motion, will experience fields of force that are similar to those caused by motion alone. The extra force felt when an elevator starts to rise is an example of a force from pure motion: gravity is a similar force resulting from the inherent curvature of four-dimensional space, and the idea that these forces are alike enabled Einstein to solve the general problem of gravitation.
The Timeline
The correlation of time with its geometrical analogue is of absorbing interest. Representing velocity by the common method of plotting a curve showing positions at various times and marking distances horizontally and times vertically, the velocity of [274]light being 1, MM′ and NN′ will both represent this velocity. Since this is assumed to be the greatest velocity occurring in nature, all other possible velocities
The link between time and its geometric counterpart is really intriguing. Using the usual method of plotting a curve that shows positions at different times, with distances along the horizontal axis and times along the vertical axis, the speed of [__A_TAG_PLACEHOLDER_0__]light is set at 1. Both MM′ and NN′ will represent this speed. Since this is considered the maximum speed possible in nature, all other speeds will

are represented by lines falling within the upper and lower V’s. Now this diagram
correctly represents two dimensions of Minkowski’s Euclidean four-space so, transmuting
to real but flat four-space by multiplying times by , it is seen that there is a region outside which no effect can be propagated from
O since that would involve the existence of a velocity greater than that of light.
This region represents the future of O. Similarly, O can only be affected by events within the region derived from the downward-opening
V, which therefore represents the past of O. The region between the two represents events which may be either simultaneous [275]with O or not, according to the velocity of the observer at O. Thus in this theory an event dictated by free-will, could affect points in its “future”
region, but not in any other, which agrees with experience and shows that the theory
is not essentially “determinist.” If “free-will” is really free, the future is not yet determined, and the fourspace
must be in some way formed by the will as time progresses.
are represented by lines within the upper and lower V shapes. This diagram accurately illustrates two dimensions of Minkowski’s Euclidean four-space, so when we transform it into real but flat four-space by multiplying by , we can see that there is an area outside of which no effect can be transmitted from O, as that would require a speed exceeding that of light. This area represents the future of O. Likewise, O can only be influenced by events within the area defined by the downward-opening V, which thus represents the past of O. The space between the two represents events that may be either simultaneous [__A_TAG_PLACEHOLDER_0__] with O or not, depending on the observer's velocity at O. Therefore, in this theory, an event driven by free will could influence points in its "future" region, but not in any other, which aligns with our experiences and demonstrates that the theory is not fundamentally “deterministic.” If “free will” is genuinely free, the future is not predetermined, and four-space must somehow be shaped by this will as time moves forward.
The trains of thought inspired by Einstein’s postulates have already carried us to a pinnacle of knowledge unprecedented in the history of man. On every hand, as we look out upon the universe from our new and lofty standpoint, unexpected and enthralling vistas open up before us, and we find ourselves confronting nature with an insight such as no man has ever before dared aspire to.
The ideas sparked by Einstein’s theories have taken us to an unmatched height of understanding in human history. Everywhere we look at the universe from our new and elevated perspective, surprising and exciting views unfold before us, and we face nature with a clarity that no one has ever before dreamed of.
It is completely unthinkable that this theory can ever be swept aside. Apart from experimental verifications which, in point of fact, lend it the strongest support, no one could work through the theory without feeling that here, in truth, the inner workings of the universe were laid bare before him. The harmony with nature is far too complete for any doubt to arise of its truth. [276]
It’s completely unimaginable that this theory could ever be ignored. Besides the experimental evidence that actually backs it up the most, no one can study the theory without realizing that, in reality, the inner workings of the universe are revealed. The harmony with nature is just too perfect for any doubt about its truth to exist. [__A_TAG_PLACEHOLDER_0__]
XVI
THE QUEST OF THE ABSOLUTE
Modern Developments in Theoretical Physics, and the Climax Supplied by Einstein
Recent Advances in Theoretical Physics and the Contribution of Einstein
We shall discuss the more important aspects of the theory popularly known as the “Einstein Theory of Gravitation” and shall try to show clearly that this theory is a natural outcome of ideas long held by physicists in general. These ideas are:
We will talk about the key aspects of the theory commonly referred to as the “Einstein Theory of Gravitation” and will aim to demonstrate clearly that this theory is a natural result of concepts that have been accepted by physicists for a long time. These concepts are:
(a) The impossibility of “action at a distance;” in other words we find an instinctive repugnance to admit that one body can affect another, remote from it, instantaneously and without the existence of an intervening medium.
(a) The impossibility of "action at a distance;" in other words, we have an instinctive aversion to accept that one body can instantly affect another body that is far away, without an intervening medium.
(b) The independence of natural, i.e., physical, laws of their mathematical mode of expression. Thus, when an equation is written down as the expression of a physical law it must be satisfied, no matter what units we choose in order to measure the quantities occurring in the equation. As our physics teacher used to say “the expression of the law must have in every term the same dimensions.” [277]More than this the choice of the quantities used to express the law—if there be a choice open—must have no effect on its correctness. As we were told—“all physical laws are capable of expression as relations between vectors or else as relations between magnitudes of the same dimensions.” We shall hope to make this clearer in its proper place in the essay, as its obvious generalization is Einstein’s cardinal principle of relativity.
(b) The independence of natural, i.e. physical, laws from how they are mathematically expressed. So, when an equation is written as the expression of a physical law, it must hold true regardless of the units we choose for measuring the quantities in the equation. As our physics teacher used to say, “the expression of the law must have in every term the same dimensions.” [__A_TAG_PLACEHOLDER_0__] Furthermore, the choice of the quantities used to express the law—if there’s a choice available—should not affect its accuracy. As we were told, “all physical laws can be expressed as relationships between vectors or as relationships between magnitudes of the same dimensions.” We hope to clarify this in its appropriate section of the essay, as its clear generalization is Einstein’s key principle of relativity.
The measurements which an experimental physicist makes are always the expression of a coincidence of two points in space at the same time. If we ask such an experimenter what he means by a point in space he tells us that, for him, the term has no meaning until he has a material body with reference to which he can locate the point by measurements; in general it requires three measurements and he expresses this by saying that space has three dimensions. He measures his distance, as a rule, parallel to three mutually perpendicular lines fixed in the material body—a Cartesian reference-frame so-called. So that a “point in space” is equivalent to a given material reference-frame and three numbers or coordinates. If, for any reason, we prefer to use a new material reference-frame the coordinates or measurements will change and, if we know the relative positions of the two material reference-frames, there is a definite relation between the two sets of three coordinates which is termed a transformation of coordinates. But which particular material reference-frame shall we use? The first choice would, we think, be that attached to the earth. But, even yet, we are in doubt as there are numberless Cartesian [278]frameworks attached to the earth (as to any material body) and it is here that our idea (b) begins to function. We say it must be immaterial which of these Cartesian frames we use. In each frame a vector has three components and when we change from one frame to another the components change in such a way that if two vectors have their three components equal in one framework they will be equal in any other attached to the same material system. So our idea (b), which says that our physical equations must be vector equations, is equivalent to saying that the choice of the framework attached to any given material body can have no effect on the mode of expression of a natural law.
The measurements that an experimental physicist takes always represent the coincidence of two points in space at the same time. If we ask such an experimenter what he means by a point in space, he’ll tell us that the term doesn’t have meaning for him until he has a material object to refer to, which allows him to locate the point through measurements; in general, it takes three measurements, which he expresses by saying that space has three dimensions. Typically, he measures distance parallel to three mutually perpendicular lines fixed in the material object—what's known as a Cartesian reference frame. Therefore, a “point in space” corresponds to a specific material reference frame and three numbers or coordinates. If we decide to use a different material reference frame for any reason, the coordinates or measurements will change, and if we understand the relative positions of the two material reference frames, there is a specific relationship between the two sets of three coordinates, known as a transformation of coordinates. But which specific material reference frame should we use? We would likely choose the one attached to the Earth. Yet, even then, we have uncertainty since there are countless Cartesian [__A_TAG_PLACEHOLDER_0__]frameworks linked to the Earth (or to any material body), and this is where our idea (b) starts to take shape. We assert that it doesn't matter which of these Cartesian frames we utilize. In each frame, a vector has three components, and when we shift from one frame to another, the components change in such a way that if two vectors have their three components equal in one framework, they will be equal in any other linked to the same material system. Thus, our idea (b), which states that our physical equations must be vector equations, is equivalent to saying that the choice of the framework connected to any given material body doesn’t influence how a natural law is expressed.
Shall we carry over our idea (b) to answer the next question: “To which material body shall we attach our framework?” To this question Newton gave one answer and Einstein another. We shall first consider Newton’s position and then we may hope to see clearly where the new theory diverges from the classical or Newtonian mechanics. Newton’s answer was that there is a particular material frame with reference to which the laws of mechanics have a remarkably simple form commonly known as “Newton’s laws of motion” and so it is preferable to use this framework which is called an absolute frame.
Shall we take our idea (b) and apply it to the next question: “Which physical body should we base our framework on?” Newton had one answer, and Einstein had another. We’ll first look at Newton’s viewpoint, and then we can hopefully see where the new theory differs from classical or Newtonian mechanics. Newton said there is a specific material frame that makes the laws of mechanics incredibly straightforward, commonly referred to as “Newton’s laws of motion,” so it’s better to use this framework, known as an absolute frame.
What is the essential peculiarity of an absolute frame? Newton was essentially an
empiricist of Bacon’s school and he observed the following facts. Let us suppose we
have a framework of reference attached to the earth. Then a small particle of matter under the gravitational influence of surrounding
[279]bodies, including the earth, takes on a certain acceleration . Now suppose the surrounding bodies removed (since we cannot remove the earth we
shall have to view the experiment as an abstraction), and another set introduced;
the particle, being again at its original position, will begin to move with an acceleration
. If both sets of surrounding bodies are present simultaneously the particle begins
to move with an acceleration which is approximately but not quite the sum of
and
. Newton postulated there there is a certain absolute reference frame in which the
approximation would be an equality; and so the acceleration, relative to the material
frame, furnishes a convenient measure of the effect of the surrounding bodies—which
effect we call their gravitational force. Notice that if the effect of the surrounding bodies is small the acceleration is
small and so we obtain as a limiting case, Newton’s law of inertia which says that a body subject to no forces has no acceleration; a law which, as
Poincaré justly observed, can never be subjected to experimental justification. The natural
questions then arise: which is the absolute and privileged reference-frame and how
must the simple laws be modified when we use a frame more convenient for us—one attached
to the earth let us say? The absolute frame is one attached to the fixed stars; and
to the absolute or real force defined as above, we must add certain terms, usually
called centrifugal forces. These are referred to as fictitious forces because, as it is explained, they are due to the motion of the reference-frame
with respect to the absolute frame and in no way depend on the distribution of the
[280]surrounding bodies. Gravitational force and centrifugal forces have in common the
remarkable property that they depend in no way on the material of the attracted body
nor on its chemical state; they act on all matter and are in this way different from
other forces met with in nature, such as magnetic or electric forces. Further Newton
found that he could predict the facts of observation accurately on the hypothesis
that two small particles of matter attracted each other, in the direction of the line
joining them, with a force varying inversely as the square of the distance between
them. This law is an “action at a distance” law and so is opposed to the idea (a).
What is the key feature of an absolute frame? Newton was primarily an empiricist in the tradition of Bacon, and he noted the following facts. Let's say we have a reference frame attached to the earth. A small particle of matter, influenced by the gravitational forces of surrounding bodies, including the earth, will experience a certain acceleration . Now, if we imagine removing the surrounding bodies (since we can't remove the earth, we'll treat this as a theoretical scenario) and introducing a different set; the particle, back at its original position, will start to move with an acceleration
. If both sets of surrounding bodies are present at the same time, the particle will begin moving with an acceleration that is approximately but not quite the sum of
and
. Newton proposed that there is a certain absolute reference frame where this approximation would actually hold true; thus, the acceleration relative to this material frame provides a useful measure of the effects of the surrounding bodies—what we refer to as their gravitational force. Notice that if the effect of the surrounding bodies is minimal, the acceleration is also minimal, leading us to the limiting case of Newton’s law of inertia, which states that a body with no forces acting on it has no acceleration; a law that, as Poincaré rightly pointed out, can never be validated through experimentation. This raises natural questions: which is the absolute and privileged reference frame, and how must we adjust the simple laws when using a more convenient frame—like one attached to the earth? The absolute frame is one connected to the fixed stars; to the absolute or true force defined earlier, we must add certain terms, often termed centrifugal forces. These are called fictitious forces because, as explained, they arise from the motion of the reference frame relative to the absolute frame and do not depend on how the surrounding bodies are arranged. Gravitational force and centrifugal forces share the notable feature that they do not depend on the material of the attracted body or its chemical state; they act on all matter in this respect, which distinguishes them from other forces in nature, such as magnetic or electric forces. Furthermore, Newton discovered that he could accurately predict observations based on the hypothesis that two small particles of matter attract each other along the line connecting them, with a force that varies inversely with the square of the distance between them. This law embodies the concept of “action at a distance” and thus opposes the notion (a).
We have tacitly supposed that the space in which we make our measurements is that made familiar to us by the study of Euclid’s elements. The characteristic property of this space is that stated by the theorem of Pythagoras that the distance between two points is found by extracting the square root of the sum of the squares of the differences of the Cartesian coordinates of the two points. Mathematicians have long recognized the possibility of other types of space and Einstein has followed their lead. He abandons the empiricist method and when asked what he means by a point in space replies that to him a point in space is equivalent to four numbers how obtained it is unnecessary to know a priori; in certain special cases they may be the three Cartesian coordinates of the experimenter (measured with reference to a definite material framework) together with the time. Accordingly he says his space is of four dimensions. Between any two [281]“points” we may insert a sequence of sets of four numbers, varying continuously from the first set to the second, thus forming what we call a curve joining the two points. Now we define the “length” of this curve in a manner which involves all the points on it and stipulate that this length has a physical reality, i.e., according to our idea (b) its value is independent of the particular choice of coordinates we make in describing the space. Among all the joining curves there will be one with the property of having the smallest length; this is called a geodesic and corresponds to the straight line in Euclidean space. We must now, for lack of an a priori description of the actual significance of our coordinates, extend the idea of vector so that we may speak of the components of a vector no matter what our coordinates may actually signify. In this way are introduced what are known as tensors; if two tensors are equal, i.e., have all their components equal, in any one set of coordinates they are equal in any other and the fundamental demand of the new physics is that all physical equations which are not merely the expression of equality of magnitudes must state the equality of tensors. In this way no one system of coordinates is privileged above any other and the laws of physics are expressed in a form independent of the actual coordinates chosen; they are written, as we may say, in an absolute form.
We have implicitly assumed that the space in which we take our measurements is the one made familiar to us through Euclid’s elements. The defining feature of this space is described by the Pythagorean theorem, which states that the distance between two points is found by taking the square root of the sum of the squares of the differences of the Cartesian coordinates of those points. Mathematicians have long recognized the possibility of different types of space, and Einstein has built on this idea. He drops the empiricist method and, when asked what he means by a point in space, responds that to him, a point in space is represented by four numbers how obtained it is unnecessary to know a priori; in certain special cases, these might be the three Cartesian coordinates of the experimenter (measured with respect to a specific material framework) plus the time. Thus, he states that his space is four-dimensional. Between any two [__A_TAG_PLACEHOLDER_0__] "points," we can insert a sequence of sets of four numbers that vary continuously from the first set to the second, creating what we call a curve connecting the two points. We then define the “length” of this curve in a way that involves all the points on it and assert that this length has a physical reality, i.e., according to our idea (b), its value is independent of the specific choice of coordinates we use to describe the space. Among all the connecting curves, there will be one that has the smallest length; this is called a geodesic and corresponds to the straight line in Euclidean space. Now, due to the lack of an a priori description of the actual significance of our coordinates, we need to expand the concept of a vector so we can discuss the components of a vector regardless of what our coordinates actually mean. This is how what are known as tensors are introduced; if two tensors are equal, i.e., have all their components equal in one set of coordinates, they are equal in any other set as well, and the fundamental requirement of the new physics is that all physical equations that are more than just expressions of equality of magnitudes must state the equality of tensors. This way, no single system of coordinates is favored over another, and the laws of physics can be expressed in a form independent of the specific coordinates chosen; they can be said to be written in an absolute form.
The Gravity Hypothesis
Einstein flatly denies Newton’s hypothesis that there is an absolute system (and, indeed, many others [282]before him had found it difficult to admit that so insignificant a part of the universe as our fixed star system should have such a privileged position as that accorded to it in the Newtonian Mechanics). In any system, he says, we have no reason to distinguish between the so-called real gravitational force and the so-called fictitious centrifugal forces—if we wish so to express it gravitational force is fictitious force.1 A particle moving in the neighborhood of material bodies moves according to a law of inertia—a physical law expressible, therefore, in a manner quite independent of the choice of coordinates. The law of inertia is that a particle left to itself moves along the geodesics or shortest lines in the space. If the particle is remote from other bodies the space has the Euclidean character and we have Newton’s law of inertia; otherwise the particle is in a space of a non-Euclidean character (the space being always the four-dimensional space) and the path of the particle is along a geodesic in that space. Einstein, in order to make the theory more concrete, makes a certain stipulation as to the nature of the gravitational space which stipulation is expressed, as are all physical laws, by means of a tensor equation—and this is sometimes called his law of gravitation.
Einstein outright rejects Newton’s idea that there is an absolute system (and, in fact, many others [__A_TAG_PLACEHOLDER_0__]before him struggled to accept that such a minor part of the universe as our solar system should have such a special status in Newtonian mechanics). He argues that in any system, there’s no reason to differentiate between the so-called real gravitational force and the so-called fictitious centrifugal forces—if we want to frame it that way, gravitational force is a fictitious force.1 A particle moving near material bodies behaves according to a law of inertia—a physical law that can be expressed independently of the choice of coordinates. The law of inertia states that a particle left on its own moves along the geodesics or shortest paths in space. If the particle is far from other bodies, the space has a Euclidean nature, and we observe Newton’s law of inertia; otherwise, the particle exists in a non-Euclidean space (which is always four-dimensional), and its path follows a geodesic in that space. To clarify the theory, Einstein makes a specific assumption about the nature of gravitational space, which is described, like all physical laws, through a tensor equation—and this is sometimes referred to as his law of gravitation.
Perhaps it will be well, in exemplification, to explain why light rays, which pass close to the sun, should be bent according to the new theory. It is assumed that light rays travel along certain geodesics known as minimal geodesics. The sun has an intense gravitational field near it—or, as we now say, the [283]departure of the four-dimensional space from the Euclidean is very marked for points near the sun—but for points so remote as the earth this departure is so small as to be negligible. Hence the form of the geodesics near the sun is different from that near the earth. If the space surrounding the sun were Euclidean the actual paths of the light rays would appear different from geodesics or straight-lines. Hence Einstein speaks of the curvature of the light rays due to the gravitational field of the sun; but we must not be misled by a phrase. Light always travels along geodesics (or straight lines—the only definition we have of a straight line is that it is a geodesic); but, owing to the “distortion” of the space they traverse, due to the sun, these geodesics reach us with a direction different from that they would have if they did not pass through the markedly non-Euclidean space near the sun.
Maybe it would be helpful to explain why light rays that pass close to the sun are bent according to the new theory. It is assumed that light rays travel along specific paths known as minimal geodesics. The sun has a strong gravitational field around it—or, as we now say, the [__A_TAG_PLACEHOLDER_0__]departure of four-dimensional space from Euclidean geometry is very noticeable near the sun—but for points as far away as the earth, this deviation is so small that it can be ignored. Therefore, the shape of the geodesics near the sun is different from those near the earth. If the space surrounding the sun were Euclidean, the actual paths of the light rays would look different from geodesics or straight lines. That’s why Einstein talks about the curvature of light rays caused by the sun's gravitational field; however, we shouldn't be misled by the wording. Light always travels along geodesics (or straight lines—the only definition we have of a straight line is that it's a geodesic); but due to the “distortion” of the space it moves through, caused by the sun, these geodesics reach us at a different angle than they would if they didn't pass through the significantly non-Euclidean space near the sun.
The consideration of the fundamental four-dimensional space as being non-Euclidean where matter is present gives a possibility of an answer to the world old question: Is space finite or infinite? Is time eternal or finite? The fascinating possibility arises that the space may be like the two-dimensional surface of a sphere which to a limited experience seems infinite in extent and flat or Euclidean in character. A new Columbus now asks us to consider other possibilities in which we should have a finite universe—finite not only as to space measurement but as to time (for the space may be such that all of the four coordinates of its points are bounded in magnitude). However, although Einstein speaks of the possibility of a finite universe, we do not, personally, think his [284]argument convincing. Points on a sphere may be located by the Cartesian coordinates of their stereographic projections on the equatorial plane and these coordinates, which might well be those actually measured, are not bounded.
The idea of viewing fundamental four-dimensional space as non-Euclidean in the presence of matter opens up possibilities for answering the age-old question: Is space finite or infinite? Is time eternal or finite? It is intriguing to consider that space could be like the two-dimensional surface of a sphere, which can seem infinite and flat or Euclidean based on limited experience. A new thinker now challenges us to explore other options where we could have a finite universe—not just in terms of spatial measurement but also regarding time (since space might be structured such that all four coordinates of its points are limited in value). However, although Einstein mentions the possibility of a finite universe, we personally do not find his argument convincing. Points on a sphere can be identified by the Cartesian coordinates of their stereographic projections on the equatorial plane, and these coordinates, which could very well be the ones actually measured, are not confined.
The Theory of Special Relativity
In our account of the Einstein theory we have not followed its historical order of development for two reasons. Firstly, the earlier Special Relativity Theory properly belongs to a school of thought diametrically opposed to that furnishing the “General Theory of Relativity” and, secondly, the latter cannot be obtained from the former by the process of generalization as commonly understood. Einstein, when proposing the earlier theory, adopted the position of the empiricist so that to him the phrase, a point in space, had no meaning without a material framework of reference in which to measure space distances. When he came to investigate what is meant by time and when he asked the question “what is meant by the statement that two remote events are simultaneous?” it became evident that some mode of communication between the two places is necessary; the mode adopted was that by means of light-signals. The fundamental hypothesis was then made that the velocity of such signals is independent of the velocity of their source (some hypothesis is necessary if we wish to compare the time associated with events, when one material reference-system is used, and the corresponding time when another in motion relative to the first is adopted). It develops [285]that time and space measurements are inextricably interwoven; there is no such thing as the length of a body or the duration of an event but rather these are relative to the reference-system.2 Minkowski introduced the idea of the space of events—of four dimensions—but this space was supposed Euclidean like the three-dimensional space of his predecessors. To Einstein belongs the credit of taking from this representation a purely formal mathematical character and of insisting that the “real” space—whose distances have a physical significance—is the four-dimensional space. But we cannot insist too strongly on the fact that in the gravitational space of the general theory there is no postulate of the constancy of velocity of a light-signal and accordingly no method of assigning a time to events corresponding to that adopted in the special theory. In this latter theory attention was confined to material systems moving with uniform velocity with respect to each other and it developed that the velocity of light was the ultimate velocity faster than which no system could move—a result surprising and a priori rather repugnant. It is merely a consequence of our mode of comparing times of events; if some other method—thought transference, let us say—were possible the velocity of this would be the “limiting velocity.”
In our discussion of Einstein's theory, we haven't followed the historical order of its development for two reasons. First, the earlier Special Relativity Theory is tied to a completely different school of thought than the one that developed the “General Theory of Relativity.” Second, you can't derive the latter from the former through the typical process of generalization. When Einstein proposed the earlier theory, he took an empiricist stance, meaning that to him, the term a point in space only had meaning within a material framework for measuring spatial distances. When he looked into the concept of time and questioned, “What does it mean for two distant events to be simultaneous?” it became clear that some form of communication between the two locations is essential; he chose light signals as that communication method. The central hypothesis then stated that the speed of these signals is constant, no matter the speed of their source (some hypothesis is necessary for comparing the time associated with events in one material reference system and the corresponding time when another system is in motion relative to the first). It turns out [__A_TAG_PLACEHOLDER_0__] that measurements of time and space are deeply interconnected; there is no absolute measure of the length of an object or the duration of an event; instead, these are relative to the reference system used. Minkowski introduced the concept of a four-dimensional space of events, but this space was assumed to be Euclidean, like the three-dimensional space of his predecessors. Einstein deserves credit for taking this representation and giving it a purely formal mathematical character, insisting that the “real” space, where distances have physical meaning, is four-dimensional space. However, it’s crucial to emphasize that in the gravitational space of the general theory, there is no assumption of constant velocity for a light signal, and thus no way to assign a time to events like in the special theory. In the latter, the focus was on material systems moving at a consistent speed relative to each other, leading to the conclusion that the speed of light is the ultimate speed—faster than which no system can move—a result that was surprising and, at first, quite uncomfortable to accept. This is simply a result of how we compare the timing of events; if another method—like thought transference—were possible, that speed would be the “limiting velocity.”
In conclusion we should remark that the postulated equivalence of “gravitational” and “centrifugal” forces demands that anything possessed of inertia will be acted upon by a gravitational field and [286]this leads to a possible identification of matter and energy. Further our guiding idea (a) will prompt us to say, following the example of Faraday in his electrical researches, that the geodesics of a gravitational space have a physical existence as distinct from a mere mathematical one. The four-dimensional space we may call the ether, and so restore this bearer of physical forces to the position it lost when, as a three-dimensional idea in the Special Relativity Theory, it had to bear an identical relation to a multitude of relatively moving material systems. The reason for our seemingly paradoxical title for an essay on Relativity will be clear when it is remembered that in the new theory we consider those space-time properties which are absolute or devoid of reference to any particular material reference-frame. Nevertheless, although the general characteristics of the theory are thus described, without reference to experiment, when the theory is to be tested it is necessary to state what the four coordinates discussed actually are—how they are determined by measurement. It is our opinion that much remains to be done to place this portion of the subject on a satisfactory basis. For example, in the derivation of the nature of the gravitational space, surrounding a single attracting body, most of the accounts use Cartesian coordinates as if the space were Euclidean and step from these to polar coordinates by the formulæ familiar in Euclidean geometry. But these details are, perhaps, like matters of elegance, if we shall be allowed to give Einstein’s quotation from Boltzmann, to be left to the “tailor and the cobbler.” [287]
In conclusion, we should note that the proposed equivalence of "gravitational" and "centrifugal" forces suggests that anything with inertia will be influenced by a gravitational field, and [__A_TAG_PLACEHOLDER_0__] this raises the possibility of identifying matter and energy. Furthermore, our guiding idea (a) leads us to say, following Faraday's example in his electrical studies, that the paths in a gravitational space have a physical existence distinct from just a mathematical one. We can refer to the four-dimensional space as the ether, thus restoring this carrier of physical forces to the role it lost when, as a three-dimensional concept in the Special Relativity Theory, it had to maintain an identical connection to many relatively moving material systems. The reason for our seemingly paradoxical title for an essay on Relativity will be clear when we remember that in the new theory, we look at those space-time properties that are absolute or not dependent on any specific material reference frame. However, even though the general characteristics of the theory are described in this way, independent of experiments, when it comes to testing the theory, it's essential to clarify what the four coordinates discussed actually represent—how they are defined through measurement. We believe there is still much work to be done to put this aspect of the topic on a solid foundation. For example, in deriving the nature of the gravitational space around a single attracting body, most explanations use Cartesian coordinates as if the space were Euclidean and then transition to polar coordinates using formulas common in Euclidean geometry. But these specifics are perhaps more a matter of elegance, if we may quote Einstein’s reference to Boltzmann, to be left to the “tailor and the cobbler.” [__A_TAG_PLACEHOLDER_0__]
XVII
THE PHYSICAL SIDE OF RELATIVITY
The Immediate Contacts Between Einstein’s Theories and Current Physics and Astronomy
The Direct Links Between Einstein's Theories and Today's Physics and Astronomy
The Theory of Relativity will be treated first from the physical side, leaving the three astronomical tests to which it has been put to be discussed later. There is one astronomical fact however that must be mentioned in this connection, and this is the discovery of the aberration of light by Bradley in 1726. It is found that every star in the heavens apparently describes a small annual ellipse, whose major axis is 41″ in length. This Bradley showed to be due to a combination of the velocity of the earth in its orbit, and the velocity of light; and it is so explained in all the elementary text-books on astronomy. It implies a stationary ether through which the earth is moving. The importance of this statement will appear presently.
The Theory of Relativity will first be discussed from the physical perspective, with the three astronomical tests it has undergone being addressed later. However, there is one astronomical detail that must be mentioned here: the discovery of light aberration by Bradley in 1726. It has been observed that every star in the sky seems to trace out a small annual ellipse, with a major axis measuring 41 seconds of arc. Bradley demonstrated that this effect is due to a combination of the Earth's velocity in its orbit and the speed of light, and this explanation is found in all basic astronomy textbooks. It suggests the existence of a stationary ether through which the Earth moves. The significance of this statement will become clear shortly.
The subject is usually illustrated by supposing a [288]man to go out in a rainstorm carrying a vertical tube. If the rain is falling vertically, and the man stands still, the sides of the tube will not be wet, save by an occasional drop, but if the tube is moved, it must then be inclined forward in order to keep it dry. The angle of inclination, which corresponds to aberration, will depend on the relative velocity of the tube, corresponding to the earth, and the rain drops which correspond to the waves of light.
The topic is typically explained by imagining a [__A_TAG_PLACEHOLDER_0__]man going out in a rainstorm with a vertical tube. If the rain is falling straight down and the man stays still, the sides of the tube won't get wet, except for an occasional drop. However, if the tube is moved, it will need to be tilted forward to stay dry. The angle of tilt, which relates to aberration, will depend on the tube's speed compared to the earth and the raindrops that correspond to light waves.
If three lines are dropped upon a point in space, each line being perpendicular to the plane containing the other two, we have what is known as a system of coordinates. Einstein’s original theory of relativity, which he now designates as the “special theory,” depends on two principles. The first is that “Every law of nature which holds good with respect to a coordinate system K must also hold good for any other system K′, provided that K and K′ are in uniform movement of translation.” The second principle is that “Light in a vacuum has a definite and constant velocity, independent of the velocity of its source.”
If three lines are dropped onto a point in space, with each line being perpendicular to the plane formed by the other two, we have what's called a coordinate system. Einstein's original theory of relativity, which he now refers to as the "special theory," is based on two principles. The first is that "Every law of nature that applies to a coordinate system K must also apply to any other system K′, as long as K and K′ are in uniform motion." The second principle is that "Light in a vacuum travels at a specific and constant speed, regardless of the speed of its source."
These two sentences may be considered as authoritative, being quoted in Einstein’s own words.1 The first of these principles need not greatly surprise us. The second is not well expressed, because it is ambiguous. He does not say how the first “velocity” is measured, whether relatively to the ether or relatively to the observer. In fact this is the very gist of the whole matter, as we shall presently see. In the case of sound the velocity is constant with regard to the medium, the air, in the case of light it is supposed to be constant with regard to the [289]observer. It reaches him with a constant velocity, no matter how he moves.
These two sentences can be seen as authoritative, quoted in Einstein’s own words.1 The first principle isn’t particularly surprising. The second is not clearly stated because it's ambiguous. He doesn’t specify how the first “velocity” is measured, whether in relation to the ether or to the observer. In fact, this is the crux of the entire issue, as we will soon see. For sound, the velocity is constant relative to the medium, which is air, while for light, it is assumed to be constant relative to the [__A_TAG_PLACEHOLDER_0__]observer. It reaches him at a constant velocity, regardless of his motion.
In order to understand this statement clearly let us consider the appended tabular diagram. On a calm day imagine a source of sound at S in line a. This may be either a gun or a bell. Imagine an observer 1,100 feet distant, located at O. The velocity of sound in air is 1,100 feet per second. This velocity we will take as unity, as indicated in the third column, and the velocity with which the sound reaches the observer is also 1, as shown in the fourth. It will reach him in a unit interval of 1 second, as shown in the fifth. If the bell is struck, it will give its normal pitch or frequency, which we will also call unity, in the sixth column.
To clearly understand this statement, let's look at the table below. On a calm day, imagine a sound source at S along line a. This could be either a gun or a bell. Picture an observer who is 1,100 feet away, positioned at O. The speed of sound in air is 1,100 feet per second. We will consider this speed as one unit, as indicated in the third column, and the speed at which the sound reaches the observer is also 1, as shown in the fourth column. The sound will reach him in a unit interval of 1 second, as noted in the fifth column. If the bell is struck, it will produce its normal pitch or frequency, which we will also refer to as one unit, in the sixth column.
Now imagine case b where the observer is on a train advancing toward S. When he is 1,100 feet distant, the gun is fired, but as he is advancing toward it, he hears it at O in rather less than a second, as shown in the fifth column. The velocity of the sound with regard to him is rather more than unity, as shown in the fourth column. If the bell is sounded, the pitch, that is the frequency, is raised, because he receives more sound waves per second than before.
Now imagine case b, where the observer is on a train moving toward S. When he is 1,100 feet away, the gun is fired, but as he is moving toward it, he hears it at O in just under a second, as shown in the fifth column. The speed of sound relative to him is slightly more than one, as indicated in the fourth column. If the bell rings, the pitch—that is, the frequency—increases because he receives more sound waves per second than before.
In case c the observer is stationary, but the source of sound is receding. At a distance of 1,100 feet the gun is fired, and the observer hears it after an interval of just one second, as in case a. The velocities with regard to the observer and through the medium are also unity. If the bell is struck the pitch is lowered, since he receives fewer sound waves per second, the reverse of case b. [290]
In case c, the observer is standing still, but the sound source is moving away. When the gun is fired 1,100 feet away, the observer hears it after just one second, similar to case a. The speeds relative to the observer and through the medium are also the same. When the bell is struck, the pitch drops because the observer receives fewer sound waves per second, which is the opposite of case b. [__A_TAG_PLACEHOLDER_0__]
Velocity | ||||||||
Source | in Medium | to Observer | Interval | Frequency | Observer | |||
Air | ||||||||
a | S | 1 | 1 | 1 | 1 | O | ||
b | S | 1 | 1 + | 1 - | 1 + | O | ||
c | S | 1 | 1 | 1 | 1 - | O | ||
d | S | 1 | 1 + | 1 - | 1 | O | ||
Ether | ||||||||
A | S | 1 | 1 | 1 | 1 | O | ||
B | S | 1 - | 1 | 1 | 1 + | O | ||
C | S | 1 | 1 | 1 | 1 - | O | ||
D | S | 1 - | 1 | 1 | 1 | O |
In case d imagine the source and the observer 1,100 feet apart, and advancing on the same train. When the gun is fired, the velocity of the sound waves will be greater with regard to the observer, and he will hear the sound in less than a second, as in case b. When the bell is struck it will have the normal pitch, the same as in case a.
In the scenario where d visualizes the source and the observer as being 1,100 feet apart and moving towards each other on the same train, when the gun is fired, the speed of the sound waves will be faster for the observer, and he will hear the sound in under a second, similar to case b. When the bell rings, it will have the normal pitch, just like in case a.
We find therefore that for sound the velocity with regard to the medium is always unity, while the velocity with regard to the observer, and the interval elapsed, depend only on the motion of the observer himself, and are independent of the motion of the source. The frequency of the vibrations, on the other hand, depends only on the relative motion of the observer and the source, but is independent of their common motion in any direction. Further, it makes no difference whether the source and the observer are moving on a train, or whether they are stationary, and a uniform wind is blowing past them.
We find that the speed of sound in relation to the medium is always constant, while the speed related to the observer and the time that has passed depend only on the motion of the observer and not on the motion of the source. The frequency of the vibrations, however, is determined solely by the relative motion between the observer and the source, and is not affected by their shared motion in any direction. Additionally, it doesn’t matter if the source and the observer are moving on a train or if they are stationary while a steady wind blows past them.
In the case of light waves we shall find a very different state of affairs, although the rules for frequency are the same as they are for sound. In case A we have the normal conditions, where both the [291]source and observers are stationary. In case B we have a representation of the Michelson-Morley experiment as supplemented by that of Majorana,2 where the source is stationary and the observer advances. Unlike the case of sound, the interval elapsed, as shown by the experiment, is now the same as in case A, and since the distance to the observer is less, the velocity of light with respect to the ether must also be less than unity. Since the observer is advancing against the light, this will permit the velocity of light with regard to the observer to remain unity, in conformity with the second principle of relativity. Compare with case b for sound. As Jeans expresses it, “The velocity of light in all directions is the same, whatever the motion of the observer.”3 That is to say it appears to be the same to him, however he moves.
In the case of light waves, we see a very different situation, although the rules for frequency are the same as they are for sound. In case A, we have the normal circumstances, where both the [__A_TAG_PLACEHOLDER_0__]source and observers are stationary. In case B, we have a representation of the Michelson-Morley experiment, supplemented by Majorana,2 where the source is stationary and the observer is moving forward. Unlike with sound, the time that has passed, as demonstrated by the experiment, is now the same as in case A, and since the distance to the observer is shorter, the speed of light relative to the ether must also be less than one. Since the observer is moving towards the light, this allows the speed of light relative to the observer to remain one, in accordance with the second principle of relativity. Compare this with case b for sound. As Jeans puts it, “The velocity of light in all directions is the same, whatever the motion of the observer.”3 In other words, it seems to be the same to him, no matter how he moves.
Case C represents Einstein’s statement, as confirmed by Majorana’s experiment. It does not differ from case c for sound. Case D is more complex, but accepting the statement above that the velocity is constant with regard to the observer, we see that the velocity through the medium must be less, and that the interval elapsed will be constant, as in case B. Could we use the brighter stars and planets as sources of light, several of these cases could be further tested.
Case C reflects Einstein’s assertion, as confirmed by Majorana’s experiment. It doesn’t differ from case c regarding sound. Case D is more complicated, but if we accept the previous statement that the velocity is constant in relation to the observer, we see that the velocity through the medium must be lower, and the time taken will remain constant, similar to case B. If we could use the brighter stars and planets as light sources, several of these cases could be tested further.
This brings us at once to statements that contradict our common sense. For instance, Jeans says “no matter what the velocity of the observer is, the light surface, as observed by that observer, is invariably a sphere having that observer as center.”3 That is to say the light surface, or wave front, is a contracting, [292]not an expanding, sphere. This, if confirmed, would go a long way toward making our universe a subjective rather than an objective phenomenon. Again imagine a flash of light, such as an explosion, to occur when an observer is in a given position. It makes no difference how the observer may move while the light is approaching him, whether several miles forward or backward, the light will reach him in exactly the same time, as is shown by Michelson’s experiment. Or if two observers are at the same spot when the explosion occurs, and one moves forward, and the other backward, they will both see the explosion at exactly the same instant.
This brings us directly to statements that go against our common sense. For example, Denim jeans says, “no matter what the observer's speed is, the light surface, as seen by that observer, is always a sphere with that observer at its center.”3 In other words, the light surface, or wave front, is a contracting, [__A_TAG_PLACEHOLDER_0__] not an expanding, sphere. If this is confirmed, it would significantly suggest that our universe is more subjective than objective. Imagine a flash of light, like an explosion, occurring while an observer is in a specific position. It doesn’t matter how the observer moves while the light is coming towards them—whether they move several miles forward or backward—the light will reach them at exactly the same time, as demonstrated by Michelson’s experiment. If two observers are at the same location when the explosion happens, and one moves forward while the other moves backward, they will both see the explosion at exactly the same moment.
This sounds ridiculous, but not only is it what Jeans says, but it is the logical interpretation of Einstein’s second principle, if Einstein means by velocity, velocity with regard to the observer. If he means velocity with regard to the medium, then the case is exactly the same as that of sound in air, and Michelson’s experiment as well as the Maxwell-Lorentz theory of light are contradicted. This theory is now universally accepted, and Michelson’s experiment has been carefully repeated by other observers, and fully confirmed. This is the very heart of the relativity question.
This may sound absurd, but it's not only what Jeans argues; it's also the logical interpretation of Einstein’s second principle, assuming Einstein is talking about velocity in relation to the observer. If he's referring to velocity in relation to the medium, then the situation is identical to that of sound in air, which would contradict Michelson’s experiment as well as the Maxwell-Lorentz theory of light. This theory is now widely accepted, and Michelson’s experiment has been thoroughly repeated by other observers and has been fully validated. This is the core of the relativity issue.
If we state the matter objectively it comes to this. The velocity of light with regard to the ether is a variable quantity, depending merely on where the observer chooses to go. As Eddington well says, “these relations to the ether have no effect on the phenomena and can be disregarded—a step which appears to divest the ether of the last remnants of substantiality.”4 [293]
If we put it plainly, this is what it boils down to. The speed of light in relation to the ether is a changing amount, depending only on where the observer decides to move. As Eddington rightly points out, “these connections to the ether don’t affect the phenomena and can be ignored—a move that seems to strip the ether of its last bits of substance.”4 [__A_TAG_PLACEHOLDER_0__]
The only way of avoiding this apparent absurdity seems to be to consider that the ether moves with the earth. Michelson’s result would then be fully explained. Of course this can only be true for a few miles above the earth’s surface. Beyond that the ether must either be stationary or move with the sun. The velocity of light with regard to the ether would then be a constant, just as the velocity of sound is constant with regard to the air. This would contradict Einstein’s second principle as it is generally understood. The trouble with this suggestion is that it fails to account for aberration, which, as already explained, appears to require that the earth should be moving through the ether. To meet this emergency would involve some modification of the undulatory theory of light, which apparently would not be impossible, but has not yet been made.
The only way to avoid this obvious contradiction seems to be to think of the ether moving with the Earth. This would fully explain Michelson’s results. Clearly, this can only be true a few miles above the Earth's surface. Beyond that, the ether must either be stationary or move with the sun. The speed of light concerning the ether would then be constant, similar to how the speed of sound is constant concerning air. This would go against Einstein’s second principle as it is commonly understood. The problem with this idea is that it doesn’t account for aberration, which, as previously explained, seems to require that the Earth is moving through the ether. Addressing this issue would necessitate some changes to the wave theory of light, which apparently isn’t impossible but hasn’t been made yet.
In 1915 Einstein brought out an extension of his first principle. This he calls the “general theory of relativity.” It states that in our choice of coordinate systems we “should not be limited in any way so far as their state of motion is concerned.”1 This leads to the three astronomical consequences mentioned later in this paper, two of which have been more or less confirmed, and the third practically contradicted as far as quantitative measures are concerned.5
In 1915, Einstein introduced an expansion of his first principle, which he refers to as the “general theory of relativity.” It asserts that we “should not be restricted in any way regarding their state of motion” when choosing coordinate systems.1 This leads to three astronomical consequences discussed later in this paper, two of which have been somewhat confirmed, while the third has been largely contradicted in terms of quantitative measurements.5
As is well known the kinetic energy of a moving body may be expressed as , but if the body is charged electrically, the fraction becomes
, where m′ is a quantity dependent on the square of the electrical charge. That is to say, [294]we have the normal mass of the body, and also what we may call its electrical mass.
If when in this condition a portion of the mass is electrical, the question at once
occurs to us, why may not the whole mass be electrical, in other words, a form of
energy? Although this has not been satisfactorily proved hitherto, yet such is the
general belief among physicists. As Einstein puts it “inert mass is nothing else than
latent energy.”1 The same idea is sometimes expressed as “the mass of ordinary matter is due to the
electromagnetic energy of its ultimate particles, and electromagnetic energy wherever
found must possess mass, i.e., inertia.”6 If that is so, since a ray of light on the undulatory theory is a form of electromagnetic
energy, it too must possess mass. Since all mass with which we are familiar is subject
to the attraction of gravitation, it seemed likely that a ray of light would be bent
out of its course in passing near the sun, and this as we have seen was proved to
be true at the recent solar eclipse.
As everyone knows, the kinetic energy of a moving object can be expressed as . However, if the object has an electric charge, the formula changes to
, where m′ is a value based on the square of the electric charge. In other words, [__A_TAG_PLACEHOLDER_0__] we have the standard mass of the object, plus what we can refer to as its electrical mass. If, under these conditions, part of the mass is electrical, it raises the question of why the entire mass couldn't be electrical, essentially a form of energy. Although this hasn't been conclusively proven yet, many physicists generally believe it to be true. As Einstein said, “inert mass is nothing else than latent energy.”1 This idea is sometimes paraphrased as “the mass of ordinary matter results from the electromagnetic energy of its fundamental particles, and electromagnetic energy, wherever it exists, must have mass, i.e., inertia.”6 If that's the case, since a beam of light is a kind of electromagnetic energy according to the wave theory, it too must have mass. Given that all familiar mass is affected by gravitational attraction, it seemed plausible that a beam of light would be deflected from its path when passing near the sun, which, as we have seen, was confirmed during the recent solar eclipse.
That portion of the mass of a body due to its electrical charge can be readily shown experimentally to vary with the velocity of the body. Einstein has shown the same to be true of the normal mass, as is illustrated in the advance of the perihelion of the orbit of Mercury. He has also pointed out that gravitation, inertia and centrifugal force are all closely related, and obey similar laws. Thus if we rise from the earth with accelerated velocity, we apparently increase our weight. Again if the velocity of rotation of the earth on its axis should be increased, our weight would be diminished. These [295]facts are suggestive when we come to consider the ultimate cause of gravitation.
That part of a body's mass that comes from its electrical charge can be easily shown through experiments to change with the body's speed. Einstein demonstrated that the same applies to normal mass, as seen in the advance of Mercury's orbit around the sun. He also pointed out that gravity, inertia, and centrifugal force are all closely connected and follow similar laws. So when we rise from the Earth with increased speed, we seemingly increase our weight. Conversely, if the Earth's rotation on its axis speeds up, our weight would decrease. These [__A_TAG_PLACEHOLDER_0__]facts are interesting when considering the root cause of gravity.
Another fact which must be rather startling to the older school of scientists is that
momentum is no longer simply mv, mass times velocity, but that the velocity of light c, comes into the question, and the formula for momentum now assumes the form of
Another fact that might be quite surprising to older scientists is that momentum is no longer just mv, which is mass times velocity. Now, we have to consider the speed of light c, and the formula for momentum has changed to
For ordinary velocities this correction is extremely small, but it has been shown to be necessary, both theoretically and experimentally, when dealing with the high velocities with which we are now familiar.
For typical speeds, this adjustment is really minor, but it's been proven to be essential, both in theory and in experiments, when dealing with the high speeds we know today.
The theory of relativity is so widespread in its application that several other theories have become more or less intimately combined with it, for which Einstein is in no way responsible. One of these is known as the Fitzgerald-Lorentz theory, that all bodies are subject to a contraction in the direction of their motions through space. This was first suggested in order to explain the Michelson-Morley experiment, but has proved inadequate to do so, particularly when the observer is receding from the source. This contraction is expressed by the same factor used in the denominator of the revised expression for momentum, given above. Again the quantity c is so enormous, that even for large bodies at planetary velocities the contraction amounts to very little. Thus the earth moving at a speed of eighteen miles per second in its orbit, is flattened only 1/200,000,000, or 2.5 inches. On the other [296]hand for high velocities of many thousand miles per second, such as we have become familiar with in the case of the radioactive substances, the flattening is a very considerable fraction of the diameter of the moving body, one-half or more, and in the case of the corpuscles of light, if that theory were adopted, this flattening becomes equal to the diameter, and their thickness is reduced to zero.
The theory of relativity is so widely applicable that several other theories have become more or less closely linked to it, and Einstein is not responsible for that. One of these is known as the Fitzgerald-Lorentz theory, which states that all objects experience a contraction in the direction of their movement through space. This was initially suggested to explain the Michelson-Morley experiment but has proven inadequate, especially when the observer is moving away from the source. This contraction is represented by the same factor that appears in the denominator of the updated momentum equation mentioned earlier. Again, the quantity c is so huge that even for large objects moving at planetary speeds, the contraction is very minimal. For instance, the Earth, moving at a speed of eighteen miles per second in its orbit, is flattened by only 1/200,000,000, or about 2.5 inches. On the other [__A_TAG_PLACEHOLDER_0__] hand, for extremely high speeds of thousands of miles per second, like those observed with radioactive substances, the flattening becomes a significant fraction of the object's diameter, sometimes half or more. In the case of light particles, if that theory were accepted, this flattening would equal the diameter, reducing their thickness to zero.
When we view Einstein’s theories from the astronomical standpoint, the earliest fact bearing on relativity that we need consider was the discovery of aberration, by Bradley, in 1726, as seen above. In 1872 Airy observed the star γ Draconis through a telescope filled with water. Since the velocity of light is less in water than in air, we should naturally expect to find the aberration appreciably increased. It was found, on the other hand, however, to be unaffected.
When we look at Einstein’s theories from an astronomical perspective, the first important fact related to relativity that we should consider is the discovery of aberration by Bradley in 1726, as mentioned above. In 1872, Airy observed the star γ Draconis through a water-filled telescope. Since light travels slower in water than in air, we would expect to see a noticeable increase in aberration. However, it was discovered that it remained unchanged.
In 1887 the results of the famous Michelson-Morley experiment were published.7 In this experiment the velocity of light was measured in various directions with regard to the motion of the earth in its orbit. If the ether were stationary, and the earth moving through it, different velocities should be obtained in different directions. Such was not the case however, and the experiment indicated that the ether moved with the earth. It thus flatly contradicted the conclusions founded on aberration.
In 1887, the results of the well-known Michelson-Morley experiment were published.7 In this experiment, the speed of light was measured in various directions relative to the Earth's motion in its orbit. If the ether were stationary while the Earth moved through it, different speeds should have been recorded in different directions. However, that was not the case, and the experiment showed that the ether moved along with the Earth. This contradicted the conclusions based on aberration.
Einstein’s Special Theory of Relativity, of 1905, as we have seen, resolves this contradiction. But as we shall presently see, it is the General Theory, of 1915, that leads to astronomical applications of broad scope. It indicates, for instance, that there is [297]no essential difference between gravitation and inertia. This idea may be crudely illustrated by our feelings of increased weight when an elevator starts rapidly upwards. A man while falling freely in space ceases to feel the pull of gravitation.
Einstein's Special Theory of Relativity from 1905, as we've discussed, resolves this contradiction. However, as we will soon see, it's the General Theory from 1915 that leads to a wide range of astronomical applications. For example, it shows that there is [__A_TAG_PLACEHOLDER_0__]no fundamental difference between gravity and inertia. This concept can be roughly illustrated by how we feel an increase in weight when an elevator starts moving quickly upwards. A person falling freely in space no longer feels the force of gravity.
But we must not as yet conceive of the theory of relativity as a universally accepted and unquestioned truth of science. Eddington is its leading English exponent, and he is supported by such men as Jeans, Larmor, and Jeffreys. On the other hand, the theory has been severely criticised by Lodge, Fowler, Silberstein, and Sampson. Few American scientists have expressed any opinions in print on the subject, and the recent eclipse observations, to which we shall refer later, are to be repeated with more suitable instruments for verification in 1922, in the hope of obtaining more accurate and accordant results.8
But we shouldn't think of the theory of relativity as a universally accepted and unquestionable truth in science just yet. Eddington is its main English proponent, and he has the backing of people like Jeans, Larmor, and Jeffreys. On the flip side, the theory has faced strong criticism from Lodge, Fowler, Silberstein, and Sampson. Few American scientists have shared their thoughts on the topic in print, and the recent eclipse observations, which we will discuss later, are set to be repeated with better instruments for verification in 1922, with the hope of getting more accurate and consistent results.8
An appurtenance of the Einstein theories which bears much the same relation to them as does the Lorentz-Fitzgerald contraction, mentioned above, is the idea, first clearly stated by Minkowski, that time is a kind of space—a fourth dimension. This the reader will doubtless find to be the most difficult portion of the theory to picture in his own mind. It is entirely unsupported by experiment or observation, necessarily so, and is based wholly on mathematical and philosophical conceptions. Our distinction between space and time seems to be that the direction in which we progress without effort is time; the other directions, in which we have to make an exertion to move ourselves, or in which we are carried, are space. How many dimensions empty space may have, we really have no means of knowing, because [298]we can neither see nor feel it. Matter we know has three, length, breadth, and thickness, also that it lies remote from us in three corresponding directions. These facts may have given us the erroneous impression that space too has only three dimensions. Now it is claimed that time is a fourth, and that there are also others.
An addition to Einstein's theories that relates to them similarly to the previously mentioned Lorentz-Fitzgerald contraction is the concept, first clearly articulated by Minkowski, that time is a form of space—a fourth dimension. Readers will likely find this to be the most challenging aspect of the theory to visualize. It has no experimental or observational support—this is expected—and is entirely grounded in mathematical and philosophical ideas. Our distinction between space and time seems to be that the direction we move in effortlessly is time; the other directions, where we have to exert effort to move or are carried, are space. We have no way of knowing how many dimensions empty space may have because we can neither see nor feel it. Matter, which we know has three dimensions—length, width, and height—also exists in three corresponding directions from us. These facts may have given us the misleading impression that space has only three dimensions. Now it's proposed that time is a fourth dimension, and there may be others as well.
In order to illustrate this, Eddington asks us to imagine a movie film taken of a man or of any moving object. Let the separate pictures be cut apart and piled on one another. This would form a sort of pictorial history of the individual for a brief interval in his life, in the form of a cube. If we attempt to pick it up, it falls apart, thus clearly showing the difference between time and space. But suppose it now all glued together in one solid cube, so that it is no easier to cut a section in one direction than in another. That is Minkowski’s idea of space and time, and further, that the direction in which we should cut it depends merely on the velocity with which we are moving through space. I should cut it parallel to the films, but a man on a rapidly moving star, in order to separate it into space and time, would cut it in an inclined direction. That is a thing which may be true, but it is one which we believe no mortal man can clearly picture to himself.
To illustrate this, Eddington asks us to imagine a film of a man or any moving object. If we cut apart and stack the individual frames, it would create a visual history of that person for a short period in the shape of a cube. If we try to pick it up, it falls apart, clearly showing the difference between time and space. Now, imagine it all glued together into one solid cube, making it equally difficult to slice through in any direction. This is Minkowski’s concept of space and time, where the direction we should cut depends purely on our speed through space. I would cut it parallel to the frames, but someone on a fast-moving star would cut it at an angle to separate it into space and time. While this might be true, it’s something we believe no human can clearly visualize.
On the other hand Turner has recently made a very interesting point,9 namely, that the fourth dimension as actually treated by the mathematicians is not time itself, but time multiplied by a constant—the velocity of light.10 Without affecting the astronomical proofs of relativity at all, this simplifies our
conceptions enormously. In ordinary everyday life [299]time and space cannot be identical, any more than a yard can be identical with a quart.
On what is known to physicists as the centimeter-gram-second system, distance is represented
by l, mass by m, and time by t. Velocity is then distance divided by time, , or as we say in English units, so many feet per second, and the fourth dimension
may be expressed as time multiplied by velocity,
. That is to say, it is simply distance, just like the other three dimensions. To
say that time is the fourth dimension from this point of view, appears to us just
as ridiculous as it would be to attempt to measure the velocity of a train in quarts.
It is quite correct, however, although unusual, to speak of a given train as moving
at a speed of 10 quarts per square inch per second,
. This would be equivalent to a velocity of 33 miles per hour.
On the other hand, Turner recently made a very interesting point,9 specifically that the fourth dimension as mathematicians actually handle it isn’t time itself, but time multiplied by a constant—the speed of light.10 This simplifies our understanding significantly without changing the astronomical proofs of relativity at all. In everyday life, [__A_TAG_PLACEHOLDER_0__] time and space cannot be the same, just like a yard can’t be the same as a quart.
In what physicists call the centimeter-gram-second system, distance is represented by l, mass by m, and time by t. Velocity is distance divided by time, , or in English units, so many feet per second, and the fourth dimension can be expressed as time multiplied by velocity,
. In other words, it is just distance, like the other three dimensions.
Saying that time is the fourth dimension from this perspective seems just as absurd as trying to measure a train’s speed in quarts. However, it’s technically correct, though unusual, to say a specific train is moving at a speed of 10 quarts per square inch per second,
. This would be equivalent to a speed of 33 miles per hour.
If I wish to give a complete dimensional description of myself in my four dimensions, I must give my length, my breadth, and my thickness, ever since I came into being, and also the course I have traversed through space since that time. This latter distance will be expressed in terms of a unit whose length is 186,000 miles, the distance traversed by light in one second. The distance which I travel through space annually is enormous, and very complex as to direction. It involves not merely my own motions as I cross the room, or take a train or steamer, but also those due to the rotation of the earth on its axis, its revolution round the sun, and the motion of the latter through the heavens. In general I travel, or in other words increase my length in the fourth dimension, by over 4,000 units [300]a year. The fourth dimension accordingly, if this view is accepted, is simply a distance like the other three, and perfectly easy to understand.
If I want to give a complete dimensional description of myself in my four dimensions, I need to provide my length, width, and depth since I came into existence, along with the path I’ve traveled through space during that time. This distance will be measured in a unit that is 186,000 miles, which is how far light travels in one second. The distance I move through space each year is massive and very complicated in terms of direction. It includes not just my own movements as I walk across a room or take a train or boat, but also those caused by the Earth rotating on its axis, its orbit around the sun, and the sun's movement through the galaxy. Overall, I travel—or in other words, increase my length in the fourth dimension—by over 4,000 units [__A_TAG_PLACEHOLDER_0__] each year. Thus, if this perspective is accepted, the fourth dimension is simply a distance like the other three and is perfectly easy to grasp.
We now come to the three actual tests by which the theory has been tried. The planets as is well known revolve about the sun in ellipses, with the sun in one of the foci. That is to say, the sun is not in the center, but a little on one side of it. The end of the ellipse where the planet comes nearest to the sun is called the perihelion, and here the planet is moving most rapidly. The other end is called the aphelion, and here the motion is slowest. According to Newton’s theory of gravitation, if a spherical sun possesses a single planet or companion, its orbit will be permanently fixed in space unless perturbed by some other body. If a second planet exist, it will cause the perihelion of the first slowly to advance. According to Einstein the mass of a planet depends in part on its velocity. It will therefore be less at aphelion where it is moving slowly than at perihelion where it is moving rapidly, consequently in addition to the Newtonian attraction we have another one which increases as we approach the sun. The effect of this will be to cause the perihelion of the orbit to advance, whether there is a second planet or not.
We now come to the three actual tests that have put the theory to the test. The planets, as we know, revolve around the sun in ellipses, with the sun positioned at one of the foci. This means the sun isn’t at the center but slightly off to one side. The end of the ellipse where the planet is closest to the sun is called the perihelion, and it’s at this point that the planet moves fastest. The other end is known as the aphelion, where the motion is slowest. According to Newton’s theory of gravitation, if a spherical sun has one planet or companion, its orbit will stay fixed in space unless affected by another body. If there’s a second planet, it will cause the perihelion of the first to gradually advance. According to Einstein, a planet’s mass partly depends on its speed. Therefore, its mass will be less at aphelion, where it moves slowly, than at perihelion, where it moves quickly. As a result, in addition to Newtonian attraction, there’s another force that increases as we get closer to the sun. This will cause the perihelion of the orbit to advance, whether there is a second planet or not.
Among the larger planets Mercury has the most eccentric orbit, and it also moves most rapidly, so that it is particularly well adapted to test the relativity theory. The observed advance of its perihelion is 574″ per century, instead of the theoretical figure 532″, due to the other planets—a difference of 42″.11 This has long been a puzzling discrepancy between observation and the law of gravitation. [301]Prior to Einstein, attempts were made to eliminate it by assuming a certain oblateness of the solar disk. If the equatorial diameter exceeded the polar by only 0″.5 the whole advance would be accounted for, but not only has this ellipticity failed of detection, but if it existed, it should produce a very noticeable and inadmissible change in the inclination of Mercury’s orbit, amounting to about 3″ per century, as has been demonstrated by both Herzer and Newcomb.12
Among the larger planets, Mercury has the most unusual orbit and moves the fastest, making it especially suitable for testing the theory of relativity. The observed advance of its perihelion is 574″ per century, compared to the theoretical figure of 532″, influenced by the other planets—a difference of 42″.11 This discrepancy between observation and the law of gravitation has puzzled scientists for a long time. [__A_TAG_PLACEHOLDER_0__]Before Einstein, people tried to resolve it by assuming a certain flattening of the solar disk. If the equatorial diameter was just 0″.5 larger than the polar diameter, the entire advance would be explained. However, this flattening has not been detected, and even if it did exist, it should lead to a noticeable and unacceptable change in the inclination of Mercury’s orbit, amounting to about 3″ per century, as shown by both Herzer and Newcomb.12
Einstein from computations alone, without introducing any new constants or hypotheses whatever, showed, if the theory of relativity be accepted, that the sun should produce an acceleration of 43″ per century, thus entirely accounting for the observed discrepancy, far within the limits of accuracy of the observations. The only other planet whose orbit has a large eccentricity, and that is suitable for investigation, is the planet Mars. Here the discrepancy between observation and theory is very slight, only 4″, and a portion of that may be due to the attraction of the asteroids. This deviation is so slight that it may well be due entirely to accidental errors of observation, but however that may be, Einstein’s theory reduces it to 2″.7.
Einstein demonstrated through calculations alone, without introducing any new constants or hypotheses, that if we accept the theory of relativity, the sun should cause an acceleration of 43″ per century. This completely explains the observed discrepancy, which is well within the limits of observation accuracy. The only other planet with a significant eccentric orbit that can be studied is Mars. Here, the difference between observation and theory is very minor, just 4″, and part of that could be due to the gravitational pull of the asteroids. This small deviation could very well be attributed entirely to random observational errors, but regardless, Einstein's theory reduces it to 2″.7.
This all seems very satisfactory and complete, but the trouble with it is that the coincidence for Mercury is rather too good. It is based on the assumption that the sun is a perfect sphere, and that the density of its surface is uniform from the equator to the poles. This would doubtless be true if the sun did not revolve on its axis. In point of fact it does revolve, in a period in general of about 26 [302]days. Consequently an object on its equator must experience a certain amount of centrifugal force. Therefore if its surface were of uniform density the shape of the sun would be an oblate spheroid.
This all seems very satisfactory and complete, but the issue with it is that the fit for Mercury looks a bit too perfect. It relies on the assumption that the sun is a perfect sphere and that its surface density is the same from the equator to the poles. This would probably hold true if the sun didn't rotate on its axis. In reality, it does rotate, roughly every 26 [__A_TAG_PLACEHOLDER_0__] days. As a result, an object at its equator must experience some centrifugal force. So, if its surface had uniform density, the shape of the sun would be an oblate spheroid.
It can be readily shown that the theoretical excess of the equatorial over the polar diameter, due to the centrifugal force, should amount to only 0″.04, an amount which could hardly be detected by observation, and might readily be concealed by a slight excess of equatorial over polar density. Any reasonable excess of density at the center would diminish this result but slightly. The molecular weight of the central material13 is probably about 2. This computed equatorial excess is one-twelfth of the amount necessary to cause the observed advance, and should therefore cause an advance of the perihelion of about 3″.5 per century, reducing the difference between the observed advance and that caused by gravitation to 38″.5. According to Einstein the advance due to relativity should be, as we saw, 43″, a discrepancy of 4″.5 per century, or 10 per cent. Jeffreys has remarked that any discrepancy such as 10″ “would be fatal to a theory such as Einstein’s, which contains no arbitrary constituent capable of adjustment to suit empirical facts.”14 It must be pointed out here however, that so far as known, this small correction to the motion of Mercury’s perihelion has not previously been suggested, so that there has been no opportunity hitherto for its criticism by others.
It can be easily demonstrated that the theoretical difference between the equatorial and polar diameters, caused by centrifugal force, should only be about 0″.04, a value that would be nearly impossible to detect through observation and could easily be masked by a small increase in equatorial density compared to polar density. Any reasonable increase in density at the center would only slightly affect this outcome. The molecular weight of the central material13 is likely around 2. This calculated equatorial excess is one-twelfth of what is needed to account for the observed advance, and would, therefore, contribute to a perihelion advance of about 3″.5 each century, bringing the difference between the observed advance and that explained by gravity down to 38″.5. According to Einstein, the advance due to relativity, as we observed, should be 43″, leading to a discrepancy of 4″.5 per century, or 10 percent. Jeffreys has noted that any discrepancy over 10″ “would be fatal to a theory like Einstein’s, which has no adjustable components to fit empirical data.”14 It should be emphasized, however, that as far as we know, this small correction to Mercury's perihelion motion has not been proposed before, and thus there has been no chance for others to critique it.

One of the eclipse photographs
An eclipse photo
The arrows pointing to the star-images have been inserted by hand; and the star-images themselves have had to be materially strengthened in order to make them show in the engraving at all.
The arrows pointing to the star images have been added by hand, and the star images themselves have had to be significantly reinforced to be visible in the engraving.
Photograph submitted by Dr. Alexander McAdie, Harvard University, by courtesy of the Royal Observatory, Greenwich.
Photo submitted by Dr. Alexander McAdie, Harvard University, courtesy of the Royal Observatory, Greenwich.
It was due largely to the success with Mercury that it was decided to put the relativity theory to another test. According to the Newtonian theory, [303]as stated by Newton himself, corpuscles as well as planets have mass, and must therefore be attracted by the sun. According to Einstein, owing to their high velocity, this attraction must be twice as great as it would be according to the theory of gravitation. If the ray of light proceeding from a star were to pass nearly tangent to the sun’s limb it should be deflected 0″.87 according to Newton. According to the theory of relativity it should be deflected 1″.75. Stars of course cannot usually be observed near the sun. It is therefore necessary to take advantage of a total solar eclipse, when the sun is completely hidden by the moon, in order to secure these observations.
It was mainly because of the success with Mercury that they decided to test the theory of relativity again. According to Newtonian theory, [__A_TAG_PLACEHOLDER_0__]as Newton himself stated, both particles and planets have mass and should be attracted by the sun. However, according to Einstein, due to their high speed, this attraction should be twice as strong as what gravity theory suggests. If a beam of light from a star passes very close to the edge of the sun, it should bend by 0.87 seconds of arc according to Newton. According to relativity, it should bend by 1.75 seconds of arc. Of course, stars are usually not visible near the sun, so it’s necessary to wait for a total solar eclipse when the moon completely covers the sun to make these observations.
Two expeditions, one to Africa, and one to South America, observed successfully the total eclipse of May 29, 1919. The former was located on the Island of Principe in the Gulf of Guinea. The latter was located at Sobral, Brazil. Their equipment and results are shown in the following table, where the successive columns give the location, the aperture in inches of the telescopes employed, their focus in feet, the number of plates secured, the number of stars measured, their mean deduced deflection from their true positions by the attraction of the sun, and the deviations from the theoretical results.15 In the first and last line of the table shown herewith, this
Two expeditions, one to Africa and one to South America, successfully observed the total eclipse on May 29, 1919. The first group was on the Island of Príncipe in the Gulf of Guinea, while the second was in Sobral, Brazil. Their equipment and results are outlined in the table below, where each column details the location, the aperture size in inches of the telescopes used, their focal length in feet, the number of plates taken, the number of stars measured, their average deflection from their true positions due to the sun's gravity, and the discrepancies from the theoretical results.15 In the first and last line of the following table, this
Location | Aperture | Focus | Plates | Stars | Defl. | Dev. |
Principe | 13 | 11 | 2 | 5 | 1″.60 | -0″.15 |
Sobral | 13 | 11 | 19 | 12 | 0 .93 | (+0 .06) |
Sobral | 4 | 19 | 8 | 7 | 1 .98 | +0 .23 |
deviation is taken from Einstein’s computed value of [304]1″.75. In the second line the difference shown is from the value required by the Newtonian theory, 0″.87. The results obtained with this telescope were rejected however, although they were much the most numerous, because it was found that for some reason, supposed to be the heating of the mirror by the sun before the eclipse, the star images were slightly out of focus, and were therefore considered unreliable. The results with the two other telescopes were not very accordant, but the 4-inch had the longer focus, secured the greater number of plates, and showed the greater number of stars. The results obtained with it therefore appear to have been the more reliable. They differ from Einstein’s prediction by 13 per cent. In future expeditions to test this question, the mirror in front of the telescope will be eliminated.
The deviation is taken from Einstein’s calculated value of [__A_TAG_PLACEHOLDER_0__]1″.75. In the second line, the difference shown is from the value required by Newtonian theory, 0″.87. However, the results obtained with this telescope were discarded, even though they were the most numerous, because it was discovered that, for some reason—thought to be due to the sun heating the mirror before the eclipse—the star images were slightly out of focus and therefore deemed unreliable. The results from the other two telescopes were not very consistent, but the 4-inch had a longer focus, secured more plates, and displayed a greater number of stars. Therefore, the results obtained with it seem to have been more reliable. They differ from Einstein’s prediction by 13 percent. In future expeditions to test this question, the mirror in front of the telescope will be removed.
We now come to the final test which has been applied to Einstein’s theory. Einstein showed that in the intense gravitational field of the sun, the theory of relativity required that all of the spectrum lines should be shifted slightly toward the red end. The shift however is exceedingly small, and can only be detected and measured with the most powerful modern instruments. Moreover only certain lines can be used, because owing to varying pressure in the solar atmosphere, which affects many lines, as well as to rapid motion in the line of sight, which may affect all of them, still larger displacements are liable to occur.
We now come to the final test applied to Einstein’s theory. Einstein demonstrated that in the strong gravitational field of the sun, the theory of relativity predicted that all the spectrum lines would be shifted slightly toward the red end. However, the shift is extremely small and can only be detected and measured with today’s most powerful instruments. Moreover, only certain lines can be used because varying pressure in the solar atmosphere affects many lines, and rapid motion along the line of sight can affect all of them, resulting in even larger displacements.
According to the theory of relativity the displacement of the lines should be . St. John at Mt. Wilson found a displacement for the cyanogen [305]lines of only
.16 Evershed at Kodaikanal found +0.0060 at the north pole of the sun, and +0.0080 at
the south pole. These latter values however were only for the stronger lines. The
weaker lines give much smaller shifts, as do those of calcium and magnesium.17 According to Einstein all lines should give nearly the same shift, an amount proportional
to the wave length. It therefore appears that we must conclude by saying that Einstein’s
theory of relativity has been partially, but not completely, verified.
According to the theory of relativity, the expected shift of the lines should be . St. John at Mt. Wilson measured a shift for the cyanogen [__A_TAG_PLACEHOLDER_0__] lines of only
.16 Evershed at Kodaikanal found +0.0060 at the north pole of the sun and +0.0080 at the south pole. However, these latter values were only for the stronger lines. The weaker lines show much smaller shifts, as do those of calcium and magnesium.17 According to Einstein, all lines should show nearly the same shift, proportional to the wavelength. Therefore, it seems that we must conclude that Einstein's theory of relativity has been partially, but not fully, confirmed.
The reference numbers in the above text have nothing to do with the numbers used in other parts of this volume to acknowledge the work of the various contestants; they refer to Dr. Pickering’s sources, as follows:
The reference numbers in the text above are not related to the numbers used in other sections of this volume to recognize the efforts of different contestants; they refer to Dr. Pickering’s sources, as follows:
1 Journ. Brit. Astron. Assoc., 1919, 30, 76.
1 Journal of the British Astronomical Association, 1919, 30, 76.
5 Astro-Physical Journal, 1917, 46, 249. Journ. Brit. Astron. Assoc., 1920, 30, 276.
5 Astro-Physical Journal, 1917, 46, 249. Journ. Brit. Astronomy. Assoc., 1920, 30, 276.
9 The Observatory 1920, April. From an Oxford Note Book.
9 The Observatory 1920, April. From an Oxford Note Book.
10 Monthly Notices, R. A. S., 1917. 78, 3 De Sitter, 1919, 80, 121, Jeans, 80, 146 Jeffreys.
10 Monthly Notices, R. A. S., 1917. 78, 3 De Sitter, 1919, 80, 121, Jeans, 80, 146 Jeffreys.
11 “Gravitation and the Principle of Relativity,” Eddington. Royal Institution of Great Britain, 1918.
11 “Gravitation and the Principle of Relativity,” Eddington. Royal Institution of Great Britain, 1918.
13 “The Interior of a Star,” Eddington. Scientia, 1918, 23, 15.
13 “The Interior of a Star,” Eddington. Scientia, 1918, 23, 15.
XVIII
THE PRACTICAL SIGNIFICANCE OF RELATIVITY
The Best Discussion of the Special Theory Among All the Competing Essays
The Top Discussion of the Special Theory Among All the Competing Essays
Can a small child catch a baseball moving sixty miles an hour without getting hurt? We should probably answer “No”—but suppose that the boy and his father were sitting side by side in an express train, and the ball was tossed lightly from one to the other. Then there would be no trouble about it, whether the train was standing still, or going at full speed. Only the relative motion of ball and boy would count.
Can a small child catch a baseball traveling at sixty miles an hour without getting hurt? We should probably say “No”—but imagine that the boy and his father were sitting next to each other in an express train, and the ball was tossed gently between them. In that case, it wouldn't be a problem, whether the train was stationary or moving at full speed. Only the relative motion of the ball and the boy would matter.
This every-day experience is a good illustration of the much discussed Principle of Relativity, in its simplest form. If there were no jolting, the motion of the train, straight ahead at a uniform speed, would have no effect at all upon the relative motions of objects inside it, nor on the forces required to produce or change these motions. Indeed, the motion of the earth in its orbit, which is free from all jar, but a thousand times faster, does not influence even the most delicate apparatus. We are quite unconscious of it, and would not know that the earth [307]was moving, if we could not see other bodies outside it. This sort of relativity has been recognized for more than two centuries and lies at the bottom of all our ordinary dynamical reasoning, upon which both science and engineering are based.
This everyday experience is a good example of the often-discussed Principle of Relativity, in its simplest form. If there were no jolting, the train's motion, moving straight ahead at a steady speed, wouldn't affect the relative movements of objects inside it, nor the forces needed to create or change those movements. In fact, the earth's motion in its orbit, which is smooth and a thousand times faster, doesn’t impact even the most sensitive instruments. We are completely unaware of it, and wouldn’t know that the earth [__A_TAG_PLACEHOLDER_0__]is moving if we couldn’t see other objects outside of it. This kind of relativity has been acknowledged for over two centuries and is fundamental to all our ordinary reasoning in dynamics, which both science and engineering rely on.
But there are other things in nature besides moving bodies,—above all, light, which is intimately related to electricity and magnetism, and can travel through empty space, between the stars. It moves at the enormous speed of 186,000 miles per second, and behaves exactly like a series of vibrations or “waves.” We naturally think of it as travelling through some medium, and call this thing, which carries the light, the “ether.”
But there are other things in nature besides moving objects—most importantly, light, which is closely linked to electricity and magnetism and can travel through the vacuum of space, between the stars. It moves at an incredible speed of 186,000 miles per second and behaves just like a series of vibrations or “waves.” We instinctively think of it as moving through some medium and refer to this substance that carries light as the “ether.”

Can we tell whether we are moving through this ether, even though all parts of our
apparatus move together, and at the same rate? Suppose that we have two mirrors, M and N, at equal distances, d, from a point O, but in directions at right angles to one another, and send out a flash of light
from O. If everything is at rest, the reflected flashes will evidently come back to O at the same instant, and the [308]elapsed time will be seconds if c is the velocity of light.
Can we tell if we are moving through this ether, even if all parts of our equipment are moving together and at the same speed? Imagine we have two mirrors, M and N, the same distance, d, from a point O, but positioned at right angles to each other, and we send out a flash of light from O. If everything is stationary, the reflected flashes will obviously return to O at the same moment, and the [__A_TAG_PLACEHOLDER_0__] elapsed time will be seconds if c is the speed of light.
But suppose that O, M, and N are fastened to a rigid frame work, and all moving in the direction , with velocity V. The light which goes from O toward M, at the speed c, will overtake it with the difference of their speeds,
, taking
seconds to reach M. On the way back, O will be advancing to meet it, and the return trip will occupy
seconds. The elapsed time for the round trip comes out
seconds, which is longer than when the system was at rest—the loss of time in the “stern chase” exceeding
the saving on the return.
But let's say that O, M, and N are attached to a solid framework, all moving in the direction , at a speed of V. The light traveling from O to M, at a speed of c, will catch up with it due to the difference in their speeds,
, taking
seconds to reach M. On the way back, O will be moving towards it, and the return trip will take
seconds. The total time for the round trip is
seconds, which is longer than when the system was at rest—the time lost in the “stern chase” outweighing the time saved on the return.
The light which is reflected from N has a different history. When it starts, O and N have certain positions in the ether, and
. By the time it reaches the mirror, this is at
, and O is at
, and when it returns, it finds O at
. The distances for the outward and inward journeys are now equal, but (as is obvious
from the figure), each of them is greater than d, or
, and the time for the round trip will be correspondingly increased. A simple calculation
shows that it is
.
The light reflected from N has a different story. When it begins, O and N are positioned in the ether at and
. By the time it reaches the mirror,
is where N is located, and O is at
. When it returns, it finds O at
. The distances for the outward and inward journeys are now the same, but (as is clear from the figure), each distance is greater than d, or
, meaning the time for the round trip will be increased accordingly. A simple calculation shows that it is
.
The increase above the time required when the system was at rest is less in this case than the preceding. Hence, if the apparatus is moving through the ether, the flashes reflected from M and N will not return at the same instant.
The increase in time needed when the system was at rest is smaller in this situation than before. So, if the apparatus is moving through the ether, the flashes reflected from M and N won't come back at the same moment.
For such velocities as are attainable—even the 18 miles per second of the earth in its orbit—the difference is less than a hundred-millionth of the [309]elapsed time. Nevertheless, Michelson and Morley tried to detect it in their famous experiment.
For speeds that can be achieved—even the 18 miles per second that the Earth travels in its orbit—the difference is less than a hundred-millionth of the [__A_TAG_PLACEHOLDER_0__]elapsed time. Still, Michelson and Morley attempted to measure it in their famous experiment.
A beam of light was allowed to fall obliquely upon a clear glass mirror (placed at O in the diagram) which reflected part of it toward the mirror, M, and let the rest pass through to the mirror N. By reuniting the beams after their round trips, it was possible to tell whether one had gained upon the other by even a small fraction of the time of vibration of a single light wave. The apparatus was so sensitive that the predicted difference, though amounting to less than a millionth part of a billionth of a second, could easily have been measured; but they actually found no difference at all—though the earth is certainly in motion.
A beam of light was directed at an angle onto a clear glass mirror (shown at O in the diagram) which reflected some of it toward the mirror, M, while allowing the rest to pass to the mirror N. By combining the beams after their round trips, it was possible to determine if one beam had gained on the other by even a tiny fraction of the time it takes for a single light wave to vibrate. The setup was so sensitive that the expected difference, although less than a millionth of a billionth of a second, could have easily been measured; however, they actually found no difference at all—even though the earth is definitely in motion.
Other optical experiments, more intricate, and even more delicate, were attempted, with the same object of detecting the motion of the earth through the ether; and they all failed.
Other optical experiments, which were more complex and even more sensitive, were conducted with the same goal of detecting the motion of the earth through the ether, and they all failed.
The Special Theory and Its Unexpected Outcomes
It was upon these facts that Einstein based his original, or “special” theory of Relativity. He assumed boldly that the universe is so constituted that uniform straight-ahead motion of an observer and all his apparatus will not produce any difference whatever in the result of any physical process or experiment of any kind. Granting this, it follows that if all objects in the visible universe were moving uniformly together in any direction, no matter how fast, we could not find this out at all. We cannot [310]determine whether the universe, as a whole, is at rest or in motion, and may as well make one guess as another. Only the relative motions of its parts can be detected or studied.
It was on these facts that Einstein developed his original, or “special” theory of Relativity. He boldly assumed that the universe is structured in such a way that uniform straight-line motion of an observer and all their equipment will not cause any difference in the outcomes of any physical process or experiment. Given this, it follows that if all objects in the visible universe were moving together uniformly in any direction, no matter how fast, we wouldn't be able to tell at all. We cannot [__A_TAG_PLACEHOLDER_0__]determine whether the universe as a whole is at rest or in motion, and we might as well make one guess as another. Only the relative motions of its components can be detected or studied.
This seems simple and easy enough to understand. But the consequences which follow from it are extraordinary, and at first acquaintance seem almost absurd.
This seems straightforward and easy to grasp. However, the consequences that come from it are remarkable and, at first glance, seem almost ridiculous.
In the first place, if an observer measures the velocity of light, he must always
get the same result, no matter how fast he and his apparatus are moving, or in what
direction (so long as the motion is uniform and rectilinear). This sounds harmless;
but let us go back to the Michelson-Morley experiment where the light came back in
exactly the same time from the two mirrors. If the observer supposes himself to be
at rest, he will say that the distances and
were equal. But if he fancies that the whole universe is moving in the direction
, he will conclude that M is nearer to O than N is—for if they were equidistant, the round-trip would take longer in the first case,
as we have proved. If once more he fancies that the universe is moving in the direction
, he will conclude that N is nearer to O than M is. His answer to the question which of the two distances,
or
, is the greater will therefore depend on his assumption about the motion of the universe
as a whole.
In the first place, if someone measures the speed of light, they'll always get the same result, no matter how fast they're moving or in what direction (as long as the motion is steady and linear). This seems straightforward; but let's return to the Michelson-Morley experiment, where the light returned in exactly the same amount of time from the two mirrors. If the observer thinks they're at rest, they'll say that the distances and
are equal. But if they think the whole universe is moving in the direction of
, they'll conclude that M is closer to O than N is—because if they were the same distance, the round-trip would take longer in the first case, as we've shown. If they imagine again that the universe is moving in the direction of
, they'll conclude that N is closer to O than M is. So, his answer to the question of which of the two distances,
or
, is greater will depend on his assumption about the motion of the universe as a whole.
Similar complications arise in the measurement of time. Suppose that we have two observers, A and B, provided with clocks which run with perfect uniformity, and mirrors to reflect light signals to one another. At noon exactly by his clock, A sends a [311]flash of light towards B. B sees it come in at 12:01 by his clock. The flash reflected from B’s mirror reaches A at 12:02 by A’s clock. They communicate these observations to one another.
Similar complications come up when measuring time. Imagine we have two observers, A and B, each with clocks that work perfectly and mirrors to reflect light signals back and forth. Exactly at noon by his clock, A sends a [__A_TAG_PLACEHOLDER_0__] flash of light toward B. B sees it arrive at 12:01 by his clock. The flash reflected from B’s mirror reaches A at 12:02 by A’s clock. They share their observations with each other.
If A and B regard themselves as being at rest, they will agree that the light took as long to go out as it did to come back, and therefore that it reached B at just 12:01 by A’s clock, and that the two clocks are synchronized. But they may, if they please, suppose that they (and the whole universe) are moving in the direction from A towards B, with half the speed of light. They will then say that the light had a “stern chase” to reach B, and took three times as long to go out as to come back. This means that it got to B at 1½ minutes past noon by A’s clock, and that B’s clock is slow compared with A’s. If they should assume that they were moving with the same speed in the opposite direction, they would conclude that B’s clock is half a minute fast.
If A and B consider themselves to be at rest, they'll agree that the light took as long to go out as it did to return, meaning it reached B at exactly 12:01 on A’s clock, and that their clocks are synchronized. However, they could choose to think that they (along with the entire universe) are moving toward B at half the speed of light. In that case, they would say the light had a “stern chase” to reach B, taking three times longer to go out than to come back. This means it arrived at B at 1½ minutes past noon on A’s clock, indicating that B’s clock is slow compared to A’s. If they were to assume that they were moving in the opposite direction at the same speed, they would conclude that B’s clock is half a minute fast.
Hence their answer to the question whether two events at different places happen at the same time, or at different times, will depend on their assumption about the motion of the universe as a whole.
Hence their answer to the question of whether two events occurring in different locations happen at the same time or at different times will depend on their assumption about the motion of the universe as a whole.
Once more, let us suppose that A and B, with their clocks and mirrors, are in relative motion, with half the speed of light, and pass one another at noon by both clocks. At 12:02 by A’s clock, he sends a flash of light, which reaches B at 12:04 by his clock, is reflected, and gets back to A’s clock at 12:06. They signal these results to each other, and sit down to work them out. A thinks that he is at rest, and B moving. He therefore concludes that the light had the same distance to go out as to return [312]to him and took two seconds each way, reaching B at 12:04 by A’s clock, and that the two clocks, which agreed then, as well as at noon, are running at the same rate.
Once again, let's imagine that A and B, with their clocks and mirrors, are moving relative to each other at half the speed of light and pass each other at noon according to both clocks. At 12:02 by A’s clock, he sends a flash of light, which reaches B at 12:04 according to his clock, gets reflected, and returns to A’s clock at 12:06. They communicate these results to one another and sit down to figure them out. A believes he is stationary while B is the one in motion. He concludes that the light had to travel the same distance to reach B as it did to return to him, taking two seconds each way, arriving at B at 12:04 by A’s clock, and that the two clocks, which matched at noon, are running at the same rate now as well.
B, on the contrary, thinks that he is at rest and A in motion. He then concludes that A was much nearer when he sent out the flash than when he got it back, and that the light had three times as far to travel on the return journey. This means that it was 12:03 by A’s clock at the instant when the light reached B and B’s clock read 12:04. Hence A’s clock is running slow, compared with B’s.
B, on the other hand, believes that he is at rest and A is in motion. He concludes that A was much closer when he sent out the flash than when he got it back, and that the light had three times as far to travel on the way back. This means that it was 12:03 on A’s clock at the moment when the light reached B, while B’s clock read 12:04. Therefore, A’s clock is running slow compared to B’s.
Hence the answer to the question whether two intervals of time, measured by observers who are in motion relative to one another, are of the same or of different durations, depends upon their assumptions about the motion of the universe as a whole.
Hence the answer to the question of whether two time intervals, measured by observers who are moving relative to each other, are the same or different in duration, depends on their beliefs about the motion of the universe as a whole.
Now we must remember that one assumption about the motion of the universe as a whole is exactly as good—or bad—as another. No possible experiment can distinguish between them. Hence on the Principle of Relativity, we have left no absolute measurement of time or space. Whether two distances in different directions are to be called equal or not—whether two events in different places are to be called simultaneous or not—depends on our arbitrary choice of such an assumption, or “frame of reference.” All the various schemes of measurement corresponding to these assumptions will, when applied to any imaginable experiment, predict exactly the same phenomena. But, in certain important cases, these predictions differ from those of the old familiar theory, and, every time that such experiments [313]have been tried, the result has agreed with the new theory, and not with the old.
Now we need to remember that one assumption about the motion of the universe as a whole is exactly as good—or bad—as another. No possible experiment can tell them apart. So, based on the Principle of Relativity, we have no absolute measure of time or space. Whether two distances in different directions are considered equal or not—whether two events in different places are considered simultaneous or not—depends on our arbitrary choice of such an assumption, or “frame of reference.” All the different measurement schemes based on these assumptions will, when applied to any conceivable experiment, predict exactly the same phenomena. But, in certain important cases, these predictions differ from those of the old familiar theory, and, every time such experiments [__A_TAG_PLACEHOLDER_0__] have been conducted, the results have supported the new theory, not the old.
We are therefore driven to accept the theory of relativity, strange as it is, as being more nearly “true to nature” than our older ideas. Fortunately, the difference between the results of the two become important only when we assume that the whole visible universe is moving together much faster than any of its parts are moving relatively to one another. Unless we make such an unwarranted assumption, the differences are so small that it takes the most ingenious and precise experiments to reveal them.
We are therefore compelled to accept the theory of relativity, odd as it seems, as being closer to “true to nature” than our previous ideas. Fortunately, the difference between the results of the two becomes significant only when we assume that the entire visible universe is moving together much faster than any of its parts are moving relative to one another. Unless we make such an unfounded assumption, the differences are so minor that it requires the most clever and precise experiments to uncover them.
The Generalization
Not content with all this, Einstein proceeded, a few years ago, to develop a “general” theory of relativity, which includes the effects of gravitation.
Not satisfied with all this, Einstein went on, a few years ago, to develop a “general” theory of relativity that includes the effects of gravity.
To make this idea clear, let us imagine two observers, each, with his measuring instruments, in a large and perfectly impervious box, which forms his “closed system.”
To clarify this idea, let’s picture two observers, each with their measuring tools, inside a large, completely sealed box, which serves as their “closed system.”
The first observer, with his box and its contents, alone in space, is entirely at rest.
The first observer, with his box and its contents, is completely still, alone in space.
The second observer, with his box and its contents, is, it may be imagined, near the earth or the sun or some star, and falling freely under the influence of its gravitation.
The second observer, with his box and what’s inside, is probably close to the earth, the sun, or some star, and is falling freely under the pull of its gravity.
This second box and its contents, including the observer, will then fall under the gravitational force, that is, get up an ever-increasing speed, but at exactly the same rate, so that there will be no tendency for their relative positions to be altered. [314]
This second box and what’s inside it, including the observer, will then be affected by gravity, meaning they will all accelerate at the same increasing speed, so their relative positions will remain unchanged. [__A_TAG_PLACEHOLDER_0__]
According to Newton’s principles, this will make not the slightest difference in the motions of the physical objects comprising the system or their attractions on one another, so that no dynamical experiment can distinguish between the condition of the freely falling observer in the second box and the observer at rest in the first.
According to Newton’s principles, this will make no difference at all in the motion of the physical objects in the system or how they attract each other, so no dynamical experiment can tell the difference between the freely falling observer in the second box and the observer at rest in the first.
But once more the question arises: What could be done by an optical experiment?
But once again the question comes up: What could an optical experiment accomplish?
Einstein assumed that the principle of relativity still applied in this case, so that it would be impossible to distinguish between the conditions of the observers in the two boxes by any optical experiment.
Einstein believed that the principle of relativity still applied here, meaning it would be impossible to tell the difference between the conditions of the observers in the two boxes through any optical experiment.
It can easily be seen that it follows from this new generalized relativity that light cannot travel in a straight line in a gravitational field.
It’s clear that this new generalized relativity shows that light can’t travel in a straight line in a gravitational field.
Imagine that the first observer sets up three slits, all in a straight line. A ray of light which passes through the first and second will obviously pass exactly through the third.
Imagine that the first observer sets up three slits, all in a straight line. A ray of light that passes through the first and second will clearly pass directly through the third.

Suppose the observer in the freely falling system attempts the same experiment, having
his slits P, Q, R, equally spaced, and placing them at right angles to the direction in which he is
falling. When the light passes through P, the slits will be in certain position (Figure). By the time [315]it reaches Q, they will have fallen to a lower level,
, and when it reaches R, they will be still lower,
. The times which the light takes to move from P to Q and Q to R will be the same: but, since the system is falling ever faster and faster the distance
will be greater than
. Hence, if the light which has passed through P and Q moves in a straight line, it will strike above R, as is illustrated by the straight line in the figure. But, on Einstein’s assumption,
the light must go through the third slit, as it would do in the system at rest, and
must therefore move in a curved line, like the curved line in the figure, and bend downward in the direction of the gravitational force.
Suppose the observer in the freely falling system tries the same experiment, having his slits P, Q, R, spaced evenly and positioned at right angles to the direction he is falling. When the light passes through P, the slits will be in a certain position (Figure). By the time it reaches Q, they will have dropped to a lower level,
, and when it reaches R, they will be even lower,
. The time it takes for the light to travel from P to Q and Q to R will be the same: however, since the system is falling faster and faster, the distance
will be greater than
. Therefore, if the light that has passed through P and Q travels in a straight line, it will hit above R, as shown by the straight line in the figure. But, according to Einstein's assumption, the light must go through the third slit, as it would in a system at rest, and must therefore travel in a curved path, like the curved line in the figure, and dip downward in the direction of the gravitational force.
The Exams
Calculation shows that the deviation of light by the moon or planets would be too small to detect. But for a ray which had passed near the sun, the deflection comes out 1.7″, which the modern astronomer regards as a large quantity, easy to measure. Observations to test this can be made only at a total eclipse, when we can photograph stars near the sun, on a nearly dark sky. A very fine chance came in May, 1919, and two English expeditions were sent to Brazil and the African coast. These photographs were measured with extreme care, and they show that the stars actually appear to be shifted, in almost exactly the way predicted by Einstein’s theory.
Calculation shows that the light deviation by the moon or planets would be too small to notice. However, for a ray that passes close to the sun, the deflection measures 1.7″, which today's astronomers consider significant and easy to measure. Observations to test this can only happen during a total eclipse when we can photograph stars near the sun against a nearly dark sky. A great opportunity arose in May 1919, and two British expeditions were sent to Brazil and the African coast. These photographs were meticulously measured, and they indicate that the stars actually appear to be shifted, almost exactly as predicted by Einstein's theory.
Another consequence of “general relativity” is that Newton’s law of gravitation needs a minute [316]correction. This is so small that there is but a single case in which it can be tested. On Newton’s theory, the line joining the sun to the nearest point upon a planet’s orbit (its perihelion) should remain fixed in direction, (barring certain effects of the attraction of the other planets, which can be allowed for). On Einstein’s theory it should move slowly forward. It has been known for years that the perihelion of Mercury was actually moving forward, and all explanations had failed. But Einstein’s theory not only predicts the direction of the motion, but exactly the observed amount.
Another outcome of “general relativity” is that Newton’s law of gravitation requires a tiny [__A_TAG_PLACEHOLDER_0__]correction. This correction is so small that it has only been possible to test it in one instance. According to Newton’s theory, the line connecting the sun to the closest point on a planet’s orbit (its perihelion) should stay fixed in direction, except for specific effects from the pull of other planets, which can be accounted for. However, in Einstein’s theory, this line should gradually shift forward. It’s been known for years that the perihelion of Mercury was indeed moving forward, and previous explanations had all fallen short. But Einstein’s theory not only predicts the direction of this movement but also matches the exact amount observed.
Einstein also predicts that the lines of any element in the solar spectrum should be slightly shifted towards the red, as compared with those produced in our laboratories. Different observers have investigated this, and so far they disagree. The trouble is that there are several other influences which may shift the lines, such as pressure in the sun’s atmosphere, motion of currents on the sun’s surface, etc., and it is very hard to disentangle this Gordian knot. At present, the results of these observations can neither be counted for or against the theory, while those in the other two cases are decisively favorable.
Einstein also predicts that the lines of any element in the solar spectrum should be slightly shifted toward the red compared to those produced in our labs. Different observers have looked into this, but so far they don't agree. The problem is that there are several other factors that might shift the lines, like pressure in the sun’s atmosphere, movement of currents on the sun’s surface, and so on, making it very difficult to untangle this complicated situation. Right now, the results of these observations can’t be counted for or against the theory, while those from the other two cases clearly support it.
The mathematical expression of this general relativity is intricate and difficult. Mathematicians—who are used to conceptions which are unfamiliar, if not incomprehensible, to most of us—find that these expressions may be described (to the trained student) in terms of space of four dimensions and of the non-Euclidean geometry. We therefore hear such phrases as “time as a sort of fourth dimension,” “curvature of space” and others. But these are [317]simply attempts—not altogether successful—to put mathematical relationships into ordinary language, instead of algebraic equations.
The math behind general relativity is complicated and challenging. Mathematicians—who are accustomed to ideas that most people find strange, if not completely confusing—describe these concepts (to those who have trained in the field) using four-dimensional space and non-Euclidean geometry. That's why we hear phrases like “time as a sort of fourth dimension” and “curvature of space.” However, these are [__A_TAG_PLACEHOLDER_0__] attempts—not entirely effective—to translate mathematical relationships into everyday language, instead of sticking to algebraic equations.
More important to the general reader are the physical bearings of the new theory, and these are far easier to understand.
More important to the average reader are the practical implications of the new theory, and these are much easier to grasp.
Various assumptions which we may make about the motion of the universe as a whole, though they do not influence the observed facts of nature, will lead us to different ways of interpreting our observations as measurements of space and time.
Various assumptions we might make about the motion of the universe as a whole, even though they don't affect the observed facts of nature, will lead us to different ways of interpreting our observations as measurements of space and time.
Theoretically, one of these assumptions is as good as any other. Hence we no longer believe in absolute space and time. This is of great interest philosophically. Practically, it is unimportant, for, unless our choice of an assumption is very wild, our conclusions and measurements will agree substantially with those which are already familiar.
Theoretically, one of these assumptions is as good as any other. So we no longer believe in absolute space and time. This is really interesting from a philosophical standpoint. Practically, it doesn't matter much, because unless our choice of assumption is really off, our conclusions and measurements will align closely with the ones we already know.
Finally, the “general” relativity shows that gravitation and electro-magnetic phenomena—(including light) do not form two independent sides of nature, as we once supposed, but influence one another (though slightly) and are parts of one greater whole. [318]
Finally, “general” relativity shows that gravity and electromagnetic phenomena—including light—aren't two separate aspects of nature as we used to think. Instead, they influence each other (even if only a little) and are parts of one larger whole. [__A_TAG_PLACEHOLDER_0__]
XIX
EINSTEIN’S THEORY OF RELATIVITY
A Simple Explanation of His Postulates and Their Consequences
A Simple Explanation of His Ideas and What They Mean
Einstein’s theory of relativity seeks to represent to us the world as it really is instead of the world of appearances which may be deceiving us. When I was in town last week to buy 5 yards of calico I watched the draper very carefully as he measured the cloth to make sure I was not cheated. Yet experiment can demonstrate, and Einstein’s theory can explain, that the draper’s yardstick became longer or shorter according to the direction in which it was held. The length of the yardstick did not appear to me to change simply because everything else in the same direction, the store, the draper, the cloth, the retina of my eye, changed length in the same ratio. Einstein’s theory points out not only this, but every case where appearances are deceptive, and tries to show us the world of reality.
Einstein’s theory of relativity aims to show us the world as it truly is, rather than the deceptive world of appearances. When I was in town last week to buy 5 yards of calico, I watched the draper closely as he measured the cloth to ensure I wasn’t getting cheated. However, experiments can show, and Einstein’s theory can clarify, that the draper’s yardstick could appear longer or shorter depending on how it was held. The length of the yardstick didn’t seem to change for me simply because everything else in that direction—the store, the draper, the cloth, the retina of my eye—changed in length at the same rate. Einstein’s theory highlights not only this phenomenon but also every situation where appearances can be misleading, attempting to reveal the true nature of reality.
Einstein’s theory is based on the principle of relativity and before we try to follow his reasoning we must spend a little time in understanding what he [319]means by “relativity” and in grasping how the idea arises. Suppose I wish to define my motion as I travel along in an automobile. I may be moving at the rate of 25 miles an hour relative to objects fixed on the roadside, but relative to a fellow-passenger I am not moving at all; relative to the sun I am moving with a speed of 18½ miles per second in an elliptical orbit, and again relative to the stars I am moving in the direction of the star Vega at a speed of 12 miles per second. Thus motion can only be defined relative to some object or point of reference. Now this is not satisfactory to the exact scientist. Scientists are not content with knowing, for example, that the temperature of boiling water is +100° C. relative to the temperature of freezing; they have set out to determine absolute temperatures and have found that water boils at 373° C. above absolute zero. Why should I not, therefore, determine the absolute motion of the automobile, not its motion relative to the road, earth, sun or stars, but relative to absolute rest?
Einstein’s theory is based on the principle of relativity. Before we dive into his reasoning, we need to spend some time understanding what he [__A_TAG_PLACEHOLDER_0__]means by “relativity” and how this idea comes about. Think about it: if I want to define my motion while driving in a car, I might be going 25 miles an hour compared to objects by the roadside, but to a passenger next to me, I’m not moving at all. Relative to the sun, I'm traveling at 18½ miles per second in an elliptical orbit, and compared to the stars, I'm heading toward the star Vega at 12 miles per second. This shows that motion can only be defined in relation to some object or reference point. However, this isn’t satisfactory to scientists who seek precision. They don’t just want to know that the temperature of boiling water is +100° C compared to freezing; they aim to find absolute temperatures, discovering that water boils at 373° C above absolute zero. So, why shouldn't I determine the absolute motion of the car, not just its motion relative to the road, earth, sun, or stars, but in relation to absolute rest?
Michelson and Morley set out in their famous experiment to measure the absolute velocity of their laboratory, which was, of course, fixed on the earth. The experiment consisted of timing two rays of light over two equal tracks at right angles to each other. When one track was situated in the direction of the earth’s motion they expected to get the same result as when two scullers of equal prowess are racing in a river, one up and down the stream and the other across and back; the winner will be the [320]sculler rowing across the stream, as working out an example will convince. Even if the earth had been stationary at the time of one experiment, the earth’s motion round the sun would have been reversed 6 months later and would then have given double the effect. They found, however, that the two rays of light arrived always an exact dead heat. All experimenters who have tried since have arrived at the same result and found it impossible to detect absolute motion.
Michelson and Morley embarked on their famous experiment to measure the absolute speed of their laboratory, which was, of course, stationary on Earth. The experiment involved timing two rays of light over two equal paths at right angles to each other. When one path was aligned with the Earth’s motion, they expected to get the same result as when two rowers of equal skill race in a river, one going upstream and downstream and the other going across the stream; the winner will be the rower moving across the stream, as a simple example will demonstrate. Even if the Earth had been stationary during one experiment, its motion around the sun would have reversed six months later, producing double the effect. However, they discovered that the two rays of light always arrived in a perfect tie. All subsequent experimenters have reached the same conclusion and found it impossible to detect absolute motion.
The principle of relativity has its foundation in fact on these failures to detect absolute motion. This principle states that the only motion we can ever know about is relative motion. If we devise an experiment which ought to reveal absolute motion, nature will enter into a conspiracy to defeat us. In the Michelson and Morley experiment the conspiracy was that the track in the direction of the earth’s absolute motion should contract its length by just so much as would allow the ray of light along it to arrive up to time.
The principle of relativity is based on the fact that we can't detect absolute motion. This principle says that the only motion we can recognize is relative motion. If we set up an experiment that should show absolute motion, nature will work against us. In the Michelson and Morley experiment, the issue was that the path in the direction of the earth’s absolute motion would shorten just enough to allow the light beam to arrive on time.
We see, therefore, that according to the principle of relativity motion must always remain a relative term, in much the same way as vertical and horizontal, right and left, are relative terms having only meaning when referred to some observer. We do not expect to find an absolute vertical and are wise enough not to attempt it; in seeking to find absolute motion physicists were not so wise and only found themselves baffled.
We see, therefore, that based on the principle of relativity, motion can only be understood in relation to something else, just like vertical and horizontal, right and left, only have meaning when connected to an observer. We don't expect to find an absolute vertical and are smart enough not to try; in the pursuit of absolute motion, physicists weren't as wise and ended up confused.
The principle that all motion is relative now requires to be worked out to all its consequences, as [321]has been done by Einstein, and we have his theory of relativity. Einstein conceives a world of four dimensions built up of the three dimensions of space, namely up and down, backwards and forwards, right and left, with time as the fourth dimension. This is an unusual conception to most of us, so let us simplify it into something which we can more easily picture but which will still allow us to grasp Einstein’s ideas. Let us confine ourselves for the present to events which happen on this sheet of paper, i.e., to space of two dimensions only and take time as our third dimension at right angles to the plane of the paper. We have thus built up a three dimensional world of space-time which is every bit as useful to us as a four dimensional representation so long as we only need study objects moving over the sheet of paper.
The idea that all motion is relative needs to be fully explored, as [__A_TAG_PLACEHOLDER_0__] has shown with Einstein's theory of relativity. Einstein envisions a four-dimensional world composed of the three dimensions of space—up and down, backward and forward, left and right—with time as the fourth dimension. This is an unusual concept for many of us, so let's simplify it into something that's easier to visualize while still capturing Einstein's ideas. For now, let's focus on events happening on this sheet of paper, meaning we’ll consider only two dimensions of space and take time as our third dimension, which is perpendicular to the plane of the paper. In this way, we've created a three-dimensional world of space-time that is just as useful as a four-dimensional representation as long as we only need to study objects moving across the sheet of paper.
Suppose a fly is crawling over this sheet of paper and let us make a movie record of it. If we cut up the strip of movie film into the individual pictures and cement them together one above another in their proper order, we shall build up a solid block of film which will be a model of our simplified world of space-time and in which there will be a series of dots representing the motion of the fly over the paper. Just as I can state the exact position of an object in my room by defining its height above the floor, its distance from the north wall and its distance from the east wall, so we can reduce the positions of the dots to figures for use in calculations by measuring their distances from the three faces intersecting in the lines OX, OY, and OT, where [322]OXAYTBCD represents the block of film. The mathematician would call the three lines OX, OY, OT the coordinate axes. Measuring all the dots in this way we shall obtain the motion of the fly relative to the coordinate axes OX, OY, OT. If we add a block OTDYEFGH of plain film we can use EX, EH, EF as coordinate axes and again obtain the motion of the fly relative to these new axes; or we
Suppose a fly is crawling across this sheet of paper, and let’s record it on video. If we cut the movie film into individual frames and stick them together in the right order, we'll create a solid block of film that represents our simplified world of space-time, showing a series of dots that depict the fly's movement across the paper. Just like I can pinpoint the exact location of an object in my room by stating how high it is from the floor, its distance from the north wall, and its distance from the east wall, we can simplify the dot positions into figures for calculations by measuring their distances from the three planes that intersect at the lines OX, OY, and OT, where [__A_TAG_PLACEHOLDER_0__]OXAYTBCD represents the film block. Mathematicians refer to the three lines OX, Oh no, and OT as the coordinate axes. By measuring all the dots this way, we can track the fly's motion relative to these coordinate axes OX, OMG, and OT. If we add a strip of plain film OTDYEFGH, we can use EX, EH, and EF as new coordinate axes and again track the fly's movement relative to these axes; or we...

can add block after block so as to keep the axes moving. We can conceive of other changes of axes. The operator making the movie record might have taken the fly for the hero of the piece and moved the camera about so as to keep the fly more or less central in the picture; or he might, by turning the handle first fast and then slow and by moving the camera, have made the fly appear to be doing stunts. Moving the camera would change the axes of x and y, and turning the handle at different speeds [323]would change the axis of time. Again, we might change the axes by pushing the block out of shape or by distorting it into a state of strain. Whatever change of axes we make, any dot in the block of film will signify a coincidence of the fly with a certain point of the paper at a certain time, and the series of dots will, in every case, be a representation of the motion of the fly. Maybe the representation will be a distorted one, but who is to say which is the absolutely undistorted representation? The principle of relativity which we laid down before says that no one set of coordinates will give the absolute motion of the fly, so that one set is as good as another. The principle that all motion is relative means, therefore, that no matter how we change our coordinates of space-time, the laws of motion which we deduce must be the same for all changes.
You can add block after block to keep the axes moving. We can imagine other changes to the axes. The operator recording the video might have chosen the fly as the main subject and moved the camera around to keep it somewhat centered in the frame; or by turning the handle quickly and then slowly while moving the camera, he could have made the fly seem like it was doing stunts. Moving the camera would change the axes of x and y, and adjusting the speed of the handle [__A_TAG_PLACEHOLDER_0__] would change the time axis. Additionally, we could alter the axes by deforming the block or distorting it under strain. Regardless of how we change the axes, any dot in the block of film will represent a moment when the fly coincides with a specific point on the paper at a certain time, and the series of dots will always depict the fly's motion. The representation might be distorted, but who can determine which representation is completely undistorted? The principle of relativity we've discussed states that no single set of coordinates will give the absolute motion of the fly, meaning any set is equally valid. The idea that all motion is relative implies that no matter how we adjust our space-time coordinates, the laws of motion we derive must remain consistent across all changes.
To use an analogy, the sculptured head of Shakespeare on my table may appear to have hollow cheeks when I admit light from the east window only, or to have sunken eyes with light from the skylight in the roof, but the true shape of the head remains the same in all lights.
To use an analogy, the sculpted head of Shakespeare on my table may look like it has hollow cheeks when I let in light from the east window only, or appear to have sunken eyes with light from the skylight in the roof, but the true shape of the head stays the same in all lighting.
Hence, if with reference to two consecutive dots in our block of film a mathematical quantity can be found which will not change no matter how we changes our axes of coordinates, that quantity must be an expression of the true law of motion of the fly between the two points of the paper and the two times represented by these two dots. Einstein has worked out such a quantity remaining constant [324]for all changes of coordinates of the four dimensional world of space-time.
Therefore, if we can find a mathematical quantity related to two consecutive dots in our film strip that stays the same regardless of how we change our coordinate axes, that quantity must represent the true law of motion of the fly between the two points on the paper and the two times indicated by these dots. Einstein developed a quantity that remains constant [__A_TAG_PLACEHOLDER_0__] for all changes in the coordinates of the four-dimensional world of space-time.
In passing we may notice a feature of Einstein’s world of space-time which we shall doubtless find it difficult to conceive, namely, that there is no essential difference between a time and a distance in space. Since one set of coordinates is as good as another, we can transform time into space and space into time according as we choose our axes. For example if we change OX, OT, the axes of x and time in Fig. 2, into OX′, OT′ by a simple rotation, the new time represented by OT′ consists partly of OA in the old time and partly of OB in the old x direction. Referring to our block of movie film again, it means that although I might separate the block into space and time by slicing it into the original pictures, I can just as readily slice it in any direction I choose and still get individual pictures representing the motion of the fly but with, of course, new time and space. So whilst I may be believe that a liner has travelled 3,000 miles in 4 days, an observer on a star who knows nothing of my particular axes in space-time may say, with equal truth, that it went 2,000 miles in 7 days. Thus, time and space are not two separate identities in Einstein’s view; there only exists a world of four dimensions which we can split up into time and space as we choose.
In passing, we can point out an aspect of Einstein's concept of space-time that we might find hard to grasp: there isn't a fundamental difference between time and distance in space. Since any set of coordinates works just as well as another, we can switch time for space and vice versa depending on how we set our axes. For example, if we change OX and OT, the axes of x and time in Fig. 2, to OX' and OT by simply rotating, the new time shown by OT' is partly made up of OA from the old time and partly of OB from the old x direction. Referring back to our film strip, this means that even though I might divide the strip into space and time by cutting it into the original frames, I can just as easily cut it in any direction I choose and still get individual frames showing the fly's motion but with, of course, new representations of time and space. So, while I might think that a liner has traveled 3,000 miles in 4 days, an observer on a star who knows nothing about my specific axes in space-time could truthfully say it traveled 2,000 miles in 7 days. Thus, in Einstein's view, time and space aren't two separate entities; there's only a four-dimensional world that we can divide into time and space however we like.
Let us see now how Einstein explains gravitation. When a body is not acted on by any forces (except gravitation) the quantity which remains constant for all changes of coordinates implies that the body will follow that path in the space of an outside [325]observer which takes the least time. It is an observed fact that one body attracts another by gravitation; that is, the path of one body is bent from its course by the presence of another. Now we can bend the path of the fly in our block of film by straining the block in some way. Suppose, therefore, that I strain the world so as to bend the path of a body exactly as the gravitation due to some other body bends it; i.e., by a change of coordinates I have obtained the same effect as that produced by gravitation. Einstein’s theory, therefore, explains gravitation as a distortion of the world of space-time due to the presence of matter. Suppose first that a body is moving with no other bodies near; according to Einstein it will take the path in space which requires the least time, i.e., a straight line as agrees with our experience. If now the world be strained by the presence of another body or by a change of coordinates it will still pursue the path of least time, but this path is now distorted from the straight line, just as in a similar way the path on a globe requiring the least time to travel follows a great circle. So, on Einstein’s view of gravitation, the earth moves in an elliptical path around the sun not because a force is acting on it, but because the world of space-time is so distorted by the presence of the sun that the path of least time through space is the elliptical path observed. There is, therefore, no need to introduce any idea of “force” of gravitation. Einstein’s theory explains gravitation only in the sense that he has explained it away as a force of nature and makes it a property of space-time, namely, a distortion not [326]different from an appropriate change of coordinates. He does not, however, explain how or why a body can distort space-time. It is noteworthy that whilst the law of gravitation and the law of uniform motion in a straight line when no force is acting were separate and independent laws under Newton, Einstein finds one explanation for both under the principle of relativity.1 [327]
Let’s see how Einstein talks about gravity. When an object isn’t influenced by any forces (except gravity), the one factor that stays constant for any changes in position means that the object will travel along the shortest path through space as seen by an outside observer. It’s a known fact that one object pulls on another through gravity; in other words, the path of one object changes because of another’s presence. Now we can alter the path of a fly in our film strip by manipulating the strip in some way. So, let’s assume I manipulate the world to bend an object’s path exactly as gravity from another object would bend it; that is, by changing the coordinates, I have achieved the same effect as gravity. Einstein’s theory explains gravity as a warp in the space-time fabric caused by the presence of matter. First, consider a body moving without any nearby objects; according to Einstein, it will take the path through space that requires the least amount of time, meaning a straight line, which matches our experience. If the world is distorted by the presence of another object or through a change in coordinates, it will still follow the path of least time, but this path is now warped from a straight line, just like on a globe where the shortest travel route follows a great circle. So, according to Einstein's view of gravity, the Earth moves in an elliptical orbit around the sun not because a force is acting on it, but because the space-time fabric is so warped by the sun’s presence that the shortest path through space is the elliptical path we observe. Thus, there’s no need to introduce the concept of a “force” of gravity. Einstein's theory explains gravity by effectively removing it as a natural force and instead describes it as a characteristic of space-time, specifically a distortion akin to a relevant change in coordinates. However, he doesn’t explain how or why an object can warp space-time. Interestingly, while Newton saw the law of gravity and the law of uniform motion in a straight line with no forces as separate and independent laws, Einstein finds a single explanation for both under the principle of relativity.
1 The balance of Dr. Royds’ essay is given to a discussion of the phenomena of Mercury’s perihelial advance, the deflection of light under the gravitational field of the sun, and the shift in spectral lines, in connection with which alone Einstein’s theory makes predictions which are sufficiently at variance with those of Newtonian science to be of value in checking up the theory observationally. In the interest of space conservation and in the presence of Dr. Pickering’s very complete discussion of these matters we omit Dr. Royds’ statement.—Editor. ↑
1 Dr. Royds’ essay primarily discusses the phenomena of Mercury’s advance during perihelion, the bending of light due to the sun's gravitational field, and the movement of spectral lines. These are the areas where Einstein’s theory makes predictions that differ enough from Newtonian science to be useful for observational verification of the theory. To save space and because Dr. Pickering has thoroughly covered these topics, we will omit Dr. Royds’ statement.—Editor. ↑
XX
EINSTEIN’S THEORY OF GRAVITATION
The Discussion of the General Theory and Its Most Important Application, from the Essay by
The Discussion of the General Theory and Its Most Important Application, from the Essay by
Newton’s great discovery regarding the motion of the planets consisted in his showing that these could all be summed up in the following statement: consider any planet in its relation to all particles in the universe. Write down, for the planet, in the line joining it to any particle, an acceleration proportional to the mass of the particle and to the inverse square of its distance from the planet. Then calculate the planet’s resultant acceleration by combining all the accelerations thus obtained.
Newton's major discovery about how planets move can be summed up like this: take any planet and look at its relationship with all the particles in the universe. For that planet, along the line connecting it to any particle, note an acceleration that’s proportional to the mass of that particle and inversely proportional to the square of the distance from the planet. Then, figure out the planet’s overall acceleration by adding up all the accelerations you just calculated.
We have here purposely avoided the use of the word “force,” for Newton’s law is complete as a practical statement of fact without it; and this word adds nothing to the law by way of enhancing its power in actual use. Nevertheless, the fact that the acceleration is made up as it were of non-interfering contributions from each particle in the line joining it to the planet strongly suggests to the mind something of the nature of an elastic pull for which the [328]particle is responsible, and to which the planet’s departure from a straight-line motion is due. The mind likes to think of the elastic; ever since the time of Newton people have sought to devise some mechanism by which these pulls might be visualized as responsible for the phenomena in the same way as one pictures an elastic thread as controlling the motion of a stone which swings around at its end.
We have intentionally avoided using the word “force,” because Newton’s law stands as a complete practical statement of fact without it; this word doesn’t really add anything to the law in terms of its application. Still, the fact that the acceleration seems to come from each particle along the line connecting it to the planet suggests something like an elastic pull that the [__A_TAG_PLACEHOLDER_0__] particle is responsible for, and this is what causes the planet to deviate from a straight-line motion. Our minds tend to think in terms of elastic connections; ever since Newton’s time, people have tried to come up with some way to picture these pulls as explaining the phenomena, similar to visualizing an elastic thread controlling the motion of a stone that swings at its end.
This search has been always without success; and now Einstein has found a rather different law which fits the facts better than Newton’s law. It is of such a type that it does not lend itself conveniently to expression in terms of force; the mind would gain nothing by trying to picture such forces as are necessary. It compensates for this, however, in being capable of visualization in terms of what is ultimately a much simpler concept.
This search has always been unsuccessful; and now Einstein has discovered a different law that better explains the facts than Newton’s law. It's of a type that doesn’t easily translate into terms of force; trying to visualize those necessary forces wouldn’t help our understanding. However, it compensates for this by being able to be visualized in terms of a concept that is ultimately much simpler.
In order to appreciate the fundamental ideas involved, suppose for a moment that gravitation could be annihilated, completely, and suppose I find myself upon this earth in empty space. You shall be seated at some point in space and shall watch my doings. If I am in the condition of mind of the people of the reign of King Henry VIII, I shall believe that the earth does not rotate. If I let go a stone, there being no gravity, I shall find that it flies away from me with an acceleration. You will know, however, that the stone really moves in a straight line with constant velocity, and that the apparent acceleration which I perceive is due to the earth’s rotation. If I have argued that acceleration is due to force, I shall say that the earth repels the stone, and shall try to find the law governing the [329]variation of this force with distance. I may go farther, and try to imagine some reason for the force, some pushing action transmitted from the earth to the stone through a surrounding medium; and, you will pity me for all this wasted labor, and particularly for my attempt to find a mechanism to account for the force, since you know that if I would only accept your measurements all would appear so simple.
To understand the basic concepts involved, imagine for a moment that gravity could be completely eliminated, and picture myself on this earth in empty space. You’ll be sitting somewhere in space, watching what I do. If I think like people did during the reign of King Henry VIII, I’ll believe that the earth isn’t spinning. If I drop a stone, without gravity, it will seem to fly away from me with acceleration. But you will know that the stone is actually moving in a straight line at a constant speed, and the "acceleration" I see is just because of the earth's rotation. If I argue that acceleration is caused by force, I might claim that the earth pushes the stone away and try to find the rule that describes how this force changes with distance. I might even go further and try to think of a reason for the force, imagining some kind of push from the earth to the stone through a surrounding medium. You’ll feel sorry for all this drunk effort, especially my attempts to find a way to explain the force, since you know that if only I would accept your measurements, everything would seem so straightforward.
Let us probe this matter a little farther, however, from the stand-point of myself. I must believe in the reality of the force, since I have to be tied to my chair to prevent my departure from the earth. I might wonder how this field of force would affect the propagation of light, chemical action and so forth. For, even though I had discovered that, by using your measures, I could transform away the apparent effects of my field of force as far as concerned its power to hurl stones about, I could still regard this as a mathematical accident, and believe that the force was really there. Although I might suspect that the same transformation of view-point that would annul the field’s effect as regards the stones would also annul its effect as regards light, etc., I should not be sure of this, as you would be; and my conscience would hardly allow me to do more than look upon the assumption of complete equivalence between the apparent field and a change in the system of measurement as a hypothesis. I should be strongly tempted to make the hypothesis, however.
Let’s look into this a bit more from my perspective. I have to believe in the reality of this force since I have to be strapped to my chair to stop myself from floating away. I might wonder how this force field would influence the travel of light, chemical reactions, and so on. Even though I’ve found that by using your measurements I could eliminate the visible effects of my force field when it comes to its ability to throw stones around, I could still see this as just a mathematical coincidence and think that the force is still actually there. While I might suspect that the same shift in perspective that removes the force's effect on stones would also remove its effect on light and such, I wouldn't be as sure of this as you would; and my conscience wouldn’t let me do anything more than regard the idea of complete equivalence between the apparent field and a change in the measurement system as a hypothesis. Still, I would be very tempted to propose that hypothesis.
Now the question raised by Einstein is whether the force of gravity, which we experience as a very [330]real thing, may be put upon a footing which is in some way analogous to that of the obviously fictitious centrifugal force cited above: whether gravitation may be regarded as a figment of our imagination engendered by the way in which we measure things. He found that it could be so regarded. He went still farther, and in his Principle of Equivalence, he postulated that the apparent effects of gravitation in all phenomena could be attributed to the same change in the system of our measurements that would account for the ordinary phenomena of gravitation. On the basis of this hypothesis he was able to deduce for subsequent experimental verification, the effects of gravitation on light. He did not limit himself to such simple changes in our measurements as were sufficient to serve the purpose of the problem of centrifugal force cited above; but, emboldened by the assumptions, in the older theory of relativity, of change in standards of length and time on account of motion, he went even farther than this, and considered the possibility of change of our measures due to mere proximity to matter.
Now the question raised by Einstein is whether gravity, which we experience as a very [__A_TAG_PLACEHOLDER_0__]real thing, can be understood in a way that is similar to the obviously imaginary centrifugal force mentioned earlier: whether we can think of gravitation as something created by how we measure things. He found that it could be seen this way. He went even further, and in his Principle of Equivalence, he suggested that the apparent effects of gravity in all phenomena could be traced back to the same change in our measurement system that explains the ordinary effects of gravity. Based on this idea, he was able to predict, for later experimental verification, the effects of gravity on light. He didn't just limit himself to simple changes in our measurements that would apply to the problem of centrifugal force mentioned earlier; instead, encouraged by the earlier theories of relativity that involved changes in standards of length and time due to motion, he considered even broader changes in our measurements caused by being near matter.
His problem amounted to an attempt to find some way in which it was possible to conceive our scales and clocks as altered, relatively to some more fundamental set, so as to allow of the planetary motions being uniform and rectilinear with respect to these fundamental measures, although they appear as they do to us. If we allow our imaginations perfect freedom as to how the scales may be altered, we shall not balk at assuming alterations varying in any way we please, Einstein does, however, introduce [331]restrictions for reasons which we will now discern.
His problem was basically about figuring out how we could imagine our scales and clocks being different compared to some more fundamental set, so that the motions of the planets could appear uniform and straight with respect to these basic measurements, even though they seem different to us. If we let our imaginations run wild with how the scales might be changed, we won't hesitate to assume any kind of alteration we want. However, Einstein does impose [__A_TAG_PLACEHOLDER_0__]restrictions for reasons that we will now understand.
If we imagine our whole universe, with its observers, planetary orbits, instruments, and everything else, embedded in a jelly, and then distort the jelly and contents in any way, the numbers at which our planetary orbits (or rather their telescopic images) intersect our scales will be unaltered. Moreover, we could vary, in any manner, the times at which all objects (including the clock hands) occupied their distorted positions, and the hand of some clock near the point where the planetary image crossed the scale would record for this occurrence the same dial reading as before. An inhabitant of this distorted universe would be absolutely unconscious of the change. Now the General Theory of Relativity which expresses itself in slightly varied forms, amounts to satisfying a certain philosophical craving of the mind, by asserting that the laws of nature which control our universe ought to be such that another universe like the above, whose inhabitants would be unconscious of their change, would also satisfy these laws, not merely from the standpoint of its own inhabitants, but also from the standpoint of our measurements. In other words, this second universe ought to appear possible to us as well as to its inhabitants.
If we picture our entire universe, complete with its observers, planetary orbits, instruments, and everything else, floating in a jelly, and then distort that jelly and its contents in any way, the measurements where our planetary orbits (or rather their telescopic images) intersect our scales will remain unchanged. Furthermore, we could change, in any way, the times when all objects (including clock hands) were in their distorted positions, and a clock close to the point where the planetary image crossed the scale would show the same reading as before. A person living in this distorted universe would be completely unaware of the change. Now, the General Theory of Relativity, which can be expressed in slightly different ways, fulfills a certain philosophical desire by proposing that the natural laws governing our universe should be such that another universe, like the one described, where its inhabitants would be oblivious to their changes, would also adhere to these laws—not only from its own inhabitants' perspective but also from our measurements. In other words, this alternate universe should appear plausible to us as well as to its inhabitants.
Einstein decides to make his theory conform to this philosophical desire, and this greatly limits the modifications of clocks and scales which he permits himself for the purpose of representing gravitation. Further, if we express the alterations of the measures as functions of proximity to matter, velocity [332]and so forth, our expressions for these alterations will include, as a particular case, that where matter is absent, although the scales and observer may still remain. Our alteration of the scales and clocks with velocity must thus revert, for this case, to that corresponding to the older theory of relativity, in order to avoid predicting that two observers, in uniform motion relative to each other in empty space, will measure different values for the velocity of light. In this way, the velocity of light comes to play a part in expressing the alterations of the measures.
Einstein decides to shape his theory to fit this philosophical wish, which significantly restricts the changes he allows for clocks and scales to represent gravity. Additionally, if we express the changes in measurements based on closeness to matter, speed [__A_TAG_PLACEHOLDER_0__], and so on, our expressions for these changes will include, as a specific case, the situation where there is no matter, even though the scales and observer may still exist. Therefore, our adjustments of the scales and clocks with speed must return, in this case, to what corresponds to the older theory of relativity, to prevent predicting that two observers, moving at a steady pace relative to each other in empty space, will measure different values for the speed of light. In this manner, the speed of light becomes integral to expressing the changes in measurements.
Even with these restrictions, Einstein was able to do the equivalent of finding an alteration of scales and clocks in the presence of matter which would account for our finding that the planetary motions take place very nearly in accordance with Newton’s law. The new law has accounted with surprising accuracy for certain astronomical irregularities for which Newton’s law failed to account, and has predicted at least one previously unknown phenomenon which was immediately verified.
Even with these limitations, Einstein managed to figure out a way to adjust scales and clocks in the presence of matter that explained why planetary motions closely follow Newton’s law. The new law has surprisingly accurately explained some astronomical irregularities that Newton’s law couldn’t, and it has predicted at least one previously unknown phenomenon that was quickly confirmed.
In conclusion, it may be of interest to state how the new law describes the motion
of a particle in the vicinity of a body like the earth. The law amounts to stating
that, if we measure a short distance, radially as regards the earth’s center, we must
allow for the peculiarity of our units by dividing by
In conclusion, it might be interesting to note how the new law explains the motion of a particle near a body like the Earth. The law essentially states that, when we measure a short distance from the center of the Earth, we need to account for the specifics of our measurement units by dividing by
where r is the distance from the earth’s center, m [333]the mass of the earth, c the velocity of light, and G the Newtonian gravitational constant. Tangential measurements require no correction, but intervals of time as measured by our clocks must be multiplied, for each particular place, by the above factor. Then, in terms of the corrected measures so obtained, the particle will be found to describe a straight line with constant velocity although, in terms of our actual measures, it appears to fall with an acceleration. [334]
where r is the distance from the Earth's center, m [__A_TAG_PLACEHOLDER_0__] is the mass of the Earth,
XXI
THE EQUIVALENCE HYPOTHESIS
The Discussion of This, With Its Difficulties and the Manner in Which Einstein Has Resolved Them, from the Essay by
The discussion of this, along with its challenges and how Einstein has addressed them, from the essay by
Having shown that, of several systems all moving with reference to one another with uniform motion, no one is entitled to any preference over the others, and having deduced the laws for such systems, Einstein was confronted with a difficulty which had long been felt. A body rotating, which is a special case of an accelerated body, can be distinguished from one at rest, without looking outside it, by the existence of the so-called centrifugal forces.
Having demonstrated that in a range of systems all moving uniformly in relation to each other, none deserves preference over the others, and derived the laws governing these systems, Einstein faced a challenge that had been long recognized. A rotating body, which is a specific example of an accelerated body, can be differentiated from a stationary one, without needing to look externally, by the presence of what are known as centrifugal forces.
This circumstance, which gives certain bodies an absolute or preferential motion, is unpalatable to the relativist; he would like there to be no difference as regards forces1 between the case when the earth rotates with reference to outside bodies (the stars) considered as fixed, and the case when the earth is considered fixed and all outside bodies rotate around it. This point cannot be investigated by direct [335]experiment; we can spin a top but we cannot keep a top at rest and spin the world round it, to see if the forces are same.
This situation, which allows certain objects to have a fixed or favored movement, is problematic for the relativist; they wish there were no differences in forces1 between the scenario where the Earth spins relative to distant objects (the stars) considered as stationary, and the scenario where the Earth is considered stationary and everything else rotates around it. This issue can't be explored through direct [__A_TAG_PLACEHOLDER_0__]experiment; we can spin a top, but we can't keep a top still and spin the world around it to determine if the forces are the same.
In considering the problem of how to devise laws which should make all rotations relative, Einstein conceived the brilliant yet simple idea that gravitation could be brought into the scheme as an acceleration effect, since both ordinary accelerational forces and gravitational forces are proportional to the same thing, the mass of a body. The impossibility of separating the two kinds of effect can be easily seen by considering the starting of an elevator. When the elevator is quickly accelerated upwards we feel a downward pull, just as if the gravitational pull had been increased, and if the acceleration continued to be uniform, bodies tested with a spring balance would all weigh more in the elevator than they did on firm ground. In a similar way the whole of the gravitational pull may be considered to be an accelerational effect, the difficulty being to devise laws of motion which will give the effects that we find by actual observation.
In thinking about how to create laws that would make all spins relative, Einstein came up with the brilliant yet straightforward idea that gravity could be viewed as an acceleration effect, since both regular accelerating forces and gravitational forces relate to the same factor: the mass of an object. It’s easy to see how these two effects are inseparable by looking at what happens when an elevator starts moving. When the elevator rapidly accelerates upward, we feel a downward force, just as if the pull of gravity had increased. If the acceleration remained consistent, objects measured with a spring scale would all weigh more in the elevator than they do on solid ground. Similarly, we can view the entire gravitational pull as an acceleration effect, with the challenge being to create motion laws that replicate the effects we observe in reality.
But it is obvious that we cannot, by ordinary mechanics, consider the earth as being accelerated in all directions, which we should have to do, apparently, to account for the fact that the gravitational pull is always toward the center. [It is obvious that we cannot explain gravitation by assuming that the earth’s surface is continually moving outward with an accelerated velocity.]227 So Einstein found that, as long as we treat the problem by Euclid’s geometry, we cannot reach a satisfactory solution. But he found that to the four-dimensional [336]space made up of the three ordinary dimensions of space, together with the time-dimension which we have already mentioned in discussing the special theory, may be attributed a peculiar geometry, the nature of which departs more and more from Euclidean geometry as we approach a gravitational body, and the net result of which is to make possible the universal correspondence of gravitation and acceleration.
But it's clear that we can't, using regular mechanics, think of the earth as being accelerated in all directions, which we would need to do to explain why gravity always pulls toward the center. [It's clear that we can't explain gravity by assuming that the earth's surface is constantly moving outward with increasing speed.]227 So Einstein discovered that, as long as we approach the problem using Euclid’s geometry, we won't find a satisfactory solution. He found that the four-dimensional [__A_TAG_PLACEHOLDER_0__]space, consisting of the three typical dimensions of space along with the time dimension we've already talked about in relation to the special theory, can have a unique geometry. This geometry increasingly differs from Euclidean geometry as we get closer to a gravitational body, resulting in a connection between gravity and acceleration that makes sense universally.
This modification of the geometry of space is often spoken of as the “curvature of space,” an expression which is puzzling, especially as the space which is “curved” is four-dimensional time-space. But we can get an idea of what is meant by considering figures, triangles say, drawn on the surface, of a sphere. These triangles, although drawn on a surface, will not have the same properties as triangles drawn on flat paper—their three angles will not together equal right angles. They will be non-Euclidean. This is only a rough analogy, but we can see that the curvature of the surface causes a departure from Euclidean geometry for plane figures, and consequently the departure from Euclidean laws extended to four dimensions may be referred to as caused by “curvature of space.”
This change in the geometry of space is often called the “curvature of space,” a term that can be confusing, especially since the space that is “curved” is four-dimensional time-space. However, we can better understand what it means by looking at shapes, like triangles, drawn on the surface of a sphere. These triangles, even though they are on a surface, won’t have the same properties as triangles drawn on a flat piece of paper—their three angles won’t add up to right angles. They will be non-Euclidean. This is just a basic analogy, but it shows that the curvature of the surface leads to a deviation from Euclidean geometry for flat shapes, and therefore, the deviation from Euclidean rules in four dimensions can be described as being caused by the “curvature of space.”
It is difficult to imagine a lump of matter affecting the geometry of the space round it. Once more we must use a rough illustration. Imagine a very hot body, and that, knowing nothing of its properties, we have to measure up the space round it with metal measuring-rods. The nearer we are to the body, the longer the rods will become, owing to the expansion of the metal. When we measure out a [337]square, one side of which is nearer the body than the opposite side, its angles will not be right angles. If we knew nothing of the laws of heat we should say that the body had made the space round it non-Euclidean.
It's hard to picture a mass influencing the shape of the space around it. Let's use a simple example. Imagine a very hot object, and we have to measure the space around it with metal measuring rods without knowing anything about its properties. The closer we get to the object, the longer the rods will stretch due to the metal expanding. When we measure out a [__A_TAG_PLACEHOLDER_0__]square, one side closer to the object than the other, its corners won't form right angles. If we didn't understand the laws of heat, we might conclude that the object has made the space around it non-Euclidean.
Einstein found, then, that by taking the properties of space, as given by measurement, to be modified in the neighborhood of masses of matter, he could devise general laws according to which gravitational effects would be produced, and there would be no absolute rotation. All forces will be the same whether a body rotates with everything outside it fixed, or the body is fixed, and everything rotates round it. All motion is then relative, and the theory is one of “general relativity.” The velocity of light is, however, no longer constant, and its path is not a straight line, if it is passing near gravitating matter. This does not contradict the special theory, which did not allow for gravitation. Rather, the special theory is a particular case to which the generalised theory reduces when there is no matter about, just as the Newtonian dynamics is a special case of the special theory, which we obtain when all velocities are small compared to that of light. [338]
Einstein discovered that by altering the properties of space, based on measurements, in the vicinity of massive objects, he could formulate general laws to explain how gravitational effects occur, and that there would be no absolute rotation. All forces are the same whether an object is rotating with everything else fixed, or the object is stationary while everything revolves around it. All motion is therefore relative, leading to the theory of “general relativity.” However, the speed of light is no longer constant, and its trajectory is not a straight line when it passes close to a gravitational body. This doesn't contradict the special theory, which didn't consider gravity. Instead, the special theory is a specific case of the generalized theory that applies when there is no matter present, just as Newtonian dynamics is a special case of the special theory, which we observe when all speeds are small compared to light. [__A_TAG_PLACEHOLDER_0__]
XXII
THE GENERAL THEORY
Fragments of Particular Merit on This Phase of the Subject
Key Sections on This Topic
When Dorothy was carried by the cyclone from her home in Kansas to the land of Oz, together with her uncle’s house and her little dog Toto, she neglected to lower the trap door over the hole in the floor which formerly led to the cyclone cellar and Toto stepped through. Dorothy rushed to the opening expecting to see him dashed onto the rocks below but found him floating just below the floor. She drew him back into the room and closed the trap.
When Dorothy was swept away by the tornado from her home in Kansas to the Land of Oz, along with her uncle’s house and her little dog Toto, she forgot to close the trapdoor over the hole in the floor that used to lead to the storm cellar, and Toto stepped through it. Dorothy hurried to the opening, fearing he would be thrown onto the rocks below, but found him floating just beneath the floor. She pulled him back into the room and shut the trapdoor.
The author of the chronicle of Dorothy’s adventures explains that the same force which held up the house held up Toto but this explanation is not necessary. Dorothy was now floating through space and house and dog were subject to the same forces of gravitation which gave them identical motions. Dorothy must have pushed the dog down onto the floor and in doing so must herself have floated to the ceiling whence she might have pushed herself back to the floor. In fact gravitation was apparently suspended and Dorothy was in a position to have tried certain experiments which Einstein has never [339]tried because he was never in Dorothy’s unique position.]188
The author of the chronicle of Dorothy’s adventures explains that the same force keeping the house up also kept Toto up, but this explanation isn't really needed. Dorothy was now floating through space, and both the house and the dog were affected by the same gravitational forces, making them move in the same way. Dorothy must have pushed the dog down onto the floor, and in doing so, she must have floated up to the ceiling, from which she could push herself back down to the floor. It seemed like gravity was actually suspended, and Dorothy had the chance to try out some experiments that Einstein never did because he was never in Dorothy’s unique situation. [__A_TAG_PLACEHOLDER_0__]188
* * *
Please provide the text you would like to modernize.
The Principle of Equivalence, of which Einstein’s suspended cage experiment is the usual illustration, and upon which the generalized theory of relativity is built, is thus stated by Prof. Eddington: “A gravitational field of force is precisely equivalent to an artificial field of force, so that in any small region it is impossible, by any conceivable experiment, to distinguish between them. In other words, force is purely relative.”
The Principle of Equivalence, commonly illustrated by Einstein's suspended cage experiment and the foundation of the generalized theory of relativity, is stated by Prof. Eddington: “A gravitational field of force is exactly equivalent to an artificial field of force, so in any small area, it’s impossible, through any imaginable experiment, to tell them apart. In other words, force is entirely relative.”
This may be otherwise stated by going back to our idea of a four-dimensional world, the points of which represent the positions and times of events. If we mark in such a space-time the successive positions of an object we get a line, or curve, which represents the whole history of the object, inasmuch as it shows us the position of the object at every time. The reader may imagine that all events happen in one plane, so that only two perpendicular dimensions are needed to fix positions in space, with a third perpendicular dimension for time. He may then conceive, if he may not picture, an analogous process for four-dimensional space-time. These lines, “tracks of objects through space-time,” were called by Minkowski “world-lines.” We may now say that all the events we observe are the intersections of world-lines. The temperature at noon was 70°. This means that if I plot the world-line of the top of the mercury column and the world-line of a certain mark on the glass they intersect in a certain point of space-time. All that we know are [340]intersections of these world-lines. Suppose now we have a large number of them drawn in our four-dimensional world, satisfying all known intersections, and let us suppose the whole embedded in a jelly. We may distort this jelly in any way, changing our coordinates as we please, but we shall neither destroy nor create intersections of world-lines. It may be proved that a change from one system of reference, to which observations are referred, to any other system, moving in any way with respect to the first system, may be pictured as a distortion of the four-dimensional jelly. The laws of nature, therefore, being laws that describe intersections, must be expressible in a form independent of the reference system chosen.
This can also be expressed by revisiting our concept of a four-dimensional world, where the points represent the positions and times of events. If we plot the successive positions of an object in this space-time, we create a line or curve that illustrates the entire history of the object, since it shows us the object's position at every moment. You can imagine that all events occur in one plane, meaning only two perpendicular dimensions are needed to specify positions in space, with a third perpendicular dimension for time. You can then think, if not visualize, a similar process for four-dimensional space-time. These lines, “tracks of objects through space-time,” were referred to by Minkowski as “world-lines.” We can now say that all the events we observe are the intersections of world-lines. For example, the temperature at noon was 70°F. This means that if I chart the world-line of the top of the mercury column and the world-line of a specific mark on the glass, they intersect at a specific point in space-time. All that we know are [__A_TAG_PLACEHOLDER_0__] intersections of these world-lines. Now, suppose we have a large number of them drawn in our four-dimensional world, fitting all known intersections, and let’s imagine the whole integrated in a jelly. We can distort this jelly in any way, changing our coordinates as we wish, but we won’t destroy or create any intersections of world-lines. It can be demonstrated that changing from one reference system, to which observations are related, to another system that moves relative to the first, can be visualized as a distortion of the four-dimensional jelly. So, the laws of nature, being laws that describe intersections, must be expressible in a way that is independent of the reference system chosen.
From these postulates, Einstein was able to show such a formulation possible. His law may be stated very simply:—All bodies move through space-time in the straightest possible tracks.
From these principles, Einstein demonstrated that such a formulation was possible. His law can be stated very simply:—All objects move through space-time along the straightest possible paths.
The fact that an easy non-mathematical explanation can not be given, of how this law is reached, or of just why the straightest track of Mercury through space-time will give us an ellipse in space after we have split space-time up into space and time, is no valid objection to the theory. Newton’s law that bodies attract with a force proportional to their masses and inversely proportional to the square of the distance is simple, but no one has ever given an easy non-mathematical proof of how that law requires the path of Mercury to be an ellipse, with the sun at a focus, instead of some other curve.]182
The fact that we can't provide a simple, non-mathematical explanation for how this law is derived or why Mercury’s path through space-time results in an ellipse after we separate space and time is not a valid criticism of the theory. Newton's law states that objects attract each other with a force that is proportional to their masses and inversely proportional to the square of the distance between them. It's straightforward, but no one has ever provided an easy, non-mathematical proof to show why Mercury's path is an ellipse, with the sun at one focus, instead of some other type of curve.182
* * *
Below is a short piece of text (5 words or fewer). Modernize it into contemporary English if there's enough context, but do not add or omit any information. If context is insufficient, return it unchanged. Do not add commentary, and do not modify any placeholders. If you see placeholders of the form __A_TAG_PLACEHOLDER_x__, you must keep them exactly as-is so they can be replaced with links. * * *
[341] One of the grave difficulties we have in gaining a satisfactory comprehension of Einstein’s conceptions, is that they do not readily relate themselves to our modes of geometrical thought. Within limits we may choose our own geometry, but it may be at the cost of unwieldy complication. If we think with Newton in terms of Euclidean geometry and consider the earth as revolving around the sun, the motions of our solar system can be stated in comparatively simple terms. If, on the other hand, we should persist in stating them, as Ptolemy would have done, from the earth as a relatively stationary center, our formulas will become complicated beyond ready comprehension. For this reason it is much simpler in applying the theory of relativity, and in considering and describing what actually happens in the physical universe, to use geometrical conceptions to which the actual conditions can be easily related. We find such an instrument in non-Euclidean geometry, wherein space will appear as though it were projected from a slightly concave mirror. It is in this sense that some speak of space as curved. The analogy is so suggestive it tempts one to linger over it. Unless there were material objects within the range of the mirror, its conformation would be immaterial; the thought of the space which the mirror, as it were, circumscribes, is dependent upon the presence of such material objects. The lines of light and of all other movement will not be quite “straight” from the view-point of Euclidean geometry. A line drawn in a universe of such a nature must inevitably return upon itself. Nothing therefore, can ever pass out of this unlimitedly great [342]but yet finite cosmos. But even now, since our imaginary mirror is only very slightly concave, it follows that for limited regions like the earth or even the solar system, our conception of geometry may well be rectilinear and Euclidean. Newton’s law of gravitation will be quite accurate with only a theoretical modification drawn from the theory of relativity.]82
[__A_TAG_PLACEHOLDER_0__] One of the major challenges we face in fully understanding Einstein’s ideas is that they don't easily connect with the way we typically think about geometry. While we can choose our own geometry to some extent, doing so may lead to cumbersome complexities. If we use Newton’s perspective based on Euclidean geometry and view the Earth as orbiting the sun, the movements of our solar system can be described in fairly straightforward terms. However, if we insist on describing them from a position where the Earth is seen as a relatively stationary center, like Ptolemy would have, our equations become overly complicated and hard to grasp. This is why applying the theory of relativity and explaining what actually happens in the physical universe is much simpler when we use geometric concepts that can be easily related to actual conditions. Non-Euclidean geometry serves as such a tool, where space appears to be projected from a slightly concave mirror. This is what some mean when they talk about space as being curved. The analogy is so evocative that it invites further exploration. If there were no physical objects within the mirror's range, its shape would have no significance; the concept of space that the mirror seems to outline relies on the presence of those objects. The paths of light and other forms of movement won't be perfectly "straight" when viewed through the lens of Euclidean geometry. A line drawn in a universe with this quality must ultimately loop back on itself. Thus, nothing can ever escape this boundless yet finite [__A_TAG_PLACEHOLDER_0__] cosmos. However, since our imagined mirror is only very slightly concave, it means that in limited areas like the Earth or even the solar system, our geometric understanding can still be largely straight and Euclidean. Newton’s law of gravitation will remain quite accurate with just a theoretical adjustment from the theory of relativity.82
* * *
* * *
The way in which a curvature of space might appear to us as a force is made plainer by an example. Suppose that in a certain room a marble dropped anywhere on the floor always rolled to the center of the room; suppose the same thing happened to a baseball, a billiard ball, and a tennis ball. These results could be explained in two ways; we might assume that a mysterious force of attraction existed at the center of the floor, which affected all kinds of balls alike; or we might assume that the floor was curved. We naturally prefer the latter explanation. But when we find that in the neighborhood of a large material body all other bodies move toward it in exactly the same manner, regardless of their nature or their condition, we are accustomed to postulate a mysterious attractive force (gravitation); Einstein, on the contrary, adopts the other alternative, that the space around the body is curved.]223
The way a curvature of space can seem like a force becomes clearer with an example. Imagine that in a certain room, a marble dropped anywhere on the floor always rolls toward the center; the same happens with a baseball, a billiard ball, and a tennis ball. We could explain these results in two ways: we might think there’s some mysterious force of attraction at the center of the floor that affects all types of balls equally, or we could assume that the floor is curved. We naturally lean toward the latter explanation. But when we notice that near a large mass, all other objects move toward it in exactly the same way, regardless of their type or state, we tend to believe in a mysterious attractive force (gravity); Einstein, however, chooses the other option, which is that the space around the mass is curved.223
* * *
* * *
In the ordinary “analytical geometry,” the position and motion of all the points considered is referred to a rigid “body” or “frame of reference.” This usually consists of an imaginary room of suitable [343]size. The position of any point is then given by three numbers, i.e., its distances from one side wall and from the back wall and its height above the floor. These three numbers can only give one point, every other point having at least one number different. In four-dimensional geometry a fourth wall may be vaguely imagined as perpendicular to all three walls, and a fourth number added, giving the distance of the “point” from this wall also. Since “rigid” bodies do not exist in gravitational fields the “frame of reference” must be “non-rigid.” The frame of reference in the Gaussian system need not be rigid, it can be of any shape and moving in any manner, in fact a kind of jelly. A “point” or “event” in the four dimensioned world is still given by four numbers but these numbers do not represent distances from anywhere; all that is necessary is that no two events shall have exactly the same four numbers to represent them, and that two events which are very close together shall be represented by numbers which differ only slightly from one another. This system assumes so little that it will be seen to be very wide in its scope; although to the ordinary mind, what is gained in scope seems more than that lost in concreteness. This does not concern the mathematician, however, and by using this system he gains his object, proving that the general laws of nature remain the same when expressed in any Gaussian coordinate system whatever.]220
In basic “analytical geometry,” the position and movement of all points considered are related to a rigid “body” or “frame of reference.” This typically involves an imaginary room of suitable [__A_TAG_PLACEHOLDER_0__] size. The position of any point is represented by three numbers, i.e., its distances from one side wall, from the back wall, and its height above the floor. These three numbers can only identify one point, with every other point having at least one number different. In four-dimensional geometry, a fourth wall can be vaguely imagined as perpendicular to all three walls, adding a fourth number to indicate the distance of the “point” from this wall as well. Since “rigid” bodies don’t exist in gravitational fields, the “frame of reference” must be “non-rigid.” The frame of reference in the Gaussian system doesn’t have to be rigid; it can take any shape and move in any way, essentially being like jelly. A “point” or “event” in the four-dimensional world is still represented by four numbers, but these numbers don’t depict distances from anywhere; all that’s needed is that no two events have exactly the same four numbers, and that two events which are very close together are represented by numbers that differ only slightly. This system makes minimal assumptions, making it very broad in scope; however, to most people, the vastness gained seems to outweigh the specificity lost. This isn’t an issue for mathematicians, though, and by using this system, they achieve their goal: proving that the general laws of nature remain consistent when expressed in any Gaussian organize system whatsoever.]220
* * *
* * *
Einstein enunciates a general principle that it is possible to find a transformation of coordinate axes which is exactly equivalent to any force, and in particular [344]one which is equivalent to the force of gravitation. That is by concentrating our attention on the transformation which is a purely mathematical operation we can afford to neglect the force completely. To get a better idea of this principle of equivalence as it is called, let us consider a relatively simple example (which actually has nothing to do with gravitation, but which will serve to make our notions clearer.) A person on the earth unconsciously refers all his experiences, i.e., the motions of the objects around him to a set of axes fixed in the earth on which he stands. However, we know that the earth is rotating about its axis, and his axes of reference are also rotating with respect to the space about him. From the point of view of general relativity it is exactly because we do refer motions on the surface of the earth to axes rotating with the earth that we experience the so-called centrifugal force of the earth’s rotation, with which everyone is familiar. If we could find it convenient to transform from moving axes to fixed axes, the force would vanish, since it is exactly equivalent to the transformation from one set of axes to the other. However, we find it unnatural to refer daily experiences to axes that are not placed where we happen to be, and so we prefer to take the force and rotating axes instead of no force and fixed axes.]272
Einstein states a general principle that it's possible to find a transformation of coordinate axes that is exactly equivalent to any force, particularly [__A_TAG_PLACEHOLDER_0__] one that represents the force of gravity. By focusing on the transformation, which is purely a mathematical operation, we can completely ignore the force. To better understand this principle of equivalence, let’s look at a relatively simple example (which doesn't actually relate to gravity, but will help clarify our ideas). A person on Earth unconsciously relates all their experiences, i.e., the movements of objects around them, to a set of axes fixed to the ground they're standing on. However, we know that the Earth is rotating on its axis, and their reference axes are also moving in relation to the space around them. From the perspective of general relativity, it's precisely because we relate movements on the Earth's surface to axes that rotate with the planet that we experience the so-called centrifugal force from the Earth's rotation, which everyone is familiar with. If we could find it convenient to shift from moving axes to fixed axes, the force would disappear, since it's exactly equivalent to the transformation from one set of axes to another. However, we find it unnatural to relate our daily experiences to axes that aren’t positioned where we are, so we choose to acknowledge the force and rotating axes instead of no force and fixed axes.]272
* * *
Please provide the text you'd like me to modernize.
We seem to have a direct experience of force in our muscular sensations. By pushing or pulling we can set bodies in motion. It is natural to assume, that something similar occurs, when Nature set bodies in motion. But is this not a relic of animism? [345]The savage and the ancients peopled all the woods and skies with Gods and demons, who carries on the activities of nature by their own bodily efforts. Today we have dispossessed the demons, but the ghost of a muscular pull still holds the planets in place.]141
We seem to have a direct experience of force through our muscle sensations. By pushing or pulling, we can set objects in motion. It’s natural to assume that something similar happens when Nature sets things in motion. But isn’t this just a leftover idea from animism? [__A_TAG_PLACEHOLDER_0__] Primitive people and ancient cultures filled the forests and skies with gods and demons, who performed nature's activities through their own physical efforts. Today, we've removed those demons, but the ghost of a muscular pull still keeps the planets in their orbits.]141
* * *
Understood! Please provide the text you'd like me to modernize.
The general theory is an extension of the special theory which enables the law of gravitation to be deduced. Not in Newton’s form, it is true, but in a better form, that is, one that accounts for two important facts otherwise not explained. But it is a far more general theory that indicated above. It is a complete study of the relations between laws expressed by means of any four coordinates (of which three space and one time is a special case), and the same laws expressed in the four coordinates of a system having any motion whatever with respect to the first system. By restricting this general study in accordance with certain postulates about the nature of the universe we live in, we arrive at a number of conclusions which fit more closely with observed facts that the conclusions drawn from Newton’s theory.]221
The general theory builds on the special theory to derive the law of gravitation. While it's not in Newton's form, it is presented in a more effective way that explains two important facts that were previously overlooked. However, this is just one aspect of a much broader theory. It thoroughly examines the relationships between laws described using any four coordinates (of which three are spatial and one is temporal) and how these same laws are described in the four coords of a system that is moving in any way compared to the original system. By narrowing this general study based on certain assumptions regarding the nature of the universe we live in, we can reach several conclusions that align more closely with observed facts than those derived from Newton's theory.]221
Colophon
Availability
The text was produced from the 1921 first edition. Except for the title page, the illustrations were taken from the 1922 second edition. Those are the same as those of the first edition, but were available in a significantly better scan-quality. Note that textual changes made in the second edition have not been applied to this transcription of the first edition.
The text was created from the 1921 first edition. Except for the title page, the illustrations were taken from the 1922 second edition. These are the same as those in the first edition but are available in a significantly better scan quality. Please note that textual changes made in the second edition have not been applied to this transcription of the first edition.
Metadata
Title: | Einstein’s Theories of Relativity and Gravitation: A Selection of Material from the Essays Submitted in the Competition for the Eugene Higgins Prize of $5,000 | |
Editor: | James Malcolm Bird (1886–1964) | Info |
Contributor: | A. d’Abro | |
Contributor: | L. M. Alexander | |
Contributor: | Prof. Joseph S. Ames | |
Contributor: | Prof. E. N. da C. Andrade | |
Contributor: | Lieut. W. Mark Angus | |
Contributor: | Paul M. Batchelder | |
Contributor: | E. T. Bell | |
Contributor: | Lyndon Bolton | |
Contributor: | Dr. George de Bothezat | |
Contributor: | Charles A. Brunn | |
Contributor: | Charles H. Burr | |
Contributor: | Dorothy Burr | |
Contributor: | Elwyn F. Burrill | |
Contributor: | Professor A. E. Caswell | |
Contributor: | Edward A. Clarke | |
Contributor: | Harold T. Davis | |
Contributor: | R. F. Deimel | |
Contributor: | C. E. Dimick | |
Contributor: | Saul Dushman | |
Contributor: | Hugh Elliot | |
Contributor: | Earl R. Evans | |
Contributor: | E. P. Fairbairn, M.C., B.Sc. | |
Contributor: | Francis Farquhar | |
Contributor: | Montgomery Francis | |
Contributor: | J. Elias Fries, Fellow A.I.E.E. | |
Contributor: | H. Gartelmann | |
Contributor: | James O. G. Gibbons | |
Contributor: | Norman E. Gilbert | |
Contributor: | Dean W. P. Graham | |
Contributor: | D. B. Hall | |
Contributor: | George Frederick Hemens, M.C., B.Sc. | |
Contributor: | C. W. Kanolt | |
Contributor: | Frank E. Law | |
Contributor: | R. Bruce Lindsay | |
Contributor: | Rev. George Thomas Manley | |
Contributor: | George F. Marsteller | |
Contributor: | John G. McHardy | |
Contributor: | L. F. H. de Miffonis. B.A., C.E. | |
Contributor: | Col. John Millis | |
Contributor: | Dr. M. C. Mott-Smith | |
Contributor: | Dr. Francis D. Murnaghan | |
Contributor: | Edward A. Partridge | |
Contributor: | Professor Andrew H. Patterson | |
Contributor: | Professor William H. Pickering | |
Contributor: | William Hemmenway Pratt | |
Contributor: | Kenneth W. Reed | |
Contributor: | James Rice | |
Contributor: | Edward Adams Richardson | |
Contributor: | Walter van B. Roberts | |
Contributor: | C. E. Rose, M.E. | |
Contributor: | T. Royds | |
Contributor: | Prof. Henry Norris Russell | |
Contributor: | Prof. Moritz Schlick | |
Contributor: | Leopold Schorsch | |
Contributor: | Prof. J. A. Schouten | |
Contributor: | Frederick W. Shurlock | |
Contributor: | W. de Sitter | |
Contributor: | Prof. William Benjamin Smith | |
Contributor: | Robert Stevenson | |
Contributor: | Prof. W. F. G. Swann | |
Contributor: | Prof. Arthur Gordon Webster | |
Contributor: | L. L. Whyte | |
Contributor: | Prof. R. W. Wood | |
Language: | English | |
Original publication date: | 1921 |
Revision History
- 2020-08-29 Started.
External References
Corrections
The following corrections have been applied to the text:
The following corrections have been made to the text:
Page | Source | Correction | Edit distance |
---|---|---|---|
vii | is made is made | is made | 8 |
viii | btween | between | 1 |
xiii | Chiselhurst | Chislehurst | 2 |
28 | attriuting | attributing | 1 |
38 | axis | axes | 1 |
46, 327 | [Not in source] | , | 1 |
48 | [Not in source] | ” | 1 |
59 | assums | assumes | 1 |
61 | II | III | 1 |
74 | follows | follow | 1 |
74 | parellel | parallel | 1 |
96 | inadmissable | inadmissible | 1 |
110 | apposite | opposite | 1 |
114 | synonomous | synonymous | 1 |
123 | indepedent | independent | 1 |
125 | Descartean | Cartesian | 6 |
126 | propostions | propositions | 1 |
127 | ’ | ” | 1 |
144 | incomphehensible | incomprehensible | 1 |
151 | hypothenuse | hypotenuse | 1 |
153 | dimension | dimensions | 1 |
156 | Betelguese | Betelgeuse | 2 |
169, 282, 282 | coördinates | coordinates | 1 / 0 |
170 | [Not in source] | . | 1 |
181 | Michelsen | Michelson | 1 |
182 | Michelsen-Morley | Michelson-Morley | 1 |
185 | that: | [Deleted] | 5 |
205 | gometry | geometry | 1 |
206 | and | und | 1 |
213 | distroyed | destroyed | 1 |
219 | wells | walls | 1 |
228 | indisolubly | indissolubly | 1 |
231 | signficance | significance | 1 |
240 | amazinging | amazing | 3 |
244 | dimsensions | dimensions | 1 |
249 | Relativitatstheorie | Relativitätstheorie | 1 / 0 |
254 | oberservers | observers | 2 |
260, 345 | co-ordinates | coordinates | 1 |
262 | or | of | 1 |
279 | Poincairé | Poincaré | 1 |
291 | Jean | Jeans | 1 |
298 | methematicians | mathematicians | 1 |
305, 305, 305 | Asso. | Assoc. | 1 |
305 | Astro. | Astron. | 1 |
312 | axactly | exactly | 1 |
315 | . | , | 1 |
319 | acrross | across | 1 |
322 | us | use | 1 |
327 | BY | [Deleted] | 3 |
329 | waste | wasted | 1 |
330 | analagous | analogous | 1 |
335 | rotatons | rotations | 1 |
340 | imbedded | embedded | 1 |
343, 343 | co-ordinate | coordinate | 1 |
344 | gravitaton | gravitation | 1 |
345 | line | live | 1 |
Download ePUB
If you like this ebook, consider a donation!